POSIX [[:character class:]] support for standard, locale,
[p5sagit/p5-mst-13.2.git] / pod / perldelta.pod
CommitLineData
ba8251e8 1=head1 NAME
2
2bb14304 3perldelta - what's new for perl5.006 (as of 5.005_56)
ba8251e8 4
5=head1 DESCRIPTION
6
7This document describes differences between the 5.005 release and this one.
8
9=head1 Incompatible Changes
10
e02fdbd2 11=head2 Perl Source Incompatibilities
12
13None known at this time.
14
15=head2 C Source Incompatibilities
16
17=over 4
18
19=item C<PERL_POLLUTE>
20
21Release 5.005 grandfathered old global symbol names by providing preprocessor
22macros for extension source compatibility. As of release 5.006, these
23preprocessor definitions are not available by default. You need to explicitly
14218588 24compile perl with C<-DPERL_POLLUTE> to get these definitions. For
25extensions still using the old symbols, this option can be
2aea4d40 26specified via MakeMaker:
27
14218588 28 perl Makefile.PL POLLUTE=1
e02fdbd2 29
86058a2d 30=item C<PERL_POLLUTE_MALLOC>
31
14218588 32Enabling Perl's malloc in release 5.005 and earlier caused
86058a2d 33the namespace of system versions of the malloc family of functions to
14218588 34be usurped by the Perl versions, since by default they used the
35same names.
86058a2d 36
37Besides causing problems on platforms that do not allow these functions to
38be cleanly replaced, this also meant that the system versions could not
39be called in programs that used Perl's malloc. Previous versions of Perl
14218588 40have allowed this behaviour to be suppressed with the HIDEMYMALLOC and
86058a2d 41EMBEDMYMALLOC preprocessor definitions.
42
43As of release 5.006, Perl's malloc family of functions have default names
44distinct from the system versions. You need to explicitly compile perl with
14218588 45C<-DPERL_POLLUTE_MALLOC> to get the older behaviour. HIDEMYMALLOC
46and EMBEDMYMALLOC have no effect, since the behaviour they enabled is now
86058a2d 47the default.
48
49Note that these functions do B<not> constitute Perl's memory allocation API.
50See L<perlguts/"Memory Allocation"> for further information about that.
51
e02fdbd2 52=item C<PL_na> and C<dTHR> Issues
53
54The C<PL_na> global is now thread local, so a C<dTHR> declaration is needed
14218588 55in the scope in which the global appears. XSUBs should handle this automatically,
e02fdbd2 56but if you have used C<PL_na> in support functions, you either need to
57change the C<PL_na> to a local variable (which is recommended), or put in
58a C<dTHR>.
59
60=back
61
cceca5ed 62=head2 Compatible C Source API Changes
63
64=over
65
66=item C<PATCHLEVEL> is now C<PERL_VERSION>
67
14218588 68The cpp macros C<PERL_REVISION>, C<PERL_VERSION>, and C<PERL_SUBVERSION>
cceca5ed 69are now available by default from perl.h, and reflect the base revision,
14218588 70patchlevel, and subversion respectively. C<PERL_REVISION> had no
cceca5ed 71prior equivalent, while C<PERL_VERSION> and C<PERL_SUBVERSION> were
72previously available as C<PATCHLEVEL> and C<SUBVERSION>.
73
14218588 74The new names cause less pollution of the B<cpp> namespace and reflect what
cceca5ed 75the numbers have come to stand for in common practice. For compatibility,
14218588 76the old names are still supported when F<patchlevel.h> is explicitly
cceca5ed 77included (as required before), so there is no source incompatibility
14218588 78from the change.
cceca5ed 79
80=back
81
e02fdbd2 82=head2 Binary Incompatibilities
83
14218588 84This release is not binary compatible with the 5.005 release or its
e02fdbd2 85maintenance versions.
86
ba8251e8 87=head1 Core Changes
88
9d73390d 89=head2 Unicode and UTF-8 support
90
91Perl can optionally use UTF-8 as its internal representation for character
92strings. The C<use utf8> pragma enables this support in the current lexical
93scope. See L<utf8> for more information.
94
95=head2 Lexically scoped warning categories
96
97You can now control the granularity of warnings emitted by perl at a finer
98level using the C<use warning> pragma. See L<warning> for details.
99
5fdc711f 100=head2 Binary numbers supported
101
4f19785b 102Binary numbers are now supported as literals, in s?printf formats, and
103C<oct()>:
104
14218588 105 $answer = 0b101010;
106 printf "The answer is: %b\n", oct("0b101010");
4f19785b 107
5fdc711f 108=head2 syswrite() ease-of-use
109
6c67e1bb 110The length argument of C<syswrite()> is now optional.
111
5fdc711f 112=head2 64-bit support
113
6c67e1bb 114Better 64-bit support -- but full support still a distant goal. One
115must Configure with -Duse64bits to get Configure to probe for the
116extent of 64-bit support. Depending on the platform (hints file) more
117or less 64-awareness becomes available. As of 5.005_54 at least
118somewhat 64-bit aware platforms are HP-UX 11 or better, Solaris 2.6 or
119better, IRIX 6.2 or better. Naturally 64-bit platforms like Digital
19799a22 120Unix and UNICOS also have 64-bit support.
e02fdbd2 121
62c18ce2 122=head2 Better syntax checks on parenthesized unary operators
123
b8c5462f 124TODO
125
126=head2 POSIX character class syntax [: :] supported
127
128For example to match alphabetic characters use /[[:alpha:]]/.
129See L<perlre> for details.
130
62c18ce2 131Expressions such as:
132
14218588 133 print defined(&foo,&bar,&baz);
134 print uc("foo","bar","baz");
135 undef($foo,&bar);
62c18ce2 136
7711098a 137used to be accidentally allowed in earlier versions, and produced
14218588 138unpredictable behaviour. Some produced ancillary warnings
139when used in this way; others silently did the wrong thing.
62c18ce2 140
141The parenthesized forms of most unary operators that expect a single
14218588 142argument now ensure that they are not called with more than one
143argument, making the cases shown above syntax errors. The usual
144behaviour of:
62c18ce2 145
14218588 146 print defined &foo, &bar, &baz;
147 print uc "foo", "bar", "baz";
148 undef $foo, &bar;
62c18ce2 149
150remains unchanged. See L<perlop>.
151
5a929a98 152=head2 Improved C<qw//> operator
8127e0e3 153
26ef7447 154The C<qw//> operator is now evaluated at compile time into a true list
155instead of being replaced with a run time call to C<split()>. This
14218588 156removes the confusing misbehaviour of C<qw//> in scalar context, which
157had inherited that behaviour from split().
26ef7447 158
159Thus:
160
161 $foo = ($bar) = qw(a b c); print "$foo|$bar\n";
162
163now correctly prints "3|a", instead of "2|a".
8127e0e3 164
5a929a98 165=head2 pack() format 'Z' supported
166
167The new format type 'Z' is useful for packing and unpacking null-terminated
168strings. See L<perlfunc/"pack">.
8127e0e3 169
4d0c1c44 170=head2 pack() format modifier '!' supported
ee3907e2 171
14218588 172The new format type modifier '!' is useful for packing and unpacking
ee3907e2 173native shorts, ints, and longs. See L<perlfunc/"pack">.
174
2b92dfce 175=head2 $^X variables may now have names longer than one character
176
177Formerly, $^X was synonymous with ${"\cX"}, but $^XY was a syntax
178error. Now variable names that begin with a control character may be
179arbitrarily long. However, for compatibility reasons, these variables
180I<must> be written with explicit braces, as C<${^XY}> for example.
14218588 181C<${^XYZ}> is synonymous with ${"\cXYZ"}. Variable names with more
2b92dfce 182than one control character, such as C<${^XY^Z}>, are illegal.
183
14218588 184The old syntax has not changed. As before, `^X' may be either a
185literal control-X character or the two-character sequence `caret' plus
186`X'. When braces are omitted, the variable name stops after the
2b92dfce 187control character. Thus C<"$^XYZ"> continues to be synonymous with
7711098a 188C<$^X . "YZ"> as before.
2b92dfce 189
190As before, lexical variables may not have names beginning with control
191characters. As before, variables whose names begin with a control
14218588 192character are always forced to be in package `main'. All such variables
193are reserved for future extensions, except those that begin with
194C<^_>, which may be used by user programs and is guaranteed not to
195acquire special meaning in any future version of Perl.
2b92dfce 196
fbad3eb5 197=head1 Significant bug fixes
198
199=head2 E<lt>HANDLEE<gt> on empty files
200
201With C<$/> set to C<undef>, slurping an empty file returns a string of
14218588 202zero length (instead of C<undef>, as it used to) the first time the
203HANDLE is read. Further reads yield C<undef>.
fbad3eb5 204
205This means that the following will append "foo" to an empty file (it used
14218588 206to do nothing):
fbad3eb5 207
208 perl -0777 -pi -e 's/^/foo/' empty_file
209
14218588 210The behaviour of:
fbad3eb5 211
212 perl -pi -e 's/^/foo/' empty_file
213
214is unchanged (it continues to leave the file empty).
215
0244c3a4 216=head2 C<eval '...'> improvements
217
218Line numbers (as reflected by caller() and most diagnostics) within
219C<eval '...'> were often incorrect when here documents were involved.
220This has been corrected.
221
222Lexical lookups for variables appearing in C<eval '...'> within
223functions that were themselves called within an C<eval '...'> were
14218588 224searching the wrong place for lexicals. The lexical search now
225correctly ends at the subroutine's block boundary.
0244c3a4 226
227Parsing of here documents used to be flawed when they appeared as
228the replacement expression in C<eval 's/.../.../e'>. This has
229been fixed.
230
45bc9206 231=head2 Automatic flushing of output buffers
232
14218588 233fork(), exec(), system(), qx//, and pipe open()s now flush buffers
234of all files opened for output when the operation
235was attempted. This mostly eliminates confusing
45bc9206 236buffering mishaps suffered by users unaware of how Perl internally
14218588 237handles I/O.
45bc9206 238
ba8251e8 239=head1 Supported Platforms
240
5fdc711f 241=over 4
242
243=item *
244
6c67e1bb 245VM/ESA is now supported.
246
5fdc711f 247=item *
248
ee3907e2 249Siemens BS2000 is now supported under the POSIX Shell.
250
251=item *
252
2bb14304 253The Mach CThreads (NEXTSTEP, OPENSTEP) are now supported by the Thread
254extension.
6c67e1bb 255
5fdc711f 256=item *
257
ee3907e2 258GNU/Hurd is now supported.
6c67e1bb 259
00ad96e1 260=item *
261
262Rhapsody is now supported.
263
27806c82 264=item *
265
266EPOC is is now supported (on Psion 5).
267
5fdc711f 268=back
269
6c67e1bb 270=head1 New tests
271
272=over 4
273
274=item op/io_const
275
276IO constants (SEEK_*, _IO*).
14218588 277
6c67e1bb 278=item op/io_dir
279
280Directory-related IO methods (new, read, close, rewind, tied delete).
281
282=item op/io_multihomed
283
284INET sockets with multi-homed hosts.
285
286=item op/io_poll
287
288IO poll().
289
290=item op/io_unix
291
292UNIX sockets.
293
294=item op/filetest
295
296File test operators.
297
298=item op/lex_assign
299
5fdc711f 300Verify operations that access pad objects (lexicals and temporaries).
6c67e1bb 301
302=back
e02fdbd2 303
ba8251e8 304=head1 Modules and Pragmata
305
3e8c4fa0 306=head2 Modules
307
b7d8191e 308=over 4
309
310=item Dumpvalue
311
312Added Dumpvalue module provides screen dumps of Perl data.
313
314=item Benchmark
315
868cb350 316You can now run tests for I<n> seconds instead of guessing the right
14218588 317number of tests to run: e.g. timethese(-5, ...) will run each
318code for at least 5 CPU seconds. Zero as the "number of repetitions"
155776c0 319means "for at least 3 CPU seconds". The output format has also
14218588 320changed. For example:
155776c0 321
322use Benchmark;$x=3;timethese(-5,{a=>sub{$x*$x},b=>sub{$x**2}})
323
324will now output something like this:
325
326Benchmark: running a, b, each for at least 5 CPU seconds...
327 a: 5 wallclock secs ( 5.77 usr + 0.00 sys = 5.77 CPU) @ 200551.91/s (n=1156516)
328 b: 4 wallclock secs ( 5.00 usr + 0.02 sys = 5.02 CPU) @ 159605.18/s (n=800686)
329
330New features: "each for at least N CPU seconds...", "wallclock secs",
331and the "@ operations/CPU second (n=operations)".
b7d8191e 332
f505c983 333=item Devel::Peek
334
335The Devel::Peek module provides access to the internal representation
14218588 336of Perl variables and data. It is a data debugging tool for the XS programmer.
f505c983 337
b7d8191e 338=item Fcntl
339
340More Fcntl constants added: F_SETLK64, F_SETLKW64, O_LARGEFILE for
14218588 341large (more than 4G) file access (64-bit support is not yet
b7d8191e 342working, though, so no need to get overly excited), Free/Net/OpenBSD
343locking behaviour flags F_FLOCK, F_POSIX, Linux F_SHLCK, and
344O_ACCMODE: the mask of O_RDONLY, O_WRONLY, and O_RDWR.
345
f505c983 346=item File::Spec
347
348New methods have been added to the File::Spec module: devnull() returns
19799a22 349the name of the null device (/dev/null on Unix) and tmpdir() the name of
14218588 350the temp directory (normally /tmp on Unix). There are now also methods
f505c983 351to convert between absolute and relative filenames: abs2rel() and
14218588 352rel2abs(). For compatibility with operating systems that specify volume
353names in file paths, the splitpath(), splitdir(), and catdir() methods
f505c983 354have been added.
355
356=item File::Spec::Functions
357
358The new File::Spec::Functions modules provides a function interface
14218588 359to the File::Spec module. Allows shorthand
f505c983 360
14218588 361 $fullname = catfile($dir1, $dir2, $file);
f505c983 362
363instead of
364
14218588 365 $fullname = File::Spec->catfile($dir1, $dir2, $file);
f505c983 366
e16b8f49 367=item Math::BigInt
368
14218588 369The logical operations C<E<lt>E<lt>>, C<E<gt>E<gt>>, C<&>, C<|>,
e16b8f49 370and C<~> are now supported on bigints.
371
b7d8191e 372=item Math::Complex
7711098a 373
14218588 374The accessor methods Re, Im, arg, abs, rho, and theta can now also
868cb350 375act as mutators (accessor $z->Re(), mutator $z->Re(3)).
b7d8191e 376
377=item Math::Trig
378
14218588 379A little bit of radial trigonometry (cylindrical and spherical),
380radial coordinate conversions, and the great circle distance were added.
b7d8191e 381
f4b9d880 382=item SDBM_File
383
384An EXISTS method has been added to this module (and sdbm_exists() has
385been added to the underlying sdbm library), so one can now call exists
14218588 386on an SDBM_File tied hash and get the correct result, rather than a
f4b9d880 387runtime error.
388
06ef4121 389=item Time::Local
390
391The timelocal() and timegm() functions used to silently return bogus
392results when the date exceeded the machine's integer range. They
14218588 393now consistently croak() if the date falls in an unsupported range.
06ef4121 394
8fe0a5c4 395=item Win32
396
397The error return value in list context has been changed for all functions
14218588 398that return a list of values. Previously these functions returned a list
399with a single element C<undef> if an error occurred. Now these functions
400return the empty list in these situations. This applies to the following
8fe0a5c4 401functions:
402
14218588 403 Win32::FsType
404 Win32::GetOSVersion
8fe0a5c4 405
406The remaining functions are unchanged and continue to return C<undef> on
407error even in list context.
408
409The Win32::SetLastError(ERROR) function has been added as a complement
410to the Win32::GetLastError() function.
411
412The new Win32::GetFullPathName(FILENAME) returns the full absolute
14218588 413pathname for FILENAME in scalar context. In list context it returns
414a two-element list containing the fully qualified directory name and
8fe0a5c4 415the filename.
416
9fe6733a 417=item DBM Filters
418
419A new feature called "DBM Filters" has been added to all the
14218588 420DBM modules--DB_File, GDBM_File, NDBM_File, ODBM_File, and SDBM_File.
421DBM Filters add four new methods to each DBM module:
9fe6733a 422
423 filter_store_key
424 filter_store_value
425 filter_fetch_key
426 filter_fetch_value
427
14218588 428These can be used to filter key-value pairs before the pairs are
9fe6733a 429written to the database or just after they are read from the database.
430See L<perldbmfilter> for further information.
431
b7d8191e 432=back
3e8c4fa0 433
434=head2 Pragmata
435
14218588 436C<use utf8> to enable UTF-8 and Unicode support.
9d73390d 437
43165c05 438C<use caller 'encoding'> allows modules to inherit pragmatic attributes
439from the caller's context. C<encoding> is currently the only supported
440attribute.
9d73390d 441
442Lexical warnings pragma, C<use warning;>, to control optional warnings.
6c67e1bb 443
14218588 444C<use filetest> to control the behaviour of filetests (C<-r> C<-w> ...).
6c67e1bb 445Currently only one subpragma implemented, "use filetest 'access';",
14218588 446that enables the use of access(2) or equivalent to check
6c67e1bb 447permissions instead of using stat(2) as usual. This matters
14218588 448in filesystems where there are ACLs (access control lists): the
449stat(2) might lie, but access(2) knows better.
6c67e1bb 450
ba8251e8 451=head1 Utility Changes
452
e02fdbd2 453Todo.
454
ba8251e8 455=head1 Documentation Changes
456
5fdc711f 457=over 4
458
459=item perlopentut.pod
f8284313 460
5fdc711f 461A tutorial on using open() effectively.
f8284313 462
5fdc711f 463=item perlreftut.pod
464
465A tutorial that introduces the essentials of references.
466
14218588 467=item perltootc.pod
468
469A tutorial on managing class data for object modules.
470
5fdc711f 471=back
e02fdbd2 472
ba8251e8 473=head1 New Diagnostics
474
6b121555 475=item /%s/: Unrecognized escape \\%c passed through
476
477(W) You used a backslash-character combination which is not recognized
7711098a 478by Perl. This combination appears in an interpolated variable or a
6b121555 479C<'>-delimited regular expression.
480
481=item Unrecognized escape \\%c passed through
482
483(W) You used a backslash-character combination which is not recognized
484by Perl.
e02fdbd2 485
06eaf0bc 486=item Missing command in piped open
487
488(W) You used the C<open(FH, "| command")> or C<open(FH, "command |")>
489construction, but the command was missing or blank.
490
f6b3007c 491=item defined(@array) is deprecated
69794302 492
493(D) defined() is not usually useful on arrays because it checks for an
494undefined I<scalar> value. If you want to see if the array is empty,
495just use C<if (@array) { # not empty }> for example.
496
f6b3007c 497=item defined(%hash) is deprecated
69794302 498
499(D) defined() is not usually useful on hashes because it checks for an
500undefined I<scalar> value. If you want to see if the hash is empty,
501just use C<if (%hash) { # not empty }> for example.
502
ba8251e8 503=head1 Obsolete Diagnostics
504
e02fdbd2 505Todo.
506
04d420f9 507=head1 Configuration Changes
508
27806c82 509=head2 installusrbinperl
510
04d420f9 511You can use "Configure -Uinstallusrbinperl" which causes installperl
512to skip installing perl also as /usr/bin/perl. This is useful if you
513prefer not to modify /usr/bin for some reason or another but harmful
555834d1 514because many scripts assume to find Perl in /usr/bin/perl.
515
27806c82 516=head2 SOCKS support
555834d1 517
27806c82 518You can use "Configure -Dusesocks" which causes Perl to probe
519for the SOCKS proxy protocol library, http://www.socks.nec.com/
04d420f9 520
ba8251e8 521=head1 BUGS
522
523If you find what you think is a bug, you might check the headers of
14218588 524articles recently posted to the comp.lang.perl.misc newsgroup.
ba8251e8 525There may also be information at http://www.perl.com/perl/, the Perl
526Home Page.
527
528If you believe you have an unreported bug, please run the B<perlbug>
14218588 529program included with your release. Make sure to trim your bug down
ba8251e8 530to a tiny but sufficient test case. Your bug report, along with the
14218588 531output of C<perl -V>, will be sent off to perlbug@perl.com to be
ba8251e8 532analysed by the Perl porting team.
533
534=head1 SEE ALSO
535
536The F<Changes> file for exhaustive details on what changed.
537
538The F<INSTALL> file for how to build Perl.
539
540The F<README> file for general stuff.
541
542The F<Artistic> and F<Copying> files for copyright information.
543
544=head1 HISTORY
545
546Written by Gurusamy Sarathy <F<gsar@umich.edu>>, with many contributions
547from The Perl Porters.
548
549Send omissions or corrections to <F<perlbug@perl.com>>.
550
551=cut