Full initial pass at perldelta. ready for sanity checking
[p5sagit/p5-mst-13.2.git] / pod / perl5113delta.pod
CommitLineData
2e8102e6 1=head1 NAME
2
2e8102e6 3perldelta - what is new for perl v5.11.3
4
5=head1 DESCRIPTION
6
166777da 7This document describes differences between the 5.11.2 release and
2e8102e6 8the 5.11.3 release.
9
c308b6b9 10If you are upgrading from an earlier release such as 5.11.1, first read
11the L<perl5112delta>, which describes differences between 5.11.1 and
125.11.2
2e8102e6 13
14=head1 Incompatible Changes
15
1f5724d9 16=over
2e8102e6 17
1f5724d9 18=item Filehandles are blessed directly into C<IO::Handle::>, as C<FileHandle> is merely a wrapper around C<IO::Handle>.
19
d350938a 20The previous behaviour was to bless Filehandles into L<FileHandle>
c308b6b9 21(an empty proxy class) if it was loaded into memory and otherwise
22to bless them into C<IO::Handle::>.
23
db2ed548 24
1f5724d9 25=back
2e8102e6 26
27=head1 Core Enhancements
28
283b82dc 29=head2 Unicode version
30
51f494cc 31Perl is shipped with the latest Unicode version, 5.2, dated October 2009. See
283b82dc 32L<http://www.unicode.org/versions/Unicode5.2.0> for details about this release
db2ed548 33of Unicode. See L<perlunicode> for instructions on installing and using
34older versions of Unicode.
37e2e78e 35
8d814567 36=head2 Unicode properties
37
38Perl can now handle every Unicode character property. A new pod,
39L<perluniprops>, lists all available non-Unihan character properties. By
40default the Unihan properties and certain others (deprecated and Unicode
41internal-only ones) are not exposed. See below for more details on
42these; there is also a section in the pod listing them, and why they are
43not exposed.
44
45Perl now fully supports the Unicode compound-style of using C<=> and C<:>
46in writing regular expressions: C<\p{property=value}> and
47C<\p{property:value}> (both of which mean the same thing).
48
db2ed548 49Perl now fully supports the Unicode loose matching rules for text
8d814567 50between the braces in C<\p{...}> constructs. In addition, Perl also allows
51underscores between digits of numbers.
52
53All the Unicode-defined synonyms for properties and property values are
54now accepted.
55
37e2e78e 56C<qr/\X/>, which matches a Unicode logical character, has been expanded to work
57better with various Asian languages. It now is defined as an C<extended
51f494cc 58grapheme cluster>. (See L<http://www.unicode.org/reports/tr29/>).
59Anything matched by previously will continue to be matched. But in addition:
60
61=over
62
63=item *
64
65C<\X> will now not break apart a C<S<CR LF>> sequence.
66
67=item *
68
69C<\X> will now match a sequence including the C<ZWJ> and C<ZWNJ> characters.
70
71=item *
72
73C<\X> will now always match at least one character, including an initial mark.
74Marks generally come after a base character, but it is possible in Unicode to
75have them in isolation, and C<\X> will now handle that case, for example at the
76beginning of a line or after a C<ZWSP>.
77
78=item *
79
80C<\X> will now match a (Korean) Hangul syllable sequence, and the Thai and Lao
81exception cases.
82
83=back
84
85Otherwise, this change should be transparent for the non-affected languages.
37e2e78e 86
8d814567 87C<\p{...}> matches using the Canonical_Combining_Class property were
88completely broken in previous Perls. This is now fixed.
89
51f494cc 90In previous Perls, the Unicode C<Decomposition_Type=Compat> property and a
8d814567 91Perl extension had the same name, which led to neither matching all the
92correct values (with more than 100 mistakes in one, and several thousand
93in the other). The Perl extension has now been renamed to be
51f494cc 94C<Decomposition_Type=Noncanonical> (short: C<dt=noncanon>). It has the same
8d814567 95meaning as was previously intended, namely the union of all the
51f494cc 96non-canonical Decomposition types, with Unicode C<Compat> being just one of
8d814567 97those.
98
99C<\p{Uppercase}> and C<\p{Lowercase}> have been brought into line with the
100Unicode definitions. This means they each match a few more characters
101than previously.
102
103C<\p{Cntrl}> now matches the same characters as C<\p{Control}>. This means it
104no longer will match Private Use (gc=co), Surrogates (gc=cs), nor Format
105(gc=cf) code points. The Format code points represent the biggest
106possible problem. All but 36 of them are either officially deprecated
107or strongly discouraged from being used. Of those 36, likely the most
108widely used are the soft hyphen (U+00AD), and BOM, ZWSP, ZWNJ, WJ, and
109similar, plus Bi-directional controls.
110
111C<\p{Alpha}> now matches the same characters as C<\p{Alphabetic}>. The Perl
112definition included a number of things that aren't really alpha (all
51f494cc 113marks), while omitting many that were. As a direct consequence, the
8d814567 114definitions of C<\p{Alnum}> and C<\p{Word}> which depend on Alpha also change.
115
116C<\p{Word}> also now doesn't match certain characters it wasn't supposed
117to, such as fractions.
118
51f494cc 119C<\p{Print}> no longer matches the line control characters: Tab, LF, CR,
120FF, VT, and NEL. This brings it in line with the documentation.
8d814567 121
51f494cc 122C<\p{Decomposition_Type=Canonical}> now includes the Hangul syllables.
8d814567 123
124The Numeric type property has been extended to include the Unihan
125characters.
126
51f494cc 127There is a new Perl extension, the 'Present_In', or simply 'In',
8d814567 128property. This is an extension of the Unicode Age property, but
51f494cc 129C<\p{In=5.0}> matches any code point whose usage has been determined
130I<as of> Unicode version 5.0. The C<\p{Age=5.0}> only matches code points
131added in I<precisely> version 5.0.
8d814567 132
133A number of properties did not have the correct values for unassigned
134code points. This is now fixed. The affected properties are
135Bidi_Class, East_Asian_Width, Joining_Type, Decomposition_Type,
136Hangul_Syllable_Type, Numeric_Type, and Line_Break.
137
138The Default_Ignorable_Code_Point, ID_Continue, and ID_Start properties
51f494cc 139have been updated to their current Unicode definitions.
8d814567 140
141Certain properties that are supposed to be Unicode internal-only were
142erroneously exposed by previous Perls. Use of these in regular
51f494cc 143expressions will now generate, if enabled, a deprecated warning message.
144The properties are: Other_Alphabetic, Other_Default_Ignorable_Code_Point,
145Other_Grapheme_Extend, Other_ID_Continue, Other_ID_Start, Other_Lowercase,
146Other_Math, and Other_Uppercase.
8d814567 147
8d814567 148An installation can now fairly easily change which Unicode properties
149Perl understands. As mentioned above, certain properties are by default
150turned off. These include all the Unihan properties (which should be
151accessible via the CPAN module Unicode::Unihan) and any deprecated or
152Unicode internal-only property that Perl has never exposed.
153
d350938a 154The generated files in the C<lib/unicore/To> directory are now more
155clearly marked as being stable, directly usable by applications.
156New hash entries in them give the format of the normal entries,
157which allows for easier machine parsing. Perl can generate files
158in this directory for any property, though most are suppressed. An
159installation can choose to change which get written. Instructions
160are in L<perluniprops>.
8d814567 161
89dbd0d1 162=head2 Regular Expressions
163
d350938a 164U+0FFFF is now a legal character in regular expressions.
89dbd0d1 165
2e8102e6 166=head1 Modules and Pragmata
167
2e8102e6 168=head2 Pragmata Changes
169
170=over 4
171
7da18641 172=item C<constant>
173
174Upgraded from version 1.19 to 1.20.
175
19185491 176=item C<diagnostics>
177
178This pragma no longer suppresses C<Use of uninitialized value in range (or flip)> warnings. [perl #71204]
179
db2ed548 180=item C<feature>
181
182Upgraded from 1.13 to 1.14. Added the C<unicode_strings> feature:
183
184 use feature "unicode_strings";
ecb37473 185
f43e267d 186This pragma turns on Unicode semantics for the case-changing operations
db2ed548 187(uc/lc/ucfirst/lcfirst) on strings that don't have the internal UTF-8 flag set,
188but that contain single-byte characters between 128 and 255.
ecb37473 189
f43e267d 190=item C<legacy>
2e8102e6 191
f43e267d 192The experimental C<legacy> pragma, introduced in 5.11.2, has been removed,
db2ed548 193and its functionality replaced by the new feature pragma, C<use feature
f43e267d 194"unicode_strings">.
2e8102e6 195
7da18641 196=item C<threads>
197
198Upgraded from version 1.74 to 1.75.
199
db2ed548 200=item C<warnings>
2e8102e6 201
db2ed548 202Upgraded from 1.07 to 1.08. Added new C<warnings::fatal_enabled()> function.
09b2a3d2 203
db2ed548 204=back
09b2a3d2 205
db2ed548 206=head2 Updated Modules
09b2a3d2 207
7da18641 208=over 4
209
210=item C<Archive::Extract>
211
212Upgraded from version 0.34 to 0.36.
213
214=item C<CPAN>
215
6a90494c 216Upgraded from version 1.94_51 to 1.94_53. Includes better bzip2 support,
217improved FirstTime experience with auto-selection of CPAN mirrors, proper
218handling of modules removed from the Perl core, and an updated 'cpan'
219utility script
7da18641 220
221=item C<CPANPLUS>
222
223Upgraded from version 0.89_09 to 0.90.
224
225=item C<ExtUtils::MakeMaker>
226
6a90494c 227Upgraded from version 6.55_02 to 6.56. Adds new BUILD_REQUIRES key to
228indicate build-only prerequisites. Also adds support for
229mingw64 and the new "package NAME VERSION" syntax.
7da18641 230
231=item C<File::Path>
232
233Upgraded from version 2.08 to 2.08_01.
234
235=item C<Module::Build>
236
6a90494c 237Upgraded from version 0.35_09 to 0.36. Compared to 0.35, this version has a
238new 'installdeps' action, supports the PERL_MB_OPT environment variable, adds a
239'share_dir' property for L<File::ShareDir> support, support the "package NAME
240VERSION" syntax and has many other enhancements and bug fixes. The
241'passthrough' style of Module::Build::Compat has been deprecated.
7da18641 242
243=item C<Module::CoreList>
244
245Upgraded from version 2.23 to 2.24.
246
19185491 247=item C<POSIX>
248
249Upgraded from version 1.18 to 1.19. Error codes for C<getaddrinfo()> and C<getnameinfo()> are now
250available.
251
7da18641 252=item C<Pod::Simple>
253
254Upgraded from version 3.10 to 3.13.
255
256=item C<Safe>
257
258Upgraded from version 2.19 to 2.20.
259
260=back
261
2e8102e6 262=head1 Utility Changes
263
2e8102e6 264=over 4
265
db2ed548 266=item F<perlbug>
2e8102e6 267
db2ed548 268No longer reports "Message sent" when it hasn't actually sent the message
1f5724d9 269
2e8102e6 270=back
271
2e8102e6 272=head1 Changes to Existing Documentation
273
9c455b5f 274The Pod specification (L<perlpodspec>) has been updated to bring the
275specification in line with modern usage already supported by most Pod systems.
276A parameter string may now follow the format name in a "begin/end" region.
1f5724d9 277Links to URIs with a text description are now allowed. The usage of
278C<LE<lt>"section"E<gt>> has been marked as deprecated.
9c455b5f 279
280L<if.pm|if> has been documented in L<perlfunc/use> as a means to get
281conditional loading of modules despite the implicit BEGIN block around C<use>.
874e3373 282
283
284
2e8102e6 285=head1 Installation and Configuration Improvements
286
ecb37473 287=head2 Testing improvements
288
289=over 4
290
291=item It's now possible to override C<PERL5OPT> and friends in F<t/TEST>
292
293=back
294
2e8102e6 295=head2 Platform Specific Changes
296
297=over 4
298
874e3373 299=item Win32
2e8102e6 300
db2ed548 301=over 4
ecb37473 302
303=item *
304
305Always add a manifest resource to C<perl.exe> to specify the <trustInfo>
306settings for Windows Vista and later. Without this setting Windows
307will treat C<perl.exe> as a legacy application and apply various
308heuristics like redirecting access to protected file system areas
309(like the "Program Files" folder) to the users "VirtualStore"
310instead of generating a proper "permission denied" error.
311
312For VC8 and VC9 this manifest setting is automatically generated by
313the compiler/linker (together with the binding information for their
314respective runtime libraries); for all other compilers we need to
315embed the manifest resource explicitly in the external resource file.
316
317This change also requests the Microsoft Common-Controls version 6.0
318(themed controls introduced in Windows XP) via the dependency list
319in the assembly manifest. For VC8 and VC9 this is specified using the
320C</manifestdependency> linker commandline option instead.
321
322=back
323
324=item cygwin
325
326=over 4
327
328=item Enable IPv6 support on cygwin 1.7 and newer
329
330=back
874e3373 331
332=item OpenVMS
2e8102e6 333
ecb37473 334=over 4
335
336=item Make -UDEBUGGING the default on VMS for 5.12.0.
337
338Like it has been everywhere else for ages and ages. Also make
339command-line selection of -UDEBUGGING and -DDEBUGGING work in
340configure.com; before the only way to turn it off was by saying
341no in answer to the interactive question.
342
2e8102e6 343=back
344
1f5724d9 345=back
346
2e8102e6 347=head1 Selected Bug Fixes
348
2e8102e6 349=over 4
350
351=item *
352
ecb37473 353Ensure that pp_qr returns a new regexp SV each time. Resolves RT #69852.
354
355Instead of returning a(nother) reference to the (pre-compiled) regexp in the
356optree, use reg_temp_copy() to create a copy of it, and return a reference to
357that. This resolves issues about Regexp::DESTROY not being called in a timely
358fashion (the original bug tracked by RT #69852), as well as bugs related to
359blessing regexps, and of assigning to regexps, as described in correspondence
360added to the ticket.
361
362It transpires that we also need to undo the SvPVX() sharing when ithreads
363cloning a Regexp SV, because mother_re is set to NULL, instead of a cloned
364copy of the mother_re. This change might fix bugs with regexps and threads in
365certain other situations, but as yet neither tests nor bug reports have
366indicated any problems, so it might not actually be an edge case that it's
367possible to reach.
368
369=item *
370
371Several compilation errors and segfaults when perl was built with C<-Dmad> were fixed.
372
373=item *
374
375Fixes for lexer API changes in 5.11.2 which broke NYTProf's savesrc option.
2e8102e6 376
9889e3de 377=item *
378
379F<-t> should only return TRUE for file handles connected to a TTY
380
381The Microsoft C version of isatty() returns TRUE for all
382character mode devices, including the /dev/null style "nul"
383device and printers like "lpt1".
384
385=item *
386
387Fixed a regression caused by commit fafafbaf which caused a panic during parameter passing [perl #70171]
388
389
390=item *
391
392On systems which in-place edits without backup files, -i'*' now works as the documentation says it does [perl #70802]
393
19185491 394=item *
395
fa4ee1ee 396Saving and restoring magic flags no longer loses readonly flag.
397
398=item *
399
400The malformed syntax C<grep EXPR LIST> (note the missing comma) no longer
401causes abrupt and total failure.
402
403=item *
404
405Regular expressions compiled with C<qr{}> literals properly set C<$'> when
406matching again.
407
408=item *
409
410Using named subroutines with C<sort> should no longer lead to bus errors [perl
411#71076]
412
413=item *
414
19185491 415Numerous bugfixes catch small issues caused by the recently-added Lexer API.
416
417=item *
418
419Smart match against C<@_> sometimes gave false negatives negatives. [perl #71078]
420
421=item *
422
423C<$@> may now be assigned a read-only value (without error or busting the stack).
9889e3de 424
4656de94 425=item *
426
427C<sort> called recursively from within an active comparison subroutine no longer causes a bus error if run multiple times. [perl #71076]
428
2e8102e6 429=back
430
431=head1 New or Changed Diagnostics
432
2e8102e6 433=over 4
434
fa4ee1ee 435=item *
2e8102e6 436
fa4ee1ee 437C<split> now warns when called in void context
2e8102e6 438
2e8102e6 439
440=item *
441
fa4ee1ee 442C<printf>-style functions called with too few arguments will now issue the warning C<"Missing argument in %s"> [perl #71000]
443
2e8102e6 444
445=back
446
447=head1 New Tests
448
d659d22b 449Many modules updated from CPAN incorporate new tests.
2e8102e6 450
451=over 4
452
d659d22b 453=item t/comp/final_line_num.t
2e8102e6 454
d659d22b 455See if line numbers are correct at EOF
456
457=item t/comp/form_scope.t
458
459See if format scoping works
460
461=item t/comp/line_debug.t
462
463See if @{"_<$file"} works
464
465=item t/op/filetest_t.t
466
467See if -t file test works
468
469=item t/op/qr.t
470
471See if qr works
472
473=item t/op/utf8cache.t
474
475Tests malfunctions of utf8 cache
476
477=item t/re/uniprops.t
478
479Test unicode \p{} regex constructs
2e8102e6 480
481=back
482
2e8102e6 483=head1 Deprecations
484
2e8102e6 485The following items are now deprecated.
486
487=over 4
488
13a4a486 489=item Use of "goto" to jump into a construct is deprecated
2e8102e6 490
13a4a486 491Using C<goto> to jump from an outer scope into an inner
492scope is now deprecated. This rare use case was causing
493problems in the implementation of scopes.
2e8102e6 494
495=back
496
2e8102e6 497=head1 Acknowledgements
498
d350938a 499Perl 5.11.3 represents approximately one month of development since
500Perl 5.11.2 and contains 61407 lines of changes across 396 files
501from 40 authors and committers:
502
503Abigail, Alex Davies, Alexandr Ciornii, Andrew Rodland, Andy
504Dougherty, Bram, brian d foy, Chip Salzenberg, Chris Williams, Craig
505A. Berry, Daniel Frederick Crisman, David Golden, Dennis Kaarsemaker,
506Eric Brine, Father Chrysostomos, Gene Sullivan, Gerard Goossen, H.
507Merijn Brand, Hugo van der Sanden, Jan Dubois, Jerry D. Hedden,
508Jesse Vincent, Jim Cromie, Karl Williamson, Leon Brocard, Max
509Maischein, Michael Breen, Moritz Lenz, Nicholas Clark, Rafael
510Garcia-Suarez, Reini Urban, Ricardo Signes, Stepan Kasal, Steve
511Hay, Steve Peters, Tim Bunce, Tony Cook, Vincent Pit and Zefram.
512
513Many of the changes included in this version originated in the CPAN
514modules included in Perl's core. We're grateful to the entire CPAN
515community for helping Perl to flourish.
2e8102e6 516
517=head1 Reporting Bugs
518
519If you find what you think is a bug, you might check the articles
520recently posted to the comp.lang.perl.misc newsgroup and the perl
521bug database at http://rt.perl.org/perlbug/ . There may also be
522information at http://www.perl.org/ , the Perl Home Page.
523
524If you believe you have an unreported bug, please run the B<perlbug>
525program included with your release. Be sure to trim your bug down
526to a tiny but sufficient test case. Your bug report, along with the
527output of C<perl -V>, will be sent off to perlbug@perl.org to be
528analysed by the Perl porting team.
529
530If the bug you are reporting has security implications, which make it
531inappropriate to send to a publicly archived mailing list, then please send
532it to perl5-security-report@perl.org. This points to a closed subscription
533unarchived mailing list, which includes all the core committers, who be able
534to help assess the impact of issues, figure out a resolution, and help
535co-ordinate the release of patches to mitigate or fix the problem across all
536platforms on which Perl is supported. Please only use this address for
537security issues in the Perl core, not for modules independently
538distributed on CPAN.
539
540=head1 SEE ALSO
541
542The F<Changes> file for an explanation of how to view exhaustive details
543on what changed.
544
545The F<INSTALL> file for how to build Perl.
546
547The F<README> file for general stuff.
548
549The F<Artistic> and F<Copying> files for copyright information.
550
551=cut
7ef88767 552
553