Make L<perltrap> refer to L<perldelta>
[p5sagit/p5-mst-13.2.git] / pod / perltrap.pod
CommitLineData
a0d0e21e 1=head1 NAME
2
3perltrap - Perl traps for the unwary
4
5=head1 DESCRIPTION
6
cb1a09d0 7The biggest trap of all is forgetting to use the B<-w> switch; see
8L<perlrun>. The second biggest trap is not making your entire program
daff0e37 9runnable under C<use strict>. The third biggest trap is not reading
10the list of changes in this version of Perl; see L<perldelta>.
a0d0e21e 11
12=head2 Awk Traps
13
14Accustomed B<awk> users should take special note of the following:
15
16=over 4
17
18=item *
19
20The English module, loaded via
21
22 use English;
23
54310121 24allows you to refer to special variables (like C<$/>) with names (like
25C<$RS>), as though they were in B<awk>; see L<perlvar> for details.
a0d0e21e 26
27=item *
28
29Semicolons are required after all simple statements in Perl (except
30at the end of a block). Newline is not a statement delimiter.
31
32=item *
33
34Curly brackets are required on C<if>s and C<while>s.
35
36=item *
37
38Variables begin with "$" or "@" in Perl.
39
40=item *
41
42Arrays index from 0. Likewise string positions in substr() and
43index().
44
45=item *
46
47You have to decide whether your array has numeric or string indices.
48
49=item *
50
aa689395 51Hash values do not spring into existence upon mere reference.
a0d0e21e 52
53=item *
54
55You have to decide whether you want to use string or numeric
56comparisons.
57
58=item *
59
60Reading an input line does not split it for you. You get to split it
54310121 61to an array yourself. And the split() operator has different
62arguments than B<awk>'s.
a0d0e21e 63
64=item *
65
66The current input line is normally in $_, not $0. It generally does
67not have the newline stripped. ($0 is the name of the program
68executed.) See L<perlvar>.
69
70=item *
71
8b0a4b75 72$E<lt>I<digit>E<gt> does not refer to fields--it refers to substrings matched
73by the last match pattern.
a0d0e21e 74
75=item *
76
77The print() statement does not add field and record separators unless
8b0a4b75 78you set C<$,> and C<$\>. You can set $OFS and $ORS if you're using
a0d0e21e 79the English module.
80
81=item *
82
83You must open your files before you print to them.
84
85=item *
86
87The range operator is "..", not comma. The comma operator works as in
88C.
89
90=item *
91
92The match operator is "=~", not "~". ("~" is the one's complement
93operator, as in C.)
94
95=item *
96
97The exponentiation operator is "**", not "^". "^" is the XOR
98operator, as in C. (You know, one could get the feeling that B<awk> is
99basically incompatible with C.)
100
101=item *
102
103The concatenation operator is ".", not the null string. (Using the
5f05dabc 104null string would render C</pat/ /pat/> unparsable, because the third slash
105would be interpreted as a division operator--the tokenizer is in fact
8b0a4b75 106slightly context sensitive for operators like "/", "?", and "E<gt>".
a0d0e21e 107And in fact, "." itself can be the beginning of a number.)
108
109=item *
110
111The C<next>, C<exit>, and C<continue> keywords work differently.
112
113=item *
114
115
116The following variables work differently:
117
118 Awk Perl
119 ARGC $#ARGV or scalar @ARGV
120 ARGV[0] $0
121 FILENAME $ARGV
122 FNR $. - something
123 FS (whatever you like)
124 NF $#Fld, or some such
125 NR $.
126 OFMT $#
127 OFS $,
128 ORS $\
129 RLENGTH length($&)
130 RS $/
131 RSTART length($`)
132 SUBSEP $;
133
134=item *
135
136You cannot set $RS to a pattern, only a string.
137
138=item *
139
140When in doubt, run the B<awk> construct through B<a2p> and see what it
141gives you.
142
143=back
144
145=head2 C Traps
146
147Cerebral C programmers should take note of the following:
148
149=over 4
150
151=item *
152
153Curly brackets are required on C<if>'s and C<while>'s.
154
155=item *
156
157You must use C<elsif> rather than C<else if>.
158
159=item *
160
54310121 161The C<break> and C<continue> keywords from C become in
a0d0e21e 162Perl C<last> and C<next>, respectively.
163Unlike in C, these do I<NOT> work within a C<do { } while> construct.
164
165=item *
166
167There's no switch statement. (But it's easy to build one on the fly.)
168
169=item *
170
171Variables begin with "$" or "@" in Perl.
172
173=item *
174
6dbacca0 175C<printf()> does not implement the "*" format for interpolating
a0d0e21e 176field widths, but it's trivial to use interpolation of double-quoted
177strings to achieve the same effect.
178
179=item *
180
181Comments begin with "#", not "/*".
182
183=item *
184
185You can't take the address of anything, although a similar operator
5f05dabc 186in Perl is the backslash, which creates a reference.
a0d0e21e 187
188=item *
189
4633a7c4 190C<ARGV> must be capitalized. C<$ARGV[0]> is C's C<argv[1]>, and C<argv[0]>
191ends up in C<$0>.
a0d0e21e 192
193=item *
194
195System calls such as link(), unlink(), rename(), etc. return nonzero for
196success, not 0.
197
198=item *
199
200Signal handlers deal with signal names, not numbers. Use C<kill -l>
201to find their names on your system.
202
203=back
204
205=head2 Sed Traps
206
207Seasoned B<sed> programmers should take note of the following:
208
209=over 4
210
211=item *
212
213Backreferences in substitutions use "$" rather than "\".
214
215=item *
216
217The pattern matching metacharacters "(", ")", and "|" do not have backslashes
218in front.
219
220=item *
221
222The range operator is C<...>, rather than comma.
223
224=back
225
226=head2 Shell Traps
227
228Sharp shell programmers should take note of the following:
229
230=over 4
231
232=item *
233
54310121 234The backtick operator does variable interpolation without regard to
a0d0e21e 235the presence of single quotes in the command.
236
237=item *
238
54310121 239The backtick operator does no translation of the return value, unlike B<csh>.
a0d0e21e 240
241=item *
242
243Shells (especially B<csh>) do several levels of substitution on each
5f05dabc 244command line. Perl does substitution in only certain constructs
54310121 245such as double quotes, backticks, angle brackets, and search patterns.
a0d0e21e 246
247=item *
248
249Shells interpret scripts a little bit at a time. Perl compiles the
250entire program before executing it (except for C<BEGIN> blocks, which
251execute at compile time).
252
253=item *
254
255The arguments are available via @ARGV, not $1, $2, etc.
256
257=item *
258
259The environment is not automatically made available as separate scalar
260variables.
261
262=back
263
264=head2 Perl Traps
265
266Practicing Perl Programmers should take note of the following:
267
268=over 4
269
270=item *
271
272Remember that many operations behave differently in a list
273context than they do in a scalar one. See L<perldata> for details.
274
275=item *
276
68dc0745 277Avoid barewords if you can, especially all lowercase ones.
54310121 278You can't tell by just looking at it whether a bareword is
279a function or a string. By using quotes on strings and
5f05dabc 280parentheses on function calls, you won't ever get them confused.
a0d0e21e 281
282=item *
283
54310121 284You cannot discern from mere inspection which builtins
285are unary operators (like chop() and chdir())
a0d0e21e 286and which are list operators (like print() and unlink()).
5f05dabc 287(User-defined subroutines can be B<only> list operators, never
a0d0e21e 288unary ones.) See L<perlop>.
289
290=item *
291
748a9306 292People have a hard time remembering that some functions
a0d0e21e 293default to $_, or @ARGV, or whatever, but that others which
54310121 294you might expect to do not.
a0d0e21e 295
6dbacca0 296=item *
a0d0e21e 297
8b0a4b75 298The E<lt>FHE<gt> construct is not the name of the filehandle, it is a readline
5f05dabc 299operation on that handle. The data read is assigned to $_ only if the
748a9306 300file read is the sole condition in a while loop:
301
302 while (<FH>) { }
54310121 303 while (defined($_ = <FH>)) { }..
748a9306 304 <FH>; # data discarded!
305
6dbacca0 306=item *
748a9306 307
a0d0e21e 308Remember not to use "C<=>" when you need "C<=~>";
309these two constructs are quite different:
310
311 $x = /foo/;
312 $x =~ /foo/;
313
314=item *
315
54310121 316The C<do {}> construct isn't a real loop that you can use
a0d0e21e 317loop control on.
318
319=item *
320
54310121 321Use C<my()> for local variables whenever you can get away with
322it (but see L<perlform> for where you can't).
323Using C<local()> actually gives a local value to a global
a0d0e21e 324variable, which leaves you open to unforeseen side-effects
325of dynamic scoping.
326
c07a80fd 327=item *
328
329If you localize an exported variable in a module, its exported value will
330not change. The local name becomes an alias to a new value but the
331external name is still an alias for the original.
332
a0d0e21e 333=back
334
5f05dabc 335=head2 Perl4 to Perl5 Traps
a0d0e21e 336
54310121 337Practicing Perl4 Programmers should take note of the following
6dbacca0 338Perl4-to-Perl5 specific traps.
339
340They're crudely ordered according to the following list:
a0d0e21e 341
342=over 4
343
6dbacca0 344=item Discontinuance, Deprecation, and BugFix traps
a0d0e21e 345
6dbacca0 346Anything that's been fixed as a perl4 bug, removed as a perl4 feature
347or deprecated as a perl4 feature with the intent to encourage usage of
348some other perl5 feature.
a0d0e21e 349
6dbacca0 350=item Parsing Traps
748a9306 351
6dbacca0 352Traps that appear to stem from the new parser.
a0d0e21e 353
6dbacca0 354=item Numerical Traps
a0d0e21e 355
6dbacca0 356Traps having to do with numerical or mathematical operators.
a0d0e21e 357
6dbacca0 358=item General data type traps
a0d0e21e 359
6dbacca0 360Traps involving perl standard data types.
a0d0e21e 361
6dbacca0 362=item Context Traps - scalar, list contexts
363
364Traps related to context within lists, scalar statements/declarations.
365
366=item Precedence Traps
367
368Traps related to the precedence of parsing, evaluation, and execution of
369code.
370
371=item General Regular Expression Traps using s///, etc.
372
373Traps related to the use of pattern matching.
374
375=item Subroutine, Signal, Sorting Traps
376
377Traps related to the use of signals and signal handlers, general subroutines,
378and sorting, along with sorting subroutines.
379
380=item OS Traps
381
382OS-specific traps.
383
384=item DBM Traps
385
386Traps specific to the use of C<dbmopen()>, and specific dbm implementations.
387
388=item Unclassified Traps
389
390Everything else.
391
392=back
393
394If you find an example of a conversion trap that is not listed here,
9607fc9c 395please submit it to Bill Middleton <F<wjm@best.com>> for inclusion.
396Also note that at least some of these can be caught with B<-w>.
6dbacca0 397
398=head2 Discontinuance, Deprecation, and BugFix traps
399
400Anything that has been discontinued, deprecated, or fixed as
54310121 401a bug from perl4.
a0d0e21e 402
6dbacca0 403=over 4
404
54310121 405=item * Discontinuance
6dbacca0 406
407Symbols starting with "_" are no longer forced into package main, except
408for C<$_> itself (and C<@_>, etc.).
409
410 package test;
411 $_legacy = 1;
cb1a09d0 412
6dbacca0 413 package main;
414 print "\$_legacy is ",$_legacy,"\n";
54310121 415
6dbacca0 416 # perl4 prints: $_legacy is 1
417 # perl5 prints: $_legacy is
418
54310121 419=item * Deprecation
6dbacca0 420
421Double-colon is now a valid package separator in a variable name. Thus these
5f05dabc 422behave differently in perl4 vs. perl5, because the packages don't exist.
6dbacca0 423
424 $a=1;$b=2;$c=3;$var=4;
425 print "$a::$b::$c ";
cb1a09d0 426 print "$var::abc::xyz\n";
6dbacca0 427
428 # perl4 prints: 1::2::3 4::abc::xyz
429 # perl5 prints: 3
cb1a09d0 430
6dbacca0 431Given that C<::> is now the preferred package delimiter, it is debatable
432whether this should be classed as a bug or not.
433(The older package delimiter, ' ,is used here)
cb1a09d0 434
6dbacca0 435 $x = 10 ;
436 print "x=${'x}\n" ;
54310121 437
6dbacca0 438 # perl4 prints: x=10
439 # perl5 prints: Can't find string terminator "'" anywhere before EOF
a0d0e21e 440
54310121 441Also see precedence traps, for parsing C<$:>.
a0d0e21e 442
6dbacca0 443=item * BugFix
a0d0e21e 444
6dbacca0 445The second and third arguments of C<splice()> are now evaluated in scalar
446context (as the Camel says) rather than list context.
a0d0e21e 447
6dbacca0 448 sub sub1{return(0,2) } # return a 2-elem array
449 sub sub2{ return(1,2,3)} # return a 3-elem array
54310121 450 @a1 = ("a","b","c","d","e");
6dbacca0 451 @a2 = splice(@a1,&sub1,&sub2);
452 print join(' ',@a2),"\n";
54310121 453
6dbacca0 454 # perl4 prints: a b
54310121 455 # perl5 prints: c d e
a0d0e21e 456
54310121 457=item * Discontinuance
a0d0e21e 458
6dbacca0 459You can't do a C<goto> into a block that is optimized away. Darn.
a0d0e21e 460
6dbacca0 461 goto marker1;
a0d0e21e 462
54310121 463 for(1){
6dbacca0 464 marker1:
465 print "Here I is!\n";
54310121 466 }
467
6dbacca0 468 # perl4 prints: Here I is!
469 # perl5 dumps core (SEGV)
470
54310121 471=item * Discontinuance
6dbacca0 472
473It is no longer syntactically legal to use whitespace as the name
474of a variable, or as a delimiter for any kind of quote construct.
54310121 475Double darn.
6dbacca0 476
477 $a = ("foo bar");
478 $b = q baz ;
479 print "a is $a, b is $b\n";
54310121 480
6dbacca0 481 # perl4 prints: a is foo bar, b is baz
54310121 482 # perl5 errors: Bareword found where operator expected
5e378fdf 483
6dbacca0 484=item * Discontinuance
485
486The archaic while/if BLOCK BLOCK syntax is no longer supported.
487
488 if { 1 } {
489 print "True!";
490 }
491 else {
492 print "False!";
493 }
54310121 494
6dbacca0 495 # perl4 prints: True!
496 # perl5 errors: syntax error at test.pl line 1, near "if {"
497
498=item * BugFix
499
500The C<**> operator now binds more tightly than unary minus.
501It was documented to work this way before, but didn't.
502
503 print -4**2,"\n";
54310121 504
6dbacca0 505 # perl4 prints: 16
506 # perl5 prints: -16
507
54310121 508=item * Discontinuance
6dbacca0 509
510The meaning of C<foreach{}> has changed slightly when it is iterating over a
511list which is not an array. This used to assign the list to a
512temporary array, but no longer does so (for efficiency). This means
513that you'll now be iterating over the actual values, not over copies of
514the values. Modifications to the loop variable can change the original
515values.
516
517 @list = ('ab','abc','bcd','def');
518 foreach $var (grep(/ab/,@list)){
519 $var = 1;
520 }
521 print (join(':',@list));
54310121 522
6dbacca0 523 # perl4 prints: ab:abc:bcd:def
524 # perl5 prints: 1:1:bcd:def
525
526To retain Perl4 semantics you need to assign your list
54310121 527explicitly to a temporary array and then iterate over that. For
6dbacca0 528example, you might need to change
529
530 foreach $var (grep(/ab/,@list)){
531
532to
533
534 foreach $var (@tmp = grep(/ab/,@list)){
535
536Otherwise changing $var will clobber the values of @list. (This most often
537happens when you use C<$_> for the loop variable, and call subroutines in
538the loop that don't properly localize C<$_>.)
539
5e378fdf 540=item * Discontinuance
541
542C<split> with no arguments now behaves like C<split ' '> (which doesn't
543return an initial null field if $_ starts with whitespace), it used to
544behave like C<split /\s+/> (which does).
545
546 $_ = ' hi mom';
547 print join(':', split);
548
549 # perl4 prints: :hi:mom
550 # perl5 prints: hi:mom
551
55497cff 552=item * BugFix
553
9607fc9c 554Perl 4 would ignore any text which was attached to an B<-e> switch,
55497cff 555always taking the code snippet from the following arg. Additionally, it
9607fc9c 556would silently accept an B<-e> switch without a following arg. Both of
55497cff 557these behaviors have been fixed.
558
559 perl -e'print "attached to -e"' 'print "separate arg"'
54310121 560
55497cff 561 # perl4 prints: separate arg
562 # perl5 prints: attached to -e
54310121 563
55497cff 564 perl -e
565
566 # perl4 prints:
567 # perl5 dies: No code specified for -e.
568
569=item * Discontinuance
570
571In Perl 4 the return value of C<push> was undocumented, but it was
572actually the last value being pushed onto the target list. In Perl 5
573the return value of C<push> is documented, but has changed, it is the
574number of elements in the resulting list.
575
576 @x = ('existing');
577 print push(@x, 'first new', 'second new');
54310121 578
55497cff 579 # perl4 prints: second new
580 # perl5 prints: 3
581
68dc0745 582=item * Discontinuance
583
584In Perl 4 (and versions of Perl 5 before 5.004), C<'\r'> characters in
585Perl code were silently allowed, although they could cause (mysterious!)
586failures in certain constructs, particularly here documents. Now,
587C<'\r'> characters cause an immediate fatal error. (Note: In this
588example, the notation B<\015> represents the incorrect line
589ending. Depending upon your text viewer, it will look different.)
590
591 print "foo";\015
592 print "bar";
593
594 # perl4 prints: foobar
595 # perl5.003 prints: foobar
596 # perl5.004 dies: Illegal character \015 (carriage return)
597
598See L<perldiag> for full details.
599
6dbacca0 600=item * Deprecation
601
602Some error messages will be different.
603
54310121 604=item * Discontinuance
6dbacca0 605
606Some bugs may have been inadvertently removed. :-)
607
608=back
609
610=head2 Parsing Traps
611
612Perl4-to-Perl5 traps from having to do with parsing.
613
614=over 4
615
616=item * Parsing
617
618Note the space between . and =
619
620 $string . = "more string";
621 print $string;
54310121 622
6dbacca0 623 # perl4 prints: more string
624 # perl5 prints: syntax error at - line 1, near ". ="
625
626=item * Parsing
627
628Better parsing in perl 5
629
630 sub foo {}
631 &foo
632 print("hello, world\n");
54310121 633
6dbacca0 634 # perl4 prints: hello, world
635 # perl5 prints: syntax error
636
637=item * Parsing
638
639"if it looks like a function, it is a function" rule.
640
641 print
642 ($foo == 1) ? "is one\n" : "is zero\n";
54310121 643
6dbacca0 644 # perl4 prints: is zero
645 # perl5 warns: "Useless use of a constant in void context" if using -w
646
647=back
648
649=head2 Numerical Traps
650
651Perl4-to-Perl5 traps having to do with numerical operators,
652operands, or output from same.
653
654=over 5
655
656=item * Numerical
657
658Formatted output and significant digits
659
660 print 7.373504 - 0, "\n";
54310121 661 printf "%20.18f\n", 7.373504 - 0;
662
6dbacca0 663 # Perl4 prints:
664 7.375039999999996141
665 7.37503999999999614
54310121 666
6dbacca0 667 # Perl5 prints:
668 7.373504
669 7.37503999999999614
670
671=item * Numerical
672
5f05dabc 673This specific item has been deleted. It demonstrated how the auto-increment
5e378fdf 674operator would not catch when a number went over the signed int limit. Fixed
a6006777 675in version 5.003_04. But always be wary when using large integers.
676If in doubt:
6dbacca0 677
5e378fdf 678 use Math::BigInt;
6dbacca0 679
54310121 680=item * Numerical
6dbacca0 681
682Assignment of return values from numeric equality tests
683does not work in perl5 when the test evaluates to false (0).
684Logical tests now return an null, instead of 0
a6006777 685
6dbacca0 686 $p = ($test == 1);
687 print $p,"\n";
a6006777 688
6dbacca0 689 # perl4 prints: 0
690 # perl5 prints:
691
dc848c6f 692Also see L<"General Regular Expression Traps using s///, etc.">
693for another example of this new feature...
6dbacca0 694
695=back
696
697=head2 General data type traps
698
699Perl4-to-Perl5 traps involving most data-types, and their usage
700within certain expressions and/or context.
701
702=over 5
703
704=item * (Arrays)
705
706Negative array subscripts now count from the end of the array.
707
708 @a = (1, 2, 3, 4, 5);
709 print "The third element of the array is $a[3] also expressed as $a[-2] \n";
54310121 710
6dbacca0 711 # perl4 prints: The third element of the array is 4 also expressed as
712 # perl5 prints: The third element of the array is 4 also expressed as 4
713
714=item * (Arrays)
715
716Setting C<$#array> lower now discards array elements, and makes them
717impossible to recover.
718
54310121 719 @a = (a,b,c,d,e);
6dbacca0 720 print "Before: ",join('',@a);
54310121 721 $#a =1;
6dbacca0 722 print ", After: ",join('',@a);
723 $#a =3;
724 print ", Recovered: ",join('',@a),"\n";
54310121 725
6dbacca0 726 # perl4 prints: Before: abcde, After: ab, Recovered: abcd
727 # perl5 prints: Before: abcde, After: ab, Recovered: ab
728
729=item * (Hashes)
730
731Hashes get defined before use
732
54310121 733 local($s,@a,%h);
6dbacca0 734 die "scalar \$s defined" if defined($s);
735 die "array \@a defined" if defined(@a);
736 die "hash \%h defined" if defined(%h);
54310121 737
6dbacca0 738 # perl4 prints:
739 # perl5 dies: hash %h defined
740
741=item * (Globs)
742
743glob assignment from variable to variable will fail if the assigned
744variable is localized subsequent to the assignment
745
746 @a = ("This is Perl 4");
747 *b = *a;
748 local(@a);
749 print @b,"\n";
54310121 750
6dbacca0 751 # perl4 prints: This is Perl 4
752 # perl5 prints:
54310121 753
6dbacca0 754 # Another example
54310121 755
6dbacca0 756 *fred = *barney; # fred is aliased to barney
757 @barney = (1, 2, 4);
758 # @fred;
759 print "@fred"; # should print "1, 2, 4"
54310121 760
6dbacca0 761 # perl4 prints: 1 2 4
9607fc9c 762 # perl5 prints: In string, @fred now must be written as \@fred
5e378fdf 763
6dbacca0 764=item * (Scalar String)
765
766Changes in unary negation (of strings)
767This change effects both the return value and what it
768does to auto(magic)increment.
769
770 $x = "aaa";
771 print ++$x," : ";
772 print -$x," : ";
773 print ++$x,"\n";
54310121 774
6dbacca0 775 # perl4 prints: aab : -0 : 1
776 # perl5 prints: aab : -aab : aac
777
778=item * (Constants)
779
780perl 4 lets you modify constants:
781
782 $foo = "x";
783 &mod($foo);
784 for ($x = 0; $x < 3; $x++) {
785 &mod("a");
786 }
787 sub mod {
788 print "before: $_[0]";
789 $_[0] = "m";
790 print " after: $_[0]\n";
791 }
54310121 792
6dbacca0 793 # perl4:
794 # before: x after: m
795 # before: a after: m
796 # before: m after: m
797 # before: m after: m
54310121 798
6dbacca0 799 # Perl5:
800 # before: x after: m
801 # Modification of a read-only value attempted at foo.pl line 12.
802 # before: a
803
804=item * (Scalars)
805
806The behavior is slightly different for:
807
808 print "$x", defined $x
54310121 809
6dbacca0 810 # perl 4: 1
811 # perl 5: <no output, $x is not called into existence>
812
813=item * (Variable Suicide)
814
815Variable suicide behavior is more consistent under Perl 5.
aa689395 816Perl5 exhibits the same behavior for hashes and scalars,
5f05dabc 817that perl4 exhibits for only scalars.
6dbacca0 818
819 $aGlobal{ "aKey" } = "global value";
820 print "MAIN:", $aGlobal{"aKey"}, "\n";
821 $GlobalLevel = 0;
822 &test( *aGlobal );
823
824 sub test {
825 local( *theArgument ) = @_;
826 local( %aNewLocal ); # perl 4 != 5.001l,m
54310121 827 $aNewLocal{"aKey"} = "this should never appear";
6dbacca0 828 print "SUB: ", $theArgument{"aKey"}, "\n";
829 $aNewLocal{"aKey"} = "level $GlobalLevel"; # what should print
830 $GlobalLevel++;
831 if( $GlobalLevel<4 ) {
832 &test( *aNewLocal );
833 }
834 }
54310121 835
6dbacca0 836 # Perl4:
837 # MAIN:global value
838 # SUB: global value
839 # SUB: level 0
840 # SUB: level 1
841 # SUB: level 2
54310121 842
6dbacca0 843 # Perl5:
844 # MAIN:global value
845 # SUB: global value
846 # SUB: this should never appear
847 # SUB: this should never appear
848 # SUB: this should never appear
849
84dc3c4d 850=back
6dbacca0 851
852=head2 Context Traps - scalar, list contexts
853
854=over 5
855
856=item * (list context)
857
858The elements of argument lists for formats are now evaluated in list
859context. This means you can interpolate list values now.
860
861 @fmt = ("foo","bar","baz");
862 format STDOUT=
863 @<<<<< @||||| @>>>>>
864 @fmt;
865 .
54310121 866 write;
867
6dbacca0 868 # perl4 errors: Please use commas to separate fields in file
869 # perl5 prints: foo bar baz
870
871=item * (scalar context)
872
54310121 873The C<caller()> function now returns a false value in a scalar context
874if there is no caller. This lets library files determine if they're
6dbacca0 875being required.
876
877 caller() ? (print "You rang?\n") : (print "Got a 0\n");
54310121 878
6dbacca0 879 # perl4 errors: There is no caller
880 # perl5 prints: Got a 0
5e378fdf 881
6dbacca0 882=item * (scalar context)
883
884The comma operator in a scalar context is now guaranteed to give a
885scalar context to its arguments.
886
887 @y= ('a','b','c');
888 $x = (1, 2, @y);
889 print "x = $x\n";
54310121 890
6dbacca0 891 # Perl4 prints: x = c # Thinks list context interpolates list
892 # Perl5 prints: x = 3 # Knows scalar uses length of list
893
894=item * (list, builtin)
895
896C<sprintf()> funkiness (array argument converted to scalar array count)
897This test could be added to t/op/sprintf.t
898
899 @z = ('%s%s', 'foo', 'bar');
900 $x = sprintf(@z);
901 if ($x eq 'foobar') {print "ok 2\n";} else {print "not ok 2 '$x'\n";}
54310121 902
6dbacca0 903 # perl4 prints: ok 2
904 # perl5 prints: not ok 2
905
906C<printf()> works fine, though:
907
908 printf STDOUT (@z);
54310121 909 print "\n";
910
6dbacca0 911 # perl4 prints: foobar
912 # perl5 prints: foobar
913
914Probably a bug.
915
916=back
917
918=head2 Precedence Traps
919
920Perl4-to-Perl5 traps involving precedence order.
921
84dc3c4d 922=over 5
923
5e378fdf 924=item * Precedence
925
926LHS vs. RHS when both sides are getting an op.
927
928 @arr = ( 'left', 'right' );
929 $a{shift @arr} = shift @arr;
930 print join( ' ', keys %a );
931
932 # perl4 prints: left
933 # perl5 prints: right
934
935=item * Precedence
6dbacca0 936
937These are now semantic errors because of precedence:
938
939 @list = (1,2,3,4,5);
940 %map = ("a",1,"b",2,"c",3,"d",4);
941 $n = shift @list + 2; # first item in list plus 2
942 print "n is $n, ";
943 $m = keys %map + 2; # number of items in hash plus 2
944 print "m is $m\n";
54310121 945
6dbacca0 946 # perl4 prints: n is 3, m is 6
947 # perl5 errors and fails to compile
948
949=item * Precedence
a0d0e21e 950
4633a7c4 951The precedence of assignment operators is now the same as the precedence
952of assignment. Perl 4 mistakenly gave them the precedence of the associated
953operator. So you now must parenthesize them in expressions like
954
955 /foo/ ? ($a += 2) : ($a -= 2);
a6006777 956
4633a7c4 957Otherwise
958
6dbacca0 959 /foo/ ? $a += 2 : $a -= 2
4633a7c4 960
961would be erroneously parsed as
962
963 (/foo/ ? $a += 2 : $a) -= 2;
964
965On the other hand,
966
54310121 967 $a += /foo/ ? 1 : 2;
4633a7c4 968
969now works as a C programmer would expect.
970
6dbacca0 971=item * Precedence
4633a7c4 972
6dbacca0 973 open FOO || die;
a0d0e21e 974
5f05dabc 975is now incorrect. You need parentheses around the filehandle.
976Otherwise, perl5 leaves the statement as its default precedence:
a0d0e21e 977
6dbacca0 978 open(FOO || die);
54310121 979
6dbacca0 980 # perl4 opens or dies
981 # perl5 errors: Precedence problem: open FOO should be open(FOO)
a0d0e21e 982
6dbacca0 983=item * Precedence
a0d0e21e 984
6dbacca0 985perl4 gives the special variable, C<$:> precedence, where perl5
986treats C<$::> as main C<package>
a0d0e21e 987
6dbacca0 988 $a = "x"; print "$::a";
54310121 989
6dbacca0 990 # perl 4 prints: -:a
991 # perl 5 prints: x
5e378fdf 992
6dbacca0 993=item * Precedence
a0d0e21e 994
54310121 995concatenation precedence over filetest operator?
996
997 -e $foo .= "q"
a0d0e21e 998
6dbacca0 999 # perl4 prints: no output
1000 # perl5 prints: Can't modify -e in concatenation
a0d0e21e 1001
6dbacca0 1002=item * Precedence
a0d0e21e 1003
6dbacca0 1004Assignment to value takes precedence over assignment to key in
1005perl5 when using the shift operator on both sides.
1006
1007 @arr = ( 'left', 'right' );
1008 $a{shift @arr} = shift @arr;
1009 print join( ' ', keys %a );
1010
1011 # perl4 prints: left
1012 # perl5 prints: right
1013
1014=back
1015
1016=head2 General Regular Expression Traps using s///, etc.
1017
1018All types of RE traps.
1019
1020=over 5
1021
1022=item * Regular Expression
1023
1024C<s'$lhs'$rhs'> now does no interpolation on either side. It used to
54310121 1025interpolate C<$lhs> but not C<$rhs>. (And still does not match a literal
6dbacca0 1026'$' in string)
1027
1028 $a=1;$b=2;
1029 $string = '1 2 $a $b';
1030 $string =~ s'$a'$b';
1031 print $string,"\n";
54310121 1032
6dbacca0 1033 # perl4 prints: $b 2 $a $b
1034 # perl5 prints: 1 2 $a $b
1035
1036=item * Regular Expression
a0d0e21e 1037
1038C<m//g> now attaches its state to the searched string rather than the
6dbacca0 1039regular expression. (Once the scope of a block is left for the sub, the
1040state of the searched string is lost)
1041
1042 $_ = "ababab";
1043 while(m/ab/g){
1044 &doit("blah");
1045 }
1046 sub doit{local($_) = shift; print "Got $_ "}
54310121 1047
6dbacca0 1048 # perl4 prints: blah blah blah
1049 # perl5 prints: infinite loop blah...
1050
1051=item * Regular Expression
1052
68dc0745 1053Currently, if you use the C<m//o> qualifier on a regular expression
1054within an anonymous sub, I<all> closures generated from that anonymous
1055sub will use the regular expression as it was compiled when it was used
1056the very first time in any such closure. For instance, if you say
1057
1058 sub build_match {
1059 my($left,$right) = @_;
1060 return sub { $_[0] =~ /$left stuff $right/o; };
1061 }
1062
1063build_match() will always return a sub which matches the contents of
1064C<$left> and C<$right> as they were the I<first> time that build_match()
1065was called, not as they are in the current call.
1066
1067This is probably a bug, and may change in future versions of Perl.
1068
1069=item * Regular Expression
1070
6dbacca0 1071If no parentheses are used in a match, Perl4 sets C<$+> to
1072the whole match, just like C<$&>. Perl5 does not.
1073
1074 "abcdef" =~ /b.*e/;
1075 print "\$+ = $+\n";
54310121 1076
6dbacca0 1077 # perl4 prints: bcde
1078 # perl5 prints:
1079
1080=item * Regular Expression
1081
1082substitution now returns the null string if it fails
1083
1084 $string = "test";
1085 $value = ($string =~ s/foo//);
1086 print $value, "\n";
54310121 1087
6dbacca0 1088 # perl4 prints: 0
1089 # perl5 prints:
1090
1091Also see L<Numerical Traps> for another example of this new feature.
1092
1093=item * Regular Expression
1094
54310121 1095C<s`lhs`rhs`> (using backticks) is now a normal substitution, with no
1096backtick expansion
6dbacca0 1097
1098 $string = "";
1099 $string =~ s`^`hostname`;
1100 print $string, "\n";
54310121 1101
6dbacca0 1102 # perl4 prints: <the local hostname>
1103 # perl5 prints: hostname
1104
1105=item * Regular Expression
1106
1107Stricter parsing of variables used in regular expressions
1108
1109 s/^([^$grpc]*$grpc[$opt$plus$rep]?)//o;
54310121 1110
6dbacca0 1111 # perl4: compiles w/o error
1112 # perl5: with Scalar found where operator expected ..., near "$opt$plus"
1113
1114an added component of this example, apparently from the same script, is
1115the actual value of the s'd string after the substitution.
1116C<[$opt]> is a character class in perl4 and an array subscript in perl5
1117
54310121 1118 $grpc = 'a';
6dbacca0 1119 $opt = 'r';
1120 $_ = 'bar';
1121 s/^([^$grpc]*$grpc[$opt]?)/foo/;
1122 print ;
54310121 1123
6dbacca0 1124 # perl4 prints: foo
1125 # perl5 prints: foobar
1126
1127=item * Regular Expression
1128
1129Under perl5, C<m?x?> matches only once, like C<?x?>. Under perl4, it matched
1130repeatedly, like C</x/> or C<m!x!>.
1131
1132 $test = "once";
1133 sub match { $test =~ m?once?; }
1134 &match();
1135 if( &match() ) {
1136 # m?x? matches more then once
1137 print "perl4\n";
54310121 1138 } else {
6dbacca0 1139 # m?x? matches only once
54310121 1140 print "perl5\n";
6dbacca0 1141 }
54310121 1142
6dbacca0 1143 # perl4 prints: perl4
1144 # perl5 prints: perl5
a0d0e21e 1145
a0d0e21e 1146
44a8e56a 1147=item * Regular Expression
1148
1149Under perl4 and upto version 5.003, a failed C<m//g> match used to
1150reset the internal iterator, so that subsequent C<m//g> match attempts
1151began from the beginning of the string. In perl version 5.004 and later,
1152failed C<m//g> matches do not reset the iterator position (which can be
1153found using the C<pos()> function--see L<perlfunc/pos>).
1154
1155 $test = "foop";
1156 for (1..3) {
1157 print $1 while ($test =~ /(o)/g);
1158 # pos $test = 0; # to get old behavior
1159 }
54310121 1160
44a8e56a 1161 # perl4 prints: oooooo
1162 # perl5.004 prints: oo
1163
1164You may always reset the iterator yourself as shown in the commented line
1165to get the old behavior.
1166
6dbacca0 1167=back
1168
1169=head2 Subroutine, Signal, Sorting Traps
a0d0e21e 1170
6dbacca0 1171The general group of Perl4-to-Perl5 traps having to do with
1172Signals, Sorting, and their related subroutines, as well as
1173general subroutine traps. Includes some OS-Specific traps.
a0d0e21e 1174
6dbacca0 1175=over 5
a0d0e21e 1176
6dbacca0 1177=item * (Signals)
a0d0e21e 1178
6dbacca0 1179Barewords that used to look like strings to Perl will now look like subroutine
1180calls if a subroutine by that name is defined before the compiler sees them.
a0d0e21e 1181
6dbacca0 1182 sub SeeYa { warn"Hasta la vista, baby!" }
1183 $SIG{'TERM'} = SeeYa;
1184 print "SIGTERM is now $SIG{'TERM'}\n";
54310121 1185
6dbacca0 1186 # perl4 prints: SIGTERM is main'SeeYa
1187 # perl5 prints: SIGTERM is now main::1
a0d0e21e 1188
6dbacca0 1189Use B<-w> to catch this one
a0d0e21e 1190
6dbacca0 1191=item * (Sort Subroutine)
a0d0e21e 1192
6dbacca0 1193reverse is no longer allowed as the name of a sort subroutine.
a0d0e21e 1194
6dbacca0 1195 sub reverse{ print "yup "; $a <=> $b }
54310121 1196 print sort reverse a,b,c;
1197
6dbacca0 1198 # perl4 prints: yup yup yup yup abc
54310121 1199 # perl5 prints: abc
a0d0e21e 1200
b996531f 1201=item * warn() won't let you specify a filehandle.
1202
1203Although it _always_ printed to STDERR, warn() would let you specify a
1204filehandle in perl4. With perl5 it does not.
5e378fdf 1205
1206 warn STDERR "Foo!";
1207
1208 # perl4 prints: Foo!
54310121 1209 # perl5 prints: String found where operator expected
5e378fdf 1210
6dbacca0 1211=back
a0d0e21e 1212
6dbacca0 1213=head2 OS Traps
1214
1215=over 5
1216
1217=item * (SysV)
1218
54310121 1219Under HPUX, and some other SysV OSes, one had to reset any signal handler,
1220within the signal handler function, each time a signal was handled with
1221perl4. With perl5, the reset is now done correctly. Any code relying
6dbacca0 1222on the handler _not_ being reset will have to be reworked.
1223
a6006777 1224Since version 5.002, Perl uses sigaction() under SysV.
6dbacca0 1225
1226 sub gotit {
54310121 1227 print "Got @_... ";
1228 }
6dbacca0 1229 $SIG{'INT'} = 'gotit';
54310121 1230
6dbacca0 1231 $| = 1;
1232 $pid = fork;
1233 if ($pid) {
1234 kill('INT', $pid);
1235 sleep(1);
1236 kill('INT', $pid);
54310121 1237 } else {
6dbacca0 1238 while (1) {sleep(10);}
54310121 1239 }
1240
6dbacca0 1241 # perl4 (HPUX) prints: Got INT...
1242 # perl5 (HPUX) prints: Got INT... Got INT...
1243
1244=item * (SysV)
1245
54310121 1246Under SysV OSes, C<seek()> on a file opened to append C<E<gt>E<gt>> now does
1247the right thing w.r.t. the fopen() manpage. e.g., - When a file is opened
6dbacca0 1248for append, it is impossible to overwrite information already in
1249the file.
1250
1251 open(TEST,">>seek.test");
54310121 1252 $start = tell TEST ;
6dbacca0 1253 foreach(1 .. 9){
1254 print TEST "$_ ";
1255 }
1256 $end = tell TEST ;
1257 seek(TEST,$start,0);
1258 print TEST "18 characters here";
54310121 1259
6dbacca0 1260 # perl4 (solaris) seek.test has: 18 characters here
1261 # perl5 (solaris) seek.test has: 1 2 3 4 5 6 7 8 9 18 characters here
a0d0e21e 1262
a0d0e21e 1263
a0d0e21e 1264
6dbacca0 1265=back
a0d0e21e 1266
6dbacca0 1267=head2 Interpolation Traps
a0d0e21e 1268
8b0a4b75 1269Perl4-to-Perl5 traps having to do with how things get interpolated
1270within certain expressions, statements, contexts, or whatever.
1271
6dbacca0 1272=over 5
a0d0e21e 1273
6dbacca0 1274=item * Interpolation
a0d0e21e 1275
6dbacca0 1276@ now always interpolates an array in double-quotish strings.
1277
54310121 1278 print "To: someone@somewhere.com\n";
1279
6dbacca0 1280 # perl4 prints: To:someone@somewhere.com
9607fc9c 1281 # perl5 errors : In string, @somewhere now must be written as \@somewhere
6dbacca0 1282
1283=item * Interpolation
1284
6dbacca0 1285Double-quoted strings may no longer end with an unescaped $ or @.
1286
1287 $foo = "foo$";
1288 $bar = "bar@";
1289 print "foo is $foo, bar is $bar\n";
54310121 1290
6dbacca0 1291 # perl4 prints: foo is foo$, bar is bar@
1292 # perl5 errors: Final $ should be \$ or $name
1293
1294Note: perl5 DOES NOT error on the terminating @ in $bar
1295
1296=item * Interpolation
a0d0e21e 1297
8b0a4b75 1298Perl now sometimes evaluates arbitrary expressions inside braces that occur
1299within double quotes (usually when the opening brace is preceded by C<$>
1300or C<@>).
1301
1302 @www = "buz";
1303 $foo = "foo";
1304 $bar = "bar";
1305 sub foo { return "bar" };
1306 print "|@{w.w.w}|${main'foo}|";
1307
1308 # perl4 prints: |@{w.w.w}|foo|
1309 # perl5 prints: |buz|bar|
1310
1311Note that you can C<use strict;> to ward off such trappiness under perl5.
1312
1313=item * Interpolation
1314
748a9306 1315The construct "this is $$x" used to interpolate the pid at that
6dbacca0 1316point, but now apparently tries to dereference C<$x>. C<$$> by itself still
748a9306 1317works fine, however.
1318
6dbacca0 1319 print "this is $$x\n";
748a9306 1320
6dbacca0 1321 # perl4 prints: this is XXXx (XXX is the current pid)
1322 # perl5 prints: this is
1323
1324=item * Interpolation
1325
54310121 1326Creation of hashes on the fly with C<eval "EXPR"> now requires either both
1327C<$>'s to be protected in the specification of the hash name, or both curlies
6dbacca0 1328to be protected. If both curlies are protected, the result will be compatible
1329with perl4 and perl5. This is a very common practice, and should be changed
1330to use the block form of C<eval{}> if possible.
c07a80fd 1331
6dbacca0 1332 $hashname = "foobar";
1333 $key = "baz";
1334 $value = 1234;
1335 eval "\$$hashname{'$key'} = q|$value|";
1336 (defined($foobar{'baz'})) ? (print "Yup") : (print "Nope");
1337
1338 # perl4 prints: Yup
1339 # perl5 prints: Nope
1340
1341Changing
1342
1343 eval "\$$hashname{'$key'} = q|$value|";
c07a80fd 1344
1345to
1346
6dbacca0 1347 eval "\$\$hashname{'$key'} = q|$value|";
c07a80fd 1348
6dbacca0 1349causes the following result:
c07a80fd 1350
6dbacca0 1351 # perl4 prints: Nope
1352 # perl5 prints: Yup
c07a80fd 1353
6dbacca0 1354or, changing to
a0d0e21e 1355
6dbacca0 1356 eval "\$$hashname\{'$key'\} = q|$value|";
1357
1358causes the following result:
1359
1360 # perl4 prints: Yup
1361 # perl5 prints: Yup
1362 # and is compatible for both versions
1363
1364
1365=item * Interpolation
1366
1367perl4 programs which unconsciously rely on the bugs in earlier perl versions.
1368
1369 perl -e '$bar=q/not/; print "This is $foo{$bar} perl5"'
54310121 1370
6dbacca0 1371 # perl4 prints: This is not perl5
1372 # perl5 prints: This is perl5
1373
1374=item * Interpolation
1375
54310121 1376You also have to be careful about array references.
6dbacca0 1377
1378 print "$foo{"
1379
1380 perl 4 prints: {
1381 perl 5 prints: syntax error
1382
1383=item * Interpolation
1384
1385Similarly, watch out for:
1386
1387 $foo = "array";
1388 print "\$$foo{bar}\n";
54310121 1389
6dbacca0 1390 # perl4 prints: $array{bar}
1391 # perl5 prints: $
1392
1393Perl 5 is looking for C<$array{bar}> which doesn't exist, but perl 4 is
1394happy just to expand $foo to "array" by itself. Watch out for this
1395especially in C<eval>'s.
1396
1397=item * Interpolation
1398
1399C<qq()> string passed to C<eval>
1400
1401 eval qq(
1402 foreach \$y (keys %\$x\) {
1403 \$count++;
1404 }
1405 );
54310121 1406
6dbacca0 1407 # perl4 runs this ok
54310121 1408 # perl5 prints: Can't find string terminator ")"
a0d0e21e 1409
6dbacca0 1410=back
1411
1412=head2 DBM Traps
1413
1414General DBM traps.
1415
1416=over 5
1417
1418=item * DBM
1419
1420Existing dbm databases created under perl4 (or any other dbm/ndbm tool)
1421may cause the same script, run under perl5, to fail. The build of perl5
1422must have been linked with the same dbm/ndbm as the default for C<dbmopen()>
1423to function properly without C<tie>'ing to an extension dbm implementation.
1424
1425 dbmopen (%dbm, "file", undef);
1426 print "ok\n";
1427
1428 # perl4 prints: ok
1429 # perl5 prints: ok (IFF linked with -ldbm or -lndbm)
1430
1431
1432=item * DBM
1433
1434Existing dbm databases created under perl4 (or any other dbm/ndbm tool)
1435may cause the same script, run under perl5, to fail. The error generated
1436when exceeding the limit on the key/value size will cause perl5 to exit
1437immediately.
1438
1439 dbmopen(DB, "testdb",0600) || die "couldn't open db! $!";
1440 $DB{'trap'} = "x" x 1024; # value too large for most dbm/ndbm
1441 print "YUP\n";
1442
1443 # perl4 prints:
1444 dbm store returned -1, errno 28, key "trap" at - line 3.
1445 YUP
1446
1447 # perl5 prints:
1448 dbm store returned -1, errno 28, key "trap" at - line 3.
a0d0e21e 1449
1450=back
6dbacca0 1451
1452=head2 Unclassified Traps
1453
1454Everything else.
1455
84dc3c4d 1456=over 5
1457
6dbacca0 1458=item * Unclassified
1459
1460C<require>/C<do> trap using returned value
1461
1462If the file doit.pl has:
1463
1464 sub foo {
1465 $rc = do "./do.pl";
1466 return 8;
54310121 1467 }
6dbacca0 1468 print &foo, "\n";
1469
1470And the do.pl file has the following single line:
1471
1472 return 3;
1473
1474Running doit.pl gives the following:
1475
1476 # perl 4 prints: 3 (aborts the subroutine early)
54310121 1477 # perl 5 prints: 8
6dbacca0 1478
1479Same behavior if you replace C<do> with C<require>.
1480
1481=back
1482
54310121 1483As always, if any of these are ever officially declared as bugs,
6dbacca0 1484they'll be fixed and removed.
1485