perlfunc.pod grammar fixes
[p5sagit/p5-mst-13.2.git] / pod / perlsyn.pod
CommitLineData
a0d0e21e 1=head1 NAME
2
3perlsyn - Perl syntax
4
5=head1 DESCRIPTION
6
6014d0cb 7A Perl program consists of a sequence of declarations and statements
8which run from the top to the bottom. Loops, subroutines and other
9control structures allow you to jump around within the code.
10
11Perl is a B<free-form> language, you can format and indent it however
12you like. Whitespace mostly serves to separate tokens, unlike
13languages like Python where it is an important part of the syntax.
14
15Many of Perl's syntactic elements are B<optional>. Rather than
110b9c83 16requiring you to put parentheses around every function call and
6014d0cb 17declare every variable, you can often leave such explicit elements off
18and Perl will figure out what you meant. This is known as B<Do What I
19Mean>, abbreviated B<DWIM>. It allows programmers to be B<lazy> and to
110b9c83 20code in a style with which they are comfortable.
6014d0cb 21
22Perl B<borrows syntax> and concepts from many languages: awk, sed, C,
23Bourne Shell, Smalltalk, Lisp and even English. Other
24languages have borrowed syntax from Perl, particularly its regular
25expression extensions. So if you have programmed in another language
26you will see familiar pieces in Perl. They often work the same, but
27see L<perltrap> for information about how they differ.
a0d0e21e 28
0b8d69e9 29=head2 Declarations
30
cf48932e 31The only things you need to declare in Perl are report formats and
32subroutines (and sometimes not even subroutines). A variable holds
33the undefined value (C<undef>) until it has been assigned a defined
34value, which is anything other than C<undef>. When used as a number,
35C<undef> is treated as C<0>; when used as a string, it is treated as
36the empty string, C<"">; and when used as a reference that isn't being
37assigned to, it is treated as an error. If you enable warnings,
38you'll be notified of an uninitialized value whenever you treat
39C<undef> as a string or a number. Well, usually. Boolean contexts,
40such as:
7bd1983c 41
42 my $a;
43 if ($a) {}
44
a6b1f6d8 45are exempt from warnings (because they care about truth rather than
46definedness). Operators such as C<++>, C<-->, C<+=>,
7bd1983c 47C<-=>, and C<.=>, that operate on undefined left values such as:
48
49 my $a;
50 $a++;
51
52are also always exempt from such warnings.
0b8d69e9 53
a0d0e21e 54A declaration can be put anywhere a statement can, but has no effect on
55the execution of the primary sequence of statements--declarations all
56take effect at compile time. Typically all the declarations are put at
54310121 57the beginning or the end of the script. However, if you're using
0b8d69e9 58lexically-scoped private variables created with C<my()>, you'll
59have to make sure
4633a7c4 60your format or subroutine definition is within the same block scope
5f05dabc 61as the my if you expect to be able to access those private variables.
a0d0e21e 62
4633a7c4 63Declaring a subroutine allows a subroutine name to be used as if it were a
64list operator from that point forward in the program. You can declare a
54310121 65subroutine without defining it by saying C<sub name>, thus:
a0d0e21e 66
54310121 67 sub myname;
a0d0e21e 68 $me = myname $0 or die "can't get myname";
69
1f950eb4 70Note that myname() functions as a list operator, not as a unary operator;
71so be careful to use C<or> instead of C<||> in this case. However, if
54310121 72you were to declare the subroutine as C<sub myname ($)>, then
02c45c47 73C<myname> would function as a unary operator, so either C<or> or
54310121 74C<||> would work.
a0d0e21e 75
4633a7c4 76Subroutines declarations can also be loaded up with the C<require> statement
77or both loaded and imported into your namespace with a C<use> statement.
78See L<perlmod> for details on this.
a0d0e21e 79
4633a7c4 80A statement sequence may contain declarations of lexically-scoped
81variables, but apart from declaring a variable name, the declaration acts
82like an ordinary statement, and is elaborated within the sequence of
83statements as if it were an ordinary statement. That means it actually
84has both compile-time and run-time effects.
a0d0e21e 85
6014d0cb 86=head2 Comments
87
88Text from a C<"#"> character until the end of the line is a comment,
89and is ignored. Exceptions include C<"#"> inside a string or regular
90expression.
91
6ec4bd10 92=head2 Simple Statements
a0d0e21e 93
94The only kind of simple statement is an expression evaluated for its
95side effects. Every simple statement must be terminated with a
96semicolon, unless it is the final statement in a block, in which case
f386e492 97the semicolon is optional. (A semicolon is still encouraged if the
98block takes up more than one line, because you may eventually add
cf48932e 99another line.) Note that there are some operators like C<eval {}> and
100C<do {}> that look like compound statements, but aren't (they're just
101TERMs in an expression), and thus need an explicit termination if used
102as the last item in a statement.
103
104=head2 Truth and Falsehood
105
f92061c1 106The number 0, the strings C<'0'> and C<''>, the empty list C<()>, and
107C<undef> are all false in a boolean context. All other values are true.
52ea55c9 108Negation of a true value by C<!> or C<not> returns a special false value.
109When evaluated as a string it is treated as C<''>, but as a number, it
110is treated as 0.
cf48932e 111
cf48932e 112=head2 Statement Modifiers
a0d0e21e 113
114Any simple statement may optionally be followed by a I<SINGLE> modifier,
115just before the terminating semicolon (or block ending). The possible
116modifiers are:
117
118 if EXPR
119 unless EXPR
120 while EXPR
121 until EXPR
cf48932e 122 foreach LIST
123
124The C<EXPR> following the modifier is referred to as the "condition".
125Its truth or falsehood determines how the modifier will behave.
126
127C<if> executes the statement once I<if> and only if the condition is
128true. C<unless> is the opposite, it executes the statement I<unless>
129the condition is true (i.e., if the condition is false).
130
131 print "Basset hounds got long ears" if length $ear >= 10;
132 go_outside() and play() unless $is_raining;
133
134The C<foreach> modifier is an iterator: it executes the statement once
135for each item in the LIST (with C<$_> aliased to each item in turn).
136
137 print "Hello $_!\n" foreach qw(world Dolly nurse);
138
139C<while> repeats the statement I<while> the condition is true.
140C<until> does the opposite, it repeats the statement I<until> the
141condition is true (or while the condition is false):
142
143 # Both of these count from 0 to 10.
144 print $i++ while $i <= 10;
145 print $j++ until $j > 10;
146
147The C<while> and C<until> modifiers have the usual "C<while> loop"
148semantics (conditional evaluated first), except when applied to a
149C<do>-BLOCK (or to the deprecated C<do>-SUBROUTINE statement), in
150which case the block executes once before the conditional is
151evaluated. This is so that you can write loops like:
a0d0e21e 152
153 do {
4633a7c4 154 $line = <STDIN>;
a0d0e21e 155 ...
4633a7c4 156 } until $line eq ".\n";
a0d0e21e 157
5a964f20 158See L<perlfunc/do>. Note also that the loop control statements described
159later will I<NOT> work in this construct, because modifiers don't take
160loop labels. Sorry. You can always put another block inside of it
161(for C<next>) or around it (for C<last>) to do that sort of thing.
f86cebdf 162For C<next>, just double the braces:
5a964f20 163
164 do {{
165 next if $x == $y;
166 # do something here
167 }} until $x++ > $z;
168
f86cebdf 169For C<last>, you have to be more elaborate:
5a964f20 170
171 LOOP: {
172 do {
173 last if $x = $y**2;
174 # do something here
175 } while $x++ <= $z;
176 }
a0d0e21e 177
457b36cb 178B<NOTE:> The behaviour of a C<my> statement modified with a statement
179modifier conditional or loop construct (e.g. C<my $x if ...>) is
180B<undefined>. The value of the C<my> variable may be C<undef>, any
181previously assigned value, or possibly anything else. Don't rely on
182it. Future versions of perl might do something different from the
183version of perl you try it out on. Here be dragons.
184
6ec4bd10 185=head2 Compound Statements
a0d0e21e 186
187In Perl, a sequence of statements that defines a scope is called a block.
188Sometimes a block is delimited by the file containing it (in the case
189of a required file, or the program as a whole), and sometimes a block
190is delimited by the extent of a string (in the case of an eval).
191
192But generally, a block is delimited by curly brackets, also known as braces.
193We will call this syntactic construct a BLOCK.
194
195The following compound statements may be used to control flow:
196
197 if (EXPR) BLOCK
198 if (EXPR) BLOCK else BLOCK
199 if (EXPR) BLOCK elsif (EXPR) BLOCK ... else BLOCK
200 LABEL while (EXPR) BLOCK
201 LABEL while (EXPR) BLOCK continue BLOCK
5ec6d87f 202 LABEL until (EXPR) BLOCK
203 LABEL until (EXPR) BLOCK continue BLOCK
a0d0e21e 204 LABEL for (EXPR; EXPR; EXPR) BLOCK
748a9306 205 LABEL foreach VAR (LIST) BLOCK
b303ae78 206 LABEL foreach VAR (LIST) BLOCK continue BLOCK
a0d0e21e 207 LABEL BLOCK continue BLOCK
208
209Note that, unlike C and Pascal, these are defined in terms of BLOCKs,
210not statements. This means that the curly brackets are I<required>--no
211dangling statements allowed. If you want to write conditionals without
212curly brackets there are several other ways to do it. The following
213all do the same thing:
214
215 if (!open(FOO)) { die "Can't open $FOO: $!"; }
216 die "Can't open $FOO: $!" unless open(FOO);
217 open(FOO) or die "Can't open $FOO: $!"; # FOO or bust!
218 open(FOO) ? 'hi mom' : die "Can't open $FOO: $!";
219 # a bit exotic, that last one
220
5f05dabc 221The C<if> statement is straightforward. Because BLOCKs are always
a0d0e21e 222bounded by curly brackets, there is never any ambiguity about which
223C<if> an C<else> goes with. If you use C<unless> in place of C<if>,
224the sense of the test is reversed.
225
226The C<while> statement executes the block as long as the expression is
0eb389d5 227true (does not evaluate to the null string C<""> or C<0> or C<"0">).
1d5653dd 228The C<until> statement executes the block as long as the expression is
229false.
b78218b7 230The LABEL is optional, and if present, consists of an identifier followed
231by a colon. The LABEL identifies the loop for the loop control
232statements C<next>, C<last>, and C<redo>.
233If the LABEL is omitted, the loop control statement
4633a7c4 234refers to the innermost enclosing loop. This may include dynamically
235looking back your call-stack at run time to find the LABEL. Such
9f1b1f2d 236desperate behavior triggers a warning if you use the C<use warnings>
a2293a43 237pragma or the B<-w> flag.
4633a7c4 238
239If there is a C<continue> BLOCK, it is always executed just before the
6ec4bd10 240conditional is about to be evaluated again. Thus it can be used to
241increment a loop variable, even when the loop has been continued via
242the C<next> statement.
4633a7c4 243
244=head2 Loop Control
245
6ec4bd10 246The C<next> command starts the next iteration of the loop:
4633a7c4 247
248 LINE: while (<STDIN>) {
249 next LINE if /^#/; # discard comments
250 ...
251 }
252
6ec4bd10 253The C<last> command immediately exits the loop in question. The
4633a7c4 254C<continue> block, if any, is not executed:
255
256 LINE: while (<STDIN>) {
257 last LINE if /^$/; # exit when done with header
258 ...
259 }
260
261The C<redo> command restarts the loop block without evaluating the
262conditional again. The C<continue> block, if any, is I<not> executed.
263This command is normally used by programs that want to lie to themselves
264about what was just input.
265
266For example, when processing a file like F</etc/termcap>.
267If your input lines might end in backslashes to indicate continuation, you
268want to skip ahead and get the next record.
269
270 while (<>) {
271 chomp;
54310121 272 if (s/\\$//) {
273 $_ .= <>;
4633a7c4 274 redo unless eof();
275 }
276 # now process $_
54310121 277 }
4633a7c4 278
279which is Perl short-hand for the more explicitly written version:
280
54310121 281 LINE: while (defined($line = <ARGV>)) {
4633a7c4 282 chomp($line);
54310121 283 if ($line =~ s/\\$//) {
284 $line .= <ARGV>;
4633a7c4 285 redo LINE unless eof(); # not eof(ARGV)!
286 }
287 # now process $line
54310121 288 }
4633a7c4 289
36e7a065 290Note that if there were a C<continue> block on the above code, it would
291get executed only on lines discarded by the regex (since redo skips the
292continue block). A continue block is often used to reset line counters
293or C<?pat?> one-time matches:
4633a7c4 294
5a964f20 295 # inspired by :1,$g/fred/s//WILMA/
296 while (<>) {
297 ?(fred)? && s//WILMA $1 WILMA/;
298 ?(barney)? && s//BETTY $1 BETTY/;
299 ?(homer)? && s//MARGE $1 MARGE/;
300 } continue {
301 print "$ARGV $.: $_";
302 close ARGV if eof(); # reset $.
303 reset if eof(); # reset ?pat?
4633a7c4 304 }
305
a0d0e21e 306If the word C<while> is replaced by the word C<until>, the sense of the
307test is reversed, but the conditional is still tested before the first
308iteration.
309
5a964f20 310The loop control statements don't work in an C<if> or C<unless>, since
311they aren't loops. You can double the braces to make them such, though.
312
313 if (/pattern/) {{
7bd1983c 314 last if /fred/;
315 next if /barney/; # same effect as "last", but doesn't document as well
316 # do something here
5a964f20 317 }}
318
7bd1983c 319This is caused by the fact that a block by itself acts as a loop that
320executes once, see L<"Basic BLOCKs and Switch Statements">.
321
5b23ba8b 322The form C<while/if BLOCK BLOCK>, available in Perl 4, is no longer
323available. Replace any occurrence of C<if BLOCK> by C<if (do BLOCK)>.
4633a7c4 324
cb1a09d0 325=head2 For Loops
a0d0e21e 326
b78df5de 327Perl's C-style C<for> loop works like the corresponding C<while> loop;
cb1a09d0 328that means that this:
a0d0e21e 329
330 for ($i = 1; $i < 10; $i++) {
331 ...
332 }
333
cb1a09d0 334is the same as this:
a0d0e21e 335
336 $i = 1;
337 while ($i < 10) {
338 ...
339 } continue {
340 $i++;
341 }
342
b78df5de 343There is one minor difference: if variables are declared with C<my>
344in the initialization section of the C<for>, the lexical scope of
345those variables is exactly the C<for> loop (the body of the loop
346and the control sections).
55497cff 347
cb1a09d0 348Besides the normal array index looping, C<for> can lend itself
349to many other interesting applications. Here's one that avoids the
54310121 350problem you get into if you explicitly test for end-of-file on
351an interactive file descriptor causing your program to appear to
cb1a09d0 352hang.
353
354 $on_a_tty = -t STDIN && -t STDOUT;
355 sub prompt { print "yes? " if $on_a_tty }
356 for ( prompt(); <STDIN>; prompt() ) {
357 # do something
54310121 358 }
cb1a09d0 359
00cb5da1 360Using C<readline> (or the operator form, C<< <EXPR> >>) as the
361conditional of a C<for> loop is shorthand for the following. This
362behaviour is the same as a C<while> loop conditional.
363
364 for ( prompt(); defined( $_ = <STDIN> ); prompt() ) {
365 # do something
366 }
367
cb1a09d0 368=head2 Foreach Loops
369
4633a7c4 370The C<foreach> loop iterates over a normal list value and sets the
55497cff 371variable VAR to be each element of the list in turn. If the variable
372is preceded with the keyword C<my>, then it is lexically scoped, and
373is therefore visible only within the loop. Otherwise, the variable is
374implicitly local to the loop and regains its former value upon exiting
375the loop. If the variable was previously declared with C<my>, it uses
376that variable instead of the global one, but it's still localized to
5c502d37 377the loop. This implicit localisation occurs I<only> in a C<foreach>
378loop.
4633a7c4 379
380The C<foreach> keyword is actually a synonym for the C<for> keyword, so
5a964f20 381you can use C<foreach> for readability or C<for> for brevity. (Or because
382the Bourne shell is more familiar to you than I<csh>, so writing C<for>
f86cebdf 383comes more naturally.) If VAR is omitted, C<$_> is set to each value.
c5674021 384
385If any element of LIST is an lvalue, you can modify it by modifying
386VAR inside the loop. Conversely, if any element of LIST is NOT an
387lvalue, any attempt to modify that element will fail. In other words,
388the C<foreach> loop index variable is an implicit alias for each item
389in the list that you're looping over.
302617ea 390
391If any part of LIST is an array, C<foreach> will get very confused if
392you add or remove elements within the loop body, for example with
393C<splice>. So don't do that.
394
395C<foreach> probably won't do what you expect if VAR is a tied or other
396special variable. Don't do that either.
4633a7c4 397
748a9306 398Examples:
a0d0e21e 399
4633a7c4 400 for (@ary) { s/foo/bar/ }
a0d0e21e 401
96f2dc66 402 for my $elem (@elements) {
a0d0e21e 403 $elem *= 2;
404 }
405
4633a7c4 406 for $count (10,9,8,7,6,5,4,3,2,1,'BOOM') {
407 print $count, "\n"; sleep(1);
a0d0e21e 408 }
409
410 for (1..15) { print "Merry Christmas\n"; }
411
4633a7c4 412 foreach $item (split(/:[\\\n:]*/, $ENV{TERMCAP})) {
a0d0e21e 413 print "Item: $item\n";
414 }
415
4633a7c4 416Here's how a C programmer might code up a particular algorithm in Perl:
417
55497cff 418 for (my $i = 0; $i < @ary1; $i++) {
419 for (my $j = 0; $j < @ary2; $j++) {
4633a7c4 420 if ($ary1[$i] > $ary2[$j]) {
421 last; # can't go to outer :-(
422 }
423 $ary1[$i] += $ary2[$j];
424 }
cb1a09d0 425 # this is where that last takes me
4633a7c4 426 }
427
184e9718 428Whereas here's how a Perl programmer more comfortable with the idiom might
cb1a09d0 429do it:
4633a7c4 430
96f2dc66 431 OUTER: for my $wid (@ary1) {
432 INNER: for my $jet (@ary2) {
cb1a09d0 433 next OUTER if $wid > $jet;
434 $wid += $jet;
54310121 435 }
436 }
4633a7c4 437
cb1a09d0 438See how much easier this is? It's cleaner, safer, and faster. It's
439cleaner because it's less noisy. It's safer because if code gets added
c07a80fd 440between the inner and outer loops later on, the new code won't be
5f05dabc 441accidentally executed. The C<next> explicitly iterates the other loop
c07a80fd 442rather than merely terminating the inner one. And it's faster because
443Perl executes a C<foreach> statement more rapidly than it would the
444equivalent C<for> loop.
4633a7c4 445
446=head2 Basic BLOCKs and Switch Statements
447
55497cff 448A BLOCK by itself (labeled or not) is semantically equivalent to a
449loop that executes once. Thus you can use any of the loop control
450statements in it to leave or restart the block. (Note that this is
451I<NOT> true in C<eval{}>, C<sub{}>, or contrary to popular belief
452C<do{}> blocks, which do I<NOT> count as loops.) The C<continue>
453block is optional.
4633a7c4 454
455The BLOCK construct is particularly nice for doing case
a0d0e21e 456structures.
457
458 SWITCH: {
459 if (/^abc/) { $abc = 1; last SWITCH; }
460 if (/^def/) { $def = 1; last SWITCH; }
461 if (/^xyz/) { $xyz = 1; last SWITCH; }
462 $nothing = 1;
463 }
464
f86cebdf 465There is no official C<switch> statement in Perl, because there are
83df6a1d 466already several ways to write the equivalent.
467
468However, starting from Perl 5.8 to get switch and case one can use
469the Switch extension and say:
470
471 use Switch;
472
473after which one has switch and case. It is not as fast as it could be
474because it's not really part of the language (it's done using source
475filters) but it is available, and it's very flexible.
476
477In addition to the above BLOCK construct, you could write
a0d0e21e 478
479 SWITCH: {
480 $abc = 1, last SWITCH if /^abc/;
481 $def = 1, last SWITCH if /^def/;
482 $xyz = 1, last SWITCH if /^xyz/;
483 $nothing = 1;
484 }
485
cb1a09d0 486(That's actually not as strange as it looks once you realize that you can
6ec4bd10 487use loop control "operators" within an expression. That's just the binary
488comma operator in scalar context. See L<perlop/"Comma Operator">.)
a0d0e21e 489
490or
491
492 SWITCH: {
493 /^abc/ && do { $abc = 1; last SWITCH; };
494 /^def/ && do { $def = 1; last SWITCH; };
495 /^xyz/ && do { $xyz = 1; last SWITCH; };
496 $nothing = 1;
497 }
498
f86cebdf 499or formatted so it stands out more as a "proper" C<switch> statement:
a0d0e21e 500
501 SWITCH: {
54310121 502 /^abc/ && do {
503 $abc = 1;
504 last SWITCH;
a0d0e21e 505 };
506
54310121 507 /^def/ && do {
508 $def = 1;
509 last SWITCH;
a0d0e21e 510 };
511
54310121 512 /^xyz/ && do {
513 $xyz = 1;
514 last SWITCH;
a0d0e21e 515 };
516 $nothing = 1;
517 }
518
519or
520
521 SWITCH: {
522 /^abc/ and $abc = 1, last SWITCH;
523 /^def/ and $def = 1, last SWITCH;
524 /^xyz/ and $xyz = 1, last SWITCH;
525 $nothing = 1;
526 }
527
528or even, horrors,
529
530 if (/^abc/)
531 { $abc = 1 }
532 elsif (/^def/)
533 { $def = 1 }
534 elsif (/^xyz/)
535 { $xyz = 1 }
536 else
537 { $nothing = 1 }
538
f86cebdf 539A common idiom for a C<switch> statement is to use C<foreach>'s aliasing to make
540a temporary assignment to C<$_> for convenient matching:
4633a7c4 541
542 SWITCH: for ($where) {
543 /In Card Names/ && do { push @flags, '-e'; last; };
544 /Anywhere/ && do { push @flags, '-h'; last; };
545 /In Rulings/ && do { last; };
546 die "unknown value for form variable where: `$where'";
54310121 547 }
4633a7c4 548
cb1a09d0 549Another interesting approach to a switch statement is arrange
550for a C<do> block to return the proper value:
551
552 $amode = do {
5a964f20 553 if ($flag & O_RDONLY) { "r" } # XXX: isn't this 0?
54310121 554 elsif ($flag & O_WRONLY) { ($flag & O_APPEND) ? "a" : "w" }
cb1a09d0 555 elsif ($flag & O_RDWR) {
556 if ($flag & O_CREAT) { "w+" }
c07a80fd 557 else { ($flag & O_APPEND) ? "a+" : "r+" }
cb1a09d0 558 }
559 };
560
5a964f20 561Or
562
563 print do {
564 ($flags & O_WRONLY) ? "write-only" :
565 ($flags & O_RDWR) ? "read-write" :
566 "read-only";
567 };
568
a031eab2 569Or if you are certain that all the C<&&> clauses are true, you can use
5a964f20 570something like this, which "switches" on the value of the
a2293a43 571C<HTTP_USER_AGENT> environment variable.
5a964f20 572
573 #!/usr/bin/perl
574 # pick out jargon file page based on browser
575 $dir = 'http://www.wins.uva.nl/~mes/jargon';
576 for ($ENV{HTTP_USER_AGENT}) {
577 $page = /Mac/ && 'm/Macintrash.html'
578 || /Win(dows )?NT/ && 'e/evilandrude.html'
579 || /Win|MSIE|WebTV/ && 'm/MicroslothWindows.html'
580 || /Linux/ && 'l/Linux.html'
581 || /HP-UX/ && 'h/HP-SUX.html'
582 || /SunOS/ && 's/ScumOS.html'
583 || 'a/AppendixB.html';
584 }
585 print "Location: $dir/$page\015\012\015\012";
586
587That kind of switch statement only works when you know the C<&&> clauses
588will be true. If you don't, the previous C<?:> example should be used.
589
19799a22 590You might also consider writing a hash of subroutine references
591instead of synthesizing a C<switch> statement.
5a964f20 592
4633a7c4 593=head2 Goto
594
19799a22 595Although not for the faint of heart, Perl does support a C<goto>
596statement. There are three forms: C<goto>-LABEL, C<goto>-EXPR, and
597C<goto>-&NAME. A loop's LABEL is not actually a valid target for
598a C<goto>; it's just the name of the loop.
4633a7c4 599
f86cebdf 600The C<goto>-LABEL form finds the statement labeled with LABEL and resumes
4633a7c4 601execution there. It may not be used to go into any construct that
f86cebdf 602requires initialization, such as a subroutine or a C<foreach> loop. It
4633a7c4 603also can't be used to go into a construct that is optimized away. It
604can be used to go almost anywhere else within the dynamic scope,
605including out of subroutines, but it's usually better to use some other
f86cebdf 606construct such as C<last> or C<die>. The author of Perl has never felt the
607need to use this form of C<goto> (in Perl, that is--C is another matter).
4633a7c4 608
f86cebdf 609The C<goto>-EXPR form expects a label name, whose scope will be resolved
610dynamically. This allows for computed C<goto>s per FORTRAN, but isn't
4633a7c4 611necessarily recommended if you're optimizing for maintainability:
612
96f2dc66 613 goto(("FOO", "BAR", "GLARCH")[$i]);
4633a7c4 614
f86cebdf 615The C<goto>-&NAME form is highly magical, and substitutes a call to the
4633a7c4 616named subroutine for the currently running subroutine. This is used by
f86cebdf 617C<AUTOLOAD()> subroutines that wish to load another subroutine and then
4633a7c4 618pretend that the other subroutine had been called in the first place
f86cebdf 619(except that any modifications to C<@_> in the current subroutine are
620propagated to the other subroutine.) After the C<goto>, not even C<caller()>
4633a7c4 621will be able to tell that this routine was called first.
622
c07a80fd 623In almost all cases like this, it's usually a far, far better idea to use the
624structured control flow mechanisms of C<next>, C<last>, or C<redo> instead of
4633a7c4 625resorting to a C<goto>. For certain applications, the catch and throw pair of
626C<eval{}> and die() for exception processing can also be a prudent approach.
cb1a09d0 627
628=head2 PODs: Embedded Documentation
629
630Perl has a mechanism for intermixing documentation with source code.
c07a80fd 631While it's expecting the beginning of a new statement, if the compiler
cb1a09d0 632encounters a line that begins with an equal sign and a word, like this
633
634 =head1 Here There Be Pods!
635
636Then that text and all remaining text up through and including a line
637beginning with C<=cut> will be ignored. The format of the intervening
54310121 638text is described in L<perlpod>.
cb1a09d0 639
640This allows you to intermix your source code
641and your documentation text freely, as in
642
643 =item snazzle($)
644
54310121 645 The snazzle() function will behave in the most spectacular
cb1a09d0 646 form that you can possibly imagine, not even excepting
647 cybernetic pyrotechnics.
648
649 =cut back to the compiler, nuff of this pod stuff!
650
651 sub snazzle($) {
652 my $thingie = shift;
653 .........
54310121 654 }
cb1a09d0 655
54310121 656Note that pod translators should look at only paragraphs beginning
184e9718 657with a pod directive (it makes parsing easier), whereas the compiler
54310121 658actually knows to look for pod escapes even in the middle of a
cb1a09d0 659paragraph. This means that the following secret stuff will be
660ignored by both the compiler and the translators.
661
662 $a=3;
663 =secret stuff
664 warn "Neither POD nor CODE!?"
665 =cut back
666 print "got $a\n";
667
f86cebdf 668You probably shouldn't rely upon the C<warn()> being podded out forever.
cb1a09d0 669Not all pod translators are well-behaved in this regard, and perhaps
670the compiler will become pickier.
774d564b 671
672One may also use pod directives to quickly comment out a section
673of code.
674
675=head2 Plain Old Comments (Not!)
676
6ec4bd10 677Perl can process line directives, much like the C preprocessor. Using
5a964f20 678this, one can control Perl's idea of filenames and line numbers in
774d564b 679error or warning messages (especially for strings that are processed
f86cebdf 680with C<eval()>). The syntax for this mechanism is the same as for most
774d564b 681C preprocessors: it matches the regular expression
6ec4bd10 682
683 # example: '# line 42 "new_filename.plx"'
82d4537c 684 /^\# \s*
6ec4bd10 685 line \s+ (\d+) \s*
7b6e93a8 686 (?:\s("?)([^"]+)\2)? \s*
6ec4bd10 687 $/x
688
7b6e93a8 689with C<$1> being the line number for the next line, and C<$3> being
690the optional filename (specified with or without quotes).
774d564b 691
003183f2 692There is a fairly obvious gotcha included with the line directive:
693Debuggers and profilers will only show the last source line to appear
694at a particular line number in a given file. Care should be taken not
695to cause line number collisions in code you'd like to debug later.
696
774d564b 697Here are some examples that you should be able to type into your command
698shell:
699
700 % perl
701 # line 200 "bzzzt"
702 # the `#' on the previous line must be the first char on line
703 die 'foo';
704 __END__
705 foo at bzzzt line 201.
54310121 706
774d564b 707 % perl
708 # line 200 "bzzzt"
709 eval qq[\n#line 2001 ""\ndie 'foo']; print $@;
710 __END__
711 foo at - line 2001.
54310121 712
774d564b 713 % perl
714 eval qq[\n#line 200 "foo bar"\ndie 'foo']; print $@;
715 __END__
716 foo at foo bar line 200.
54310121 717
774d564b 718 % perl
719 # line 345 "goop"
720 eval "\n#line " . __LINE__ . ' "' . __FILE__ ."\"\ndie 'foo'";
721 print $@;
722 __END__
723 foo at goop line 345.
724
725=cut