further speeding up of is_utf8_string()
[p5sagit/p5-mst-13.2.git] / pod / perlsyn.pod
CommitLineData
a0d0e21e 1=head1 NAME
2
3perlsyn - Perl syntax
4
5=head1 DESCRIPTION
6
6014d0cb 7A Perl program consists of a sequence of declarations and statements
8which run from the top to the bottom. Loops, subroutines and other
9control structures allow you to jump around within the code.
10
11Perl is a B<free-form> language, you can format and indent it however
12you like. Whitespace mostly serves to separate tokens, unlike
13languages like Python where it is an important part of the syntax.
14
15Many of Perl's syntactic elements are B<optional>. Rather than
110b9c83 16requiring you to put parentheses around every function call and
6014d0cb 17declare every variable, you can often leave such explicit elements off
18and Perl will figure out what you meant. This is known as B<Do What I
19Mean>, abbreviated B<DWIM>. It allows programmers to be B<lazy> and to
110b9c83 20code in a style with which they are comfortable.
6014d0cb 21
22Perl B<borrows syntax> and concepts from many languages: awk, sed, C,
23Bourne Shell, Smalltalk, Lisp and even English. Other
24languages have borrowed syntax from Perl, particularly its regular
25expression extensions. So if you have programmed in another language
26you will see familiar pieces in Perl. They often work the same, but
27see L<perltrap> for information about how they differ.
a0d0e21e 28
0b8d69e9 29=head2 Declarations
30
cf48932e 31The only things you need to declare in Perl are report formats and
32subroutines (and sometimes not even subroutines). A variable holds
33the undefined value (C<undef>) until it has been assigned a defined
34value, which is anything other than C<undef>. When used as a number,
35C<undef> is treated as C<0>; when used as a string, it is treated as
36the empty string, C<"">; and when used as a reference that isn't being
37assigned to, it is treated as an error. If you enable warnings,
38you'll be notified of an uninitialized value whenever you treat
39C<undef> as a string or a number. Well, usually. Boolean contexts,
40such as:
7bd1983c 41
42 my $a;
43 if ($a) {}
44
a6b1f6d8 45are exempt from warnings (because they care about truth rather than
46definedness). Operators such as C<++>, C<-->, C<+=>,
7bd1983c 47C<-=>, and C<.=>, that operate on undefined left values such as:
48
49 my $a;
50 $a++;
51
52are also always exempt from such warnings.
0b8d69e9 53
a0d0e21e 54A declaration can be put anywhere a statement can, but has no effect on
55the execution of the primary sequence of statements--declarations all
56take effect at compile time. Typically all the declarations are put at
54310121 57the beginning or the end of the script. However, if you're using
0b8d69e9 58lexically-scoped private variables created with C<my()>, you'll
59have to make sure
4633a7c4 60your format or subroutine definition is within the same block scope
5f05dabc 61as the my if you expect to be able to access those private variables.
a0d0e21e 62
4633a7c4 63Declaring a subroutine allows a subroutine name to be used as if it were a
64list operator from that point forward in the program. You can declare a
54310121 65subroutine without defining it by saying C<sub name>, thus:
a0d0e21e 66
54310121 67 sub myname;
a0d0e21e 68 $me = myname $0 or die "can't get myname";
69
1f950eb4 70Note that myname() functions as a list operator, not as a unary operator;
71so be careful to use C<or> instead of C<||> in this case. However, if
54310121 72you were to declare the subroutine as C<sub myname ($)>, then
02c45c47 73C<myname> would function as a unary operator, so either C<or> or
54310121 74C<||> would work.
a0d0e21e 75
4633a7c4 76Subroutines declarations can also be loaded up with the C<require> statement
77or both loaded and imported into your namespace with a C<use> statement.
78See L<perlmod> for details on this.
a0d0e21e 79
4633a7c4 80A statement sequence may contain declarations of lexically-scoped
81variables, but apart from declaring a variable name, the declaration acts
82like an ordinary statement, and is elaborated within the sequence of
83statements as if it were an ordinary statement. That means it actually
84has both compile-time and run-time effects.
a0d0e21e 85
6014d0cb 86=head2 Comments
87
88Text from a C<"#"> character until the end of the line is a comment,
89and is ignored. Exceptions include C<"#"> inside a string or regular
90expression.
91
6ec4bd10 92=head2 Simple Statements
a0d0e21e 93
94The only kind of simple statement is an expression evaluated for its
95side effects. Every simple statement must be terminated with a
96semicolon, unless it is the final statement in a block, in which case
f386e492 97the semicolon is optional. (A semicolon is still encouraged if the
98block takes up more than one line, because you may eventually add
cf48932e 99another line.) Note that there are some operators like C<eval {}> and
100C<do {}> that look like compound statements, but aren't (they're just
101TERMs in an expression), and thus need an explicit termination if used
102as the last item in a statement.
103
104=head2 Truth and Falsehood
105
f92061c1 106The number 0, the strings C<'0'> and C<''>, the empty list C<()>, and
107C<undef> are all false in a boolean context. All other values are true.
52ea55c9 108Negation of a true value by C<!> or C<not> returns a special false value.
109When evaluated as a string it is treated as C<''>, but as a number, it
110is treated as 0.
cf48932e 111
cf48932e 112=head2 Statement Modifiers
a0d0e21e 113
114Any simple statement may optionally be followed by a I<SINGLE> modifier,
115just before the terminating semicolon (or block ending). The possible
116modifiers are:
117
118 if EXPR
119 unless EXPR
120 while EXPR
121 until EXPR
cf48932e 122 foreach LIST
123
124The C<EXPR> following the modifier is referred to as the "condition".
125Its truth or falsehood determines how the modifier will behave.
126
127C<if> executes the statement once I<if> and only if the condition is
128true. C<unless> is the opposite, it executes the statement I<unless>
129the condition is true (i.e., if the condition is false).
130
131 print "Basset hounds got long ears" if length $ear >= 10;
132 go_outside() and play() unless $is_raining;
133
134The C<foreach> modifier is an iterator: it executes the statement once
135for each item in the LIST (with C<$_> aliased to each item in turn).
136
137 print "Hello $_!\n" foreach qw(world Dolly nurse);
138
139C<while> repeats the statement I<while> the condition is true.
140C<until> does the opposite, it repeats the statement I<until> the
141condition is true (or while the condition is false):
142
143 # Both of these count from 0 to 10.
144 print $i++ while $i <= 10;
145 print $j++ until $j > 10;
146
147The C<while> and C<until> modifiers have the usual "C<while> loop"
148semantics (conditional evaluated first), except when applied to a
149C<do>-BLOCK (or to the deprecated C<do>-SUBROUTINE statement), in
150which case the block executes once before the conditional is
151evaluated. This is so that you can write loops like:
a0d0e21e 152
153 do {
4633a7c4 154 $line = <STDIN>;
a0d0e21e 155 ...
4633a7c4 156 } until $line eq ".\n";
a0d0e21e 157
5a964f20 158See L<perlfunc/do>. Note also that the loop control statements described
159later will I<NOT> work in this construct, because modifiers don't take
160loop labels. Sorry. You can always put another block inside of it
161(for C<next>) or around it (for C<last>) to do that sort of thing.
f86cebdf 162For C<next>, just double the braces:
5a964f20 163
164 do {{
165 next if $x == $y;
166 # do something here
167 }} until $x++ > $z;
168
f86cebdf 169For C<last>, you have to be more elaborate:
5a964f20 170
171 LOOP: {
172 do {
173 last if $x = $y**2;
174 # do something here
175 } while $x++ <= $z;
176 }
a0d0e21e 177
457b36cb 178B<NOTE:> The behaviour of a C<my> statement modified with a statement
179modifier conditional or loop construct (e.g. C<my $x if ...>) is
180B<undefined>. The value of the C<my> variable may be C<undef>, any
181previously assigned value, or possibly anything else. Don't rely on
182it. Future versions of perl might do something different from the
183version of perl you try it out on. Here be dragons.
184
6ec4bd10 185=head2 Compound Statements
a0d0e21e 186
187In Perl, a sequence of statements that defines a scope is called a block.
188Sometimes a block is delimited by the file containing it (in the case
189of a required file, or the program as a whole), and sometimes a block
190is delimited by the extent of a string (in the case of an eval).
191
192But generally, a block is delimited by curly brackets, also known as braces.
193We will call this syntactic construct a BLOCK.
194
195The following compound statements may be used to control flow:
196
197 if (EXPR) BLOCK
198 if (EXPR) BLOCK else BLOCK
199 if (EXPR) BLOCK elsif (EXPR) BLOCK ... else BLOCK
200 LABEL while (EXPR) BLOCK
201 LABEL while (EXPR) BLOCK continue BLOCK
202 LABEL for (EXPR; EXPR; EXPR) BLOCK
748a9306 203 LABEL foreach VAR (LIST) BLOCK
b303ae78 204 LABEL foreach VAR (LIST) BLOCK continue BLOCK
a0d0e21e 205 LABEL BLOCK continue BLOCK
206
207Note that, unlike C and Pascal, these are defined in terms of BLOCKs,
208not statements. This means that the curly brackets are I<required>--no
209dangling statements allowed. If you want to write conditionals without
210curly brackets there are several other ways to do it. The following
211all do the same thing:
212
213 if (!open(FOO)) { die "Can't open $FOO: $!"; }
214 die "Can't open $FOO: $!" unless open(FOO);
215 open(FOO) or die "Can't open $FOO: $!"; # FOO or bust!
216 open(FOO) ? 'hi mom' : die "Can't open $FOO: $!";
217 # a bit exotic, that last one
218
5f05dabc 219The C<if> statement is straightforward. Because BLOCKs are always
a0d0e21e 220bounded by curly brackets, there is never any ambiguity about which
221C<if> an C<else> goes with. If you use C<unless> in place of C<if>,
222the sense of the test is reversed.
223
224The C<while> statement executes the block as long as the expression is
0eb389d5 225true (does not evaluate to the null string C<""> or C<0> or C<"0">).
b78218b7 226The LABEL is optional, and if present, consists of an identifier followed
227by a colon. The LABEL identifies the loop for the loop control
228statements C<next>, C<last>, and C<redo>.
229If the LABEL is omitted, the loop control statement
4633a7c4 230refers to the innermost enclosing loop. This may include dynamically
231looking back your call-stack at run time to find the LABEL. Such
9f1b1f2d 232desperate behavior triggers a warning if you use the C<use warnings>
a2293a43 233pragma or the B<-w> flag.
4633a7c4 234
235If there is a C<continue> BLOCK, it is always executed just before the
6ec4bd10 236conditional is about to be evaluated again. Thus it can be used to
237increment a loop variable, even when the loop has been continued via
238the C<next> statement.
4633a7c4 239
240=head2 Loop Control
241
6ec4bd10 242The C<next> command starts the next iteration of the loop:
4633a7c4 243
244 LINE: while (<STDIN>) {
245 next LINE if /^#/; # discard comments
246 ...
247 }
248
6ec4bd10 249The C<last> command immediately exits the loop in question. The
4633a7c4 250C<continue> block, if any, is not executed:
251
252 LINE: while (<STDIN>) {
253 last LINE if /^$/; # exit when done with header
254 ...
255 }
256
257The C<redo> command restarts the loop block without evaluating the
258conditional again. The C<continue> block, if any, is I<not> executed.
259This command is normally used by programs that want to lie to themselves
260about what was just input.
261
262For example, when processing a file like F</etc/termcap>.
263If your input lines might end in backslashes to indicate continuation, you
264want to skip ahead and get the next record.
265
266 while (<>) {
267 chomp;
54310121 268 if (s/\\$//) {
269 $_ .= <>;
4633a7c4 270 redo unless eof();
271 }
272 # now process $_
54310121 273 }
4633a7c4 274
275which is Perl short-hand for the more explicitly written version:
276
54310121 277 LINE: while (defined($line = <ARGV>)) {
4633a7c4 278 chomp($line);
54310121 279 if ($line =~ s/\\$//) {
280 $line .= <ARGV>;
4633a7c4 281 redo LINE unless eof(); # not eof(ARGV)!
282 }
283 # now process $line
54310121 284 }
4633a7c4 285
36e7a065 286Note that if there were a C<continue> block on the above code, it would
287get executed only on lines discarded by the regex (since redo skips the
288continue block). A continue block is often used to reset line counters
289or C<?pat?> one-time matches:
4633a7c4 290
5a964f20 291 # inspired by :1,$g/fred/s//WILMA/
292 while (<>) {
293 ?(fred)? && s//WILMA $1 WILMA/;
294 ?(barney)? && s//BETTY $1 BETTY/;
295 ?(homer)? && s//MARGE $1 MARGE/;
296 } continue {
297 print "$ARGV $.: $_";
298 close ARGV if eof(); # reset $.
299 reset if eof(); # reset ?pat?
4633a7c4 300 }
301
a0d0e21e 302If the word C<while> is replaced by the word C<until>, the sense of the
303test is reversed, but the conditional is still tested before the first
304iteration.
305
5a964f20 306The loop control statements don't work in an C<if> or C<unless>, since
307they aren't loops. You can double the braces to make them such, though.
308
309 if (/pattern/) {{
7bd1983c 310 last if /fred/;
311 next if /barney/; # same effect as "last", but doesn't document as well
312 # do something here
5a964f20 313 }}
314
7bd1983c 315This is caused by the fact that a block by itself acts as a loop that
316executes once, see L<"Basic BLOCKs and Switch Statements">.
317
5b23ba8b 318The form C<while/if BLOCK BLOCK>, available in Perl 4, is no longer
319available. Replace any occurrence of C<if BLOCK> by C<if (do BLOCK)>.
4633a7c4 320
cb1a09d0 321=head2 For Loops
a0d0e21e 322
b78df5de 323Perl's C-style C<for> loop works like the corresponding C<while> loop;
cb1a09d0 324that means that this:
a0d0e21e 325
326 for ($i = 1; $i < 10; $i++) {
327 ...
328 }
329
cb1a09d0 330is the same as this:
a0d0e21e 331
332 $i = 1;
333 while ($i < 10) {
334 ...
335 } continue {
336 $i++;
337 }
338
b78df5de 339There is one minor difference: if variables are declared with C<my>
340in the initialization section of the C<for>, the lexical scope of
341those variables is exactly the C<for> loop (the body of the loop
342and the control sections).
55497cff 343
cb1a09d0 344Besides the normal array index looping, C<for> can lend itself
345to many other interesting applications. Here's one that avoids the
54310121 346problem you get into if you explicitly test for end-of-file on
347an interactive file descriptor causing your program to appear to
cb1a09d0 348hang.
349
350 $on_a_tty = -t STDIN && -t STDOUT;
351 sub prompt { print "yes? " if $on_a_tty }
352 for ( prompt(); <STDIN>; prompt() ) {
353 # do something
54310121 354 }
cb1a09d0 355
00cb5da1 356Using C<readline> (or the operator form, C<< <EXPR> >>) as the
357conditional of a C<for> loop is shorthand for the following. This
358behaviour is the same as a C<while> loop conditional.
359
360 for ( prompt(); defined( $_ = <STDIN> ); prompt() ) {
361 # do something
362 }
363
cb1a09d0 364=head2 Foreach Loops
365
4633a7c4 366The C<foreach> loop iterates over a normal list value and sets the
55497cff 367variable VAR to be each element of the list in turn. If the variable
368is preceded with the keyword C<my>, then it is lexically scoped, and
369is therefore visible only within the loop. Otherwise, the variable is
370implicitly local to the loop and regains its former value upon exiting
371the loop. If the variable was previously declared with C<my>, it uses
372that variable instead of the global one, but it's still localized to
5c502d37 373the loop. This implicit localisation occurs I<only> in a C<foreach>
374loop.
4633a7c4 375
376The C<foreach> keyword is actually a synonym for the C<for> keyword, so
5a964f20 377you can use C<foreach> for readability or C<for> for brevity. (Or because
378the Bourne shell is more familiar to you than I<csh>, so writing C<for>
f86cebdf 379comes more naturally.) If VAR is omitted, C<$_> is set to each value.
c5674021 380
381If any element of LIST is an lvalue, you can modify it by modifying
382VAR inside the loop. Conversely, if any element of LIST is NOT an
383lvalue, any attempt to modify that element will fail. In other words,
384the C<foreach> loop index variable is an implicit alias for each item
385in the list that you're looping over.
302617ea 386
387If any part of LIST is an array, C<foreach> will get very confused if
388you add or remove elements within the loop body, for example with
389C<splice>. So don't do that.
390
391C<foreach> probably won't do what you expect if VAR is a tied or other
392special variable. Don't do that either.
4633a7c4 393
748a9306 394Examples:
a0d0e21e 395
4633a7c4 396 for (@ary) { s/foo/bar/ }
a0d0e21e 397
96f2dc66 398 for my $elem (@elements) {
a0d0e21e 399 $elem *= 2;
400 }
401
4633a7c4 402 for $count (10,9,8,7,6,5,4,3,2,1,'BOOM') {
403 print $count, "\n"; sleep(1);
a0d0e21e 404 }
405
406 for (1..15) { print "Merry Christmas\n"; }
407
4633a7c4 408 foreach $item (split(/:[\\\n:]*/, $ENV{TERMCAP})) {
a0d0e21e 409 print "Item: $item\n";
410 }
411
4633a7c4 412Here's how a C programmer might code up a particular algorithm in Perl:
413
55497cff 414 for (my $i = 0; $i < @ary1; $i++) {
415 for (my $j = 0; $j < @ary2; $j++) {
4633a7c4 416 if ($ary1[$i] > $ary2[$j]) {
417 last; # can't go to outer :-(
418 }
419 $ary1[$i] += $ary2[$j];
420 }
cb1a09d0 421 # this is where that last takes me
4633a7c4 422 }
423
184e9718 424Whereas here's how a Perl programmer more comfortable with the idiom might
cb1a09d0 425do it:
4633a7c4 426
96f2dc66 427 OUTER: for my $wid (@ary1) {
428 INNER: for my $jet (@ary2) {
cb1a09d0 429 next OUTER if $wid > $jet;
430 $wid += $jet;
54310121 431 }
432 }
4633a7c4 433
cb1a09d0 434See how much easier this is? It's cleaner, safer, and faster. It's
435cleaner because it's less noisy. It's safer because if code gets added
c07a80fd 436between the inner and outer loops later on, the new code won't be
5f05dabc 437accidentally executed. The C<next> explicitly iterates the other loop
c07a80fd 438rather than merely terminating the inner one. And it's faster because
439Perl executes a C<foreach> statement more rapidly than it would the
440equivalent C<for> loop.
4633a7c4 441
442=head2 Basic BLOCKs and Switch Statements
443
55497cff 444A BLOCK by itself (labeled or not) is semantically equivalent to a
445loop that executes once. Thus you can use any of the loop control
446statements in it to leave or restart the block. (Note that this is
447I<NOT> true in C<eval{}>, C<sub{}>, or contrary to popular belief
448C<do{}> blocks, which do I<NOT> count as loops.) The C<continue>
449block is optional.
4633a7c4 450
451The BLOCK construct is particularly nice for doing case
a0d0e21e 452structures.
453
454 SWITCH: {
455 if (/^abc/) { $abc = 1; last SWITCH; }
456 if (/^def/) { $def = 1; last SWITCH; }
457 if (/^xyz/) { $xyz = 1; last SWITCH; }
458 $nothing = 1;
459 }
460
f86cebdf 461There is no official C<switch> statement in Perl, because there are
83df6a1d 462already several ways to write the equivalent.
463
464However, starting from Perl 5.8 to get switch and case one can use
465the Switch extension and say:
466
467 use Switch;
468
469after which one has switch and case. It is not as fast as it could be
470because it's not really part of the language (it's done using source
471filters) but it is available, and it's very flexible.
472
473In addition to the above BLOCK construct, you could write
a0d0e21e 474
475 SWITCH: {
476 $abc = 1, last SWITCH if /^abc/;
477 $def = 1, last SWITCH if /^def/;
478 $xyz = 1, last SWITCH if /^xyz/;
479 $nothing = 1;
480 }
481
cb1a09d0 482(That's actually not as strange as it looks once you realize that you can
6ec4bd10 483use loop control "operators" within an expression. That's just the binary
484comma operator in scalar context. See L<perlop/"Comma Operator">.)
a0d0e21e 485
486or
487
488 SWITCH: {
489 /^abc/ && do { $abc = 1; last SWITCH; };
490 /^def/ && do { $def = 1; last SWITCH; };
491 /^xyz/ && do { $xyz = 1; last SWITCH; };
492 $nothing = 1;
493 }
494
f86cebdf 495or formatted so it stands out more as a "proper" C<switch> statement:
a0d0e21e 496
497 SWITCH: {
54310121 498 /^abc/ && do {
499 $abc = 1;
500 last SWITCH;
a0d0e21e 501 };
502
54310121 503 /^def/ && do {
504 $def = 1;
505 last SWITCH;
a0d0e21e 506 };
507
54310121 508 /^xyz/ && do {
509 $xyz = 1;
510 last SWITCH;
a0d0e21e 511 };
512 $nothing = 1;
513 }
514
515or
516
517 SWITCH: {
518 /^abc/ and $abc = 1, last SWITCH;
519 /^def/ and $def = 1, last SWITCH;
520 /^xyz/ and $xyz = 1, last SWITCH;
521 $nothing = 1;
522 }
523
524or even, horrors,
525
526 if (/^abc/)
527 { $abc = 1 }
528 elsif (/^def/)
529 { $def = 1 }
530 elsif (/^xyz/)
531 { $xyz = 1 }
532 else
533 { $nothing = 1 }
534
f86cebdf 535A common idiom for a C<switch> statement is to use C<foreach>'s aliasing to make
536a temporary assignment to C<$_> for convenient matching:
4633a7c4 537
538 SWITCH: for ($where) {
539 /In Card Names/ && do { push @flags, '-e'; last; };
540 /Anywhere/ && do { push @flags, '-h'; last; };
541 /In Rulings/ && do { last; };
542 die "unknown value for form variable where: `$where'";
54310121 543 }
4633a7c4 544
cb1a09d0 545Another interesting approach to a switch statement is arrange
546for a C<do> block to return the proper value:
547
548 $amode = do {
5a964f20 549 if ($flag & O_RDONLY) { "r" } # XXX: isn't this 0?
54310121 550 elsif ($flag & O_WRONLY) { ($flag & O_APPEND) ? "a" : "w" }
cb1a09d0 551 elsif ($flag & O_RDWR) {
552 if ($flag & O_CREAT) { "w+" }
c07a80fd 553 else { ($flag & O_APPEND) ? "a+" : "r+" }
cb1a09d0 554 }
555 };
556
5a964f20 557Or
558
559 print do {
560 ($flags & O_WRONLY) ? "write-only" :
561 ($flags & O_RDWR) ? "read-write" :
562 "read-only";
563 };
564
a031eab2 565Or if you are certain that all the C<&&> clauses are true, you can use
5a964f20 566something like this, which "switches" on the value of the
a2293a43 567C<HTTP_USER_AGENT> environment variable.
5a964f20 568
569 #!/usr/bin/perl
570 # pick out jargon file page based on browser
571 $dir = 'http://www.wins.uva.nl/~mes/jargon';
572 for ($ENV{HTTP_USER_AGENT}) {
573 $page = /Mac/ && 'm/Macintrash.html'
574 || /Win(dows )?NT/ && 'e/evilandrude.html'
575 || /Win|MSIE|WebTV/ && 'm/MicroslothWindows.html'
576 || /Linux/ && 'l/Linux.html'
577 || /HP-UX/ && 'h/HP-SUX.html'
578 || /SunOS/ && 's/ScumOS.html'
579 || 'a/AppendixB.html';
580 }
581 print "Location: $dir/$page\015\012\015\012";
582
583That kind of switch statement only works when you know the C<&&> clauses
584will be true. If you don't, the previous C<?:> example should be used.
585
19799a22 586You might also consider writing a hash of subroutine references
587instead of synthesizing a C<switch> statement.
5a964f20 588
4633a7c4 589=head2 Goto
590
19799a22 591Although not for the faint of heart, Perl does support a C<goto>
592statement. There are three forms: C<goto>-LABEL, C<goto>-EXPR, and
593C<goto>-&NAME. A loop's LABEL is not actually a valid target for
594a C<goto>; it's just the name of the loop.
4633a7c4 595
f86cebdf 596The C<goto>-LABEL form finds the statement labeled with LABEL and resumes
4633a7c4 597execution there. It may not be used to go into any construct that
f86cebdf 598requires initialization, such as a subroutine or a C<foreach> loop. It
4633a7c4 599also can't be used to go into a construct that is optimized away. It
600can be used to go almost anywhere else within the dynamic scope,
601including out of subroutines, but it's usually better to use some other
f86cebdf 602construct such as C<last> or C<die>. The author of Perl has never felt the
603need to use this form of C<goto> (in Perl, that is--C is another matter).
4633a7c4 604
f86cebdf 605The C<goto>-EXPR form expects a label name, whose scope will be resolved
606dynamically. This allows for computed C<goto>s per FORTRAN, but isn't
4633a7c4 607necessarily recommended if you're optimizing for maintainability:
608
96f2dc66 609 goto(("FOO", "BAR", "GLARCH")[$i]);
4633a7c4 610
f86cebdf 611The C<goto>-&NAME form is highly magical, and substitutes a call to the
4633a7c4 612named subroutine for the currently running subroutine. This is used by
f86cebdf 613C<AUTOLOAD()> subroutines that wish to load another subroutine and then
4633a7c4 614pretend that the other subroutine had been called in the first place
f86cebdf 615(except that any modifications to C<@_> in the current subroutine are
616propagated to the other subroutine.) After the C<goto>, not even C<caller()>
4633a7c4 617will be able to tell that this routine was called first.
618
c07a80fd 619In almost all cases like this, it's usually a far, far better idea to use the
620structured control flow mechanisms of C<next>, C<last>, or C<redo> instead of
4633a7c4 621resorting to a C<goto>. For certain applications, the catch and throw pair of
622C<eval{}> and die() for exception processing can also be a prudent approach.
cb1a09d0 623
624=head2 PODs: Embedded Documentation
625
626Perl has a mechanism for intermixing documentation with source code.
c07a80fd 627While it's expecting the beginning of a new statement, if the compiler
cb1a09d0 628encounters a line that begins with an equal sign and a word, like this
629
630 =head1 Here There Be Pods!
631
632Then that text and all remaining text up through and including a line
633beginning with C<=cut> will be ignored. The format of the intervening
54310121 634text is described in L<perlpod>.
cb1a09d0 635
636This allows you to intermix your source code
637and your documentation text freely, as in
638
639 =item snazzle($)
640
54310121 641 The snazzle() function will behave in the most spectacular
cb1a09d0 642 form that you can possibly imagine, not even excepting
643 cybernetic pyrotechnics.
644
645 =cut back to the compiler, nuff of this pod stuff!
646
647 sub snazzle($) {
648 my $thingie = shift;
649 .........
54310121 650 }
cb1a09d0 651
54310121 652Note that pod translators should look at only paragraphs beginning
184e9718 653with a pod directive (it makes parsing easier), whereas the compiler
54310121 654actually knows to look for pod escapes even in the middle of a
cb1a09d0 655paragraph. This means that the following secret stuff will be
656ignored by both the compiler and the translators.
657
658 $a=3;
659 =secret stuff
660 warn "Neither POD nor CODE!?"
661 =cut back
662 print "got $a\n";
663
f86cebdf 664You probably shouldn't rely upon the C<warn()> being podded out forever.
cb1a09d0 665Not all pod translators are well-behaved in this regard, and perhaps
666the compiler will become pickier.
774d564b 667
668One may also use pod directives to quickly comment out a section
669of code.
670
671=head2 Plain Old Comments (Not!)
672
6ec4bd10 673Perl can process line directives, much like the C preprocessor. Using
5a964f20 674this, one can control Perl's idea of filenames and line numbers in
774d564b 675error or warning messages (especially for strings that are processed
f86cebdf 676with C<eval()>). The syntax for this mechanism is the same as for most
774d564b 677C preprocessors: it matches the regular expression
6ec4bd10 678
679 # example: '# line 42 "new_filename.plx"'
82d4537c 680 /^\# \s*
6ec4bd10 681 line \s+ (\d+) \s*
7b6e93a8 682 (?:\s("?)([^"]+)\2)? \s*
6ec4bd10 683 $/x
684
7b6e93a8 685with C<$1> being the line number for the next line, and C<$3> being
686the optional filename (specified with or without quotes).
774d564b 687
003183f2 688There is a fairly obvious gotcha included with the line directive:
689Debuggers and profilers will only show the last source line to appear
690at a particular line number in a given file. Care should be taken not
691to cause line number collisions in code you'd like to debug later.
692
774d564b 693Here are some examples that you should be able to type into your command
694shell:
695
696 % perl
697 # line 200 "bzzzt"
698 # the `#' on the previous line must be the first char on line
699 die 'foo';
700 __END__
701 foo at bzzzt line 201.
54310121 702
774d564b 703 % perl
704 # line 200 "bzzzt"
705 eval qq[\n#line 2001 ""\ndie 'foo']; print $@;
706 __END__
707 foo at - line 2001.
54310121 708
774d564b 709 % perl
710 eval qq[\n#line 200 "foo bar"\ndie 'foo']; print $@;
711 __END__
712 foo at foo bar line 200.
54310121 713
774d564b 714 % perl
715 # line 345 "goop"
716 eval "\n#line " . __LINE__ . ' "' . __FILE__ ."\"\ndie 'foo'";
717 print $@;
718 __END__
719 foo at goop line 345.
720
721=cut