Re: [perl #17757] s///g fails when using English & study in 5.8.0
[p5sagit/p5-mst-13.2.git] / pod / perlrun.pod
CommitLineData
a0d0e21e 1=head1 NAME
2
3perlrun - how to execute the Perl interpreter
4
5=head1 SYNOPSIS
6
6537fe72 7B<perl> S<[ B<-CsTtuUWX> ]>
e0ebc809 8 S<[ B<-hv> ] [ B<-V>[:I<configvar>] ]>
9 S<[ B<-cw> ] [ B<-d>[:I<debugger>] ] [ B<-D>[I<number/list>] ]>
10 S<[ B<-pna> ] [ B<-F>I<pattern> ] [ B<-l>[I<octal>] ] [ B<-0>[I<octal>] ]>
11 S<[ B<-I>I<dir> ] [ B<-m>[B<->]I<module> ] [ B<-M>[B<->]I<'module...'> ]>
12 S<[ B<-P> ]>
13 S<[ B<-S> ]>
14 S<[ B<-x>[I<dir>] ]>
15 S<[ B<-i>[I<extension>] ]>
16 S<[ B<-e> I<'command'> ] [ B<--> ] [ I<programfile> ] [ I<argument> ]...>
a0d0e21e 17
18=head1 DESCRIPTION
19
19799a22 20The normal way to run a Perl program is by making it directly
21executable, or else by passing the name of the source file as an
22argument on the command line. (An interactive Perl environment
23is also possible--see L<perldebug> for details on how to do that.)
24Upon startup, Perl looks for your program in one of the following
a0d0e21e 25places:
26
27=over 4
28
29=item 1.
30
31Specified line by line via B<-e> switches on the command line.
32
33=item 2.
34
35Contained in the file specified by the first filename on the command line.
a3cb178b 36(Note that systems supporting the #! notation invoke interpreters this
37way. See L<Location of Perl>.)
a0d0e21e 38
39=item 3.
40
5f05dabc 41Passed in implicitly via standard input. This works only if there are
19799a22 42no filename arguments--to pass arguments to a STDIN-read program you
43must explicitly specify a "-" for the program name.
a0d0e21e 44
45=back
46
47With methods 2 and 3, Perl starts parsing the input file from the
48beginning, unless you've specified a B<-x> switch, in which case it
49scans for the first line starting with #! and containing the word
19799a22 50"perl", and starts there instead. This is useful for running a program
a0d0e21e 51embedded in a larger message. (In this case you would indicate the end
19799a22 52of the program using the C<__END__> token.)
a0d0e21e 53
5f05dabc 54The #! line is always examined for switches as the line is being
55parsed. Thus, if you're on a machine that allows only one argument
56with the #! line, or worse, doesn't even recognize the #! line, you
57still can get consistent switch behavior regardless of how Perl was
19799a22 58invoked, even if B<-x> was used to find the beginning of the program.
59
60Because historically some operating systems silently chopped off
61kernel interpretation of the #! line after 32 characters, some
62switches may be passed in on the command line, and some may not;
63you could even get a "-" without its letter, if you're not careful.
64You probably want to make sure that all your switches fall either
65before or after that 32-character boundary. Most switches don't
66actually care if they're processed redundantly, but getting a "-"
67instead of a complete switch could cause Perl to try to execute
68standard input instead of your program. And a partial B<-I> switch
a0d0e21e 69could also cause odd results.
70
19799a22 71Some switches do care if they are processed twice, for instance
72combinations of B<-l> and B<-0>. Either put all the switches after
73the 32-character boundary (if applicable), or replace the use of
74B<-0>I<digits> by C<BEGIN{ $/ = "\0digits"; }>.
fb73857a 75
a0d0e21e 76Parsing of the #! switches starts wherever "perl" is mentioned in the line.
77The sequences "-*" and "- " are specifically ignored so that you could,
78if you were so inclined, say
79
80 #!/bin/sh -- # -*- perl -*- -p
19799a22 81 eval 'exec perl -wS $0 ${1+"$@"}'
5f05dabc 82 if $running_under_some_shell;
a0d0e21e 83
44a4342c 84to let Perl see the B<-p> switch.
19799a22 85
86A similar trick involves the B<env> program, if you have it.
87
88 #!/usr/bin/env perl
89
90The examples above use a relative path to the perl interpreter,
91getting whatever version is first in the user's path. If you want
92a specific version of Perl, say, perl5.005_57, you should place
93that directly in the #! line's path.
a0d0e21e 94
95If the #! line does not contain the word "perl", the program named after
96the #! is executed instead of the Perl interpreter. This is slightly
97bizarre, but it helps people on machines that don't do #!, because they
19799a22 98can tell a program that their SHELL is F</usr/bin/perl>, and Perl will then
a0d0e21e 99dispatch the program to the correct interpreter for them.
100
19799a22 101After locating your program, Perl compiles the entire program to an
a0d0e21e 102internal form. If there are any compilation errors, execution of the
19799a22 103program is not attempted. (This is unlike the typical shell script,
54310121 104which might run part-way through before finding a syntax error.)
a0d0e21e 105
19799a22 106If the program is syntactically correct, it is executed. If the program
a0d0e21e 107runs off the end without hitting an exit() or die() operator, an implicit
108C<exit(0)> is provided to indicate successful completion.
109
68dc0745 110=head2 #! and quoting on non-Unix systems
111
112Unix's #! technique can be simulated on other systems:
113
114=over 4
115
116=item OS/2
117
118Put
119
120 extproc perl -S -your_switches
121
19799a22 122as the first line in C<*.cmd> file (B<-S> due to a bug in cmd.exe's
68dc0745 123`extproc' handling).
124
54310121 125=item MS-DOS
68dc0745 126
19799a22 127Create a batch file to run your program, and codify it in
68dc0745 128C<ALTERNATIVE_SHEBANG> (see the F<dosish.h> file in the source
129distribution for more information).
130
131=item Win95/NT
132
6c6a61e2 133The Win95/NT installation, when using the ActiveState installer for Perl,
c8db1d39 134will modify the Registry to associate the F<.pl> extension with the perl
6c6a61e2 135interpreter. If you install Perl by other means (including building from
136the sources), you may have to modify the Registry yourself. Note that
137this means you can no longer tell the difference between an executable
138Perl program and a Perl library file.
68dc0745 139
140=item Macintosh
141
19799a22 142A Macintosh perl program will have the appropriate Creator and
68dc0745 143Type, so that double-clicking them will invoke the perl application.
144
bd3fa61c 145=item VMS
146
147Put
148
149 $ perl -mysw 'f$env("procedure")' 'p1' 'p2' 'p3' 'p4' 'p5' 'p6' 'p7' 'p8' !
150 $ exit++ + ++$status != 0 and $exit = $status = undef;
151
19799a22 152at the top of your program, where B<-mysw> are any command line switches you
153want to pass to Perl. You can now invoke the program directly, by saying
154C<perl program>, or as a DCL procedure, by saying C<@program> (or implicitly
155via F<DCL$PATH> by just using the name of the program).
bd3fa61c 156
157This incantation is a bit much to remember, but Perl will display it for
158you if you say C<perl "-V:startperl">.
159
68dc0745 160=back
161
162Command-interpreters on non-Unix systems have rather different ideas
163on quoting than Unix shells. You'll need to learn the special
164characters in your command-interpreter (C<*>, C<\> and C<"> are
165common) and how to protect whitespace and these characters to run
19799a22 166one-liners (see B<-e> below).
68dc0745 167
168On some systems, you may have to change single-quotes to double ones,
e6f03d26 169which you must I<not> do on Unix or Plan 9 systems. You might also
68dc0745 170have to change a single % to a %%.
171
172For example:
173
174 # Unix
175 perl -e 'print "Hello world\n"'
176
54310121 177 # MS-DOS, etc.
68dc0745 178 perl -e "print \"Hello world\n\""
179
54310121 180 # Macintosh
68dc0745 181 print "Hello world\n"
182 (then Run "Myscript" or Shift-Command-R)
183
184 # VMS
185 perl -e "print ""Hello world\n"""
186
19799a22 187The problem is that none of this is reliable: it depends on the
188command and it is entirely possible neither works. If B<4DOS> were
189the command shell, this would probably work better:
68dc0745 190
191 perl -e "print <Ctrl-x>"Hello world\n<Ctrl-x>""
192
19799a22 193B<CMD.EXE> in Windows NT slipped a lot of standard Unix functionality in
68dc0745 194when nobody was looking, but just try to find documentation for its
195quoting rules.
196
54310121 197Under the Macintosh, it depends which environment you are using. The MacPerl
68dc0745 198shell, or MPW, is much like Unix shells in its support for several
54310121 199quoting variants, except that it makes free use of the Macintosh's non-ASCII
68dc0745 200characters as control characters.
201
202There is no general solution to all of this. It's just a mess.
203
a3cb178b 204=head2 Location of Perl
205
206It may seem obvious to say, but Perl is useful only when users can
19799a22 207easily find it. When possible, it's good for both F</usr/bin/perl>
208and F</usr/local/bin/perl> to be symlinks to the actual binary. If
209that can't be done, system administrators are strongly encouraged
210to put (symlinks to) perl and its accompanying utilities into a
211directory typically found along a user's PATH, or in some other
212obvious and convenient place.
213
214In this documentation, C<#!/usr/bin/perl> on the first line of the program
215will stand in for whatever method works on your system. You are
216advised to use a specific path if you care about a specific version.
a3cb178b 217
19799a22 218 #!/usr/local/bin/perl5.00554
a3cb178b 219
19799a22 220or if you just want to be running at least version, place a statement
221like this at the top of your program:
a0d0e21e 222
19799a22 223 use 5.005_54;
a0d0e21e 224
19799a22 225=head2 Command Switches
226
227As with all standard commands, a single-character switch may be
228clustered with the following switch, if any.
229
230 #!/usr/bin/perl -spi.orig # same as -s -p -i.orig
a0d0e21e 231
232Switches include:
233
234=over 5
235
e0ebc809 236=item B<-0>[I<digits>]
a0d0e21e 237
55497cff 238specifies the input record separator (C<$/>) as an octal number. If there are
a0d0e21e 239no digits, the null character is the separator. Other switches may
240precede or follow the digits. For example, if you have a version of
241B<find> which can print filenames terminated by the null character, you
242can say this:
243
19799a22 244 find . -name '*.orig' -print0 | perl -n0e unlink
a0d0e21e 245
246The special value 00 will cause Perl to slurp files in paragraph mode.
5f05dabc 247The value 0777 will cause Perl to slurp files whole because there is no
a0d0e21e 248legal character with that value.
249
250=item B<-a>
251
252turns on autosplit mode when used with a B<-n> or B<-p>. An implicit
253split command to the @F array is done as the first thing inside the
254implicit while loop produced by the B<-n> or B<-p>.
255
256 perl -ane 'print pop(@F), "\n";'
257
258is equivalent to
259
260 while (<>) {
261 @F = split(' ');
262 print pop(@F), "\n";
263 }
264
265An alternate delimiter may be specified using B<-F>.
266
46487f74 267=item B<-C>
268
fde18df1 269enables Perl to use the Unicode APIs on the target system.
270
271As of Perl 5.8.1, if C<-C> is used and the locale settings (the LC_ALL,
272LC_CTYPE, and LANG environment variables) indicate a UTF-8 locale,
273the STDIN is expected to be in UTF-8, the STDOUT and STDERR are
274expected to be in UTF-8, and C<:utf8> is the default file open layer.
275See L<perluniintro>, L<perlfunc/open>, and L<open> for more information.
276The magic variable C<${^UTF8_LOCALE}> reflects this state,
277see L<perlvar/"${^UTF8_LOCALE}">. (Another way of setting this
278variable is to set the environment variable PERL_UTF8_LOCALE.)
279
280(In Perls earlier than 5.8.1 the C<-C> switch was a Win32-only switch
281that enabled the use of Unicode-aware "wide system call" Win32 APIs.
282This feature was practically unused, however, and the command line
283switch was therefore "recycled".)
46487f74 284
a0d0e21e 285=item B<-c>
286
19799a22 287causes Perl to check the syntax of the program and then exit without
7d30b5c4 288executing it. Actually, it I<will> execute C<BEGIN>, C<CHECK>, and
4f25aa18 289C<use> blocks, because these are considered as occurring outside the
290execution of your program. C<INIT> and C<END> blocks, however, will
291be skipped.
a0d0e21e 292
293=item B<-d>
294
19799a22 295runs the program under the Perl debugger. See L<perldebug>.
a0d0e21e 296
70c94a19 297=item B<-d:>I<foo[=bar,baz]>
3c81428c 298
19799a22 299runs the program under the control of a debugging, profiling, or
300tracing module installed as Devel::foo. E.g., B<-d:DProf> executes
70c94a19 301the program using the Devel::DProf profiler. As with the B<-M>
302flag, options may be passed to the Devel::foo package where they
303will be received and interpreted by the Devel::foo::import routine.
304The comma-separated list of options must follow a C<=> character.
305See L<perldebug>.
3c81428c 306
db2ba183 307=item B<-D>I<letters>
a0d0e21e 308
db2ba183 309=item B<-D>I<number>
a0d0e21e 310
19799a22 311sets debugging flags. To watch how it executes your program, use
db2ba183 312B<-Dtls>. (This works only if debugging is compiled into your
313Perl.) Another nice value is B<-Dx>, which lists your compiled
4197b13f 314syntax tree. And B<-Dr> displays compiled regular expressions;
44a4342c 315the format of the output is explained in L<perldebguts>.
4197b13f 316
317As an alternative, specify a number instead of list of letters (e.g.,
318B<-D14> is equivalent to B<-Dtls>):
a0d0e21e 319
db2ba183 320 1 p Tokenizing and parsing
321 2 s Stack snapshots
d6721266 322 with v, displays all stacks
db2ba183 323 4 l Context (loop) stack processing
324 8 t Trace execution
325 16 o Method and overloading resolution
326 32 c String/numeric conversions
1045810a 327 64 P Print profiling info, preprocessor command for -P, source file input state
db2ba183 328 128 m Memory allocation
329 256 f Format processing
330 512 r Regular expression parsing and execution
331 1024 x Syntax tree dump
332 2048 u Tainting checks
7bab3ede 333 4096 (Obsolete, previously used for LEAKTEST)
db2ba183 334 8192 H Hash dump -- usurps values()
335 16384 X Scratchpad allocation
336 32768 D Cleaning up
8b73bbec 337 65536 S Thread synchronization
607df283 338 131072 T Tokenising
04932ac8 339 262144 R Include reference counts of dumped variables (eg when using -Ds)
1045810a 340 524288 J Do not s,t,P-debug (Jump over) opcodes within package DB
d6721266 341 1048576 v Verbose: use in conjunction with other flags
46187eeb 342 2097152 C Copy On Write
a0d0e21e 343
19799a22 344All these flags require B<-DDEBUGGING> when you compile the Perl
1045810a 345executable (but see L<Devel::Peek>, L<re> which may change this).
44a4342c 346See the F<INSTALL> file in the Perl source distribution
19799a22 347for how to do this. This flag is automatically set if you include B<-g>
8c52afec 348option when C<Configure> asks you about optimizer/debugger flags.
349
19799a22 350If you're just trying to get a print out of each line of Perl code
351as it executes, the way that C<sh -x> provides for shell scripts,
44a4342c 352you can't use Perl's B<-D> switch. Instead do this
19799a22 353
c406981e 354 # If you have "env" utility
355 env=PERLDB_OPTS="NonStop=1 AutoTrace=1 frame=2" perl -dS program
356
19799a22 357 # Bourne shell syntax
358 $ PERLDB_OPTS="NonStop=1 AutoTrace=1 frame=2" perl -dS program
359
360 # csh syntax
361 % (setenv PERLDB_OPTS "NonStop=1 AutoTrace=1 frame=2"; perl -dS program)
362
363See L<perldebug> for details and variations.
364
a0d0e21e 365=item B<-e> I<commandline>
366
19799a22 367may be used to enter one line of program. If B<-e> is given, Perl
368will not look for a filename in the argument list. Multiple B<-e>
369commands may be given to build up a multi-line script. Make sure
370to use semicolons where you would in a normal program.
a0d0e21e 371
e0ebc809 372=item B<-F>I<pattern>
a0d0e21e 373
e0ebc809 374specifies the pattern to split on if B<-a> is also in effect. The
5f05dabc 375pattern may be surrounded by C<//>, C<"">, or C<''>, otherwise it will be
e0ebc809 376put in single quotes.
a0d0e21e 377
e0ebc809 378=item B<-h>
379
380prints a summary of the options.
381
382=item B<-i>[I<extension>]
a0d0e21e 383
2d259d92 384specifies that files processed by the C<E<lt>E<gt>> construct are to be
385edited in-place. It does this by renaming the input file, opening the
386output file by the original name, and selecting that output file as the
387default for print() statements. The extension, if supplied, is used to
388modify the name of the old file to make a backup copy, following these
389rules:
390
391If no extension is supplied, no backup is made and the current file is
392overwritten.
393
19799a22 394If the extension doesn't contain a C<*>, then it is appended to the
395end of the current filename as a suffix. If the extension does
396contain one or more C<*> characters, then each C<*> is replaced
397with the current filename. In Perl terms, you could think of this
398as:
2d259d92 399
66606d78 400 ($backup = $extension) =~ s/\*/$file_name/g;
2d259d92 401
402This allows you to add a prefix to the backup file, instead of (or in
403addition to) a suffix:
404
19799a22 405 $ perl -pi 'orig_*' -e 's/bar/baz/' fileA # backup to 'orig_fileA'
2d259d92 406
407Or even to place backup copies of the original files into another
408directory (provided the directory already exists):
409
19799a22 410 $ perl -pi 'old/*.orig' -e 's/bar/baz/' fileA # backup to 'old/fileA.orig'
2d259d92 411
66606d78 412These sets of one-liners are equivalent:
413
414 $ perl -pi -e 's/bar/baz/' fileA # overwrite current file
19799a22 415 $ perl -pi '*' -e 's/bar/baz/' fileA # overwrite current file
66606d78 416
19799a22 417 $ perl -pi '.orig' -e 's/bar/baz/' fileA # backup to 'fileA.orig'
418 $ perl -pi '*.orig' -e 's/bar/baz/' fileA # backup to 'fileA.orig'
66606d78 419
2d259d92 420From the shell, saying
a0d0e21e 421
19799a22 422 $ perl -p -i.orig -e "s/foo/bar/; ... "
a0d0e21e 423
19799a22 424is the same as using the program:
a0d0e21e 425
19799a22 426 #!/usr/bin/perl -pi.orig
a0d0e21e 427 s/foo/bar/;
428
429which is equivalent to
430
431 #!/usr/bin/perl
19799a22 432 $extension = '.orig';
433 LINE: while (<>) {
a0d0e21e 434 if ($ARGV ne $oldargv) {
66606d78 435 if ($extension !~ /\*/) {
436 $backup = $ARGV . $extension;
437 }
438 else {
439 ($backup = $extension) =~ s/\*/$ARGV/g;
440 }
441 rename($ARGV, $backup);
a0d0e21e 442 open(ARGVOUT, ">$ARGV");
443 select(ARGVOUT);
444 $oldargv = $ARGV;
445 }
446 s/foo/bar/;
447 }
448 continue {
449 print; # this prints to original filename
450 }
451 select(STDOUT);
452
453except that the B<-i> form doesn't need to compare $ARGV to $oldargv to
454know when the filename has changed. It does, however, use ARGVOUT for
66606d78 455the selected filehandle. Note that STDOUT is restored as the default
456output filehandle after the loop.
457
458As shown above, Perl creates the backup file whether or not any output
459is actually changed. So this is just a fancy way to copy files:
460
cd2d1bac 461 $ perl -p -i'/some/file/path/*' -e 1 file1 file2 file3...
19799a22 462or
cd2d1bac 463 $ perl -p -i'.orig' -e 1 file1 file2 file3...
66606d78 464
465You can use C<eof> without parentheses to locate the end of each input
466file, in case you want to append to each file, or reset line numbering
467(see example in L<perlfunc/eof>).
468
469If, for a given file, Perl is unable to create the backup file as
470specified in the extension then it will skip that file and continue on
471with the next one (if it exists).
472
19799a22 473For a discussion of issues surrounding file permissions and B<-i>,
cea6626f 474see L<perlfaq5/Why does Perl let me delete read-only files? Why does -i clobber protected files? Isn't this a bug in Perl?>.
66606d78 475
476You cannot use B<-i> to create directories or to strip extensions from
477files.
a0d0e21e 478
19799a22 479Perl does not expand C<~> in filenames, which is good, since some
480folks use it for their backup files:
a0d0e21e 481
19799a22 482 $ perl -pi~ -e 's/foo/bar/' file1 file2 file3...
483
484Finally, the B<-i> switch does not impede execution when no
a2008d6d 485files are given on the command line. In this case, no backup is made
486(the original file cannot, of course, be determined) and processing
487proceeds from STDIN to STDOUT as might be expected.
488
a0d0e21e 489=item B<-I>I<directory>
490
e0ebc809 491Directories specified by B<-I> are prepended to the search path for
1fef88e7 492modules (C<@INC>), and also tells the C preprocessor where to search for
e0ebc809 493include files. The C preprocessor is invoked with B<-P>; by default it
494searches /usr/include and /usr/lib/perl.
a0d0e21e 495
e0ebc809 496=item B<-l>[I<octnum>]
a0d0e21e 497
19799a22 498enables automatic line-ending processing. It has two separate
499effects. First, it automatically chomps C<$/> (the input record
500separator) when used with B<-n> or B<-p>. Second, it assigns C<$\>
501(the output record separator) to have the value of I<octnum> so
502that any print statements will have that separator added back on.
503If I<octnum> is omitted, sets C<$\> to the current value of
504C<$/>. For instance, to trim lines to 80 columns:
a0d0e21e 505
506 perl -lpe 'substr($_, 80) = ""'
507
508Note that the assignment C<$\ = $/> is done when the switch is processed,
509so the input record separator can be different than the output record
510separator if the B<-l> switch is followed by a B<-0> switch:
511
512 gnufind / -print0 | perl -ln0e 'print "found $_" if -p'
513
1fef88e7 514This sets C<$\> to newline and then sets C<$/> to the null character.
a0d0e21e 515
e0ebc809 516=item B<-m>[B<->]I<module>
517
518=item B<-M>[B<->]I<module>
c07a80fd 519
e0ebc809 520=item B<-M>[B<->]I<'module ...'>
521
522=item B<-[mM]>[B<->]I<module=arg[,arg]...>
3c81428c 523
19799a22 524B<-m>I<module> executes C<use> I<module> C<();> before executing your
525program.
3c81428c 526
19799a22 527B<-M>I<module> executes C<use> I<module> C<;> before executing your
528program. You can use quotes to add extra code after the module name,
529e.g., C<'-Mmodule qw(foo bar)'>.
3c81428c 530
19799a22 531If the first character after the B<-M> or B<-m> is a dash (C<->)
a5f75d66 532then the 'use' is replaced with 'no'.
533
54310121 534A little builtin syntactic sugar means you can also say
19799a22 535B<-mmodule=foo,bar> or B<-Mmodule=foo,bar> as a shortcut for
536C<'-Mmodule qw(foo bar)'>. This avoids the need to use quotes when
537importing symbols. The actual code generated by B<-Mmodule=foo,bar> is
e0ebc809 538C<use module split(/,/,q{foo,bar})>. Note that the C<=> form
19799a22 539removes the distinction between B<-m> and B<-M>.
3c81428c 540
a0d0e21e 541=item B<-n>
542
19799a22 543causes Perl to assume the following loop around your program, which
a0d0e21e 544makes it iterate over filename arguments somewhat like B<sed -n> or
545B<awk>:
546
19799a22 547 LINE:
a0d0e21e 548 while (<>) {
19799a22 549 ... # your program goes here
a0d0e21e 550 }
551
552Note that the lines are not printed by default. See B<-p> to have
08e9d68e 553lines printed. If a file named by an argument cannot be opened for
19799a22 554some reason, Perl warns you about it and moves on to the next file.
08e9d68e 555
556Here is an efficient way to delete all files older than a week:
a0d0e21e 557
19799a22 558 find . -mtime +7 -print | perl -nle unlink
a0d0e21e 559
19799a22 560This is faster than using the B<-exec> switch of B<find> because you don't
561have to start a process on every filename found. It does suffer from
562the bug of mishandling newlines in pathnames, which you can fix if
44a4342c 563you follow the example under B<-0>.
a0d0e21e 564
565C<BEGIN> and C<END> blocks may be used to capture control before or after
19799a22 566the implicit program loop, just as in B<awk>.
a0d0e21e 567
568=item B<-p>
569
19799a22 570causes Perl to assume the following loop around your program, which
a0d0e21e 571makes it iterate over filename arguments somewhat like B<sed>:
572
573
19799a22 574 LINE:
a0d0e21e 575 while (<>) {
19799a22 576 ... # your program goes here
a0d0e21e 577 } continue {
08e9d68e 578 print or die "-p destination: $!\n";
a0d0e21e 579 }
580
08e9d68e 581If a file named by an argument cannot be opened for some reason, Perl
582warns you about it, and moves on to the next file. Note that the
c2611fb3 583lines are printed automatically. An error occurring during printing is
08e9d68e 584treated as fatal. To suppress printing use the B<-n> switch. A B<-p>
585overrides a B<-n> switch.
a0d0e21e 586
587C<BEGIN> and C<END> blocks may be used to capture control before or after
19799a22 588the implicit loop, just as in B<awk>.
a0d0e21e 589
590=item B<-P>
591
079a94c4 592B<NOTE: Use of -P is strongly discouraged because of its inherent
593problems, including poor portability.>
594
595This option causes your program to be run through the C preprocessor before
efdf3af0 596compilation by Perl. Because both comments and B<cpp> directives begin
a0d0e21e 597with the # character, you should avoid starting comments with any words
efdf3af0 598recognized by the C preprocessor such as C<"if">, C<"else">, or C<"define">.
079a94c4 599
600If you're considering using C<-P>, you might also want to look at the
601Filter::cpp module from CPAN.
602
603The problems of -P include, but are not limited to:
604
605=over 10
606
607=item *
608
609The C<#!> line is stripped, so any switches there don't apply.
610
611=item *
612
613A C<-P> on a C<#!> line doesn't work.
614
615=item *
616
617B<All> lines that begin with (whitespace and) a C<#> but
618do not look like cpp commands, are stripped, including anything
44a4342c 619inside Perl strings, regular expressions, and here-docs .
079a94c4 620
621=item *
622
623In some platforms the C preprocessor knows too much: it knows about
624the C++ -style until-end-of-line comments starting with C<"//">.
efdf3af0 625This will cause problems with common Perl constructs like
626
627 s/foo//;
628
629because after -P this will became illegal code
630
631 s/foo
632
633The workaround is to use some other quoting separator than C<"/">,
634like for example C<"!">:
635
636 s!foo!!;
a0d0e21e 637
079a94c4 638
639
640=item *
641
642It requires not only a working C preprocessor but also a working
643F<sed>. If not on UNIX, you are probably out of luck on this.
644
645=item *
646
647Script line numbers are not preserved.
648
649=item *
650
651The C<-x> does not work with C<-P>.
652
653=back
9a1f07e7 654
a0d0e21e 655=item B<-s>
656
19799a22 657enables rudimentary switch parsing for switches on the command
658line after the program name but before any filename arguments (or before
3bbcc830 659an argument of B<-->). This means you can have switches with two leading
660dashes (B<--help>). Any switch found there is removed from @ARGV and sets the
19799a22 661corresponding variable in the Perl program. The following program
3c0facb2 662prints "1" if the program is invoked with a B<-xyz> switch, and "abc"
663if it is invoked with B<-xyz=abc>.
a0d0e21e 664
665 #!/usr/bin/perl -s
3c0facb2 666 if ($xyz) { print "$xyz\n" }
a0d0e21e 667
3bbcc830 668Do note that B<--help> creates the variable ${-help}, which is not compliant
669with C<strict refs>.
670
a0d0e21e 671=item B<-S>
672
673makes Perl use the PATH environment variable to search for the
19799a22 674program (unless the name of the program contains directory separators).
675
2a92aaa0 676On some platforms, this also makes Perl append suffixes to the
677filename while searching for it. For example, on Win32 platforms,
678the ".bat" and ".cmd" suffixes are appended if a lookup for the
679original name fails, and if the name does not already end in one
680of those suffixes. If your Perl was compiled with DEBUGGING turned
681on, using the -Dp switch to Perl shows how the search progresses.
682
2a92aaa0 683Typically this is used to emulate #! startup on platforms that
684don't support #!. This example works on many platforms that
685have a shell compatible with Bourne shell:
a0d0e21e 686
687 #!/usr/bin/perl
a3cb178b 688 eval 'exec /usr/bin/perl -wS $0 ${1+"$@"}'
a0d0e21e 689 if $running_under_some_shell;
690
19799a22 691The system ignores the first line and feeds the program to F</bin/sh>,
692which proceeds to try to execute the Perl program as a shell script.
a0d0e21e 693The shell executes the second line as a normal shell command, and thus
694starts up the Perl interpreter. On some systems $0 doesn't always
695contain the full pathname, so the B<-S> tells Perl to search for the
19799a22 696program if necessary. After Perl locates the program, it parses the
a0d0e21e 697lines and ignores them because the variable $running_under_some_shell
19799a22 698is never true. If the program will be interpreted by csh, you will need
a3cb178b 699to replace C<${1+"$@"}> with C<$*>, even though that doesn't understand
700embedded spaces (and such) in the argument list. To start up sh rather
a0d0e21e 701than csh, some systems may have to replace the #! line with a line
702containing just a colon, which will be politely ignored by Perl. Other
703systems can't control that, and need a totally devious construct that
19799a22 704will work under any of B<csh>, B<sh>, or Perl, such as the following:
a0d0e21e 705
19799a22 706 eval '(exit $?0)' && eval 'exec perl -wS $0 ${1+"$@"}'
a3cb178b 707 & eval 'exec /usr/bin/perl -wS $0 $argv:q'
5f05dabc 708 if $running_under_some_shell;
a0d0e21e 709
19799a22 710If the filename supplied contains directory separators (i.e., is an
711absolute or relative pathname), and if that file is not found,
712platforms that append file extensions will do so and try to look
713for the file with those extensions added, one by one.
714
715On DOS-like platforms, if the program does not contain directory
716separators, it will first be searched for in the current directory
717before being searched for on the PATH. On Unix platforms, the
718program will be searched for strictly on the PATH.
719
6537fe72 720=item B<-t>
721
722Like B<-T>, but taint checks will issue warnings rather than fatal
317ea90d 723errors. These warnings can be controlled normally with C<no warnings
724qw(taint)>.
1dbad523 725
726B<NOTE: this is not a substitute for -T.> This is meant only to be
727used as a temporary development aid while securing legacy code:
728for real production code and for new secure code written from scratch
729always use the real B<-T>.
6537fe72 730
a0d0e21e 731=item B<-T>
732
a3cb178b 733forces "taint" checks to be turned on so you can test them. Ordinarily
19799a22 734these checks are done only when running setuid or setgid. It's a
735good idea to turn them on explicitly for programs that run on behalf
736of someone else whom you might not necessarily trust, such as CGI
737programs or any internet servers you might write in Perl. See
738L<perlsec> for details. For security reasons, this option must be
739seen by Perl quite early; usually this means it must appear early
740on the command line or in the #! line for systems which support
741that construct.
a0d0e21e 742
743=item B<-u>
744
19799a22 745This obsolete switch causes Perl to dump core after compiling your
746program. You can then in theory take this core dump and turn it
747into an executable file by using the B<undump> program (not supplied).
748This speeds startup at the expense of some disk space (which you
749can minimize by stripping the executable). (Still, a "hello world"
750executable comes out to about 200K on my machine.) If you want to
751execute a portion of your program before dumping, use the dump()
752operator instead. Note: availability of B<undump> is platform
753specific and may not be available for a specific port of Perl.
754
755This switch has been superseded in favor of the new Perl code
756generator backends to the compiler. See L<B> and L<B::Bytecode>
757for details.
a0d0e21e 758
759=item B<-U>
760
761allows Perl to do unsafe operations. Currently the only "unsafe"
762operations are the unlinking of directories while running as superuser,
763and running setuid programs with fatal taint checks turned into
19799a22 764warnings. Note that the B<-w> switch (or the C<$^W> variable) must
765be used along with this option to actually I<generate> the
fb73857a 766taint-check warnings.
a0d0e21e 767
768=item B<-v>
769
19799a22 770prints the version and patchlevel of your perl executable.
a0d0e21e 771
3c81428c 772=item B<-V>
773
774prints summary of the major perl configuration values and the current
19799a22 775values of @INC.
3c81428c 776
e0ebc809 777=item B<-V:>I<name>
3c81428c 778
779Prints to STDOUT the value of the named configuration variable.
44a4342c 780For example,
3c81428c 781
19799a22 782 $ perl -V:man.dir
783
784will provide strong clues about what your MANPATH variable should
785be set to in order to access the Perl documentation.
a0d0e21e 786
19799a22 787=item B<-w>
774d564b 788
19799a22 789prints warnings about dubious constructs, such as variable names
790that are mentioned only once and scalar variables that are used
791before being set, redefined subroutines, references to undefined
792filehandles or filehandles opened read-only that you are attempting
793to write on, values used as a number that doesn't look like numbers,
794using an array as though it were a scalar, if your subroutines
795recurse more than 100 deep, and innumerable other things.
796
797This switch really just enables the internal C<^$W> variable. You
798can disable or promote into fatal errors specific warnings using
799C<__WARN__> hooks, as described in L<perlvar> and L<perlfunc/warn>.
800See also L<perldiag> and L<perltrap>. A new, fine-grained warning
801facility is also available if you want to manipulate entire classes
9f1b1f2d 802of warnings; see L<warnings> or L<perllexwarn>.
a0d0e21e 803
0453d815 804=item B<-W>
805
3c0facb2 806Enables all warnings regardless of C<no warnings> or C<$^W>.
0453d815 807See L<perllexwarn>.
808
809=item B<-X>
810
3c0facb2 811Disables all warnings regardless of C<use warnings> or C<$^W>.
0453d815 812See L<perllexwarn>.
813
a0d0e21e 814=item B<-x> I<directory>
815
19799a22 816tells Perl that the program is embedded in a larger chunk of unrelated
817ASCII text, such as in a mail message. Leading garbage will be
818discarded until the first line that starts with #! and contains the
819string "perl". Any meaningful switches on that line will be applied.
820If a directory name is specified, Perl will switch to that directory
821before running the program. The B<-x> switch controls only the
822disposal of leading garbage. The program must be terminated with
823C<__END__> if there is trailing garbage to be ignored (the program
824can process any or all of the trailing garbage via the DATA filehandle
825if desired).
a0d0e21e 826
1e422769 827=back
828
829=head1 ENVIRONMENT
830
831=over 12
832
833=item HOME
834
835Used if chdir has no argument.
836
837=item LOGDIR
838
839Used if chdir has no argument and HOME is not set.
840
841=item PATH
842
19799a22 843Used in executing subprocesses, and in finding the program if B<-S> is
1e422769 844used.
845
846=item PERL5LIB
847
848A colon-separated list of directories in which to look for Perl library
849files before looking in the standard library and the current
951ba7fe 850directory. Any architecture-specific directories under the specified
851locations are automatically included if they exist. If PERL5LIB is not
852defined, PERLLIB is used.
853
854When running taint checks (either because the program was running setuid
855or setgid, or the B<-T> switch was used), neither variable is used.
856The program should instead say:
1e422769 857
858 use lib "/my/directory";
859
54310121 860=item PERL5OPT
861
862Command-line options (switches). Switches in this variable are taken
1c4db469 863as if they were on every Perl command line. Only the B<-[DIMUdmtw]>
19799a22 864switches are allowed. When running taint checks (because the program
54310121 865was running setuid or setgid, or the B<-T> switch was used), this
74288ac8 866variable is ignored. If PERL5OPT begins with B<-T>, tainting will be
867enabled, and any subsequent options ignored.
54310121 868
16537909 869=item PERLIO
870
44a4342c 871A space (or colon) separated list of PerlIO layers. If perl is built
03d9e98a 872to use PerlIO system for IO (the default) these layers effect perl's IO.
44a4342c 873
874It is conventional to start layer names with a colon e.g. C<:perlio> to
875emphasise their similarity to variable "attributes". But the code that parses
876layer specification strings (which is also used to decode the PERLIO
877environment variable) treats the colon as a separator.
878
879The list becomes the default for I<all> perl's IO. Consequently only built-in
880layers can appear in this list, as external layers (such as :encoding()) need
881IO in order to load them!. See L<"open pragma"|open> for how to add external
882encodings as defaults.
883
884The layers that it makes sense to include in the PERLIO environment
885variable are summarised below. For more details see L<PerlIO>.
16537909 886
887=over 8
888
889=item :bytes
890
44a4342c 891Turns I<off> the C<:utf8> flag for the layer below.
99366417 892Unlikely to be useful in global PERLIO environment variable.
16537909 893
894=item :crlf
895
44a4342c 896A layer that implements DOS/Windows like CRLF line endings.
897On read converts pairs of CR,LF to a single "\n" newline character.
898On write converts each "\n" to a CR,LF pair.
899Based on the C<:perlio> layer.
900
901=item :mmap
902
903A layer which implements "reading" of files by using C<mmap()> to
904make (whole) file appear in the process's address space, and then
905using that as PerlIO's "buffer". This I<may> be faster in certain
906circumstances for large files, and may result in less physical memory
907use when multiple processes are reading the same file.
16537909 908
44a4342c 909Files which are not C<mmap()>-able revert to behaving like the C<:perlio>
910layer. Writes also behave like C<:perlio> layer as C<mmap()> for write
911needs extra house-keeping (to extend the file) which negates any advantage.
16537909 912
44a4342c 913The C<:mmap> layer will not exist if platform does not support C<mmap()>.
16537909 914
44a4342c 915=item :perlio
16537909 916
44a4342c 917A from scratch implementation of buffering for PerlIO. Provides fast
918access to the buffer for C<sv_gets> which implements perl's readline/E<lt>E<gt>
919and in general attempts to minimize data copying.
16537909 920
44a4342c 921C<:perlio> will insert a C<:unix> layer below itself to do low level IO.
16537909 922
44a4342c 923=item :raw
16537909 924
0226bbdb 925Applying the <:raw> layer is equivalent to calling C<binmode($fh)>.
926It makes the stream pass each byte as-is without any translation.
927In particular CRLF translation, and/or :utf8 inuited from locale
928are disabled.
1cbfc93d 929
0226bbdb 930Arranges for all accesses go straight to the lowest buffered layer provided
44a4342c 931by the configration. That is it strips off any layers above that layer.
16537909 932
fae2c0fb 933In Perl 5.6 and some books the C<:raw> layer (previously sometimes also
934referred to as a "discipline") is documented as the inverse of the
935C<:crlf> layer. That is no longer the case - other layers which would
936alter binary nature of the stream are also disabled. If you want UNIX
937line endings on a platform that normally does CRLF translation, but still
938want UTF-8 or encoding defaults the appropriate thing to do is to add
939C<:perlio> to PERLIO environment variable.
16537909 940
44a4342c 941=item :stdio
942
943This layer provides PerlIO interface by wrapping system's ANSI C "stdio"
944library calls. The layer provides both buffering and IO.
945Note that C<:stdio> layer does I<not> do CRLF translation even if that
946is platforms normal behaviour. You will need a C<:crlf> layer above it
947to do that.
948
949=item :unix
950
951Lowest level layer which provides basic PerlIO operations in terms of
952UNIX/POSIX numeric file descriptor calls
953C<open(), read(), write(), lseek(), close()>
16537909 954
955=item :utf8
956
44a4342c 957Turns on a flag on the layer below to tell perl that data sent to the
958stream should be converted to perl internal "utf8" form and that data from the
959stream should be considered as so encoded. On ASCII based platforms the
960encoding is UTF-8 and on EBCDIC platforms UTF-EBCDIC.
961May be useful in PERLIO environment variable to make UTF-8 the
962default. (To turn off that behaviour use C<:bytes> layer.)
963
964=item :win32
965
ab4f7683 966On Win32 platforms this I<experimental> layer uses native "handle" IO
44a4342c 967rather than unix-like numeric file descriptor layer. Known to be
968buggy in this release.
16537909 969
970=back
971
44a4342c 972On all platforms the default set of layers should give acceptable results.
973
ab4f7683 974For UNIX platforms that will equivalent of "unix perlio" or "stdio".
44a4342c 975Configure is setup to prefer "stdio" implementation if system's library
976provides for fast access to the buffer, otherwise it uses the "unix perlio"
977implementation.
978
979On Win32 the default in this release is "unix crlf". Win32's "stdio"
980has a number of bugs/mis-features for perl IO which are somewhat
99366417 981C compiler vendor/version dependent. Using our own C<crlf> layer as
44a4342c 982the buffer avoids those issues and makes things more uniform.
983The C<crlf> layer provides CRLF to/from "\n" conversion as well as
984buffering.
985
986This release uses C<unix> as the bottom layer on Win32 and so still uses C
987compiler's numeric file descriptor routines. There is an experimental native
988C<win32> layer which is expected to be enhanced and should eventually replace
989the C<unix> layer.
990
991=item PERLIO_DEBUG
992
993If set to the name of a file or device then certain operations of PerlIO
994sub-system will be logged to that file (opened as append). Typical uses
995are UNIX:
996
997 PERLIO_DEBUG=/dev/tty perl script ...
998
999and Win32 approximate equivalent:
1000
1001 set PERLIO_DEBUG=CON
1002 perl script ...
1003
16537909 1004
1e422769 1005=item PERLLIB
1006
1007A colon-separated list of directories in which to look for Perl library
1008files before looking in the standard library and the current directory.
1009If PERL5LIB is defined, PERLLIB is not used.
1010
1011=item PERL5DB
1012
1013The command used to load the debugger code. The default is:
1014
1015 BEGIN { require 'perl5db.pl' }
1016
19799a22 1017=item PERL5SHELL (specific to the Win32 port)
174c211a 1018
1019May be set to an alternative shell that perl must use internally for
ce1da67e 1020executing "backtick" commands or system(). Default is C<cmd.exe /x/c>
1021on WindowsNT and C<command.com /c> on Windows95. The value is considered
19799a22 1022to be space-separated. Precede any character that needs to be protected
ce1da67e 1023(like a space or backslash) with a backslash.
1024
1025Note that Perl doesn't use COMSPEC for this purpose because
1026COMSPEC has a high degree of variability among users, leading to
1027portability concerns. Besides, perl can use a shell that may not be
1028fit for interactive use, and setting COMSPEC to such a shell may
1029interfere with the proper functioning of other programs (which usually
1030look in COMSPEC to find a shell fit for interactive use).
174c211a 1031
1e422769 1032=item PERL_DEBUG_MSTATS
1033
67ce8856 1034Relevant only if perl is compiled with the malloc included with the perl
a3cb178b 1035distribution (that is, if C<perl -V:d_mymalloc> is 'define').
1036If set, this causes memory statistics to be dumped after execution. If set
1e422769 1037to an integer greater than one, also causes memory statistics to be dumped
1038after compilation.
1039
1040=item PERL_DESTRUCT_LEVEL
1041
1042Relevant only if your perl executable was built with B<-DDEBUGGING>,
1043this controls the behavior of global destruction of objects and other
64cea5fd 1044references. See L<perlhack/PERL_DESTRUCT_LEVEL> for more information.
a0d0e21e 1045
5d170f3a 1046=item PERL_ENCODING
1047
1048If using the C<encoding> pragma without an explicit encoding name, the
1049PERL_ENCODING environment variable is consulted for an encoding name.
1050
3d0ae7ba 1051=item PERL_ROOT (specific to the VMS port)
1052
1053A translation concealed rooted logical name that contains perl and the
1054logical device for the @INC path on VMS only. Other logical names that
44a4342c 1055affect perl on VMS include PERLSHR, PERL_ENV_TABLES, and
1056SYS$TIMEZONE_DIFFERENTIAL but are optional and discussed further in
3d0ae7ba 1057L<perlvms> and in F<README.vms> in the Perl source distribution.
1058
1059=item SYS$LOGIN (specific to the VMS port)
1060
1061Used if chdir has no argument and HOME and LOGDIR are not set.
1062
a0d0e21e 1063=back
1e422769 1064
1065Perl also has environment variables that control how Perl handles data
1066specific to particular natural languages. See L<perllocale>.
1067
1068Apart from these, Perl uses no other environment variables, except
19799a22 1069to make them available to the program being executed, and to child
1070processes. However, programs running setuid would do well to execute
1e422769 1071the following lines before doing anything else, just to keep people
1072honest:
1073
19799a22 1074 $ENV{PATH} = '/bin:/usr/bin'; # or whatever you need
7bac28a0 1075 $ENV{SHELL} = '/bin/sh' if exists $ENV{SHELL};
c90c0ff4 1076 delete @ENV{qw(IFS CDPATH ENV BASH_ENV)};