Re: [PATCH] Re: Modulus operator inconsistency
[p5sagit/p5-mst-13.2.git] / pod / perlport.pod
CommitLineData
e41182b5 1=head1 NAME
2
3perlport - Writing portable Perl
4
e41182b5 5=head1 DESCRIPTION
6
b7df3edc 7Perl runs on numerous operating systems. While most of them share
8much in common, they also have their own unique features.
e41182b5 9
10This document is meant to help you to find out what constitutes portable
b7df3edc 11Perl code. That way once you make a decision to write portably,
e41182b5 12you know where the lines are drawn, and you can stay within them.
13
b7df3edc 14There is a tradeoff between taking full advantage of one particular
15type of computer and taking advantage of a full range of them.
16Naturally, as you broaden your range and become more diverse, the
17common factors drop, and you are left with an increasingly smaller
18area of common ground in which you can operate to accomplish a
19particular task. Thus, when you begin attacking a problem, it is
20important to consider under which part of the tradeoff curve you
21want to operate. Specifically, you must decide whether it is
22important that the task that you are coding have the full generality
23of being portable, or whether to just get the job done right now.
24This is the hardest choice to be made. The rest is easy, because
25Perl provides many choices, whichever way you want to approach your
0a47030a 26problem.
27
28Looking at it another way, writing portable code is usually about
b7df3edc 29willfully limiting your available choices. Naturally, it takes
30discipline and sacrifice to do that. The product of portability
31and convenience may be a constant. You have been warned.
e41182b5 32
33Be aware of two important points:
34
35=over 4
36
37=item Not all Perl programs have to be portable
38
b7df3edc 39There is no reason you should not use Perl as a language to glue Unix
e41182b5 40tools together, or to prototype a Macintosh application, or to manage the
41Windows registry. If it makes no sense to aim for portability for one
42reason or another in a given program, then don't bother.
43
b7df3edc 44=item Nearly all of Perl already I<is> portable
e41182b5 45
46Don't be fooled into thinking that it is hard to create portable Perl
47code. It isn't. Perl tries its level-best to bridge the gaps between
48what's available on different platforms, and all the means available to
49use those features. Thus almost all Perl code runs on any machine
6ab3f9cb 50without modification. But there are some significant issues in
e41182b5 51writing portable code, and this document is entirely about those issues.
52
53=back
54
b7df3edc 55Here's the general rule: When you approach a task commonly done
56using a whole range of platforms, think about writing portable
e41182b5 57code. That way, you don't sacrifice much by way of the implementation
58choices you can avail yourself of, and at the same time you can give
59your users lots of platform choices. On the other hand, when you have to
60take advantage of some unique feature of a particular platform, as is
61often the case with systems programming (whether for Unix, Windows,
62S<Mac OS>, VMS, etc.), consider writing platform-specific code.
63
b7df3edc 64When the code will run on only two or three operating systems, you
65may need to consider only the differences of those particular systems.
66The important thing is to decide where the code will run and to be
0a47030a 67deliberate in your decision.
68
69The material below is separated into three main sections: main issues of
70portability (L<"ISSUES">, platform-specific issues (L<"PLATFORMS">, and
b7df3edc 71built-in perl functions that behave differently on various ports
0a47030a 72(L<"FUNCTION IMPLEMENTATIONS">.
e41182b5 73
74This information should not be considered complete; it includes possibly
b8099c3d 75transient information about idiosyncrasies of some of the ports, almost
b7df3edc 76all of which are in a state of constant evolution. Thus, this material
e41182b5 77should be considered a perpetual work in progress
c47ff5f1 78(<IMG SRC="yellow_sign.gif" ALT="Under Construction">).
e41182b5 79
e41182b5 80=head1 ISSUES
81
82=head2 Newlines
83
638bc118 84In most operating systems, lines in files are terminated by newlines.
e41182b5 85Just what is used as a newline may vary from OS to OS. Unix
b7df3edc 86traditionally uses C<\012>, one type of DOSish I/O uses C<\015\012>,
e41182b5 87and S<Mac OS> uses C<\015>.
88
b7df3edc 89Perl uses C<\n> to represent the "logical" newline, where what is
90logical may depend on the platform in use. In MacPerl, C<\n> always
91means C<\015>. In DOSish perls, C<\n> usually means C<\012>, but
92when accessing a file in "text" mode, STDIO translates it to (or
56d7751a 93from) C<\015\012>, depending on whether you're reading or writing.
b7df3edc 94Unix does the same thing on ttys in canonical mode. C<\015\012>
95is commonly referred to as CRLF.
96
5b3eff12 97A common cause of unportable programs is the misuse of chop() to trim
98newlines:
99
100 # XXX UNPORTABLE!
101 while(<FILE>) {
102 chop;
103 @array = split(/:/);
104 #...
105 }
106
107You can get away with this on Unix and MacOS (they have a single
108character end-of-line), but the same program will break under DOSish
109perls because you're only chop()ing half the end-of-line. Instead,
110chomp() should be used to trim newlines. The Dunce::Files module can
111help audit your code for misuses of chop().
112
113When dealing with binary files (or text files in binary mode) be sure
114to explicitly set $/ to the appropriate value for your file format
115before using chomp().
116
b7df3edc 117Because of the "text" mode translation, DOSish perls have limitations
118in using C<seek> and C<tell> on a file accessed in "text" mode.
119Stick to C<seek>-ing to locations you got from C<tell> (and no
120others), and you are usually free to use C<seek> and C<tell> even
121in "text" mode. Using C<seek> or C<tell> or other file operations
122may be non-portable. If you use C<binmode> on a file, however, you
123can usually C<seek> and C<tell> with arbitrary values in safety.
e41182b5 124
125A common misconception in socket programming is that C<\n> eq C<\012>
0a47030a 126everywhere. When using protocols such as common Internet protocols,
e41182b5 127C<\012> and C<\015> are called for specifically, and the values of
128the logical C<\n> and C<\r> (carriage return) are not reliable.
129
130 print SOCKET "Hi there, client!\r\n"; # WRONG
131 print SOCKET "Hi there, client!\015\012"; # RIGHT
132
0a47030a 133However, using C<\015\012> (or C<\cM\cJ>, or C<\x0D\x0A>) can be tedious
134and unsightly, as well as confusing to those maintaining the code. As
6ab3f9cb 135such, the Socket module supplies the Right Thing for those who want it.
e41182b5 136
137 use Socket qw(:DEFAULT :crlf);
138 print SOCKET "Hi there, client!$CRLF" # RIGHT
139
6ab3f9cb 140When reading from a socket, remember that the default input record
b7df3edc 141separator C<$/> is C<\n>, but robust socket code will recognize as
142either C<\012> or C<\015\012> as end of line:
e41182b5 143
144 while (<SOCKET>) {
145 # ...
146 }
147
b7df3edc 148Because both CRLF and LF end in LF, the input record separator can
149be set to LF and any CR stripped later. Better to write:
e41182b5 150
151 use Socket qw(:DEFAULT :crlf);
152 local($/) = LF; # not needed if $/ is already \012
153
154 while (<SOCKET>) {
155 s/$CR?$LF/\n/; # not sure if socket uses LF or CRLF, OK
156 # s/\015?\012/\n/; # same thing
157 }
158
b7df3edc 159This example is preferred over the previous one--even for Unix
160platforms--because now any C<\015>'s (C<\cM>'s) are stripped out
e41182b5 161(and there was much rejoicing).
162
6ab3f9cb 163Similarly, functions that return text data--such as a function that
b7df3edc 164fetches a web page--should sometimes translate newlines before
165returning the data, if they've not yet been translated to the local
166newline representation. A single line of code will often suffice:
2ee0eb3c 167
b7df3edc 168 $data =~ s/\015?\012/\n/g;
169 return $data;
2ee0eb3c 170
6ab3f9cb 171Some of this may be confusing. Here's a handy reference to the ASCII CR
172and LF characters. You can print it out and stick it in your wallet.
173
174 LF == \012 == \x0A == \cJ == ASCII 10
175 CR == \015 == \x0D == \cM == ASCII 13
176
177 | Unix | DOS | Mac |
178 ---------------------------
179 \n | LF | LF | CR |
180 \r | CR | CR | LF |
181 \n * | LF | CRLF | CR |
182 \r * | CR | CR | LF |
183 ---------------------------
184 * text-mode STDIO
185
b7df3edc 186The Unix column assumes that you are not accessing a serial line
187(like a tty) in canonical mode. If you are, then CR on input becomes
188"\n", and "\n" on output becomes CRLF.
189
6ab3f9cb 190These are just the most common definitions of C<\n> and C<\r> in Perl.
191There may well be others.
192
322422de 193=head2 Numbers endianness and Width
194
195Different CPUs store integers and floating point numbers in different
196orders (called I<endianness>) and widths (32-bit and 64-bit being the
b7df3edc 197most common today). This affects your programs when they attempt to transfer
198numbers in binary format from one CPU architecture to another,
199usually either "live" via network connection, or by storing the
200numbers to secondary storage such as a disk file or tape.
322422de 201
b7df3edc 202Conflicting storage orders make utter mess out of the numbers. If a
d1e3b762 203little-endian host (Intel, VAX) stores 0x12345678 (305419896 in
b84d4f81 204decimal), a big-endian host (Motorola, Sparc, PA) reads it as
2050x78563412 (2018915346 in decimal). Alpha and MIPS can be either:
206Digital/Compaq used/uses them in little-endian mode; SGI/Cray uses
207them in big-endian mode. To avoid this problem in network (socket)
208connections use the C<pack> and C<unpack> formats C<n> and C<N>, the
209"network" orders. These are guaranteed to be portable.
322422de 210
d1e3b762 211You can explore the endianness of your platform by unpacking a
212data structure packed in native format such as:
213
214 print unpack("h*", pack("s2", 1, 2)), "\n";
215 # '10002000' on e.g. Intel x86 or Alpha 21064 in little-endian mode
216 # '00100020' on e.g. Motorola 68040
217
218If you need to distinguish between endian architectures you could use
219either of the variables set like so:
220
221 $is_big_endian = unpack("h*", pack("s", 1)) =~ /01/;
4375e838 222 $is_little_endian = unpack("h*", pack("s", 1)) =~ /^1/;
d1e3b762 223
b7df3edc 224Differing widths can cause truncation even between platforms of equal
225endianness. The platform of shorter width loses the upper parts of the
322422de 226number. There is no good solution for this problem except to avoid
227transferring or storing raw binary numbers.
228
b7df3edc 229One can circumnavigate both these problems in two ways. Either
322422de 230transfer and store numbers always in text format, instead of raw
b7df3edc 231binary, or else consider using modules like Data::Dumper (included in
83df6a1d 232the standard distribution as of Perl 5.005) and Storable (included as
233of perl 5.8). Keeping all data as text significantly simplifies matters.
322422de 234
433acd8a 235=head2 Files and Filesystems
e41182b5 236
237Most platforms these days structure files in a hierarchical fashion.
b7df3edc 238So, it is reasonably safe to assume that all platforms support the
6ab3f9cb 239notion of a "path" to uniquely identify a file on the system. How
b7df3edc 240that path is really written, though, differs considerably.
e41182b5 241
4375e838 242Although similar, file path specifications differ between Unix,
b7df3edc 243Windows, S<Mac OS>, OS/2, VMS, VOS, S<RISC OS>, and probably others.
244Unix, for example, is one of the few OSes that has the elegant idea
245of a single root directory.
322422de 246
6ab3f9cb 247DOS, OS/2, VMS, VOS, and Windows can work similarly to Unix with C</>
248as path separator, or in their own idiosyncratic ways (such as having
249several root directories and various "unrooted" device files such NIL:
250and LPT:).
322422de 251
252S<Mac OS> uses C<:> as a path separator instead of C</>.
253
6ab3f9cb 254The filesystem may support neither hard links (C<link>) nor
255symbolic links (C<symlink>, C<readlink>, C<lstat>).
433acd8a 256
6ab3f9cb 257The filesystem may support neither access timestamp nor change
433acd8a 258timestamp (meaning that about the only portable timestamp is the
259modification timestamp), or one second granularity of any timestamps
260(e.g. the FAT filesystem limits the time granularity to two seconds).
261
495c5fdc 262VOS perl can emulate Unix filenames with C</> as path separator. The
263native pathname characters greater-than, less-than, number-sign, and
264percent-sign are always accepted.
265
6ab3f9cb 266S<RISC OS> perl can emulate Unix filenames with C</> as path
322422de 267separator, or go native and use C<.> for path separator and C<:> to
6ab3f9cb 268signal filesystems and disk names.
e41182b5 269
a1667ba3 270Don't assume UNIX filesystem access semantics: that read, write,
271and execute are all the permissions there are, and even if they exist,
272that their semantics (for example what do r, w, and x mean on
273a directory) are the UNIX ones. The various UNIX/POSIX compatibility
274layers usually try to make interfaces like chmod() work, but sometimes
275there simply is no good mapping.
276
b7df3edc 277If all this is intimidating, have no (well, maybe only a little)
278fear. There are modules that can help. The File::Spec modules
279provide methods to do the Right Thing on whatever platform happens
280to be running the program.
e41182b5 281
6ab3f9cb 282 use File::Spec::Functions;
283 chdir(updir()); # go up one directory
284 $file = catfile(curdir(), 'temp', 'file.txt');
e41182b5 285 # on Unix and Win32, './temp/file.txt'
286 # on Mac OS, ':temp:file.txt'
d1e3b762 287 # on VMS, '[.temp]file.txt'
e41182b5 288
b7df3edc 289File::Spec is available in the standard distribution as of version
d1e3b762 2905.004_05. File::Spec::Functions is only in File::Spec 0.7 and later,
291and some versions of perl come with version 0.6. If File::Spec
292is not updated to 0.7 or later, you must use the object-oriented
293interface from File::Spec (or upgrade File::Spec).
e41182b5 294
b7df3edc 295In general, production code should not have file paths hardcoded.
296Making them user-supplied or read from a configuration file is
297better, keeping in mind that file path syntax varies on different
298machines.
e41182b5 299
300This is especially noticeable in scripts like Makefiles and test suites,
301which often assume C</> as a path separator for subdirectories.
302
b7df3edc 303Also of use is File::Basename from the standard distribution, which
e41182b5 304splits a pathname into pieces (base filename, full path to directory,
305and file suffix).
306
19799a22 307Even when on a single platform (if you can call Unix a single platform),
b7df3edc 308remember not to count on the existence or the contents of particular
3c075c7d 309system-specific files or directories, like F</etc/passwd>,
b7df3edc 310F</etc/sendmail.conf>, F</etc/resolv.conf>, or even F</tmp/>. For
311example, F</etc/passwd> may exist but not contain the encrypted
312passwords, because the system is using some form of enhanced security.
313Or it may not contain all the accounts, because the system is using NIS.
3c075c7d 314If code does need to rely on such a file, include a description of the
b7df3edc 315file and its format in the code's documentation, then make it easy for
3c075c7d 316the user to override the default location of the file.
317
b7df3edc 318Don't assume a text file will end with a newline. They should,
319but people forget.
e41182b5 320
ec481373 321Do not have two files or directories of the same name with different
322case, like F<test.pl> and F<Test.pl>, as many platforms have
323case-insensitive (or at least case-forgiving) filenames. Also, try
324not to have non-word characters (except for C<.>) in the names, and
325keep them to the 8.3 convention, for maximum portability, onerous a
326burden though this may appear.
dd9f0070 327
b7df3edc 328Likewise, when using the AutoSplit module, try to keep your functions to
3298.3 naming and case-insensitive conventions; or, at the least,
dd9f0070 330make it so the resulting files have a unique (case-insensitively)
331first 8 characters.
332
ec481373 333Whitespace in filenames is tolerated on most systems, but not all,
334and even on systems where it might be tolerated, some utilities
335might becoem confused by such whitespace.
336
b7df3edc 337Many systems (DOS, VMS) cannot have more than one C<.> in their filenames.
433acd8a 338
c47ff5f1 339Don't assume C<< > >> won't be the first character of a filename.
340Always use C<< < >> explicitly to open a file for reading,
b7df3edc 341unless you want the user to be able to specify a pipe open.
0a47030a 342
6ab3f9cb 343 open(FILE, "< $existing_file") or die $!;
0a47030a 344
6ab3f9cb 345If filenames might use strange characters, it is safest to open it
346with C<sysopen> instead of C<open>. C<open> is magic and can
c47ff5f1 347translate characters like C<< > >>, C<< < >>, and C<|>, which may
b7df3edc 348be the wrong thing to do. (Sometimes, though, it's the right thing.)
e41182b5 349
ec481373 350Don't use C<:> as a part of a filename since many systems use that for
351their own semantics (MacOS Classic for separating pathname components,
352many networking schemes and utilities for separating the nodename and
353the pathname, and so on).
354
355The I<portable filename characters> as defined by ANSI C are
356
357 a b c d e f g h i j k l m n o p q r t u v w x y z
358 A B C D E F G H I J K L M N O P Q R T U V W X Y Z
359 0 1 2 3 4 5 6 7 8 9
360 . _ -
361
362and the "-" shouldn't be the first character.
363
e41182b5 364=head2 System Interaction
365
b7df3edc 366Not all platforms provide a command line. These are usually platforms
367that rely primarily on a Graphical User Interface (GUI) for user
368interaction. A program requiring a command line interface might
369not work everywhere. This is probably for the user of the program
370to deal with, so don't stay up late worrying about it.
e41182b5 371
b7df3edc 372Some platforms can't delete or rename files held open by the system.
373Remember to C<close> files when you are done with them. Don't
374C<unlink> or C<rename> an open file. Don't C<tie> or C<open> a
375file already tied or opened; C<untie> or C<close> it first.
e41182b5 376
0a47030a 377Don't open the same file more than once at a time for writing, as some
378operating systems put mandatory locks on such files.
379
73e9292c 380Don't assume that write/modify permission on a directory gives the
381right to add or delete files/directories in that directory. That is
382filesystem specific: in some filesystems you need write/modify
383permission also (or even just) in the file/directory itself. In some
384filesystems (AFS, DFS) the permission to add/delete directory entries
385is a completely separate permission.
386
387Don't assume that a single C<unlink> completely gets rid of the file:
388some filesystems (most notably the ones in VMS) have versioned
389filesystems, and unlink() removes only the most recent one (it doesn't
390remove all the versions because by default the native tools on those
391platforms remove just the most recent version, too). The portable
392idiom to remove all the versions of a file is
393
94bb614c 394 1 while unlink "file";
73e9292c 395
396This will terminate if the file is undeleteable for some reason
397(protected, not there, and so on).
398
e41182b5 399Don't count on a specific environment variable existing in C<%ENV>.
0a47030a 400Don't count on C<%ENV> entries being case-sensitive, or even
4a0d0822 401case-preserving. Don't try to clear %ENV by saying C<%ENV = ();>, or,
6be8f7a6 402if you really have to, make it conditional on C<$^O ne 'VMS'> since in
4a0d0822 403VMS the C<%ENV> table is much more than a per-process key-value string
404table.
e41182b5 405
d1e3b762 406Don't count on signals or C<%SIG> for anything.
e41182b5 407
408Don't count on filename globbing. Use C<opendir>, C<readdir>, and
409C<closedir> instead.
410
b8099c3d 411Don't count on per-program environment variables, or per-program current
dd9f0070 412directories.
b8099c3d 413
3c075c7d 414Don't count on specific values of C<$!>.
415
e41182b5 416=head2 Interprocess Communication (IPC)
417
b7df3edc 418In general, don't directly access the system in code meant to be
419portable. That means, no C<system>, C<exec>, C<fork>, C<pipe>,
420C<``>, C<qx//>, C<open> with a C<|>, nor any of the other things
421that makes being a perl hacker worth being.
e41182b5 422
423Commands that launch external processes are generally supported on
b7df3edc 424most platforms (though many of them do not support any type of
425forking). The problem with using them arises from what you invoke
426them on. External tools are often named differently on different
4375e838 427platforms, may not be available in the same location, might accept
b7df3edc 428different arguments, can behave differently, and often present their
429results in a platform-dependent way. Thus, you should seldom depend
430on them to produce consistent results. (Then again, if you're calling
431I<netstat -a>, you probably don't expect it to run on both Unix and CP/M.)
e41182b5 432
b7df3edc 433One especially common bit of Perl code is opening a pipe to B<sendmail>:
e41182b5 434
b7df3edc 435 open(MAIL, '|/usr/lib/sendmail -t')
436 or die "cannot fork sendmail: $!";
e41182b5 437
438This is fine for systems programming when sendmail is known to be
439available. But it is not fine for many non-Unix systems, and even
440some Unix systems that may not have sendmail installed. If a portable
b7df3edc 441solution is needed, see the various distributions on CPAN that deal
442with it. Mail::Mailer and Mail::Send in the MailTools distribution are
443commonly used, and provide several mailing methods, including mail,
444sendmail, and direct SMTP (via Net::SMTP) if a mail transfer agent is
445not available. Mail::Sendmail is a standalone module that provides
446simple, platform-independent mailing.
447
448The Unix System V IPC (C<msg*(), sem*(), shm*()>) is not available
449even on all Unix platforms.
e41182b5 450
a81e5e2e 451Do not use either the bare result of C<pack("N", 10, 20, 30, 40)> or
452bare v-strings (such as C<v10.20.30.40>) to represent IPv4 addresses:
453both forms just pack the four bytes into network order. That this
454would be equal to the C language C<in_addr> struct (which is what the
455socket code internally uses) is not guaranteed. To be portable use
456the routines of the Socket extension, such as C<inet_aton()>,
457C<inet_ntoa()>, and C<sockaddr_in()>.
6b2463a0 458
e41182b5 459The rule of thumb for portable code is: Do it all in portable Perl, or
0a47030a 460use a module (that may internally implement it with platform-specific
461code, but expose a common interface).
e41182b5 462
e41182b5 463=head2 External Subroutines (XS)
464
b7df3edc 465XS code can usually be made to work with any platform, but dependent
e41182b5 466libraries, header files, etc., might not be readily available or
467portable, or the XS code itself might be platform-specific, just as Perl
468code might be. If the libraries and headers are portable, then it is
469normally reasonable to make sure the XS code is portable, too.
470
b7df3edc 471A different type of portability issue arises when writing XS code:
472availability of a C compiler on the end-user's system. C brings
473with it its own portability issues, and writing XS code will expose
474you to some of those. Writing purely in Perl is an easier way to
e41182b5 475achieve portability.
476
e41182b5 477=head2 Standard Modules
478
479In general, the standard modules work across platforms. Notable
6ab3f9cb 480exceptions are the CPAN module (which currently makes connections to external
e41182b5 481programs that may not be available), platform-specific modules (like
6ab3f9cb 482ExtUtils::MM_VMS), and DBM modules.
e41182b5 483
b7df3edc 484There is no one DBM module available on all platforms.
6ab3f9cb 485SDBM_File and the others are generally available on all Unix and DOSish
486ports, but not in MacPerl, where only NBDM_File and DB_File are
0a47030a 487available.
e41182b5 488
489The good news is that at least some DBM module should be available, and
6ab3f9cb 490AnyDBM_File will use whichever module it can find. Of course, then
b7df3edc 491the code needs to be fairly strict, dropping to the greatest common
492factor (e.g., not exceeding 1K for each record), so that it will
6ab3f9cb 493work with any DBM module. See L<AnyDBM_File> for more details.
e41182b5 494
e41182b5 495=head2 Time and Date
496
0a47030a 497The system's notion of time of day and calendar date is controlled in
b7df3edc 498widely different ways. Don't assume the timezone is stored in C<$ENV{TZ}>,
0a47030a 499and even if it is, don't assume that you can control the timezone through
500that variable.
e41182b5 501
322422de 502Don't assume that the epoch starts at 00:00:00, January 1, 1970,
6ab3f9cb 503because that is OS- and implementation-specific. It is better to store a date
504in an unambiguous representation. The ISO-8601 standard defines
505"YYYY-MM-DD" as the date format. A text representation (like "1987-12-18")
506can be easily converted into an OS-specific value using a module like
507Date::Parse. An array of values, such as those returned by
322422de 508C<localtime>, can be converted to an OS-specific representation using
6ab3f9cb 509Time::Local.
322422de 510
19799a22 511When calculating specific times, such as for tests in time or date modules,
512it may be appropriate to calculate an offset for the epoch.
b7df3edc 513
19799a22 514 require Time::Local;
515 $offset = Time::Local::timegm(0, 0, 0, 1, 0, 70);
b7df3edc 516
19799a22 517The value for C<$offset> in Unix will be C<0>, but in Mac OS will be
518some large number. C<$offset> can then be added to a Unix time value
519to get what should be the proper value on any system.
322422de 520
521=head2 Character sets and character encoding
522
ec481373 523Assume very little about character sets.
524
525Assume nothing about numerical values (C<ord>, C<chr>) of characters.
526Do not use explicit code point ranges (like \xHH-\xHH); use for
527example symbolic character classes like C<[:print:]>.
528
529Do not assume that the alphabetic characters are encoded contiguously
530(in the numeric sense). There may be gaps.
531
532Do not assume anything about the ordering of the characters.
533The lowercase letters may come before or after the uppercase letters;
534the lowercase and uppercase may be interlaced so that both `a' and `A'
535come before `b'; the accented and other international characters may
536be interlaced so that E<auml> comes before `b'.
322422de 537
538=head2 Internationalisation
539
b7df3edc 540If you may assume POSIX (a rather large assumption), you may read
541more about the POSIX locale system from L<perllocale>. The locale
542system at least attempts to make things a little bit more portable,
543or at least more convenient and native-friendly for non-English
544users. The system affects character sets and encoding, and date
545and time formatting--amongst other things.
e41182b5 546
547=head2 System Resources
548
0a47030a 549If your code is destined for systems with severely constrained (or
550missing!) virtual memory systems then you want to be I<especially> mindful
551of avoiding wasteful constructs such as:
e41182b5 552
553 # NOTE: this is no longer "bad" in perl5.005
554 for (0..10000000) {} # bad
555 for (my $x = 0; $x <= 10000000; ++$x) {} # good
556
557 @lines = <VERY_LARGE_FILE>; # bad
558
559 while (<FILE>) {$file .= $_} # sometimes bad
0a47030a 560 $file = join('', <FILE>); # better
e41182b5 561
b7df3edc 562The last two constructs may appear unintuitive to most people. The
563first repeatedly grows a string, whereas the second allocates a
564large chunk of memory in one go. On some systems, the second is
565more efficient that the first.
0a47030a 566
e41182b5 567=head2 Security
568
b7df3edc 569Most multi-user platforms provide basic levels of security, usually
570implemented at the filesystem level. Some, however, do
a1667ba3 571not-- unfortunately. Thus the notion of user id, or "home" directory,
b7df3edc 572or even the state of being logged-in, may be unrecognizable on many
573platforms. If you write programs that are security-conscious, it
574is usually best to know what type of system you will be running
575under so that you can write code explicitly for that platform (or
576class of platforms).
0a47030a 577
a1667ba3 578Don't assume the UNIX filesystem access semantics: the operating
579system or the filesystem may be using some ACL systems, which are
580richer languages than the usual rwx. Even if the rwx exist,
581their semantics might be different.
582
583(From security viewpoint testing for permissions before attempting to
584do something is silly anyway: if one tries this, there is potential
585for race conditions-- someone or something might change the
586permissions between the permissions check and the actual operation.
587Just try the operation.)
588
589Don't assume the UNIX user and group semantics: especially, don't
2edcc0d9 590expect the C<< $< >> and C<< $> >> (or the C<$(> and C<$)>) to work
a1667ba3 591for switching identities (or memberships).
592
593Don't assume set-uid and set-gid semantics. (And even if you do,
594think twice: set-uid and set-gid are a known can of security worms.)
595
e41182b5 596=head2 Style
597
598For those times when it is necessary to have platform-specific code,
599consider keeping the platform-specific code in one place, making porting
6ab3f9cb 600to other platforms easier. Use the Config module and the special
0a47030a 601variable C<$^O> to differentiate platforms, as described in
602L<"PLATFORMS">.
e41182b5 603
b7df3edc 604Be careful in the tests you supply with your module or programs.
605Module code may be fully portable, but its tests might not be. This
606often happens when tests spawn off other processes or call external
607programs to aid in the testing, or when (as noted above) the tests
608assume certain things about the filesystem and paths. Be careful
609not to depend on a specific output style for errors, such as when
d1be9408 610checking C<$!> after a system call. Some platforms expect a certain
b7df3edc 611output format, and perl on those platforms may have been adjusted
612accordingly. Most specifically, don't anchor a regex when testing
613an error value.
e41182b5 614
0a47030a 615=head1 CPAN Testers
e41182b5 616
0a47030a 617Modules uploaded to CPAN are tested by a variety of volunteers on
618different platforms. These CPAN testers are notified by mail of each
e41182b5 619new upload, and reply to the list with PASS, FAIL, NA (not applicable to
0a47030a 620this platform), or UNKNOWN (unknown), along with any relevant notations.
e41182b5 621
622The purpose of the testing is twofold: one, to help developers fix any
0a47030a 623problems in their code that crop up because of lack of testing on other
b7df3edc 624platforms; two, to provide users with information about whether
0a47030a 625a given module works on a given platform.
e41182b5 626
627=over 4
628
629=item Mailing list: cpan-testers@perl.org
630
c997b287 631=item Testing results: http://testers.cpan.org/
e41182b5 632
633=back
634
e41182b5 635=head1 PLATFORMS
636
637As of version 5.002, Perl is built with a C<$^O> variable that
638indicates the operating system it was built on. This was implemented
b7df3edc 639to help speed up code that would otherwise have to C<use Config>
640and use the value of C<$Config{osname}>. Of course, to get more
e41182b5 641detailed information about the system, looking into C<%Config> is
642certainly recommended.
643
b7df3edc 644C<%Config> cannot always be trusted, however, because it was built
645at compile time. If perl was built in one place, then transferred
646elsewhere, some values may be wrong. The values may even have been
647edited after the fact.
6ab3f9cb 648
e41182b5 649=head2 Unix
650
651Perl works on a bewildering variety of Unix and Unix-like platforms (see
652e.g. most of the files in the F<hints/> directory in the source code kit).
653On most of these systems, the value of C<$^O> (hence C<$Config{'osname'}>,
d1e3b762 654too) is determined either by lowercasing and stripping punctuation from the
655first field of the string returned by typing C<uname -a> (or a similar command)
656at the shell prompt or by testing the file system for the presence of
657uniquely named files such as a kernel or header file. Here, for example,
658are a few of the more popular Unix flavors:
e41182b5 659
b7df3edc 660 uname $^O $Config{'archname'}
6ab3f9cb 661 --------------------------------------------
b7df3edc 662 AIX aix aix
6ab3f9cb 663 BSD/OS bsdos i386-bsdos
664 dgux dgux AViiON-dgux
665 DYNIX/ptx dynixptx i386-dynixptx
b7df3edc 666 FreeBSD freebsd freebsd-i386
d1e3b762 667 Linux linux arm-linux
b7df3edc 668 Linux linux i386-linux
6ab3f9cb 669 Linux linux i586-linux
670 Linux linux ppc-linux
b7df3edc 671 HP-UX hpux PA-RISC1.1
672 IRIX irix irix
b787fad4 673 Mac OS X darwin darwin
d1e3b762 674 MachTen PPC machten powerpc-machten
675 NeXT 3 next next-fat
676 NeXT 4 next OPENSTEP-Mach
6ab3f9cb 677 openbsd openbsd i386-openbsd
b7df3edc 678 OSF1 dec_osf alpha-dec_osf
6ab3f9cb 679 reliantunix-n svr4 RM400-svr4
680 SCO_SV sco_sv i386-sco_sv
681 SINIX-N svr4 RM400-svr4
682 sn4609 unicos CRAY_C90-unicos
683 sn6521 unicosmk t3e-unicosmk
684 sn9617 unicos CRAY_J90-unicos
b7df3edc 685 SunOS solaris sun4-solaris
686 SunOS solaris i86pc-solaris
687 SunOS4 sunos sun4-sunos
e41182b5 688
b7df3edc 689Because the value of C<$Config{archname}> may depend on the
690hardware architecture, it can vary more than the value of C<$^O>.
6ab3f9cb 691
e41182b5 692=head2 DOS and Derivatives
693
b7df3edc 694Perl has long been ported to Intel-style microcomputers running under
e41182b5 695systems like PC-DOS, MS-DOS, OS/2, and most Windows platforms you can
696bring yourself to mention (except for Windows CE, if you count that).
b7df3edc 697Users familiar with I<COMMAND.COM> or I<CMD.EXE> style shells should
e41182b5 698be aware that each of these file specifications may have subtle
699differences:
700
701 $filespec0 = "c:/foo/bar/file.txt";
702 $filespec1 = "c:\\foo\\bar\\file.txt";
703 $filespec2 = 'c:\foo\bar\file.txt';
704 $filespec3 = 'c:\\foo\\bar\\file.txt';
705
b7df3edc 706System calls accept either C</> or C<\> as the path separator.
707However, many command-line utilities of DOS vintage treat C</> as
708the option prefix, so may get confused by filenames containing C</>.
709Aside from calling any external programs, C</> will work just fine,
710and probably better, as it is more consistent with popular usage,
711and avoids the problem of remembering what to backwhack and what
712not to.
e41182b5 713
b7df3edc 714The DOS FAT filesystem can accommodate only "8.3" style filenames. Under
715the "case-insensitive, but case-preserving" HPFS (OS/2) and NTFS (NT)
0a47030a 716filesystems you may have to be careful about case returned with functions
e41182b5 717like C<readdir> or used with functions like C<open> or C<opendir>.
718
b7df3edc 719DOS also treats several filenames as special, such as AUX, PRN,
720NUL, CON, COM1, LPT1, LPT2, etc. Unfortunately, sometimes these
721filenames won't even work if you include an explicit directory
722prefix. It is best to avoid such filenames, if you want your code
723to be portable to DOS and its derivatives. It's hard to know what
724these all are, unfortunately.
e41182b5 725
726Users of these operating systems may also wish to make use of
b7df3edc 727scripts such as I<pl2bat.bat> or I<pl2cmd> to
e41182b5 728put wrappers around your scripts.
729
730Newline (C<\n>) is translated as C<\015\012> by STDIO when reading from
6ab3f9cb 731and writing to files (see L<"Newlines">). C<binmode(FILEHANDLE)>
732will keep C<\n> translated as C<\012> for that filehandle. Since it is a
733no-op on other systems, C<binmode> should be used for cross-platform code
b7df3edc 734that deals with binary data. That's assuming you realize in advance
735that your data is in binary. General-purpose programs should
736often assume nothing about their data.
e41182b5 737
b7df3edc 738The C<$^O> variable and the C<$Config{archname}> values for various
e41182b5 739DOSish perls are as follows:
740
67ac489e 741 OS $^O $Config{archname} ID Version
742 --------------------------------------------------------
743 MS-DOS dos ?
744 PC-DOS dos ?
745 OS/2 os2 ?
746 Windows 3.1 ? ? 0 3 01
747 Windows 95 MSWin32 MSWin32-x86 1 4 00
748 Windows 98 MSWin32 MSWin32-x86 1 4 10
749 Windows ME MSWin32 MSWin32-x86 1 ?
750 Windows NT MSWin32 MSWin32-x86 2 4 xx
751 Windows NT MSWin32 MSWin32-ALPHA 2 4 xx
752 Windows NT MSWin32 MSWin32-ppc 2 4 xx
753 Windows 2000 MSWin32 MSWin32-x86 2 5 xx
754 Windows XP MSWin32 MSWin32-x86 2 ?
755 Windows CE MSWin32 ? 3
756 Cygwin cygwin ?
e41182b5 757
34aaaa84 758The various MSWin32 Perl's can distinguish the OS they are running on
759via the value of the fifth element of the list returned from
760Win32::GetOSVersion(). For example:
761
762 if ($^O eq 'MSWin32') {
763 my @os_version_info = Win32::GetOSVersion();
764 print +('3.1','95','NT')[$os_version_info[4]],"\n";
765 }
766
e41182b5 767Also see:
768
769=over 4
770
c997b287 771=item *
e41182b5 772
c997b287 773The djgpp environment for DOS, http://www.delorie.com/djgpp/
774and L<perldos>.
e41182b5 775
c997b287 776=item *
e41182b5 777
c997b287 778The EMX environment for DOS, OS/2, etc. emx@iaehv.nl,
779http://www.leo.org/pub/comp/os/os2/leo/gnu/emx+gcc/index.html or
780ftp://hobbes.nmsu.edu/pub/os2/dev/emx. Also L<perlos2>.
e41182b5 781
c997b287 782=item *
d1e3b762 783
c997b287 784Build instructions for Win32 in L<perlwin32>, or under the Cygnus environment
785in L<perlcygwin>.
786
787=item *
788
789The C<Win32::*> modules in L<Win32>.
790
791=item *
792
793The ActiveState Pages, http://www.activestate.com/
794
795=item *
796
797The Cygwin environment for Win32; F<README.cygwin> (installed
47dafe4d 798as L<perlcygwin>), http://www.cygwin.com/
c997b287 799
800=item *
801
802The U/WIN environment for Win32,
cea6626f 803http://www.research.att.com/sw/tools/uwin/
c997b287 804
cea6626f 805=item *
d1e3b762 806
cea6626f 807Build instructions for OS/2, L<perlos2>
d1e3b762 808
e41182b5 809=back
810
dd9f0070 811=head2 S<Mac OS>
e41182b5 812
813Any module requiring XS compilation is right out for most people, because
814MacPerl is built using non-free (and non-cheap!) compilers. Some XS
815modules that can work with MacPerl are built and distributed in binary
6ab3f9cb 816form on CPAN.
e41182b5 817
818Directories are specified as:
819
820 volume:folder:file for absolute pathnames
821 volume:folder: for absolute pathnames
822 :folder:file for relative pathnames
823 :folder: for relative pathnames
824 :file for relative pathnames
825 file for relative pathnames
826
b7df3edc 827Files are stored in the directory in alphabetical order. Filenames are
6ab3f9cb 828limited to 31 characters, and may include any character except for
b7df3edc 829null and C<:>, which is reserved as the path separator.
e41182b5 830
0a47030a 831Instead of C<flock>, see C<FSpSetFLock> and C<FSpRstFLock> in the
6ab3f9cb 832Mac::Files module, or C<chmod(0444, ...)> and C<chmod(0666, ...)>.
e41182b5 833
834In the MacPerl application, you can't run a program from the command line;
835programs that expect C<@ARGV> to be populated can be edited with something
836like the following, which brings up a dialog box asking for the command
837line arguments.
838
839 if (!@ARGV) {
840 @ARGV = split /\s+/, MacPerl::Ask('Arguments?');
841 }
842
b7df3edc 843A MacPerl script saved as a "droplet" will populate C<@ARGV> with the full
e41182b5 844pathnames of the files dropped onto the script.
845
b7df3edc 846Mac users can run programs under a type of command line interface
847under MPW (Macintosh Programmer's Workshop, a free development
848environment from Apple). MacPerl was first introduced as an MPW
849tool, and MPW can be used like a shell:
e41182b5 850
851 perl myscript.plx some arguments
852
853ToolServer is another app from Apple that provides access to MPW tools
0a47030a 854from MPW and the MacPerl app, which allows MacPerl programs to use
e41182b5 855C<system>, backticks, and piped C<open>.
856
857"S<Mac OS>" is the proper name for the operating system, but the value
858in C<$^O> is "MacOS". To determine architecture, version, or whether
859the application or MPW tool version is running, check:
860
861 $is_app = $MacPerl::Version =~ /App/;
862 $is_tool = $MacPerl::Version =~ /MPW/;
863 ($version) = $MacPerl::Version =~ /^(\S+)/;
864 $is_ppc = $MacPerl::Architecture eq 'MacPPC';
865 $is_68k = $MacPerl::Architecture eq 'Mac68K';
866
b787fad4 867S<Mac OS X>, based on NeXT's OpenStep OS, runs MacPerl natively, under the
868"Classic" environment. There is no "Carbon" version of MacPerl to run
869under the primary Mac OS X environment. S<Mac OS X> and its Open Source
870version, Darwin, both run Unix perl natively.
6ab3f9cb 871
e41182b5 872Also see:
873
874=over 4
875
c997b287 876=item *
877
862b5365 878MacPerl Development, http://dev.macperl.org/ .
c997b287 879
880=item *
881
862b5365 882The MacPerl Pages, http://www.macperl.com/ .
e41182b5 883
c997b287 884=item *
6ab3f9cb 885
862b5365 886The MacPerl mailing lists, http://lists.perl.org/ .
e41182b5 887
888=back
889
e41182b5 890=head2 VMS
891
c997b287 892Perl on VMS is discussed in L<perlvms> in the perl distribution.
b7df3edc 893Perl on VMS can accept either VMS- or Unix-style file
e41182b5 894specifications as in either of the following:
895
896 $ perl -ne "print if /perl_setup/i" SYS$LOGIN:LOGIN.COM
897 $ perl -ne "print if /perl_setup/i" /sys$login/login.com
898
899but not a mixture of both as in:
900
901 $ perl -ne "print if /perl_setup/i" sys$login:/login.com
902 Can't open sys$login:/login.com: file specification syntax error
903
904Interacting with Perl from the Digital Command Language (DCL) shell
905often requires a different set of quotation marks than Unix shells do.
906For example:
907
908 $ perl -e "print ""Hello, world.\n"""
909 Hello, world.
910
b7df3edc 911There are several ways to wrap your perl scripts in DCL F<.COM> files, if
e41182b5 912you are so inclined. For example:
913
914 $ write sys$output "Hello from DCL!"
915 $ if p1 .eqs. ""
916 $ then perl -x 'f$environment("PROCEDURE")
917 $ else perl -x - 'p1 'p2 'p3 'p4 'p5 'p6 'p7 'p8
918 $ deck/dollars="__END__"
919 #!/usr/bin/perl
920
921 print "Hello from Perl!\n";
922
923 __END__
924 $ endif
925
926Do take care with C<$ ASSIGN/nolog/user SYS$COMMAND: SYS$INPUT> if your
c47ff5f1 927perl-in-DCL script expects to do things like C<< $read = <STDIN>; >>.
e41182b5 928
929Filenames are in the format "name.extension;version". The maximum
930length for filenames is 39 characters, and the maximum length for
931extensions is also 39 characters. Version is a number from 1 to
93232767. Valid characters are C</[A-Z0-9$_-]/>.
933
b7df3edc 934VMS's RMS filesystem is case-insensitive and does not preserve case.
e41182b5 935C<readdir> returns lowercased filenames, but specifying a file for
b7df3edc 936opening remains case-insensitive. Files without extensions have a
e41182b5 937trailing period on them, so doing a C<readdir> with a file named F<A.;5>
0a47030a 938will return F<a.> (though that file could be opened with
939C<open(FH, 'A')>).
e41182b5 940
f34d0673 941RMS had an eight level limit on directory depths from any rooted logical
dd9f0070 942(allowing 16 levels overall) prior to VMS 7.2. Hence
943C<PERL_ROOT:[LIB.2.3.4.5.6.7.8]> is a valid directory specification but
944C<PERL_ROOT:[LIB.2.3.4.5.6.7.8.9]> is not. F<Makefile.PL> authors might
945have to take this into account, but at least they can refer to the former
f34d0673 946as C</PERL_ROOT/lib/2/3/4/5/6/7/8/>.
e41182b5 947
6ab3f9cb 948The VMS::Filespec module, which gets installed as part of the build
0a47030a 949process on VMS, is a pure Perl module that can easily be installed on
950non-VMS platforms and can be helpful for conversions to and from RMS
951native formats.
e41182b5 952
5e12dbfa 953What C<\n> represents depends on the type of file opened. It usually
954represents C<\012> but it could also be C<\015>, C<\012>, C<\015\012>,
955C<\000>, C<\040>, or nothing depending on the file organiztion and
956record format. The VMS::Stdio module provides access to the
957special fopen() requirements of files with unusual attributes on VMS.
e41182b5 958
959TCP/IP stacks are optional on VMS, so socket routines might not be
960implemented. UDP sockets may not be supported.
961
962The value of C<$^O> on OpenVMS is "VMS". To determine the architecture
963that you are running on without resorting to loading all of C<%Config>
964you can examine the content of the C<@INC> array like so:
965
966 if (grep(/VMS_AXP/, @INC)) {
967 print "I'm on Alpha!\n";
6ab3f9cb 968
e41182b5 969 } elsif (grep(/VMS_VAX/, @INC)) {
970 print "I'm on VAX!\n";
6ab3f9cb 971
e41182b5 972 } else {
973 print "I'm not so sure about where $^O is...\n";
974 }
975
b7df3edc 976On VMS, perl determines the UTC offset from the C<SYS$TIMEZONE_DIFFERENTIAL>
977logical name. Although the VMS epoch began at 17-NOV-1858 00:00:00.00,
6ab3f9cb 978calls to C<localtime> are adjusted to count offsets from
b7df3edc 97901-JAN-1970 00:00:00.00, just like Unix.
6ab3f9cb 980
e41182b5 981Also see:
982
983=over 4
984
c997b287 985=item *
986
987F<README.vms> (installed as L<README_vms>), L<perlvms>
988
989=item *
990
991vmsperl list, majordomo@perl.org
e41182b5 992
c997b287 993(Put the words C<subscribe vmsperl> in message body.)
e41182b5 994
c997b287 995=item *
e41182b5 996
c997b287 997vmsperl on the web, http://www.sidhe.org/vmsperl/index.html
e41182b5 998
999=back
1000
495c5fdc 1001=head2 VOS
1002
9a997319 1003Perl on VOS is discussed in F<README.vos> in the perl distribution
1004(installed as L<perlvos>). Perl on VOS can accept either VOS- or
1005Unix-style file specifications as in either of the following:
495c5fdc 1006
1007 $ perl -ne "print if /perl_setup/i" >system>notices
1008 $ perl -ne "print if /perl_setup/i" /system/notices
1009
1010or even a mixture of both as in:
1011
1012 $ perl -ne "print if /perl_setup/i" >system/notices
1013
b7df3edc 1014Even though VOS allows the slash character to appear in object
495c5fdc 1015names, because the VOS port of Perl interprets it as a pathname
1016delimiting character, VOS files, directories, or links whose names
1017contain a slash character cannot be processed. Such files must be
a3dfe201 1018renamed before they can be processed by Perl. Note that VOS limits
1019file names to 32 or fewer characters.
495c5fdc 1020
5b8c1387 1021See F<README.vos> for restrictions that apply when Perl is built
1022with the alpha version of VOS POSIX.1 support.
1023
1024Perl on VOS is built without any extensions and does not support
1025dynamic loading.
495c5fdc 1026
1027The value of C<$^O> on VOS is "VOS". To determine the architecture that
1028you are running on without resorting to loading all of C<%Config> you
c997b287 1029can examine the content of the @INC array like so:
495c5fdc 1030
24e8e380 1031 if ($^O =~ /VOS/) {
495c5fdc 1032 print "I'm on a Stratus box!\n";
1033 } else {
1034 print "I'm not on a Stratus box!\n";
1035 die;
1036 }
1037
1038 if (grep(/860/, @INC)) {
1039 print "This box is a Stratus XA/R!\n";
6ab3f9cb 1040
495c5fdc 1041 } elsif (grep(/7100/, @INC)) {
24e8e380 1042 print "This box is a Stratus HP 7100 or 8xxx!\n";
6ab3f9cb 1043
495c5fdc 1044 } elsif (grep(/8000/, @INC)) {
24e8e380 1045 print "This box is a Stratus HP 8xxx!\n";
6ab3f9cb 1046
495c5fdc 1047 } else {
24e8e380 1048 print "This box is a Stratus 68K!\n";
495c5fdc 1049 }
1050
1051Also see:
1052
1053=over 4
1054
c997b287 1055=item *
495c5fdc 1056
c997b287 1057F<README.vos>
1058
1059=item *
1060
1061The VOS mailing list.
495c5fdc 1062
1063There is no specific mailing list for Perl on VOS. You can post
1064comments to the comp.sys.stratus newsgroup, or subscribe to the general
1065Stratus mailing list. Send a letter with "Subscribe Info-Stratus" in
1066the message body to majordomo@list.stratagy.com.
1067
c997b287 1068=item *
1069
1070VOS Perl on the web at http://ftp.stratus.com/pub/vos/vos.html
495c5fdc 1071
1072=back
1073
e41182b5 1074=head2 EBCDIC Platforms
1075
1076Recent versions of Perl have been ported to platforms such as OS/400 on
d1e3b762 1077AS/400 minicomputers as well as OS/390, VM/ESA, and BS2000 for S/390
1078Mainframes. Such computers use EBCDIC character sets internally (usually
0cc436d0 1079Character Code Set ID 0037 for OS/400 and either 1047 or POSIX-BC for S/390
1080systems). On the mainframe perl currently works under the "Unix system
1081services for OS/390" (formerly known as OpenEdition), VM/ESA OpenEdition, or
1082the BS200 POSIX-BC system (BS2000 is supported in perl 5.6 and greater).
c997b287 1083See L<perlos390> for details.
e41182b5 1084
7c5ffed3 1085As of R2.5 of USS for OS/390 and Version 2.3 of VM/ESA these Unix
1086sub-systems do not support the C<#!> shebang trick for script invocation.
1087Hence, on OS/390 and VM/ESA perl scripts can be executed with a header
1088similar to the following simple script:
e41182b5 1089
1090 : # use perl
1091 eval 'exec /usr/local/bin/perl -S $0 ${1+"$@"}'
1092 if 0;
1093 #!/usr/local/bin/perl # just a comment really
1094
1095 print "Hello from perl!\n";
1096
d1e3b762 1097OS/390 will support the C<#!> shebang trick in release 2.8 and beyond.
1098Calls to C<system> and backticks can use POSIX shell syntax on all
1099S/390 systems.
1100
b7df3edc 1101On the AS/400, if PERL5 is in your library list, you may need
6ab3f9cb 1102to wrap your perl scripts in a CL procedure to invoke them like so:
1103
1104 BEGIN
1105 CALL PGM(PERL5/PERL) PARM('/QOpenSys/hello.pl')
1106 ENDPGM
1107
1108This will invoke the perl script F<hello.pl> in the root of the
1109QOpenSys file system. On the AS/400 calls to C<system> or backticks
1110must use CL syntax.
1111
e41182b5 1112On these platforms, bear in mind that the EBCDIC character set may have
0a47030a 1113an effect on what happens with some perl functions (such as C<chr>,
1114C<pack>, C<print>, C<printf>, C<ord>, C<sort>, C<sprintf>, C<unpack>), as
1115well as bit-fiddling with ASCII constants using operators like C<^>, C<&>
1116and C<|>, not to mention dealing with socket interfaces to ASCII computers
6ab3f9cb 1117(see L<"Newlines">).
e41182b5 1118
b7df3edc 1119Fortunately, most web servers for the mainframe will correctly
1120translate the C<\n> in the following statement to its ASCII equivalent
1121(C<\r> is the same under both Unix and OS/390 & VM/ESA):
e41182b5 1122
1123 print "Content-type: text/html\r\n\r\n";
1124
d1e3b762 1125The values of C<$^O> on some of these platforms includes:
e41182b5 1126
d1e3b762 1127 uname $^O $Config{'archname'}
1128 --------------------------------------------
1129 OS/390 os390 os390
1130 OS400 os400 os400
1131 POSIX-BC posix-bc BS2000-posix-bc
1132 VM/ESA vmesa vmesa
3c075c7d 1133
e41182b5 1134Some simple tricks for determining if you are running on an EBCDIC
1135platform could include any of the following (perhaps all):
1136
1137 if ("\t" eq "\05") { print "EBCDIC may be spoken here!\n"; }
1138
1139 if (ord('A') == 193) { print "EBCDIC may be spoken here!\n"; }
1140
1141 if (chr(169) eq 'z') { print "EBCDIC may be spoken here!\n"; }
1142
b7df3edc 1143One thing you may not want to rely on is the EBCDIC encoding
0a47030a 1144of punctuation characters since these may differ from code page to code
1145page (and once your module or script is rumoured to work with EBCDIC,
1146folks will want it to work with all EBCDIC character sets).
e41182b5 1147
1148Also see:
1149
1150=over 4
1151
c997b287 1152=item *
1153
1154*
d1e3b762 1155
dc5c060f 1156L<perlos390>, F<README.os390>, F<perlbs2000>, F<README.vmesa>,
bb462878 1157L<perlebcdic>.
c997b287 1158
1159=item *
e41182b5 1160
1161The perl-mvs@perl.org list is for discussion of porting issues as well as
1162general usage issues for all EBCDIC Perls. Send a message body of
1163"subscribe perl-mvs" to majordomo@perl.org.
1164
c997b287 1165=item *
1166
1167AS/400 Perl information at
b1866b2d 1168http://as400.rochester.ibm.com/
d1e3b762 1169as well as on CPAN in the F<ports/> directory.
e41182b5 1170
1171=back
1172
b8099c3d 1173=head2 Acorn RISC OS
1174
b7df3edc 1175Because Acorns use ASCII with newlines (C<\n>) in text files as C<\012> like
1176Unix, and because Unix filename emulation is turned on by default,
1177most simple scripts will probably work "out of the box". The native
6ab3f9cb 1178filesystem is modular, and individual filesystems are free to be
0a47030a 1179case-sensitive or insensitive, and are usually case-preserving. Some
b7df3edc 1180native filesystems have name length limits, which file and directory
6ab3f9cb 1181names are silently truncated to fit. Scripts should be aware that the
1182standard filesystem currently has a name length limit of B<10>
1183characters, with up to 77 items in a directory, but other filesystems
0a47030a 1184may not impose such limitations.
b8099c3d 1185
1186Native filenames are of the form
1187
6ab3f9cb 1188 Filesystem#Special_Field::DiskName.$.Directory.Directory.File
dd9f0070 1189
b8099c3d 1190where
1191
1192 Special_Field is not usually present, but may contain . and $ .
1193 Filesystem =~ m|[A-Za-z0-9_]|
1194 DsicName =~ m|[A-Za-z0-9_/]|
1195 $ represents the root directory
1196 . is the path separator
1197 @ is the current directory (per filesystem but machine global)
1198 ^ is the parent directory
1199 Directory and File =~ m|[^\0- "\.\$\%\&:\@\\^\|\177]+|
1200
1201The default filename translation is roughly C<tr|/.|./|;>
1202
6ab3f9cb 1203Note that C<"ADFS::HardDisk.$.File" ne 'ADFS::HardDisk.$.File'> and that
0a47030a 1204the second stage of C<$> interpolation in regular expressions will fall
1205foul of the C<$.> if scripts are not careful.
1206
1207Logical paths specified by system variables containing comma-separated
b7df3edc 1208search lists are also allowed; hence C<System:Modules> is a valid
0a47030a 1209filename, and the filesystem will prefix C<Modules> with each section of
6ab3f9cb 1210C<System$Path> until a name is made that points to an object on disk.
b7df3edc 1211Writing to a new file C<System:Modules> would be allowed only if
0a47030a 1212C<System$Path> contains a single item list. The filesystem will also
1213expand system variables in filenames if enclosed in angle brackets, so
c47ff5f1 1214C<< <System$Dir>.Modules >> would look for the file
0a47030a 1215S<C<$ENV{'System$Dir'} . 'Modules'>>. The obvious implication of this is
c47ff5f1 1216that B<fully qualified filenames can start with C<< <> >>> and should
0a47030a 1217be protected when C<open> is used for input.
b8099c3d 1218
1219Because C<.> was in use as a directory separator and filenames could not
1220be assumed to be unique after 10 characters, Acorn implemented the C
1221compiler to strip the trailing C<.c> C<.h> C<.s> and C<.o> suffix from
1222filenames specified in source code and store the respective files in
b7df3edc 1223subdirectories named after the suffix. Hence files are translated:
b8099c3d 1224
1225 foo.h h.foo
1226 C:foo.h C:h.foo (logical path variable)
1227 sys/os.h sys.h.os (C compiler groks Unix-speak)
1228 10charname.c c.10charname
1229 10charname.o o.10charname
1230 11charname_.c c.11charname (assuming filesystem truncates at 10)
1231
1232The Unix emulation library's translation of filenames to native assumes
b7df3edc 1233that this sort of translation is required, and it allows a user-defined list
1234of known suffixes that it will transpose in this fashion. This may
1235seem transparent, but consider that with these rules C<foo/bar/baz.h>
0a47030a 1236and C<foo/bar/h/baz> both map to C<foo.bar.h.baz>, and that C<readdir> and
1237C<glob> cannot and do not attempt to emulate the reverse mapping. Other
6ab3f9cb 1238C<.>'s in filenames are translated to C</>.
0a47030a 1239
b7df3edc 1240As implied above, the environment accessed through C<%ENV> is global, and
0a47030a 1241the convention is that program specific environment variables are of the
6ab3f9cb 1242form C<Program$Name>. Each filesystem maintains a current directory,
1243and the current filesystem's current directory is the B<global> current
b7df3edc 1244directory. Consequently, sociable programs don't change the current
1245directory but rely on full pathnames, and programs (and Makefiles) cannot
0a47030a 1246assume that they can spawn a child process which can change the current
1247directory without affecting its parent (and everyone else for that
1248matter).
1249
b7df3edc 1250Because native operating system filehandles are global and are currently
1251allocated down from 255, with 0 being a reserved value, the Unix emulation
0a47030a 1252library emulates Unix filehandles. Consequently, you can't rely on
1253passing C<STDIN>, C<STDOUT>, or C<STDERR> to your children.
1254
1255The desire of users to express filenames of the form
c47ff5f1 1256C<< <Foo$Dir>.Bar >> on the command line unquoted causes problems,
0a47030a 1257too: C<``> command output capture has to perform a guessing game. It
c47ff5f1 1258assumes that a string C<< <[^<>]+\$[^<>]> >> is a
0a47030a 1259reference to an environment variable, whereas anything else involving
c47ff5f1 1260C<< < >> or C<< > >> is redirection, and generally manages to be 99%
0a47030a 1261right. Of course, the problem remains that scripts cannot rely on any
1262Unix tools being available, or that any tools found have Unix-like command
1263line arguments.
1264
b7df3edc 1265Extensions and XS are, in theory, buildable by anyone using free
1266tools. In practice, many don't, as users of the Acorn platform are
1267used to binary distributions. MakeMaker does run, but no available
1268make currently copes with MakeMaker's makefiles; even if and when
1269this should be fixed, the lack of a Unix-like shell will cause
1270problems with makefile rules, especially lines of the form C<cd
1271sdbm && make all>, and anything using quoting.
b8099c3d 1272
1273"S<RISC OS>" is the proper name for the operating system, but the value
1274in C<$^O> is "riscos" (because we don't like shouting).
1275
e41182b5 1276=head2 Other perls
1277
b7df3edc 1278Perl has been ported to many platforms that do not fit into any of
1279the categories listed above. Some, such as AmigaOS, Atari MiNT,
1280BeOS, HP MPE/iX, QNX, Plan 9, and VOS, have been well-integrated
1281into the standard Perl source code kit. You may need to see the
1282F<ports/> directory on CPAN for information, and possibly binaries,
1283for the likes of: aos, Atari ST, lynxos, riscos, Novell Netware,
1284Tandem Guardian, I<etc.> (Yes, we know that some of these OSes may
1285fall under the Unix category, but we are not a standards body.)
e41182b5 1286
d1e3b762 1287Some approximate operating system names and their C<$^O> values
1288in the "OTHER" category include:
1289
1290 OS $^O $Config{'archname'}
1291 ------------------------------------------
1292 Amiga DOS amigaos m68k-amigos
1293 MPE/iX mpeix PA-RISC1.1
1294
e41182b5 1295See also:
1296
1297=over 4
1298
c997b287 1299=item *
1300
1301Amiga, F<README.amiga> (installed as L<perlamiga>).
1302
1303=item *
d1e3b762 1304
c997b287 1305Atari, F<README.mint> and Guido Flohr's web page
1306http://stud.uni-sb.de/~gufl0000/
e41182b5 1307
c997b287 1308=item *
d1e3b762 1309
c997b287 1310Be OS, F<README.beos>
e41182b5 1311
c997b287 1312=item *
1313
1314HP 300 MPE/iX, F<README.mpeix> and Mark Bixby's web page
34aaaa84 1315http://www.bixby.org/mark/perlix.html
c997b287 1316
1317=item *
e41182b5 1318
6ab3f9cb 1319A free perl5-based PERL.NLM for Novell Netware is available in
c997b287 1320precompiled binary and source code form from http://www.novell.com/
6ab3f9cb 1321as well as from CPAN.
e41182b5 1322
13a2d996 1323=item *
c997b287 1324
1325Plan 9, F<README.plan9>
d1e3b762 1326
e41182b5 1327=back
1328
e41182b5 1329=head1 FUNCTION IMPLEMENTATIONS
1330
b7df3edc 1331Listed below are functions that are either completely unimplemented
1332or else have been implemented differently on various platforms.
1333Following each description will be, in parentheses, a list of
1334platforms that the description applies to.
e41182b5 1335
b7df3edc 1336The list may well be incomplete, or even wrong in some places. When
1337in doubt, consult the platform-specific README files in the Perl
1338source distribution, and any other documentation resources accompanying
1339a given port.
e41182b5 1340
0a47030a 1341Be aware, moreover, that even among Unix-ish systems there are variations.
e41182b5 1342
b7df3edc 1343For many functions, you can also query C<%Config>, exported by
1344default from the Config module. For example, to check whether the
1345platform has the C<lstat> call, check C<$Config{d_lstat}>. See
1346L<Config> for a full description of available variables.
e41182b5 1347
1348=head2 Alphabetical Listing of Perl Functions
1349
1350=over 8
1351
1352=item -X FILEHANDLE
1353
1354=item -X EXPR
1355
1356=item -X
1357
b7df3edc 1358C<-r>, C<-w>, and C<-x> have a limited meaning only; directories
e41182b5 1359and applications are executable, and there are no uid/gid
b7df3edc 1360considerations. C<-o> is not supported. (S<Mac OS>)
e41182b5 1361
b7df3edc 1362C<-r>, C<-w>, C<-x>, and C<-o> tell whether the file is accessible,
1363which may not reflect UIC-based file protections. (VMS)
e41182b5 1364
b8099c3d 1365C<-s> returns the size of the data fork, not the total size of data fork
1366plus resource fork. (S<Mac OS>).
1367
1368C<-s> by name on an open file will return the space reserved on disk,
1369rather than the current extent. C<-s> on an open filehandle returns the
b7df3edc 1370current size. (S<RISC OS>)
b8099c3d 1371
e41182b5 1372C<-R>, C<-W>, C<-X>, C<-O> are indistinguishable from C<-r>, C<-w>,
b8099c3d 1373C<-x>, C<-o>. (S<Mac OS>, Win32, VMS, S<RISC OS>)
e41182b5 1374
1375C<-b>, C<-c>, C<-k>, C<-g>, C<-p>, C<-u>, C<-A> are not implemented.
1376(S<Mac OS>)
1377
1378C<-g>, C<-k>, C<-l>, C<-p>, C<-u>, C<-A> are not particularly meaningful.
b8099c3d 1379(Win32, VMS, S<RISC OS>)
e41182b5 1380
1381C<-d> is true if passed a device spec without an explicit directory.
1382(VMS)
1383
1384C<-T> and C<-B> are implemented, but might misclassify Mac text files
0a47030a 1385with foreign characters; this is the case will all platforms, but may
b7df3edc 1386affect S<Mac OS> often. (S<Mac OS>)
e41182b5 1387
1388C<-x> (or C<-X>) determine if a file ends in one of the executable
b7df3edc 1389suffixes. C<-S> is meaningless. (Win32)
e41182b5 1390
b8099c3d 1391C<-x> (or C<-X>) determine if a file has an executable file type.
1392(S<RISC OS>)
1393
63f87e49 1394=item alarm SECONDS
1395
1396=item alarm
1397
1398Not implemented. (Win32)
1399
e41182b5 1400=item binmode FILEHANDLE
1401
b7df3edc 1402Meaningless. (S<Mac OS>, S<RISC OS>)
e41182b5 1403
1404Reopens file and restores pointer; if function fails, underlying
1405filehandle may be closed, or pointer may be in a different position.
1406(VMS)
1407
1408The value returned by C<tell> may be affected after the call, and
1409the filehandle may be flushed. (Win32)
1410
1411=item chmod LIST
1412
b7df3edc 1413Only limited meaning. Disabling/enabling write permission is mapped to
e41182b5 1414locking/unlocking the file. (S<Mac OS>)
1415
1416Only good for changing "owner" read-write access, "group", and "other"
1417bits are meaningless. (Win32)
1418
b8099c3d 1419Only good for changing "owner" and "other" read-write access. (S<RISC OS>)
1420
495c5fdc 1421Access permissions are mapped onto VOS access-control list changes. (VOS)
1422
e41182b5 1423=item chown LIST
1424
495c5fdc 1425Not implemented. (S<Mac OS>, Win32, Plan9, S<RISC OS>, VOS)
e41182b5 1426
1427Does nothing, but won't fail. (Win32)
1428
1429=item chroot FILENAME
1430
1431=item chroot
1432
7c5ffed3 1433Not implemented. (S<Mac OS>, Win32, VMS, Plan9, S<RISC OS>, VOS, VM/ESA)
e41182b5 1434
1435=item crypt PLAINTEXT,SALT
1436
1437May not be available if library or source was not provided when building
b8099c3d 1438perl. (Win32)
e41182b5 1439
495c5fdc 1440Not implemented. (VOS)
1441
e41182b5 1442=item dbmclose HASH
1443
495c5fdc 1444Not implemented. (VMS, Plan9, VOS)
e41182b5 1445
1446=item dbmopen HASH,DBNAME,MODE
1447
495c5fdc 1448Not implemented. (VMS, Plan9, VOS)
e41182b5 1449
1450=item dump LABEL
1451
b8099c3d 1452Not useful. (S<Mac OS>, S<RISC OS>)
e41182b5 1453
1454Not implemented. (Win32)
1455
b8099c3d 1456Invokes VMS debugger. (VMS)
e41182b5 1457
1458=item exec LIST
1459
1460Not implemented. (S<Mac OS>)
1461
7c5ffed3 1462Implemented via Spawn. (VM/ESA)
3c075c7d 1463
0f897271 1464Does not automatically flush output handles on some platforms.
1465(SunOS, Solaris, HP-UX)
1466
fe12c0e8 1467=item exit EXPR
1468
1469=item exit
1470
1471Emulates UNIX exit() (which considers C<exit 1> to indicate an error) by
1472mapping the C<1> to SS$_ABORT (C<44>). This behavior may be overridden
1473with the pragma C<use vmsish 'exit'>. As with the CRTL's exit()
1474function, C<exit 0> is also mapped to an exit status of SS$_NORMAL
1475(C<1>); this mapping cannot be overridden. Any other argument to exit()
1476is used directly as Perl's exit status. (VMS)
1477
e41182b5 1478=item fcntl FILEHANDLE,FUNCTION,SCALAR
1479
1480Not implemented. (Win32, VMS)
1481
1482=item flock FILEHANDLE,OPERATION
1483
495c5fdc 1484Not implemented (S<Mac OS>, VMS, S<RISC OS>, VOS).
e41182b5 1485
1486Available only on Windows NT (not on Windows 95). (Win32)
1487
1488=item fork
1489
0f897271 1490Not implemented. (S<Mac OS>, AmigaOS, S<RISC OS>, VOS, VM/ESA)
1491
1492Emulated using multiple interpreters. See L<perlfork>. (Win32)
1493
1494Does not automatically flush output handles on some platforms.
1495(SunOS, Solaris, HP-UX)
e41182b5 1496
1497=item getlogin
1498
b8099c3d 1499Not implemented. (S<Mac OS>, S<RISC OS>)
e41182b5 1500
1501=item getpgrp PID
1502
495c5fdc 1503Not implemented. (S<Mac OS>, Win32, VMS, S<RISC OS>, VOS)
e41182b5 1504
1505=item getppid
1506
b8099c3d 1507Not implemented. (S<Mac OS>, Win32, VMS, S<RISC OS>)
e41182b5 1508
1509=item getpriority WHICH,WHO
1510
7c5ffed3 1511Not implemented. (S<Mac OS>, Win32, VMS, S<RISC OS>, VOS, VM/ESA)
e41182b5 1512
1513=item getpwnam NAME
1514
1515Not implemented. (S<Mac OS>, Win32)
1516
b8099c3d 1517Not useful. (S<RISC OS>)
1518
e41182b5 1519=item getgrnam NAME
1520
b8099c3d 1521Not implemented. (S<Mac OS>, Win32, VMS, S<RISC OS>)
e41182b5 1522
1523=item getnetbyname NAME
1524
1525Not implemented. (S<Mac OS>, Win32, Plan9)
1526
1527=item getpwuid UID
1528
1529Not implemented. (S<Mac OS>, Win32)
1530
b8099c3d 1531Not useful. (S<RISC OS>)
1532
e41182b5 1533=item getgrgid GID
1534
b8099c3d 1535Not implemented. (S<Mac OS>, Win32, VMS, S<RISC OS>)
e41182b5 1536
1537=item getnetbyaddr ADDR,ADDRTYPE
1538
1539Not implemented. (S<Mac OS>, Win32, Plan9)
1540
1541=item getprotobynumber NUMBER
1542
1543Not implemented. (S<Mac OS>)
1544
1545=item getservbyport PORT,PROTO
1546
1547Not implemented. (S<Mac OS>)
1548
1549=item getpwent
1550
7c5ffed3 1551Not implemented. (S<Mac OS>, Win32, VM/ESA)
e41182b5 1552
1553=item getgrent
1554
7c5ffed3 1555Not implemented. (S<Mac OS>, Win32, VMS, VM/ESA)
e41182b5 1556
1557=item gethostent
1558
1559Not implemented. (S<Mac OS>, Win32)
1560
1561=item getnetent
1562
1563Not implemented. (S<Mac OS>, Win32, Plan9)
1564
1565=item getprotoent
1566
1567Not implemented. (S<Mac OS>, Win32, Plan9)
1568
1569=item getservent
1570
1571Not implemented. (Win32, Plan9)
1572
1573=item setpwent
1574
b8099c3d 1575Not implemented. (S<Mac OS>, Win32, S<RISC OS>)
e41182b5 1576
1577=item setgrent
1578
b8099c3d 1579Not implemented. (S<Mac OS>, Win32, VMS, S<RISC OS>)
e41182b5 1580
1581=item sethostent STAYOPEN
1582
b8099c3d 1583Not implemented. (S<Mac OS>, Win32, Plan9, S<RISC OS>)
e41182b5 1584
1585=item setnetent STAYOPEN
1586
b8099c3d 1587Not implemented. (S<Mac OS>, Win32, Plan9, S<RISC OS>)
e41182b5 1588
1589=item setprotoent STAYOPEN
1590
b8099c3d 1591Not implemented. (S<Mac OS>, Win32, Plan9, S<RISC OS>)
e41182b5 1592
1593=item setservent STAYOPEN
1594
b8099c3d 1595Not implemented. (Plan9, Win32, S<RISC OS>)
e41182b5 1596
1597=item endpwent
1598
a3dfe201 1599Not implemented. (S<Mac OS>, MPE/iX, VM/ESA, Win32)
e41182b5 1600
1601=item endgrent
1602
a3dfe201 1603Not implemented. (S<Mac OS>, MPE/iX, S<RISC OS>, VM/ESA, VMS, Win32)
e41182b5 1604
1605=item endhostent
1606
1607Not implemented. (S<Mac OS>, Win32)
1608
1609=item endnetent
1610
1611Not implemented. (S<Mac OS>, Win32, Plan9)
1612
1613=item endprotoent
1614
1615Not implemented. (S<Mac OS>, Win32, Plan9)
1616
1617=item endservent
1618
1619Not implemented. (Plan9, Win32)
1620
1621=item getsockopt SOCKET,LEVEL,OPTNAME
1622
97c10e77 1623Not implemented. (Plan9)
e41182b5 1624
1625=item glob EXPR
1626
1627=item glob
1628
63f87e49 1629This operator is implemented via the File::Glob extension on most
1630platforms. See L<File::Glob> for portability information.
b8099c3d 1631
e41182b5 1632=item ioctl FILEHANDLE,FUNCTION,SCALAR
1633
1634Not implemented. (VMS)
1635
1636Available only for socket handles, and it does what the ioctlsocket() call
1637in the Winsock API does. (Win32)
1638
b8099c3d 1639Available only for socket handles. (S<RISC OS>)
1640
b350dd2f 1641=item kill SIGNAL, LIST
e41182b5 1642
862b5365 1643C<kill(0, LIST)> is implemented for the sake of taint checking;
1644use with other signals is unimplemented. (S<Mac OS>)
1645
1646Not implemented, hence not useful for taint checking. (S<RISC OS>)
e41182b5 1647
63f87e49 1648C<kill()> doesn't have the semantics of C<raise()>, i.e. it doesn't send
1649a signal to the identified process like it does on Unix platforms.
1650Instead C<kill($sig, $pid)> terminates the process identified by $pid,
1651and makes it exit immediately with exit status $sig. As in Unix, if
1652$sig is 0 and the specified process exists, it returns true without
1653actually terminating it. (Win32)
e41182b5 1654
1655=item link OLDFILE,NEWFILE
1656
a3dfe201 1657Not implemented. (S<Mac OS>, MPE/iX, VMS, S<RISC OS>)
e41182b5 1658
433acd8a 1659Link count not updated because hard links are not quite that hard
1660(They are sort of half-way between hard and soft links). (AmigaOS)
1661
a3dfe201 1662Hard links are implemented on Win32 (Windows NT and Windows 2000)
1663under NTFS only.
1664
e41182b5 1665=item lstat FILEHANDLE
1666
1667=item lstat EXPR
1668
1669=item lstat
1670
b8099c3d 1671Not implemented. (VMS, S<RISC OS>)
e41182b5 1672
63f87e49 1673Return values (especially for device and inode) may be bogus. (Win32)
e41182b5 1674
1675=item msgctl ID,CMD,ARG
1676
1677=item msgget KEY,FLAGS
1678
1679=item msgsnd ID,MSG,FLAGS
1680
1681=item msgrcv ID,VAR,SIZE,TYPE,FLAGS
1682
495c5fdc 1683Not implemented. (S<Mac OS>, Win32, VMS, Plan9, S<RISC OS>, VOS)
e41182b5 1684
1685=item open FILEHANDLE,EXPR
1686
1687=item open FILEHANDLE
1688
b7df3edc 1689The C<|> variants are supported only if ToolServer is installed.
e41182b5 1690(S<Mac OS>)
1691
c47ff5f1 1692open to C<|-> and C<-|> are unsupported. (S<Mac OS>, Win32, S<RISC OS>)
e41182b5 1693
0f897271 1694Opening a process does not automatically flush output handles on some
1695platforms. (SunOS, Solaris, HP-UX)
1696
e41182b5 1697=item pipe READHANDLE,WRITEHANDLE
1698
433acd8a 1699Very limited functionality. (MiNT)
1700
e41182b5 1701=item readlink EXPR
1702
1703=item readlink
1704
b8099c3d 1705Not implemented. (Win32, VMS, S<RISC OS>)
e41182b5 1706
1707=item select RBITS,WBITS,EBITS,TIMEOUT
1708
689c5c24 1709Only implemented on sockets. (Win32, VMS)
e41182b5 1710
b8099c3d 1711Only reliable on sockets. (S<RISC OS>)
1712
76e05f0b 1713Note that the C<select FILEHANDLE> form is generally portable.
63f87e49 1714
e41182b5 1715=item semctl ID,SEMNUM,CMD,ARG
1716
1717=item semget KEY,NSEMS,FLAGS
1718
1719=item semop KEY,OPSTRING
1720
495c5fdc 1721Not implemented. (S<Mac OS>, Win32, VMS, S<RISC OS>, VOS)
e41182b5 1722
a3dfe201 1723=item setgrent
1724
1725Not implemented. (MPE/iX, Win32)
1726
e41182b5 1727=item setpgrp PID,PGRP
1728
495c5fdc 1729Not implemented. (S<Mac OS>, Win32, VMS, S<RISC OS>, VOS)
e41182b5 1730
1731=item setpriority WHICH,WHO,PRIORITY
1732
495c5fdc 1733Not implemented. (S<Mac OS>, Win32, VMS, S<RISC OS>, VOS)
e41182b5 1734
a3dfe201 1735=item setpwent
1736
1737Not implemented. (MPE/iX, Win32)
1738
e41182b5 1739=item setsockopt SOCKET,LEVEL,OPTNAME,OPTVAL
1740
97c10e77 1741Not implemented. (Plan9)
e41182b5 1742
1743=item shmctl ID,CMD,ARG
1744
1745=item shmget KEY,SIZE,FLAGS
1746
1747=item shmread ID,VAR,POS,SIZE
1748
1749=item shmwrite ID,STRING,POS,SIZE
1750
495c5fdc 1751Not implemented. (S<Mac OS>, Win32, VMS, S<RISC OS>, VOS)
e41182b5 1752
80cbd5ad 1753=item sockatmark SOCKET
1754
1755A relatively recent addition to socket functions, may not
1756be implemented even in UNIX platforms.
1757
e41182b5 1758=item socketpair SOCKET1,SOCKET2,DOMAIN,TYPE,PROTOCOL
1759
862b5365 1760Not implemented. (Win32, VMS, S<RISC OS>, VOS, VM/ESA)
e41182b5 1761
1762=item stat FILEHANDLE
1763
1764=item stat EXPR
1765
1766=item stat
1767
d62e1b7f 1768Platforms that do not have rdev, blksize, or blocks will return these
1769as '', so numeric comparison or manipulation of these fields may cause
1770'not numeric' warnings.
1771
e41182b5 1772mtime and atime are the same thing, and ctime is creation time instead of
1773inode change time. (S<Mac OS>)
1774
1775device and inode are not meaningful. (Win32)
1776
1777device and inode are not necessarily reliable. (VMS)
1778
b8099c3d 1779mtime, atime and ctime all return the last modification time. Device and
1780inode are not necessarily reliable. (S<RISC OS>)
1781
d62e1b7f 1782dev, rdev, blksize, and blocks are not available. inode is not
1783meaningful and will differ between stat calls on the same file. (os2)
1784
73e9292c 1785some versions of cygwin when doing a stat("foo") and if not finding it
1786may then attempt to stat("foo.exe") (Cygwin)
1787
e41182b5 1788=item symlink OLDFILE,NEWFILE
1789
b8099c3d 1790Not implemented. (Win32, VMS, S<RISC OS>)
e41182b5 1791
1792=item syscall LIST
1793
7c5ffed3 1794Not implemented. (S<Mac OS>, Win32, VMS, S<RISC OS>, VOS, VM/ESA)
e41182b5 1795
f34d0673 1796=item sysopen FILEHANDLE,FILENAME,MODE,PERMS
1797
dd9f0070 1798The traditional "0", "1", and "2" MODEs are implemented with different
322422de 1799numeric values on some systems. The flags exported by C<Fcntl>
1800(O_RDONLY, O_WRONLY, O_RDWR) should work everywhere though. (S<Mac
7c5ffed3 1801OS>, OS/390, VM/ESA)
f34d0673 1802
e41182b5 1803=item system LIST
1804
9d6eb86e 1805In general, do not assume the UNIX/POSIX semantics that you can shift
7717d0e7 1806C<$?> right by eight to get the exit value, or that C<$? & 127>
9d6eb86e 1807would give you the number of the signal that terminated the program,
1808or that C<$? & 128> would test true if the program was terminated by a
1809coredump. Instead, use the POSIX W*() interfaces: for example, use
1810WIFEXITED($?) an WEXITVALUE($?) to test for a normal exit and the exit
1811value, and WIFSIGNALED($?) and WTERMSIG($?) for a signal exit and the
7717d0e7 1812signal. Core dumping is not a portable concept, so there's no portable
9d6eb86e 1813way to test for that.
1814
e41182b5 1815Only implemented if ToolServer is installed. (S<Mac OS>)
1816
1817As an optimization, may not call the command shell specified in
b7df3edc 1818C<$ENV{PERL5SHELL}>. C<system(1, @args)> spawns an external
e41182b5 1819process and immediately returns its process designator, without
1820waiting for it to terminate. Return value may be used subsequently
63f87e49 1821in C<wait> or C<waitpid>. Failure to spawn() a subprocess is indicated
1822by setting $? to "255 << 8". C<$?> is set in a way compatible with
1823Unix (i.e. the exitstatus of the subprocess is obtained by "$? >> 8",
1824as described in the documentation). (Win32)
e41182b5 1825
b8099c3d 1826There is no shell to process metacharacters, and the native standard is
1827to pass a command line terminated by "\n" "\r" or "\0" to the spawned
c47ff5f1 1828program. Redirection such as C<< > foo >> is performed (if at all) by
b8099c3d 1829the run time library of the spawned program. C<system> I<list> will call
1830the Unix emulation library's C<exec> emulation, which attempts to provide
1831emulation of the stdin, stdout, stderr in force in the parent, providing
1832the child program uses a compatible version of the emulation library.
1833I<scalar> will call the native command line direct and no such emulation
1834of a child Unix program will exists. Mileage B<will> vary. (S<RISC OS>)
1835
433acd8a 1836Far from being POSIX compliant. Because there may be no underlying
1837/bin/sh tries to work around the problem by forking and execing the
9b63e9ec 1838first token in its argument string. Handles basic redirection
c47ff5f1 1839("<" or ">") on its own behalf. (MiNT)
433acd8a 1840
0f897271 1841Does not automatically flush output handles on some platforms.
1842(SunOS, Solaris, HP-UX)
1843
9bc98430 1844The return value is POSIX-like (shifted up by 8 bits), which only allows
1845room for a made-up value derived from the severity bits of the native
184632-bit condition code (unless overridden by C<use vmsish 'status'>).
1847For more details see L<perlvms/$?>. (VMS)
1848
e41182b5 1849=item times
1850
1851Only the first entry returned is nonzero. (S<Mac OS>)
1852
63f87e49 1853"cumulative" times will be bogus. On anything other than Windows NT
1854or Windows 2000, "system" time will be bogus, and "user" time is
1855actually the time returned by the clock() function in the C runtime
1856library. (Win32)
e41182b5 1857
b8099c3d 1858Not useful. (S<RISC OS>)
1859
e41182b5 1860=item truncate FILEHANDLE,LENGTH
1861
1862=item truncate EXPR,LENGTH
1863
6d738113 1864Not implemented. (Older versions of VMS)
e41182b5 1865
495c5fdc 1866Truncation to zero-length only. (VOS)
1867
4cfdb94f 1868If a FILEHANDLE is supplied, it must be writable and opened in append
e71a7dc8 1869mode (i.e., use C<<< open(FH, '>>filename') >>>
4cfdb94f 1870or C<sysopen(FH,...,O_APPEND|O_RDWR)>. If a filename is supplied, it
1871should not be held open elsewhere. (Win32)
1872
e41182b5 1873=item umask EXPR
1874
1875=item umask
1876
1877Returns undef where unavailable, as of version 5.005.
1878
b7df3edc 1879C<umask> works but the correct permissions are set only when the file
1880is finally closed. (AmigaOS)
433acd8a 1881
e41182b5 1882=item utime LIST
1883
15c65113 1884Only the modification time is updated. (S<BeOS>, S<Mac OS>, VMS, S<RISC OS>)
e41182b5 1885
322422de 1886May not behave as expected. Behavior depends on the C runtime
1887library's implementation of utime(), and the filesystem being
1888used. The FAT filesystem typically does not support an "access
1889time" field, and it may limit timestamps to a granularity of
1890two seconds. (Win32)
e41182b5 1891
1892=item wait
1893
1894=item waitpid PID,FLAGS
1895
495c5fdc 1896Not implemented. (S<Mac OS>, VOS)
e41182b5 1897
1898Can only be applied to process handles returned for processes spawned
a6f858fb 1899using C<system(1, ...)> or pseudo processes created with C<fork()>. (Win32)
e41182b5 1900
b8099c3d 1901Not useful. (S<RISC OS>)
1902
e41182b5 1903=back
1904
b8099c3d 1905=head1 CHANGES
1906
1907=over 4
1908
fd46a41b 1909=item v1.48, 02 February 2001
1910
1911Various updates from perl5-porters over the past year, supported
1912platforms update from Jarkko Hietaniemi.
1913
c997b287 1914=item v1.47, 22 March 2000
1915
1916Various cleanups from Tom Christiansen, including migration of
1917long platform listings from L<perl>.
1918
56d7751a 1919=item v1.46, 12 February 2000
1920
1921Updates for VOS and MPE/iX. (Peter Prymmer) Other small changes.
1922
0cc436d0 1923=item v1.45, 20 December 1999
1924
1925Small changes from 5.005_63 distribution, more changes to EBCDIC info.
1926
d1e3b762 1927=item v1.44, 19 July 1999
1928
1929A bunch of updates from Peter Prymmer for C<$^O> values,
1930endianness, File::Spec, VMS, BS2000, OS/400.
1931
b7df3edc 1932=item v1.43, 24 May 1999
1933
1934Added a lot of cleaning up from Tom Christiansen.
1935
19799a22 1936=item v1.42, 22 May 1999
b7df3edc 1937
19799a22 1938Added notes about tests, sprintf/printf, and epoch offsets.
b7df3edc 1939
6ab3f9cb 1940=item v1.41, 19 May 1999
1941
1942Lots more little changes to formatting and content.
1943
d1e3b762 1944Added a bunch of C<$^O> and related values
6ab3f9cb 1945for various platforms; fixed mail and web addresses, and added
1946and changed miscellaneous notes. (Peter Prymmer)
1947
1948=item v1.40, 11 April 1999
1949
1950Miscellaneous changes.
1951
1952=item v1.39, 11 February 1999
2ee0eb3c 1953
1954Changes from Jarkko and EMX URL fixes Michael Schwern. Additional
1955note about newlines added.
1956
9b63e9ec 1957=item v1.38, 31 December 1998
1958
1959More changes from Jarkko.
1960
3c075c7d 1961=item v1.37, 19 December 1998
1962
1963More minor changes. Merge two separate version 1.35 documents.
1964
1965=item v1.36, 9 September 1998
1966
1967Updated for Stratus VOS. Also known as version 1.35.
1968
1969=item v1.35, 13 August 1998
495c5fdc 1970
3c075c7d 1971Integrate more minor changes, plus addition of new sections under
1972L<"ISSUES">: L<"Numbers endianness and Width">,
1973L<"Character sets and character encoding">,
1974L<"Internationalisation">.
495c5fdc 1975
3c075c7d 1976=item v1.33, 06 August 1998
0a47030a 1977
1978Integrate more minor changes.
1979
3c075c7d 1980=item v1.32, 05 August 1998
dd9f0070 1981
1982Integrate more minor changes.
1983
3c075c7d 1984=item v1.30, 03 August 1998
b8099c3d 1985
1986Major update for RISC OS, other minor changes.
1987
3c075c7d 1988=item v1.23, 10 July 1998
b8099c3d 1989
1990First public release with perl5.005.
1991
1992=back
e41182b5 1993
ba58ab26 1994=head1 Supported Platforms
1995
9ca74005 1996As of early 2001 (the Perl releases 5.6.1 and 5.7.1), the following
1997platforms are able to build Perl from the standard source code
a93751fa 1998distribution available at http://www.cpan.org/src/index.html
ba58ab26 1999
2000 AIX
fd46a41b 2001 AmigaOS
b787fad4 2002 Darwin (Mac OS X)
fd46a41b 2003 DG/UX
ba58ab26 2004 DOS DJGPP 1)
fd46a41b 2005 DYNIX/ptx
6ba81f13 2006 EPOC
ba58ab26 2007 FreeBSD
2008 HP-UX
2009 IRIX
2010 Linux
ba58ab26 2011 MachTen
fd46a41b 2012 MacOS Classic 2)
2013 NonStop-UX
2014 ReliantUNIX (SINIX)
ba58ab26 2015 OpenBSD
fd46a41b 2016 OpenVMS (VMS)
ba58ab26 2017 OS/2
fd46a41b 2018 OS X
ba58ab26 2019 QNX
ba58ab26 2020 Solaris
fd46a41b 2021 Tru64 UNIX (DEC OSF/1, Digital UNIX)
ba58ab26 2022 UNICOS
2023 UNICOS/mk
ba58ab26 2024 VOS
fd46a41b 2025 Win32/NT/2K 3)
ba58ab26 2026
2027 1) in DOS mode either the DOS or OS/2 ports can be used
fd46a41b 2028 2) Mac OS Classic (pre-X) is almost 5.6.1-ready; building from
2029 the source does work with 5.6.1, but additional MacOS specific
862b5365 2030 source code is needed for a complete build. See the web
2031 site http://dev.macperl.org/ for more information.
fd46a41b 2032 3) compilers: Borland, Cygwin, Mingw32 EGCS/GCC, VC++
ba58ab26 2033
9ca74005 2034The following platforms worked for the previous releases (5.6.0 and 5.7.0),
2035but we did not manage to test these in time for the 5.7.1 release.
2036There is a very good chance that these will work fine with the 5.7.1.
ba58ab26 2037
ba58ab26 2038 DomainOS
2039 Hurd
fd46a41b 2040 LynxOS
2041 MinGW
2042 MPE/iX
2043 NetBSD
ba58ab26 2044 PowerMAX
fd46a41b 2045 SCO SV
ba58ab26 2046 SunOS
fd46a41b 2047 SVR4
2048 Unixware
2049 Windows 3.1
2050 Windows 95
2051 Windows 98
2052 Windows Me
ba58ab26 2053
fd46a41b 2054The following platform worked for the 5.005_03 major release but not
9ca74005 2055for 5.6.0. Standardization on UTF-8 as the internal string
2056representation in 5.6.0 and 5.6.1 introduced incompatibilities in this
2057EBCDIC platform. While Perl 5.7.1 will build on this platform some
fd46a41b 2058regression tests may fail and the C<use utf8;> pragma typically
9ca74005 2059introduces text handling errors.
ba58ab26 2060
fd46a41b 2061 OS/390 1)
ba58ab26 2062
fd46a41b 2063 1) previously known as MVS, about to become z/OS.
ba58ab26 2064
fd46a41b 2065Strongly related to the OS/390 platform by also being EBCDIC-based
ba58ab26 2066mainframe platforms are the following platforms:
2067
fd46a41b 2068 POSIX-BC (BS2000)
ba58ab26 2069 VM/ESA
2070
fd46a41b 2071These are also expected to work, albeit with no UTF-8 support, under 5.6.1
2072for the same reasons as OS/390. Contact the mailing list perl-mvs@perl.org
2073for more details.
ba58ab26 2074
2075The following platforms have been known to build Perl from source in
fd46a41b 2076the past (5.005_03 and earlier), but we haven't been able to verify
2077their status for the current release, either because the
2078hardware/software platforms are rare or because we don't have an
2079active champion on these platforms--or both. They used to work,
2080though, so go ahead and try compiling them, and let perlbug@perl.org
2081of any trouble.
ba58ab26 2082
2083 3b1
fd46a41b 2084 A/UX
2085 BeOS
2086 BSD/OS
ba58ab26 2087 ConvexOS
2088 CX/UX
2089 DC/OSx
2090 DDE SMES
2091 DOS EMX
2092 Dynix
2093 EP/IX
2094 ESIX
2095 FPS
2096 GENIX
2097 Greenhills
2098 ISC
2099 MachTen 68k
2100 MiNT
2101 MPC
2102 NEWS-OS
fd46a41b 2103 NextSTEP
2104 OpenSTEP
ba58ab26 2105 Opus
2106 Plan 9
2107 PowerUX
2108 RISC/os
fd46a41b 2109 SCO ODT/OSR
ba58ab26 2110 Stellar
2111 SVR2
2112 TI1500
2113 TitanOS
fd46a41b 2114 Ultrix
ba58ab26 2115 Unisys Dynix
2116 Unixware
fd46a41b 2117 UTS
ba58ab26 2118
2119Support for the following platform is planned for a future Perl release:
2120
2121 Netware
2122
2123The following platforms have their own source code distributions and
a93751fa 2124binaries available via http://www.cpan.org/ports/index.html:
ba58ab26 2125
2126 Perl release
2127
ba58ab26 2128 Netware 5.003_07
fd46a41b 2129 OS/400 5.005_02
ba58ab26 2130 Tandem Guardian 5.004
2131
2132The following platforms have only binaries available via
a93751fa 2133http://www.cpan.org/ports/index.html :
ba58ab26 2134
2135 Perl release
2136
2137 Acorn RISCOS 5.005_02
2138 AOS 5.002
2139 LynxOS 5.004_02
2140
2141Although we do suggest that you always build your own Perl from
2142the source code, both for maximal configurability and for security,
2143in case you are in a hurry you can check
a93751fa 2144http://www.cpan.org/ports/index.html for binary distributions.
ba58ab26 2145
c997b287 2146=head1 SEE ALSO
2147
a83b6f46 2148L<perlaix>, L<perlapollo>, L<perlamiga>, L<perlbeos>, L<perlbs200>,
a1f19229 2149L<perlce>, L<perlcygwin>, L<perldgux>, L<perldos>, L<perlepoc>, L<perlebcdic>,
a83b6f46 2150L<perlhurd>, L<perlhpux>, L<perlmachten>, L<perlmacos>, L<perlmint>,
9038e305 2151L<perlmpeix>, L<perlnetware>, L<perlos2>, L<perlos390>, L<perlplan9>,
2152L<perlqnx>, L<perlsolaris>, L<perltru64>, L<perlunicode>,
2153L<perlvmesa>, L<perlvms>, L<perlvos>, L<perlwin32>, and L<Win32>.
c997b287 2154
e41182b5 2155=head1 AUTHORS / CONTRIBUTORS
2156
06e9666b 2157Abigail <abigail@foad.org>,
c47ff5f1 2158Charles Bailey <bailey@newman.upenn.edu>,
2159Graham Barr <gbarr@pobox.com>,
2160Tom Christiansen <tchrist@perl.com>,
06e9666b 2161Nicholas Clark <nick@ccl4.org>,
c47ff5f1 2162Thomas Dorner <Thomas.Dorner@start.de>,
06e9666b 2163Andy Dougherty <doughera@lafayette.edu>,
2164Dominic Dunlop <domo@computer.org>,
2165Neale Ferguson <neale@vma.tabnsw.com.au>,
c47ff5f1 2166David J. Fiander <davidf@mks.com>,
2167Paul Green <Paul_Green@stratus.com>,
06e9666b 2168M.J.T. Guy <mjtg@cam.ac.uk>,
61f30a5e 2169Jarkko Hietaniemi <jhi@iki.fi>,
c47ff5f1 2170Luther Huffman <lutherh@stratcom.com>,
06e9666b 2171Nick Ing-Simmons <nick@ing-simmons.net>,
2172Andreas J. KE<ouml>nig <a.koenig@mind.de>,
c47ff5f1 2173Markus Laker <mlaker@contax.co.uk>,
2174Andrew M. Langmead <aml@world.std.com>,
2175Larry Moore <ljmoore@freespace.net>,
2176Paul Moore <Paul.Moore@uk.origin-it.com>,
2177Chris Nandor <pudge@pobox.com>,
2178Matthias Neeracher <neeri@iis.ee.ethz.ch>,
e71a7dc8 2179Philip Newton <pne@cpan.org>,
c47ff5f1 2180Gary Ng <71564.1743@CompuServe.COM>,
2181Tom Phoenix <rootbeer@teleport.com>,
2182AndrE<eacute> Pirard <A.Pirard@ulg.ac.be>,
2183Peter Prymmer <pvhp@forte.com>,
2184Hugo van der Sanden <hv@crypt0.demon.co.uk>,
2185Gurusamy Sarathy <gsar@activestate.com>,
2186Paul J. Schinder <schinder@pobox.com>,
2187Michael G Schwern <schwern@pobox.com>,
06e9666b 2188Dan Sugalski <dan@sidhe.org>,
c47ff5f1 2189Nathan Torkington <gnat@frii.com>.
e41182b5 2190
e41182b5 2191=head1 VERSION
2192
b787fad4 2193Version 1.50, last modified 10 Jul 2001