perl 5.003_04: utils/perldoc.PL
[p5sagit/p5-mst-13.2.git] / pod / perlmod.pod
CommitLineData
a0d0e21e 1=head1 NAME
2
3perlmod - Perl modules (packages)
4
5=head1 DESCRIPTION
6
7=head2 Packages
8
748a9306 9Perl provides a mechanism for alternative namespaces to protect packages
d0c42abe 10from stomping on each other's variables. In fact, apart from certain
cb1a09d0 11magical variables, there's really no such thing as a global variable in
12Perl. The package statement declares the compilation unit as being in the
13given namespace. The scope of the package declaration is from the
14declaration itself through the end of the enclosing block (the same scope
15as the local() operator). All further unqualified dynamic identifiers
16will be in this namespace. A package statement only affects dynamic
17variables--including those you've used local() on--but I<not> lexical
18variables created with my(). Typically it would be the first declaration
19in a file to be included by the C<require> or C<use> operator. You can
a0d0e21e 20switch into a package in more than one place; it merely influences which
21symbol table is used by the compiler for the rest of that block. You can
22refer to variables and filehandles in other packages by prefixing the
23identifier with the package name and a double colon:
24C<$Package::Variable>. If the package name is null, the C<main> package
d0c42abe 25is assumed. That is, C<$::sail> is equivalent to C<$main::sail>.
a0d0e21e 26
27(The old package delimiter was a single quote, but double colon
28is now the preferred delimiter, in part because it's more readable
29to humans, and in part because it's more readable to B<emacs> macros.
30It also makes C++ programmers feel like they know what's going on.)
31
32Packages may be nested inside other packages: C<$OUTER::INNER::var>. This
33implies nothing about the order of name lookups, however. All symbols
34are either local to the current package, or must be fully qualified
35from the outer package name down. For instance, there is nowhere
36within package C<OUTER> that C<$INNER::var> refers to C<$OUTER::INNER::var>.
37It would treat package C<INNER> as a totally separate global package.
38
39Only identifiers starting with letters (or underscore) are stored in a
cb1a09d0 40package's symbol table. All other symbols are kept in package C<main>,
41including all of the punctuation variables like $_. In addition, the
42identifiers STDIN, STDOUT, STDERR, ARGV, ARGVOUT, ENV, INC and SIG are
43forced to be in package C<main>, even when used for other purposes than
44their built-in one. Note also that, if you have a package called C<m>,
45C<s> or C<y>, then you can't use the qualified form of an identifier
46because it will be interpreted instead as a pattern match, a substitution,
47or a translation.
a0d0e21e 48
49(Variables beginning with underscore used to be forced into package
50main, but we decided it was more useful for package writers to be able
cb1a09d0 51to use leading underscore to indicate private variables and method names.
52$_ is still global though.)
a0d0e21e 53
54Eval()ed strings are compiled in the package in which the eval() was
55compiled. (Assignments to C<$SIG{}>, however, assume the signal
748a9306 56handler specified is in the C<main> package. Qualify the signal handler
a0d0e21e 57name if you wish to have a signal handler in a package.) For an
58example, examine F<perldb.pl> in the Perl library. It initially switches
59to the C<DB> package so that the debugger doesn't interfere with variables
60in the script you are trying to debug. At various points, however, it
61temporarily switches back to the C<main> package to evaluate various
62expressions in the context of the C<main> package (or wherever you came
63from). See L<perldebug>.
64
cb1a09d0 65See L<perlsub> for other scoping issues related to my() and local(),
66or L<perlref> regarding closures.
67
a0d0e21e 68=head2 Symbol Tables
69
70The symbol table for a package happens to be stored in the associative
71array of that name appended with two colons. The main symbol table's
d0c42abe 72name is thus C<%main::>, or C<%::> for short. Likewise symbol table for
73the nested package mentioned earlier is named C<%OUTER::INNER::>.
a0d0e21e 74
cb1a09d0 75The value in each entry of the associative array is what you are referring
76to when you use the C<*name> typeglob notation. In fact, the following
77have the same effect, though the first is more efficient because it does
78the symbol table lookups at compile time:
a0d0e21e 79
80 local(*main::foo) = *main::bar; local($main::{'foo'}) =
81 $main::{'bar'};
82
83You can use this to print out all the variables in a package, for
84instance. Here is F<dumpvar.pl> from the Perl library:
85
86 package dumpvar;
87 sub main::dumpvar {
88 ($package) = @_;
89 local(*stab) = eval("*${package}::");
90 while (($key,$val) = each(%stab)) {
91 local(*entry) = $val;
92 if (defined $entry) {
93 print "\$$key = '$entry'\n";
94 }
95
96 if (defined @entry) {
97 print "\@$key = (\n";
98 foreach $num ($[ .. $#entry) {
99 print " $num\t'",$entry[$num],"'\n";
100 }
101 print ")\n";
102 }
103
104 if ($key ne "${package}::" && defined %entry) {
105 print "\%$key = (\n";
106 foreach $key (sort keys(%entry)) {
107 print " $key\t'",$entry{$key},"'\n";
108 }
109 print ")\n";
110 }
111 }
112 }
113
114Note that even though the subroutine is compiled in package C<dumpvar>,
115the name of the subroutine is qualified so that its name is inserted
116into package C<main>.
117
cb1a09d0 118Assignment to a typeglob performs an aliasing operation, i.e.,
a0d0e21e 119
120 *dick = *richard;
121
748a9306 122causes variables, subroutines and file handles accessible via the
d0c42abe 123identifier C<richard> to also be accessible via the identifier C<dick>. If
a0d0e21e 124you only want to alias a particular variable or subroutine, you can
125assign a reference instead:
126
127 *dick = \$richard;
128
129makes $richard and $dick the same variable, but leaves
130@richard and @dick as separate arrays. Tricky, eh?
131
cb1a09d0 132This mechanism may be used to pass and return cheap references
133into or from subroutines if you won't want to copy the whole
134thing.
135
136 %some_hash = ();
137 *some_hash = fn( \%another_hash );
138 sub fn {
139 local *hashsym = shift;
140 # now use %hashsym normally, and you
141 # will affect the caller's %another_hash
142 my %nhash = (); # do what you want
143 return \%nhash;
144 }
145
146On return, the reference wil overwrite the hash slot in the
147symbol table specified by the *some_hash typeglob. This
148is a somewhat tricky way of passing around refernces cheaply
149when you won't want to have to remember to dereference variables
150explicitly.
151
152Another use of symbol tables is for making "constant" scalars.
153
154 *PI = \3.14159265358979;
155
156Now you cannot alter $PI, which is probably a good thing all in all.
157
a0d0e21e 158=head2 Package Constructors and Destructors
159
160There are two special subroutine definitions that function as package
161constructors and destructors. These are the C<BEGIN> and C<END>
162routines. The C<sub> is optional for these routines.
163
164A C<BEGIN> subroutine is executed as soon as possible, that is, the
165moment it is completely defined, even before the rest of the containing
166file is parsed. You may have multiple C<BEGIN> blocks within a
167file--they will execute in order of definition. Because a C<BEGIN>
168block executes immediately, it can pull in definitions of subroutines
169and such from other files in time to be visible to the rest of the
170file.
171
172An C<END> subroutine is executed as late as possible, that is, when the
173interpreter is being exited, even if it is exiting as a result of a
174die() function. (But not if it's is being blown out of the water by a
175signal--you have to trap that yourself (if you can).) You may have
748a9306 176multiple C<END> blocks within a file--they will execute in reverse
a0d0e21e 177order of definition; that is: last in, first out (LIFO).
178
179Note that when you use the B<-n> and B<-p> switches to Perl, C<BEGIN>
180and C<END> work just as they do in B<awk>, as a degenerate case.
181
182=head2 Perl Classes
183
4633a7c4 184There is no special class syntax in Perl, but a package may function
a0d0e21e 185as a class if it provides subroutines that function as methods. Such a
186package may also derive some of its methods from another class package
4633a7c4 187by listing the other package name in its @ISA array.
188
189For more on this, see L<perlobj>.
a0d0e21e 190
191=head2 Perl Modules
192
c07a80fd 193A module is just a package that is defined in a library file of
a0d0e21e 194the same name, and is designed to be reusable. It may do this by
195providing a mechanism for exporting some of its symbols into the symbol
196table of any package using it. Or it may function as a class
197definition and make its semantics available implicitly through method
198calls on the class and its objects, without explicit exportation of any
199symbols. Or it can do a little of both.
200
4633a7c4 201For example, to start a normal module called Fred, create
202a file called Fred.pm and put this at the start of it:
203
cf6f4e91 204 package Fred;
205 use Exporter ();
4633a7c4 206 @ISA = qw(Exporter);
207 @EXPORT = qw(func1 func2);
208 @EXPORT_OK = qw($sally @listabob %harry func3);
209
210Then go on to declare and use your variables in functions
211without any qualifications.
212See L<Exporter> and the I<Perl Modules File> for details on
213mechanics and style issues in module creation.
214
215Perl modules are included into your program by saying
a0d0e21e 216
217 use Module;
218
219or
220
221 use Module LIST;
222
223This is exactly equivalent to
224
225 BEGIN { require "Module.pm"; import Module; }
226
227or
228
229 BEGIN { require "Module.pm"; import Module LIST; }
230
cb1a09d0 231As a special case
232
233 use Module ();
234
235is exactly equivalent to
236
237 BEGIN { require "Module.pm"; }
238
a0d0e21e 239All Perl module files have the extension F<.pm>. C<use> assumes this so
240that you don't have to spell out "F<Module.pm>" in quotes. This also
241helps to differentiate new modules from old F<.pl> and F<.ph> files.
242Module names are also capitalized unless they're functioning as pragmas,
243"Pragmas" are in effect compiler directives, and are sometimes called
244"pragmatic modules" (or even "pragmata" if you're a classicist).
245
246Because the C<use> statement implies a C<BEGIN> block, the importation
247of semantics happens at the moment the C<use> statement is compiled,
248before the rest of the file is compiled. This is how it is able
249to function as a pragma mechanism, and also how modules are able to
250declare subroutines that are then visible as list operators for
251the rest of the current file. This will not work if you use C<require>
cb1a09d0 252instead of C<use>. With require you can get into this problem:
a0d0e21e 253
254 require Cwd; # make Cwd:: accessible
255 $here = Cwd::getcwd();
256
257 use Cwd; # import names from Cwd::
258 $here = getcwd();
259
260 require Cwd; # make Cwd:: accessible
261 $here = getcwd(); # oops! no main::getcwd()
262
cb1a09d0 263In general C<use Module ();> is recommended over C<require Module;>.
264
a0d0e21e 265Perl packages may be nested inside other package names, so we can have
266package names containing C<::>. But if we used that package name
267directly as a filename it would makes for unwieldy or impossible
268filenames on some systems. Therefore, if a module's name is, say,
269C<Text::Soundex>, then its definition is actually found in the library
270file F<Text/Soundex.pm>.
271
272Perl modules always have a F<.pm> file, but there may also be dynamically
273linked executables or autoloaded subroutine definitions associated with
274the module. If so, these will be entirely transparent to the user of
275the module. It is the responsibility of the F<.pm> file to load (or
276arrange to autoload) any additional functionality. The POSIX module
277happens to do both dynamic loading and autoloading, but the user can
278just say C<use POSIX> to get it all.
279
8e07c86e 280For more information on writing extension modules, see L<perlxs>
a0d0e21e 281and L<perlguts>.
282
283=head1 NOTE
284
285Perl does not enforce private and public parts of its modules as you may
286have been used to in other languages like C++, Ada, or Modula-17. Perl
287doesn't have an infatuation with enforced privacy. It would prefer
288that you stayed out of its living room because you weren't invited, not
289because it has a shotgun.
290
291The module and its user have a contract, part of which is common law,
292and part of which is "written". Part of the common law contract is
293that a module doesn't pollute any namespace it wasn't asked to. The
294written contract for the module (AKA documentation) may make other
295provisions. But then you know when you C<use RedefineTheWorld> that
296you're redefining the world and willing to take the consequences.
297
298=head1 THE PERL MODULE LIBRARY
299
300A number of modules are included the the Perl distribution. These are
301described below, and all end in F<.pm>. You may also discover files in
302the library directory that end in either F<.pl> or F<.ph>. These are old
748a9306 303libraries supplied so that old programs that use them still run. The
a0d0e21e 304F<.pl> files will all eventually be converted into standard modules, and
305the F<.ph> files made by B<h2ph> will probably end up as extension modules
306made by B<h2xs>. (Some F<.ph> values may already be available through the
307POSIX module.) The B<pl2pm> file in the distribution may help in your
d0c42abe 308conversion, but it's just a mechanical process, so is far from bulletproof.
a0d0e21e 309
310=head2 Pragmatic Modules
311
312They work somewhat like pragmas in that they tend to affect the compilation of
313your program, and thus will usually only work well when used within a
748a9306 314C<use>, or C<no>. These are locally scoped, so an inner BLOCK
a0d0e21e 315may countermand any of these by saying
316
317 no integer;
318 no strict 'refs';
319
320which lasts until the end of that BLOCK.
321
322The following programs are defined (and have their own documentation).
323
324=over 12
325
cb1a09d0 326=item diagnostics
4633a7c4 327
328Pragma to produce enhanced diagnostics
329
cb1a09d0 330=item integer
a0d0e21e 331
4633a7c4 332Pragma to compute arithmetic in integer instead of double
a0d0e21e 333
cb1a09d0 334=item less
a0d0e21e 335
4633a7c4 336Pragma to request less of something from the compiler
a0d0e21e 337
d0c42abe 338=item ops
339
340Pragma to restrict use of unsafe opcodes
341
cb1a09d0 342=item overload
343
344Pragma for overloading operators
345
346=item sigtrap
a0d0e21e 347
4633a7c4 348Pragma to enable stack backtrace on unexpected signals
a0d0e21e 349
cb1a09d0 350=item strict
a0d0e21e 351
4633a7c4 352Pragma to restrict unsafe constructs
a0d0e21e 353
cb1a09d0 354=item subs
a0d0e21e 355
4633a7c4 356Pragma to predeclare sub names
a0d0e21e 357
d0c42abe 358=item vars
359
360Pragma to predeclare global symbols
361
a0d0e21e 362=back
363
364=head2 Standard Modules
365
4633a7c4 366Standard, bundled modules are all expected to behave in a well-defined
a0d0e21e 367manner with respect to namespace pollution because they use the
4633a7c4 368Exporter module. See their own documentation for details.
a0d0e21e 369
cb1a09d0 370=over 12
371
372=item AnyDBM_File
373
374provide framework for multiple DBMs
375
376=item AutoLoader
377
378load functions only on demand
379
380=item AutoSplit
381
382split a package for autoloading
383
384=item Benchmark
385
386benchmark running times of code
387
388=item Carp
389
390warn of errors (from perspective of caller)
391
392=item Config
393
394access Perl configuration option
395
396=item Cwd
397
398get pathname of current working directory
399
400=item DB_File
401
402Perl access to Berkeley DB
403
404=item Devel::SelfStubber
405
406generate stubs for a SelfLoading module
407
408=item DynaLoader
409
410Dynamically load C libraries into Perl code
411
412=item English
413
414use nice English (or awk) names for ugly punctuation variables
415
416=item Env
417
418perl module that imports environment variables
419
420=item Exporter
421
422provide inport/export controls for Perl modules
423
424=item ExtUtils::Liblist
425
426determine libraries to use and how to use them
427
428=item ExtUtils::MakeMaker
429
430create an extension Makefile
431
432=item ExtUtils::Manifest
433
434utilities to write and check a MANIFEST file
435
436=item ExtUtils::Mkbootstrap
437
438make a bootstrap file for use by DynaLoader
439
440=item ExtUtils::Miniperl
441
442!!!GOOD QUESTION!!!
443
444=item Fcntl
445
446load the C Fcntl.h defines
447
448=item File::Basename
449
450parse file specifications
451
452=item File::CheckTree
453
454run many filetest checks on a tree
455
456=item File::Find
457
458traverse a file tree
459
460=item FileHandle
461
462supply object methods for filehandles
463
464=item File::Path
465
466create or remove a series of directories
467
468=item Getopt::Long
469
470extended getopt processing
471
472=item Getopt::Std
473
474Process single-character switches with switch clustering
475
476=item I18N::Collate
477
478compare 8-bit scalar data according to the current locale
479
480=item IPC::Open2
481
482a process for both reading and writing
483
484=item IPC::Open3
485
486open a process for reading, writing, and error handling
487
488=item Net::Ping
489
490check a host for upness
491
492=item POSIX
493
494Perl interface to IEEE Std 1003.1
495
496=item SelfLoader
497
498load functions only on demand
499
a2927560 500=item Safe
501
502Creation controlled compartments in which perl code can be evaluated.
503
cb1a09d0 504=item Socket
505
506load the C socket.h defines and structure manipulators
507
508=item Test::Harness
509
510run perl standard test scripts with statistics
511
512=item Text::Abbrev
513
514rceate an abbreviation table from a list
515
516=back
517
518To find out I<all> the modules installed on your system, including
519those without documentation or outside the standard release, do this:
a0d0e21e 520
4633a7c4 521 find `perl -e 'print "@INC"'` -name '*.pm' -print
a0d0e21e 522
4633a7c4 523They should all have their own documentation installed and accessible via
524your system man(1) command. If that fails, try the I<perldoc> program.
a0d0e21e 525
4633a7c4 526=head2 Extension Modules
a0d0e21e 527
4633a7c4 528Extension modules are written in C (or a mix of Perl and C) and get
529dynamically loaded into Perl if and when you need them. Supported
530extension modules include the Socket, Fcntl, and POSIX modules.
a0d0e21e 531
cb1a09d0 532Many popular C extension modules do not come bundled (at least, not
533completely) due to their size, volatility, or simply lack of time for
534adequate testing and configuration across the multitude of platforms on
535which Perl was beta-tested. You are encouraged to look for them in
536archie(1L), the Perl FAQ or Meta-FAQ, the WWW page, and even with their
537authors before randomly posting asking for their present condition and
538disposition.
a0d0e21e 539
cb1a09d0 540=head1 CPAN
a0d0e21e 541
4633a7c4 542CPAN stands for the Comprehensive Perl Archive Network. This is a globally
543replicated collection of all known Perl materials, including hundreds
544of unbunded modules. Here are the major categories of modules:
a0d0e21e 545
4633a7c4 546=over
a0d0e21e 547
4633a7c4 548=item *
549Language Extensions and Documentation Tools
a0d0e21e 550
4633a7c4 551=item *
552Development Support
a0d0e21e 553
4633a7c4 554=item *
555Operating System Interfaces
a0d0e21e 556
4633a7c4 557=item *
558Networking, Device Control (modems) and InterProcess Communication
a0d0e21e 559
4633a7c4 560=item *
561Data Types and Data Type Utilities
a0d0e21e 562
4633a7c4 563=item *
564Database Interfaces
a0d0e21e 565
4633a7c4 566=item *
567User Interfaces
a0d0e21e 568
4633a7c4 569=item *
570Interfaces to / Emulations of Other Programming Languages
a0d0e21e 571
4633a7c4 572=item *
573File Names, File Systems and File Locking (see also File Handles)
a0d0e21e 574
4633a7c4 575=item *
576String Processing, Language Text Processing, Parsing and Searching
a0d0e21e 577
4633a7c4 578=item *
579Option, Argument, Parameter and Configuration File Processing
a0d0e21e 580
4633a7c4 581=item *
582Internationalization and Locale
a0d0e21e 583
4633a7c4 584=item *
585Authentication, Security and Encryption
a0d0e21e 586
4633a7c4 587=item *
588World Wide Web, HTML, HTTP, CGI, MIME
a0d0e21e 589
4633a7c4 590=item *
591Server and Daemon Utilities
a0d0e21e 592
4633a7c4 593=item *
594Archiving and Compression
a0d0e21e 595
4633a7c4 596=item *
597Images, Pixmap and Bitmap Manipulation, Drawing and Graphing
a0d0e21e 598
4633a7c4 599=item *
600Mail and Usenet News
a0d0e21e 601
4633a7c4 602=item *
603Control Flow Utilities (callbacks and exceptions etc)
a0d0e21e 604
4633a7c4 605=item *
606File Handle and Input/Output Stream Utilities
a0d0e21e 607
4633a7c4 608=item *
609Miscellaneous Modules
a0d0e21e 610
4633a7c4 611=back
a0d0e21e 612
d0c42abe 613The registered CPAN sites as of this writing include the following.
4633a7c4 614You should try to choose one close to you:
a0d0e21e 615
4633a7c4 616=over
a0d0e21e 617
4633a7c4 618=item *
619ftp://ftp.sterling.com/programming/languages/perl/
a0d0e21e 620
4633a7c4 621=item *
622ftp://ftp.sedl.org/pub/mirrors/CPAN/
a0d0e21e 623
4633a7c4 624=item *
625ftp://ftp.uoknor.edu/mirrors/CPAN/
a0d0e21e 626
4633a7c4 627=item *
628ftp://ftp.delphi.com/pub/mirrors/packages/perl/CPAN/
a0d0e21e 629
4633a7c4 630=item *
631ftp://uiarchive.cso.uiuc.edu/pub/lang/perl/CPAN/
a0d0e21e 632
4633a7c4 633=item *
634ftp://ftp.cis.ufl.edu/pub/perl/CPAN/
a0d0e21e 635
4633a7c4 636=item *
637ftp://ftp.switch.ch/mirror/CPAN/
a0d0e21e 638
4633a7c4 639=item *
640ftp://ftp.sunet.se/pub/lang/perl/CPAN/
a0d0e21e 641
4633a7c4 642=item *
643ftp://ftp.ci.uminho.pt/pub/lang/perl/
a0d0e21e 644
4633a7c4 645=item *
646ftp://ftp.cs.ruu.nl/pub/PERL/CPAN/
a0d0e21e 647
4633a7c4 648=item *
649ftp://ftp.demon.co.uk/pub/mirrors/perl/CPAN/
a0d0e21e 650
4633a7c4 651=item *
652ftp://ftp.rz.ruhr-uni-bochum.de/pub/programming/languages/perl/CPAN/
a0d0e21e 653
4633a7c4 654=item *
655ftp://ftp.leo.org/pub/comp/programming/languages/perl/CPAN/
a0d0e21e 656
4633a7c4 657=item *
658ftp://ftp.pasteur.fr/pub/computing/unix/perl/CPAN/
a0d0e21e 659
4633a7c4 660=item *
661ftp://ftp.ibp.fr/pub/perl/CPAN/
a0d0e21e 662
4633a7c4 663=item *
664ftp://ftp.funet.fi/pub/languages/perl/CPAN/
a0d0e21e 665
4633a7c4 666=item *
667ftp://ftp.tekotago.ac.nz/pub/perl/CPAN/
a0d0e21e 668
4633a7c4 669=item *
670ftp://ftp.mame.mu.oz.au/pub/perl/CPAN/
a0d0e21e 671
4633a7c4 672=item *
673ftp://coombs.anu.edu.au/pub/perl/
a0d0e21e 674
4633a7c4 675=item *
676ftp://dongpo.math.ncu.edu.tw/perl/CPAN/
a0d0e21e 677
4633a7c4 678=item *
679ftp://ftp.lab.kdd.co.jp/lang/perl/CPAN/
a0d0e21e 680
4633a7c4 681=item *
682ftp://ftp.is.co.za/programming/perl/CPAN/
a0d0e21e 683
684=back
4633a7c4 685
686For an up-to-date listing of CPAN sites,
d0c42abe 687see F<http://www.perl.com/perl/CPAN> or F<ftp://ftp.perl.com/perl/>.
cb1a09d0 688
689=head1 Modules: Creation, Use and Abuse
690
691(The following section is borrowed directly from Tim Bunce's modules
692file, available at your nearest CPAN site.)
693
694Perl 5 implements a class using a package, but the presence of a
695package doesn't imply the presence of a class. A package is just a
696namespace. A class is a package that provides subroutines that can be
697used as methods. A method is just a subroutine that expects, as its
698first argument, either the name of a package (for "static" methods),
699or a reference to something (for "virtual" methods).
700
701A module is a file that (by convention) provides a class of the same
702name (sans the .pm), plus an import method in that class that can be
703called to fetch exported symbols. This module may implement some of
704its methods by loading dynamic C or C++ objects, but that should be
705totally transparent to the user of the module. Likewise, the module
706might set up an AUTOLOAD function to slurp in subroutine definitions on
707demand, but this is also transparent. Only the .pm file is required to
708exist.
709
710=head2 Guidelines for Module Creation
711
712=over 4
713
714=item Do similar modules already exist in some form?
715
716If so, please try to reuse the existing modules either in whole or
717by inheriting useful features into a new class. If this is not
718practical try to get together with the module authors to work on
719extending or enhancing the functionality of the existing modules.
720A perfect example is the plethora of packages in perl4 for dealing
721with command line options.
722
723If you are writing a module to expand an already existing set of
724modules, please coordinate with the author of the package. It
725helps if you follow the same naming scheme and module interaction
726scheme as the original author.
727
728=item Try to design the new module to be easy to extend and reuse.
729
730Use blessed references. Use the two argument form of bless to bless
731into the class name given as the first parameter of the constructor,
732e.g.:
733
734 sub new {
735 my $class = shift;
736 return bless {}, $class;
737 }
738
739or even this if you'd like it to be used as either a static
740or a virtual method.
741
742 sub new {
743 my $self = shift;
744 my $class = ref($self) || $self;
745 return bless {}, $class;
746 }
747
748Pass arrays as references so more parameters can be added later
749(it's also faster). Convert functions into methods where
750appropriate. Split large methods into smaller more flexible ones.
751Inherit methods from other modules if appropriate.
752
753Avoid class name tests like: die "Invalid" unless ref $ref eq 'FOO'.
754Generally you can delete the "eq 'FOO'" part with no harm at all.
755Let the objects look after themselves! Generally, avoid hardwired
756class names as far as possible.
757
d28ebecd 758Avoid $r-E<gt>Class::func() where using @ISA=qw(... Class ...) and
759$r-E<gt>func() would work (see perlbot man page for more details).
cb1a09d0 760
761Use autosplit so little used or newly added functions won't be a
762burden to programs which don't use them. Add test functions to
763the module after __END__ either using AutoSplit or by saying:
764
765 eval join('',<main::DATA>) || die $@ unless caller();
766
767Does your module pass the 'empty sub-class' test? If you say
768"@SUBCLASS::ISA = qw(YOURCLASS);" your applications should be able
769to use SUBCLASS in exactly the same way as YOURCLASS. For example,
770does your application still work if you change: $obj = new YOURCLASS;
771into: $obj = new SUBCLASS; ?
772
773Avoid keeping any state information in your packages. It makes it
774difficult for multiple other packages to use yours. Keep state
775information in objects.
776
777Always use C<-w>. Try to C<use strict;> (or C<use strict qw(...);>).
778Remember that you can add C<no strict qw(...);> to individual blocks
779of code which need less strictness. Always use C<-w>. Always use C<-w>!
780Follow the guidelines in the perlstyle(1) manual.
781
782=item Some simple style guidelines
783
784The perlstyle manual supplied with perl has many helpful points.
785
786Coding style is a matter of personal taste. Many people evolve their
787style over several years as they learn what helps them write and
788maintain good code. Here's one set of assorted suggestions that
789seem to be widely used by experienced developers:
790
791Use underscores to separate words. It is generally easier to read
792$var_names_like_this than $VarNamesLikeThis, especially for
793non-native speakers of English. It's also a simple rule that works
794consistently with VAR_NAMES_LIKE_THIS.
795
796Package/Module names are an exception to this rule. Perl informally
797reserves lowercase module names for 'pragma' modules like integer
798and strict. Other modules normally begin with a capital letter and
799use mixed case with no underscores (need to be short and portable).
800
801You may find it helpful to use letter case to indicate the scope
802or nature of a variable. For example:
803
804 $ALL_CAPS_HERE constants only (beware clashes with perl vars)
805 $Some_Caps_Here package-wide global/static
806 $no_caps_here function scope my() or local() variables
807
808Function and method names seem to work best as all lowercase.
d28ebecd 809E.g., $obj-E<gt>as_string().
cb1a09d0 810
811You can use a leading underscore to indicate that a variable or
812function should not be used outside the package that defined it.
813
814=item Select what to export.
815
816Do NOT export method names!
817
818Do NOT export anything else by default without a good reason!
819
820Exports pollute the namespace of the module user. If you must
821export try to use @EXPORT_OK in preference to @EXPORT and avoid
822short or common names to reduce the risk of name clashes.
823
824Generally anything not exported is still accessible from outside the
d28ebecd 825module using the ModuleName::item_name (or $blessed_ref-E<gt>method)
cb1a09d0 826syntax. By convention you can use a leading underscore on names to
827informally indicate that they are 'internal' and not for public use.
828
829(It is actually possible to get private functions by saying:
830my $subref = sub { ... }; &$subref; But there's no way to call that
831directly as a method, since a method must have a name in the symbol
832table.)
833
834As a general rule, if the module is trying to be object oriented
835then export nothing. If it's just a collection of functions then
836@EXPORT_OK anything but use @EXPORT with caution.
837
838=item Select a name for the module.
839
840This name should be as descriptive, accurate and complete as
841possible. Avoid any risk of ambiguity. Always try to use two or
842more whole words. Generally the name should reflect what is special
843about what the module does rather than how it does it. Please use
844nested module names to informally group or categorise a module.
845A module should have a very good reason not to have a nested name.
846Module names should begin with a capital letter.
847
848Having 57 modules all called Sort will not make life easy for anyone
849(though having 23 called Sort::Quick is only marginally better :-).
850Imagine someone trying to install your module alongside many others.
851If in any doubt ask for suggestions in comp.lang.perl.misc.
852
853If you are developing a suite of related modules/classes it's good
854practice to use nested classes with a common prefix as this will
855avoid namespace clashes. For example: Xyz::Control, Xyz::View,
856Xyz::Model etc. Use the modules in this list as a naming guide.
857
858If adding a new module to a set, follow the original author's
859standards for naming modules and the interface to methods in
860those modules.
861
862To be portable each component of a module name should be limited to
86311 characters. If it might be used on DOS then try to ensure each is
864unique in the first 8 characters. Nested modules make this easier.
865
866=item Have you got it right?
867
868How do you know that you've made the right decisions? Have you
869picked an interface design that will cause problems later? Have
870you picked the most appropriate name? Do you have any questions?
871
872The best way to know for sure, and pick up many helpful suggestions,
873is to ask someone who knows. Comp.lang.perl.misc is read by just about
874all the people who develop modules and it's the best place to ask.
875
876All you need to do is post a short summary of the module, its
877purpose and interfaces. A few lines on each of the main methods is
878probably enough. (If you post the whole module it might be ignored
879by busy people - generally the very people you want to read it!)
880
881Don't worry about posting if you can't say when the module will be
882ready - just say so in the message. It might be worth inviting
883others to help you, they may be able to complete it for you!
884
885=item README and other Additional Files.
886
887It's well known that software developers usually fully document the
888software they write. If, however, the world is in urgent need of
889your software and there is not enough time to write the full
890documentation please at least provide a README file containing:
891
892=over 10
893
894=item *
895A description of the module/package/extension etc.
896
897=item *
898A copyright notice - see below.
899
900=item *
901Prerequisites - what else you may need to have.
902
903=item *
904How to build it - possible changes to Makefile.PL etc.
905
906=item *
907How to install it.
908
909=item *
910Recent changes in this release, especially incompatibilities
911
912=item *
913Changes / enhancements you plan to make in the future.
914
915=back
916
917If the README file seems to be getting too large you may wish to
918split out some of the sections into separate files: INSTALL,
919Copying, ToDo etc.
920
d0c42abe 921=over 4
922
cb1a09d0 923=item Adding a Copyright Notice.
924
925How you choose to licence your work is a personal decision.
926The general mechanism is to assert your Copyright and then make
927a declaration of how others may copy/use/modify your work.
928
929Perl, for example, is supplied with two types of licence: The GNU
930GPL and The Artistic License (see the files README, Copying and
931Artistic). Larry has good reasons for NOT just using the GNU GPL.
932
933My personal recommendation, out of respect for Larry, Perl and the
934perl community at large is to simply state something like:
935
936 Copyright (c) 1995 Your Name. All rights reserved.
937 This program is free software; you can redistribute it and/or
938 modify it under the same terms as Perl itself.
939
940This statement should at least appear in the README file. You may
941also wish to include it in a Copying file and your source files.
942Remember to include the other words in addition to the Copyright.
943
944=item Give the module a version/issue/release number.
945
946To be fully compatible with the Exporter and MakeMaker modules you
947should store your module's version number in a non-my package
948variable called $VERSION. This should be a valid floating point
949number with at least two digits after the decimal (ie hundredths,
950e.g, $VERSION = "0.01"). Don't use a "1.3.2" style version.
951See Exporter.pm in Perl5.001m or later for details.
952
953It may be handy to add a function or method to retrieve the number.
954Use the number in announcements and archive file names when
955releasing the module (ModuleName-1.02.tar.Z).
956See perldoc ExtUtils::MakeMaker.pm for details.
957
958=item How to release and distribute a module.
959
960It's good idea to post an announcement of the availability of your
961module (or the module itself if small) to the comp.lang.perl.announce
962Usenet newsgroup. This will at least ensure very wide once-off
963distribution.
964
965If possible you should place the module into a major ftp archive and
966include details of it's location in your announcement.
967
968Some notes about ftp archives: Please use a long descriptive file
969name which includes the version number. Most incoming directories
970will not be readable/listable, i.e., you won't be able to see your
971file after uploading it. Remember to send your email notification
972message as soon as possible after uploading else your file may get
973deleted automatically. Allow time for the file to be processed
974and/or check the file has been processed before announcing its
975location.
976
977FTP Archives for Perl Modules:
978
979Follow the instructions and links on
980
981 http://franz.ww.tu-berlin.de/modulelist
982
983or upload to one of these sites:
984
985 ftp://franz.ww.tu-berlin.de/incoming
986 ftp://ftp.cis.ufl.edu/incoming
987
988and notify upload@franz.ww.tu-berlin.de.
989
990By using the WWW interface you can ask the Upload Server to mirror
991your modules from your ftp or WWW site into your own directory on
992CPAN!
993
994Please remember to send me an updated entry for the Module list!
995
996=item Take care when changing a released module.
997
998Always strive to remain compatible with previous released versions
999(see 2.2 above) Otherwise try to add a mechanism to revert to the
1000old behaviour if people rely on it. Document incompatible changes.
1001
1002=back
1003
d0c42abe 1004=back
1005
cb1a09d0 1006=head2 Guidelines for Converting Perl 4 Library Scripts into Modules
1007
1008=over 4
1009
1010=item There is no requirement to convert anything.
1011
1012If it ain't broke, don't fix it! Perl 4 library scripts should
1013continue to work with no problems. You may need to make some minor
1014changes (like escaping non-array @'s in double quoted strings) but
1015there is no need to convert a .pl file into a Module for just that.
1016
1017=item Consider the implications.
1018
1019All the perl applications which make use of the script will need to
1020be changed (slightly) if the script is converted into a module. Is
1021it worth it unless you plan to make other changes at the same time?
1022
1023=item Make the most of the opportunity.
1024
1025If you are going to convert the script to a module you can use the
1026opportunity to redesign the interface. The 'Guidelines for Module
1027Creation' above include many of the issues you should consider.
1028
1029=item The pl2pm utility will get you started.
1030
1031This utility will read *.pl files (given as parameters) and write
1032corresponding *.pm files. The pl2pm utilities does the following:
1033
1034=over 10
1035
1036=item *
1037Adds the standard Module prologue lines
1038
1039=item *
1040Converts package specifiers from ' to ::
1041
1042=item *
1043Converts die(...) to croak(...)
1044
1045=item *
1046Several other minor changes
1047
1048=back
1049
1050Being a mechanical process pl2pm is not bullet proof. The converted
1051code will need careful checking, especially any package statements.
1052Don't delete the original .pl file till the new .pm one works!
1053
1054=back
1055
1056=head2 Guidelines for Reusing Application Code
1057
1058=over 4
1059
1060=item Complete applications rarely belong in the Perl Module Library.
1061
1062=item Many applications contain some perl code which could be reused.
1063
1064Help save the world! Share your code in a form that makes it easy
1065to reuse.
1066
1067=item Break-out the reusable code into one or more separate module files.
1068
1069=item Take the opportunity to reconsider and redesign the interfaces.
1070
1071=item In some cases the 'application' can then be reduced to a small
1072
1073fragment of code built on top of the reusable modules. In these cases
1074the application could invoked as:
1075
1076 perl -e 'use Module::Name; method(@ARGV)' ...
1077or
d0c42abe 1078 perl -mModule::Name ... (in perl5.002)
cb1a09d0 1079
1080=back
1081