1 #############################################################################
2 # Pod/InputObjects.pm -- package which defines objects for input streams
3 # and paragraphs and commands when parsing POD docs.
5 # Copyright (C) 1996-1999 by Bradford Appleton. All rights reserved.
6 # This file is part of "PodParser". PodParser is free software;
7 # you can redistribute it and/or modify it under the same terms
9 #############################################################################
11 package Pod::InputObjects;
13 use vars qw($VERSION);
14 $VERSION = 1.090; ## Current version of this package
15 require 5.004; ## requires this Perl version or later
17 #############################################################################
21 Pod::InputObjects - objects representing POD input paragraphs, commands, etc.
25 use Pod::InputObjects;
37 This module defines some basic input objects used by B<Pod::Parser> when
38 reading and parsing POD text from an input source. The following objects
45 =item B<Pod::InputSource>
47 An object corresponding to a source of POD input text. It is mostly a
48 wrapper around a filehandle or C<IO::Handle>-type object (or anything
49 that implements the C<getline()> method) which keeps track of some
50 additional information relevant to the parsing of PODs.
54 =item B<Pod::Paragraph>
56 An object corresponding to a paragraph of POD input text. It may be a
57 plain paragraph, a verbatim paragraph, or a command paragraph (see
60 =item B<Pod::InteriorSequence>
62 An object corresponding to an interior sequence command from the POD
63 input text (see L<perlpod>).
65 =item B<Pod::ParseTree>
67 An object corresponding to a tree of parsed POD text. Each "node" in
68 a parse-tree (or I<ptree>) is either a text-string or a reference to
69 a B<Pod::InteriorSequence> object. The nodes appear in the parse-tree
70 in they order in which they were parsed from left-to-right.
74 Each of these input objects are described in further detail in the
75 sections which follow.
79 #############################################################################
85 #############################################################################
87 package Pod::InputSource;
89 ##---------------------------------------------------------------------------
93 =head1 B<Pod::InputSource>
95 This object corresponds to an input source or stream of POD
96 documentation. When parsing PODs, it is necessary to associate and store
97 certain context information with each input source. All of this
98 information is kept together with the stream itself in one of these
99 C<Pod::InputSource> objects. Each such object is merely a wrapper around
100 an C<IO::Handle> object of some kind (or at least something that
101 implements the C<getline()> method). They have the following
108 ##---------------------------------------------------------------------------
114 my $pod_input1 = Pod::InputSource->new(-handle => $filehandle);
115 my $pod_input2 = new Pod::InputSource(-handle => $filehandle,
117 my $pod_input3 = new Pod::InputSource(-handle => \*STDIN);
118 my $pod_input4 = Pod::InputSource->new(-handle => \*STDIN,
121 This is a class method that constructs a C<Pod::InputSource> object and
122 returns a reference to the new input source object. It takes one or more
123 keyword arguments in the form of a hash. The keyword C<-handle> is
124 required and designates the corresponding input handle. The keyword
125 C<-name> is optional and specifies the name associated with the input
126 handle (typically a file name).
133 ## Determine if we were called via an object-ref or a classname
135 my $class = ref($this) || $this;
137 ## Any remaining arguments are treated as initial values for the
138 ## hash that is used to represent this object. Note that we default
139 ## certain values by specifying them *before* the arguments passed.
140 ## If they are in the argument list, they will override the defaults.
141 my $self = { -name => '(unknown)',
146 ## Bless ourselves into the desired class and perform any initialization
151 ##---------------------------------------------------------------------------
157 my $filename = $pod_input->name();
158 $pod_input->name($new_filename_to_use);
160 This method gets/sets the name of the input source (usually a filename).
161 If no argument is given, it returns a string containing the name of
162 the input source; otherwise it sets the name of the input source to the
163 contents of the given argument.
170 (@_ > 1) and $_[0]->{'-name'} = $_[1];
171 return $_[0]->{'-name'};
174 ## allow 'filename' as an alias for 'name'
177 ##---------------------------------------------------------------------------
183 my $handle = $pod_input->handle();
185 Returns a reference to the handle object from which input is read (the
186 one used to contructed this input source object).
193 return $_[0]->{'-handle'};
196 ##---------------------------------------------------------------------------
200 =head2 B<was_cutting()>
202 print "Yes.\n" if ($pod_input->was_cutting());
204 The value of the C<cutting> state (that the B<cutting()> method would
205 have returned) immediately before any input was read from this input
206 stream. After all input from this stream has been read, the C<cutting>
207 state is restored to this value.
214 (@_ > 1) and $_[0]->{-was_cutting} = $_[1];
215 return $_[0]->{-was_cutting};
218 ##---------------------------------------------------------------------------
220 #############################################################################
222 package Pod::Paragraph;
224 ##---------------------------------------------------------------------------
226 =head1 B<Pod::Paragraph>
228 An object representing a paragraph of POD input text.
229 It has the following methods/attributes:
233 ##---------------------------------------------------------------------------
237 my $pod_para1 = Pod::Paragraph->new(-text => $text);
238 my $pod_para2 = Pod::Paragraph->new(-name => $cmd,
240 my $pod_para3 = new Pod::Paragraph(-text => $text);
241 my $pod_para4 = new Pod::Paragraph(-name => $cmd,
243 my $pod_para5 = Pod::Paragraph->new(-name => $cmd,
246 -line => $line_number);
248 This is a class method that constructs a C<Pod::Paragraph> object and
249 returns a reference to the new paragraph object. It may be given one or
250 two keyword arguments. The C<-text> keyword indicates the corresponding
251 text of the POD paragraph. The C<-name> keyword indicates the name of
252 the corresponding POD command, such as C<head1> or C<item> (it should
253 I<not> contain the C<=> prefix); this is needed only if the POD
254 paragraph corresponds to a command paragraph. The C<-file> and C<-line>
255 keywords indicate the filename and line number corresponding to the
256 beginning of the paragraph
261 ## Determine if we were called via an object-ref or a classname
263 my $class = ref($this) || $this;
265 ## Any remaining arguments are treated as initial values for the
266 ## hash that is used to represent this object. Note that we default
267 ## certain values by specifying them *before* the arguments passed.
268 ## If they are in the argument list, they will override the defaults.
271 -text => (@_ == 1) ? $_[0] : undef,
272 -file => '<unknown-file>',
280 ## Bless ourselves into the desired class and perform any initialization
285 ##---------------------------------------------------------------------------
289 my $para_cmd = $pod_para->cmd_name();
291 If this paragraph is a command paragraph, then this method will return
292 the name of the command (I<without> any leading C<=> prefix).
297 (@_ > 1) and $_[0]->{'-name'} = $_[1];
298 return $_[0]->{'-name'};
301 ## let name() be an alias for cmd_name()
304 ##---------------------------------------------------------------------------
308 my $para_text = $pod_para->text();
310 This method will return the corresponding text of the paragraph.
315 (@_ > 1) and $_[0]->{'-text'} = $_[1];
316 return $_[0]->{'-text'};
319 ##---------------------------------------------------------------------------
323 my $raw_pod_para = $pod_para->raw_text();
325 This method will return the I<raw> text of the POD paragraph, exactly
326 as it appeared in the input.
331 return $_[0]->{'-text'} unless (defined $_[0]->{'-name'});
332 return $_[0]->{'-prefix'} . $_[0]->{'-name'} .
333 $_[0]->{'-separator'} . $_[0]->{'-text'};
336 ##---------------------------------------------------------------------------
338 =head2 B<cmd_prefix()>
340 my $prefix = $pod_para->cmd_prefix();
342 If this paragraph is a command paragraph, then this method will return
343 the prefix used to denote the command (which should be the string "="
349 return $_[0]->{'-prefix'};
352 ##---------------------------------------------------------------------------
354 =head2 B<cmd_separator()>
356 my $separator = $pod_para->cmd_separator();
358 If this paragraph is a command paragraph, then this method will return
359 the text used to separate the command name from the rest of the
365 return $_[0]->{'-separator'};
368 ##---------------------------------------------------------------------------
370 =head2 B<parse_tree()>
372 my $ptree = $pod_parser->parse_text( $pod_para->text() );
373 $pod_para->parse_tree( $ptree );
374 $ptree = $pod_para->parse_tree();
376 This method will get/set the corresponding parse-tree of the paragraph's text.
381 (@_ > 1) and $_[0]->{'-ptree'} = $_[1];
382 return $_[0]->{'-ptree'};
385 ## let ptree() be an alias for parse_tree()
386 *ptree = \&parse_tree;
388 ##---------------------------------------------------------------------------
390 =head2 B<file_line()>
392 my ($filename, $line_number) = $pod_para->file_line();
393 my $position = $pod_para->file_line();
395 Returns the current filename and line number for the paragraph
396 object. If called in an array context, it returns a list of two
397 elements: first the filename, then the line number. If called in
398 a scalar context, it returns a string containing the filename, followed
399 by a colon (':'), followed by the line number.
404 my @loc = ($_[0]->{'-file'} || '<unknown-file>',
405 $_[0]->{'-line'} || 0);
406 return (wantarray) ? @loc : join(':', @loc);
409 ##---------------------------------------------------------------------------
411 #############################################################################
413 package Pod::InteriorSequence;
415 ##---------------------------------------------------------------------------
417 =head1 B<Pod::InteriorSequence>
419 An object representing a POD interior sequence command.
420 It has the following methods/attributes:
424 ##---------------------------------------------------------------------------
428 my $pod_seq1 = Pod::InteriorSequence->new(-name => $cmd
429 -ldelim => $delimiter);
430 my $pod_seq2 = new Pod::InteriorSequence(-name => $cmd,
431 -ldelim => $delimiter);
432 my $pod_seq3 = new Pod::InteriorSequence(-name => $cmd,
433 -ldelim => $delimiter,
435 -line => $line_number);
437 my $pod_seq4 = new Pod::InteriorSequence(-name => $cmd, $ptree);
438 my $pod_seq5 = new Pod::InteriorSequence($cmd, $ptree);
440 This is a class method that constructs a C<Pod::InteriorSequence> object
441 and returns a reference to the new interior sequence object. It should
442 be given two keyword arguments. The C<-ldelim> keyword indicates the
443 corresponding left-delimiter of the interior sequence (e.g. 'E<lt>').
444 The C<-name> keyword indicates the name of the corresponding interior
445 sequence command, such as C<I> or C<B> or C<C>. The C<-file> and
446 C<-line> keywords indicate the filename and line number corresponding
447 to the beginning of the interior sequence. If the C<$ptree> argument is
448 given, it must be the last argument, and it must be either string, or
449 else an array-ref suitable for passing to B<Pod::ParseTree::new> (or
450 it may be a reference to an Pod::ParseTree object).
455 ## Determine if we were called via an object-ref or a classname
457 my $class = ref($this) || $this;
459 ## See if first argument has no keyword
460 if (((@_ <= 2) or (@_ % 2)) and $_[0] !~ /^-\w/) {
461 ## Yup - need an implicit '-name' before first parameter
465 ## See if odd number of args
467 ## Yup - need an implicit '-ptree' before the last parameter
468 splice @_, $#_, 0, '-ptree';
471 ## Any remaining arguments are treated as initial values for the
472 ## hash that is used to represent this object. Note that we default
473 ## certain values by specifying them *before* the arguments passed.
474 ## If they are in the argument list, they will override the defaults.
476 -name => (@_ == 1) ? $_[0] : undef,
477 -file => '<unknown-file>',
484 ## Initialize contents if they havent been already
485 my $ptree = $self->{'-ptree'} || new Pod::ParseTree();
486 if ( ref $ptree =~ /^(ARRAY)?$/ ) {
487 ## We have an array-ref, or a normal scalar. Pass it as an
488 ## an argument to the ptree-constructor
489 $ptree = new Pod::ParseTree($1 ? [$ptree] : $ptree);
491 $self->{'-ptree'} = $ptree;
493 ## Bless ourselves into the desired class and perform any initialization
498 ##---------------------------------------------------------------------------
502 my $seq_cmd = $pod_seq->cmd_name();
504 The name of the interior sequence command.
509 (@_ > 1) and $_[0]->{'-name'} = $_[1];
510 return $_[0]->{'-name'};
513 ## let name() be an alias for cmd_name()
516 ##---------------------------------------------------------------------------
518 ## Private subroutine to set the parent pointer of all the given
519 ## children that are interior-sequences to be $self
521 sub _set_child2parent_links {
522 my ($self, @children) = @_;
523 ## Make sure any sequences know who their parent is
525 next unless (ref || ref eq 'SCALAR');
526 if ($_->isa('Pod::InteriorSequence') or $_->can('nested')) {
532 ## Private subroutine to unset child->parent links
534 sub _unset_child2parent_links {
536 $self->{'-parent_sequence'} = undef;
537 my $ptree = $self->{'-ptree'};
539 next unless (length and ref and ref ne 'SCALAR');
540 $_->_unset_child2parent_links() if $_->isa('Pod::InteriorSequence');
544 ##---------------------------------------------------------------------------
548 $pod_seq->prepend($text);
549 $pod_seq1->prepend($pod_seq2);
551 Prepends the given string or parse-tree or sequence object to the parse-tree
552 of this interior sequence.
558 $self->{'-ptree'}->prepend(@_);
559 _set_child2parent_links($self, @_);
563 ##---------------------------------------------------------------------------
567 $pod_seq->append($text);
568 $pod_seq1->append($pod_seq2);
570 Appends the given string or parse-tree or sequence object to the parse-tree
571 of this interior sequence.
577 $self->{'-ptree'}->append(@_);
578 _set_child2parent_links($self, @_);
582 ##---------------------------------------------------------------------------
586 $outer_seq = $pod_seq->nested || print "not nested";
588 If this interior sequence is nested inside of another interior
589 sequence, then the outer/parent sequence that contains it is
590 returned. Otherwise C<undef> is returned.
596 (@_ == 1) and $self->{'-parent_sequence'} = shift;
597 return $self->{'-parent_sequence'} || undef;
600 ##---------------------------------------------------------------------------
604 my $seq_raw_text = $pod_seq->raw_text();
606 This method will return the I<raw> text of the POD interior sequence,
607 exactly as it appeared in the input.
613 my $text = $self->{'-name'} . $self->{'-ldelim'};
614 for ( $self->{'-ptree'}->children ) {
615 $text .= (ref $_) ? $_->raw_text : $_;
617 $text .= $self->{'-rdelim'};
621 ##---------------------------------------------------------------------------
623 =head2 B<left_delimiter()>
625 my $ldelim = $pod_seq->left_delimiter();
627 The leftmost delimiter beginning the argument text to the interior
628 sequence (should be "<").
633 (@_ > 1) and $_[0]->{'-ldelim'} = $_[1];
634 return $_[0]->{'-ldelim'};
637 ## let ldelim() be an alias for left_delimiter()
638 *ldelim = \&left_delimiter;
640 ##---------------------------------------------------------------------------
642 =head2 B<right_delimiter()>
644 The rightmost delimiter beginning the argument text to the interior
645 sequence (should be ">").
649 sub right_delimiter {
650 (@_ > 1) and $_[0]->{'-rdelim'} = $_[1];
651 return $_[0]->{'-rdelim'};
654 ## let rdelim() be an alias for right_delimiter()
655 *rdelim = \&right_delimiter;
657 ##---------------------------------------------------------------------------
659 =head2 B<parse_tree()>
661 my $ptree = $pod_parser->parse_text($paragraph_text);
662 $pod_seq->parse_tree( $ptree );
663 $ptree = $pod_seq->parse_tree();
665 This method will get/set the corresponding parse-tree of the interior
671 (@_ > 1) and $_[0]->{'-ptree'} = $_[1];
672 return $_[0]->{'-ptree'};
675 ## let ptree() be an alias for parse_tree()
676 *ptree = \&parse_tree;
678 ##---------------------------------------------------------------------------
680 =head2 B<file_line()>
682 my ($filename, $line_number) = $pod_seq->file_line();
683 my $position = $pod_seq->file_line();
685 Returns the current filename and line number for the interior sequence
686 object. If called in an array context, it returns a list of two
687 elements: first the filename, then the line number. If called in
688 a scalar context, it returns a string containing the filename, followed
689 by a colon (':'), followed by the line number.
694 my @loc = ($_[0]->{'-file'} || '<unknown-file>',
695 $_[0]->{'-line'} || 0);
696 return (wantarray) ? @loc : join(':', @loc);
699 ##---------------------------------------------------------------------------
703 This method performs any necessary cleanup for the interior-sequence.
704 If you override this method then it is B<imperative> that you invoke
705 the parent method from within your own method, otherwise
706 I<interior-sequence storage will not be reclaimed upon destruction!>
711 ## We need to get rid of all child->parent pointers throughout the
712 ## tree so their reference counts will go to zero and they can be
714 _unset_child2parent_links(@_);
717 ##---------------------------------------------------------------------------
719 #############################################################################
721 package Pod::ParseTree;
723 ##---------------------------------------------------------------------------
725 =head1 B<Pod::ParseTree>
727 This object corresponds to a tree of parsed POD text. As POD text is
728 scanned from left to right, it is parsed into an ordered list of
729 text-strings and B<Pod::InteriorSequence> objects (in order of
730 appearance). A B<Pod::ParseTree> object corresponds to this list of
731 strings and sequences. Each interior sequence in the parse-tree may
732 itself contain a parse-tree (since interior sequences may be nested).
736 ##---------------------------------------------------------------------------
740 my $ptree1 = Pod::ParseTree->new;
741 my $ptree2 = new Pod::ParseTree;
742 my $ptree4 = Pod::ParseTree->new($array_ref);
743 my $ptree3 = new Pod::ParseTree($array_ref);
745 This is a class method that constructs a C<Pod::Parse_tree> object and
746 returns a reference to the new parse-tree. If a single-argument is given,
747 it must be a reference to an array, and is used to initialize the root
748 (top) of the parse tree.
753 ## Determine if we were called via an object-ref or a classname
755 my $class = ref($this) || $this;
757 my $self = (@_ == 1 and ref $_[0]) ? $_[0] : [];
759 ## Bless ourselves into the desired class and perform any initialization
764 ##---------------------------------------------------------------------------
768 my $top_node = $ptree->top();
769 $ptree->top( $top_node );
770 $ptree->top( @children );
772 This method gets/sets the top node of the parse-tree. If no arguments are
773 given, it returns the topmost node in the tree (the root), which is also
774 a B<Pod::ParseTree>. If it is given a single argument that is a reference,
775 then the reference is assumed to a parse-tree and becomes the new top node.
776 Otherwise, if arguments are given, they are treated as the new list of
777 children for the top node.
784 @{ $self } = (@_ == 1 and ref $_[0]) ? ${ @_ } : @_;
789 ## let parse_tree() & ptree() be aliases for the 'top' method
790 *parse_tree = *ptree = \⊤
792 ##---------------------------------------------------------------------------
796 This method gets/sets the children of the top node in the parse-tree.
797 If no arguments are given, it returns the list (array) of children
798 (each of which should be either a string or a B<Pod::InteriorSequence>.
799 Otherwise, if arguments are given, they are treated as the new list of
800 children for the top node.
807 @{ $self } = (@_ == 1 and ref $_[0]) ? ${ @_ } : @_;
812 ##---------------------------------------------------------------------------
816 This method prepends the given text or parse-tree to the current parse-tree.
817 If the first item on the parse-tree is text and the argument is also text,
818 then the text is prepended to the first item (not added as a separate string).
819 Otherwise the argument is added as a new string or parse-tree I<before>
824 use vars qw(@ptree); ## an alias used for performance reasons
828 local *ptree = $self;
831 if (@ptree and !(ref $ptree[0]) and !(ref $_)) {
832 $ptree[0] = $_ . $ptree[0];
840 ##---------------------------------------------------------------------------
844 This method appends the given text or parse-tree to the current parse-tree.
845 If the last item on the parse-tree is text and the argument is also text,
846 then the text is appended to the last item (not added as a separate string).
847 Otherwise the argument is added as a new string or parse-tree I<after>
854 local *ptree = $self;
857 if (@ptree and !(ref $ptree[-1]) and !(ref $_)) {
868 my $ptree_raw_text = $ptree->raw_text();
870 This method will return the I<raw> text of the POD parse-tree
871 exactly as it appeared in the input.
879 $text .= (ref $_) ? $_->raw_text : $_;
884 ##---------------------------------------------------------------------------
886 ## Private routines to set/unset child->parent links
888 sub _unset_child2parent_links {
890 local *ptree = $self;
892 next unless (length and ref and ref ne 'SCALAR');
893 $_->_unset_child2parent_links() if $_->isa('Pod::InteriorSequence');
897 sub _set_child2parent_links {
898 ## nothing to do, Pod::ParseTrees cant have parent pointers
903 This method performs any necessary cleanup for the parse-tree.
904 If you override this method then it is B<imperative>
905 that you invoke the parent method from within your own method,
906 otherwise I<parse-tree storage will not be reclaimed upon destruction!>
911 ## We need to get rid of all child->parent pointers throughout the
912 ## tree so their reference counts will go to zero and they can be
914 _unset_child2parent_links(@_);
917 #############################################################################
921 See L<Pod::Parser>, L<Pod::Select>, and L<Pod::Callbacks>.
925 Brad Appleton E<lt>bradapp@enteract.comE<gt>