[p5sagit/p5-mst-13.2.git] / pod / perlreref.pod

=head1 NAME

perlreref - Perl Regular Expressions Reference

=head1 DESCRIPTION

This is a quick reference to Perl's regular expressions.
For full information see L<perlre> and L<perlop>, as well
as the L</"SEE ALSO"> section in this document.

=head2 OPERATORS

C<=~> determines to which variable the regex is applied.
In its absence, $_ is used.

    $var =~ /foo/;

C<!~> determines to which variable the regex is applied,
and negates the result of the match; it returns
false if the match succeeds, and true if it fails.

    $var !~ /foo/;

C<m/pattern/msixpogc> searches a string for a pattern match,
applying the given options.

    m  Multiline mode - ^ and $ match internal lines
    s  match as a Single line - . matches \n
    i  case-Insensitive
    x  eXtended legibility - free whitespace and comments
    p  Preserve a copy of the matched string -
       ${^PREMATCH}, ${^MATCH}, ${^POSTMATCH} will be defined.
    o  compile pattern Once
    g  Global - all occurrences
    c  don't reset pos on failed matches when using /g

If 'pattern' is an empty string, the last I<successfully> matched
regex is used. Delimiters other than '/' may be used for both this
operator and the following ones. The leading C<m> can be omitted
if the delimiter is '/'.

C<qr/pattern/msixpo> lets you store a regex in a variable,
or pass one around. Modifiers as for C<m//>, and are stored
within the regex.

C<s/pattern/replacement/msixpogce> substitutes matches of
'pattern' with 'replacement'. Modifiers as for C<m//>,
with two additions:

    e  Evaluate 'replacement' as an expression
    r  Return substitution and leave the original string untouched.

'e' may be specified multiple times. 'replacement' is interpreted
as a double quoted string unless a single-quote (C<'>) is the delimiter.

C<?pattern?> is like C<m/pattern/> but matches only once. No alternate
delimiters can be used.  Must be reset with reset().

=head2 SYNTAX

 \       Escapes the character immediately following it
 .       Matches any single character except a newline (unless /s is
           used)
 ^       Matches at the beginning of the string (or line, if /m is used)
 $       Matches at the end of the string (or line, if /m is used)
 *       Matches the preceding element 0 or more times
 +       Matches the preceding element 1 or more times
 ?       Matches the preceding element 0 or 1 times
 {...}   Specifies a range of occurrences for the element preceding it
 [...]   Matches any one of the characters contained within the brackets
 (...)   Groups subexpressions for capturing to $1, $2...
 (?:...) Groups subexpressions without capturing (cluster)
 |       Matches either the subexpression preceding or following it
 \1, \2, \3 ...           Matches the text from the Nth group
 \g1 or \g{1}, \g2 ...    Matches the text from the Nth group
 \g-1 or \g{-1}, \g-2 ... Matches the text from the Nth previous group
 \g{name}     Named backreference
 \k<name>     Named backreference
 \k'name'     Named backreference
 (?P=name)    Named backreference (python syntax)

=head2 ESCAPE SEQUENCES

These work as in normal strings.

   \a       Alarm (beep)
   \e       Escape
   \f       Formfeed
   \n       Newline
   \r       Carriage return
   \t       Tab
   \037     Any octal ASCII value
   \x7f     Any hexadecimal ASCII value
   \x{263a} A wide hexadecimal value
   \cx      Control-x
   \N{name} A named character
   \N{U+263D} A Unicode character by hex ordinal

   \l  Lowercase next character
   \u  Titlecase next character
   \L  Lowercase until \E
   \U  Uppercase until \E
   \Q  Disable pattern metacharacters until \E
   \E  End modification

For Titlecase, see L</Titlecase>.

This one works differently from normal strings:

   \b  An assertion, not backspace, except in a character class

=head2 CHARACTER CLASSES

   [amy]    Match 'a', 'm' or 'y'
   [f-j]    Dash specifies "range"
   [f-j-]   Dash escaped or at start or end means 'dash'
   [^f-j]   Caret indicates "match any character _except_ these"

The following sequences (except C<\N>) work within or without a character class.
The first six are locale aware, all are Unicode aware. See L<perllocale>
and L<perlunicode> for details.

   \d      A digit
   \D      A nondigit
   \w      A word character
   \W      A non-word character
   \s      A whitespace character
   \S      A non-whitespace character
   \h      An horizontal whitespace
   \H      A non horizontal whitespace
   \N      A non newline (when not followed by '{NAME}'; experimental;
           not valid in a character class; equivalent to [^\n]; it's
           like '.' without /s modifier)
   \v      A vertical whitespace
   \V      A non vertical whitespace
   \R      A generic newline           (?>\v|\x0D\x0A)

   \C      Match a byte (with Unicode, '.' matches a character)
   \pP     Match P-named (Unicode) property
   \p{...} Match Unicode property with name longer than 1 character
   \PP     Match non-P
   \P{...} Match lack of Unicode property with name longer than 1 char
   \X      Match Unicode extended grapheme cluster

POSIX character classes and their Unicode and Perl equivalents:

           ASCII-         Full-
           range          range   backslash
 POSIX    \p{...}         \p{}    sequence       Description
 -----------------------------------------------------------------------
 alnum   PosixAlnum       Alnum               Alpha plus Digit
 alpha   PosixAlpha       Alpha               Alphabetic characters
 ascii   ASCII                                Any ASCII character
 blank   PosixBlank       Blank     \h        Horizontal whitespace;
                                                full-range also written
                                                as \p{HorizSpace} (GNU
                                                extension)
 cntrl   PosixCntrl       Cntrl               Control characters
 digit   PosixDigit       Digit     \d        Decimal digits
 graph   PosixGraph       Graph               Alnum plus Punct
 lower   PosixLower       Lower               Lowercase characters
 print   PosixPrint       Print               Graph plus Print, but not
                                                any Cntrls
 punct   PosixPunct       Punct               These aren't precisely
                                                equivalent.  See NOTE,
                                                below.
 space   PosixSpace       Space     [\s\cK]   Whitespace
         PerlSpace        SpacePerl \s        Perl's whitespace
                                                definition
 upper   PosixUpper       Upper               Uppercase characters
 word    PerlWord         Word      \w        Alnum plus '_' (Perl
                                                extension)
 xdigit  ASCII_Hex_Digit  XDigit              Hexadecimal digit,
                                                ASCII-range is
                                                [0-9A-Fa-f]

NOTE on C<[[:punct:]]>, C<\p{PosixPunct}> and C<\p{Punct}>:
In the ASCII range, C<[[:punct:]]> and C<\p{PosixPunct}> match
C<[-!"#$%&'()*+,./:;<=E<gt>?@[\\\]^_`{|}~]> (although if a locale is in
effect, it could alter the behavior of C<[[:punct:]]>); and C<\p{Punct}>
matches C<[-!"#%&'()*,./:;?@[\\\]_{}]>.  When matching a UTF-8 string,
C<[[:punct:]]> matches what it does in the ASCII range, plus what
C<\p{Punct}> matches.  C<\p{Punct}> matches, anything that isn't a
control, an alphanumeric, a space, nor a symbol.

Within a character class:

    POSIX      traditional   Unicode
  [:digit:]       \d        \p{Digit}
  [:^digit:]      \D        \P{Digit}

=head2 ANCHORS

All are zero-width assertions.

   ^  Match string start (or line, if /m is used)
   $  Match string end (or line, if /m is used) or before newline
   \b Match word boundary (between \w and \W)
   \B Match except at word boundary (between \w and \w or \W and \W)
   \A Match string start (regardless of /m)
   \Z Match string end (before optional newline)
   \z Match absolute string end
   \G Match where previous m//g left off
   \K Keep the stuff left of the \K, don't include it in $&

=head2 QUANTIFIERS

Quantifiers are greedy by default and match the B<longest> leftmost.

   Maximal Minimal Possessive Allowed range
   ------- ------- ---------- -------------
   {n,m}   {n,m}?  {n,m}+     Must occur at least n times
                              but no more than m times
   {n,}    {n,}?   {n,}+      Must occur at least n times
   {n}     {n}?    {n}+       Must occur exactly n times
   *       *?      *+         0 or more times (same as {0,})
   +       +?      ++         1 or more times (same as {1,})
   ?       ??      ?+         0 or 1 time (same as {0,1})

The possessive forms (new in Perl 5.10) prevent backtracking: what gets
matched by a pattern with a possessive quantifier will not be backtracked
into, even if that causes the whole match to fail.

There is no quantifier C<{,n}>. That's interpreted as a literal string.

=head2 EXTENDED CONSTRUCTS

   (?#text)          A comment
   (?:...)           Groups subexpressions without capturing (cluster)
   (?pimsx-imsx:...) Enable/disable option (as per m// modifiers)
   (?=...)           Zero-width positive lookahead assertion
   (?!...)           Zero-width negative lookahead assertion
   (?<=...)          Zero-width positive lookbehind assertion
   (?<!...)          Zero-width negative lookbehind assertion
   (?>...)           Grab what we can, prohibit backtracking
   (?|...)           Branch reset
   (?<name>...)      Named capture
   (?'name'...)      Named capture
   (?P<name>...)     Named capture (python syntax)
   (?{ code })       Embedded code, return value becomes $^R
   (??{ code })      Dynamic regex, return value used as regex
   (?N)              Recurse into subpattern number N
   (?-N), (?+N)      Recurse into Nth previous/next subpattern
   (?R), (?0)        Recurse at the beginning of the whole pattern
   (?&name)          Recurse into a named subpattern
   (?P>name)         Recurse into a named subpattern (python syntax)
   (?(cond)yes|no)
   (?(cond)yes)      Conditional expression, where "cond" can be:
                     (N)       subpattern N has matched something
                     (<name>)  named subpattern has matched something
                     ('name')  named subpattern has matched something
                     (?{code}) code condition
                     (R)       true if recursing
                     (RN)      true if recursing into Nth subpattern
                     (R&name)  true if recursing into named subpattern
                     (DEFINE)  always false, no no-pattern allowed

=head2 VARIABLES

   $_    Default variable for operators to use

   $`    Everything prior to matched string
   $&    Entire matched string
   $'    Everything after to matched string

   ${^PREMATCH}   Everything prior to matched string
   ${^MATCH}      Entire matched string
   ${^POSTMATCH}  Everything after to matched string

The use of C<$`>, C<$&> or C<$'> will slow down B<all> regex use
within your program. Consult L<perlvar> for C<@->
to see equivalent expressions that won't cause slow down.
See also L<Devel::SawAmpersand>. Starting with Perl 5.10, you
can also use the equivalent variables C<${^PREMATCH}>, C<${^MATCH}>
and C<${^POSTMATCH}>, but for them to be defined, you have to
specify the C</p> (preserve) modifier on your regular expression.

   $1, $2 ...  hold the Xth captured expr
   $+    Last parenthesized pattern match
   $^N   Holds the most recently closed capture
   $^R   Holds the result of the last (?{...}) expr
   @-    Offsets of starts of groups. $-[0] holds start of whole match
   @+    Offsets of ends of groups. $+[0] holds end of whole match
   %+    Named capture buffers
   %-    Named capture buffers, as array refs

Captured groups are numbered according to their I<opening> paren.

=head2 FUNCTIONS

   lc          Lowercase a string
   lcfirst     Lowercase first char of a string
   uc          Uppercase a string
   ucfirst     Titlecase first char of a string

   pos         Return or set current match position
   quotemeta   Quote metacharacters
   reset       Reset ?pattern? status
   study       Analyze string for optimizing matching

   split       Use a regex to split a string into parts

The first four of these are like the escape sequences C<\L>, C<\l>,
C<\U>, and C<\u>.  For Titlecase, see L</Titlecase>.

=head2 TERMINOLOGY

=head3 Titlecase

Unicode concept which most often is equal to uppercase, but for
certain characters like the German "sharp s" there is a difference.

=head1 AUTHOR

Iain Truskett. Updated by the Perl 5 Porters.

This document may be distributed under the same terms as Perl itself.

=head1 SEE ALSO

=over 4

=item *

L<perlretut> for a tutorial on regular expressions.

=item *

L<perlrequick> for a rapid tutorial.

=item *

L<perlre> for more details.

=item *

L<perlvar> for details on the variables.

=item *

L<perlop> for details on the operators.

=item *

L<perlfunc> for details on the functions.

=item *

L<perlfaq6> for FAQs on regular expressions.

=item *

L<perlrebackslash> for a reference on backslash sequences.

=item *

L<perlrecharclass> for a reference on character classes.

=item *

The L<re> module to alter behaviour and aid
debugging.

=item *

L<perldebug/"Debugging regular expressions">

=item *

L<perluniintro>, L<perlunicode>, L<charnames> and L<perllocale>
for details on regexes and internationalisation.

=item *

I<Mastering Regular Expressions> by Jeffrey Friedl
(F<http://oreilly.com/catalog/9780596528126/>) for a thorough grounding and
reference on the topic.

=back

=head1 THANKS

David P.C. Wollmann,
Richard Soderberg,
Sean M. Burke,
Tom Christiansen,
Jim Cromie,
and
Jeffrey Goff
for useful advice.

=cut
Commit	Line	Data
30487ceb	1	=head1 NAME
	2
	3	perlreref - Perl Regular Expressions Reference
	4
	5	=head1 DESCRIPTION
	6
	7	This is a quick reference to Perl's regular expressions.
	8	For full information see L<perlre> and L<perlop>, as well
6d014f17	9	as the L</"SEE ALSO"> section in this document.
30487ceb	10
a5365663	11	=head2 OPERATORS
30487ceb	12
e17472c5	13	C<=~> determines to which variable the regex is applied.
e17472c5	14	In its absence, $_ is used.
30487ceb	15
e17472c5	16	$var =~ /foo/;
30487ceb	17
e17472c5	18	C<!~> determines to which variable the regex is applied,
	19	and negates the result of the match; it returns
	20	false if the match succeeds, and true if it fails.
6d014f17	21
e17472c5	22	$var !~ /foo/;
6d014f17	23
e17472c5	24	C<m/pattern/msixpogc> searches a string for a pattern match,
e17472c5	25	applying the given options.
30487ceb	26
e17472c5	27	m Multiline mode - ^ and $ match internal lines
	28	s match as a Single line - . matches \n
	29	i case-Insensitive
	30	x eXtended legibility - free whitespace and comments
	31	p Preserve a copy of the matched string -
	32	${^PREMATCH}, ${^MATCH}, ${^POSTMATCH} will be defined.
	33	o compile pattern Once
	34	g Global - all occurrences
	35	c don't reset pos on failed matches when using /g
30487ceb	36
e17472c5	37	If 'pattern' is an empty string, the last I<successfully> matched
e17472c5	38	regex is used. Delimiters other than '/' may be used for both this
64c5a566	39	operator and the following ones. The leading C<m> can be omitted
e17472c5	40	if the delimiter is '/'.
30487ceb	41
e17472c5	42	C<qr/pattern/msixpo> lets you store a regex in a variable,
	43	or pass one around. Modifiers as for C<m//>, and are stored
	44	within the regex.
30487ceb	45
e17472c5	46	C<s/pattern/replacement/msixpogce> substitutes matches of
e17472c5	47	'pattern' with 'replacement'. Modifiers as for C<m//>,
4f4d7508	48	with two additions:
30487ceb	49
e17472c5	50	e Evaluate 'replacement' as an expression
4f4d7508	51	r Return substitution and leave the original string untouched.
30487ceb	52
e17472c5	53	'e' may be specified multiple times. 'replacement' is interpreted
e17472c5	54	as a double quoted string unless a single-quote (C<'>) is the delimiter.
30487ceb	55
e17472c5	56	C<?pattern?> is like C<m/pattern/> but matches only once. No alternate
e17472c5	57	delimiters can be used. Must be reset with reset().
30487ceb	58
a5365663	59	=head2 SYNTAX
30487ceb	60
9f4a55d4	61	\ Escapes the character immediately following it
	62	. Matches any single character except a newline (unless /s is
	63	used)
	64	^ Matches at the beginning of the string (or line, if /m is used)
	65	$ Matches at the end of the string (or line, if /m is used)
	66	* Matches the preceding element 0 or more times
	67	+ Matches the preceding element 1 or more times
	68	? Matches the preceding element 0 or 1 times
	69	{...} Specifies a range of occurrences for the element preceding it
	70	[...] Matches any one of the characters contained within the brackets
	71	(...) Groups subexpressions for capturing to $1, $2...
	72	(?:...) Groups subexpressions without capturing (cluster)
	73	\| Matches either the subexpression preceding or following it
	74	\1, \2, \3 ... Matches the text from the Nth group
	75	\g1 or \g{1}, \g2 ... Matches the text from the Nth group
	76	\g-1 or \g{-1}, \g-2 ... Matches the text from the Nth previous group
	77	\g{name} Named backreference
	78	\k<name> Named backreference
	79	\k'name' Named backreference
	80	(?P=name) Named backreference (python syntax)
30487ceb	81
	82	=head2 ESCAPE SEQUENCES
	83
	84	These work as in normal strings.
	85
	86	\a Alarm (beep)
	87	\e Escape
	88	\f Formfeed
	89	\n Newline
	90	\r Carriage return
	91	\t Tab
6ed007ae	92	\037 Any octal ASCII value
30487ceb	93	\x7f Any hexadecimal ASCII value
	94	\x{263a} A wide hexadecimal value
	95	\cx Control-x
	96	\N{name} A named character
e526e8bb	97	\N{U+263D} A Unicode character by hex ordinal
30487ceb	98
6d014f17	99	\l Lowercase next character
d3b55b48	100	\u Titlecase next character
30487ceb	101	\L Lowercase until \E
d3b55b48	102	\U Uppercase until \E
30487ceb	103	\Q Disable pattern metacharacters until \E
e17472c5	104	\E End modification
30487ceb	105
47e8a552	106	For Titlecase, see L</Titlecase>.
47e8a552	107
30487ceb	108	This one works differently from normal strings:
	109
	110	\b An assertion, not backspace, except in a character class
	111
	112	=head2 CHARACTER CLASSES
	113
	114	[amy] Match 'a', 'm' or 'y'
	115	[f-j] Dash specifies "range"
	116	[f-j-] Dash escaped or at start or end means 'dash'
6d014f17	117	[^f-j] Caret indicates "match any character _except_ these"
30487ceb	118
df225385	119	The following sequences (except C<\N>) work within or without a character class.
e17472c5	120	The first six are locale aware, all are Unicode aware. See L<perllocale>
	121	and L<perlunicode> for details.
	122
	123	\d A digit
	124	\D A nondigit
	125	\w A word character
	126	\W A non-word character
	127	\s A whitespace character
	128	\S A non-whitespace character
418e7b04	129	\h An horizontal whitespace
418e7b04	130	\H A non horizontal whitespace
9f4a55d4	131	\N A non newline (when not followed by '{NAME}'; experimental;
	132	not valid in a character class; equivalent to [^\n]; it's
	133	like '.' without /s modifier)
418e7b04	134	\v A vertical whitespace
418e7b04	135	\V A non vertical whitespace
e17472c5	136	\R A generic newline (?>\v\|\x0D\x0A)
e04a154e	137
e04a154e	138	\C Match a byte (with Unicode, '.' matches a character)
30487ceb	139	\pP Match P-named (Unicode) property
e1b711da	140	\p{...} Match Unicode property with name longer than 1 character
30487ceb	141	\PP Match non-P
e1b711da	142	\P{...} Match lack of Unicode property with name longer than 1 char
0111a78f	143	\X Match Unicode extended grapheme cluster
30487ceb	144
	145	POSIX character classes and their Unicode and Perl equivalents:
	146
9f4a55d4	147	ASCII- Full-
	148	range range backslash
	149	POSIX \p{...} \p{} sequence Description
	150	-----------------------------------------------------------------------
	151	alnum PosixAlnum Alnum Alpha plus Digit
	152	alpha PosixAlpha Alpha Alphabetic characters
	153	ascii ASCII Any ASCII character
	154	blank PosixBlank Blank \h Horizontal whitespace;
	155	full-range also written
	156	as \p{HorizSpace} (GNU
	157	extension)
	158	cntrl PosixCntrl Cntrl Control characters
	159	digit PosixDigit Digit \d Decimal digits
	160	graph PosixGraph Graph Alnum plus Punct
	161	lower PosixLower Lower Lowercase characters
	162	print PosixPrint Print Graph plus Print, but not
	163	any Cntrls
	164	punct PosixPunct Punct These aren't precisely
	165	equivalent. See NOTE,
	166	below.
	167	space PosixSpace Space [\s\cK] Whitespace
	168	PerlSpace SpacePerl \s Perl's whitespace
	169	definition
	170	upper PosixUpper Upper Uppercase characters
	171	word PerlWord Word \w Alnum plus '_' (Perl
	172	extension)
	173	xdigit ASCII_Hex_Digit XDigit Hexadecimal digit,
	174	ASCII-range is
	175	[0-9A-Fa-f]
	176
	177	NOTE on C<[[:punct:]]>, C<\p{PosixPunct}> and C<\p{Punct}>:
	178	In the ASCII range, C<[[:punct:]]> and C<\p{PosixPunct}> match
	179	C<[-!"#$%&'()*+,./:;<=E<gt>?@[\\\]^_`{\|}~]> (although if a locale is in
	180	effect, it could alter the behavior of C<[[:punct:]]>); and C<\p{Punct}>
	181	matches C<[-!"#%&'()*,./:;?@[\\\]_{}]>. When matching a UTF-8 string,
	182	C<[[:punct:]]> matches what it does in the ASCII range, plus what
	183	C<\p{Punct}> matches. C<\p{Punct}> matches, anything that isn't a
	184	control, an alphanumeric, a space, nor a symbol.
30487ceb	185
	186	Within a character class:
	187
9f4a55d4	188	POSIX traditional Unicode
	189	[:digit:] \d \p{Digit}
	190	[:^digit:] \D \P{Digit}
30487ceb	191
	192	=head2 ANCHORS
	193
	194	All are zero-width assertions.
	195
	196	^ Match string start (or line, if /m is used)
	197	$ Match string end (or line, if /m is used) or before newline
	198	\b Match word boundary (between \w and \W)
6d014f17	199	\B Match except at word boundary (between \w and \w or \W and \W)
30487ceb	200	\A Match string start (regardless of /m)
6d014f17	201	\Z Match string end (before optional newline)
30487ceb	202	\z Match absolute string end
30487ceb	203	\G Match where previous m//g left off
64c5a566	204	\K Keep the stuff left of the \K, don't include it in $&
64c5a566	205
30487ceb	206	=head2 QUANTIFIERS
30487ceb	207
ac036724	208	Quantifiers are greedy by default and match the B<longest> leftmost.
30487ceb	209
64c5a566	210	Maximal Minimal Possessive Allowed range
	211	------- ------- ---------- -------------
	212	{n,m} {n,m}? {n,m}+ Must occur at least n times
	213	but no more than m times
	214	{n,} {n,}? {n,}+ Must occur at least n times
	215	{n} {n}? {n}+ Must occur exactly n times
	216	* ? + 0 or more times (same as {0,})
	217	+ +? ++ 1 or more times (same as {1,})
	218	? ?? ?+ 0 or 1 time (same as {0,1})
	219
	220	The possessive forms (new in Perl 5.10) prevent backtracking: what gets
	221	matched by a pattern with a possessive quantifier will not be backtracked
	222	into, even if that causes the whole match to fail.
30487ceb	223
ac036724	224	There is no quantifier C<{,n}>. That's interpreted as a literal string.
6d014f17	225
30487ceb	226	=head2 EXTENDED CONSTRUCTS
30487ceb	227
64c5a566	228	(?#text) A comment
	229	(?:...) Groups subexpressions without capturing (cluster)
	230	(?pimsx-imsx:...) Enable/disable option (as per m// modifiers)
	231	(?=...) Zero-width positive lookahead assertion
	232	(?!...) Zero-width negative lookahead assertion
	233	(?<=...) Zero-width positive lookbehind assertion
	234	(?<!...) Zero-width negative lookbehind assertion
	235	(?>...) Grab what we can, prohibit backtracking
	236	(?\|...) Branch reset
	237	(?<name>...) Named capture
	238	(?'name'...) Named capture
	239	(?P<name>...) Named capture (python syntax)
	240	(?{ code }) Embedded code, return value becomes $^R
	241	(??{ code }) Dynamic regex, return value used as regex
	242	(?N) Recurse into subpattern number N
	243	(?-N), (?+N) Recurse into Nth previous/next subpattern
	244	(?R), (?0) Recurse at the beginning of the whole pattern
	245	(?&name) Recurse into a named subpattern
	246	(?P>name) Recurse into a named subpattern (python syntax)
	247	(?(cond)yes\|no)
	248	(?(cond)yes) Conditional expression, where "cond" can be:
	249	(N) subpattern N has matched something
	250	(<name>) named subpattern has matched something
	251	('name') named subpattern has matched something
	252	(?{code}) code condition
	253	(R) true if recursing
	254	(RN) true if recursing into Nth subpattern
	255	(R&name) true if recursing into named subpattern
	256	(DEFINE) always false, no no-pattern allowed
30487ceb	257
a5365663	258	=head2 VARIABLES
30487ceb	259
30487ceb	260	$_ Default variable for operators to use
30487ceb	261
30487ceb	262	$` Everything prior to matched string
e17472c5	263	$& Entire matched string
30487ceb	264	$' Everything after to matched string
30487ceb	265
e17472c5	266	${^PREMATCH} Everything prior to matched string
	267	${^MATCH} Entire matched string
	268	${^POSTMATCH} Everything after to matched string
	269
	270	The use of C<$`>, C<$&> or C<$'> will slow down B<all> regex use
64c5a566	271	within your program. Consult L<perlvar> for C<@->
30487ceb	272	to see equivalent expressions that won't cause slow down.
e17472c5	273	See also L<Devel::SawAmpersand>. Starting with Perl 5.10, you
	274	can also use the equivalent variables C<${^PREMATCH}>, C<${^MATCH}>
	275	and C<${^POSTMATCH}>, but for them to be defined, you have to
	276	specify the C</p> (preserve) modifier on your regular expression.
30487ceb	277
	278	$1, $2 ... hold the Xth captured expr
	279	$+ Last parenthesized pattern match
	280	$^N Holds the most recently closed capture
	281	$^R Holds the result of the last (?{...}) expr
6d014f17	282	@- Offsets of starts of groups. $-[0] holds start of whole match
6d014f17	283	@+ Offsets of ends of groups. $+[0] holds end of whole match
e17472c5	284	%+ Named capture buffers
e17472c5	285	%- Named capture buffers, as array refs
30487ceb	286
6d014f17	287	Captured groups are numbered according to their I<opening> paren.
30487ceb	288
a5365663	289	=head2 FUNCTIONS
30487ceb	290
	291	lc Lowercase a string
	292	lcfirst Lowercase first char of a string
	293	uc Uppercase a string
47e8a552	294	ucfirst Titlecase first char of a string
47e8a552	295
30487ceb	296	pos Return or set current match position
	297	quotemeta Quote metacharacters
	298	reset Reset ?pattern? status
	299	study Analyze string for optimizing matching
	300
e17472c5	301	split Use a regex to split a string into parts
30487ceb	302
d3b55b48	303	The first four of these are like the escape sequences C<\L>, C<\l>,
d3b55b48	304	C<\U>, and C<\u>. For Titlecase, see L</Titlecase>.
47e8a552	305
1501d360	306	=head2 TERMINOLOGY
47e8a552	307
a5365663	308	=head3 Titlecase
47e8a552	309
	310	Unicode concept which most often is equal to uppercase, but for
	311	certain characters like the German "sharp s" there is a difference.
	312
40506b5d	313	=head1 AUTHOR
30487ceb	314
64c5a566	315	Iain Truskett. Updated by the Perl 5 Porters.
30487ceb	316
	317	This document may be distributed under the same terms as Perl itself.
	318
40506b5d	319	=head1 SEE ALSO
30487ceb	320
	321	=over 4
	322
	323	=item *
	324
	325	L<perlretut> for a tutorial on regular expressions.
	326
	327	=item *
	328
	329	L<perlrequick> for a rapid tutorial.
	330
	331	=item *
	332
	333	L<perlre> for more details.
	334
	335	=item *
	336
	337	L<perlvar> for details on the variables.
	338
	339	=item *
	340
	341	L<perlop> for details on the operators.
	342
	343	=item *
	344
	345	L<perlfunc> for details on the functions.
	346
	347	=item *
	348
	349	L<perlfaq6> for FAQs on regular expressions.
	350
	351	=item *
	352
64c5a566	353	L<perlrebackslash> for a reference on backslash sequences.
	354
	355	=item *
	356
	357	L<perlrecharclass> for a reference on character classes.
	358
	359	=item *
	360
30487ceb	361	The L<re> module to alter behaviour and aid
	362	debugging.
	363
	364	=item *
	365
	366	L<perldebug/"Debugging regular expressions">
	367
	368	=item *
	369
e17472c5	370	L<perluniintro>, L<perlunicode>, L<charnames> and L<perllocale>
30487ceb	371	for details on regexes and internationalisation.
	372
	373	=item *
	374
	375	I<Mastering Regular Expressions> by Jeffrey Friedl
08d7a6b2	376	(F<http://oreilly.com/catalog/9780596528126/>) for a thorough grounding and
30487ceb	377	reference on the topic.
	378
	379	=back
	380
40506b5d	381	=head1 THANKS
30487ceb	382
	383	David P.C. Wollmann,
	384	Richard Soderberg,
	385	Sean M. Burke,
	386	Tom Christiansen,
e5a7b003	387	Jim Cromie,
30487ceb	388	and
	389	Jeffrey Goff
	390	for useful advice.
6d014f17	391
6d014f17	392	=cut