\S A non-whitespace character
\h An horizontal white space
\H A non horizontal white space
+ \N A non newline (when not followed by a '{'; it's like . without /s)
\v A vertical white space
\V A non vertical white space
\R A generic newline (?>\v|\x0D\x0A)
\C Match a byte (with Unicode, '.' matches a character)
\pP Match P-named (Unicode) property
- \p{...} Match Unicode property with long name
+ \p{...} Match Unicode property with name longer than 1 character
\PP Match non-P
- \P{...} Match lack of Unicode property with long name
- \X Match extended Unicode combining character sequence
+ \P{...} Match lack of Unicode property with name longer than 1 char
+ \X Match Unicode extended grapheme cluster
POSIX character classes and their Unicode and Perl equivalents:
=item *
I<Mastering Regular Expressions> by Jeffrey Friedl
-(F<http://regex.info/>) for a thorough grounding and
+(F<http://oreilly.com/catalog/9780596528126/>) for a thorough grounding and
reference on the topic.
=back