to work similarly to Unicode tech reports and Java
notation \uXXXX (and already existing \x{XXXX))?
more than four hexdigits? make also \U+XXXX work?
+ overloadable regex assertions? e.g. in Thai \b cannot
+ be deduced by any simple character class boundary rules,
+ word boundaries must algorithmically computed
+
+ see ext/Encode/Todo for notes and references about proper detection
+ of malformed UTF-8
+
+ SCSU? http://www.unicode.org/unicode/reports/tr6/
+ Collation? http://www.unicode.org/unicode/reports/tr10/
+ Normalization? http://www.unicode.org/unicode/reports/tr15/
+ EBCDIC? http://www.unicode.org/unicode/reports/tr16/
+ Regexes? http://www.unicode.org/unicode/reports/tr18/
+ Case Mappings? http://www.unicode.org/unicode/reports/tr21/
+
See also "Locales", "Regexen", and "Miscellaneous".
Multi-threading
sendmsg, recvmsg? (Configure doesn't probe for these but the units exist)
setitimer, getitimer? (the metaconfig units exist)
-Unicode
- SCSU? http://www.unicode.org/unicode/reports/tr6/
- Collation? http://www.unicode.org/unicode/reports/tr10/
- Normalization? http://www.unicode.org/unicode/reports/tr15/
- EBCDIC? http://www.unicode.org/unicode/reports/tr16/
- Regexes? http://www.unicode.org/unicode/reports/tr18/
- Case Mappings? http://www.unicode.org/unicode/reports/tr21/
-
Ongoing
keep filenames 8.3 friendly, where feasible
upgrade to newer versions of all independently maintained modules