Unicode support
finish byte <-> utf8 and localencoding <-> utf8 conversions
- make substr($bytestr,0,0,$charstr) do the right conversion
add Unicode::Map equivivalent to core
add support for I/O disciplines
- a way to specify disciplines when opening things:
to work similarly to Unicode tech reports and Java
notation \uXXXX (and already existing \x{XXXX))?
more than four hexdigits? make also \U+XXXX work?
+ overloadable regex assertions? e.g. in Thai \b cannot
+ be deduced by any simple character class boundary rules,
+ word boundaries must algorithmically computed
+
+ see ext/Encode/Todo for notes and references about proper detection
+ of malformed UTF-8
+
+ SCSU? http://www.unicode.org/unicode/reports/tr6/
+ Collation? http://www.unicode.org/unicode/reports/tr10/
+ Normalization? http://www.unicode.org/unicode/reports/tr15/
+ EBCDIC? http://www.unicode.org/unicode/reports/tr16/
+ Regexes? http://www.unicode.org/unicode/reports/tr18/
+ Case Mappings? http://www.unicode.org/unicode/reports/tr21/
+
See also "Locales", "Regexen", and "Miscellaneous".
Multi-threading
turn Cwd into an XS module? (Configure already probes for getcwd())
mmap for speeding up input? (Configure already probes for the mmap family)
sendmsg, recvmsg? (Configure doesn't probe for these but the units exist)
+ sockatmark?
setitimer, getitimer? (the metaconfig units exist)
-Unicode
- SCSU? http://www.unicode.org/unicode/reports/tr6/
- Collation? http://www.unicode.org/unicode/reports/tr10/
- Normalization? http://www.unicode.org/unicode/reports/tr15/
- EBCDIC? http://www.unicode.org/unicode/reports/tr16/
- Regexes? http://www.unicode.org/unicode/reports/tr18/
- Case Mappings? http://www.unicode.org/unicode/reports/tr21/
-
Ongoing
keep filenames 8.3 friendly, where feasible
upgrade to newer versions of all independently maintained modules