Level 1 - Basic Unicode Support
2.1 Hex Notation - done [1]
- Named Notation - done [2]
+ Named Notation - done [2]
2.2 Categories - done [3][4]
2.3 Subtraction - MISSING [5][6]
2.4 Simple Word Boundaries - done [7]
[ 9] see UTR#13 Unicode Newline Guidelines
[10] should do ^ and $ also on \x{85}, \x{2028} and \x{2029})
(should also affect <>, $., and script line numbers)
+ (the \x{85}, \x{2028} and \x{2029} do match \s)
(*) You can mimic class subtraction using lookahead.
For example, what TR18 might write as