Infrastructure to use UTF-EBCDIC rather than UTF-8 as the internal
authorNick Ing-Simmons <nik@tiuk.ti.com>
Sat, 17 Mar 2001 18:29:50 +0000 (18:29 +0000)
committerNick Ing-Simmons <nik@tiuk.ti.com>
Sat, 17 Mar 2001 18:29:50 +0000 (18:29 +0000)
commit1d72bdf6104ef56ab17c3abedf522be0125851c7
treeee4fd5e4b221b1cb3519855686b6cc7d9077c9d6
parentd742c382eed38a7010c93d369ad6896d826c21d6
Infrastructure to use UTF-EBCDIC rather than UTF-8 as the internal
encoding on EBCDIC platforms. This has property that U+0000..U+009F i.e.
a superset of ASCII are invariant under the encoding. This is EBCDIC
friendly as an encoded string can be looked at as being EBCDIC by lexer
sprintf("%d",...) etc. in same manner that a UTF-8 string be considered
ASCII on ASCII machines.

 - re-arrange utf8.h to get ASCII specific vs Unicode generic bits
   seperate.
 - Add some more macros to comprehend different shift amounts and
   possible swizzle in UTF-EBCDIC vs UTF-8. Change utf8.c to use them.
 - add utfebcdic.h which provides UTF-EBCDIC versions of the macros,
   and conditionally #include it.

EBCDIC build as yet untested.  ASCII still fails the one test.

p4raw-id: //depot/perlio@9185
utf8.c
utf8.h
utfebcdic.h [new file with mode: 0644]