git.shadowcat.co.uk Git - p5sagit/p5-mst-13.2.git/commit

author	Nick Ing-Simmons <nik@tiuk.ti.com>
	Sat, 17 Mar 2001 18:29:50 +0000 (18:29 +0000)
committer	Nick Ing-Simmons <nik@tiuk.ti.com>
	Sat, 17 Mar 2001 18:29:50 +0000 (18:29 +0000)
commit	1d72bdf6104ef56ab17c3abedf522be0125851c7
tree	ee4fd5e4b221b1cb3519855686b6cc7d9077c9d6	tree \| snapshot
parent	d742c382eed38a7010c93d369ad6896d826c21d6	commit \| diff

Infrastructure to use UTF-EBCDIC rather than UTF-8 as the internal
encoding on EBCDIC platforms. This has property that U+0000..U+009F i.e.
a superset of ASCII are invariant under the encoding. This is EBCDIC
friendly as an encoded string can be looked at as being EBCDIC by lexer
sprintf("%d",...) etc. in same manner that a UTF-8 string be considered
ASCII on ASCII machines.

- re-arrange utf8.h to get ASCII specific vs Unicode generic bits
   seperate.
- Add some more macros to comprehend different shift amounts and
   possible swizzle in UTF-EBCDIC vs UTF-8. Change utf8.c to use them.
- add utfebcdic.h which provides UTF-EBCDIC versions of the macros,
   and conditionally #include it.

EBCDIC build as yet untested.  ASCII still fails the one test.

p4raw-id: //depot/perlio@9185

utf8.c		diff \| blob \| blame \| history
utf8.h		diff \| blob \| blame \| history
utfebcdic.h	[new file with mode: 0644]	blob