git.shadowcat.co.uk Git - p5sagit/p5-mst-13.2.git/commit

author	Jarkko Hietaniemi <jhi@iki.fi>
	Sun, 28 Jan 2001 19:28:40 +0000 (19:28 +0000)
committer	Jarkko Hietaniemi <jhi@iki.fi>
	Sun, 28 Jan 2001 19:28:40 +0000 (19:28 +0000)
commit	f9a6324217cffea75ff769ddd313748c0613a128
tree	9fb5b4ade5877ba969d093cfe37ec605de62d8dc	tree \| snapshot
parent	9ee2bb1a7c54b1866ff07ab9c157254810ee5205	commit \| diff

Patch from Inaba Hiroto:
- canonical UTF-8 hash keys: if a key string for a hash is
  UTF8-on, try downgrade the string and use it if
  unicode::distinct is not in effect.
  For the task, I added a function bytes_from_utf8() to utf8.c.
  It might resemble utf8_to_bytes() but it is not convenient
  to the task.
  Made a test for it and added to t/op/each.t
- Changed do_print in doio.c to apply sv_utf8_(downgrade|upgrade) to
  the mortal copy of the argument SV.
  And changed t/io/utf8.t test 18 which expects print() to
  upgrade its argument.
- re-implement sv_eq with bytes_from_utf8()
- some bug fixes
  - tr/// does not handle UTF8 range (\x{}-\x{})
  - \ before raw UTF8 character produced
    "Malformed UTF-8 character" warning.
  - "\x{100}\N{CENT SIGN}" is Malformed.
    Added tests for these 3.
  - and one silly bug (by me) with qu operator.

p4raw-id: //depot/perl@8583

doio.c		diff \| blob \| blame \| history
embed.h		diff \| blob \| blame \| history
embed.pl		diff \| blob \| blame \| history
global.sym		diff \| blob \| blame \| history
hv.c		diff \| blob \| blame \| history
objXSUB.h		diff \| blob \| blame \| history
perlapi.c		diff \| blob \| blame \| history
pod/perlapi.pod		diff \| blob \| blame \| history
proto.h		diff \| blob \| blame \| history
sv.c		diff \| blob \| blame \| history
t/io/utf8.t		diff \| blob \| blame \| history
t/lib/charnames.t		diff \| blob \| blame \| history
t/op/each.t		diff \| blob \| blame \| history
t/op/tr.t		diff \| blob \| blame \| history
t/pragma/utf8.t		diff \| blob \| blame \| history
toke.c		diff \| blob \| blame \| history
utf8.c		diff \| blob \| blame \| history