# Revision history for Perl extension Encode.
#
-# $Id: Changes,v 1.72 2002/05/20 15:49:56 dankogai Exp dankogai $
+# $Id: Changes,v 1.95 2003/05/21 08:41:11 dankogai Exp $
#
+$Revision: 1.95 $ $Date: 2003/05/21 08:41:11 $
+! encoding.pm
+ Addressed [cpan #2629] Wrong assumption in numeric comparison
+ Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
+! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
+ lib/Encode/Encoding.pm t/perlio.t
+ ! API Change: ->new_sequence() => ->renew()
+ + Encode::Unicode makes use of it so it can handle BOM on PerlIO
+ + Encode::XS and Encode::utf8 now supports ->renew()
+ + Encode::Encoding now documents this with examples
+ - Non-XS (en|de)code stripped out of Encode::Unicode
+ Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
+
+1.95 2003/05/21 08:41:11
+! ucm/8859-*.ucm
+ Since bogus entries were found in iso-8859-6, all entries are
+ re-generated once again out of
+ http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
+ Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
+ Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
++ lib/Encode/Unicode/UTF7.pm
+! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
+ lib/Encode/Supported.pod
+ UTF-7 support is now added. With this Encode now has all transcoding
+ methods in Unicode::String.
+
+1.94 2003/05/10 18:13:59
+! lib/Encode/MIME/Header.pm
+ A more sophisticated solution for double-encoding by dankogai
+! lib/Encode/MIME/Header.pm AUTHORS
+ Two bugs fixed by Bjoern Jacke
+ * "Double Encoding" was not possible
+ i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
+ * encode("MIME-Q") had UTF-8 flag on
+ Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
+! lib/Encode/MIME/Header.pm AUTHORS
+ Two occurances of "croak ()" fixed as "croak qq()".
+ Simon Cozens is added to AUTHORS as a result.
+ Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
+! bin/piconv
+ POD fixes that reflect enhancements by jhi
+! bin/piconv
+ Two enhancements by jhi.
+ + Now uses Getopt::Long so it accepts long name options
+ (--from for -f, for example)
+ + New option: -r,--resolve
+ Message-Id: <20030505114149.GA227075@kosh.hut.fi>
+! MANIFEST META.yml
+ META.yml added upon request of Schwern
+ Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
+! AUTHORS
+ Enache Adrian removed upon request -- to live longer than Encode
+ and/or FreeBSD (toy-)?thread :)
+ Message-Id: <20030425015701.GA2069@ratsnest.hole>
+! t/enc_module.t
+ "close STDOUT unless $^O eq 'freebsd';" once again relocated
+ to keep VMS happy in which case "$^O eq 'freebsd'" is required
+ to keep FreeBSD+thread happy. Sigh.
+ Message-Id: <3EA88ADC.3000300@mac.com>
+
+1.93 2003/04/24 17:43:16
+! t/enc_eucjp.t
+ added "no warnings 'pack'" in for loop to keep bleedperl from
+ complaining "Character in 'C' format wrapped in pack".
+! Makefile.PL
+ More elegant perl core detection inspired by Ilya Zakharevich
+ (but further elaborated for general cases).
+! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
+ POD fixes.
+! t/euc-jp.ucm
+ like cp9??, \x80-\x9F (control + 0x80) are zapped so they
+ are less likely to be confused w/ ISO-8859-*
+! t/CJKT.t
+ RT tests added (vendor encodings are exemplified)
+ -- that successfully found a flaw on iso-2022-kr before the patch.
+! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
+ decode("ISO-2022-KR") has been buggy but no one ever sited
+ that since no one seems to be using it. Bugs discovered by
+ SADAHIRO-san
+ Message-Id: <20030416231757.A545.BQW10602@nifty.com>
+! lib/Encode/CN/HZ.pm t/perlio.t
+ HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
+ so it adds test for HZ.
+ Message-Id: <20030416231757.A545.BQW10602@nifty.com>
+! lib/Encode/Guess.pm
+ Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
+ So long as the string contains \x{00}-\x{ff} it does not fail.
+ See perldoc for details.
+ Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
+
+1.92 2003/03/31 03:27:27
+! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
+ Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
+ removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
+ but glibc-BIG5-2.1.2.ucm does.
+ Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
+! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
+ Maps regenerated again but this time based upon
+ http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
+ (But where is THE DOCUMENT by MICROSOFT?)
+! t/enc_module.t AUTHORS
+ failure with threaded Perl on FreeBSD addressed.
+ Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
+ Message-Id: <20030322230131.GA813@ratsnest.hole>
+! lib/Encode/Guess.pm
+ Some POD fixes.
+! t/CJKT.t
+ Change 18989: Make the :bytes conditional on PerlIO.
+ further Modified by Dan Kogai
+ <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
+! t/enc_module.t
+ Chnage 18966: another fix for failing test on windows ("use encoding"
+ puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
+ Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
+! t/CJKT.t
+ Change 18970: Hopefully this works also in Win32, by jhi
+ Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
+ Change 18965: fix CJKT.t failures on windows due to incorrect
+ binmode(), by gsar
+ Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
+
+1.91 2003/03/09 20:07:37
+! encoding.pm
+ even more proofread by jhi.
+ Message-Id: <20030309194323.GT20843@kosh.hut.fi>
+! t/enc_module.t
+ -use lib 't';
+ +use lib qw(t ext/Encode/t ../ext/Encode/t);
+ Message-Id: <20030309182057.GR20843@kosh.hut.fi>
+! AUTHORS
+ s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
+ Message-Id: <20030309181748.GP20843@kosh.hut.fi>
+! encoding.pm
+ s/logner/longer/
+ Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
+
+1.90 2003/03/09 17:32:43
+! encoding.pm
++ t/enc_data.t
+ Inaba-san has added a patch for perl 5.8.1 or later that makes
+ encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
+ that. POD is further revised.
+ Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
+! encoding.pm t/enc_module.t
+ encoding vs. ${^UNICODE} resolved. POD revised accordingly.
+ Message-Id: <20030306112940.GN20652@kosh.hut.fi>
+
+1.89 2003/02/28
+! Encode.xs
+ signed vs. unsigned issue discovered by Craig on OpenVM
+ Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
+! encoding.pm AUTHORS
++ t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
+ Because binmode() stacks layers instead of overwrite, you have to
+ ":raw :encoding()" in encoding.pm or your are in trouble when you
+ call encoding.pm multiple times. There are several workarounds
+ but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
+ who was the first to address this problem was added to AUTHORS.
+ The test suites was added for this, which is a modified version
+ of SUGAWARA-san's scripts
+ Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
+
+1.88 2003/02/20 14:42:34
+! Encode.xs
+ one signedness nit for Encode by jhi
+ <200302161933.h1GJX876018710@kosh.hut.fi>
+! ucm/viscii.ucm
+ VISCII map was incorrect; fixed by Sadahiro-san
+ Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
+! t/enc_eucjp.t t/enc_utf8.t AUTHORS
+ You can't unlink files that are opened in cygwin but the last
+ file handle opened in t/enc_*.t left open. Patch submitted
+ by Yitzchak and he was added to AUTHORS.
+ Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
+! t/CJKT.t
+ now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
+ Message-Id: <20030206104513.GA11081@kosh.hut.fi>
+! Unicode/Unicode.xs
+ For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
+ Message-Id: <20030206045153.GA6826@kosh.hut.fi>
+
+1.87 2003/02/06 01:52:11
+! AUTHORS
+ * Inaba "Sensei" Hirohito added (I thought I have done so a long
+ ago but apparently I did not).
+ * SUZUKI Norio added for verious and useful bug reports.
+! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
+ lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
+ podchecked so all warnings are gone except for L<http://>.
+! encoding.pm t/enc_eucjp.t
+ * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
+ * Filter option overhaul
+ * POD revision
+! Encode.pm Encode.xs encengine.c Encode/encode.h
+ lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
+ Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
+ without filter. podchecked by Dan Kogai.
+ Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
+! lib/Encode/Alias.pm
+ decode('alias', $1) went wild because of local $_ in find_alias()
+ the evil local $_ is eradicated but that changes find_alias()
+ format for coderef aliasing. See Encode::Alias for details
+ Message-Id: <200302051704.AA00042@kipp0.nifty.com>
+
+1.86 2003/01/22 03:29:07
+! encoding.pm
+ * Don't forget to canonize when you attempt an exact match!
+ Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
+ * ${^ENCODING} exception is off for $] > 5.008
+ Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
+! t/enc_utf8.t
+ $] check commented out so it runs on 5.8.0
+
+1.85 2003/01/21 22:19:14
+! encoding.pm
+ ${^ENCODING} exception is now explicit rather than handled by regex.
++ t/enc_eucjp.t t/enc_utf8.t
+ Test suite for the better "encoding" pragma support for bleedperl.
+ On 5.8.0, they will just be skipped.
+
+1.84 2003/01/10 12:00:16
+! encoding.pm
+ ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
+ (That is to prevent duplicate encoding first by IO then ${^ENCODING})
+ Message-Id: <20030108213737.GK331043@lyta.hut.fi>
+! Unicode/Unicode.xs
+ %_ fixes saves the resulting .so .05% smaller, by NC
+ Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
+! Encode.pm
+ Silence Encode on undef, by Andreas
+ Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
+ Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
+! Unicode/Unicode.xs
+ s/regognised/recognised/ . British spelling left intact to pay
+ respect to two British Nicks :)
+ Message-Id: <20021203020454.GK2274@kosh.hut.fi>
+
+1.83 2002/11/18 17:28:49
+! Encode.xs lib/Encode/JIS7.pm
+ Even more patches from Inaba-san has been applied. With this
+ patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
+ Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
+
+1.82 2002/11/14 23:06:12
+! Encode.xs
+ Encode::utf8 (XS Version) assertion botch first found in Cygwin,
+ later found in perls w/ -Dusemymalloc was fixed by NC.
+ Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
+
+1.81 2002/11/08 18:29:27
+! Encode.pm Encode.xs
+ Non-XS version of Encode::utf8 is back (with XS being default).
+ Encode::predefine_encodings(0) to turn off XS.
+ This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
+ found that it was Test::More causing the problem, not Encode.
+ But I have already made it configurable so it may be useful in
+ some rare cases....
+ Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
+! bin/enc2xs
+ The ingenious patch by Nicholas Clark that reduces shlib sizes by
+ 50% with no penalty and backward compatibility preserved, is in.
+ Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
+
+1.80 2002/10/21 20:39:09
+! Encode.xs t/mime-header.t
+ Even more patches from NI-XS regarding Encode::utf8->decode().
+ And one more test to t/mime-header.t to prove it
+ Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
+
+1.79 2002/10/21 06:05:37
+! Encode.xs
+ Further patches from NI-XS. Encode::utf8->decode() now checks the
+ value of utf8 flag of the argument. As a result, the fix to
+ lib/Encode/MIME/Header.pm is no longer neccessary but since it did
+ no harm (even speedwise) I'll leave it unreverted.
+! ucm/cp949.ucm ucm/cp950.ucm
+ U+20AC EURO SIGN
+ U+00AE REGISTERED SIGN
+ were missing as a result of 1.78. Discovered by Moriyama-san.
+ Moriyama-san has also developed a test script that compares
+ (en|de)coded results to the corresponding Win32 API result and
+ all cp9?? maps are now verified.
+ Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
+
+1.78 2002/10/20 15:44:00
+! lib/Encode/MIME/Header.pm
+ fixed so that it works with new Encode::utf8
+! Encode.pm Encode.xs
+ Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
+ :encoding(UTF-8) to handle partial chars at end of buffers
+ correctly.
+ Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
+! lib/Encode/Supported.pod
+ More nitpickings applied.
++ t/rt.pl MANIFEST
+! t/CJKT.t
+ Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
+ tests were completely useless. To redeem that and get the peace of
+ mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
+ ucm/*.ucm Since this script takes too long to finish (30 seconds on
+ PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
+ but you can easily run that by either renaming it or:
+ perl -Mblib t/rt.pl
+ Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
+! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
+ Other CJKT cp9?? also updated according to the URI below;
+ http://www.microsoft.com/typography/unicode/cscp.htm
++ bin/ucmsort MANIFEST
+ ucmsort is a crude utility that sorts CHARMAP entries in UCM files
+ to proper order. intended for hardcore develpers only.
+! ucm/cp932.ucm JP/JP.pm AUTHORS
+ CP932 mapping which was based upon the mapping file at unicode.org
+ was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
+ has also supplied the patch so he was added to AUTHORS.
+! lib/Encode/Supported.pod
+ ISO-8859-11 != TIS 620
+ == TIS 620 + \xA0 ( )
+ Message-Id:
+ <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
+
+1.77 2002/10/06 03:27:02
+! t/jperl.t
+ * Modified to accomodate up and comming patch by Inaba-san that
+ will fix tr/// needing eval qq{}
+ Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
+! encoding.pm
+ * pod fixes/enhancements to reflect the changes above
+! lib/Encode/Alias.pm
+ "Encode::TW is correct, Encode::Alias not." - /Autrijus/
+ Message-Id: <20021001015648.GB18710@not.autrijus.org>
+
+1.76 2002/08/25 15:09:51
+! t/big5-eten.utf
+ To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
+ but naturally identical to previous version -- dankogai
+! ucm/big5-eten.ucm
+ Codepoint fixes -- autrijus
+ Message-Id: <20020805040236.GC5220@not.autrijus.org>
+= *
+ copied everything under perl-5.8.0/ext/Encode to make sure Encode
+ is in sync w/ perl core
+! t/CJKT.t t/guess.t
+ Change 17175 by jhi@alpha on 2002/06/10 23:24:42
+ Now that binmode(FH) does implicit ":bytes" revisit
+ the failing tests. The worrisome one is the Digest::MD5
+ test-- how will it fare in CRLF lands now?
+! t/CJKT.t t/guess.t
+ From: Radu Greab <radu@netsoft.ro>
+ Date: Mon, 10 Jun 2002 00:40:34 +0300
+ Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
+ Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
+ ones which I cannot figure out.
+! lib/Encode/Alias.pm
+ Subject: [Encode PATCH] spurious warning
+ From: Nicholas Clark <nick@unfortu.net>
+ Date: Sun, 2 Jun 2002 20:26:22 +0100
+ Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
+
+1.75 2002/06/01 18:07:49
+! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
+ glibc compliance cited by Autrijus.
+ http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
+! bin/enc2xs bin/piconv
+ Subject: Re: forewarning: usedevel and versiononly
+ Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
+
+1.74 2002/05/28 18:33:15
++ ucm/null.ucm ucm/ctrl.ucm
+! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
+ "null" and "ascii-ctrl" encodings added upon the request of Autrijus
+ Subject: Re: unicode -> &# notation
+ Message-ID: <20020518193704.GB40272@not.autrijus.org>
+
+1.73 2002/05/28 17:26:18
+! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
+ Chris Nandor has fixed Encode so that it works w/ MacPerl --
+ at least w/ PPC (68k need static linking which does not work due to
+ 64k limit). pudge is added to AUTHORS (I'm surprised he was not
+ there in the list). Encode/Makefile_PL.e2x was additionally fixed
+ by dankogai to reflect changes in other Makefile.PL
+ Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
+! t/mime-header.t
+ Subject: Change 16746: -Mutf8 cleanup.
+ Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
-$Revision: 1.72 $ $Date: 2002/05/20 15:49:56 $
+1.72 2002/05/20 15:49:56
! Makefile.PL
Subject: [PATCH] Encode should be in perl-core library path
Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
Typo fixes and improvements by jhi
Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
-1.11 $Date: 2002/05/20 15:49:56 $
+1.11 2002/03/31 22:12:13
+ t/encoding.t
+ t/jperl.t
! MANIFEST