1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.97 2003/07/08 21:52:14 dankogai Exp $
5 $Revision: 1.97 $ $Date: 2003/07/08 21:52:14 $
6 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
7 Encode test fixes for VMS by Peter Prymmer
8 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
9 ! lib/Encode/Alias.pm t/Aliases.t
10 koi-8 aliases bug detected and patched by sadahiro.
11 Further fix and test suite by dankogai
12 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
14 1.97 2003/07/08 21:52:14
15 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
16 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
17 $DEBUG replaced with DEBUG() so perl optimizes better,
18 by Rafael with further fixes by dankogai
19 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
20 ! lib/Encode/Aliases.pm
21 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
22 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
23 So new hash seeding introduced in bleedperl works.
24 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
26 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
27 automatic utf(8|16|32) guessing -- originally by Autrijus
28 Message-Id: <20030626162731.GA2077@not.autrijus.org>
30 Addressed the following;
31 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
32 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
34 1.96 2003/06/18 09:29:02
35 ! lib/Encode/JP/JP.pm t/guess.t
36 m/(...)/ in void context then $1 is considered a Bad Thing
37 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
39 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
42 More typecast from maintperl@19739
43 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
45 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
46 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
48 Now skips for EBCDIC platform.
49 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
51 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
52 have more than one C<.> in their filenames." -- perlport.
53 Message-Id: <3ED79E01.8050401@mac.com>
55 Found and fixed the back that -p,--perlqq does not work.
56 Induced by the change from Getopt::Std to Getopt::Long.
58 Addressed [cpan #2629] Wrong assumption in numeric comparison
59 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
60 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
61 lib/Encode/Encoding.pm t/perlio.t
62 ! API Change: ->new_sequence() => ->renew()
63 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
64 + Encode::XS and Encode::utf8 now supports ->renew()
65 + Encode::Encoding now documents this with examples
66 - Non-XS (en|de)code stripped out of Encode::Unicode
67 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
69 1.95 2003/05/21 08:41:11
71 Since bogus entries were found in iso-8859-6, all entries are
72 re-generated once again out of
73 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
74 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
75 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
76 + lib/Encode/Unicode/UTF7.pm
77 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
78 lib/Encode/Supported.pod
79 UTF-7 support is now added. With this Encode now has all transcoding
80 methods in Unicode::String.
82 1.94 2003/05/10 18:13:59
83 ! lib/Encode/MIME/Header.pm
84 A more sophisticated solution for double-encoding by dankogai
85 ! lib/Encode/MIME/Header.pm AUTHORS
86 Two bugs fixed by Bjoern Jacke
87 * "Double Encoding" was not possible
88 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
89 * encode("MIME-Q") had UTF-8 flag on
90 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
91 ! lib/Encode/MIME/Header.pm AUTHORS
92 Two occurances of "croak ()" fixed as "croak qq()".
93 Simon Cozens is added to AUTHORS as a result.
94 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
96 POD fixes that reflect enhancements by jhi
98 Two enhancements by jhi.
99 + Now uses Getopt::Long so it accepts long name options
100 (--from for -f, for example)
101 + New option: -r,--resolve
102 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
104 META.yml added upon request of Schwern
105 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
107 Enache Adrian removed upon request -- to live longer than Encode
108 and/or FreeBSD (toy-)?thread :)
109 Message-Id: <20030425015701.GA2069@ratsnest.hole>
111 "close STDOUT unless $^O eq 'freebsd';" once again relocated
112 to keep VMS happy in which case "$^O eq 'freebsd'" is required
113 to keep FreeBSD+thread happy. Sigh.
114 Message-Id: <3EA88ADC.3000300@mac.com>
116 1.93 2003/04/24 17:43:16
118 added "no warnings 'pack'" in for loop to keep bleedperl from
119 complaining "Character in 'C' format wrapped in pack".
121 More elegant perl core detection inspired by Ilya Zakharevich
122 (but further elaborated for general cases).
123 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
126 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
127 are less likely to be confused w/ ISO-8859-*
129 RT tests added (vendor encodings are exemplified)
130 -- that successfully found a flaw on iso-2022-kr before the patch.
131 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
132 decode("ISO-2022-KR") has been buggy but no one ever sited
133 that since no one seems to be using it. Bugs discovered by
135 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
136 ! lib/Encode/CN/HZ.pm t/perlio.t
137 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
138 so it adds test for HZ.
139 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
140 ! lib/Encode/Guess.pm
141 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
142 So long as the string contains \x{00}-\x{ff} it does not fail.
143 See perldoc for details.
144 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
146 1.92 2003/03/31 03:27:27
147 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
148 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
149 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
150 but glibc-BIG5-2.1.2.ucm does.
151 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
152 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
153 Maps regenerated again but this time based upon
154 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
155 (But where is THE DOCUMENT by MICROSOFT?)
156 ! t/enc_module.t AUTHORS
157 failure with threaded Perl on FreeBSD addressed.
158 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
159 Message-Id: <20030322230131.GA813@ratsnest.hole>
160 ! lib/Encode/Guess.pm
163 Change 18989: Make the :bytes conditional on PerlIO.
164 further Modified by Dan Kogai
165 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
167 Chnage 18966: another fix for failing test on windows ("use encoding"
168 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
169 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
171 Change 18970: Hopefully this works also in Win32, by jhi
172 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
173 Change 18965: fix CJKT.t failures on windows due to incorrect
175 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
177 1.91 2003/03/09 20:07:37
179 even more proofread by jhi.
180 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
183 +use lib qw(t ext/Encode/t ../ext/Encode/t);
184 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
186 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
187 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
190 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
192 1.90 2003/03/09 17:32:43
195 Inaba-san has added a patch for perl 5.8.1 or later that makes
196 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
197 that. POD is further revised.
198 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
199 ! encoding.pm t/enc_module.t
200 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
201 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
205 signed vs. unsigned issue discovered by Craig on OpenVM
206 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
207 ! encoding.pm AUTHORS
208 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
209 Because binmode() stacks layers instead of overwrite, you have to
210 ":raw :encoding()" in encoding.pm or your are in trouble when you
211 call encoding.pm multiple times. There are several workarounds
212 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
213 who was the first to address this problem was added to AUTHORS.
214 The test suites was added for this, which is a modified version
215 of SUGAWARA-san's scripts
216 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
218 1.88 2003/02/20 14:42:34
220 one signedness nit for Encode by jhi
221 <200302161933.h1GJX876018710@kosh.hut.fi>
223 VISCII map was incorrect; fixed by Sadahiro-san
224 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
225 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
226 You can't unlink files that are opened in cygwin but the last
227 file handle opened in t/enc_*.t left open. Patch submitted
228 by Yitzchak and he was added to AUTHORS.
229 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
231 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
232 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
234 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
235 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
237 1.87 2003/02/06 01:52:11
239 * Inaba "Sensei" Hirohito added (I thought I have done so a long
240 ago but apparently I did not).
241 * SUZUKI Norio added for verious and useful bug reports.
242 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
243 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
244 podchecked so all warnings are gone except for L<http://>.
245 ! encoding.pm t/enc_eucjp.t
246 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
247 * Filter option overhaul
249 ! Encode.pm Encode.xs encengine.c Encode/encode.h
250 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
251 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
252 without filter. podchecked by Dan Kogai.
253 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
254 ! lib/Encode/Alias.pm
255 decode('alias', $1) went wild because of local $_ in find_alias()
256 the evil local $_ is eradicated but that changes find_alias()
257 format for coderef aliasing. See Encode::Alias for details
258 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
260 1.86 2003/01/22 03:29:07
262 * Don't forget to canonize when you attempt an exact match!
263 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
264 * ${^ENCODING} exception is off for $] > 5.008
265 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
267 $] check commented out so it runs on 5.8.0
269 1.85 2003/01/21 22:19:14
271 ${^ENCODING} exception is now explicit rather than handled by regex.
272 + t/enc_eucjp.t t/enc_utf8.t
273 Test suite for the better "encoding" pragma support for bleedperl.
274 On 5.8.0, they will just be skipped.
276 1.84 2003/01/10 12:00:16
278 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
279 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
280 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
282 %_ fixes saves the resulting .so .05% smaller, by NC
283 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
285 Silence Encode on undef, by Andreas
286 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
287 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
289 s/regognised/recognised/ . British spelling left intact to pay
290 respect to two British Nicks :)
291 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
293 1.83 2002/11/18 17:28:49
294 ! Encode.xs lib/Encode/JIS7.pm
295 Even more patches from Inaba-san has been applied. With this
296 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
297 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
299 1.82 2002/11/14 23:06:12
301 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
302 later found in perls w/ -Dusemymalloc was fixed by NC.
303 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
305 1.81 2002/11/08 18:29:27
306 ! Encode.pm Encode.xs
307 Non-XS version of Encode::utf8 is back (with XS being default).
308 Encode::predefine_encodings(0) to turn off XS.
309 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
310 found that it was Test::More causing the problem, not Encode.
311 But I have already made it configurable so it may be useful in
313 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
315 The ingenious patch by Nicholas Clark that reduces shlib sizes by
316 50% with no penalty and backward compatibility preserved, is in.
317 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
319 1.80 2002/10/21 20:39:09
320 ! Encode.xs t/mime-header.t
321 Even more patches from NI-XS regarding Encode::utf8->decode().
322 And one more test to t/mime-header.t to prove it
323 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
325 1.79 2002/10/21 06:05:37
327 Further patches from NI-XS. Encode::utf8->decode() now checks the
328 value of utf8 flag of the argument. As a result, the fix to
329 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
330 no harm (even speedwise) I'll leave it unreverted.
331 ! ucm/cp949.ucm ucm/cp950.ucm
333 U+00AE REGISTERED SIGN
334 were missing as a result of 1.78. Discovered by Moriyama-san.
335 Moriyama-san has also developed a test script that compares
336 (en|de)coded results to the corresponding Win32 API result and
337 all cp9?? maps are now verified.
338 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
340 1.78 2002/10/20 15:44:00
341 ! lib/Encode/MIME/Header.pm
342 fixed so that it works with new Encode::utf8
343 ! Encode.pm Encode.xs
344 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
345 :encoding(UTF-8) to handle partial chars at end of buffers
347 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
348 ! lib/Encode/Supported.pod
349 More nitpickings applied.
352 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
353 tests were completely useless. To redeem that and get the peace of
354 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
355 ucm/*.ucm Since this script takes too long to finish (30 seconds on
356 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
357 but you can easily run that by either renaming it or:
359 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
360 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
361 Other CJKT cp9?? also updated according to the URI below;
362 http://www.microsoft.com/typography/unicode/cscp.htm
363 + bin/ucmsort MANIFEST
364 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
365 to proper order. intended for hardcore develpers only.
366 ! ucm/cp932.ucm JP/JP.pm AUTHORS
367 CP932 mapping which was based upon the mapping file at unicode.org
368 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
369 has also supplied the patch so he was added to AUTHORS.
370 ! lib/Encode/Supported.pod
371 ISO-8859-11 != TIS 620
372 == TIS 620 + \xA0 ( )
374 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
376 1.77 2002/10/06 03:27:02
378 * Modified to accomodate up and comming patch by Inaba-san that
379 will fix tr/// needing eval qq{}
380 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
382 * pod fixes/enhancements to reflect the changes above
383 ! lib/Encode/Alias.pm
384 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
385 Message-Id: <20021001015648.GB18710@not.autrijus.org>
387 1.76 2002/08/25 15:09:51
389 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
390 but naturally identical to previous version -- dankogai
392 Codepoint fixes -- autrijus
393 Message-Id: <20020805040236.GC5220@not.autrijus.org>
395 copied everything under perl-5.8.0/ext/Encode to make sure Encode
396 is in sync w/ perl core
398 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
399 Now that binmode(FH) does implicit ":bytes" revisit
400 the failing tests. The worrisome one is the Digest::MD5
401 test-- how will it fare in CRLF lands now?
403 From: Radu Greab <radu@netsoft.ro>
404 Date: Mon, 10 Jun 2002 00:40:34 +0300
405 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
406 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
407 ones which I cannot figure out.
408 ! lib/Encode/Alias.pm
409 Subject: [Encode PATCH] spurious warning
410 From: Nicholas Clark <nick@unfortu.net>
411 Date: Sun, 2 Jun 2002 20:26:22 +0100
412 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
414 1.75 2002/06/01 18:07:49
415 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
416 glibc compliance cited by Autrijus.
417 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
418 ! bin/enc2xs bin/piconv
419 Subject: Re: forewarning: usedevel and versiononly
420 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
422 1.74 2002/05/28 18:33:15
423 + ucm/null.ucm ucm/ctrl.ucm
424 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
425 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
426 Subject: Re: unicode -> &# notation
427 Message-ID: <20020518193704.GB40272@not.autrijus.org>
429 1.73 2002/05/28 17:26:18
430 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
431 Chris Nandor has fixed Encode so that it works w/ MacPerl --
432 at least w/ PPC (68k need static linking which does not work due to
433 64k limit). pudge is added to AUTHORS (I'm surprised he was not
434 there in the list). Encode/Makefile_PL.e2x was additionally fixed
435 by dankogai to reflect changes in other Makefile.PL
436 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
438 Subject: Change 16746: -Mutf8 cleanup.
439 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
441 1.72 2002/05/20 15:49:56
443 Subject: [PATCH] Encode should be in perl-core library path
444 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
445 Message-Id: <20020520161201.A11019@alpha.hut.fi>
446 ! lib/Encode/MIME/Header.pm
447 Subject: [PATCH] Encode::MIME::Header
448 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
449 ! Encode/Makefile_PL.e2x
450 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
451 Message-Id: <20020519201031.GA1603@not.autrijus.org>
452 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
453 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
455 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
456 Subject: [PATCH] good day for WinCE port of perl.
457 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
460 Even more linting by Robin via @16532
462 Even more typecast by Sarathy in @16460
464 1.71 2002/05/07 16:22:42
466 even more typecasts by Robin
467 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
469 A very strange bug that was causing a bugus ucm -> C table
470 generation that was revealed by a UCM file that Andreas was
471 working. This is the king of wierdest bug I've encountered
472 in the course of Encode maintenance.
473 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
475 1.70 2002/05/06 10:26:48
477 Made more 'module-safe' with conjunction w/ 'no encoding'.
478 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
479 ! lib/Encode/Encoding.pm
480 'require Encode' because ->Define uses Encode::define_encoding();
481 problem and solution addressed by Miyagawa-kun
482 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
484 Cuts the frill to make djgpp happier, as suggested by Laszlo
485 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
487 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
488 Message-Id: <m3bsbug48n.fsf@anima.de>
490 1.69 2002/05/04 16:41:18
491 ! lib/Encode/MIME/Header
492 Floating-point coerced for UNICOS (in integer arithmetics it folds
493 line one character too early). Verification by Mark is pending.
494 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
496 more doc patch from Elizabeth
497 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
498 ! Encode/Makefile_PL.e2x
499 More platform-independent patch from Benjamin
500 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
501 ! lib/Encode/Guess AUTHORS
502 split regex fix by Graham Barr. Adds him to AUTHORS.
503 Message-Id: <20020504085419.E95940@valueclick.com>
504 ! Encode/Makefile_PL.e2x
505 enc2xs script discovery made smarter and more sensible, first cited
506 by Miyagawa-kun and further suggestions by Rafael and Andreas
507 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
508 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
510 1.68 2002/05/03 12:20:13
511 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
512 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
513 and Supported.pod modified to reflect the change. Elizabeth added
514 to Authors. And H.M. is also added for forwarding her patch among
515 other contributions (I was rather surprised to find his name was not
517 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
519 1.67 2002/05/02 07:33:09
521 Error message now consistent w/ perlqq (\N{U+} -> \x{})
522 done in perl@16308 but Philip linted me further. Now the error
523 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
524 ! lib/Encode/Guess.pm
525 Sanity check for happier -w by Autrijus
527 1.66 2002/05/01 05:41:06
528 ! Encode.xs t/fallback.t
529 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
530 while fallback is in effect. This even came with a welcome side-effect
531 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
532 also modified to test this.
533 And of course, the corresponding varialbles to UV[Xx]f are appropriately
534 cast. This should've concluded NI-XS homework.
536 encode(undef) does warn again! Repented upon suggestion by NI-XS.
537 Document for unless vs. '' added
538 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
540 1.65 2002/04/30 16:13:37
542 encode(undef) no longer warns for C<Use of uninitialized value in
543 subroutine entry>. Suggested by Paul.
544 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
545 ! lib/Encode/Supported.pod
546 Encode::MIME::Header and Encode::Guess mentioned
547 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
548 ! lib/Encode/Guess.pm
549 POD fix by Miyagawa-kun
550 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
552 1.64 2002/04/29 06:54:06
554 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
555 encocde("euc-jisx0213") are still DIFFERENT.
556 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
558 A few white spaces corrected by NI-XS via PerlIO integration to
560 Subject: Change 16247: Integrate perlio;
562 Document fixes by Andreas
563 Message-Id: <m3k7qsf1we.fsf@anima.de>
565 1.63 2002/04/27 18:59:50
566 ! lib/Encode/Encoding.pm
567 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
568 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
569 Make use of the Encode::Encoding base class!
570 And other cleanups in Encode.xs upon NI-XS suggestions
571 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
573 1.62 2002/04/27 11:17:39
575 encodings() now just check %ExtModule instead of eval{require}
576 all of them for ":all" to conserve more memory.
578 more "%x" -> "%" UVxf stuff.
580 s/=over2/=over 2/g # oops.
582 1.61 2002/04/26 03:02:04
584 Now does decent tests besides use_ok()
585 ! lib/Encode/Guess.pm t/guess.t
586 UI streamlined, document added
588 various signed/unsigned mismatch nits (#16173)
589 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
591 POD: utf8-flag-related caveats added. A few sections completely
595 Thou shalt not assume %d works, either!
596 Robin Baker added to AUTHORS for this
597 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
599 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
601 1.60 2002/04/24 20:06:52
603 "Thou shalt not assume %x works." -- jhi
604 Message-Id: <20020424210618.E24347@alpha.hut.fi>
605 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
606 low-memory build machines happy, now *.c is created for each *.ucm
607 (no table aggregation). You can still override this by setting
608 $ENV{AGGREGATE_TABLES}.
609 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
610 + lib/Encode/Guess.pm
611 + lib/Encode/JP/JIS7.pm
612 Encoding-autodetect (mainly for Japanese encoding) added. In a
613 course of development, JIS7.pm was improved.
614 + lib/Encode/HTML/Header.pm
615 + lib/Encode/Config.pm
616 MIME B/Q Header Encoding Added!
617 ! Encode.pm Encode.xs t/fallback.t
618 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
619 Message-Id: <20020424130709.GA14211@tanglefoot>
621 1.59 $ 2002/04/22 23:54:22
622 ! Encode.pm Encode.xs
623 needs_lines() and perlio_ok() are added to Internal encodings such
624 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
626 1.58 2002/04/22 23:54:22
628 s/MacChineseSimp/MacChineseTrad/ # ... oops.
633 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
634 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
635 t/(JP|KR).t are aggregated to t/CJKT.t
636 test data are all remade via bin/ucm2text.
637 And .... They are no longer skipped for -Uuseperlio !
639 1.57 2002/04/22 20:27:30
640 ! t/JP.t t/KR.t t/perlio.t
641 unless (find PerlIO::Layer 'perlio') ... line is back again.
642 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
643 need PerlIO. Sigh....
644 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
645 ->perlio_ok now does eval{ require PerlIO::encoding } there so
646 it correctly returns 1 when PerlIO::encoding is yet loaded.
648 perl-current patch #16072 reflected
650 1.56 2002/04/22 09:48:07
651 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
652 New PerlIO::encoding 0.04 compliance met
654 1.55 2002/04/22 03:43:05
655 ! Encode.pm Encode.xs Unicode/Unicode.pm
656 needs_lines() defined so Encode::Encoding is no longer needed
659 1.54 2002/04/22 02:50:01
660 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
661 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
662 now perlio_ok is true by default if PerlIO::encoding->VERSION is
663 0.03 or larger. POD in Encode::Encoding revised to reflect this.
664 Encode::XS and Encode::Unicode now has perlio_ok() method.
665 ! lib/Encode/Supported.pod
666 s/UP-UX/HP-UX/ by jhi
667 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
668 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
669 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
670 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
671 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
672 ! lib/Encode/Supported.pod
673 Huge document fixes by Philip.
676 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
677 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
679 binmode() after "<:encoding" to make Win32 happy, by Mattia.
680 Mattia added to AUTHORS file
681 Message-Id: <3CC3150F.5798.22A05AE@localhost>
683 1.52 2002/04/20 23:43:47
685 TODO: is now SKIP:, as NI-XS requested. Also adds more
686 eraborate failure analysis added.
688 A note on how to make sure of round-trip safety added to POD
689 section (so Autrijus is happier)
690 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
691 big5-(eten|hkscs) is round-trip safe again!
692 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
694 Typo fixes by Andreas
695 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
696 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
697 PerIO coodination patches from NI-XS.
698 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
700 1.51 2002/04/20 09:58:23
702 Updated test suite by Autrijis so "make test" is happy again
703 Message-Id: <20020420082104.GA25037@not.autrijus.org>
705 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
707 TW/TW.pm TW/Makefile.PL
708 Updates by Autrijus. 'big5' is no longer a canonical but an
709 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
710 Message-Id: <20020419195346.GA19597@not.autrijus.org>
712 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
713 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
715 PerlIO detection a little bit smarter; no longer uses eval qq{}
718 1.50 2002/04/19 06:13:02
719 ! ! Encode.pm Encode.xs Encode/encoding.h
721 New Fallback API imlemented and documented. See "perldoc Encode"
723 ! lib/Encode/JP/JIS7.pm Encode.pm
724 + lib/Encode/PerlIO.pod t/perlio.t
725 API compliance met. However, it still does not work unless perlio
726 implements line buffer. See BUGS section in perldoc Encode::PerlIO
727 As a sensible workaround, perlio_ok() added to Encode.
729 ! lib/Encode/Supported.pod
731 Message-Id: <20020418174647.J8466@alpha.hut.fi>
733 Doc fixes from Autrijus
734 Message-Id: <20020418144131.GA10987@not.autrijus.org>
736 perlqq mode documented
738 + t/jisx0201.euc t/jisx0201.ref
739 ! t/jisx0208.euc t/jisx0208.ref
740 t/JP.t tests more rigorously and with other encodings
741 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
742 in jis0208 (used to contain jisx0201 part).
743 ! Encode/Makefile_PL.e2x
744 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
745 enc2xs and encode.h rather than hard-coded. This allows the resulting
746 module fully CPANizable.
747 ! encoding.pm t/JP.t t/KR.t
748 PerlIO detection simplified (checks %INC instead of eval{})
749 ! Encode.xs Encode/encode.h
750 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
751 - lib/Encode/Unicode.pm
752 (en|de)code_xs relocated to where it belongs. Source reindented
755 Additional (U8 *) cast added as suggested by jhi
756 Message-Id: <20020417165916.A28599@alpha.hut.fi>
758 1.42 Date: 2002/04/17
760 no-op module; Thought of adding a pod there but enc2xs has
765 correct mechanism to detect Perlio::encoding layar installed.
767 PerlIO Layer detached.
769 1.41 2002/04/16 23:35:00
771 binmode(STDIN|STDOUT ...) done iff PerlIO is available
773 Cleaned up PerlIO skip conditions to prepare for the upcoming
774 Encode - PerlIO forking.
776 exported functions are now prototyped.
777 ! lib/Encode/CN/HZ.pm
780 fallback implemented # was /* FIXME */
781 affected programs revised to fit (only HZ was using the try-catch
782 approach which needed to be fixed for API-compliance).
784 ! Encode/KR/2022_KR.pm
786 can find =head1 NAME now, jhi
787 Message-Id: <20020416083059.V30639@alpha.hut.fi>
791 now complies with less warnings with the pickest compilers.
792 Suggested by Craig, fixed by Dan.
793 ! Encode/Makefile_PL.e2x
795 A bug that fails to find *.e2x in certain conditions fixed
797 1.40 2002/04/14 22:27:14
798 + Encode/ConfigLocal_PM.e2x
799 ! lib/Encode/Config.pm
801 "enc2xs -C" now generates/updates Encode::ConfigLocal.
802 ConfigLocal_PM.e2x is a skelton thereof.
803 ! lib/Encode/Config.pm
805 "use Encode::CN::HZ;" was missing.
808 More rigorous tests added to test XS, especially on memory allocation.
810 ! lib/Encode/Unicode.pm
811 NI-S implemented an XS version -- merged
812 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
815 Source filter option added. With this option on, you can write
816 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
817 encodings. t/jperl.t enhanced to test this feature.
819 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
820 apply his suggestion, this degree of nitting is enough to add him
822 Message-Id: <3CB93223.291E5E2E@earthlink.net>
824 + lib/Encode/JP/JIS7.pm
825 - lib/Encode/JP/JIS.pm
826 - lib/Encode/JP/2022_JP.pm
827 - lib/Encode/JP/2022_JP1.pm
828 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
829 JIS7.pm for better maintainability and performance
831 Added caveat for non-ascii identifiers.
833 fixes by jhi, the original author of this pragramtic module.
834 Message-Id: <20020413231527.V1826@alpha.hut.fi>
836 1.34 2002/04/12 20:23:05 (Unreleased)
839 EBCDIC fixes addressed by jhi.
840 Message-Id: <20020412161844.D9383@alpha.hut.fi>
841 ! lib/Encode/Encoder.pm
842 POD fix by Miyagawa-kun
843 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
845 1.33 2002/04/10 22:28:40
847 Philip's mail address corrected.
850 ! lib/Encode/Encoder.pm
851 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
852 This adds Miyagawa-kun to AUTHORS.
853 * encoding() no longer exported by default but on demand
854 * t/Encoder.t updated to test all these
855 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
856 ! lib/Encode/Unicode.pm
857 ! lib/Encode/Supported.pm
858 Further doc fixes by Anton
860 1.32 2002/04/09 20:06:15
863 - ucm/macDevanaga.ucm Unicode Character Map
864 - ucm/macGujarati.ucm Unicode Character Map
865 - ucm/macGurmukhi.ucm Unicode Character Map
866 A utility to check integrity of .ucm files. t/bogus.ucm is a
867 ucm that is deliberately bogus. unused Indic mappings are removed
870 resolve_alias() added as suggested by jhi. Same as
871 find_encoding("alias")->name. For convenience. This one is
872 defined in Encode.pm instead of Alias.pm.
873 Message-Id: <20020409215846.H17022@alpha.hut.fi>
875 Memory Allocate but detected during the devel of ucmlint -- fixed.
876 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
877 ! lib/Encode/Unicode.pm
878 valid_ucs2(0) is false but must be true.
879 3 patches from NI-S as follows. This also has fixed the incident
881 ! lib/Encode/Alias.pm
882 find_alias() recursion prevention
884 Checks for the patch above
885 ! t/Encode/Unicode.pm
886 An extra "F" that causes valid_ucs2() return a bogus value fixed
887 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
888 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
889 2 Small Patches from jhi as follows:
891 Encode->encodings() lists in case-insensitve order (as it was)
893 -l option prints avaiable encodings to STDOUT instead of STDERR
894 ! lib/Encode/Aliases.pm
895 s/defintion/definition/
896 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
898 ! lib/Encode/Supported.pod
899 ! lib/Encode/Unicode.pm
900 POD revise by Philip Newton. This adds Philip to AUTHORS list.
901 Thank you for the exact quote of Douglas Adams :)
902 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
904 1.31 2002/04/08 18:08:07
905 ! lib/Encode/Encoder.pm
907 Encode::Encoder, once just a placeholder of an idea, is now much more
908 practical. See t/Encode.t to find how practical it can be.
909 + lib/Encode/Config.pm
911 my false laziness at Encode.pm is fixed. Now %ExtModules are set
912 in Encode::Config and they are all literally, not programatically
913 set. My false laziness was resulting many encodings missing from
915 ! lib/Encode/Unicode.pm
917 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
918 so that it does not rely Encode::Unicode for BOM values
919 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
921 1.30 2002/04/08 02:34:51
922 + lib/Encode/Encoder.pm
923 Object Oriented Encoder. I reckon something like this is in need.
926 ! lib/Encode/Supported.pod
927 * autoloading bug that prevented upper-case canonicals such as UTF-16
928 is fixed. Now even UTF/UCS are autoloaded!
929 * encodings() is now more intuitive.
930 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
932 * Obligatory fixes to the POD.
933 ! lib/Encode/Supported.pod
934 Patch from Anton applied.
935 Message-Id: <66641479.20020408033300@motor.ru>
937 ! lib/Encode/Unicode.pm
938 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
940 1.28 2002/04/07 18:58:42
944 Just a MANIFEST for those missing files.
946 1.26 Date: 2002/04/07 15:22:04
949 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
950 And t/Aliases.t fixed to make sure they all exist.
952 1.25 2002/04/07 15:01:25 (Unreleased)
954 ! lib/Encode/Unicode.pm
957 - lib/Encode/UTF_EBCDIC.pm
958 - lib/Encode/Internal.pm
960 Integrated into Encode.pm as closures. That way "one package, one file"
961 rule is preserved yet less files to require.
963 commented out binmode(STDERR ...
972 ! Encode/Makefile_PL.e2x
973 Schwarn's MM-compliance patch merged
974 Message-Id: <20020406082609.GA28758@blackrider>
976 ! lib/Encode/Unicode.pm
977 + lib/Encode/UTF_EBCDIC.pm
979 - lib/Encode/10646_1.pm
980 - lib/Encode/ucs2_le.pm
981 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
982 per-module based (en|de)code, I saved a number of .pm by
983 reorganizing it as per-object base (Well, this is what Encode::XS
984 does under the hood). See Encode::Unicode for details.
985 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
986 This module is used only on EBCDIC environments.
988 1.21 2002/04/05 14:46:34 (Not Released)
994 Are back to make Perl/Tk happy Smile, NI-S.
996 ! lib/Encode/Alias.pm
997 ! lib/Encode/Supported.pm
998 ! lib/Encode/10646_1.pm
999 ! lib/Encode/ucs2_le.pm
1000 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1001 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1002 updated to reflect changes.
1003 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1004 ! lib/Encode/Alias.pm
1005 ! lib/Encode/Supported.pm
1006 Anton's revision commited. Added Dan's own fixes as well.
1007 Message-Id: <159103166906.20020405161134@motor.ru>
1008 ! lib/Encode/Alias.pm
1010 < qr/^UCS2-le$/i => '"UCS-2"', );
1012 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1013 Sigh. Thank you, Anton.
1014 Message-Id: <14567692196.20020405062020@motor.ru>
1015 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1017 1.20 2002/04/04 19:50:52
1019 the last minute addtion. Just give it a try. Docs remains to be done.
1020 Not installed by default.
1021 ! lib/Encode/Supported.pod
1024 ! lib/Encode/Alias.pm
1025 ! lib/Encode/utf8.pm
1026 ! lib/Encode/10464_1.pm
1027 ! lib/Encode/ucs2_le.pm
1028 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1029 unchanged but UTF-16BE is added as an alias. Implicit aliases
1030 move to Encode::Alias so init_alias() works more as expected.
1031 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1032 Though pedantically wrong, This should make perl mongers happier.
1033 t/Alias.t is enhanced to test all these.
1034 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1036 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1037 of the codepoints are common. CJKT left as is because the saving is
1041 ! EBCDIC/Makefile.PL
1043 ! Encode/Makefile_PL.e2x
1047 ! Symbol/Makefile.PL
1051 All occurance of _def.h replaced with .exh so djgpp works happily
1052 ever after! To credit this amazing discovery, Laszlo is now in
1054 Message-Id: <20020403181424.GA8778@freemail.hu>
1055 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1058 ! Encode/Makefile_PL.skel
1060 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1061 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1063 Two more tests by added jhi
1064 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1067 The showstopper fixed -- Memory reallocation bug was causing
1068 Encode::XS to fall into infinite loop on certain conditions.
1069 t/grow.t tests that.
1070 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1075 ! lib/Encode/Supported.pod
1076 Vendor encodings rebuilt out of original map files at unicode.org.
1077 Indic languages such as MacDevanagali remain unspported do to the
1078 shortcoming of encengine capabilities (they need algorithmical
1079 conversion and I have no knowledge on that!). Pods fixed for added
1081 Oh, macJapan.ucm renamed to macJapanese.ucm.
1082 macROMnn is macRomanian and macRUMnn is macRumanian.
1083 txt2ucm is a crude script that is used to convert them.
1085 Unicode Compound Characters (used extensively on Mac) supported
1087 Typo fixes and improvements by jhi
1088 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1090 1.11 2002/03/31 22:12:13
1094 Missing files from the MANIFEST fixed.
1095 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1096 Version incremented just to make CPAN happy.
1098 1.10 2002/03/31 21:32:42
1101 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1102 if you want. This should make Autrijus happy. Also, piconv
1103 is added to default install.
1106 Here-documented files that enc2xs generates are now exported
1107 to *.e2x. Much cleaner and easier to debug.
1109 encoding enhances so you can make it act more like such
1110 (now prehistoric ) "localized" variations of perl like Jperl.
1112 Further test for encoding.pm. Written in euc-jp
1115 Taken over form jhi.
1116 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1121 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1128 ! Encode/macIceland.ucm
1129 ! lib/Encode/Alias.pm
1130 ! lib/Encode/Supported.pod
1131 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1132 Message-Id: <m3lmcavhjt.fsf@anima.de>
1134 1.01 2002/03/29 20:59:39
1137 s/USE_SCRIPTS/MORE_SCRIPTS/
1139 installs enc2xs by default for external Encode:: modules in CPAN,
1140 such as Encode::HanExtra
1142 More sensible perl core detection via $ENV{PERL_CORE}
1144 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1146 Perl core ditection via $^X =~ m/\bminiperl$/o
1147 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1149 1.00 Wed Mar 29 2002
1151 The version of all files is updated to 1.00 via "ci -f -l1.00",
1152 commemorating version 1.00. All files, including *.ucm are now
1153 under version control.
1156 encode.h moved to Encode/ so it will be installed for the later
1159 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1164 compile renamed to enc2xs.
1165 Affected Makefle.PL updated
1167 "Punt it. HanExtra can take care of that later." -- Autrijus
1168 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1171 ! Encode/ksc5601.ucm
1172 ! lib/Encode/CJKConstants.pm
1173 ! lib/Encode/KR/2022_KR.pm
1174 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1175 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1179 bin/ added for example scripts. They are not installed by default.
1180 to install them, "perl Makefile.PL USE_SCRIPTS".
1181 piconv is iconv reinvented in perl. in addition to all features
1182 of iconv, it also adds perlish features. See L<piconv/1> for more
1184 ! lib/Encode/Alias.pm
1185 qr/^ replaced with qr/\b so it directly matches locale names
1186 such as en_US.US-ASCII
1189 Patch by MJD to fix the following problem applied.
1190 Subject: [PATCH 5.7.3 Encode]
1191 Aliases.t not properly skipped when Encode extension not built
1192 Message-Id: <20020328091850.18677.qmail@plover.com>
1193 ! lib/Encode/KR/2022_KR.pm
1194 ! lib/Encode/CJKConstants.pm
1195 Another patch from Jungshik to make iso-2022-kr actually work
1196 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1197 ! Encode/Encode/euc-kr.ucm
1198 + Encode/Encode/johab.ucm
1199 ! Encode/Encode/ksc5601.ucm
1201 ! Encode/KR/Makefile.PL
1202 ! Encode/lib/Encode/Alias.pm
1204 Johab support and complete revision of Korean Encoding by Jungshik
1205 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1207 Revised to make up with now-dropped Encode::Details.
1208 - lib/Encode/Details.pod
1209 Dropped. Besides being obsolete, the topics are now covered in
1210 respective pods now.
1215 Korean aliases fixed thanks to Jungshik Shin
1216 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1217 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1219 =head1 NAME added to all modules to make buildtoc happy
1220 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1221 - lib/Encode/CJKguide.pod
1222 Too controversial and dropped from the dist. Will be available
1223 separately on the web.
1225 RCS tags added so table debugging gets easier (should that be
1226 needed! I hope they all stay 1.00!)
1227 + lib/Encode/CJKguide.pod
1228 A detailed guide to mainly, but not limited to, CJK multibyte
1231 + Encode/hp-roman8.ucm
1233 ! Encode/Supported.pod
1234 All occurance of "roman8" replaced with "hp-roman8" to avoid
1236 ! Encode/Supported.pod
1239 Mac Encodings now comply the Inside Macintosh
1241 Test for '-raw' conventions added.
1243 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1244 ! Encode/gb12345.ucm
1246 ! Encode/ksc5601.ucm
1247 "-raw" appended to canonical names.
1248 File mames stay unchanged thanks to UCM format.
1249 ! lib/Encode/CN/HZ.pm
1250 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1251 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1253 0.99 Tue Mar 26 2002
1254 - lib/Encode/JP/Const.pm
1255 + lib/Encode/CJKConstants.pm
1256 + lib/Encode/CN/2022_CN.pm
1257 + lib/Encode/KR/2022_KR.pm
1266 * Support for ISO-2022-KR and ISO-2022-CN added.
1268 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1269 * ucm2table autogenerates character table out of UCM files.
1272 - lib/Encode/Supports.pod
1273 + lib/Encode/Supported.pod
1274 Names reverted due to popular demand.
1275 8.3 rule applies only when there is a conflict.
1276 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1281 - lib/Encode/Format/Enc.pod
1283 * Character tables is now 100% ucm.
1284 * All files under Encode/ is now 8.3-compliant
1285 * some of missing encodings added (i.e. gsm0338 and nextstep)
1286 * Vendor mappings aggregated with appropriate national std in
1287 Makefile.PL, resulting smaller *.so especially for CJK.
1288 Following is result on Dan's FreeBSD box.
1290 ---------------------------------------------------------------
1291 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1292 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1293 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1294 blib/arch/auto/Encode/Encode.so 27,791 27,791
1295 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1296 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1297 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1298 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1299 ---------------------------------------------------------------
1300 Total 5,375,297 6,343,819
1302 * As a result of ucm-transition, Encode::Tcl dropped because
1303 Encode::Tcl demands *.enc.
1304 Encode::Tcl will be supplied in a separate tarball with *.enc.
1305 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1310 -lib/Encode/Supported.pod
1311 +lib/Encode/Supports.pod
1312 -lib/Encode/iso10646_1.pm
1313 +lib/Encode/10646_1.pm
1314 -lib/Encode/EncFormat.pod
1315 +lib/Encode/Format/Enc.pod
1316 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1317 - lib/Encode/JP/Constants.pm
1318 + lib/Encode/JP/Consts.pm
1319 ! lib/Encode/JP/JIS.pm
1320 ! lib/Encode/JP/H2Z.pm
1321 Version nit problem and 8.3 rule fix.
1322 > Package namespace installed latest in CPAN file
1323 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1324 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1325 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1326 accordingly. In addition, legacy "use vars qw()..." are replaced with
1328 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1329 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1331 - lib/Encode/JP/ISO_2022_JP.pm
1332 - lib/Encode/JP/ISO_2022_JP_1.pm
1333 + lib/Encode/JP/2022_JP.pm
1334 + lib/Encode/JP/2022_JP1.pm
1336 8.3 naming conflict for vanilla fat addressed by jhi
1337 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1340 Typecast fix addressed by jhi
1341 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1343 0.98 Mon Mar 25 2002
1344 ! lib/Encode/Supported.pod
1346 + lib/Encode/JP/ISO_2022_JP_1.pm
1347 ! lib/Encode/JP/ISO_2022_JP.pm
1348 ! lib/Encode/JP/JIS.pm
1350 Now Encode::JP is more strict on the difference between ISO-2022-JP
1351 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1352 makes Anton happier :) FYI the previous version implements
1353 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1354 ! lib/Encode/Supported.pod
1357 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1358 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1363 ! lib/Encode/Suppoted.pod
1364 pod fixes to replace F<http://...> to L<http://...>,
1365 as suggested by Autrijius in:
1366 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1367 ! lib/Encode/Suppoted.pod
1368 fixes and enhancements by Anton
1369 Message-Id: <10632060120.20020324103753@motor.ru>
1370 ! lib/Encode/Alias.pm
1371 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1372 added. Suggested by Anton then deobfuscated by Autrijius
1373 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1375 Further fix by Nicholas Clark
1376 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1377 - lib/EncodeFormat.pod
1378 + lib/Encode/EncFormat.pod
1380 File renamed as suggested by Autrijius
1382 ! lib/Encode/Details.pod
1383 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1384 ! Encode.pm Sun Mar 24 13:43:47 2002
1385 pod fixes by Autrijius.
1386 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1387 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1389 ! lib/Encode/Alias.pm
1391 now more EBCDIC conscious;
1392 %ExtModules on EBCDIC system excludes CJK so that you don't
1393 have to worry about the matched alias resulting cloaking.
1394 t/Alias.t also revised to reflect changes. Verified by jhi
1395 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1397 0.97 Sun Mar 24 2002
1401 EBCDIC detection mechanism installed as in JP/JP.pm
1402 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1405 ! EBCDIC/Makefile.PL
1408 ! Symbol/Makefile.PL
1410 Now all table files used by compile are postfixed '_t' to avoid
1411 namespace collisions in case insensitive file systems once for all!
1413 Message-ID: <58290227735.20020323195659@familiehaase.de>
1415 Since the Encode::JP is unsupported under EBCDIC we
1416 cannot run this test (aliases as such should work fine) -- jhi
1417 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1419 duplicate occurance of ascii.ucm and 8859-1.ucm
1420 causes MacOS X dlyd to cloak
1426 < chdir 't' if -d 't';
1428 > if (! -d 'blib' and -d 't'){ chdir 't' };
1429 When you are "make test"-ing on Encode/ directory, you must not
1430 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1431 remain unchanced. Also the situation detection was made simpler
1432 in t/JP.t, which was originally;
1433 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1435 "Use of uninitialized value in string eq at Encode.pm line 96."
1436 ! Symbol/Makefile.PL
1437 ! EBCDIC/Makefile.PL
1439 -- Problem on case insensitive file systems
1440 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1441 Message-ID: <88254111953.20020323095503@familiehaase.de>
1444 "So I think it's a bug in gcc, not perl. But it still needs to be
1446 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1447 Message-Id: <20020323170509.C96475@plum.flirble.org>
1449 0.96 Sat Mar 23 2002
1451 ! lib/Encode/Encoding.pm
1452 ! lib/Encode/Alias.pm
1453 ! lib/Encode/Supported.pod
1455 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1456 Message-ID: <20020322073908.GB10539@blackrider>
1459 "...I think we should include ISO 8859-1 as well." -- NI-S
1460 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1465 ! lib/Encode/Alias.pm
1466 alias definitions relocated to Encode::Alias so module autoloading
1467 works for aliases also.
1469 encodings() now accepts args to check ExtModules.
1473 + EBCDIC/Makefile.PL
1474 + Symbol/Makefile.PL
1478 Latin and single byte encodings are reorganized so they are
1479 demand-loaded like Encode::XX. Now only ascii is compiled into
1481 ! lib/Encode/Alias.pm
1482 for my $k (keys %hash){ delete $hash{$k}; }
1483 is depreciated; fixed.
1485 0.95 Fri Mar 22 2002
1486 In this update, pod rewrites and alias fixes are the main issues
1487 + lib/Encode/Supported.pod
1488 Describes supported encodings
1490 streamlined compiled-in encodings.
1491 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1493 + Encode/ibm-125?.ucm
1494 Added from icu distibution with any occurance of
1495 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1496 some respect to icu staff, however.
1497 + lib/Encode/Alias.pm
1499 Alias difinitions in Encode.pm relocated.
1502 packWARN patch from Paul Marquess via jhi
1503 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1504 Paul added to AUTHORS as a result.
1505 ! t/CJKalias.t -> t/Aliases.t
1506 Renamed. Checks even more aliases and alias overloading
1509 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1510 gbk => cp936 relocated to CN.pm
1512 Test::More with plans (by jhi)
1514 0.94 Thu Mar 21 2002
1515 + lib/Encode/Description.pod
1516 ! lib/Encode/Encoding.pm
1517 Now the pod in Encode.pm is abridged as programming references.
1518 lib/Encode/Description.pod contains the original, detailed description
1519 and Encode::Encoding explains how to write your own module to
1520 add new encodings. So far, lib/Encode/Description.pod contains
1521 the whole pod once in Encode.pm. This is intentional.
1523 Pod revisions by Anton Tagunov
1524 Message-Id: <517178431.20020320174824@motor.ru>
1526 all occrance of Encode::Tcl::Extended removed including pod
1528 test now checks $encoding->name only; $encoding->{name} are
1529 no longer check to find the canonical name.
1530 ! lib/Encode/JP/JIS.pm
1531 ! lib/Encode/JP/ISO_2022_JP.pm
1532 ->name() added to be more compliant with API
1538 Patch by Autrijus to add aliases to TW and fixes to POD
1539 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1541 SADAHIRO Tomoyuki added as should. My apologies.
1543 0.93 Wed Mar 20 2002
1544 * First release to be uploaded to CPAN. For prehistoric changes,
1545 please see Changes file of perl distibution as well as
1546 perl-unicode@perl.org archive, available at:
1547 http://archive.develooper.com/perl-unicode@perl.org/
1549 Changes Since 0.92 includes;
1554 + Mention to perl-unicode@perl.org added
1556 + Encoding aliases added so you can feed locale names
1557 and MIME Charset="" directly.
1558 - Mention to JISX0212 removed because it's fixed
1561 + Encoding aliases added. Note TW is left untouched because
1562 euc-tw is not implemented in TW but in Encode::HanExtra.
1563 Autrijus, you may fix Encode::HanExtra.
1565 + to test encode aliases added