1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 2.1 2004/05/25 16:27:14 dankogai Exp dankogai $
6 $Revision: 2.1 $ $Date: 2004/05/25 16:27:14 $
7 ! lib/Encode/CN/HZ.pm lib/Encode/Unicode/UTF7.pm
8 "If someone thinks utf8::upgrade($1) should be croaked like
9 chom?p($1),please try the following patch for Encode.pm."
11 <20040522212704.C068.BQW10602@nifty.com>
13 2.0 2004/05/16 20:55:15
14 * version updated to 2.00
15 -- sorry, no big feature change. I just hate version 1.100 :)
18 addressed UTF-(8|32LE) + BOM misguessing
19 https://rt.cpan.org/Ticket/Display.html?id=6279
21 s/is_utif8/is_utf8/ in POD
22 ! Encode/lib/Encode/CN/HZ.pm
23 Fixes "make test" failure after the patch to pp_hot.c
25 Message-Id: <20040222182357.6B39.BQW10602@nifty.com>
27 From: autrijus@autrijus.org
28 Subject: [PATCH] "piconv -C 512" badly broken
29 Message-Id: <1072870210.769.5.camel@localhost>
31 1.99 2003/12/29 02:47:16
33 find_encoding("UTF-16BE")->encode("abc") now null terminates
34 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html
36 prototype bug in decode_utf8() fixed
37 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
38 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
39 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
41 Merged from maintperl@21987
43 1.98 2003/08/20 11:15:31
44 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
45 Dave Evans has found and corrected a bug in Encode::MIME::Header.
46 Test suite added by Dan Kogai.
47 Message-Id: <3F43440B.7060606@rudolf.org.uk>
49 Typo fixes rolled back in from bleedperl
51 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
52 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
54 argv case nit for VMS by Craig
55 Message-ID: <3F2B02DE.10207@mac.com>
56 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
57 Encode test fixes for VMS by Peter Prymmer
58 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
59 ! lib/Encode/Alias.pm t/Aliases.t
60 koi-8 aliases bug detected and patched by sadahiro.
61 Further fix and test suite by dankogai
62 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
64 1.97 2003/07/08 21:52:14
65 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
66 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
67 $DEBUG replaced with DEBUG() so perl optimizes better,
68 by Rafael with further fixes by dankogai
69 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
70 ! lib/Encode/Aliases.pm
71 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
72 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
73 So new hash seeding introduced in bleedperl works.
74 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
76 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
77 automatic utf(8|16|32) guessing -- originally by Autrijus
78 Message-Id: <20030626162731.GA2077@not.autrijus.org>
80 Addressed the following;
81 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
82 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
84 1.96 2003/06/18 09:29:02
85 ! lib/Encode/JP/JP.pm t/guess.t
86 m/(...)/ in void context then $1 is considered a Bad Thing
87 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
89 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
92 More typecast from maintperl@19739
93 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
95 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
96 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
98 Now skips for EBCDIC platform.
99 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
101 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
102 have more than one C<.> in their filenames." -- perlport.
103 Message-Id: <3ED79E01.8050401@mac.com>
105 Found and fixed the back that -p,--perlqq does not work.
106 Induced by the change from Getopt::Std to Getopt::Long.
108 Addressed [cpan #2629] Wrong assumption in numeric comparison
109 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
110 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
111 lib/Encode/Encoding.pm t/perlio.t
112 ! API Change: ->new_sequence() => ->renew()
113 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
114 + Encode::XS and Encode::utf8 now supports ->renew()
115 + Encode::Encoding now documents this with examples
116 - Non-XS (en|de)code stripped out of Encode::Unicode
117 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
119 1.95 2003/05/21 08:41:11
121 Since bogus entries were found in iso-8859-6, all entries are
122 re-generated once again out of
123 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
124 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
125 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
126 + lib/Encode/Unicode/UTF7.pm
127 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
128 lib/Encode/Supported.pod
129 UTF-7 support is now added. With this Encode now has all transcoding
130 methods in Unicode::String.
132 1.94 2003/05/10 18:13:59
133 ! lib/Encode/MIME/Header.pm
134 A more sophisticated solution for double-encoding by dankogai
135 ! lib/Encode/MIME/Header.pm AUTHORS
136 Two bugs fixed by Bjoern Jacke
137 * "Double Encoding" was not possible
138 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
139 * encode("MIME-Q") had UTF-8 flag on
140 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
141 ! lib/Encode/MIME/Header.pm AUTHORS
142 Two occurances of "croak ()" fixed as "croak qq()".
143 Simon Cozens is added to AUTHORS as a result.
144 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
146 POD fixes that reflect enhancements by jhi
148 Two enhancements by jhi.
149 + Now uses Getopt::Long so it accepts long name options
150 (--from for -f, for example)
151 + New option: -r,--resolve
152 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
154 META.yml added upon request of Schwern
155 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
157 Enache Adrian removed upon request -- to live longer than Encode
158 and/or FreeBSD (toy-)?thread :)
159 Message-Id: <20030425015701.GA2069@ratsnest.hole>
161 "close STDOUT unless $^O eq 'freebsd';" once again relocated
162 to keep VMS happy in which case "$^O eq 'freebsd'" is required
163 to keep FreeBSD+thread happy. Sigh.
164 Message-Id: <3EA88ADC.3000300@mac.com>
166 1.93 2003/04/24 17:43:16
168 added "no warnings 'pack'" in for loop to keep bleedperl from
169 complaining "Character in 'C' format wrapped in pack".
171 More elegant perl core detection inspired by Ilya Zakharevich
172 (but further elaborated for general cases).
173 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
176 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
177 are less likely to be confused w/ ISO-8859-*
179 RT tests added (vendor encodings are exemplified)
180 -- that successfully found a flaw on iso-2022-kr before the patch.
181 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
182 decode("ISO-2022-KR") has been buggy but no one ever sited
183 that since no one seems to be using it. Bugs discovered by
185 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
186 ! lib/Encode/CN/HZ.pm t/perlio.t
187 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
188 so it adds test for HZ.
189 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
190 ! lib/Encode/Guess.pm
191 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
192 So long as the string contains \x{00}-\x{ff} it does not fail.
193 See perldoc for details.
194 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
196 1.92 2003/03/31 03:27:27
197 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
198 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
199 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
200 but glibc-BIG5-2.1.2.ucm does.
201 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
202 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
203 Maps regenerated again but this time based upon
204 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
205 (But where is THE DOCUMENT by MICROSOFT?)
206 ! t/enc_module.t AUTHORS
207 failure with threaded Perl on FreeBSD addressed.
208 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
209 Message-Id: <20030322230131.GA813@ratsnest.hole>
210 ! lib/Encode/Guess.pm
213 Change 18989: Make the :bytes conditional on PerlIO.
214 further Modified by Dan Kogai
215 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
217 Chnage 18966: another fix for failing test on windows ("use encoding"
218 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
219 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
221 Change 18970: Hopefully this works also in Win32, by jhi
222 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
223 Change 18965: fix CJKT.t failures on windows due to incorrect
225 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
227 1.91 2003/03/09 20:07:37
229 even more proofread by jhi.
230 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
233 +use lib qw(t ext/Encode/t ../ext/Encode/t);
234 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
236 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
237 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
240 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
242 1.90 2003/03/09 17:32:43
245 Inaba-san has added a patch for perl 5.8.1 or later that makes
246 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
247 that. POD is further revised.
248 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
249 ! encoding.pm t/enc_module.t
250 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
251 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
255 signed vs. unsigned issue discovered by Craig on OpenVM
256 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
257 ! encoding.pm AUTHORS
258 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
259 Because binmode() stacks layers instead of overwrite, you have to
260 ":raw :encoding()" in encoding.pm or your are in trouble when you
261 call encoding.pm multiple times. There are several workarounds
262 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
263 who was the first to address this problem was added to AUTHORS.
264 The test suites was added for this, which is a modified version
265 of SUGAWARA-san's scripts
266 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
268 1.88 2003/02/20 14:42:34
270 one signedness nit for Encode by jhi
271 <200302161933.h1GJX876018710@kosh.hut.fi>
273 VISCII map was incorrect; fixed by Sadahiro-san
274 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
275 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
276 You can't unlink files that are opened in cygwin but the last
277 file handle opened in t/enc_*.t left open. Patch submitted
278 by Yitzchak and he was added to AUTHORS.
279 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
281 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
282 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
284 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
285 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
287 1.87 2003/02/06 01:52:11
289 * Inaba "Sensei" Hirohito added (I thought I have done so a long
290 ago but apparently I did not).
291 * SUZUKI Norio added for verious and useful bug reports.
292 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
293 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
294 podchecked so all warnings are gone except for L<http://>.
295 ! encoding.pm t/enc_eucjp.t
296 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
297 * Filter option overhaul
299 ! Encode.pm Encode.xs encengine.c Encode/encode.h
300 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
301 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
302 without filter. podchecked by Dan Kogai.
303 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
304 ! lib/Encode/Alias.pm
305 decode('alias', $1) went wild because of local $_ in find_alias()
306 the evil local $_ is eradicated but that changes find_alias()
307 format for coderef aliasing. See Encode::Alias for details
308 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
310 1.86 2003/01/22 03:29:07
312 * Don't forget to canonize when you attempt an exact match!
313 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
314 * ${^ENCODING} exception is off for $] > 5.008
315 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
317 $] check commented out so it runs on 5.8.0
319 1.85 2003/01/21 22:19:14
321 ${^ENCODING} exception is now explicit rather than handled by regex.
322 + t/enc_eucjp.t t/enc_utf8.t
323 Test suite for the better "encoding" pragma support for bleedperl.
324 On 5.8.0, they will just be skipped.
326 1.84 2003/01/10 12:00:16
328 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
329 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
330 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
332 %_ fixes saves the resulting .so .05% smaller, by NC
333 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
335 Silence Encode on undef, by Andreas
336 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
337 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
339 s/regognised/recognised/ . British spelling left intact to pay
340 respect to two British Nicks :)
341 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
343 1.83 2002/11/18 17:28:49
344 ! Encode.xs lib/Encode/JIS7.pm
345 Even more patches from Inaba-san has been applied. With this
346 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
347 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
349 1.82 2002/11/14 23:06:12
351 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
352 later found in perls w/ -Dusemymalloc was fixed by NC.
353 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
355 1.81 2002/11/08 18:29:27
356 ! Encode.pm Encode.xs
357 Non-XS version of Encode::utf8 is back (with XS being default).
358 Encode::predefine_encodings(0) to turn off XS.
359 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
360 found that it was Test::More causing the problem, not Encode.
361 But I have already made it configurable so it may be useful in
363 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
365 The ingenious patch by Nicholas Clark that reduces shlib sizes by
366 50% with no penalty and backward compatibility preserved, is in.
367 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
369 1.80 2002/10/21 20:39:09
370 ! Encode.xs t/mime-header.t
371 Even more patches from NI-XS regarding Encode::utf8->decode().
372 And one more test to t/mime-header.t to prove it
373 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
375 1.79 2002/10/21 06:05:37
377 Further patches from NI-XS. Encode::utf8->decode() now checks the
378 value of utf8 flag of the argument. As a result, the fix to
379 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
380 no harm (even speedwise) I'll leave it unreverted.
381 ! ucm/cp949.ucm ucm/cp950.ucm
383 U+00AE REGISTERED SIGN
384 were missing as a result of 1.78. Discovered by Moriyama-san.
385 Moriyama-san has also developed a test script that compares
386 (en|de)coded results to the corresponding Win32 API result and
387 all cp9?? maps are now verified.
388 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
390 1.78 2002/10/20 15:44:00
391 ! lib/Encode/MIME/Header.pm
392 fixed so that it works with new Encode::utf8
393 ! Encode.pm Encode.xs
394 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
395 :encoding(UTF-8) to handle partial chars at end of buffers
397 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
398 ! lib/Encode/Supported.pod
399 More nitpickings applied.
402 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
403 tests were completely useless. To redeem that and get the peace of
404 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
405 ucm/*.ucm Since this script takes too long to finish (30 seconds on
406 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
407 but you can easily run that by either renaming it or:
409 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
410 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
411 Other CJKT cp9?? also updated according to the URI below;
412 http://www.microsoft.com/typography/unicode/cscp.htm
413 + bin/ucmsort MANIFEST
414 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
415 to proper order. intended for hardcore develpers only.
416 ! ucm/cp932.ucm JP/JP.pm AUTHORS
417 CP932 mapping which was based upon the mapping file at unicode.org
418 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
419 has also supplied the patch so he was added to AUTHORS.
420 ! lib/Encode/Supported.pod
421 ISO-8859-11 != TIS 620
422 == TIS 620 + \xA0 ( )
424 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
426 1.77 2002/10/06 03:27:02
428 * Modified to accomodate up and comming patch by Inaba-san that
429 will fix tr/// needing eval qq{}
430 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
432 * pod fixes/enhancements to reflect the changes above
433 ! lib/Encode/Alias.pm
434 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
435 Message-Id: <20021001015648.GB18710@not.autrijus.org>
437 1.76 2002/08/25 15:09:51
439 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
440 but naturally identical to previous version -- dankogai
442 Codepoint fixes -- autrijus
443 Message-Id: <20020805040236.GC5220@not.autrijus.org>
445 copied everything under perl-5.8.0/ext/Encode to make sure Encode
446 is in sync w/ perl core
448 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
449 Now that binmode(FH) does implicit ":bytes" revisit
450 the failing tests. The worrisome one is the Digest::MD5
451 test-- how will it fare in CRLF lands now?
453 From: Radu Greab <radu@netsoft.ro>
454 Date: Mon, 10 Jun 2002 00:40:34 +0300
455 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
456 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
457 ones which I cannot figure out.
458 ! lib/Encode/Alias.pm
459 Subject: [Encode PATCH] spurious warning
460 From: Nicholas Clark <nick@unfortu.net>
461 Date: Sun, 2 Jun 2002 20:26:22 +0100
462 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
464 1.75 2002/06/01 18:07:49
465 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
466 glibc compliance cited by Autrijus.
467 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
468 ! bin/enc2xs bin/piconv
469 Subject: Re: forewarning: usedevel and versiononly
470 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
472 1.74 2002/05/28 18:33:15
473 + ucm/null.ucm ucm/ctrl.ucm
474 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
475 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
476 Subject: Re: unicode -> &# notation
477 Message-ID: <20020518193704.GB40272@not.autrijus.org>
479 1.73 2002/05/28 17:26:18
480 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
481 Chris Nandor has fixed Encode so that it works w/ MacPerl --
482 at least w/ PPC (68k need static linking which does not work due to
483 64k limit). pudge is added to AUTHORS (I'm surprised he was not
484 there in the list). Encode/Makefile_PL.e2x was additionally fixed
485 by dankogai to reflect changes in other Makefile.PL
486 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
488 Subject: Change 16746: -Mutf8 cleanup.
489 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
491 1.72 2002/05/20 15:49:56
493 Subject: [PATCH] Encode should be in perl-core library path
494 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
495 Message-Id: <20020520161201.A11019@alpha.hut.fi>
496 ! lib/Encode/MIME/Header.pm
497 Subject: [PATCH] Encode::MIME::Header
498 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
499 ! Encode/Makefile_PL.e2x
500 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
501 Message-Id: <20020519201031.GA1603@not.autrijus.org>
502 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
503 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
505 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
506 Subject: [PATCH] good day for WinCE port of perl.
507 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
510 Even more linting by Robin via @16532
512 Even more typecast by Sarathy in @16460
514 1.71 2002/05/07 16:22:42
516 even more typecasts by Robin
517 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
519 A very strange bug that was causing a bugus ucm -> C table
520 generation that was revealed by a UCM file that Andreas was
521 working. This is the king of wierdest bug I've encountered
522 in the course of Encode maintenance.
523 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
525 1.70 2002/05/06 10:26:48
527 Made more 'module-safe' with conjunction w/ 'no encoding'.
528 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
529 ! lib/Encode/Encoding.pm
530 'require Encode' because ->Define uses Encode::define_encoding();
531 problem and solution addressed by Miyagawa-kun
532 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
534 Cuts the frill to make djgpp happier, as suggested by Laszlo
535 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
537 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
538 Message-Id: <m3bsbug48n.fsf@anima.de>
540 1.69 2002/05/04 16:41:18
541 ! lib/Encode/MIME/Header
542 Floating-point coerced for UNICOS (in integer arithmetics it folds
543 line one character too early). Verification by Mark is pending.
544 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
546 more doc patch from Elizabeth
547 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
548 ! Encode/Makefile_PL.e2x
549 More platform-independent patch from Benjamin
550 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
551 ! lib/Encode/Guess AUTHORS
552 split regex fix by Graham Barr. Adds him to AUTHORS.
553 Message-Id: <20020504085419.E95940@valueclick.com>
554 ! Encode/Makefile_PL.e2x
555 enc2xs script discovery made smarter and more sensible, first cited
556 by Miyagawa-kun and further suggestions by Rafael and Andreas
557 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
558 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
560 1.68 2002/05/03 12:20:13
561 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
562 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
563 and Supported.pod modified to reflect the change. Elizabeth added
564 to Authors. And H.M. is also added for forwarding her patch among
565 other contributions (I was rather surprised to find his name was not
567 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
569 1.67 2002/05/02 07:33:09
571 Error message now consistent w/ perlqq (\N{U+} -> \x{})
572 done in perl@16308 but Philip linted me further. Now the error
573 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
574 ! lib/Encode/Guess.pm
575 Sanity check for happier -w by Autrijus
577 1.66 2002/05/01 05:41:06
578 ! Encode.xs t/fallback.t
579 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
580 while fallback is in effect. This even came with a welcome side-effect
581 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
582 also modified to test this.
583 And of course, the corresponding varialbles to UV[Xx]f are appropriately
584 cast. This should've concluded NI-XS homework.
586 encode(undef) does warn again! Repented upon suggestion by NI-XS.
587 Document for unless vs. '' added
588 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
590 1.65 2002/04/30 16:13:37
592 encode(undef) no longer warns for C<Use of uninitialized value in
593 subroutine entry>. Suggested by Paul.
594 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
595 ! lib/Encode/Supported.pod
596 Encode::MIME::Header and Encode::Guess mentioned
597 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
598 ! lib/Encode/Guess.pm
599 POD fix by Miyagawa-kun
600 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
602 1.64 2002/04/29 06:54:06
604 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
605 encocde("euc-jisx0213") are still DIFFERENT.
606 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
608 A few white spaces corrected by NI-XS via PerlIO integration to
610 Subject: Change 16247: Integrate perlio;
612 Document fixes by Andreas
613 Message-Id: <m3k7qsf1we.fsf@anima.de>
615 1.63 2002/04/27 18:59:50
616 ! lib/Encode/Encoding.pm
617 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
618 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
619 Make use of the Encode::Encoding base class!
620 And other cleanups in Encode.xs upon NI-XS suggestions
621 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
623 1.62 2002/04/27 11:17:39
625 encodings() now just check %ExtModule instead of eval{require}
626 all of them for ":all" to conserve more memory.
628 more "%x" -> "%" UVxf stuff.
630 s/=over2/=over 2/g # oops.
632 1.61 2002/04/26 03:02:04
634 Now does decent tests besides use_ok()
635 ! lib/Encode/Guess.pm t/guess.t
636 UI streamlined, document added
638 various signed/unsigned mismatch nits (#16173)
639 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
641 POD: utf8-flag-related caveats added. A few sections completely
645 Thou shalt not assume %d works, either!
646 Robin Baker added to AUTHORS for this
647 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
649 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
651 1.60 2002/04/24 20:06:52
653 "Thou shalt not assume %x works." -- jhi
654 Message-Id: <20020424210618.E24347@alpha.hut.fi>
655 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
656 low-memory build machines happy, now *.c is created for each *.ucm
657 (no table aggregation). You can still override this by setting
658 $ENV{AGGREGATE_TABLES}.
659 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
660 + lib/Encode/Guess.pm
661 + lib/Encode/JP/JIS7.pm
662 Encoding-autodetect (mainly for Japanese encoding) added. In a
663 course of development, JIS7.pm was improved.
664 + lib/Encode/HTML/Header.pm
665 + lib/Encode/Config.pm
666 MIME B/Q Header Encoding Added!
667 ! Encode.pm Encode.xs t/fallback.t
668 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
669 Message-Id: <20020424130709.GA14211@tanglefoot>
671 1.59 $ 2002/04/22 23:54:22
672 ! Encode.pm Encode.xs
673 needs_lines() and perlio_ok() are added to Internal encodings such
674 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
676 1.58 2002/04/22 23:54:22
678 s/MacChineseSimp/MacChineseTrad/ # ... oops.
683 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
684 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
685 t/(JP|KR).t are aggregated to t/CJKT.t
686 test data are all remade via bin/ucm2text.
687 And .... They are no longer skipped for -Uuseperlio !
689 1.57 2002/04/22 20:27:30
690 ! t/JP.t t/KR.t t/perlio.t
691 unless (find PerlIO::Layer 'perlio') ... line is back again.
692 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
693 need PerlIO. Sigh....
694 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
695 ->perlio_ok now does eval{ require PerlIO::encoding } there so
696 it correctly returns 1 when PerlIO::encoding is yet loaded.
698 perl-current patch #16072 reflected
700 1.56 2002/04/22 09:48:07
701 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
702 New PerlIO::encoding 0.04 compliance met
704 1.55 2002/04/22 03:43:05
705 ! Encode.pm Encode.xs Unicode/Unicode.pm
706 needs_lines() defined so Encode::Encoding is no longer needed
709 1.54 2002/04/22 02:50:01
710 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
711 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
712 now perlio_ok is true by default if PerlIO::encoding->VERSION is
713 0.03 or larger. POD in Encode::Encoding revised to reflect this.
714 Encode::XS and Encode::Unicode now has perlio_ok() method.
715 ! lib/Encode/Supported.pod
716 s/UP-UX/HP-UX/ by jhi
717 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
718 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
719 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
720 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
721 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
722 ! lib/Encode/Supported.pod
723 Huge document fixes by Philip.
726 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
727 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
729 binmode() after "<:encoding" to make Win32 happy, by Mattia.
730 Mattia added to AUTHORS file
731 Message-Id: <3CC3150F.5798.22A05AE@localhost>
733 1.52 2002/04/20 23:43:47
735 TODO: is now SKIP:, as NI-XS requested. Also adds more
736 eraborate failure analysis added.
738 A note on how to make sure of round-trip safety added to POD
739 section (so Autrijus is happier)
740 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
741 big5-(eten|hkscs) is round-trip safe again!
742 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
744 Typo fixes by Andreas
745 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
746 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
747 PerIO coodination patches from NI-XS.
748 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
750 1.51 2002/04/20 09:58:23
752 Updated test suite by Autrijis so "make test" is happy again
753 Message-Id: <20020420082104.GA25037@not.autrijus.org>
755 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
757 TW/TW.pm TW/Makefile.PL
758 Updates by Autrijus. 'big5' is no longer a canonical but an
759 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
760 Message-Id: <20020419195346.GA19597@not.autrijus.org>
762 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
763 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
765 PerlIO detection a little bit smarter; no longer uses eval qq{}
768 1.50 2002/04/19 06:13:02
769 ! ! Encode.pm Encode.xs Encode/encoding.h
771 New Fallback API imlemented and documented. See "perldoc Encode"
773 ! lib/Encode/JP/JIS7.pm Encode.pm
774 + lib/Encode/PerlIO.pod t/perlio.t
775 API compliance met. However, it still does not work unless perlio
776 implements line buffer. See BUGS section in perldoc Encode::PerlIO
777 As a sensible workaround, perlio_ok() added to Encode.
779 ! lib/Encode/Supported.pod
781 Message-Id: <20020418174647.J8466@alpha.hut.fi>
783 Doc fixes from Autrijus
784 Message-Id: <20020418144131.GA10987@not.autrijus.org>
786 perlqq mode documented
788 + t/jisx0201.euc t/jisx0201.ref
789 ! t/jisx0208.euc t/jisx0208.ref
790 t/JP.t tests more rigorously and with other encodings
791 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
792 in jis0208 (used to contain jisx0201 part).
793 ! Encode/Makefile_PL.e2x
794 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
795 enc2xs and encode.h rather than hard-coded. This allows the resulting
796 module fully CPANizable.
797 ! encoding.pm t/JP.t t/KR.t
798 PerlIO detection simplified (checks %INC instead of eval{})
799 ! Encode.xs Encode/encode.h
800 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
801 - lib/Encode/Unicode.pm
802 (en|de)code_xs relocated to where it belongs. Source reindented
805 Additional (U8 *) cast added as suggested by jhi
806 Message-Id: <20020417165916.A28599@alpha.hut.fi>
808 1.42 Date: 2002/04/17
810 no-op module; Thought of adding a pod there but enc2xs has
815 correct mechanism to detect Perlio::encoding layar installed.
817 PerlIO Layer detached.
819 1.41 2002/04/16 23:35:00
821 binmode(STDIN|STDOUT ...) done iff PerlIO is available
823 Cleaned up PerlIO skip conditions to prepare for the upcoming
824 Encode - PerlIO forking.
826 exported functions are now prototyped.
827 ! lib/Encode/CN/HZ.pm
830 fallback implemented # was /* FIXME */
831 affected programs revised to fit (only HZ was using the try-catch
832 approach which needed to be fixed for API-compliance).
834 ! Encode/KR/2022_KR.pm
836 can find =head1 NAME now, jhi
837 Message-Id: <20020416083059.V30639@alpha.hut.fi>
841 now complies with less warnings with the pickest compilers.
842 Suggested by Craig, fixed by Dan.
843 ! Encode/Makefile_PL.e2x
845 A bug that fails to find *.e2x in certain conditions fixed
847 1.40 2002/04/14 22:27:14
848 + Encode/ConfigLocal_PM.e2x
849 ! lib/Encode/Config.pm
851 "enc2xs -C" now generates/updates Encode::ConfigLocal.
852 ConfigLocal_PM.e2x is a skelton thereof.
853 ! lib/Encode/Config.pm
855 "use Encode::CN::HZ;" was missing.
858 More rigorous tests added to test XS, especially on memory allocation.
860 ! lib/Encode/Unicode.pm
861 NI-S implemented an XS version -- merged
862 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
865 Source filter option added. With this option on, you can write
866 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
867 encodings. t/jperl.t enhanced to test this feature.
869 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
870 apply his suggestion, this degree of nitting is enough to add him
872 Message-Id: <3CB93223.291E5E2E@earthlink.net>
874 + lib/Encode/JP/JIS7.pm
875 - lib/Encode/JP/JIS.pm
876 - lib/Encode/JP/2022_JP.pm
877 - lib/Encode/JP/2022_JP1.pm
878 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
879 JIS7.pm for better maintainability and performance
881 Added caveat for non-ascii identifiers.
883 fixes by jhi, the original author of this pragramtic module.
884 Message-Id: <20020413231527.V1826@alpha.hut.fi>
886 1.34 2002/04/12 20:23:05 (Unreleased)
889 EBCDIC fixes addressed by jhi.
890 Message-Id: <20020412161844.D9383@alpha.hut.fi>
891 ! lib/Encode/Encoder.pm
892 POD fix by Miyagawa-kun
893 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
895 1.33 2002/04/10 22:28:40
897 Philip's mail address corrected.
900 ! lib/Encode/Encoder.pm
901 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
902 This adds Miyagawa-kun to AUTHORS.
903 * encoding() no longer exported by default but on demand
904 * t/Encoder.t updated to test all these
905 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
906 ! lib/Encode/Unicode.pm
907 ! lib/Encode/Supported.pm
908 Further doc fixes by Anton
910 1.32 2002/04/09 20:06:15
913 - ucm/macDevanaga.ucm Unicode Character Map
914 - ucm/macGujarati.ucm Unicode Character Map
915 - ucm/macGurmukhi.ucm Unicode Character Map
916 A utility to check integrity of .ucm files. t/bogus.ucm is a
917 ucm that is deliberately bogus. unused Indic mappings are removed
920 resolve_alias() added as suggested by jhi. Same as
921 find_encoding("alias")->name. For convenience. This one is
922 defined in Encode.pm instead of Alias.pm.
923 Message-Id: <20020409215846.H17022@alpha.hut.fi>
925 Memory Allocate but detected during the devel of ucmlint -- fixed.
926 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
927 ! lib/Encode/Unicode.pm
928 valid_ucs2(0) is false but must be true.
929 3 patches from NI-S as follows. This also has fixed the incident
931 ! lib/Encode/Alias.pm
932 find_alias() recursion prevention
934 Checks for the patch above
935 ! t/Encode/Unicode.pm
936 An extra "F" that causes valid_ucs2() return a bogus value fixed
937 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
938 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
939 2 Small Patches from jhi as follows:
941 Encode->encodings() lists in case-insensitve order (as it was)
943 -l option prints avaiable encodings to STDOUT instead of STDERR
944 ! lib/Encode/Aliases.pm
945 s/defintion/definition/
946 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
948 ! lib/Encode/Supported.pod
949 ! lib/Encode/Unicode.pm
950 POD revise by Philip Newton. This adds Philip to AUTHORS list.
951 Thank you for the exact quote of Douglas Adams :)
952 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
954 1.31 2002/04/08 18:08:07
955 ! lib/Encode/Encoder.pm
957 Encode::Encoder, once just a placeholder of an idea, is now much more
958 practical. See t/Encode.t to find how practical it can be.
959 + lib/Encode/Config.pm
961 my false laziness at Encode.pm is fixed. Now %ExtModules are set
962 in Encode::Config and they are all literally, not programatically
963 set. My false laziness was resulting many encodings missing from
965 ! lib/Encode/Unicode.pm
967 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
968 so that it does not rely Encode::Unicode for BOM values
969 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
971 1.30 2002/04/08 02:34:51
972 + lib/Encode/Encoder.pm
973 Object Oriented Encoder. I reckon something like this is in need.
976 ! lib/Encode/Supported.pod
977 * autoloading bug that prevented upper-case canonicals such as UTF-16
978 is fixed. Now even UTF/UCS are autoloaded!
979 * encodings() is now more intuitive.
980 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
982 * Obligatory fixes to the POD.
983 ! lib/Encode/Supported.pod
984 Patch from Anton applied.
985 Message-Id: <66641479.20020408033300@motor.ru>
987 ! lib/Encode/Unicode.pm
988 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
990 1.28 2002/04/07 18:58:42
994 Just a MANIFEST for those missing files.
996 1.26 Date: 2002/04/07 15:22:04
999 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
1000 And t/Aliases.t fixed to make sure they all exist.
1002 1.25 2002/04/07 15:01:25 (Unreleased)
1004 ! lib/Encode/Unicode.pm
1007 - lib/Encode/UTF_EBCDIC.pm
1008 - lib/Encode/Internal.pm
1009 - lib/Encode/utf8.pm
1010 Integrated into Encode.pm as closures. That way "one package, one file"
1011 rule is preserved yet less files to require.
1013 commented out binmode(STDERR ...
1017 ! EBCDIC/Makefile.PL
1020 ! Symbol/Makefile.PL
1022 ! Encode/Makefile_PL.e2x
1023 Schwarn's MM-compliance patch merged
1024 Message-Id: <20020406082609.GA28758@blackrider>
1026 ! lib/Encode/Unicode.pm
1027 + lib/Encode/UTF_EBCDIC.pm
1029 - lib/Encode/10646_1.pm
1030 - lib/Encode/ucs2_le.pm
1031 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1032 per-module based (en|de)code, I saved a number of .pm by
1033 reorganizing it as per-object base (Well, this is what Encode::XS
1034 does under the hood). See Encode::Unicode for details.
1035 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1036 This module is used only on EBCDIC environments.
1038 1.21 2002/04/05 14:46:34 (Not Released)
1044 Are back to make Perl/Tk happy Smile, NI-S.
1046 ! lib/Encode/Alias.pm
1047 ! lib/Encode/Supported.pm
1048 ! lib/Encode/10646_1.pm
1049 ! lib/Encode/ucs2_le.pm
1050 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1051 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1052 updated to reflect changes.
1053 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1054 ! lib/Encode/Alias.pm
1055 ! lib/Encode/Supported.pm
1056 Anton's revision commited. Added Dan's own fixes as well.
1057 Message-Id: <159103166906.20020405161134@motor.ru>
1058 ! lib/Encode/Alias.pm
1060 < qr/^UCS2-le$/i => '"UCS-2"', );
1062 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1063 Sigh. Thank you, Anton.
1064 Message-Id: <14567692196.20020405062020@motor.ru>
1065 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1067 1.20 2002/04/04 19:50:52
1069 the last minute addtion. Just give it a try. Docs remains to be done.
1070 Not installed by default.
1071 ! lib/Encode/Supported.pod
1074 ! lib/Encode/Alias.pm
1075 ! lib/Encode/utf8.pm
1076 ! lib/Encode/10464_1.pm
1077 ! lib/Encode/ucs2_le.pm
1078 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1079 unchanged but UTF-16BE is added as an alias. Implicit aliases
1080 move to Encode::Alias so init_alias() works more as expected.
1081 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1082 Though pedantically wrong, This should make perl mongers happier.
1083 t/Alias.t is enhanced to test all these.
1084 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1086 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1087 of the codepoints are common. CJKT left as is because the saving is
1091 ! EBCDIC/Makefile.PL
1093 ! Encode/Makefile_PL.e2x
1097 ! Symbol/Makefile.PL
1101 All occurance of _def.h replaced with .exh so djgpp works happily
1102 ever after! To credit this amazing discovery, Laszlo is now in
1104 Message-Id: <20020403181424.GA8778@freemail.hu>
1105 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1108 ! Encode/Makefile_PL.skel
1110 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1111 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1113 Two more tests by added jhi
1114 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1117 The showstopper fixed -- Memory reallocation bug was causing
1118 Encode::XS to fall into infinite loop on certain conditions.
1119 t/grow.t tests that.
1120 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1125 ! lib/Encode/Supported.pod
1126 Vendor encodings rebuilt out of original map files at unicode.org.
1127 Indic languages such as MacDevanagali remain unspported do to the
1128 shortcoming of encengine capabilities (they need algorithmical
1129 conversion and I have no knowledge on that!). Pods fixed for added
1131 Oh, macJapan.ucm renamed to macJapanese.ucm.
1132 macROMnn is macRomanian and macRUMnn is macRumanian.
1133 txt2ucm is a crude script that is used to convert them.
1135 Unicode Compound Characters (used extensively on Mac) supported
1137 Typo fixes and improvements by jhi
1138 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1140 1.11 2002/03/31 22:12:13
1144 Missing files from the MANIFEST fixed.
1145 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1146 Version incremented just to make CPAN happy.
1148 1.10 2002/03/31 21:32:42
1151 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1152 if you want. This should make Autrijus happy. Also, piconv
1153 is added to default install.
1156 Here-documented files that enc2xs generates are now exported
1157 to *.e2x. Much cleaner and easier to debug.
1159 encoding enhances so you can make it act more like such
1160 (now prehistoric ) "localized" variations of perl like Jperl.
1162 Further test for encoding.pm. Written in euc-jp
1165 Taken over form jhi.
1166 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1171 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1178 ! Encode/macIceland.ucm
1179 ! lib/Encode/Alias.pm
1180 ! lib/Encode/Supported.pod
1181 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1182 Message-Id: <m3lmcavhjt.fsf@anima.de>
1184 1.01 2002/03/29 20:59:39
1187 s/USE_SCRIPTS/MORE_SCRIPTS/
1189 installs enc2xs by default for external Encode:: modules in CPAN,
1190 such as Encode::HanExtra
1192 More sensible perl core detection via $ENV{PERL_CORE}
1194 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1196 Perl core ditection via $^X =~ m/\bminiperl$/o
1197 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1199 1.00 Wed Mar 29 2002
1201 The version of all files is updated to 1.00 via "ci -f -l1.00",
1202 commemorating version 1.00. All files, including *.ucm are now
1203 under version control.
1206 encode.h moved to Encode/ so it will be installed for the later
1209 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1214 compile renamed to enc2xs.
1215 Affected Makefle.PL updated
1217 "Punt it. HanExtra can take care of that later." -- Autrijus
1218 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1221 ! Encode/ksc5601.ucm
1222 ! lib/Encode/CJKConstants.pm
1223 ! lib/Encode/KR/2022_KR.pm
1224 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1225 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1229 bin/ added for example scripts. They are not installed by default.
1230 to install them, "perl Makefile.PL USE_SCRIPTS".
1231 piconv is iconv reinvented in perl. in addition to all features
1232 of iconv, it also adds perlish features. See L<piconv/1> for more
1234 ! lib/Encode/Alias.pm
1235 qr/^ replaced with qr/\b so it directly matches locale names
1236 such as en_US.US-ASCII
1239 Patch by MJD to fix the following problem applied.
1240 Subject: [PATCH 5.7.3 Encode]
1241 Aliases.t not properly skipped when Encode extension not built
1242 Message-Id: <20020328091850.18677.qmail@plover.com>
1243 ! lib/Encode/KR/2022_KR.pm
1244 ! lib/Encode/CJKConstants.pm
1245 Another patch from Jungshik to make iso-2022-kr actually work
1246 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1247 ! Encode/Encode/euc-kr.ucm
1248 + Encode/Encode/johab.ucm
1249 ! Encode/Encode/ksc5601.ucm
1251 ! Encode/KR/Makefile.PL
1252 ! Encode/lib/Encode/Alias.pm
1254 Johab support and complete revision of Korean Encoding by Jungshik
1255 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1257 Revised to make up with now-dropped Encode::Details.
1258 - lib/Encode/Details.pod
1259 Dropped. Besides being obsolete, the topics are now covered in
1260 respective pods now.
1265 Korean aliases fixed thanks to Jungshik Shin
1266 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1267 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1269 =head1 NAME added to all modules to make buildtoc happy
1270 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1271 - lib/Encode/CJKguide.pod
1272 Too controversial and dropped from the dist. Will be available
1273 separately on the web.
1275 RCS tags added so table debugging gets easier (should that be
1276 needed! I hope they all stay 1.00!)
1277 + lib/Encode/CJKguide.pod
1278 A detailed guide to mainly, but not limited to, CJK multibyte
1281 + Encode/hp-roman8.ucm
1283 ! Encode/Supported.pod
1284 All occurance of "roman8" replaced with "hp-roman8" to avoid
1286 ! Encode/Supported.pod
1289 Mac Encodings now comply the Inside Macintosh
1291 Test for '-raw' conventions added.
1293 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1294 ! Encode/gb12345.ucm
1296 ! Encode/ksc5601.ucm
1297 "-raw" appended to canonical names.
1298 File mames stay unchanged thanks to UCM format.
1299 ! lib/Encode/CN/HZ.pm
1300 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1301 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1303 0.99 Tue Mar 26 2002
1304 - lib/Encode/JP/Const.pm
1305 + lib/Encode/CJKConstants.pm
1306 + lib/Encode/CN/2022_CN.pm
1307 + lib/Encode/KR/2022_KR.pm
1316 * Support for ISO-2022-KR and ISO-2022-CN added.
1318 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1319 * ucm2table autogenerates character table out of UCM files.
1322 - lib/Encode/Supports.pod
1323 + lib/Encode/Supported.pod
1324 Names reverted due to popular demand.
1325 8.3 rule applies only when there is a conflict.
1326 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1331 - lib/Encode/Format/Enc.pod
1333 * Character tables is now 100% ucm.
1334 * All files under Encode/ is now 8.3-compliant
1335 * some of missing encodings added (i.e. gsm0338 and nextstep)
1336 * Vendor mappings aggregated with appropriate national std in
1337 Makefile.PL, resulting smaller *.so especially for CJK.
1338 Following is result on Dan's FreeBSD box.
1340 ---------------------------------------------------------------
1341 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1342 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1343 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1344 blib/arch/auto/Encode/Encode.so 27,791 27,791
1345 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1346 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1347 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1348 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1349 ---------------------------------------------------------------
1350 Total 5,375,297 6,343,819
1352 * As a result of ucm-transition, Encode::Tcl dropped because
1353 Encode::Tcl demands *.enc.
1354 Encode::Tcl will be supplied in a separate tarball with *.enc.
1355 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1360 -lib/Encode/Supported.pod
1361 +lib/Encode/Supports.pod
1362 -lib/Encode/iso10646_1.pm
1363 +lib/Encode/10646_1.pm
1364 -lib/Encode/EncFormat.pod
1365 +lib/Encode/Format/Enc.pod
1366 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1367 - lib/Encode/JP/Constants.pm
1368 + lib/Encode/JP/Consts.pm
1369 ! lib/Encode/JP/JIS.pm
1370 ! lib/Encode/JP/H2Z.pm
1371 Version nit problem and 8.3 rule fix.
1372 > Package namespace installed latest in CPAN file
1373 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1374 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1375 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1376 accordingly. In addition, legacy "use vars qw()..." are replaced with
1378 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1379 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1381 - lib/Encode/JP/ISO_2022_JP.pm
1382 - lib/Encode/JP/ISO_2022_JP_1.pm
1383 + lib/Encode/JP/2022_JP.pm
1384 + lib/Encode/JP/2022_JP1.pm
1386 8.3 naming conflict for vanilla fat addressed by jhi
1387 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1390 Typecast fix addressed by jhi
1391 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1393 0.98 Mon Mar 25 2002
1394 ! lib/Encode/Supported.pod
1396 + lib/Encode/JP/ISO_2022_JP_1.pm
1397 ! lib/Encode/JP/ISO_2022_JP.pm
1398 ! lib/Encode/JP/JIS.pm
1400 Now Encode::JP is more strict on the difference between ISO-2022-JP
1401 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1402 makes Anton happier :) FYI the previous version implements
1403 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1404 ! lib/Encode/Supported.pod
1407 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1408 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1413 ! lib/Encode/Suppoted.pod
1414 pod fixes to replace F<http://...> to L<http://...>,
1415 as suggested by Autrijius in:
1416 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1417 ! lib/Encode/Suppoted.pod
1418 fixes and enhancements by Anton
1419 Message-Id: <10632060120.20020324103753@motor.ru>
1420 ! lib/Encode/Alias.pm
1421 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1422 added. Suggested by Anton then deobfuscated by Autrijius
1423 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1425 Further fix by Nicholas Clark
1426 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1427 - lib/EncodeFormat.pod
1428 + lib/Encode/EncFormat.pod
1430 File renamed as suggested by Autrijius
1432 ! lib/Encode/Details.pod
1433 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1434 ! Encode.pm Sun Mar 24 13:43:47 2002
1435 pod fixes by Autrijius.
1436 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1437 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1439 ! lib/Encode/Alias.pm
1441 now more EBCDIC conscious;
1442 %ExtModules on EBCDIC system excludes CJK so that you don't
1443 have to worry about the matched alias resulting cloaking.
1444 t/Alias.t also revised to reflect changes. Verified by jhi
1445 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1447 0.97 Sun Mar 24 2002
1451 EBCDIC detection mechanism installed as in JP/JP.pm
1452 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1455 ! EBCDIC/Makefile.PL
1458 ! Symbol/Makefile.PL
1460 Now all table files used by compile are postfixed '_t' to avoid
1461 namespace collisions in case insensitive file systems once for all!
1463 Message-ID: <58290227735.20020323195659@familiehaase.de>
1465 Since the Encode::JP is unsupported under EBCDIC we
1466 cannot run this test (aliases as such should work fine) -- jhi
1467 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1469 duplicate occurance of ascii.ucm and 8859-1.ucm
1470 causes MacOS X dlyd to cloak
1476 < chdir 't' if -d 't';
1478 > if (! -d 'blib' and -d 't'){ chdir 't' };
1479 When you are "make test"-ing on Encode/ directory, you must not
1480 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1481 remain unchanced. Also the situation detection was made simpler
1482 in t/JP.t, which was originally;
1483 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1485 "Use of uninitialized value in string eq at Encode.pm line 96."
1486 ! Symbol/Makefile.PL
1487 ! EBCDIC/Makefile.PL
1489 -- Problem on case insensitive file systems
1490 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1491 Message-ID: <88254111953.20020323095503@familiehaase.de>
1494 "So I think it's a bug in gcc, not perl. But it still needs to be
1496 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1497 Message-Id: <20020323170509.C96475@plum.flirble.org>
1499 0.96 Sat Mar 23 2002
1501 ! lib/Encode/Encoding.pm
1502 ! lib/Encode/Alias.pm
1503 ! lib/Encode/Supported.pod
1505 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1506 Message-ID: <20020322073908.GB10539@blackrider>
1509 "...I think we should include ISO 8859-1 as well." -- NI-S
1510 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1515 ! lib/Encode/Alias.pm
1516 alias definitions relocated to Encode::Alias so module autoloading
1517 works for aliases also.
1519 encodings() now accepts args to check ExtModules.
1523 + EBCDIC/Makefile.PL
1524 + Symbol/Makefile.PL
1528 Latin and single byte encodings are reorganized so they are
1529 demand-loaded like Encode::XX. Now only ascii is compiled into
1531 ! lib/Encode/Alias.pm
1532 for my $k (keys %hash){ delete $hash{$k}; }
1533 is depreciated; fixed.
1535 0.95 Fri Mar 22 2002
1536 In this update, pod rewrites and alias fixes are the main issues
1537 + lib/Encode/Supported.pod
1538 Describes supported encodings
1540 streamlined compiled-in encodings.
1541 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1543 + Encode/ibm-125?.ucm
1544 Added from icu distibution with any occurance of
1545 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1546 some respect to icu staff, however.
1547 + lib/Encode/Alias.pm
1549 Alias difinitions in Encode.pm relocated.
1552 packWARN patch from Paul Marquess via jhi
1553 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1554 Paul added to AUTHORS as a result.
1555 ! t/CJKalias.t -> t/Aliases.t
1556 Renamed. Checks even more aliases and alias overloading
1559 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1560 gbk => cp936 relocated to CN.pm
1562 Test::More with plans (by jhi)
1564 0.94 Thu Mar 21 2002
1565 + lib/Encode/Description.pod
1566 ! lib/Encode/Encoding.pm
1567 Now the pod in Encode.pm is abridged as programming references.
1568 lib/Encode/Description.pod contains the original, detailed description
1569 and Encode::Encoding explains how to write your own module to
1570 add new encodings. So far, lib/Encode/Description.pod contains
1571 the whole pod once in Encode.pm. This is intentional.
1573 Pod revisions by Anton Tagunov
1574 Message-Id: <517178431.20020320174824@motor.ru>
1576 all occrance of Encode::Tcl::Extended removed including pod
1578 test now checks $encoding->name only; $encoding->{name} are
1579 no longer check to find the canonical name.
1580 ! lib/Encode/JP/JIS.pm
1581 ! lib/Encode/JP/ISO_2022_JP.pm
1582 ->name() added to be more compliant with API
1588 Patch by Autrijus to add aliases to TW and fixes to POD
1589 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1591 SADAHIRO Tomoyuki added as should. My apologies.
1593 0.93 Wed Mar 20 2002
1594 * First release to be uploaded to CPAN. For prehistoric changes,
1595 please see Changes file of perl distibution as well as
1596 perl-unicode@perl.org archive, available at:
1597 http://archive.develooper.com/perl-unicode@perl.org/
1599 Changes Since 0.92 includes;
1604 + Mention to perl-unicode@perl.org added
1606 + Encoding aliases added so you can feed locale names
1607 and MIME Charset="" directly.
1608 - Mention to JISX0212 removed because it's fixed
1611 + Encoding aliases added. Note TW is left untouched because
1612 euc-tw is not implemented in TW but in Encode::HanExtra.
1613 Autrijus, you may fix Encode::HanExtra.
1615 + to test encode aliases added