1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 2.15 2006/04/06 15:44:11 dankogai Exp dankogai $
6 $Revision: 2.15 $ $Date: 2006/04/06 15:44:11 $
8 Addressed: UTF-16, UTF-32, UCS, UTF-7 decoders mishandle illegal characters
9 http://rt.cpan.org/NoAuth/Bug.html?id=#18556
11 added str2bytes() as an alias to encode() and bytes2str() as an alias
13 http://rt.cpan.org/NoAuth/Bug.html?id=#17103
15 Change 26922: Avoid warning with MS Visual C compiler.
16 Message-Id: <200601231245.k0NCj2dw009484@smtp3.ActiveState.com>
18 Change 26067: As using -C to turn on utf8 IO is equivalent to the open pragma
19 Message-Id: <200511092227.jA9MRcYD009025@smtp3.ActiveState.com>
21 2.14 2006/01/15 15:43:36
23 Change 26295: Don't build manpages for Encode and Unicode::Normalize
24 Message-Id: <200512071540.jB7Fe4Gt017960@smtp3.ActiveState.com>
26 Change 26081: Pod nit in Encode.pm, found by Marc Lehmann in RT #36949.
27 Message-Id: <200511110357.jAB3vZcP023647@smtp3.ActiveState.com>
28 ! Encode.xs Encode/encode.h bin/enc2xs encengine.c
29 Change 25821: Mark more static Encode data structures as const.
30 Change 25823: use more 'const' in the Encode data structures.
31 Message-Id: <200510221243.j9MChTSu027711@smtp3.ActiveState.com>
32 Message-Id: <200510221343.j9MDhTk9001245@smtp3.ActiveState.com>
34 2.13 2006/01/15 15:06:36
36 Miyagawa's mail address updated
37 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
38 ! lib/Encode/MIME/Header.pm
39 #16413: Encode::MIME::Headers patch to solve what is probably someone else's bug
40 http://rt.cpan.org/NoAuth/Bug.html?id=16413
41 ! lib/Encode/MIME/Header.pm t/mime-header.t
42 Applied: RT #16258: Support for RFC 2184 language tag
43 http://rt.cpan.org/NoAuth/Bug.html?id=16258
45 Fixed RT #14559: fix for #8872 introduces new "bug"
46 http://rt.cpan.org/NoAuth/Bug.html?id=14559
49 from_to() now makes use of $check more naturally.
50 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
51 <B10B4DE2-9BAF-4344-B3BE-2119977D817A@dan.co.jp>
53 2.12 2005/09/08 14:17:17
54 ! Encode.xs Encode.pm t/fallback.t
55 Now accepts coderef for CHECK!
57 Updated to newer version at unicode.org
58 http://rt.cpan.org/NoAuth/Bug.html?id=14222
59 ! lib/Encode/Supported.pod
61 <42F5E243.80500@gmail.com>
63 More POD typo leftover fixed.
64 Message-Id: <b77c1dce05080615487f95314@mail.gmail.com>
66 2.11 2005/08/05 10:58:25
68 To reflect changes below
69 ! Encode.pm encoding.pm
70 lib/Encode/Alias.pm lib/Encode/PerlIO.pod lib/Encode/Supported.pod
71 Typo fixed by Piotr Fusik in Change 25261 & 25266
72 Message-ID: <001401c595bd$dccb5d80$0bd34dd5@piec>
74 Addresses "BUG REPORT: panic in Encode.xs".
75 Message-Id: <42EDDA97.2010608@hyper.to>
76 + lib/Encode/MIME/Header/ISO_2022_JP.pm mime_header_iso2022jp.t
77 ! lib/Encode/MIME/Header.pm lib/Encode/Config.pm
78 Encoding 'MIME-Header-ISO_2022_JP' is introduced by Makamaka
79 Message-Id: <200507311557.j6VFvE2K034605@www231.sakura.ne.jp>
80 ! Encode/encode.h Encode.pm Encode.xs
81 PerlIO's "encoding(utf-8-strict)" got a problem w/ partial character.
82 Found and addressed by KONNO Hiroharu <hiroharu.konno@bowneglobal.co.jp>
83 See also ext/PerlIO/encoding/encoding.pm
84 Message-Id: <E1DineE-00068X-TB@yok-gs-workman.asia.bgsinternal.com>
86 2.10 2005/05/16 18:46:36
88 fixed decode_utf8() accordingly to RT#8872
89 http://rt.cpan.org/NoAuth/Bug.html?id=8872
91 s/SvIVX/SvIV_set/ by Steve Peters.
92 Message-Id: <2297.67.96.185.36.1114626315.squirrel@webmail3.pair.com>
96 New Pod section: "UTF-8 vs utf8"; explains utf-8-strict
98 Tests utf-8-strict, accordingly to
99 UTF-8 decoder capability and stress test" by Markus Kuhn
100 http://smontagu.damowmow.com/utf8test.html
101 Note that malformed and overlong sequences are not test here
102 because perl already does that for you, utf-8-strict or not.
103 ! Encode.pm Encode/encode.h t/fallback.t
104 Addressed "encode(..., Encode::LEAVE_SRC) does not work".
105 Now FB_(PERLQQ|HTMLCREF|XMLCREF) implies LEAVE_SRC so
106 you can (en|de)code constant strings with these fallbacks.
107 http://rt.cpan.org/NoAuth/Bug.html?id=8736
108 ! Encode.pm Encode.xs lib/Encode/Alias.pm t/Aliases.t
109 Make Encode.pm support the real UTF-8, by GAAS
110 Message-Id: <lrfz2mcngd.fsf@caliper.activestate.com>
111 Message-Id: <lr4qizbvvm.fsf@caliper.activestate.com>
112 ! Encode.pm Encode.xs
113 post-2.09 comment patches from GAAS applied.
114 Message-Id: <lroehacz6q.fsf@caliper.activestate.com>
115 Message-Id: <lrk6rycymu.fsf@caliper.activestate.com>
117 2.09 2004/12/03 19:16:53
118 ! Encode.pm Encode.xs
119 Addressed " :encoding(utf8) broken in perl-5.8.6".
120 Message-Id: <lrllcfeank.fsf_-_@caliper.activestate.com>
122 Addressed "(de|en)code($valid_encoding, undef) does not warn".
123 http://rt.cpan.org/NoAuth/Bug.html?id=8723
124 ! Encode.pm t/Encode.t
125 Addressed "Can't encode URI". When a reference is fed to (en|de)code,
126 Encode now stringifies instead of returning undef.
127 http://rt.cpan.org/NoAuth/Bug.html?id=8725
128 ! Encode.xs t/fallback.t
129 Addressed "FB_HTMLCREF and FB_XMLCREF for the UTF-8 decoder".
130 http://rt.cpan.org/NoAuth/Bug.html?id=8694
132 Addressed "s/digit/number/".
133 http://rt.cpan.org/NoAuth/Bug.html?id=8695
135 Addressed "while (defined(read )) { ... } is an infinite loop".
136 http://rt.cpan.org/NoAuth/Bug.html?id=8696
138 Addressed "What the heck is UCM?".
139 Document fixed so that it no longer contains "UCM-Based Encodings".
140 http://rt.cpan.org/NoAuth/Bug.html?id=8697
142 2.08 2004/10/24 13:00:29
143 ! Encode.xs lib/Encode/Encoding.pm Unicode/Unicode.{pm,xs}
144 Resolved the issue that was raised by 2.07 -- Encode::utf8 fallbacks
145 that was introduce messed up PerlIO::encoding.
146 * To do so, ->renew() is renewed and ->renewed() was introduced to
147 tell whether the caller is PerlIO or not.
148 Message-Id: <94B2EB12-25B7-11D9-9E6A-000A95DBB50A@dan.co.jp>
150 2.07 2004/10/22 19:35:52
151 ! lib/Encode/Encoding.pm
152 "Remove Carp from warnings.pm" that influences Encode, by Tels.
153 Message-Id: <200410161618.29779@bloodgate.com>
154 ! Encode.xs AUTHORS t/fallback.t
155 Now Encode::utf8's fallbacks are compliant to Encode standard.
156 Thank Bjoern Hoehrmann for persistently convincing me.
157 Message-Id: <41a61aea.638409494@smtp.bjoern.hoehrmann.de>
161 2.06 2004/10/22 06:23:11
163 RT #8083 reports that MacThai mapping was obsolete
164 Updated all mac* encodings accordingly to the URI below.
165 One remaining mystery is that MacRomanian vs. MacRumanian.
166 MacRumanian is not found in unicode.org...
167 http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/
168 ! Encode.pm t/Encode.t
169 Fixed RT #8081: "decode(..., bless{},'x') segfault"
170 Two more tests added to test that.
171 http://rt.cpan.org/NoAuth/Bug.html?id=8081
173 POD revised accordingly to RT #7966
174 http://rt.cpan.org/NoAuth/Bug.html?id=7966
176 POD updated explaining why Encode::Unicode always croaks on error
177 rather than giving users choices.
178 http://rt.cpan.org/NoAuth/Bug.html?id=7892
180 2.05 2004/10/19 04:55:01
182 "unnuke" jhi's patch in bleedperl, with minor correction by dankogai.
183 Message-ID: <41210A84.6060506@iki.fi>
184 Message-ID: <20041018233442.7418113f@r2d2>
185 Message-Id: <2BA3DAC4-218A-11D9-906D-000A95DBB50A@dan.co.jp>
187 2.04 2004/10/16 21:22:44
189 From: craigberry@mac.com
190 Subject: [PATCH ext/Encode/Makefile.PL] make Encode.c dependency explicit
191 Message-Id: <41716868.7000102@mac.com>
193 2.03 2004/10/06 05:07:20
194 ! lib/Encode/Alias.pm
195 Resolved some alias case sensitivity glitches reported via RT.
196 http://rt.cpan.org/NoAuth/Bug.html?id=7835
198 Resolved Win32 glitches reported via RT.
199 (Fixed by dankogai and tested by Steve Hay)
200 http://rt.cpan.org/Ticket/Display.html?id=7831
201 ! JP/JP.pm lib/Encode/Alias.pm lib/Encode/Supported.pod AUTHORS
202 /\bwindows-31j$/i is now an alias of CP932, by Steve Hay.
203 http://rt.cpan.org/NoAuth/Bug.html?id=6695
205 2.02 2004/08/31 10:55:34
206 ! ucm/big5-hkscs.ucm AUTHORS t/big5-hkscs.enc t/big5-hkscs.utf
207 New map submitted by Deng Liu and Autrijus. Test data needed
208 to be upgrade as well, done by dankogai
209 Message-Id: <20040824204828.GB6999@aut.dyndns.org>
211 Now works for characters U+10000 and above. This fix was needed
212 to "tidy" the original map that was submitted.
214 "ucmsort" now mentioned in pod
216 2.01 2004/05/25 16:27:14
218 From: domo@computer.org
219 Subject: [PATCH] Correct statistics from enc2xs
220 <4AF60A4A-B8BB-11D8-BF99-000A27839BD6@computer.org>
221 <CDEBBD45-B91D-11D8-BF99-000A27839BD6@computer.org>
222 ! lib/Encode/Alias.pm
223 Addressed "False [] range "\s-" in regex;" in Encode::Alias.pm
224 <200405271148.i4RBm4KY026529@mail.mvnet.de>
226 2.01 2004/05/25 16:27:14
227 ! lib/Encode/CN/HZ.pm lib/Encode/Unicode/UTF7.pm
228 "If someone thinks utf8::upgrade($1) should be croaked like
229 chom?p($1),please try the following patch for Encode.pm."
231 <20040522212704.C068.BQW10602@nifty.com>
233 2.0 2004/05/16 20:55:15
234 * version updated to 2.00
235 -- sorry, no big feature change. I just hate version 1.100 :)
236 ! lib/Encode/Guess.pm
238 addressed UTF-(8|32LE) + BOM misguessing
239 https://rt.cpan.org/Ticket/Display.html?id=6279
241 s/is_utif8/is_utf8/ in POD
242 ! Encode/lib/Encode/CN/HZ.pm
243 Fixes "make test" failure after the patch to pp_hot.c
245 Message-Id: <20040222182357.6B39.BQW10602@nifty.com>
247 From: autrijus@autrijus.org
248 Subject: [PATCH] "piconv -C 512" badly broken
249 Message-Id: <1072870210.769.5.camel@localhost>
251 1.99 2003/12/29 02:47:16
253 find_encoding("UTF-16BE")->encode("abc") now null terminates
254 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html
256 prototype bug in decode_utf8() fixed
257 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
258 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
259 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
261 Merged from maintperl@21987
263 1.98 2003/08/20 11:15:31
264 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
265 Dave Evans has found and corrected a bug in Encode::MIME::Header.
266 Test suite added by Dan Kogai.
267 Message-Id: <3F43440B.7060606@rudolf.org.uk>
269 Typo fixes rolled back in from bleedperl
270 ! t/at-cn.t t/at-tw.t
271 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
272 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
274 argv case nit for VMS by Craig
275 Message-ID: <3F2B02DE.10207@mac.com>
276 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
277 Encode test fixes for VMS by Peter Prymmer
278 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
279 ! lib/Encode/Alias.pm t/Aliases.t
280 koi-8 aliases bug detected and patched by sadahiro.
281 Further fix and test suite by dankogai
282 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
284 1.97 2003/07/08 21:52:14
285 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
286 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
287 $DEBUG replaced with DEBUG() so perl optimizes better,
288 by Rafael with further fixes by dankogai
289 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
290 ! lib/Encode/Aliases.pm
291 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
292 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
293 So new hash seeding introduced in bleedperl works.
294 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
295 ! lib/Encode/Guess.pm
296 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
297 automatic utf(8|16|32) guessing -- originally by Autrijus
298 Message-Id: <20030626162731.GA2077@not.autrijus.org>
300 Addressed the following;
301 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
302 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
304 1.96 2003/06/18 09:29:02
305 ! lib/Encode/JP/JP.pm t/guess.t
306 m/(...)/ in void context then $1 is considered a Bad Thing
307 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
309 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
312 More typecast from maintperl@19739
313 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
315 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
316 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
318 Now skips for EBCDIC platform.
319 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
321 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
322 have more than one C<.> in their filenames." -- perlport.
323 Message-Id: <3ED79E01.8050401@mac.com>
325 Found and fixed the back that -p,--perlqq does not work.
326 Induced by the change from Getopt::Std to Getopt::Long.
328 Addressed [cpan #2629] Wrong assumption in numeric comparison
329 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
330 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
331 lib/Encode/Encoding.pm t/perlio.t
332 ! API Change: ->new_sequence() => ->renew()
333 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
334 + Encode::XS and Encode::utf8 now supports ->renew()
335 + Encode::Encoding now documents this with examples
336 - Non-XS (en|de)code stripped out of Encode::Unicode
337 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
339 1.95 2003/05/21 08:41:11
341 Since bogus entries were found in iso-8859-6, all entries are
342 re-generated once again out of
343 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
344 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
345 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
346 + lib/Encode/Unicode/UTF7.pm
347 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
348 lib/Encode/Supported.pod
349 UTF-7 support is now added. With this Encode now has all transcoding
350 methods in Unicode::String.
352 1.94 2003/05/10 18:13:59
353 ! lib/Encode/MIME/Header.pm
354 A more sophisticated solution for double-encoding by dankogai
355 ! lib/Encode/MIME/Header.pm AUTHORS
356 Two bugs fixed by Bjoern Jacke
357 * "Double Encoding" was not possible
358 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
359 * encode("MIME-Q") had UTF-8 flag on
360 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
361 ! lib/Encode/MIME/Header.pm AUTHORS
362 Two occurances of "croak ()" fixed as "croak qq()".
363 Simon Cozens is added to AUTHORS as a result.
364 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
366 POD fixes that reflect enhancements by jhi
368 Two enhancements by jhi.
369 + Now uses Getopt::Long so it accepts long name options
370 (--from for -f, for example)
371 + New option: -r,--resolve
372 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
374 META.yml added upon request of Schwern
375 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
377 Enache Adrian removed upon request -- to live longer than Encode
378 and/or FreeBSD (toy-)?thread :)
379 Message-Id: <20030425015701.GA2069@ratsnest.hole>
381 "close STDOUT unless $^O eq 'freebsd';" once again relocated
382 to keep VMS happy in which case "$^O eq 'freebsd'" is required
383 to keep FreeBSD+thread happy. Sigh.
384 Message-Id: <3EA88ADC.3000300@mac.com>
386 1.93 2003/04/24 17:43:16
388 added "no warnings 'pack'" in for loop to keep bleedperl from
389 complaining "Character in 'C' format wrapped in pack".
391 More elegant perl core detection inspired by Ilya Zakharevich
392 (but further elaborated for general cases).
393 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
396 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
397 are less likely to be confused w/ ISO-8859-*
399 RT tests added (vendor encodings are exemplified)
400 -- that successfully found a flaw on iso-2022-kr before the patch.
401 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
402 decode("ISO-2022-KR") has been buggy but no one ever sited
403 that since no one seems to be using it. Bugs discovered by
405 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
406 ! lib/Encode/CN/HZ.pm t/perlio.t
407 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
408 so it adds test for HZ.
409 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
410 ! lib/Encode/Guess.pm
411 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
412 So long as the string contains \x{00}-\x{ff} it does not fail.
413 See perldoc for details.
414 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
416 1.92 2003/03/31 03:27:27
417 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
418 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
419 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
420 but glibc-BIG5-2.1.2.ucm does.
421 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
422 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
423 Maps regenerated again but this time based upon
424 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
425 (But where is THE DOCUMENT by MICROSOFT?)
426 ! t/enc_module.t AUTHORS
427 failure with threaded Perl on FreeBSD addressed.
428 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
429 Message-Id: <20030322230131.GA813@ratsnest.hole>
430 ! lib/Encode/Guess.pm
433 Change 18989: Make the :bytes conditional on PerlIO.
434 further Modified by Dan Kogai
435 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
437 Chnage 18966: another fix for failing test on windows ("use encoding"
438 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
439 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
441 Change 18970: Hopefully this works also in Win32, by jhi
442 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
443 Change 18965: fix CJKT.t failures on windows due to incorrect
445 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
447 1.91 2003/03/09 20:07:37
449 even more proofread by jhi.
450 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
453 +use lib qw(t ext/Encode/t ../ext/Encode/t);
454 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
456 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
457 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
460 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
462 1.90 2003/03/09 17:32:43
465 Inaba-san has added a patch for perl 5.8.1 or later that makes
466 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
467 that. POD is further revised.
468 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
469 ! encoding.pm t/enc_module.t
470 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
471 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
475 signed vs. unsigned issue discovered by Craig on OpenVM
476 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
477 ! encoding.pm AUTHORS
478 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
479 Because binmode() stacks layers instead of overwrite, you have to
480 ":raw :encoding()" in encoding.pm or your are in trouble when you
481 call encoding.pm multiple times. There are several workarounds
482 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
483 who was the first to address this problem was added to AUTHORS.
484 The test suites was added for this, which is a modified version
485 of SUGAWARA-san's scripts
486 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
488 1.88 2003/02/20 14:42:34
490 one signedness nit for Encode by jhi
491 <200302161933.h1GJX876018710@kosh.hut.fi>
493 VISCII map was incorrect; fixed by Sadahiro-san
494 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
495 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
496 You can't unlink files that are opened in cygwin but the last
497 file handle opened in t/enc_*.t left open. Patch submitted
498 by Yitzchak and he was added to AUTHORS.
499 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
501 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
502 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
504 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
505 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
507 1.87 2003/02/06 01:52:11
509 * Inaba "Sensei" Hirohito added (I thought I have done so a long
510 ago but apparently I did not).
511 * SUZUKI Norio added for verious and useful bug reports.
512 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
513 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
514 podchecked so all warnings are gone except for L<http://>.
515 ! encoding.pm t/enc_eucjp.t
516 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
517 * Filter option overhaul
519 ! Encode.pm Encode.xs encengine.c Encode/encode.h
520 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
521 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
522 without filter. podchecked by Dan Kogai.
523 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
524 ! lib/Encode/Alias.pm
525 decode('alias', $1) went wild because of local $_ in find_alias()
526 the evil local $_ is eradicated but that changes find_alias()
527 format for coderef aliasing. See Encode::Alias for details
528 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
530 1.86 2003/01/22 03:29:07
532 * Don't forget to canonize when you attempt an exact match!
533 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
534 * ${^ENCODING} exception is off for $] > 5.008
535 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
537 $] check commented out so it runs on 5.8.0
539 1.85 2003/01/21 22:19:14
541 ${^ENCODING} exception is now explicit rather than handled by regex.
542 + t/enc_eucjp.t t/enc_utf8.t
543 Test suite for the better "encoding" pragma support for bleedperl.
544 On 5.8.0, they will just be skipped.
546 1.84 2003/01/10 12:00:16
548 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
549 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
550 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
552 %_ fixes saves the resulting .so .05% smaller, by NC
553 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
555 Silence Encode on undef, by Andreas
556 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
557 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
559 s/regognised/recognised/ . British spelling left intact to pay
560 respect to two British Nicks :)
561 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
563 1.83 2002/11/18 17:28:49
564 ! Encode.xs lib/Encode/JIS7.pm
565 Even more patches from Inaba-san has been applied. With this
566 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
567 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
569 1.82 2002/11/14 23:06:12
571 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
572 later found in perls w/ -Dusemymalloc was fixed by NC.
573 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
575 1.81 2002/11/08 18:29:27
576 ! Encode.pm Encode.xs
577 Non-XS version of Encode::utf8 is back (with XS being default).
578 Encode::predefine_encodings(0) to turn off XS.
579 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
580 found that it was Test::More causing the problem, not Encode.
581 But I have already made it configurable so it may be useful in
583 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
585 The ingenious patch by Nicholas Clark that reduces shlib sizes by
586 50% with no penalty and backward compatibility preserved, is in.
587 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
589 1.80 2002/10/21 20:39:09
590 ! Encode.xs t/mime-header.t
591 Even more patches from NI-XS regarding Encode::utf8->decode().
592 And one more test to t/mime-header.t to prove it
593 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
595 1.79 2002/10/21 06:05:37
597 Further patches from NI-XS. Encode::utf8->decode() now checks the
598 value of utf8 flag of the argument. As a result, the fix to
599 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
600 no harm (even speedwise) I'll leave it unreverted.
601 ! ucm/cp949.ucm ucm/cp950.ucm
603 U+00AE REGISTERED SIGN
604 were missing as a result of 1.78. Discovered by Moriyama-san.
605 Moriyama-san has also developed a test script that compares
606 (en|de)coded results to the corresponding Win32 API result and
607 all cp9?? maps are now verified.
608 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
610 1.78 2002/10/20 15:44:00
611 ! lib/Encode/MIME/Header.pm
612 fixed so that it works with new Encode::utf8
613 ! Encode.pm Encode.xs
614 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
615 :encoding(UTF-8) to handle partial chars at end of buffers
617 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
618 ! lib/Encode/Supported.pod
619 More nitpickings applied.
622 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
623 tests were completely useless. To redeem that and get the peace of
624 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
625 ucm/*.ucm Since this script takes too long to finish (30 seconds on
626 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
627 but you can easily run that by either renaming it or:
629 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
630 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
631 Other CJKT cp9?? also updated according to the URI below;
632 http://www.microsoft.com/typography/unicode/cscp.htm
633 + bin/ucmsort MANIFEST
634 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
635 to proper order. intended for hardcore develpers only.
636 ! ucm/cp932.ucm JP/JP.pm AUTHORS
637 CP932 mapping which was based upon the mapping file at unicode.org
638 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
639 has also supplied the patch so he was added to AUTHORS.
640 ! lib/Encode/Supported.pod
641 ISO-8859-11 != TIS 620
642 == TIS 620 + \xA0 ( )
644 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
646 1.77 2002/10/06 03:27:02
648 * Modified to accomodate up and comming patch by Inaba-san that
649 will fix tr/// needing eval qq{}
650 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
652 * pod fixes/enhancements to reflect the changes above
653 ! lib/Encode/Alias.pm
654 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
655 Message-Id: <20021001015648.GB18710@not.autrijus.org>
657 1.76 2002/08/25 15:09:51
659 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
660 but naturally identical to previous version -- dankogai
662 Codepoint fixes -- autrijus
663 Message-Id: <20020805040236.GC5220@not.autrijus.org>
665 copied everything under perl-5.8.0/ext/Encode to make sure Encode
666 is in sync w/ perl core
668 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
669 Now that binmode(FH) does implicit ":bytes" revisit
670 the failing tests. The worrisome one is the Digest::MD5
671 test-- how will it fare in CRLF lands now?
673 From: Radu Greab <radu@netsoft.ro>
674 Date: Mon, 10 Jun 2002 00:40:34 +0300
675 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
676 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
677 ones which I cannot figure out.
678 ! lib/Encode/Alias.pm
679 Subject: [Encode PATCH] spurious warning
680 From: Nicholas Clark <nick@unfortu.net>
681 Date: Sun, 2 Jun 2002 20:26:22 +0100
682 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
684 1.75 2002/06/01 18:07:49
685 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
686 glibc compliance cited by Autrijus.
687 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
688 ! bin/enc2xs bin/piconv
689 Subject: Re: forewarning: usedevel and versiononly
690 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
692 1.74 2002/05/28 18:33:15
693 + ucm/null.ucm ucm/ctrl.ucm
694 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
695 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
696 Subject: Re: unicode -> &# notation
697 Message-ID: <20020518193704.GB40272@not.autrijus.org>
699 1.73 2002/05/28 17:26:18
700 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
701 Chris Nandor has fixed Encode so that it works w/ MacPerl --
702 at least w/ PPC (68k need static linking which does not work due to
703 64k limit). pudge is added to AUTHORS (I'm surprised he was not
704 there in the list). Encode/Makefile_PL.e2x was additionally fixed
705 by dankogai to reflect changes in other Makefile.PL
706 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
708 Subject: Change 16746: -Mutf8 cleanup.
709 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
711 1.72 2002/05/20 15:49:56
713 Subject: [PATCH] Encode should be in perl-core library path
714 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
715 Message-Id: <20020520161201.A11019@alpha.hut.fi>
716 ! lib/Encode/MIME/Header.pm
717 Subject: [PATCH] Encode::MIME::Header
718 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
719 ! Encode/Makefile_PL.e2x
720 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
721 Message-Id: <20020519201031.GA1603@not.autrijus.org>
722 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
723 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
725 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
726 Subject: [PATCH] good day for WinCE port of perl.
727 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
730 Even more linting by Robin via @16532
732 Even more typecast by Sarathy in @16460
734 1.71 2002/05/07 16:22:42
736 even more typecasts by Robin
737 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
739 A very strange bug that was causing a bugus ucm -> C table
740 generation that was revealed by a UCM file that Andreas was
741 working. This is the king of wierdest bug I've encountered
742 in the course of Encode maintenance.
743 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
745 1.70 2002/05/06 10:26:48
747 Made more 'module-safe' with conjunction w/ 'no encoding'.
748 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
749 ! lib/Encode/Encoding.pm
750 'require Encode' because ->Define uses Encode::define_encoding();
751 problem and solution addressed by Miyagawa-kun
752 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
754 Cuts the frill to make djgpp happier, as suggested by Laszlo
755 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
757 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
758 Message-Id: <m3bsbug48n.fsf@anima.de>
760 1.69 2002/05/04 16:41:18
761 ! lib/Encode/MIME/Header
762 Floating-point coerced for UNICOS (in integer arithmetics it folds
763 line one character too early). Verification by Mark is pending.
764 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
766 more doc patch from Elizabeth
767 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
768 ! Encode/Makefile_PL.e2x
769 More platform-independent patch from Benjamin
770 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
771 ! lib/Encode/Guess AUTHORS
772 split regex fix by Graham Barr. Adds him to AUTHORS.
773 Message-Id: <20020504085419.E95940@valueclick.com>
774 ! Encode/Makefile_PL.e2x
775 enc2xs script discovery made smarter and more sensible, first cited
776 by Miyagawa-kun and further suggestions by Rafael and Andreas
777 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
778 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
780 1.68 2002/05/03 12:20:13
781 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
782 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
783 and Supported.pod modified to reflect the change. Elizabeth added
784 to Authors. And H.M. is also added for forwarding her patch among
785 other contributions (I was rather surprised to find his name was not
787 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
789 1.67 2002/05/02 07:33:09
791 Error message now consistent w/ perlqq (\N{U+} -> \x{})
792 done in perl@16308 but Philip linted me further. Now the error
793 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
794 ! lib/Encode/Guess.pm
795 Sanity check for happier -w by Autrijus
797 1.66 2002/05/01 05:41:06
798 ! Encode.xs t/fallback.t
799 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
800 while fallback is in effect. This even came with a welcome side-effect
801 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
802 also modified to test this.
803 And of course, the corresponding varialbles to UV[Xx]f are appropriately
804 cast. This should've concluded NI-XS homework.
806 encode(undef) does warn again! Repented upon suggestion by NI-XS.
807 Document for unless vs. '' added
808 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
810 1.65 2002/04/30 16:13:37
812 encode(undef) no longer warns for C<Use of uninitialized value in
813 subroutine entry>. Suggested by Paul.
814 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
815 ! lib/Encode/Supported.pod
816 Encode::MIME::Header and Encode::Guess mentioned
817 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
818 ! lib/Encode/Guess.pm
819 POD fix by Miyagawa-kun
820 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
822 1.64 2002/04/29 06:54:06
824 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
825 encocde("euc-jisx0213") are still DIFFERENT.
826 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
828 A few white spaces corrected by NI-XS via PerlIO integration to
830 Subject: Change 16247: Integrate perlio;
832 Document fixes by Andreas
833 Message-Id: <m3k7qsf1we.fsf@anima.de>
835 1.63 2002/04/27 18:59:50
836 ! lib/Encode/Encoding.pm
837 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
838 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
839 Make use of the Encode::Encoding base class!
840 And other cleanups in Encode.xs upon NI-XS suggestions
841 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
843 1.62 2002/04/27 11:17:39
845 encodings() now just check %ExtModule instead of eval{require}
846 all of them for ":all" to conserve more memory.
848 more "%x" -> "%" UVxf stuff.
850 s/=over2/=over 2/g # oops.
852 1.61 2002/04/26 03:02:04
854 Now does decent tests besides use_ok()
855 ! lib/Encode/Guess.pm t/guess.t
856 UI streamlined, document added
858 various signed/unsigned mismatch nits (#16173)
859 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
861 POD: utf8-flag-related caveats added. A few sections completely
865 Thou shalt not assume %d works, either!
866 Robin Baker added to AUTHORS for this
867 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
869 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
871 1.60 2002/04/24 20:06:52
873 "Thou shalt not assume %x works." -- jhi
874 Message-Id: <20020424210618.E24347@alpha.hut.fi>
875 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
876 low-memory build machines happy, now *.c is created for each *.ucm
877 (no table aggregation). You can still override this by setting
878 $ENV{AGGREGATE_TABLES}.
879 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
880 + lib/Encode/Guess.pm
881 + lib/Encode/JP/JIS7.pm
882 Encoding-autodetect (mainly for Japanese encoding) added. In a
883 course of development, JIS7.pm was improved.
884 + lib/Encode/HTML/Header.pm
885 + lib/Encode/Config.pm
886 MIME B/Q Header Encoding Added!
887 ! Encode.pm Encode.xs t/fallback.t
888 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
889 Message-Id: <20020424130709.GA14211@tanglefoot>
891 1.59 $ 2002/04/22 23:54:22
892 ! Encode.pm Encode.xs
893 needs_lines() and perlio_ok() are added to Internal encodings such
894 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
896 1.58 2002/04/22 23:54:22
898 s/MacChineseSimp/MacChineseTrad/ # ... oops.
903 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
904 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
905 t/(JP|KR).t are aggregated to t/CJKT.t
906 test data are all remade via bin/ucm2text.
907 And .... They are no longer skipped for -Uuseperlio !
909 1.57 2002/04/22 20:27:30
910 ! t/JP.t t/KR.t t/perlio.t
911 unless (find PerlIO::Layer 'perlio') ... line is back again.
912 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
913 need PerlIO. Sigh....
914 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
915 ->perlio_ok now does eval{ require PerlIO::encoding } there so
916 it correctly returns 1 when PerlIO::encoding is yet loaded.
918 perl-current patch #16072 reflected
920 1.56 2002/04/22 09:48:07
921 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
922 New PerlIO::encoding 0.04 compliance met
924 1.55 2002/04/22 03:43:05
925 ! Encode.pm Encode.xs Unicode/Unicode.pm
926 needs_lines() defined so Encode::Encoding is no longer needed
929 1.54 2002/04/22 02:50:01
930 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
931 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
932 now perlio_ok is true by default if PerlIO::encoding->VERSION is
933 0.03 or larger. POD in Encode::Encoding revised to reflect this.
934 Encode::XS and Encode::Unicode now has perlio_ok() method.
935 ! lib/Encode/Supported.pod
936 s/UP-UX/HP-UX/ by jhi
937 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
938 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
939 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
940 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
941 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
942 ! lib/Encode/Supported.pod
943 Huge document fixes by Philip.
946 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
947 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
949 binmode() after "<:encoding" to make Win32 happy, by Mattia.
950 Mattia added to AUTHORS file
951 Message-Id: <3CC3150F.5798.22A05AE@localhost>
953 1.52 2002/04/20 23:43:47
955 TODO: is now SKIP:, as NI-XS requested. Also adds more
956 eraborate failure analysis added.
958 A note on how to make sure of round-trip safety added to POD
959 section (so Autrijus is happier)
960 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
961 big5-(eten|hkscs) is round-trip safe again!
962 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
964 Typo fixes by Andreas
965 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
966 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
967 PerIO coodination patches from NI-XS.
968 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
970 1.51 2002/04/20 09:58:23
972 Updated test suite by Autrijis so "make test" is happy again
973 Message-Id: <20020420082104.GA25037@not.autrijus.org>
975 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
977 TW/TW.pm TW/Makefile.PL
978 Updates by Autrijus. 'big5' is no longer a canonical but an
979 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
980 Message-Id: <20020419195346.GA19597@not.autrijus.org>
982 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
983 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
985 PerlIO detection a little bit smarter; no longer uses eval qq{}
988 1.50 2002/04/19 06:13:02
989 ! ! Encode.pm Encode.xs Encode/encoding.h
991 New Fallback API imlemented and documented. See "perldoc Encode"
993 ! lib/Encode/JP/JIS7.pm Encode.pm
994 + lib/Encode/PerlIO.pod t/perlio.t
995 API compliance met. However, it still does not work unless perlio
996 implements line buffer. See BUGS section in perldoc Encode::PerlIO
997 As a sensible workaround, perlio_ok() added to Encode.
999 ! lib/Encode/Supported.pod
1001 Message-Id: <20020418174647.J8466@alpha.hut.fi>
1003 Doc fixes from Autrijus
1004 Message-Id: <20020418144131.GA10987@not.autrijus.org>
1006 perlqq mode documented
1008 + t/jisx0201.euc t/jisx0201.ref
1009 ! t/jisx0208.euc t/jisx0208.ref
1010 t/JP.t tests more rigorously and with other encodings
1011 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
1012 in jis0208 (used to contain jisx0201 part).
1013 ! Encode/Makefile_PL.e2x
1014 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
1015 enc2xs and encode.h rather than hard-coded. This allows the resulting
1016 module fully CPANizable.
1017 ! encoding.pm t/JP.t t/KR.t
1018 PerlIO detection simplified (checks %INC instead of eval{})
1019 ! Encode.xs Encode/encode.h
1020 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
1021 - lib/Encode/Unicode.pm
1022 (en|de)code_xs relocated to where it belongs. Source reindented
1025 Additional (U8 *) cast added as suggested by jhi
1026 Message-Id: <20020417165916.A28599@alpha.hut.fi>
1028 1.42 Date: 2002/04/17
1030 no-op module; Thought of adding a pod there but enc2xs has
1035 correct mechanism to detect Perlio::encoding layar installed.
1037 PerlIO Layer detached.
1039 1.41 2002/04/16 23:35:00
1041 binmode(STDIN|STDOUT ...) done iff PerlIO is available
1043 Cleaned up PerlIO skip conditions to prepare for the upcoming
1044 Encode - PerlIO forking.
1046 exported functions are now prototyped.
1047 ! lib/Encode/CN/HZ.pm
1050 fallback implemented # was /* FIXME */
1051 affected programs revised to fit (only HZ was using the try-catch
1052 approach which needed to be fixed for API-compliance).
1054 ! Encode/KR/2022_KR.pm
1056 can find =head1 NAME now, jhi
1057 Message-Id: <20020416083059.V30639@alpha.hut.fi>
1061 now complies with less warnings with the pickest compilers.
1062 Suggested by Craig, fixed by Dan.
1063 ! Encode/Makefile_PL.e2x
1065 A bug that fails to find *.e2x in certain conditions fixed
1067 1.40 2002/04/14 22:27:14
1068 + Encode/ConfigLocal_PM.e2x
1069 ! lib/Encode/Config.pm
1071 "enc2xs -C" now generates/updates Encode::ConfigLocal.
1072 ConfigLocal_PM.e2x is a skelton thereof.
1073 ! lib/Encode/Config.pm
1075 "use Encode::CN::HZ;" was missing.
1078 More rigorous tests added to test XS, especially on memory allocation.
1080 ! lib/Encode/Unicode.pm
1081 NI-S implemented an XS version -- merged
1082 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
1085 Source filter option added. With this option on, you can write
1086 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
1087 encodings. t/jperl.t enhanced to test this feature.
1089 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
1090 apply his suggestion, this degree of nitting is enough to add him
1092 Message-Id: <3CB93223.291E5E2E@earthlink.net>
1094 + lib/Encode/JP/JIS7.pm
1095 - lib/Encode/JP/JIS.pm
1096 - lib/Encode/JP/2022_JP.pm
1097 - lib/Encode/JP/2022_JP1.pm
1098 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
1099 JIS7.pm for better maintainability and performance
1101 Added caveat for non-ascii identifiers.
1103 fixes by jhi, the original author of this pragramtic module.
1104 Message-Id: <20020413231527.V1826@alpha.hut.fi>
1106 1.34 2002/04/12 20:23:05 (Unreleased)
1109 EBCDIC fixes addressed by jhi.
1110 Message-Id: <20020412161844.D9383@alpha.hut.fi>
1111 ! lib/Encode/Encoder.pm
1112 POD fix by Miyagawa-kun
1113 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
1115 1.33 2002/04/10 22:28:40
1117 Philip's mail address corrected.
1120 ! lib/Encode/Encoder.pm
1121 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
1122 This adds Miyagawa-kun to AUTHORS.
1123 * encoding() no longer exported by default but on demand
1124 * t/Encoder.t updated to test all these
1125 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
1126 ! lib/Encode/Unicode.pm
1127 ! lib/Encode/Supported.pm
1128 Further doc fixes by Anton
1130 1.32 2002/04/09 20:06:15
1133 - ucm/macDevanaga.ucm Unicode Character Map
1134 - ucm/macGujarati.ucm Unicode Character Map
1135 - ucm/macGurmukhi.ucm Unicode Character Map
1136 A utility to check integrity of .ucm files. t/bogus.ucm is a
1137 ucm that is deliberately bogus. unused Indic mappings are removed
1140 resolve_alias() added as suggested by jhi. Same as
1141 find_encoding("alias")->name. For convenience. This one is
1142 defined in Encode.pm instead of Alias.pm.
1143 Message-Id: <20020409215846.H17022@alpha.hut.fi>
1145 Memory Allocate but detected during the devel of ucmlint -- fixed.
1146 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
1147 ! lib/Encode/Unicode.pm
1148 valid_ucs2(0) is false but must be true.
1149 3 patches from NI-S as follows. This also has fixed the incident
1151 ! lib/Encode/Alias.pm
1152 find_alias() recursion prevention
1154 Checks for the patch above
1155 ! t/Encode/Unicode.pm
1156 An extra "F" that causes valid_ucs2() return a bogus value fixed
1157 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
1158 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
1159 2 Small Patches from jhi as follows:
1161 Encode->encodings() lists in case-insensitve order (as it was)
1163 -l option prints avaiable encodings to STDOUT instead of STDERR
1164 ! lib/Encode/Aliases.pm
1165 s/defintion/definition/
1166 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
1168 ! lib/Encode/Supported.pod
1169 ! lib/Encode/Unicode.pm
1170 POD revise by Philip Newton. This adds Philip to AUTHORS list.
1171 Thank you for the exact quote of Douglas Adams :)
1172 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
1174 1.31 2002/04/08 18:08:07
1175 ! lib/Encode/Encoder.pm
1177 Encode::Encoder, once just a placeholder of an idea, is now much more
1178 practical. See t/Encode.t to find how practical it can be.
1179 + lib/Encode/Config.pm
1181 my false laziness at Encode.pm is fixed. Now %ExtModules are set
1182 in Encode::Config and they are all literally, not programatically
1183 set. My false laziness was resulting many encodings missing from
1185 ! lib/Encode/Unicode.pm
1187 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
1188 so that it does not rely Encode::Unicode for BOM values
1189 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
1191 1.30 2002/04/08 02:34:51
1192 + lib/Encode/Encoder.pm
1193 Object Oriented Encoder. I reckon something like this is in need.
1196 ! lib/Encode/Supported.pod
1197 * autoloading bug that prevented upper-case canonicals such as UTF-16
1198 is fixed. Now even UTF/UCS are autoloaded!
1199 * encodings() is now more intuitive.
1200 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
1202 * Obligatory fixes to the POD.
1203 ! lib/Encode/Supported.pod
1204 Patch from Anton applied.
1205 Message-Id: <66641479.20020408033300@motor.ru>
1207 ! lib/Encode/Unicode.pm
1208 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
1210 1.28 2002/04/07 18:58:42
1214 Just a MANIFEST for those missing files.
1216 1.26 Date: 2002/04/07 15:22:04
1219 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
1220 And t/Aliases.t fixed to make sure they all exist.
1222 1.25 2002/04/07 15:01:25 (Unreleased)
1224 ! lib/Encode/Unicode.pm
1227 - lib/Encode/UTF_EBCDIC.pm
1228 - lib/Encode/Internal.pm
1229 - lib/Encode/utf8.pm
1230 Integrated into Encode.pm as closures. That way "one package, one file"
1231 rule is preserved yet less files to require.
1233 commented out binmode(STDERR ...
1237 ! EBCDIC/Makefile.PL
1240 ! Symbol/Makefile.PL
1242 ! Encode/Makefile_PL.e2x
1243 Schwarn's MM-compliance patch merged
1244 Message-Id: <20020406082609.GA28758@blackrider>
1246 ! lib/Encode/Unicode.pm
1247 + lib/Encode/UTF_EBCDIC.pm
1249 - lib/Encode/10646_1.pm
1250 - lib/Encode/ucs2_le.pm
1251 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1252 per-module based (en|de)code, I saved a number of .pm by
1253 reorganizing it as per-object base (Well, this is what Encode::XS
1254 does under the hood). See Encode::Unicode for details.
1255 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1256 This module is used only on EBCDIC environments.
1258 1.21 2002/04/05 14:46:34 (Not Released)
1264 Are back to make Perl/Tk happy Smile, NI-S.
1266 ! lib/Encode/Alias.pm
1267 ! lib/Encode/Supported.pm
1268 ! lib/Encode/10646_1.pm
1269 ! lib/Encode/ucs2_le.pm
1270 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1271 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1272 updated to reflect changes.
1273 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1274 ! lib/Encode/Alias.pm
1275 ! lib/Encode/Supported.pm
1276 Anton's revision commited. Added Dan's own fixes as well.
1277 Message-Id: <159103166906.20020405161134@motor.ru>
1278 ! lib/Encode/Alias.pm
1280 < qr/^UCS2-le$/i => '"UCS-2"', );
1282 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1283 Sigh. Thank you, Anton.
1284 Message-Id: <14567692196.20020405062020@motor.ru>
1285 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1287 1.20 2002/04/04 19:50:52
1289 the last minute addtion. Just give it a try. Docs remains to be done.
1290 Not installed by default.
1291 ! lib/Encode/Supported.pod
1294 ! lib/Encode/Alias.pm
1295 ! lib/Encode/utf8.pm
1296 ! lib/Encode/10464_1.pm
1297 ! lib/Encode/ucs2_le.pm
1298 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1299 unchanged but UTF-16BE is added as an alias. Implicit aliases
1300 move to Encode::Alias so init_alias() works more as expected.
1301 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1302 Though pedantically wrong, This should make perl mongers happier.
1303 t/Alias.t is enhanced to test all these.
1304 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1306 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1307 of the codepoints are common. CJKT left as is because the saving is
1311 ! EBCDIC/Makefile.PL
1313 ! Encode/Makefile_PL.e2x
1317 ! Symbol/Makefile.PL
1321 All occurance of _def.h replaced with .exh so djgpp works happily
1322 ever after! To credit this amazing discovery, Laszlo is now in
1324 Message-Id: <20020403181424.GA8778@freemail.hu>
1325 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1328 ! Encode/Makefile_PL.skel
1330 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1331 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1333 Two more tests by added jhi
1334 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1337 The showstopper fixed -- Memory reallocation bug was causing
1338 Encode::XS to fall into infinite loop on certain conditions.
1339 t/grow.t tests that.
1340 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1345 ! lib/Encode/Supported.pod
1346 Vendor encodings rebuilt out of original map files at unicode.org.
1347 Indic languages such as MacDevanagali remain unspported do to the
1348 shortcoming of encengine capabilities (they need algorithmical
1349 conversion and I have no knowledge on that!). Pods fixed for added
1351 Oh, macJapan.ucm renamed to macJapanese.ucm.
1352 macROMnn is macRomanian and macRUMnn is macRumanian.
1353 txt2ucm is a crude script that is used to convert them.
1355 Unicode Compound Characters (used extensively on Mac) supported
1357 Typo fixes and improvements by jhi
1358 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1360 1.11 2002/03/31 22:12:13
1364 Missing files from the MANIFEST fixed.
1365 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1366 Version incremented just to make CPAN happy.
1368 1.10 2002/03/31 21:32:42
1371 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1372 if you want. This should make Autrijus happy. Also, piconv
1373 is added to default install.
1376 Here-documented files that enc2xs generates are now exported
1377 to *.e2x. Much cleaner and easier to debug.
1379 encoding enhances so you can make it act more like such
1380 (now prehistoric ) "localized" variations of perl like Jperl.
1382 Further test for encoding.pm. Written in euc-jp
1385 Taken over form jhi.
1386 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1391 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1398 ! Encode/macIceland.ucm
1399 ! lib/Encode/Alias.pm
1400 ! lib/Encode/Supported.pod
1401 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1402 Message-Id: <m3lmcavhjt.fsf@anima.de>
1404 1.01 2002/03/29 20:59:39
1407 s/USE_SCRIPTS/MORE_SCRIPTS/
1409 installs enc2xs by default for external Encode:: modules in CPAN,
1410 such as Encode::HanExtra
1412 More sensible perl core detection via $ENV{PERL_CORE}
1414 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1416 Perl core ditection via $^X =~ m/\bminiperl$/o
1417 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1419 1.00 Wed Mar 29 2002
1421 The version of all files is updated to 1.00 via "ci -f -l1.00",
1422 commemorating version 1.00. All files, including *.ucm are now
1423 under version control.
1426 encode.h moved to Encode/ so it will be installed for the later
1429 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1434 compile renamed to enc2xs.
1435 Affected Makefle.PL updated
1437 "Punt it. HanExtra can take care of that later." -- Autrijus
1438 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1441 ! Encode/ksc5601.ucm
1442 ! lib/Encode/CJKConstants.pm
1443 ! lib/Encode/KR/2022_KR.pm
1444 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1445 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1449 bin/ added for example scripts. They are not installed by default.
1450 to install them, "perl Makefile.PL USE_SCRIPTS".
1451 piconv is iconv reinvented in perl. in addition to all features
1452 of iconv, it also adds perlish features. See L<piconv/1> for more
1454 ! lib/Encode/Alias.pm
1455 qr/^ replaced with qr/\b so it directly matches locale names
1456 such as en_US.US-ASCII
1459 Patch by MJD to fix the following problem applied.
1460 Subject: [PATCH 5.7.3 Encode]
1461 Aliases.t not properly skipped when Encode extension not built
1462 Message-Id: <20020328091850.18677.qmail@plover.com>
1463 ! lib/Encode/KR/2022_KR.pm
1464 ! lib/Encode/CJKConstants.pm
1465 Another patch from Jungshik to make iso-2022-kr actually work
1466 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1467 ! Encode/Encode/euc-kr.ucm
1468 + Encode/Encode/johab.ucm
1469 ! Encode/Encode/ksc5601.ucm
1471 ! Encode/KR/Makefile.PL
1472 ! Encode/lib/Encode/Alias.pm
1474 Johab support and complete revision of Korean Encoding by Jungshik
1475 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1477 Revised to make up with now-dropped Encode::Details.
1478 - lib/Encode/Details.pod
1479 Dropped. Besides being obsolete, the topics are now covered in
1480 respective pods now.
1485 Korean aliases fixed thanks to Jungshik Shin
1486 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1487 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1489 =head1 NAME added to all modules to make buildtoc happy
1490 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1491 - lib/Encode/CJKguide.pod
1492 Too controversial and dropped from the dist. Will be available
1493 separately on the web.
1495 RCS tags added so table debugging gets easier (should that be
1496 needed! I hope they all stay 1.00!)
1497 + lib/Encode/CJKguide.pod
1498 A detailed guide to mainly, but not limited to, CJK multibyte
1501 + Encode/hp-roman8.ucm
1503 ! Encode/Supported.pod
1504 All occurance of "roman8" replaced with "hp-roman8" to avoid
1506 ! Encode/Supported.pod
1509 Mac Encodings now comply the Inside Macintosh
1511 Test for '-raw' conventions added.
1513 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1514 ! Encode/gb12345.ucm
1516 ! Encode/ksc5601.ucm
1517 "-raw" appended to canonical names.
1518 File mames stay unchanged thanks to UCM format.
1519 ! lib/Encode/CN/HZ.pm
1520 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1521 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1523 0.99 Tue Mar 26 2002
1524 - lib/Encode/JP/Const.pm
1525 + lib/Encode/CJKConstants.pm
1526 + lib/Encode/CN/2022_CN.pm
1527 + lib/Encode/KR/2022_KR.pm
1536 * Support for ISO-2022-KR and ISO-2022-CN added.
1538 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1539 * ucm2table autogenerates character table out of UCM files.
1542 - lib/Encode/Supports.pod
1543 + lib/Encode/Supported.pod
1544 Names reverted due to popular demand.
1545 8.3 rule applies only when there is a conflict.
1546 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1551 - lib/Encode/Format/Enc.pod
1553 * Character tables is now 100% ucm.
1554 * All files under Encode/ is now 8.3-compliant
1555 * some of missing encodings added (i.e. gsm0338 and nextstep)
1556 * Vendor mappings aggregated with appropriate national std in
1557 Makefile.PL, resulting smaller *.so especially for CJK.
1558 Following is result on Dan's FreeBSD box.
1560 ---------------------------------------------------------------
1561 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1562 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1563 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1564 blib/arch/auto/Encode/Encode.so 27,791 27,791
1565 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1566 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1567 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1568 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1569 ---------------------------------------------------------------
1570 Total 5,375,297 6,343,819
1572 * As a result of ucm-transition, Encode::Tcl dropped because
1573 Encode::Tcl demands *.enc.
1574 Encode::Tcl will be supplied in a separate tarball with *.enc.
1575 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1580 -lib/Encode/Supported.pod
1581 +lib/Encode/Supports.pod
1582 -lib/Encode/iso10646_1.pm
1583 +lib/Encode/10646_1.pm
1584 -lib/Encode/EncFormat.pod
1585 +lib/Encode/Format/Enc.pod
1586 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1587 - lib/Encode/JP/Constants.pm
1588 + lib/Encode/JP/Consts.pm
1589 ! lib/Encode/JP/JIS.pm
1590 ! lib/Encode/JP/H2Z.pm
1591 Version nit problem and 8.3 rule fix.
1592 > Package namespace installed latest in CPAN file
1593 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1594 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1595 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1596 accordingly. In addition, legacy "use vars qw()..." are replaced with
1598 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1599 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1601 - lib/Encode/JP/ISO_2022_JP.pm
1602 - lib/Encode/JP/ISO_2022_JP_1.pm
1603 + lib/Encode/JP/2022_JP.pm
1604 + lib/Encode/JP/2022_JP1.pm
1606 8.3 naming conflict for vanilla fat addressed by jhi
1607 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1610 Typecast fix addressed by jhi
1611 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1613 0.98 Mon Mar 25 2002
1614 ! lib/Encode/Supported.pod
1616 + lib/Encode/JP/ISO_2022_JP_1.pm
1617 ! lib/Encode/JP/ISO_2022_JP.pm
1618 ! lib/Encode/JP/JIS.pm
1620 Now Encode::JP is more strict on the difference between ISO-2022-JP
1621 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1622 makes Anton happier :) FYI the previous version implements
1623 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1624 ! lib/Encode/Supported.pod
1627 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1628 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1633 ! lib/Encode/Suppoted.pod
1634 pod fixes to replace F<http://...> to L<http://...>,
1635 as suggested by Autrijius in:
1636 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1637 ! lib/Encode/Suppoted.pod
1638 fixes and enhancements by Anton
1639 Message-Id: <10632060120.20020324103753@motor.ru>
1640 ! lib/Encode/Alias.pm
1641 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1642 added. Suggested by Anton then deobfuscated by Autrijius
1643 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1645 Further fix by Nicholas Clark
1646 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1647 - lib/EncodeFormat.pod
1648 + lib/Encode/EncFormat.pod
1650 File renamed as suggested by Autrijius
1652 ! lib/Encode/Details.pod
1653 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1654 ! Encode.pm Sun Mar 24 13:43:47 2002
1655 pod fixes by Autrijius.
1656 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1657 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1659 ! lib/Encode/Alias.pm
1661 now more EBCDIC conscious;
1662 %ExtModules on EBCDIC system excludes CJK so that you don't
1663 have to worry about the matched alias resulting cloaking.
1664 t/Alias.t also revised to reflect changes. Verified by jhi
1665 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1667 0.97 Sun Mar 24 2002
1671 EBCDIC detection mechanism installed as in JP/JP.pm
1672 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1675 ! EBCDIC/Makefile.PL
1678 ! Symbol/Makefile.PL
1680 Now all table files used by compile are postfixed '_t' to avoid
1681 namespace collisions in case insensitive file systems once for all!
1683 Message-ID: <58290227735.20020323195659@familiehaase.de>
1685 Since the Encode::JP is unsupported under EBCDIC we
1686 cannot run this test (aliases as such should work fine) -- jhi
1687 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1689 duplicate occurance of ascii.ucm and 8859-1.ucm
1690 causes MacOS X dlyd to cloak
1696 < chdir 't' if -d 't';
1698 > if (! -d 'blib' and -d 't'){ chdir 't' };
1699 When you are "make test"-ing on Encode/ directory, you must not
1700 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1701 remain unchanced. Also the situation detection was made simpler
1702 in t/JP.t, which was originally;
1703 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1705 "Use of uninitialized value in string eq at Encode.pm line 96."
1706 ! Symbol/Makefile.PL
1707 ! EBCDIC/Makefile.PL
1709 -- Problem on case insensitive file systems
1710 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1711 Message-ID: <88254111953.20020323095503@familiehaase.de>
1714 "So I think it's a bug in gcc, not perl. But it still needs to be
1716 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1717 Message-Id: <20020323170509.C96475@plum.flirble.org>
1719 0.96 Sat Mar 23 2002
1721 ! lib/Encode/Encoding.pm
1722 ! lib/Encode/Alias.pm
1723 ! lib/Encode/Supported.pod
1725 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1726 Message-ID: <20020322073908.GB10539@blackrider>
1729 "...I think we should include ISO 8859-1 as well." -- NI-S
1730 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1735 ! lib/Encode/Alias.pm
1736 alias definitions relocated to Encode::Alias so module autoloading
1737 works for aliases also.
1739 encodings() now accepts args to check ExtModules.
1743 + EBCDIC/Makefile.PL
1744 + Symbol/Makefile.PL
1748 Latin and single byte encodings are reorganized so they are
1749 demand-loaded like Encode::XX. Now only ascii is compiled into
1751 ! lib/Encode/Alias.pm
1752 for my $k (keys %hash){ delete $hash{$k}; }
1753 is depreciated; fixed.
1755 0.95 Fri Mar 22 2002
1756 In this update, pod rewrites and alias fixes are the main issues
1757 + lib/Encode/Supported.pod
1758 Describes supported encodings
1760 streamlined compiled-in encodings.
1761 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1763 + Encode/ibm-125?.ucm
1764 Added from icu distibution with any occurance of
1765 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1766 some respect to icu staff, however.
1767 + lib/Encode/Alias.pm
1769 Alias difinitions in Encode.pm relocated.
1772 packWARN patch from Paul Marquess via jhi
1773 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1774 Paul added to AUTHORS as a result.
1775 ! t/CJKalias.t -> t/Aliases.t
1776 Renamed. Checks even more aliases and alias overloading
1779 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1780 gbk => cp936 relocated to CN.pm
1782 Test::More with plans (by jhi)
1784 0.94 Thu Mar 21 2002
1785 + lib/Encode/Description.pod
1786 ! lib/Encode/Encoding.pm
1787 Now the pod in Encode.pm is abridged as programming references.
1788 lib/Encode/Description.pod contains the original, detailed description
1789 and Encode::Encoding explains how to write your own module to
1790 add new encodings. So far, lib/Encode/Description.pod contains
1791 the whole pod once in Encode.pm. This is intentional.
1793 Pod revisions by Anton Tagunov
1794 Message-Id: <517178431.20020320174824@motor.ru>
1796 all occrance of Encode::Tcl::Extended removed including pod
1798 test now checks $encoding->name only; $encoding->{name} are
1799 no longer check to find the canonical name.
1800 ! lib/Encode/JP/JIS.pm
1801 ! lib/Encode/JP/ISO_2022_JP.pm
1802 ->name() added to be more compliant with API
1808 Patch by Autrijus to add aliases to TW and fixes to POD
1809 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1811 SADAHIRO Tomoyuki added as should. My apologies.
1813 0.93 Wed Mar 20 2002
1814 * First release to be uploaded to CPAN. For prehistoric changes,
1815 please see Changes file of perl distibution as well as
1816 perl-unicode@perl.org archive, available at:
1817 http://archive.develooper.com/perl-unicode@perl.org/
1819 Changes Since 0.92 includes;
1824 + Mention to perl-unicode@perl.org added
1826 + Encoding aliases added so you can feed locale names
1827 and MIME Charset="" directly.
1828 - Mention to JISX0212 removed because it's fixed
1831 + Encoding aliases added. Note TW is left untouched because
1832 euc-tw is not implemented in TW but in Encode::HanExtra.
1833 Autrijus, you may fix Encode::HanExtra.
1835 + to test encode aliases added