1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 2.29 2009/02/01 13:14:37 dankogai Exp $
5 $Revision: 2.29 $ $Date: 2009/02/01 13:14:37 $
7 VERSION++ just to make PAUSE happy
8 Message-Id: <877i4anwwt.fsf@k75.linux.bogus>
10 2.28 Date: 2009/02/01 12:30:18
12 Latest refactoring broke the backward compatibility
13 w/ Perl 5.8.6 and before now restored
14 Message-Id: <1233185156.DABa130.74940@basic2.hostingcompartido.com>
15 Message-Id: <693254b90902010027x277a5d0fm4f5700ba2f276239@mail.gmail.com>
16 ! lib/Encode/MIME/Header.pm
17 Addressed: Split header lines are joined incorrectly
18 http://rt.cpan.org/Ticket/Display.html?id=42902
20 2.27 2009/01/21 22:55:07
21 ! lib/Encode/MIME/Header.pm t/mime-header.t
22 Addressed: Encode::MIME::Header MIME-Q encoding truncates
23 trailing zeros in some circumstances
24 http://rt.cpan.org/Ticket/Display.html?id=42627
26 Added alias: unicode-1-1-utf-7
27 http://rt.cpan.org/Ticket/Display.html?id=38558
29 Documented: _utf8_on() does not work for tainted values
30 http://rt.cpan.org/Ticket/Display.html?id=41163
32 s[oss.software.ibm.com/icu][www.icu-project.org]g
33 http://rt.cpan.org/Ticket/Display.html?id=40245
34 ! lib/Encode/Guess.pm t/guess.t
35 Addressed:Empty file should produce an error message
36 http://rt.cpan.org/Ticket/Display.html?id=38652
37 ! Unicode/Unicode.xs AUTHORS
38 Refactored by Alex Davies
39 http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/2007-10/msg00745.html
40 Message-Id: <7637669B2E3D46B187591747DA27F4C8@Amelie>
42 2.26 2008/07/01 20:56:17
44 Absense of Encode::ConfigLocal no longer carps no matter what.
45 http://bugzilla.redhat.com/show_bug.cgi?id=435505#c2
46 http://rt.cpan.org/Ticket/Display.html?id=28638
47 http://rt.cpan.org/Ticket/Display.html?id=11511
49 use encoding 'utf8' and 'iso-2022-jp' glitches on perl 5.10
51 Message-Id: <693254b90807011224h3ab50d76v50c6fea87baf223c@mail.gmail.com>
52 ! lib/Encode/Alias.pm t/Aliases.t
53 macintosh' not recognize as MacRoman
54 http://rt.cpan.org/Ticket/Display.html?id=36326
56 s{INC => "-I./Encode"}
57 {INC => '-I' . File::Spec->catfile( '.', 'Encode' )}
58 To prevent some platforms from forgetting to include Encode/encode.h.
59 http://rt.cpan.org/Ticket/Display.html?id=36348
61 2.25 2008/05/07 20:56:05
63 added ':default' to Exporter option.
64 ! lib/Encode/GSM0338.pm
65 GSM0338 now handles coderef in CHECK
66 http://rt.cpan.org/Ticket/Display.html?id=31335
68 Perl 5.10/Encode 2.24: Tiny typo in Encode's Makefile.PL arg processing
69 Message-Id: <961C2A4F-92B3-416D-A9F9-E7B0ADA9F134@fsck.com>
71 "This fix for Encode::Alias should make Solaris happy:"
72 Message-ID: <47D886D9.6060001@iki.fi>
74 2.24 2008/03/12 09:51:11
75 ! lib/Encode/Config.pm
76 adds and fixes also adds cp858 support.
77 ! Encode.pm encoding.pm lib/Encode/Alias.pm ucm/cp858.ucm
79 > Change 33486 by rgs@scipion on 2008/03/12 08:50:11
80 An unfortunate side-effect of Encode and Encode::Alias use'ing each
81 other, and Encode::Alias exporting functions into Encode for it to use
82 as methods, broke the loading of the find_alias() Encode method in some
83 cases since 5.10. Breaking the recursive inheritance fixes it.
84 Message-Id: <b77c1dce0803120151o4166c3a0gfcfd14681ab7e10d@mail.gmail.com>
87 Message-Id: <200711281835.36125@bloodgate.com>
89 Fix by MIYAGAWA via CodeRepos
90 http://coderepos.org/share/changeset/1791
91 ! encoding.pm t/mime_header_iso2022jp.t
92 ported back from Perl 5.10-RC1
94 2.23 2007/05/29 18:15:32
96 got rid of global fallback_cb; encode_method() now takes one more
97 argument which is a coderef to fallback. This should make
98 encode_method() thread-safe.
100 Added perluniintro, perlunifaq, and perlunitut to POD
102 Plug a memory leak in Encode -- by rgs
103 Message-Id: <b77c1dce0705290858v2be239c3o2d726e3d59091493@mail.gmail.com>
105 POD fixes on UTF-16LE
106 http://aspn.activestate.com/ASPN/Mail/Message/perl5-porters/3486118
108 man page generation is now conditional; yes by default but no if $PERL_CORE
109 Message-Id: <b77c1dce0705290237h5c4667cdlf79a48b839170add@mail.gmail.com>
111 2.22 2007/05/29 07:35:27
113 from_to() does not honor the check while decoding. That's a feature.
114 To make sure it is a feature it is mentioned in the POD.
115 http://rt.cpan.org/NoAuth/Bug.html?id=27277
117 Encode used to suppress man page generation. Now it does.
118 http://rt.cpan.org/NoAuth/Bug.html?id=27200
119 ! Encode.pm Encode.xs t/fallback.t
120 Addressed: (de|en)code("ascii", "\x{3000}", sub{ $_[0] }) segfaults
123 2.21 2007/05/12 06:42:19
124 + lib/Encode/MIME/Name.pm t/mime-name.t
125 ! Encode.pm Encode.xs lib/Encode/Encoding.pm
126 new method: mime_name()
127 inspired by: MIYAGAWA
129 Subject: Re: Compress::Zlib, pack "C" and utf-8 [PATCH]
130 From: Marc Lehmann <schmorp@schmorp.de>
131 Date: Thu, 12 Apr 2007 08:41:53 +0200
132 Message-ID: <20070412064153.GA22475@schmorp.de>
133 http://public.activestate.com/cgi-bin/perlbrowse/p/31194
136 Message-Id: <20070417220547.GA11999@zetta.zet>
138 2.20 2007/04/22 14:56:12
140 Pod fixes. Now find_encoding() is explained more in details.
141 + lib/Encode/GSM0338.pm
143 ! lib/Encode/Supported.pod lib/Encode/Config.pm Bytes/Makefile.PL t/gsm0338.t
144 ESTI GSM 03.38 support is relocated from Encode::Byte to Encode::GSM0338.
145 This encoding is so kaputt it is unfit for Encode::XS!
146 Though it was okay for general cases and escape sequences,
147 '\0' => '@' IFF '\0\0' => '\0' had gliches.
148 So kaputt even t/gsm0338 wrongly interpreted that.
149 ref. http://www.csoft.co.uk/sms/character_sets/gsm.htm
150 ! encoding.pm t/Aliases.t
151 Imported from bleedperl #31015
153 2.19 2007/04/06 12:53:41
154 ! lib/Encode/JP/JIS7.pm
156 encode('iso-2022-jp') fallback support added by MIYAGAWA++
157 decode()'s fallback remains unchanged (FB_PERLQQ) since UTF-8
158 contains all characters in iso-2022-jp so there's no need for fancy stuff.
159 Message-Id: <693254b90704060526s6d850320h71cdda50dfbf7eba@mail.gmail.com>
161 #25216 ([PATCH] Encode.pm: postpone the load of Encode::Encoding)
162 http://rt.cpan.org/NoAuth/Bug.html?id=25216
163 ! lib/Encode/MIME/Header.pm t/mime-header.t
164 #24418 (Encode::MIME::Header: wrong encoding with latin1 characters)
165 http://rt.cpan.org/NoAuth/Bug.html?id=24418
167 #23876 (Add documentation for LEAVE_SRC)
168 http://rt.cpan.org/NoAuth/Bug.html?id=23876
169 ! lib/Encode/Alias.pm t/Aliases.t
170 #20781: Thai encoding needs alias for tis-620
171 http://rt.cpan.org/NoAuth/Bug.html?id=20781
173 #20344: piconv: wrong conversion of utf-16le encoded files (with PATCH)
174 http://rt.cpan.org/NoAuth/Bug.html?id=20344
175 ! Encode.pm Encode.xs bin/enc2xs encoding.pm t/Aliases.t t/utf8strict.t
176 Imported from bleedperl's 2.18_01
178 2.18 2006/06/03 20:28:48
180 overhauled the -C option
181 - added ascii-ctrl', 'null', 'utf-8-strict' to core
182 - auto-generated Encode::ConfigLocal no longer use v-string for version
183 - now searches modules via File::Find so Encode/JP/Mobile is happy
184 ! Byte/Byte.pm CN/CN.pm EBCDIC/EBCDIC.pm JP/JP.pm KR/KR.pm Symbol/Symbol.pm
185 use strict added; though all they do is load XS, it's
186 still better a practice
188 use warnings added to all of them for better practices' sake.
190 2.17 2006/05/09 17:10:09
192 'chin' =~ /^zh_CN|chin(?:a|ese)?$/i is true
193 but chin is not china or chinese.
194 http://d.hatena.ne.jp/jankogai/20060508/1147090316
196 Integrated maintperl change (27824|27824) which I overlooked
197 -- sorry, Nicholas and Coverity Scan.
198 Message-Id: <200604152115.k3FLF1Ar014538@smtp3.ActiveState.com>
199 Message-Id: <200605091615.k49GF1gJ016777@smtp3.ActiveState.com>
201 2.16 2006/05/03 18:24:10
203 --xmlcref and --htmlcref added.
205 Copyright Notice Added.
206 http://rt.cpan.org/NoAuth/Bug.html?id=19056
208 Replaced remaining ^\t with q( ) x 4. -- Perl Best Practice pp. 20
209 And all .pm's are now perltidy-ed.
211 2.15 2006/04/06 15:44:11
213 Addressed: UTF-16, UTF-32, UCS, UTF-7 decoders mishandle illegal characters
214 http://rt.cpan.org/NoAuth/Bug.html?id=18556
216 added str2bytes() as an alias to encode() and bytes2str() as an alias
218 http://rt.cpan.org/NoAuth/Bug.html?id=17103
220 Change 26922: Avoid warning with MS Visual C compiler.
221 Message-Id: <200601231245.k0NCj2dw009484@smtp3.ActiveState.com>
223 Change 26067: As using -C to turn on utf8 IO is equivalent to the open pragma
224 Message-Id: <200511092227.jA9MRcYD009025@smtp3.ActiveState.com>
226 2.14 2006/01/15 15:43:36
228 Change 26295: Don't build manpages for Encode and Unicode::Normalize
229 Message-Id: <200512071540.jB7Fe4Gt017960@smtp3.ActiveState.com>
231 Change 26081: Pod nit in Encode.pm, found by Marc Lehmann in RT #36949.
232 Message-Id: <200511110357.jAB3vZcP023647@smtp3.ActiveState.com>
233 ! Encode.xs Encode/encode.h bin/enc2xs encengine.c
234 Change 25821: Mark more static Encode data structures as const.
235 Change 25823: use more 'const' in the Encode data structures.
236 Message-Id: <200510221243.j9MChTSu027711@smtp3.ActiveState.com>
237 Message-Id: <200510221343.j9MDhTk9001245@smtp3.ActiveState.com>
239 2.13 2006/01/15 15:06:36
241 Miyagawa's mail address updated
242 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
243 ! lib/Encode/MIME/Header.pm
244 #16413: Encode::MIME::Headers patch to solve what is probably someone else's bug
245 http://rt.cpan.org/NoAuth/Bug.html?id=16413
246 ! lib/Encode/MIME/Header.pm t/mime-header.t
247 Applied: RT #16258: Support for RFC 2184 language tag
248 http://rt.cpan.org/NoAuth/Bug.html?id=16258
250 Fixed RT #14559: fix for #8872 introduces new "bug"
251 http://rt.cpan.org/NoAuth/Bug.html?id=14559
254 from_to() now makes use of $check more naturally.
255 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
256 <B10B4DE2-9BAF-4344-B3BE-2119977D817A@dan.co.jp>
258 2.12 2005/09/08 14:17:17
259 ! Encode.xs Encode.pm t/fallback.t
260 Now accepts coderef for CHECK!
262 Updated to newer version at unicode.org
263 http://rt.cpan.org/NoAuth/Bug.html?id=14222
264 ! lib/Encode/Supported.pod
266 <42F5E243.80500@gmail.com>
268 More POD typo leftover fixed.
269 Message-Id: <b77c1dce05080615487f95314@mail.gmail.com>
271 2.11 2005/08/05 10:58:25
273 To reflect changes below
274 ! Encode.pm encoding.pm
275 lib/Encode/Alias.pm lib/Encode/PerlIO.pod lib/Encode/Supported.pod
276 Typo fixed by Piotr Fusik in Change 25261 & 25266
277 Message-ID: <001401c595bd$dccb5d80$0bd34dd5@piec>
279 Addresses "BUG REPORT: panic in Encode.xs".
280 Message-Id: <42EDDA97.2010608@hyper.to>
281 + lib/Encode/MIME/Header/ISO_2022_JP.pm mime_header_iso2022jp.t
282 ! lib/Encode/MIME/Header.pm lib/Encode/Config.pm
283 Encoding 'MIME-Header-ISO_2022_JP' is introduced by Makamaka
284 Message-Id: <200507311557.j6VFvE2K034605@www231.sakura.ne.jp>
285 ! Encode/encode.h Encode.pm Encode.xs
286 PerlIO's "encoding(utf-8-strict)" got a problem w/ partial character.
287 Found and addressed by KONNO Hiroharu <hiroharu.konno@bowneglobal.co.jp>
288 See also ext/PerlIO/encoding/encoding.pm
289 Message-Id: <E1DineE-00068X-TB@yok-gs-workman.asia.bgsinternal.com>
291 2.10 2005/05/16 18:46:36
293 fixed decode_utf8() accordingly to RT#8872
294 http://rt.cpan.org/NoAuth/Bug.html?id=8872
296 s/SvIVX/SvIV_set/ by Steve Peters.
297 Message-Id: <2297.67.96.185.36.1114626315.squirrel@webmail3.pair.com>
301 New Pod section: "UTF-8 vs utf8"; explains utf-8-strict
303 Tests utf-8-strict, accordingly to
304 UTF-8 decoder capability and stress test" by Markus Kuhn
305 http://smontagu.damowmow.com/utf8test.html
306 Note that malformed and overlong sequences are not test here
307 because perl already does that for you, utf-8-strict or not.
308 ! Encode.pm Encode/encode.h t/fallback.t
309 Addressed "encode(..., Encode::LEAVE_SRC) does not work".
310 Now FB_(PERLQQ|HTMLCREF|XMLCREF) implies LEAVE_SRC so
311 you can (en|de)code constant strings with these fallbacks.
312 http://rt.cpan.org/NoAuth/Bug.html?id=8736
313 ! Encode.pm Encode.xs lib/Encode/Alias.pm t/Aliases.t
314 Make Encode.pm support the real UTF-8, by GAAS
315 Message-Id: <lrfz2mcngd.fsf@caliper.activestate.com>
316 Message-Id: <lr4qizbvvm.fsf@caliper.activestate.com>
317 ! Encode.pm Encode.xs
318 post-2.09 comment patches from GAAS applied.
319 Message-Id: <lroehacz6q.fsf@caliper.activestate.com>
320 Message-Id: <lrk6rycymu.fsf@caliper.activestate.com>
322 2.09 2004/12/03 19:16:53
323 ! Encode.pm Encode.xs
324 Addressed " :encoding(utf8) broken in perl-5.8.6".
325 Message-Id: <lrllcfeank.fsf_-_@caliper.activestate.com>
327 Addressed "(de|en)code($valid_encoding, undef) does not warn".
328 http://rt.cpan.org/NoAuth/Bug.html?id=8723
329 ! Encode.pm t/Encode.t
330 Addressed "Can't encode URI". When a reference is fed to (en|de)code,
331 Encode now stringifies instead of returning undef.
332 http://rt.cpan.org/NoAuth/Bug.html?id=8725
333 ! Encode.xs t/fallback.t
334 Addressed "FB_HTMLCREF and FB_XMLCREF for the UTF-8 decoder".
335 http://rt.cpan.org/NoAuth/Bug.html?id=8694
337 Addressed "s/digit/number/".
338 http://rt.cpan.org/NoAuth/Bug.html?id=8695
340 Addressed "while (defined(read )) { ... } is an infinite loop".
341 http://rt.cpan.org/NoAuth/Bug.html?id=8696
343 Addressed "What the heck is UCM?".
344 Document fixed so that it no longer contains "UCM-Based Encodings".
345 http://rt.cpan.org/NoAuth/Bug.html?id=8697
347 2.08 2004/10/24 13:00:29
348 ! Encode.xs lib/Encode/Encoding.pm Unicode/Unicode.{pm,xs}
349 Resolved the issue that was raised by 2.07 -- Encode::utf8 fallbacks
350 that was introduce messed up PerlIO::encoding.
351 * To do so, ->renew() is renewed and ->renewed() was introduced to
352 tell whether the caller is PerlIO or not.
353 Message-Id: <94B2EB12-25B7-11D9-9E6A-000A95DBB50A@dan.co.jp>
355 2.07 2004/10/22 19:35:52
356 ! lib/Encode/Encoding.pm
357 "Remove Carp from warnings.pm" that influences Encode, by Tels.
358 Message-Id: <200410161618.29779@bloodgate.com>
359 ! Encode.xs AUTHORS t/fallback.t
360 Now Encode::utf8's fallbacks are compliant to Encode standard.
361 Thank Bjoern Hoehrmann for persistently convincing me.
362 Message-Id: <41a61aea.638409494@smtp.bjoern.hoehrmann.de>
366 2.06 2004/10/22 06:23:11
368 RT #8083 reports that MacThai mapping was obsolete
369 Updated all mac* encodings accordingly to the URI below.
370 One remaining mystery is that MacRomanian vs. MacRumanian.
371 MacRumanian is not found in unicode.org...
372 http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/
373 ! Encode.pm t/Encode.t
374 Fixed RT #8081: "decode(..., bless{},'x') segfault"
375 Two more tests added to test that.
376 http://rt.cpan.org/NoAuth/Bug.html?id=8081
378 POD revised accordingly to RT #7966
379 http://rt.cpan.org/NoAuth/Bug.html?id=7966
381 POD updated explaining why Encode::Unicode always croaks on error
382 rather than giving users choices.
383 http://rt.cpan.org/NoAuth/Bug.html?id=7892
385 2.05 2004/10/19 04:55:01
387 "unnuke" jhi's patch in bleedperl, with minor correction by dankogai.
388 Message-ID: <41210A84.6060506@iki.fi>
389 Message-ID: <20041018233442.7418113f@r2d2>
390 Message-Id: <2BA3DAC4-218A-11D9-906D-000A95DBB50A@dan.co.jp>
392 2.04 2004/10/16 21:22:44
394 From: craigberry@mac.com
395 Subject: [PATCH ext/Encode/Makefile.PL] make Encode.c dependency explicit
396 Message-Id: <41716868.7000102@mac.com>
398 2.03 2004/10/06 05:07:20
399 ! lib/Encode/Alias.pm
400 Resolved some alias case sensitivity glitches reported via RT.
401 http://rt.cpan.org/NoAuth/Bug.html?id=7835
403 Resolved Win32 glitches reported via RT.
404 (Fixed by dankogai and tested by Steve Hay)
405 http://rt.cpan.org/Ticket/Display.html?id=7831
406 ! JP/JP.pm lib/Encode/Alias.pm lib/Encode/Supported.pod AUTHORS
407 /\bwindows-31j$/i is now an alias of CP932, by Steve Hay.
408 http://rt.cpan.org/NoAuth/Bug.html?id=6695
410 2.02 2004/08/31 10:55:34
411 ! ucm/big5-hkscs.ucm AUTHORS t/big5-hkscs.enc t/big5-hkscs.utf
412 New map submitted by Deng Liu and Autrijus. Test data needed
413 to be upgrade as well, done by dankogai
414 Message-Id: <20040824204828.GB6999@aut.dyndns.org>
416 Now works for characters U+10000 and above. This fix was needed
417 to "tidy" the original map that was submitted.
419 "ucmsort" now mentioned in pod
421 2.01 2004/05/25 16:27:14
423 From: domo@computer.org
424 Subject: [PATCH] Correct statistics from enc2xs
425 <4AF60A4A-B8BB-11D8-BF99-000A27839BD6@computer.org>
426 <CDEBBD45-B91D-11D8-BF99-000A27839BD6@computer.org>
427 ! lib/Encode/Alias.pm
428 Addressed "False [] range "\s-" in regex;" in Encode::Alias.pm
429 <200405271148.i4RBm4KY026529@mail.mvnet.de>
431 2.01 2004/05/25 16:27:14
432 ! lib/Encode/CN/HZ.pm lib/Encode/Unicode/UTF7.pm
433 "If someone thinks utf8::upgrade($1) should be croaked like
434 chom?p($1),please try the following patch for Encode.pm."
436 <20040522212704.C068.BQW10602@nifty.com>
438 2.0 2004/05/16 20:55:15
439 * version updated to 2.00
440 -- sorry, no big feature change. I just hate version 1.100 :)
441 ! lib/Encode/Guess.pm
443 addressed UTF-(8|32LE) + BOM misguessing
444 https://rt.cpan.org/Ticket/Display.html?id=6279
446 s/is_utif8/is_utf8/ in POD
447 ! Encode/lib/Encode/CN/HZ.pm
448 Fixes "make test" failure after the patch to pp_hot.c
450 Message-Id: <20040222182357.6B39.BQW10602@nifty.com>
452 From: autrijus@autrijus.org
453 Subject: [PATCH] "piconv -C 512" badly broken
454 Message-Id: <1072870210.769.5.camel@localhost>
456 1.99 2003/12/29 02:47:16
458 find_encoding("UTF-16BE")->encode("abc") now null terminates
459 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html
461 prototype bug in decode_utf8() fixed
462 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
463 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
464 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
466 Merged from maintperl@21987
468 1.98 2003/08/20 11:15:31
469 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
470 Dave Evans has found and corrected a bug in Encode::MIME::Header.
471 Test suite added by Dan Kogai.
472 Message-Id: <3F43440B.7060606@rudolf.org.uk>
474 Typo fixes rolled back in from bleedperl
475 ! t/at-cn.t t/at-tw.t
476 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
477 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
479 argv case nit for VMS by Craig
480 Message-ID: <3F2B02DE.10207@mac.com>
481 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
482 Encode test fixes for VMS by Peter Prymmer
483 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
484 ! lib/Encode/Alias.pm t/Aliases.t
485 koi-8 aliases bug detected and patched by sadahiro.
486 Further fix and test suite by dankogai
487 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
489 1.97 2003/07/08 21:52:14
490 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
491 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
492 $DEBUG replaced with DEBUG() so perl optimizes better,
493 by Rafael with further fixes by dankogai
494 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
495 ! lib/Encode/Aliases.pm
496 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
497 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
498 So new hash seeding introduced in bleedperl works.
499 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
500 ! lib/Encode/Guess.pm
501 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
502 automatic utf(8|16|32) guessing -- originally by Autrijus
503 Message-Id: <20030626162731.GA2077@not.autrijus.org>
505 Addressed the following;
506 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
507 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
509 1.96 2003/06/18 09:29:02
510 ! lib/Encode/JP/JP.pm t/guess.t
511 m/(...)/ in void context then $1 is considered a Bad Thing
512 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
514 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
517 More typecast from maintperl@19739
518 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
520 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
521 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
523 Now skips for EBCDIC platform.
524 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
526 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
527 have more than one C<.> in their filenames." -- perlport.
528 Message-Id: <3ED79E01.8050401@mac.com>
530 Found and fixed the back that -p,--perlqq does not work.
531 Induced by the change from Getopt::Std to Getopt::Long.
533 Addressed [cpan #2629] Wrong assumption in numeric comparison
534 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
535 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
536 lib/Encode/Encoding.pm t/perlio.t
537 ! API Change: ->new_sequence() => ->renew()
538 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
539 + Encode::XS and Encode::utf8 now supports ->renew()
540 + Encode::Encoding now documents this with examples
541 - Non-XS (en|de)code stripped out of Encode::Unicode
542 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
544 1.95 2003/05/21 08:41:11
546 Since bogus entries were found in iso-8859-6, all entries are
547 re-generated once again out of
548 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
549 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
550 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
551 + lib/Encode/Unicode/UTF7.pm
552 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
553 lib/Encode/Supported.pod
554 UTF-7 support is now added. With this Encode now has all transcoding
555 methods in Unicode::String.
557 1.94 2003/05/10 18:13:59
558 ! lib/Encode/MIME/Header.pm
559 A more sophisticated solution for double-encoding by dankogai
560 ! lib/Encode/MIME/Header.pm AUTHORS
561 Two bugs fixed by Bjoern Jacke
562 * "Double Encoding" was not possible
563 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
564 * encode("MIME-Q") had UTF-8 flag on
565 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
566 ! lib/Encode/MIME/Header.pm AUTHORS
567 Two occurances of "croak ()" fixed as "croak qq()".
568 Simon Cozens is added to AUTHORS as a result.
569 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
571 POD fixes that reflect enhancements by jhi
573 Two enhancements by jhi.
574 + Now uses Getopt::Long so it accepts long name options
575 (--from for -f, for example)
576 + New option: -r,--resolve
577 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
579 META.yml added upon request of Schwern
580 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
582 Enache Adrian removed upon request -- to live longer than Encode
583 and/or FreeBSD (toy-)?thread :)
584 Message-Id: <20030425015701.GA2069@ratsnest.hole>
586 "close STDOUT unless $^O eq 'freebsd';" once again relocated
587 to keep VMS happy in which case "$^O eq 'freebsd'" is required
588 to keep FreeBSD+thread happy. Sigh.
589 Message-Id: <3EA88ADC.3000300@mac.com>
591 1.93 2003/04/24 17:43:16
593 added "no warnings 'pack'" in for loop to keep bleedperl from
594 complaining "Character in 'C' format wrapped in pack".
596 More elegant perl core detection inspired by Ilya Zakharevich
597 (but further elaborated for general cases).
598 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
601 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
602 are less likely to be confused w/ ISO-8859-*
604 RT tests added (vendor encodings are exemplified)
605 -- that successfully found a flaw on iso-2022-kr before the patch.
606 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
607 decode("ISO-2022-KR") has been buggy but no one ever sited
608 that since no one seems to be using it. Bugs discovered by
610 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
611 ! lib/Encode/CN/HZ.pm t/perlio.t
612 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
613 so it adds test for HZ.
614 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
615 ! lib/Encode/Guess.pm
616 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
617 So long as the string contains \x{00}-\x{ff} it does not fail.
618 See perldoc for details.
619 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
621 1.92 2003/03/31 03:27:27
622 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
623 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
624 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
625 but glibc-BIG5-2.1.2.ucm does.
626 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
627 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
628 Maps regenerated again but this time based upon
629 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
630 (But where is THE DOCUMENT by MICROSOFT?)
631 ! t/enc_module.t AUTHORS
632 failure with threaded Perl on FreeBSD addressed.
633 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
634 Message-Id: <20030322230131.GA813@ratsnest.hole>
635 ! lib/Encode/Guess.pm
638 Change 18989: Make the :bytes conditional on PerlIO.
639 further Modified by Dan Kogai
640 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
642 Chnage 18966: another fix for failing test on windows ("use encoding"
643 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
644 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
646 Change 18970: Hopefully this works also in Win32, by jhi
647 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
648 Change 18965: fix CJKT.t failures on windows due to incorrect
650 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
652 1.91 2003/03/09 20:07:37
654 even more proofread by jhi.
655 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
658 +use lib qw(t ext/Encode/t ../ext/Encode/t);
659 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
661 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
662 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
665 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
667 1.90 2003/03/09 17:32:43
670 Inaba-san has added a patch for perl 5.8.1 or later that makes
671 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
672 that. POD is further revised.
673 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
674 ! encoding.pm t/enc_module.t
675 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
676 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
680 signed vs. unsigned issue discovered by Craig on OpenVM
681 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
682 ! encoding.pm AUTHORS
683 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
684 Because binmode() stacks layers instead of overwrite, you have to
685 ":raw :encoding()" in encoding.pm or your are in trouble when you
686 call encoding.pm multiple times. There are several workarounds
687 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
688 who was the first to address this problem was added to AUTHORS.
689 The test suites was added for this, which is a modified version
690 of SUGAWARA-san's scripts
691 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
693 1.88 2003/02/20 14:42:34
695 one signedness nit for Encode by jhi
696 <200302161933.h1GJX876018710@kosh.hut.fi>
698 VISCII map was incorrect; fixed by Sadahiro-san
699 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
700 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
701 You can't unlink files that are opened in cygwin but the last
702 file handle opened in t/enc_*.t left open. Patch submitted
703 by Yitzchak and he was added to AUTHORS.
704 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
706 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
707 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
709 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
710 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
712 1.87 2003/02/06 01:52:11
714 * Inaba "Sensei" Hirohito added (I thought I have done so a long
715 ago but apparently I did not).
716 * SUZUKI Norio added for verious and useful bug reports.
717 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
718 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
719 podchecked so all warnings are gone except for L<http://>.
720 ! encoding.pm t/enc_eucjp.t
721 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
722 * Filter option overhaul
724 ! Encode.pm Encode.xs encengine.c Encode/encode.h
725 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
726 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
727 without filter. podchecked by Dan Kogai.
728 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
729 ! lib/Encode/Alias.pm
730 decode('alias', $1) went wild because of local $_ in find_alias()
731 the evil local $_ is eradicated but that changes find_alias()
732 format for coderef aliasing. See Encode::Alias for details
733 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
735 1.86 2003/01/22 03:29:07
737 * Don't forget to canonize when you attempt an exact match!
738 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
739 * ${^ENCODING} exception is off for $] > 5.008
740 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
742 $] check commented out so it runs on 5.8.0
744 1.85 2003/01/21 22:19:14
746 ${^ENCODING} exception is now explicit rather than handled by regex.
747 + t/enc_eucjp.t t/enc_utf8.t
748 Test suite for the better "encoding" pragma support for bleedperl.
749 On 5.8.0, they will just be skipped.
751 1.84 2003/01/10 12:00:16
753 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
754 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
755 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
757 %_ fixes saves the resulting .so .05% smaller, by NC
758 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
760 Silence Encode on undef, by Andreas
761 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
762 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
764 s/regognised/recognised/ . British spelling left intact to pay
765 respect to two British Nicks :)
766 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
768 1.83 2002/11/18 17:28:49
769 ! Encode.xs lib/Encode/JIS7.pm
770 Even more patches from Inaba-san has been applied. With this
771 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
772 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
774 1.82 2002/11/14 23:06:12
776 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
777 later found in perls w/ -Dusemymalloc was fixed by NC.
778 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
780 1.81 2002/11/08 18:29:27
781 ! Encode.pm Encode.xs
782 Non-XS version of Encode::utf8 is back (with XS being default).
783 Encode::predefine_encodings(0) to turn off XS.
784 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
785 found that it was Test::More causing the problem, not Encode.
786 But I have already made it configurable so it may be useful in
788 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
790 The ingenious patch by Nicholas Clark that reduces shlib sizes by
791 50% with no penalty and backward compatibility preserved, is in.
792 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
794 1.80 2002/10/21 20:39:09
795 ! Encode.xs t/mime-header.t
796 Even more patches from NI-XS regarding Encode::utf8->decode().
797 And one more test to t/mime-header.t to prove it
798 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
800 1.79 2002/10/21 06:05:37
802 Further patches from NI-XS. Encode::utf8->decode() now checks the
803 value of utf8 flag of the argument. As a result, the fix to
804 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
805 no harm (even speedwise) I'll leave it unreverted.
806 ! ucm/cp949.ucm ucm/cp950.ucm
808 U+00AE REGISTERED SIGN
809 were missing as a result of 1.78. Discovered by Moriyama-san.
810 Moriyama-san has also developed a test script that compares
811 (en|de)coded results to the corresponding Win32 API result and
812 all cp9?? maps are now verified.
813 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
815 1.78 2002/10/20 15:44:00
816 ! lib/Encode/MIME/Header.pm
817 fixed so that it works with new Encode::utf8
818 ! Encode.pm Encode.xs
819 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
820 :encoding(UTF-8) to handle partial chars at end of buffers
822 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
823 ! lib/Encode/Supported.pod
824 More nitpickings applied.
827 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
828 tests were completely useless. To redeem that and get the peace of
829 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
830 ucm/*.ucm Since this script takes too long to finish (30 seconds on
831 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
832 but you can easily run that by either renaming it or:
834 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
835 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
836 Other CJKT cp9?? also updated according to the URI below;
837 http://www.microsoft.com/typography/unicode/cscp.htm
838 + bin/ucmsort MANIFEST
839 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
840 to proper order. intended for hardcore develpers only.
841 ! ucm/cp932.ucm JP/JP.pm AUTHORS
842 CP932 mapping which was based upon the mapping file at unicode.org
843 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
844 has also supplied the patch so he was added to AUTHORS.
845 ! lib/Encode/Supported.pod
846 ISO-8859-11 != TIS 620
847 == TIS 620 + \xA0 ( )
849 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
851 1.77 2002/10/06 03:27:02
853 * Modified to accomodate up and comming patch by Inaba-san that
854 will fix tr/// needing eval qq{}
855 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
857 * pod fixes/enhancements to reflect the changes above
858 ! lib/Encode/Alias.pm
859 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
860 Message-Id: <20021001015648.GB18710@not.autrijus.org>
862 1.76 2002/08/25 15:09:51
864 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
865 but naturally identical to previous version -- dankogai
867 Codepoint fixes -- autrijus
868 Message-Id: <20020805040236.GC5220@not.autrijus.org>
870 copied everything under perl-5.8.0/ext/Encode to make sure Encode
871 is in sync w/ perl core
873 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
874 Now that binmode(FH) does implicit ":bytes" revisit
875 the failing tests. The worrisome one is the Digest::MD5
876 test-- how will it fare in CRLF lands now?
878 From: Radu Greab <radu@netsoft.ro>
879 Date: Mon, 10 Jun 2002 00:40:34 +0300
880 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
881 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
882 ones which I cannot figure out.
883 ! lib/Encode/Alias.pm
884 Subject: [Encode PATCH] spurious warning
885 From: Nicholas Clark <nick@unfortu.net>
886 Date: Sun, 2 Jun 2002 20:26:22 +0100
887 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
889 1.75 2002/06/01 18:07:49
890 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
891 glibc compliance cited by Autrijus.
892 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
893 ! bin/enc2xs bin/piconv
894 Subject: Re: forewarning: usedevel and versiononly
895 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
897 1.74 2002/05/28 18:33:15
898 + ucm/null.ucm ucm/ctrl.ucm
899 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
900 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
901 Subject: Re: unicode -> &# notation
902 Message-ID: <20020518193704.GB40272@not.autrijus.org>
904 1.73 2002/05/28 17:26:18
905 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
906 Chris Nandor has fixed Encode so that it works w/ MacPerl --
907 at least w/ PPC (68k need static linking which does not work due to
908 64k limit). pudge is added to AUTHORS (I'm surprised he was not
909 there in the list). Encode/Makefile_PL.e2x was additionally fixed
910 by dankogai to reflect changes in other Makefile.PL
911 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
913 Subject: Change 16746: -Mutf8 cleanup.
914 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
916 1.72 2002/05/20 15:49:56
918 Subject: [PATCH] Encode should be in perl-core library path
919 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
920 Message-Id: <20020520161201.A11019@alpha.hut.fi>
921 ! lib/Encode/MIME/Header.pm
922 Subject: [PATCH] Encode::MIME::Header
923 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
924 ! Encode/Makefile_PL.e2x
925 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
926 Message-Id: <20020519201031.GA1603@not.autrijus.org>
927 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
928 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
930 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
931 Subject: [PATCH] good day for WinCE port of perl.
932 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
935 Even more linting by Robin via @16532
937 Even more typecast by Sarathy in @16460
939 1.71 2002/05/07 16:22:42
941 even more typecasts by Robin
942 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
944 A very strange bug that was causing a bugus ucm -> C table
945 generation that was revealed by a UCM file that Andreas was
946 working. This is the king of wierdest bug I've encountered
947 in the course of Encode maintenance.
948 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
950 1.70 2002/05/06 10:26:48
952 Made more 'module-safe' with conjunction w/ 'no encoding'.
953 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
954 ! lib/Encode/Encoding.pm
955 'require Encode' because ->Define uses Encode::define_encoding();
956 problem and solution addressed by Miyagawa-kun
957 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
959 Cuts the frill to make djgpp happier, as suggested by Laszlo
960 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
962 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
963 Message-Id: <m3bsbug48n.fsf@anima.de>
965 1.69 2002/05/04 16:41:18
966 ! lib/Encode/MIME/Header
967 Floating-point coerced for UNICOS (in integer arithmetics it folds
968 line one character too early). Verification by Mark is pending.
969 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
971 more doc patch from Elizabeth
972 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
973 ! Encode/Makefile_PL.e2x
974 More platform-independent patch from Benjamin
975 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
976 ! lib/Encode/Guess AUTHORS
977 split regex fix by Graham Barr. Adds him to AUTHORS.
978 Message-Id: <20020504085419.E95940@valueclick.com>
979 ! Encode/Makefile_PL.e2x
980 enc2xs script discovery made smarter and more sensible, first cited
981 by Miyagawa-kun and further suggestions by Rafael and Andreas
982 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
983 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
985 1.68 2002/05/03 12:20:13
986 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
987 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
988 and Supported.pod modified to reflect the change. Elizabeth added
989 to Authors. And H.M. is also added for forwarding her patch among
990 other contributions (I was rather surprised to find his name was not
992 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
994 1.67 2002/05/02 07:33:09
996 Error message now consistent w/ perlqq (\N{U+} -> \x{})
997 done in perl@16308 but Philip linted me further. Now the error
998 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
999 ! lib/Encode/Guess.pm
1000 Sanity check for happier -w by Autrijus
1002 1.66 2002/05/01 05:41:06
1003 ! Encode.xs t/fallback.t
1004 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
1005 while fallback is in effect. This even came with a welcome side-effect
1006 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
1007 also modified to test this.
1008 And of course, the corresponding varialbles to UV[Xx]f are appropriately
1009 cast. This should've concluded NI-XS homework.
1011 encode(undef) does warn again! Repented upon suggestion by NI-XS.
1012 Document for unless vs. '' added
1013 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
1015 1.65 2002/04/30 16:13:37
1017 encode(undef) no longer warns for C<Use of uninitialized value in
1018 subroutine entry>. Suggested by Paul.
1019 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
1020 ! lib/Encode/Supported.pod
1021 Encode::MIME::Header and Encode::Guess mentioned
1022 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
1023 ! lib/Encode/Guess.pm
1024 POD fix by Miyagawa-kun
1025 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
1027 1.64 2002/04/29 06:54:06
1029 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
1030 encocde("euc-jisx0213") are still DIFFERENT.
1031 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
1033 A few white spaces corrected by NI-XS via PerlIO integration to
1035 Subject: Change 16247: Integrate perlio;
1037 Document fixes by Andreas
1038 Message-Id: <m3k7qsf1we.fsf@anima.de>
1040 1.63 2002/04/27 18:59:50
1041 ! lib/Encode/Encoding.pm
1042 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
1043 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
1044 Make use of the Encode::Encoding base class!
1045 And other cleanups in Encode.xs upon NI-XS suggestions
1046 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
1048 1.62 2002/04/27 11:17:39
1050 encodings() now just check %ExtModule instead of eval{require}
1051 all of them for ":all" to conserve more memory.
1053 more "%x" -> "%" UVxf stuff.
1055 s/=over2/=over 2/g # oops.
1057 1.61 2002/04/26 03:02:04
1059 Now does decent tests besides use_ok()
1060 ! lib/Encode/Guess.pm t/guess.t
1061 UI streamlined, document added
1062 ! Unicode/Unicode.xs
1063 various signed/unsigned mismatch nits (#16173)
1064 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
1066 POD: utf8-flag-related caveats added. A few sections completely
1070 Thou shalt not assume %d works, either!
1071 Robin Baker added to AUTHORS for this
1072 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
1074 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
1076 1.60 2002/04/24 20:06:52
1078 "Thou shalt not assume %x works." -- jhi
1079 Message-Id: <20020424210618.E24347@alpha.hut.fi>
1080 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
1081 low-memory build machines happy, now *.c is created for each *.ucm
1082 (no table aggregation). You can still override this by setting
1083 $ENV{AGGREGATE_TABLES}.
1084 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
1085 + lib/Encode/Guess.pm
1086 + lib/Encode/JP/JIS7.pm
1087 Encoding-autodetect (mainly for Japanese encoding) added. In a
1088 course of development, JIS7.pm was improved.
1089 + lib/Encode/HTML/Header.pm
1090 + lib/Encode/Config.pm
1091 MIME B/Q Header Encoding Added!
1092 ! Encode.pm Encode.xs t/fallback.t
1093 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
1094 Message-Id: <20020424130709.GA14211@tanglefoot>
1096 1.59 $ 2002/04/22 23:54:22
1097 ! Encode.pm Encode.xs
1098 needs_lines() and perlio_ok() are added to Internal encodings such
1099 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
1101 1.58 2002/04/22 23:54:22
1103 s/MacChineseSimp/MacChineseTrad/ # ... oops.
1108 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
1109 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
1110 t/(JP|KR).t are aggregated to t/CJKT.t
1111 test data are all remade via bin/ucm2text.
1112 And .... They are no longer skipped for -Uuseperlio !
1114 1.57 2002/04/22 20:27:30
1115 ! t/JP.t t/KR.t t/perlio.t
1116 unless (find PerlIO::Layer 'perlio') ... line is back again.
1117 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
1118 need PerlIO. Sigh....
1119 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
1120 ->perlio_ok now does eval{ require PerlIO::encoding } there so
1121 it correctly returns 1 when PerlIO::encoding is yet loaded.
1123 perl-current patch #16072 reflected
1125 1.56 2002/04/22 09:48:07
1126 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
1127 New PerlIO::encoding 0.04 compliance met
1129 1.55 2002/04/22 03:43:05
1130 ! Encode.pm Encode.xs Unicode/Unicode.pm
1131 needs_lines() defined so Encode::Encoding is no longer needed
1134 1.54 2002/04/22 02:50:01
1135 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
1136 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
1137 now perlio_ok is true by default if PerlIO::encoding->VERSION is
1138 0.03 or larger. POD in Encode::Encoding revised to reflect this.
1139 Encode::XS and Encode::Unicode now has perlio_ok() method.
1140 ! lib/Encode/Supported.pod
1141 s/UP-UX/HP-UX/ by jhi
1142 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
1143 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
1144 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
1145 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
1146 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
1147 ! lib/Encode/Supported.pod
1148 Huge document fixes by Philip.
1151 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
1152 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
1154 binmode() after "<:encoding" to make Win32 happy, by Mattia.
1155 Mattia added to AUTHORS file
1156 Message-Id: <3CC3150F.5798.22A05AE@localhost>
1158 1.52 2002/04/20 23:43:47
1160 TODO: is now SKIP:, as NI-XS requested. Also adds more
1161 eraborate failure analysis added.
1163 A note on how to make sure of round-trip safety added to POD
1164 section (so Autrijus is happier)
1165 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
1166 big5-(eten|hkscs) is round-trip safe again!
1167 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
1169 Typo fixes by Andreas
1170 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
1171 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
1172 PerIO coodination patches from NI-XS.
1173 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
1175 1.51 2002/04/20 09:58:23
1177 Updated test suite by Autrijis so "make test" is happy again
1178 Message-Id: <20020420082104.GA25037@not.autrijus.org>
1180 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
1182 TW/TW.pm TW/Makefile.PL
1183 Updates by Autrijus. 'big5' is no longer a canonical but an
1184 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
1185 Message-Id: <20020419195346.GA19597@not.autrijus.org>
1187 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
1188 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
1190 PerlIO detection a little bit smarter; no longer uses eval qq{}
1193 1.50 2002/04/19 06:13:02
1194 ! ! Encode.pm Encode.xs Encode/encoding.h
1196 New Fallback API imlemented and documented. See "perldoc Encode"
1198 ! lib/Encode/JP/JIS7.pm Encode.pm
1199 + lib/Encode/PerlIO.pod t/perlio.t
1200 API compliance met. However, it still does not work unless perlio
1201 implements line buffer. See BUGS section in perldoc Encode::PerlIO
1202 As a sensible workaround, perlio_ok() added to Encode.
1204 ! lib/Encode/Supported.pod
1206 Message-Id: <20020418174647.J8466@alpha.hut.fi>
1208 Doc fixes from Autrijus
1209 Message-Id: <20020418144131.GA10987@not.autrijus.org>
1211 perlqq mode documented
1213 + t/jisx0201.euc t/jisx0201.ref
1214 ! t/jisx0208.euc t/jisx0208.ref
1215 t/JP.t tests more rigorously and with other encodings
1216 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
1217 in jis0208 (used to contain jisx0201 part).
1218 ! Encode/Makefile_PL.e2x
1219 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
1220 enc2xs and encode.h rather than hard-coded. This allows the resulting
1221 module fully CPANizable.
1222 ! encoding.pm t/JP.t t/KR.t
1223 PerlIO detection simplified (checks %INC instead of eval{})
1224 ! Encode.xs Encode/encode.h
1225 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
1226 - lib/Encode/Unicode.pm
1227 (en|de)code_xs relocated to where it belongs. Source reindented
1230 Additional (U8 *) cast added as suggested by jhi
1231 Message-Id: <20020417165916.A28599@alpha.hut.fi>
1233 1.42 Date: 2002/04/17
1235 no-op module; Thought of adding a pod there but enc2xs has
1240 correct mechanism to detect Perlio::encoding layar installed.
1242 PerlIO Layer detached.
1244 1.41 2002/04/16 23:35:00
1246 binmode(STDIN|STDOUT ...) done iff PerlIO is available
1248 Cleaned up PerlIO skip conditions to prepare for the upcoming
1249 Encode - PerlIO forking.
1251 exported functions are now prototyped.
1252 ! lib/Encode/CN/HZ.pm
1255 fallback implemented # was /* FIXME */
1256 affected programs revised to fit (only HZ was using the try-catch
1257 approach which needed to be fixed for API-compliance).
1259 ! Encode/KR/2022_KR.pm
1261 can find =head1 NAME now, jhi
1262 Message-Id: <20020416083059.V30639@alpha.hut.fi>
1266 now complies with less warnings with the pickest compilers.
1267 Suggested by Craig, fixed by Dan.
1268 ! Encode/Makefile_PL.e2x
1270 A bug that fails to find *.e2x in certain conditions fixed
1272 1.40 2002/04/14 22:27:14
1273 + Encode/ConfigLocal_PM.e2x
1274 ! lib/Encode/Config.pm
1276 "enc2xs -C" now generates/updates Encode::ConfigLocal.
1277 ConfigLocal_PM.e2x is a skelton thereof.
1278 ! lib/Encode/Config.pm
1280 "use Encode::CN::HZ;" was missing.
1283 More rigorous tests added to test XS, especially on memory allocation.
1285 ! lib/Encode/Unicode.pm
1286 NI-S implemented an XS version -- merged
1287 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
1290 Source filter option added. With this option on, you can write
1291 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
1292 encodings. t/jperl.t enhanced to test this feature.
1294 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
1295 apply his suggestion, this degree of nitting is enough to add him
1297 Message-Id: <3CB93223.291E5E2E@earthlink.net>
1299 + lib/Encode/JP/JIS7.pm
1300 - lib/Encode/JP/JIS.pm
1301 - lib/Encode/JP/2022_JP.pm
1302 - lib/Encode/JP/2022_JP1.pm
1303 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
1304 JIS7.pm for better maintainability and performance
1306 Added caveat for non-ascii identifiers.
1308 fixes by jhi, the original author of this pragramtic module.
1309 Message-Id: <20020413231527.V1826@alpha.hut.fi>
1311 1.34 2002/04/12 20:23:05 (Unreleased)
1314 EBCDIC fixes addressed by jhi.
1315 Message-Id: <20020412161844.D9383@alpha.hut.fi>
1316 ! lib/Encode/Encoder.pm
1317 POD fix by Miyagawa-kun
1318 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
1320 1.33 2002/04/10 22:28:40
1322 Philip's mail address corrected.
1325 ! lib/Encode/Encoder.pm
1326 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
1327 This adds Miyagawa-kun to AUTHORS.
1328 * encoding() no longer exported by default but on demand
1329 * t/Encoder.t updated to test all these
1330 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
1331 ! lib/Encode/Unicode.pm
1332 ! lib/Encode/Supported.pm
1333 Further doc fixes by Anton
1335 1.32 2002/04/09 20:06:15
1338 - ucm/macDevanaga.ucm Unicode Character Map
1339 - ucm/macGujarati.ucm Unicode Character Map
1340 - ucm/macGurmukhi.ucm Unicode Character Map
1341 A utility to check integrity of .ucm files. t/bogus.ucm is a
1342 ucm that is deliberately bogus. unused Indic mappings are removed
1345 resolve_alias() added as suggested by jhi. Same as
1346 find_encoding("alias")->name. For convenience. This one is
1347 defined in Encode.pm instead of Alias.pm.
1348 Message-Id: <20020409215846.H17022@alpha.hut.fi>
1350 Memory Allocate but detected during the devel of ucmlint -- fixed.
1351 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
1352 ! lib/Encode/Unicode.pm
1353 valid_ucs2(0) is false but must be true.
1354 3 patches from NI-S as follows. This also has fixed the incident
1356 ! lib/Encode/Alias.pm
1357 find_alias() recursion prevention
1359 Checks for the patch above
1360 ! t/Encode/Unicode.pm
1361 An extra "F" that causes valid_ucs2() return a bogus value fixed
1362 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
1363 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
1364 2 Small Patches from jhi as follows:
1366 Encode->encodings() lists in case-insensitve order (as it was)
1368 -l option prints avaiable encodings to STDOUT instead of STDERR
1369 ! lib/Encode/Aliases.pm
1370 s/defintion/definition/
1371 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
1373 ! lib/Encode/Supported.pod
1374 ! lib/Encode/Unicode.pm
1375 POD revise by Philip Newton. This adds Philip to AUTHORS list.
1376 Thank you for the exact quote of Douglas Adams :)
1377 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
1379 1.31 2002/04/08 18:08:07
1380 ! lib/Encode/Encoder.pm
1382 Encode::Encoder, once just a placeholder of an idea, is now much more
1383 practical. See t/Encode.t to find how practical it can be.
1384 + lib/Encode/Config.pm
1386 my false laziness at Encode.pm is fixed. Now %ExtModules are set
1387 in Encode::Config and they are all literally, not programatically
1388 set. My false laziness was resulting many encodings missing from
1390 ! lib/Encode/Unicode.pm
1392 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
1393 so that it does not rely Encode::Unicode for BOM values
1394 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
1396 1.30 2002/04/08 02:34:51
1397 + lib/Encode/Encoder.pm
1398 Object Oriented Encoder. I reckon something like this is in need.
1401 ! lib/Encode/Supported.pod
1402 * autoloading bug that prevented upper-case canonicals such as UTF-16
1403 is fixed. Now even UTF/UCS are autoloaded!
1404 * encodings() is now more intuitive.
1405 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
1407 * Obligatory fixes to the POD.
1408 ! lib/Encode/Supported.pod
1409 Patch from Anton applied.
1410 Message-Id: <66641479.20020408033300@motor.ru>
1412 ! lib/Encode/Unicode.pm
1413 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
1415 1.28 2002/04/07 18:58:42
1419 Just a MANIFEST for those missing files.
1421 1.26 Date: 2002/04/07 15:22:04
1424 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
1425 And t/Aliases.t fixed to make sure they all exist.
1427 1.25 2002/04/07 15:01:25 (Unreleased)
1429 ! lib/Encode/Unicode.pm
1432 - lib/Encode/UTF_EBCDIC.pm
1433 - lib/Encode/Internal.pm
1434 - lib/Encode/utf8.pm
1435 Integrated into Encode.pm as closures. That way "one package, one file"
1436 rule is preserved yet less files to require.
1438 commented out binmode(STDERR ...
1442 ! EBCDIC/Makefile.PL
1445 ! Symbol/Makefile.PL
1447 ! Encode/Makefile_PL.e2x
1448 Schwarn's MM-compliance patch merged
1449 Message-Id: <20020406082609.GA28758@blackrider>
1451 ! lib/Encode/Unicode.pm
1452 + lib/Encode/UTF_EBCDIC.pm
1454 - lib/Encode/10646_1.pm
1455 - lib/Encode/ucs2_le.pm
1456 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1457 per-module based (en|de)code, I saved a number of .pm by
1458 reorganizing it as per-object base (Well, this is what Encode::XS
1459 does under the hood). See Encode::Unicode for details.
1460 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1461 This module is used only on EBCDIC environments.
1463 1.21 2002/04/05 14:46:34 (Not Released)
1469 Are back to make Perl/Tk happy Smile, NI-S.
1471 ! lib/Encode/Alias.pm
1472 ! lib/Encode/Supported.pm
1473 ! lib/Encode/10646_1.pm
1474 ! lib/Encode/ucs2_le.pm
1475 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1476 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1477 updated to reflect changes.
1478 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1479 ! lib/Encode/Alias.pm
1480 ! lib/Encode/Supported.pm
1481 Anton's revision commited. Added Dan's own fixes as well.
1482 Message-Id: <159103166906.20020405161134@motor.ru>
1483 ! lib/Encode/Alias.pm
1485 < qr/^UCS2-le$/i => '"UCS-2"', );
1487 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1488 Sigh. Thank you, Anton.
1489 Message-Id: <14567692196.20020405062020@motor.ru>
1490 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1492 1.20 2002/04/04 19:50:52
1494 the last minute addtion. Just give it a try. Docs remains to be done.
1495 Not installed by default.
1496 ! lib/Encode/Supported.pod
1499 ! lib/Encode/Alias.pm
1500 ! lib/Encode/utf8.pm
1501 ! lib/Encode/10464_1.pm
1502 ! lib/Encode/ucs2_le.pm
1503 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1504 unchanged but UTF-16BE is added as an alias. Implicit aliases
1505 move to Encode::Alias so init_alias() works more as expected.
1506 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1507 Though pedantically wrong, This should make perl mongers happier.
1508 t/Alias.t is enhanced to test all these.
1509 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1511 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1512 of the codepoints are common. CJKT left as is because the saving is
1516 ! EBCDIC/Makefile.PL
1518 ! Encode/Makefile_PL.e2x
1522 ! Symbol/Makefile.PL
1526 All occurance of _def.h replaced with .exh so djgpp works happily
1527 ever after! To credit this amazing discovery, Laszlo is now in
1529 Message-Id: <20020403181424.GA8778@freemail.hu>
1530 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1533 ! Encode/Makefile_PL.skel
1535 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1536 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1538 Two more tests by added jhi
1539 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1542 The showstopper fixed -- Memory reallocation bug was causing
1543 Encode::XS to fall into infinite loop on certain conditions.
1544 t/grow.t tests that.
1545 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1550 ! lib/Encode/Supported.pod
1551 Vendor encodings rebuilt out of original map files at unicode.org.
1552 Indic languages such as MacDevanagali remain unspported do to the
1553 shortcoming of encengine capabilities (they need algorithmical
1554 conversion and I have no knowledge on that!). Pods fixed for added
1556 Oh, macJapan.ucm renamed to macJapanese.ucm.
1557 macROMnn is macRomanian and macRUMnn is macRumanian.
1558 txt2ucm is a crude script that is used to convert them.
1560 Unicode Compound Characters (used extensively on Mac) supported
1562 Typo fixes and improvements by jhi
1563 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1565 1.11 2002/03/31 22:12:13
1569 Missing files from the MANIFEST fixed.
1570 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1571 Version incremented just to make CPAN happy.
1573 1.10 2002/03/31 21:32:42
1576 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1577 if you want. This should make Autrijus happy. Also, piconv
1578 is added to default install.
1581 Here-documented files that enc2xs generates are now exported
1582 to *.e2x. Much cleaner and easier to debug.
1584 encoding enhances so you can make it act more like such
1585 (now prehistoric ) "localized" variations of perl like Jperl.
1587 Further test for encoding.pm. Written in euc-jp
1590 Taken over form jhi.
1591 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1596 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1603 ! Encode/macIceland.ucm
1604 ! lib/Encode/Alias.pm
1605 ! lib/Encode/Supported.pod
1606 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1607 Message-Id: <m3lmcavhjt.fsf@anima.de>
1609 1.01 2002/03/29 20:59:39
1612 s/USE_SCRIPTS/MORE_SCRIPTS/
1614 installs enc2xs by default for external Encode:: modules in CPAN,
1615 such as Encode::HanExtra
1617 More sensible perl core detection via $ENV{PERL_CORE}
1619 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1621 Perl core ditection via $^X =~ m/\bminiperl$/o
1622 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1624 1.00 Wed Mar 29 2002
1626 The version of all files is updated to 1.00 via "ci -f -l1.00",
1627 commemorating version 1.00. All files, including *.ucm are now
1628 under version control.
1631 encode.h moved to Encode/ so it will be installed for the later
1634 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1639 compile renamed to enc2xs.
1640 Affected Makefle.PL updated
1642 "Punt it. HanExtra can take care of that later." -- Autrijus
1643 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1646 ! Encode/ksc5601.ucm
1647 ! lib/Encode/CJKConstants.pm
1648 ! lib/Encode/KR/2022_KR.pm
1649 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1650 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1654 bin/ added for example scripts. They are not installed by default.
1655 to install them, "perl Makefile.PL USE_SCRIPTS".
1656 piconv is iconv reinvented in perl. in addition to all features
1657 of iconv, it also adds perlish features. See L<piconv/1> for more
1659 ! lib/Encode/Alias.pm
1660 qr/^ replaced with qr/\b so it directly matches locale names
1661 such as en_US.US-ASCII
1664 Patch by MJD to fix the following problem applied.
1665 Subject: [PATCH 5.7.3 Encode]
1666 Aliases.t not properly skipped when Encode extension not built
1667 Message-Id: <20020328091850.18677.qmail@plover.com>
1668 ! lib/Encode/KR/2022_KR.pm
1669 ! lib/Encode/CJKConstants.pm
1670 Another patch from Jungshik to make iso-2022-kr actually work
1671 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1672 ! Encode/Encode/euc-kr.ucm
1673 + Encode/Encode/johab.ucm
1674 ! Encode/Encode/ksc5601.ucm
1676 ! Encode/KR/Makefile.PL
1677 ! Encode/lib/Encode/Alias.pm
1679 Johab support and complete revision of Korean Encoding by Jungshik
1680 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1682 Revised to make up with now-dropped Encode::Details.
1683 - lib/Encode/Details.pod
1684 Dropped. Besides being obsolete, the topics are now covered in
1685 respective pods now.
1690 Korean aliases fixed thanks to Jungshik Shin
1691 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1692 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1694 =head1 NAME added to all modules to make buildtoc happy
1695 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1696 - lib/Encode/CJKguide.pod
1697 Too controversial and dropped from the dist. Will be available
1698 separately on the web.
1700 RCS tags added so table debugging gets easier (should that be
1701 needed! I hope they all stay 1.00!)
1702 + lib/Encode/CJKguide.pod
1703 A detailed guide to mainly, but not limited to, CJK multibyte
1706 + Encode/hp-roman8.ucm
1708 ! Encode/Supported.pod
1709 All occurance of "roman8" replaced with "hp-roman8" to avoid
1711 ! Encode/Supported.pod
1714 Mac Encodings now comply the Inside Macintosh
1716 Test for '-raw' conventions added.
1718 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1719 ! Encode/gb12345.ucm
1721 ! Encode/ksc5601.ucm
1722 "-raw" appended to canonical names.
1723 File mames stay unchanged thanks to UCM format.
1724 ! lib/Encode/CN/HZ.pm
1725 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1726 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1728 0.99 Tue Mar 26 2002
1729 - lib/Encode/JP/Const.pm
1730 + lib/Encode/CJKConstants.pm
1731 + lib/Encode/CN/2022_CN.pm
1732 + lib/Encode/KR/2022_KR.pm
1741 * Support for ISO-2022-KR and ISO-2022-CN added.
1743 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1744 * ucm2table autogenerates character table out of UCM files.
1747 - lib/Encode/Supports.pod
1748 + lib/Encode/Supported.pod
1749 Names reverted due to popular demand.
1750 8.3 rule applies only when there is a conflict.
1751 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1756 - lib/Encode/Format/Enc.pod
1758 * Character tables is now 100% ucm.
1759 * All files under Encode/ is now 8.3-compliant
1760 * some of missing encodings added (i.e. gsm0338 and nextstep)
1761 * Vendor mappings aggregated with appropriate national std in
1762 Makefile.PL, resulting smaller *.so especially for CJK.
1763 Following is result on Dan's FreeBSD box.
1765 ---------------------------------------------------------------
1766 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1767 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1768 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1769 blib/arch/auto/Encode/Encode.so 27,791 27,791
1770 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1771 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1772 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1773 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1774 ---------------------------------------------------------------
1775 Total 5,375,297 6,343,819
1777 * As a result of ucm-transition, Encode::Tcl dropped because
1778 Encode::Tcl demands *.enc.
1779 Encode::Tcl will be supplied in a separate tarball with *.enc.
1780 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1785 -lib/Encode/Supported.pod
1786 +lib/Encode/Supports.pod
1787 -lib/Encode/iso10646_1.pm
1788 +lib/Encode/10646_1.pm
1789 -lib/Encode/EncFormat.pod
1790 +lib/Encode/Format/Enc.pod
1791 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1792 - lib/Encode/JP/Constants.pm
1793 + lib/Encode/JP/Consts.pm
1794 ! lib/Encode/JP/JIS.pm
1795 ! lib/Encode/JP/H2Z.pm
1796 Version nit problem and 8.3 rule fix.
1797 > Package namespace installed latest in CPAN file
1798 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1799 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1800 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1801 accordingly. In addition, legacy "use vars qw()..." are replaced with
1803 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1804 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1806 - lib/Encode/JP/ISO_2022_JP.pm
1807 - lib/Encode/JP/ISO_2022_JP_1.pm
1808 + lib/Encode/JP/2022_JP.pm
1809 + lib/Encode/JP/2022_JP1.pm
1811 8.3 naming conflict for vanilla fat addressed by jhi
1812 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1815 Typecast fix addressed by jhi
1816 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1818 0.98 Mon Mar 25 2002
1819 ! lib/Encode/Supported.pod
1821 + lib/Encode/JP/ISO_2022_JP_1.pm
1822 ! lib/Encode/JP/ISO_2022_JP.pm
1823 ! lib/Encode/JP/JIS.pm
1825 Now Encode::JP is more strict on the difference between ISO-2022-JP
1826 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1827 makes Anton happier :) FYI the previous version implements
1828 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1829 ! lib/Encode/Supported.pod
1832 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1833 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1838 ! lib/Encode/Suppoted.pod
1839 pod fixes to replace F<http://...> to L<http://...>,
1840 as suggested by Autrijius in:
1841 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1842 ! lib/Encode/Suppoted.pod
1843 fixes and enhancements by Anton
1844 Message-Id: <10632060120.20020324103753@motor.ru>
1845 ! lib/Encode/Alias.pm
1846 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1847 added. Suggested by Anton then deobfuscated by Autrijius
1848 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1850 Further fix by Nicholas Clark
1851 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1852 - lib/EncodeFormat.pod
1853 + lib/Encode/EncFormat.pod
1855 File renamed as suggested by Autrijius
1857 ! lib/Encode/Details.pod
1858 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1859 ! Encode.pm Sun Mar 24 13:43:47 2002
1860 pod fixes by Autrijius.
1861 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1862 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1864 ! lib/Encode/Alias.pm
1866 now more EBCDIC conscious;
1867 %ExtModules on EBCDIC system excludes CJK so that you don't
1868 have to worry about the matched alias resulting cloaking.
1869 t/Alias.t also revised to reflect changes. Verified by jhi
1870 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1872 0.97 Sun Mar 24 2002
1876 EBCDIC detection mechanism installed as in JP/JP.pm
1877 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1880 ! EBCDIC/Makefile.PL
1883 ! Symbol/Makefile.PL
1885 Now all table files used by compile are postfixed '_t' to avoid
1886 namespace collisions in case insensitive file systems once for all!
1888 Message-ID: <58290227735.20020323195659@familiehaase.de>
1890 Since the Encode::JP is unsupported under EBCDIC we
1891 cannot run this test (aliases as such should work fine) -- jhi
1892 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1894 duplicate occurance of ascii.ucm and 8859-1.ucm
1895 causes MacOS X dlyd to cloak
1901 < chdir 't' if -d 't';
1903 > if (! -d 'blib' and -d 't'){ chdir 't' };
1904 When you are "make test"-ing on Encode/ directory, you must not
1905 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1906 remain unchanced. Also the situation detection was made simpler
1907 in t/JP.t, which was originally;
1908 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1910 "Use of uninitialized value in string eq at Encode.pm line 96."
1911 ! Symbol/Makefile.PL
1912 ! EBCDIC/Makefile.PL
1914 -- Problem on case insensitive file systems
1915 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1916 Message-ID: <88254111953.20020323095503@familiehaase.de>
1919 "So I think it's a bug in gcc, not perl. But it still needs to be
1921 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1922 Message-Id: <20020323170509.C96475@plum.flirble.org>
1924 0.96 Sat Mar 23 2002
1926 ! lib/Encode/Encoding.pm
1927 ! lib/Encode/Alias.pm
1928 ! lib/Encode/Supported.pod
1930 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1931 Message-ID: <20020322073908.GB10539@blackrider>
1934 "...I think we should include ISO 8859-1 as well." -- NI-S
1935 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1940 ! lib/Encode/Alias.pm
1941 alias definitions relocated to Encode::Alias so module autoloading
1942 works for aliases also.
1944 encodings() now accepts args to check ExtModules.
1948 + EBCDIC/Makefile.PL
1949 + Symbol/Makefile.PL
1953 Latin and single byte encodings are reorganized so they are
1954 demand-loaded like Encode::XX. Now only ascii is compiled into
1956 ! lib/Encode/Alias.pm
1957 for my $k (keys %hash){ delete $hash{$k}; }
1958 is depreciated; fixed.
1960 0.95 Fri Mar 22 2002
1961 In this update, pod rewrites and alias fixes are the main issues
1962 + lib/Encode/Supported.pod
1963 Describes supported encodings
1965 streamlined compiled-in encodings.
1966 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1968 + Encode/ibm-125?.ucm
1969 Added from icu distibution with any occurance of
1970 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1971 some respect to icu staff, however.
1972 + lib/Encode/Alias.pm
1974 Alias difinitions in Encode.pm relocated.
1977 packWARN patch from Paul Marquess via jhi
1978 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1979 Paul added to AUTHORS as a result.
1980 ! t/CJKalias.t -> t/Aliases.t
1981 Renamed. Checks even more aliases and alias overloading
1984 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1985 gbk => cp936 relocated to CN.pm
1987 Test::More with plans (by jhi)
1989 0.94 Thu Mar 21 2002
1990 + lib/Encode/Description.pod
1991 ! lib/Encode/Encoding.pm
1992 Now the pod in Encode.pm is abridged as programming references.
1993 lib/Encode/Description.pod contains the original, detailed description
1994 and Encode::Encoding explains how to write your own module to
1995 add new encodings. So far, lib/Encode/Description.pod contains
1996 the whole pod once in Encode.pm. This is intentional.
1998 Pod revisions by Anton Tagunov
1999 Message-Id: <517178431.20020320174824@motor.ru>
2001 all occrance of Encode::Tcl::Extended removed including pod
2003 test now checks $encoding->name only; $encoding->{name} are
2004 no longer check to find the canonical name.
2005 ! lib/Encode/JP/JIS.pm
2006 ! lib/Encode/JP/ISO_2022_JP.pm
2007 ->name() added to be more compliant with API
2013 Patch by Autrijus to add aliases to TW and fixes to POD
2014 Message-Id: <20020320090619.GA24774@not.autrijus.org>
2016 SADAHIRO Tomoyuki added as should. My apologies.
2018 0.93 Wed Mar 20 2002
2019 * First release to be uploaded to CPAN. For prehistoric changes,
2020 please see Changes file of perl distibution as well as
2021 perl-unicode@perl.org archive, available at:
2022 http://archive.develooper.com/perl-unicode@perl.org/
2024 Changes Since 0.92 includes;
2029 + Mention to perl-unicode@perl.org added
2031 + Encoding aliases added so you can feed locale names
2032 and MIME Charset="" directly.
2033 - Mention to JISX0212 removed because it's fixed
2036 + Encoding aliases added. Note TW is left untouched because
2037 euc-tw is not implemented in TW but in Encode::HanExtra.
2038 Autrijus, you may fix Encode::HanExtra.
2040 + to test encode aliases added