1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 2.27 2009/01/21 22:55:07 dankogai Exp dankogai $
5 $Revision: 2.27 $ $Date: 2009/01/21 22:55:07 $
6 ! lib/Encode/MIME/Header.pm t/mime-header.t
7 Addressed: Encode::MIME::Header MIME-Q encoding truncates
8 trailing zeros in some circumstances
9 http://rt.cpan.org/Ticket/Display.html?id=342627
11 Added alias: unicode-1-1-utf-7
12 http://rt.cpan.org/Ticket/Display.html?id=38558
14 Documented: _utf8_on() does not work for tainted values
15 http://rt.cpan.org/Ticket/Display.html?id=41163
17 s[oss.software.ibm.com/icu][www.icu-project.org]g
18 http://rt.cpan.org/Ticket/Display.html?id=40245
19 ! lib/Encode/Guess.pm t/guess.t
20 Addressed:Empty file should produce an error message
21 http://rt.cpan.org/Ticket/Display.html?id=38652
24 2.26 2008/07/01 20:56:17
25 ! Unicode/Unicode.xs AUTHORS
26 Refactored by Alex Davies
27 http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/2007-10/msg00745.html
28 Message-Id: <7637669B2E3D46B187591747DA27F4C8@Amelie>
30 Absense of Encode::ConfigLocal no longer carps no matter what.
31 http://bugzilla.redhat.com/show_bug.cgi?id=435505#c2
32 http://rt.cpan.org/Ticket/Display.html?id=28638
33 http://rt.cpan.org/Ticket/Display.html?id=11511
35 use encoding 'utf8' and 'iso-2022-jp' glitches on perl 5.10
37 Message-Id: <693254b90807011224h3ab50d76v50c6fea87baf223c@mail.gmail.com>
38 ! lib/Encode/Alias.pm t/Aliases.t
39 macintosh' not recognize as MacRoman
40 http://rt.cpan.org/Ticket/Display.html?id=36326
42 s{INC => "-I./Encode"}
43 {INC => '-I' . File::Spec->catfile( '.', 'Encode' )}
44 To prevent some platforms from forgetting to include Encode/encode.h.
45 http://rt.cpan.org/Ticket/Display.html?id=36348
47 2.25 2008/05/07 20:56:05
49 added ':default' to Exporter option.
50 ! lib/Encode/GSM0338.pm
51 GSM0338 now handles coderef in CHECK
52 http://rt.cpan.org/Ticket/Display.html?id=31335
54 Perl 5.10/Encode 2.24: Tiny typo in Encode's Makefile.PL arg processing
55 Message-Id: <961C2A4F-92B3-416D-A9F9-E7B0ADA9F134@fsck.com>
57 "This fix for Encode::Alias should make Solaris happy:"
58 Message-ID: <47D886D9.6060001@iki.fi>
60 2.24 2008/03/12 09:51:11
61 ! lib/Encode/Config.pm
62 adds and fixes also adds cp858 support.
63 ! Encode.pm encoding.pm lib/Encode/Alias.pm ucm/cp858.ucm
65 > Change 33486 by rgs@scipion on 2008/03/12 08:50:11
66 An unfortunate side-effect of Encode and Encode::Alias use'ing each
67 other, and Encode::Alias exporting functions into Encode for it to use
68 as methods, broke the loading of the find_alias() Encode method in some
69 cases since 5.10. Breaking the recursive inheritance fixes it.
70 Message-Id: <b77c1dce0803120151o4166c3a0gfcfd14681ab7e10d@mail.gmail.com>
73 Message-Id: <200711281835.36125@bloodgate.com>
75 Fix by MIYAGAWA via CodeRepos
76 http://coderepos.org/share/changeset/1791
77 ! encoding.pm t/mime_header_iso2022jp.t
78 ported back from Perl 5.10-RC1
80 2.23 2007/05/29 18:15:32
82 got rid of global fallback_cb; encode_method() now takes one more
83 argument which is a coderef to fallback. This should make
84 encode_method() thread-safe.
86 Added perluniintro, perlunifaq, and perlunitut to POD
88 Plug a memory leak in Encode -- by rgs
89 Message-Id: <b77c1dce0705290858v2be239c3o2d726e3d59091493@mail.gmail.com>
92 http://aspn.activestate.com/ASPN/Mail/Message/perl5-porters/3486118
94 man page generation is now conditional; yes by default but no if $PERL_CORE
95 Message-Id: <b77c1dce0705290237h5c4667cdlf79a48b839170add@mail.gmail.com>
97 2.22 2007/05/29 07:35:27
99 from_to() does not honor the check while decoding. That's a feature.
100 To make sure it is a feature it is mentioned in the POD.
101 http://rt.cpan.org/NoAuth/Bug.html?id=27277
103 Encode used to suppress man page generation. Now it does.
104 http://rt.cpan.org/NoAuth/Bug.html?id=27200
105 ! Encode.pm Encode.xs t/fallback.t
106 Addressed: (de|en)code("ascii", "\x{3000}", sub{ $_[0] }) segfaults
109 2.21 2007/05/12 06:42:19
110 + lib/Encode/MIME/Name.pm t/mime-name.t
111 ! Encode.pm Encode.xs lib/Encode/Encoding.pm
112 new method: mime_name()
113 inspired by: MIYAGAWA
115 Subject: Re: Compress::Zlib, pack "C" and utf-8 [PATCH]
116 From: Marc Lehmann <schmorp@schmorp.de>
117 Date: Thu, 12 Apr 2007 08:41:53 +0200
118 Message-ID: <20070412064153.GA22475@schmorp.de>
119 http://public.activestate.com/cgi-bin/perlbrowse/p/31194
122 Message-Id: <20070417220547.GA11999@zetta.zet>
124 2.20 2007/04/22 14:56:12
126 Pod fixes. Now find_encoding() is explained more in details.
127 + lib/Encode/GSM0338.pm
129 ! lib/Encode/Supported.pod lib/Encode/Config.pm Bytes/Makefile.PL t/gsm0338.t
130 ESTI GSM 03.38 support is relocated from Encode::Byte to Encode::GSM0338.
131 This encoding is so kaputt it is unfit for Encode::XS!
132 Though it was okay for general cases and escape sequences,
133 '\0' => '@' IFF '\0\0' => '\0' had gliches.
134 So kaputt even t/gsm0338 wrongly interpreted that.
135 ref. http://www.csoft.co.uk/sms/character_sets/gsm.htm
136 ! encoding.pm t/Aliases.t
137 Imported from bleedperl #31015
139 2.19 2007/04/06 12:53:41
140 ! lib/Encode/JP/JIS7.pm
142 encode('iso-2022-jp') fallback support added by MIYAGAWA++
143 decode()'s fallback remains unchanged (FB_PERLQQ) since UTF-8
144 contains all characters in iso-2022-jp so there's no need for fancy stuff.
145 Message-Id: <693254b90704060526s6d850320h71cdda50dfbf7eba@mail.gmail.com>
147 #25216 ([PATCH] Encode.pm: postpone the load of Encode::Encoding)
148 http://rt.cpan.org/NoAuth/Bug.html?id=25216
149 ! lib/Encode/MIME/Header.pm t/mime-header.t
150 #24418 (Encode::MIME::Header: wrong encoding with latin1 characters)
151 http://rt.cpan.org/NoAuth/Bug.html?id=24418
153 #23876 (Add documentation for LEAVE_SRC)
154 http://rt.cpan.org/NoAuth/Bug.html?id=23876
155 ! lib/Encode/Alias.pm t/Aliases.t
156 #20781: Thai encoding needs alias for tis-620
157 http://rt.cpan.org/NoAuth/Bug.html?id=20781
159 #20344: piconv: wrong conversion of utf-16le encoded files (with PATCH)
160 http://rt.cpan.org/NoAuth/Bug.html?id=20344
161 ! Encode.pm Encode.xs bin/enc2xs encoding.pm t/Aliases.t t/utf8strict.t
162 Imported from bleedperl's 2.18_01
164 2.18 2006/06/03 20:28:48
166 overhauled the -C option
167 - added ascii-ctrl', 'null', 'utf-8-strict' to core
168 - auto-generated Encode::ConfigLocal no longer use v-string for version
169 - now searches modules via File::Find so Encode/JP/Mobile is happy
170 ! Byte/Byte.pm CN/CN.pm EBCDIC/EBCDIC.pm JP/JP.pm KR/KR.pm Symbol/Symbol.pm
171 use strict added; though all they do is load XS, it's
172 still better a practice
174 use warnings added to all of them for better practices' sake.
176 2.17 2006/05/09 17:10:09
178 'chin' =~ /^zh_CN|chin(?:a|ese)?$/i is true
179 but chin is not china or chinese.
180 http://d.hatena.ne.jp/jankogai/20060508/1147090316
182 Integrated maintperl change (27824|27824) which I overlooked
183 -- sorry, Nicholas and Coverity Scan.
184 Message-Id: <200604152115.k3FLF1Ar014538@smtp3.ActiveState.com>
185 Message-Id: <200605091615.k49GF1gJ016777@smtp3.ActiveState.com>
187 2.16 2006/05/03 18:24:10
189 --xmlcref and --htmlcref added.
191 Copyright Notice Added.
192 http://rt.cpan.org/NoAuth/Bug.html?id=19056
194 Replaced remaining ^\t with q( ) x 4. -- Perl Best Practice pp. 20
195 And all .pm's are now perltidy-ed.
197 2.15 2006/04/06 15:44:11
199 Addressed: UTF-16, UTF-32, UCS, UTF-7 decoders mishandle illegal characters
200 http://rt.cpan.org/NoAuth/Bug.html?id=18556
202 added str2bytes() as an alias to encode() and bytes2str() as an alias
204 http://rt.cpan.org/NoAuth/Bug.html?id=17103
206 Change 26922: Avoid warning with MS Visual C compiler.
207 Message-Id: <200601231245.k0NCj2dw009484@smtp3.ActiveState.com>
209 Change 26067: As using -C to turn on utf8 IO is equivalent to the open pragma
210 Message-Id: <200511092227.jA9MRcYD009025@smtp3.ActiveState.com>
212 2.14 2006/01/15 15:43:36
214 Change 26295: Don't build manpages for Encode and Unicode::Normalize
215 Message-Id: <200512071540.jB7Fe4Gt017960@smtp3.ActiveState.com>
217 Change 26081: Pod nit in Encode.pm, found by Marc Lehmann in RT #36949.
218 Message-Id: <200511110357.jAB3vZcP023647@smtp3.ActiveState.com>
219 ! Encode.xs Encode/encode.h bin/enc2xs encengine.c
220 Change 25821: Mark more static Encode data structures as const.
221 Change 25823: use more 'const' in the Encode data structures.
222 Message-Id: <200510221243.j9MChTSu027711@smtp3.ActiveState.com>
223 Message-Id: <200510221343.j9MDhTk9001245@smtp3.ActiveState.com>
225 2.13 2006/01/15 15:06:36
227 Miyagawa's mail address updated
228 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
229 ! lib/Encode/MIME/Header.pm
230 #16413: Encode::MIME::Headers patch to solve what is probably someone else's bug
231 http://rt.cpan.org/NoAuth/Bug.html?id=16413
232 ! lib/Encode/MIME/Header.pm t/mime-header.t
233 Applied: RT #16258: Support for RFC 2184 language tag
234 http://rt.cpan.org/NoAuth/Bug.html?id=16258
236 Fixed RT #14559: fix for #8872 introduces new "bug"
237 http://rt.cpan.org/NoAuth/Bug.html?id=14559
240 from_to() now makes use of $check more naturally.
241 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
242 <B10B4DE2-9BAF-4344-B3BE-2119977D817A@dan.co.jp>
244 2.12 2005/09/08 14:17:17
245 ! Encode.xs Encode.pm t/fallback.t
246 Now accepts coderef for CHECK!
248 Updated to newer version at unicode.org
249 http://rt.cpan.org/NoAuth/Bug.html?id=14222
250 ! lib/Encode/Supported.pod
252 <42F5E243.80500@gmail.com>
254 More POD typo leftover fixed.
255 Message-Id: <b77c1dce05080615487f95314@mail.gmail.com>
257 2.11 2005/08/05 10:58:25
259 To reflect changes below
260 ! Encode.pm encoding.pm
261 lib/Encode/Alias.pm lib/Encode/PerlIO.pod lib/Encode/Supported.pod
262 Typo fixed by Piotr Fusik in Change 25261 & 25266
263 Message-ID: <001401c595bd$dccb5d80$0bd34dd5@piec>
265 Addresses "BUG REPORT: panic in Encode.xs".
266 Message-Id: <42EDDA97.2010608@hyper.to>
267 + lib/Encode/MIME/Header/ISO_2022_JP.pm mime_header_iso2022jp.t
268 ! lib/Encode/MIME/Header.pm lib/Encode/Config.pm
269 Encoding 'MIME-Header-ISO_2022_JP' is introduced by Makamaka
270 Message-Id: <200507311557.j6VFvE2K034605@www231.sakura.ne.jp>
271 ! Encode/encode.h Encode.pm Encode.xs
272 PerlIO's "encoding(utf-8-strict)" got a problem w/ partial character.
273 Found and addressed by KONNO Hiroharu <hiroharu.konno@bowneglobal.co.jp>
274 See also ext/PerlIO/encoding/encoding.pm
275 Message-Id: <E1DineE-00068X-TB@yok-gs-workman.asia.bgsinternal.com>
277 2.10 2005/05/16 18:46:36
279 fixed decode_utf8() accordingly to RT#8872
280 http://rt.cpan.org/NoAuth/Bug.html?id=8872
282 s/SvIVX/SvIV_set/ by Steve Peters.
283 Message-Id: <2297.67.96.185.36.1114626315.squirrel@webmail3.pair.com>
287 New Pod section: "UTF-8 vs utf8"; explains utf-8-strict
289 Tests utf-8-strict, accordingly to
290 UTF-8 decoder capability and stress test" by Markus Kuhn
291 http://smontagu.damowmow.com/utf8test.html
292 Note that malformed and overlong sequences are not test here
293 because perl already does that for you, utf-8-strict or not.
294 ! Encode.pm Encode/encode.h t/fallback.t
295 Addressed "encode(..., Encode::LEAVE_SRC) does not work".
296 Now FB_(PERLQQ|HTMLCREF|XMLCREF) implies LEAVE_SRC so
297 you can (en|de)code constant strings with these fallbacks.
298 http://rt.cpan.org/NoAuth/Bug.html?id=8736
299 ! Encode.pm Encode.xs lib/Encode/Alias.pm t/Aliases.t
300 Make Encode.pm support the real UTF-8, by GAAS
301 Message-Id: <lrfz2mcngd.fsf@caliper.activestate.com>
302 Message-Id: <lr4qizbvvm.fsf@caliper.activestate.com>
303 ! Encode.pm Encode.xs
304 post-2.09 comment patches from GAAS applied.
305 Message-Id: <lroehacz6q.fsf@caliper.activestate.com>
306 Message-Id: <lrk6rycymu.fsf@caliper.activestate.com>
308 2.09 2004/12/03 19:16:53
309 ! Encode.pm Encode.xs
310 Addressed " :encoding(utf8) broken in perl-5.8.6".
311 Message-Id: <lrllcfeank.fsf_-_@caliper.activestate.com>
313 Addressed "(de|en)code($valid_encoding, undef) does not warn".
314 http://rt.cpan.org/NoAuth/Bug.html?id=8723
315 ! Encode.pm t/Encode.t
316 Addressed "Can't encode URI". When a reference is fed to (en|de)code,
317 Encode now stringifies instead of returning undef.
318 http://rt.cpan.org/NoAuth/Bug.html?id=8725
319 ! Encode.xs t/fallback.t
320 Addressed "FB_HTMLCREF and FB_XMLCREF for the UTF-8 decoder".
321 http://rt.cpan.org/NoAuth/Bug.html?id=8694
323 Addressed "s/digit/number/".
324 http://rt.cpan.org/NoAuth/Bug.html?id=8695
326 Addressed "while (defined(read )) { ... } is an infinite loop".
327 http://rt.cpan.org/NoAuth/Bug.html?id=8696
329 Addressed "What the heck is UCM?".
330 Document fixed so that it no longer contains "UCM-Based Encodings".
331 http://rt.cpan.org/NoAuth/Bug.html?id=8697
333 2.08 2004/10/24 13:00:29
334 ! Encode.xs lib/Encode/Encoding.pm Unicode/Unicode.{pm,xs}
335 Resolved the issue that was raised by 2.07 -- Encode::utf8 fallbacks
336 that was introduce messed up PerlIO::encoding.
337 * To do so, ->renew() is renewed and ->renewed() was introduced to
338 tell whether the caller is PerlIO or not.
339 Message-Id: <94B2EB12-25B7-11D9-9E6A-000A95DBB50A@dan.co.jp>
341 2.07 2004/10/22 19:35:52
342 ! lib/Encode/Encoding.pm
343 "Remove Carp from warnings.pm" that influences Encode, by Tels.
344 Message-Id: <200410161618.29779@bloodgate.com>
345 ! Encode.xs AUTHORS t/fallback.t
346 Now Encode::utf8's fallbacks are compliant to Encode standard.
347 Thank Bjoern Hoehrmann for persistently convincing me.
348 Message-Id: <41a61aea.638409494@smtp.bjoern.hoehrmann.de>
352 2.06 2004/10/22 06:23:11
354 RT #8083 reports that MacThai mapping was obsolete
355 Updated all mac* encodings accordingly to the URI below.
356 One remaining mystery is that MacRomanian vs. MacRumanian.
357 MacRumanian is not found in unicode.org...
358 http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/
359 ! Encode.pm t/Encode.t
360 Fixed RT #8081: "decode(..., bless{},'x') segfault"
361 Two more tests added to test that.
362 http://rt.cpan.org/NoAuth/Bug.html?id=8081
364 POD revised accordingly to RT #7966
365 http://rt.cpan.org/NoAuth/Bug.html?id=7966
367 POD updated explaining why Encode::Unicode always croaks on error
368 rather than giving users choices.
369 http://rt.cpan.org/NoAuth/Bug.html?id=7892
371 2.05 2004/10/19 04:55:01
373 "unnuke" jhi's patch in bleedperl, with minor correction by dankogai.
374 Message-ID: <41210A84.6060506@iki.fi>
375 Message-ID: <20041018233442.7418113f@r2d2>
376 Message-Id: <2BA3DAC4-218A-11D9-906D-000A95DBB50A@dan.co.jp>
378 2.04 2004/10/16 21:22:44
380 From: craigberry@mac.com
381 Subject: [PATCH ext/Encode/Makefile.PL] make Encode.c dependency explicit
382 Message-Id: <41716868.7000102@mac.com>
384 2.03 2004/10/06 05:07:20
385 ! lib/Encode/Alias.pm
386 Resolved some alias case sensitivity glitches reported via RT.
387 http://rt.cpan.org/NoAuth/Bug.html?id=7835
389 Resolved Win32 glitches reported via RT.
390 (Fixed by dankogai and tested by Steve Hay)
391 http://rt.cpan.org/Ticket/Display.html?id=7831
392 ! JP/JP.pm lib/Encode/Alias.pm lib/Encode/Supported.pod AUTHORS
393 /\bwindows-31j$/i is now an alias of CP932, by Steve Hay.
394 http://rt.cpan.org/NoAuth/Bug.html?id=6695
396 2.02 2004/08/31 10:55:34
397 ! ucm/big5-hkscs.ucm AUTHORS t/big5-hkscs.enc t/big5-hkscs.utf
398 New map submitted by Deng Liu and Autrijus. Test data needed
399 to be upgrade as well, done by dankogai
400 Message-Id: <20040824204828.GB6999@aut.dyndns.org>
402 Now works for characters U+10000 and above. This fix was needed
403 to "tidy" the original map that was submitted.
405 "ucmsort" now mentioned in pod
407 2.01 2004/05/25 16:27:14
409 From: domo@computer.org
410 Subject: [PATCH] Correct statistics from enc2xs
411 <4AF60A4A-B8BB-11D8-BF99-000A27839BD6@computer.org>
412 <CDEBBD45-B91D-11D8-BF99-000A27839BD6@computer.org>
413 ! lib/Encode/Alias.pm
414 Addressed "False [] range "\s-" in regex;" in Encode::Alias.pm
415 <200405271148.i4RBm4KY026529@mail.mvnet.de>
417 2.01 2004/05/25 16:27:14
418 ! lib/Encode/CN/HZ.pm lib/Encode/Unicode/UTF7.pm
419 "If someone thinks utf8::upgrade($1) should be croaked like
420 chom?p($1),please try the following patch for Encode.pm."
422 <20040522212704.C068.BQW10602@nifty.com>
424 2.0 2004/05/16 20:55:15
425 * version updated to 2.00
426 -- sorry, no big feature change. I just hate version 1.100 :)
427 ! lib/Encode/Guess.pm
429 addressed UTF-(8|32LE) + BOM misguessing
430 https://rt.cpan.org/Ticket/Display.html?id=6279
432 s/is_utif8/is_utf8/ in POD
433 ! Encode/lib/Encode/CN/HZ.pm
434 Fixes "make test" failure after the patch to pp_hot.c
436 Message-Id: <20040222182357.6B39.BQW10602@nifty.com>
438 From: autrijus@autrijus.org
439 Subject: [PATCH] "piconv -C 512" badly broken
440 Message-Id: <1072870210.769.5.camel@localhost>
442 1.99 2003/12/29 02:47:16
444 find_encoding("UTF-16BE")->encode("abc") now null terminates
445 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html
447 prototype bug in decode_utf8() fixed
448 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
449 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
450 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
452 Merged from maintperl@21987
454 1.98 2003/08/20 11:15:31
455 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
456 Dave Evans has found and corrected a bug in Encode::MIME::Header.
457 Test suite added by Dan Kogai.
458 Message-Id: <3F43440B.7060606@rudolf.org.uk>
460 Typo fixes rolled back in from bleedperl
461 ! t/at-cn.t t/at-tw.t
462 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
463 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
465 argv case nit for VMS by Craig
466 Message-ID: <3F2B02DE.10207@mac.com>
467 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
468 Encode test fixes for VMS by Peter Prymmer
469 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
470 ! lib/Encode/Alias.pm t/Aliases.t
471 koi-8 aliases bug detected and patched by sadahiro.
472 Further fix and test suite by dankogai
473 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
475 1.97 2003/07/08 21:52:14
476 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
477 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
478 $DEBUG replaced with DEBUG() so perl optimizes better,
479 by Rafael with further fixes by dankogai
480 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
481 ! lib/Encode/Aliases.pm
482 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
483 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
484 So new hash seeding introduced in bleedperl works.
485 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
486 ! lib/Encode/Guess.pm
487 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
488 automatic utf(8|16|32) guessing -- originally by Autrijus
489 Message-Id: <20030626162731.GA2077@not.autrijus.org>
491 Addressed the following;
492 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
493 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
495 1.96 2003/06/18 09:29:02
496 ! lib/Encode/JP/JP.pm t/guess.t
497 m/(...)/ in void context then $1 is considered a Bad Thing
498 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
500 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
503 More typecast from maintperl@19739
504 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
506 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
507 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
509 Now skips for EBCDIC platform.
510 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
512 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
513 have more than one C<.> in their filenames." -- perlport.
514 Message-Id: <3ED79E01.8050401@mac.com>
516 Found and fixed the back that -p,--perlqq does not work.
517 Induced by the change from Getopt::Std to Getopt::Long.
519 Addressed [cpan #2629] Wrong assumption in numeric comparison
520 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
521 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
522 lib/Encode/Encoding.pm t/perlio.t
523 ! API Change: ->new_sequence() => ->renew()
524 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
525 + Encode::XS and Encode::utf8 now supports ->renew()
526 + Encode::Encoding now documents this with examples
527 - Non-XS (en|de)code stripped out of Encode::Unicode
528 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
530 1.95 2003/05/21 08:41:11
532 Since bogus entries were found in iso-8859-6, all entries are
533 re-generated once again out of
534 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
535 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
536 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
537 + lib/Encode/Unicode/UTF7.pm
538 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
539 lib/Encode/Supported.pod
540 UTF-7 support is now added. With this Encode now has all transcoding
541 methods in Unicode::String.
543 1.94 2003/05/10 18:13:59
544 ! lib/Encode/MIME/Header.pm
545 A more sophisticated solution for double-encoding by dankogai
546 ! lib/Encode/MIME/Header.pm AUTHORS
547 Two bugs fixed by Bjoern Jacke
548 * "Double Encoding" was not possible
549 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
550 * encode("MIME-Q") had UTF-8 flag on
551 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
552 ! lib/Encode/MIME/Header.pm AUTHORS
553 Two occurances of "croak ()" fixed as "croak qq()".
554 Simon Cozens is added to AUTHORS as a result.
555 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
557 POD fixes that reflect enhancements by jhi
559 Two enhancements by jhi.
560 + Now uses Getopt::Long so it accepts long name options
561 (--from for -f, for example)
562 + New option: -r,--resolve
563 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
565 META.yml added upon request of Schwern
566 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
568 Enache Adrian removed upon request -- to live longer than Encode
569 and/or FreeBSD (toy-)?thread :)
570 Message-Id: <20030425015701.GA2069@ratsnest.hole>
572 "close STDOUT unless $^O eq 'freebsd';" once again relocated
573 to keep VMS happy in which case "$^O eq 'freebsd'" is required
574 to keep FreeBSD+thread happy. Sigh.
575 Message-Id: <3EA88ADC.3000300@mac.com>
577 1.93 2003/04/24 17:43:16
579 added "no warnings 'pack'" in for loop to keep bleedperl from
580 complaining "Character in 'C' format wrapped in pack".
582 More elegant perl core detection inspired by Ilya Zakharevich
583 (but further elaborated for general cases).
584 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
587 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
588 are less likely to be confused w/ ISO-8859-*
590 RT tests added (vendor encodings are exemplified)
591 -- that successfully found a flaw on iso-2022-kr before the patch.
592 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
593 decode("ISO-2022-KR") has been buggy but no one ever sited
594 that since no one seems to be using it. Bugs discovered by
596 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
597 ! lib/Encode/CN/HZ.pm t/perlio.t
598 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
599 so it adds test for HZ.
600 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
601 ! lib/Encode/Guess.pm
602 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
603 So long as the string contains \x{00}-\x{ff} it does not fail.
604 See perldoc for details.
605 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
607 1.92 2003/03/31 03:27:27
608 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
609 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
610 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
611 but glibc-BIG5-2.1.2.ucm does.
612 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
613 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
614 Maps regenerated again but this time based upon
615 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
616 (But where is THE DOCUMENT by MICROSOFT?)
617 ! t/enc_module.t AUTHORS
618 failure with threaded Perl on FreeBSD addressed.
619 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
620 Message-Id: <20030322230131.GA813@ratsnest.hole>
621 ! lib/Encode/Guess.pm
624 Change 18989: Make the :bytes conditional on PerlIO.
625 further Modified by Dan Kogai
626 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
628 Chnage 18966: another fix for failing test on windows ("use encoding"
629 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
630 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
632 Change 18970: Hopefully this works also in Win32, by jhi
633 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
634 Change 18965: fix CJKT.t failures on windows due to incorrect
636 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
638 1.91 2003/03/09 20:07:37
640 even more proofread by jhi.
641 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
644 +use lib qw(t ext/Encode/t ../ext/Encode/t);
645 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
647 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
648 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
651 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
653 1.90 2003/03/09 17:32:43
656 Inaba-san has added a patch for perl 5.8.1 or later that makes
657 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
658 that. POD is further revised.
659 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
660 ! encoding.pm t/enc_module.t
661 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
662 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
666 signed vs. unsigned issue discovered by Craig on OpenVM
667 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
668 ! encoding.pm AUTHORS
669 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
670 Because binmode() stacks layers instead of overwrite, you have to
671 ":raw :encoding()" in encoding.pm or your are in trouble when you
672 call encoding.pm multiple times. There are several workarounds
673 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
674 who was the first to address this problem was added to AUTHORS.
675 The test suites was added for this, which is a modified version
676 of SUGAWARA-san's scripts
677 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
679 1.88 2003/02/20 14:42:34
681 one signedness nit for Encode by jhi
682 <200302161933.h1GJX876018710@kosh.hut.fi>
684 VISCII map was incorrect; fixed by Sadahiro-san
685 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
686 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
687 You can't unlink files that are opened in cygwin but the last
688 file handle opened in t/enc_*.t left open. Patch submitted
689 by Yitzchak and he was added to AUTHORS.
690 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
692 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
693 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
695 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
696 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
698 1.87 2003/02/06 01:52:11
700 * Inaba "Sensei" Hirohito added (I thought I have done so a long
701 ago but apparently I did not).
702 * SUZUKI Norio added for verious and useful bug reports.
703 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
704 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
705 podchecked so all warnings are gone except for L<http://>.
706 ! encoding.pm t/enc_eucjp.t
707 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
708 * Filter option overhaul
710 ! Encode.pm Encode.xs encengine.c Encode/encode.h
711 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
712 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
713 without filter. podchecked by Dan Kogai.
714 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
715 ! lib/Encode/Alias.pm
716 decode('alias', $1) went wild because of local $_ in find_alias()
717 the evil local $_ is eradicated but that changes find_alias()
718 format for coderef aliasing. See Encode::Alias for details
719 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
721 1.86 2003/01/22 03:29:07
723 * Don't forget to canonize when you attempt an exact match!
724 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
725 * ${^ENCODING} exception is off for $] > 5.008
726 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
728 $] check commented out so it runs on 5.8.0
730 1.85 2003/01/21 22:19:14
732 ${^ENCODING} exception is now explicit rather than handled by regex.
733 + t/enc_eucjp.t t/enc_utf8.t
734 Test suite for the better "encoding" pragma support for bleedperl.
735 On 5.8.0, they will just be skipped.
737 1.84 2003/01/10 12:00:16
739 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
740 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
741 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
743 %_ fixes saves the resulting .so .05% smaller, by NC
744 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
746 Silence Encode on undef, by Andreas
747 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
748 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
750 s/regognised/recognised/ . British spelling left intact to pay
751 respect to two British Nicks :)
752 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
754 1.83 2002/11/18 17:28:49
755 ! Encode.xs lib/Encode/JIS7.pm
756 Even more patches from Inaba-san has been applied. With this
757 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
758 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
760 1.82 2002/11/14 23:06:12
762 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
763 later found in perls w/ -Dusemymalloc was fixed by NC.
764 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
766 1.81 2002/11/08 18:29:27
767 ! Encode.pm Encode.xs
768 Non-XS version of Encode::utf8 is back (with XS being default).
769 Encode::predefine_encodings(0) to turn off XS.
770 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
771 found that it was Test::More causing the problem, not Encode.
772 But I have already made it configurable so it may be useful in
774 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
776 The ingenious patch by Nicholas Clark that reduces shlib sizes by
777 50% with no penalty and backward compatibility preserved, is in.
778 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
780 1.80 2002/10/21 20:39:09
781 ! Encode.xs t/mime-header.t
782 Even more patches from NI-XS regarding Encode::utf8->decode().
783 And one more test to t/mime-header.t to prove it
784 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
786 1.79 2002/10/21 06:05:37
788 Further patches from NI-XS. Encode::utf8->decode() now checks the
789 value of utf8 flag of the argument. As a result, the fix to
790 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
791 no harm (even speedwise) I'll leave it unreverted.
792 ! ucm/cp949.ucm ucm/cp950.ucm
794 U+00AE REGISTERED SIGN
795 were missing as a result of 1.78. Discovered by Moriyama-san.
796 Moriyama-san has also developed a test script that compares
797 (en|de)coded results to the corresponding Win32 API result and
798 all cp9?? maps are now verified.
799 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
801 1.78 2002/10/20 15:44:00
802 ! lib/Encode/MIME/Header.pm
803 fixed so that it works with new Encode::utf8
804 ! Encode.pm Encode.xs
805 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
806 :encoding(UTF-8) to handle partial chars at end of buffers
808 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
809 ! lib/Encode/Supported.pod
810 More nitpickings applied.
813 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
814 tests were completely useless. To redeem that and get the peace of
815 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
816 ucm/*.ucm Since this script takes too long to finish (30 seconds on
817 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
818 but you can easily run that by either renaming it or:
820 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
821 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
822 Other CJKT cp9?? also updated according to the URI below;
823 http://www.microsoft.com/typography/unicode/cscp.htm
824 + bin/ucmsort MANIFEST
825 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
826 to proper order. intended for hardcore develpers only.
827 ! ucm/cp932.ucm JP/JP.pm AUTHORS
828 CP932 mapping which was based upon the mapping file at unicode.org
829 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
830 has also supplied the patch so he was added to AUTHORS.
831 ! lib/Encode/Supported.pod
832 ISO-8859-11 != TIS 620
833 == TIS 620 + \xA0 ( )
835 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
837 1.77 2002/10/06 03:27:02
839 * Modified to accomodate up and comming patch by Inaba-san that
840 will fix tr/// needing eval qq{}
841 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
843 * pod fixes/enhancements to reflect the changes above
844 ! lib/Encode/Alias.pm
845 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
846 Message-Id: <20021001015648.GB18710@not.autrijus.org>
848 1.76 2002/08/25 15:09:51
850 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
851 but naturally identical to previous version -- dankogai
853 Codepoint fixes -- autrijus
854 Message-Id: <20020805040236.GC5220@not.autrijus.org>
856 copied everything under perl-5.8.0/ext/Encode to make sure Encode
857 is in sync w/ perl core
859 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
860 Now that binmode(FH) does implicit ":bytes" revisit
861 the failing tests. The worrisome one is the Digest::MD5
862 test-- how will it fare in CRLF lands now?
864 From: Radu Greab <radu@netsoft.ro>
865 Date: Mon, 10 Jun 2002 00:40:34 +0300
866 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
867 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
868 ones which I cannot figure out.
869 ! lib/Encode/Alias.pm
870 Subject: [Encode PATCH] spurious warning
871 From: Nicholas Clark <nick@unfortu.net>
872 Date: Sun, 2 Jun 2002 20:26:22 +0100
873 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
875 1.75 2002/06/01 18:07:49
876 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
877 glibc compliance cited by Autrijus.
878 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
879 ! bin/enc2xs bin/piconv
880 Subject: Re: forewarning: usedevel and versiononly
881 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
883 1.74 2002/05/28 18:33:15
884 + ucm/null.ucm ucm/ctrl.ucm
885 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
886 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
887 Subject: Re: unicode -> &# notation
888 Message-ID: <20020518193704.GB40272@not.autrijus.org>
890 1.73 2002/05/28 17:26:18
891 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
892 Chris Nandor has fixed Encode so that it works w/ MacPerl --
893 at least w/ PPC (68k need static linking which does not work due to
894 64k limit). pudge is added to AUTHORS (I'm surprised he was not
895 there in the list). Encode/Makefile_PL.e2x was additionally fixed
896 by dankogai to reflect changes in other Makefile.PL
897 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
899 Subject: Change 16746: -Mutf8 cleanup.
900 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
902 1.72 2002/05/20 15:49:56
904 Subject: [PATCH] Encode should be in perl-core library path
905 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
906 Message-Id: <20020520161201.A11019@alpha.hut.fi>
907 ! lib/Encode/MIME/Header.pm
908 Subject: [PATCH] Encode::MIME::Header
909 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
910 ! Encode/Makefile_PL.e2x
911 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
912 Message-Id: <20020519201031.GA1603@not.autrijus.org>
913 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
914 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
916 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
917 Subject: [PATCH] good day for WinCE port of perl.
918 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
921 Even more linting by Robin via @16532
923 Even more typecast by Sarathy in @16460
925 1.71 2002/05/07 16:22:42
927 even more typecasts by Robin
928 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
930 A very strange bug that was causing a bugus ucm -> C table
931 generation that was revealed by a UCM file that Andreas was
932 working. This is the king of wierdest bug I've encountered
933 in the course of Encode maintenance.
934 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
936 1.70 2002/05/06 10:26:48
938 Made more 'module-safe' with conjunction w/ 'no encoding'.
939 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
940 ! lib/Encode/Encoding.pm
941 'require Encode' because ->Define uses Encode::define_encoding();
942 problem and solution addressed by Miyagawa-kun
943 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
945 Cuts the frill to make djgpp happier, as suggested by Laszlo
946 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
948 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
949 Message-Id: <m3bsbug48n.fsf@anima.de>
951 1.69 2002/05/04 16:41:18
952 ! lib/Encode/MIME/Header
953 Floating-point coerced for UNICOS (in integer arithmetics it folds
954 line one character too early). Verification by Mark is pending.
955 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
957 more doc patch from Elizabeth
958 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
959 ! Encode/Makefile_PL.e2x
960 More platform-independent patch from Benjamin
961 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
962 ! lib/Encode/Guess AUTHORS
963 split regex fix by Graham Barr. Adds him to AUTHORS.
964 Message-Id: <20020504085419.E95940@valueclick.com>
965 ! Encode/Makefile_PL.e2x
966 enc2xs script discovery made smarter and more sensible, first cited
967 by Miyagawa-kun and further suggestions by Rafael and Andreas
968 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
969 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
971 1.68 2002/05/03 12:20:13
972 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
973 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
974 and Supported.pod modified to reflect the change. Elizabeth added
975 to Authors. And H.M. is also added for forwarding her patch among
976 other contributions (I was rather surprised to find his name was not
978 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
980 1.67 2002/05/02 07:33:09
982 Error message now consistent w/ perlqq (\N{U+} -> \x{})
983 done in perl@16308 but Philip linted me further. Now the error
984 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
985 ! lib/Encode/Guess.pm
986 Sanity check for happier -w by Autrijus
988 1.66 2002/05/01 05:41:06
989 ! Encode.xs t/fallback.t
990 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
991 while fallback is in effect. This even came with a welcome side-effect
992 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
993 also modified to test this.
994 And of course, the corresponding varialbles to UV[Xx]f are appropriately
995 cast. This should've concluded NI-XS homework.
997 encode(undef) does warn again! Repented upon suggestion by NI-XS.
998 Document for unless vs. '' added
999 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
1001 1.65 2002/04/30 16:13:37
1003 encode(undef) no longer warns for C<Use of uninitialized value in
1004 subroutine entry>. Suggested by Paul.
1005 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
1006 ! lib/Encode/Supported.pod
1007 Encode::MIME::Header and Encode::Guess mentioned
1008 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
1009 ! lib/Encode/Guess.pm
1010 POD fix by Miyagawa-kun
1011 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
1013 1.64 2002/04/29 06:54:06
1015 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
1016 encocde("euc-jisx0213") are still DIFFERENT.
1017 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
1019 A few white spaces corrected by NI-XS via PerlIO integration to
1021 Subject: Change 16247: Integrate perlio;
1023 Document fixes by Andreas
1024 Message-Id: <m3k7qsf1we.fsf@anima.de>
1026 1.63 2002/04/27 18:59:50
1027 ! lib/Encode/Encoding.pm
1028 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
1029 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
1030 Make use of the Encode::Encoding base class!
1031 And other cleanups in Encode.xs upon NI-XS suggestions
1032 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
1034 1.62 2002/04/27 11:17:39
1036 encodings() now just check %ExtModule instead of eval{require}
1037 all of them for ":all" to conserve more memory.
1039 more "%x" -> "%" UVxf stuff.
1041 s/=over2/=over 2/g # oops.
1043 1.61 2002/04/26 03:02:04
1045 Now does decent tests besides use_ok()
1046 ! lib/Encode/Guess.pm t/guess.t
1047 UI streamlined, document added
1048 ! Unicode/Unicode.xs
1049 various signed/unsigned mismatch nits (#16173)
1050 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
1052 POD: utf8-flag-related caveats added. A few sections completely
1056 Thou shalt not assume %d works, either!
1057 Robin Baker added to AUTHORS for this
1058 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
1060 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
1062 1.60 2002/04/24 20:06:52
1064 "Thou shalt not assume %x works." -- jhi
1065 Message-Id: <20020424210618.E24347@alpha.hut.fi>
1066 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
1067 low-memory build machines happy, now *.c is created for each *.ucm
1068 (no table aggregation). You can still override this by setting
1069 $ENV{AGGREGATE_TABLES}.
1070 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
1071 + lib/Encode/Guess.pm
1072 + lib/Encode/JP/JIS7.pm
1073 Encoding-autodetect (mainly for Japanese encoding) added. In a
1074 course of development, JIS7.pm was improved.
1075 + lib/Encode/HTML/Header.pm
1076 + lib/Encode/Config.pm
1077 MIME B/Q Header Encoding Added!
1078 ! Encode.pm Encode.xs t/fallback.t
1079 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
1080 Message-Id: <20020424130709.GA14211@tanglefoot>
1082 1.59 $ 2002/04/22 23:54:22
1083 ! Encode.pm Encode.xs
1084 needs_lines() and perlio_ok() are added to Internal encodings such
1085 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
1087 1.58 2002/04/22 23:54:22
1089 s/MacChineseSimp/MacChineseTrad/ # ... oops.
1094 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
1095 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
1096 t/(JP|KR).t are aggregated to t/CJKT.t
1097 test data are all remade via bin/ucm2text.
1098 And .... They are no longer skipped for -Uuseperlio !
1100 1.57 2002/04/22 20:27:30
1101 ! t/JP.t t/KR.t t/perlio.t
1102 unless (find PerlIO::Layer 'perlio') ... line is back again.
1103 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
1104 need PerlIO. Sigh....
1105 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
1106 ->perlio_ok now does eval{ require PerlIO::encoding } there so
1107 it correctly returns 1 when PerlIO::encoding is yet loaded.
1109 perl-current patch #16072 reflected
1111 1.56 2002/04/22 09:48:07
1112 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
1113 New PerlIO::encoding 0.04 compliance met
1115 1.55 2002/04/22 03:43:05
1116 ! Encode.pm Encode.xs Unicode/Unicode.pm
1117 needs_lines() defined so Encode::Encoding is no longer needed
1120 1.54 2002/04/22 02:50:01
1121 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
1122 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
1123 now perlio_ok is true by default if PerlIO::encoding->VERSION is
1124 0.03 or larger. POD in Encode::Encoding revised to reflect this.
1125 Encode::XS and Encode::Unicode now has perlio_ok() method.
1126 ! lib/Encode/Supported.pod
1127 s/UP-UX/HP-UX/ by jhi
1128 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
1129 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
1130 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
1131 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
1132 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
1133 ! lib/Encode/Supported.pod
1134 Huge document fixes by Philip.
1137 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
1138 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
1140 binmode() after "<:encoding" to make Win32 happy, by Mattia.
1141 Mattia added to AUTHORS file
1142 Message-Id: <3CC3150F.5798.22A05AE@localhost>
1144 1.52 2002/04/20 23:43:47
1146 TODO: is now SKIP:, as NI-XS requested. Also adds more
1147 eraborate failure analysis added.
1149 A note on how to make sure of round-trip safety added to POD
1150 section (so Autrijus is happier)
1151 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
1152 big5-(eten|hkscs) is round-trip safe again!
1153 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
1155 Typo fixes by Andreas
1156 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
1157 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
1158 PerIO coodination patches from NI-XS.
1159 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
1161 1.51 2002/04/20 09:58:23
1163 Updated test suite by Autrijis so "make test" is happy again
1164 Message-Id: <20020420082104.GA25037@not.autrijus.org>
1166 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
1168 TW/TW.pm TW/Makefile.PL
1169 Updates by Autrijus. 'big5' is no longer a canonical but an
1170 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
1171 Message-Id: <20020419195346.GA19597@not.autrijus.org>
1173 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
1174 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
1176 PerlIO detection a little bit smarter; no longer uses eval qq{}
1179 1.50 2002/04/19 06:13:02
1180 ! ! Encode.pm Encode.xs Encode/encoding.h
1182 New Fallback API imlemented and documented. See "perldoc Encode"
1184 ! lib/Encode/JP/JIS7.pm Encode.pm
1185 + lib/Encode/PerlIO.pod t/perlio.t
1186 API compliance met. However, it still does not work unless perlio
1187 implements line buffer. See BUGS section in perldoc Encode::PerlIO
1188 As a sensible workaround, perlio_ok() added to Encode.
1190 ! lib/Encode/Supported.pod
1192 Message-Id: <20020418174647.J8466@alpha.hut.fi>
1194 Doc fixes from Autrijus
1195 Message-Id: <20020418144131.GA10987@not.autrijus.org>
1197 perlqq mode documented
1199 + t/jisx0201.euc t/jisx0201.ref
1200 ! t/jisx0208.euc t/jisx0208.ref
1201 t/JP.t tests more rigorously and with other encodings
1202 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
1203 in jis0208 (used to contain jisx0201 part).
1204 ! Encode/Makefile_PL.e2x
1205 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
1206 enc2xs and encode.h rather than hard-coded. This allows the resulting
1207 module fully CPANizable.
1208 ! encoding.pm t/JP.t t/KR.t
1209 PerlIO detection simplified (checks %INC instead of eval{})
1210 ! Encode.xs Encode/encode.h
1211 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
1212 - lib/Encode/Unicode.pm
1213 (en|de)code_xs relocated to where it belongs. Source reindented
1216 Additional (U8 *) cast added as suggested by jhi
1217 Message-Id: <20020417165916.A28599@alpha.hut.fi>
1219 1.42 Date: 2002/04/17
1221 no-op module; Thought of adding a pod there but enc2xs has
1226 correct mechanism to detect Perlio::encoding layar installed.
1228 PerlIO Layer detached.
1230 1.41 2002/04/16 23:35:00
1232 binmode(STDIN|STDOUT ...) done iff PerlIO is available
1234 Cleaned up PerlIO skip conditions to prepare for the upcoming
1235 Encode - PerlIO forking.
1237 exported functions are now prototyped.
1238 ! lib/Encode/CN/HZ.pm
1241 fallback implemented # was /* FIXME */
1242 affected programs revised to fit (only HZ was using the try-catch
1243 approach which needed to be fixed for API-compliance).
1245 ! Encode/KR/2022_KR.pm
1247 can find =head1 NAME now, jhi
1248 Message-Id: <20020416083059.V30639@alpha.hut.fi>
1252 now complies with less warnings with the pickest compilers.
1253 Suggested by Craig, fixed by Dan.
1254 ! Encode/Makefile_PL.e2x
1256 A bug that fails to find *.e2x in certain conditions fixed
1258 1.40 2002/04/14 22:27:14
1259 + Encode/ConfigLocal_PM.e2x
1260 ! lib/Encode/Config.pm
1262 "enc2xs -C" now generates/updates Encode::ConfigLocal.
1263 ConfigLocal_PM.e2x is a skelton thereof.
1264 ! lib/Encode/Config.pm
1266 "use Encode::CN::HZ;" was missing.
1269 More rigorous tests added to test XS, especially on memory allocation.
1271 ! lib/Encode/Unicode.pm
1272 NI-S implemented an XS version -- merged
1273 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
1276 Source filter option added. With this option on, you can write
1277 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
1278 encodings. t/jperl.t enhanced to test this feature.
1280 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
1281 apply his suggestion, this degree of nitting is enough to add him
1283 Message-Id: <3CB93223.291E5E2E@earthlink.net>
1285 + lib/Encode/JP/JIS7.pm
1286 - lib/Encode/JP/JIS.pm
1287 - lib/Encode/JP/2022_JP.pm
1288 - lib/Encode/JP/2022_JP1.pm
1289 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
1290 JIS7.pm for better maintainability and performance
1292 Added caveat for non-ascii identifiers.
1294 fixes by jhi, the original author of this pragramtic module.
1295 Message-Id: <20020413231527.V1826@alpha.hut.fi>
1297 1.34 2002/04/12 20:23:05 (Unreleased)
1300 EBCDIC fixes addressed by jhi.
1301 Message-Id: <20020412161844.D9383@alpha.hut.fi>
1302 ! lib/Encode/Encoder.pm
1303 POD fix by Miyagawa-kun
1304 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
1306 1.33 2002/04/10 22:28:40
1308 Philip's mail address corrected.
1311 ! lib/Encode/Encoder.pm
1312 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
1313 This adds Miyagawa-kun to AUTHORS.
1314 * encoding() no longer exported by default but on demand
1315 * t/Encoder.t updated to test all these
1316 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
1317 ! lib/Encode/Unicode.pm
1318 ! lib/Encode/Supported.pm
1319 Further doc fixes by Anton
1321 1.32 2002/04/09 20:06:15
1324 - ucm/macDevanaga.ucm Unicode Character Map
1325 - ucm/macGujarati.ucm Unicode Character Map
1326 - ucm/macGurmukhi.ucm Unicode Character Map
1327 A utility to check integrity of .ucm files. t/bogus.ucm is a
1328 ucm that is deliberately bogus. unused Indic mappings are removed
1331 resolve_alias() added as suggested by jhi. Same as
1332 find_encoding("alias")->name. For convenience. This one is
1333 defined in Encode.pm instead of Alias.pm.
1334 Message-Id: <20020409215846.H17022@alpha.hut.fi>
1336 Memory Allocate but detected during the devel of ucmlint -- fixed.
1337 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
1338 ! lib/Encode/Unicode.pm
1339 valid_ucs2(0) is false but must be true.
1340 3 patches from NI-S as follows. This also has fixed the incident
1342 ! lib/Encode/Alias.pm
1343 find_alias() recursion prevention
1345 Checks for the patch above
1346 ! t/Encode/Unicode.pm
1347 An extra "F" that causes valid_ucs2() return a bogus value fixed
1348 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
1349 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
1350 2 Small Patches from jhi as follows:
1352 Encode->encodings() lists in case-insensitve order (as it was)
1354 -l option prints avaiable encodings to STDOUT instead of STDERR
1355 ! lib/Encode/Aliases.pm
1356 s/defintion/definition/
1357 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
1359 ! lib/Encode/Supported.pod
1360 ! lib/Encode/Unicode.pm
1361 POD revise by Philip Newton. This adds Philip to AUTHORS list.
1362 Thank you for the exact quote of Douglas Adams :)
1363 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
1365 1.31 2002/04/08 18:08:07
1366 ! lib/Encode/Encoder.pm
1368 Encode::Encoder, once just a placeholder of an idea, is now much more
1369 practical. See t/Encode.t to find how practical it can be.
1370 + lib/Encode/Config.pm
1372 my false laziness at Encode.pm is fixed. Now %ExtModules are set
1373 in Encode::Config and they are all literally, not programatically
1374 set. My false laziness was resulting many encodings missing from
1376 ! lib/Encode/Unicode.pm
1378 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
1379 so that it does not rely Encode::Unicode for BOM values
1380 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
1382 1.30 2002/04/08 02:34:51
1383 + lib/Encode/Encoder.pm
1384 Object Oriented Encoder. I reckon something like this is in need.
1387 ! lib/Encode/Supported.pod
1388 * autoloading bug that prevented upper-case canonicals such as UTF-16
1389 is fixed. Now even UTF/UCS are autoloaded!
1390 * encodings() is now more intuitive.
1391 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
1393 * Obligatory fixes to the POD.
1394 ! lib/Encode/Supported.pod
1395 Patch from Anton applied.
1396 Message-Id: <66641479.20020408033300@motor.ru>
1398 ! lib/Encode/Unicode.pm
1399 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
1401 1.28 2002/04/07 18:58:42
1405 Just a MANIFEST for those missing files.
1407 1.26 Date: 2002/04/07 15:22:04
1410 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
1411 And t/Aliases.t fixed to make sure they all exist.
1413 1.25 2002/04/07 15:01:25 (Unreleased)
1415 ! lib/Encode/Unicode.pm
1418 - lib/Encode/UTF_EBCDIC.pm
1419 - lib/Encode/Internal.pm
1420 - lib/Encode/utf8.pm
1421 Integrated into Encode.pm as closures. That way "one package, one file"
1422 rule is preserved yet less files to require.
1424 commented out binmode(STDERR ...
1428 ! EBCDIC/Makefile.PL
1431 ! Symbol/Makefile.PL
1433 ! Encode/Makefile_PL.e2x
1434 Schwarn's MM-compliance patch merged
1435 Message-Id: <20020406082609.GA28758@blackrider>
1437 ! lib/Encode/Unicode.pm
1438 + lib/Encode/UTF_EBCDIC.pm
1440 - lib/Encode/10646_1.pm
1441 - lib/Encode/ucs2_le.pm
1442 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1443 per-module based (en|de)code, I saved a number of .pm by
1444 reorganizing it as per-object base (Well, this is what Encode::XS
1445 does under the hood). See Encode::Unicode for details.
1446 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1447 This module is used only on EBCDIC environments.
1449 1.21 2002/04/05 14:46:34 (Not Released)
1455 Are back to make Perl/Tk happy Smile, NI-S.
1457 ! lib/Encode/Alias.pm
1458 ! lib/Encode/Supported.pm
1459 ! lib/Encode/10646_1.pm
1460 ! lib/Encode/ucs2_le.pm
1461 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1462 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1463 updated to reflect changes.
1464 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1465 ! lib/Encode/Alias.pm
1466 ! lib/Encode/Supported.pm
1467 Anton's revision commited. Added Dan's own fixes as well.
1468 Message-Id: <159103166906.20020405161134@motor.ru>
1469 ! lib/Encode/Alias.pm
1471 < qr/^UCS2-le$/i => '"UCS-2"', );
1473 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1474 Sigh. Thank you, Anton.
1475 Message-Id: <14567692196.20020405062020@motor.ru>
1476 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1478 1.20 2002/04/04 19:50:52
1480 the last minute addtion. Just give it a try. Docs remains to be done.
1481 Not installed by default.
1482 ! lib/Encode/Supported.pod
1485 ! lib/Encode/Alias.pm
1486 ! lib/Encode/utf8.pm
1487 ! lib/Encode/10464_1.pm
1488 ! lib/Encode/ucs2_le.pm
1489 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1490 unchanged but UTF-16BE is added as an alias. Implicit aliases
1491 move to Encode::Alias so init_alias() works more as expected.
1492 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1493 Though pedantically wrong, This should make perl mongers happier.
1494 t/Alias.t is enhanced to test all these.
1495 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1497 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1498 of the codepoints are common. CJKT left as is because the saving is
1502 ! EBCDIC/Makefile.PL
1504 ! Encode/Makefile_PL.e2x
1508 ! Symbol/Makefile.PL
1512 All occurance of _def.h replaced with .exh so djgpp works happily
1513 ever after! To credit this amazing discovery, Laszlo is now in
1515 Message-Id: <20020403181424.GA8778@freemail.hu>
1516 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1519 ! Encode/Makefile_PL.skel
1521 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1522 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1524 Two more tests by added jhi
1525 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1528 The showstopper fixed -- Memory reallocation bug was causing
1529 Encode::XS to fall into infinite loop on certain conditions.
1530 t/grow.t tests that.
1531 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1536 ! lib/Encode/Supported.pod
1537 Vendor encodings rebuilt out of original map files at unicode.org.
1538 Indic languages such as MacDevanagali remain unspported do to the
1539 shortcoming of encengine capabilities (they need algorithmical
1540 conversion and I have no knowledge on that!). Pods fixed for added
1542 Oh, macJapan.ucm renamed to macJapanese.ucm.
1543 macROMnn is macRomanian and macRUMnn is macRumanian.
1544 txt2ucm is a crude script that is used to convert them.
1546 Unicode Compound Characters (used extensively on Mac) supported
1548 Typo fixes and improvements by jhi
1549 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1551 1.11 2002/03/31 22:12:13
1555 Missing files from the MANIFEST fixed.
1556 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1557 Version incremented just to make CPAN happy.
1559 1.10 2002/03/31 21:32:42
1562 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1563 if you want. This should make Autrijus happy. Also, piconv
1564 is added to default install.
1567 Here-documented files that enc2xs generates are now exported
1568 to *.e2x. Much cleaner and easier to debug.
1570 encoding enhances so you can make it act more like such
1571 (now prehistoric ) "localized" variations of perl like Jperl.
1573 Further test for encoding.pm. Written in euc-jp
1576 Taken over form jhi.
1577 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1582 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1589 ! Encode/macIceland.ucm
1590 ! lib/Encode/Alias.pm
1591 ! lib/Encode/Supported.pod
1592 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1593 Message-Id: <m3lmcavhjt.fsf@anima.de>
1595 1.01 2002/03/29 20:59:39
1598 s/USE_SCRIPTS/MORE_SCRIPTS/
1600 installs enc2xs by default for external Encode:: modules in CPAN,
1601 such as Encode::HanExtra
1603 More sensible perl core detection via $ENV{PERL_CORE}
1605 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1607 Perl core ditection via $^X =~ m/\bminiperl$/o
1608 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1610 1.00 Wed Mar 29 2002
1612 The version of all files is updated to 1.00 via "ci -f -l1.00",
1613 commemorating version 1.00. All files, including *.ucm are now
1614 under version control.
1617 encode.h moved to Encode/ so it will be installed for the later
1620 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1625 compile renamed to enc2xs.
1626 Affected Makefle.PL updated
1628 "Punt it. HanExtra can take care of that later." -- Autrijus
1629 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1632 ! Encode/ksc5601.ucm
1633 ! lib/Encode/CJKConstants.pm
1634 ! lib/Encode/KR/2022_KR.pm
1635 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1636 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1640 bin/ added for example scripts. They are not installed by default.
1641 to install them, "perl Makefile.PL USE_SCRIPTS".
1642 piconv is iconv reinvented in perl. in addition to all features
1643 of iconv, it also adds perlish features. See L<piconv/1> for more
1645 ! lib/Encode/Alias.pm
1646 qr/^ replaced with qr/\b so it directly matches locale names
1647 such as en_US.US-ASCII
1650 Patch by MJD to fix the following problem applied.
1651 Subject: [PATCH 5.7.3 Encode]
1652 Aliases.t not properly skipped when Encode extension not built
1653 Message-Id: <20020328091850.18677.qmail@plover.com>
1654 ! lib/Encode/KR/2022_KR.pm
1655 ! lib/Encode/CJKConstants.pm
1656 Another patch from Jungshik to make iso-2022-kr actually work
1657 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1658 ! Encode/Encode/euc-kr.ucm
1659 + Encode/Encode/johab.ucm
1660 ! Encode/Encode/ksc5601.ucm
1662 ! Encode/KR/Makefile.PL
1663 ! Encode/lib/Encode/Alias.pm
1665 Johab support and complete revision of Korean Encoding by Jungshik
1666 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1668 Revised to make up with now-dropped Encode::Details.
1669 - lib/Encode/Details.pod
1670 Dropped. Besides being obsolete, the topics are now covered in
1671 respective pods now.
1676 Korean aliases fixed thanks to Jungshik Shin
1677 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1678 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1680 =head1 NAME added to all modules to make buildtoc happy
1681 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1682 - lib/Encode/CJKguide.pod
1683 Too controversial and dropped from the dist. Will be available
1684 separately on the web.
1686 RCS tags added so table debugging gets easier (should that be
1687 needed! I hope they all stay 1.00!)
1688 + lib/Encode/CJKguide.pod
1689 A detailed guide to mainly, but not limited to, CJK multibyte
1692 + Encode/hp-roman8.ucm
1694 ! Encode/Supported.pod
1695 All occurance of "roman8" replaced with "hp-roman8" to avoid
1697 ! Encode/Supported.pod
1700 Mac Encodings now comply the Inside Macintosh
1702 Test for '-raw' conventions added.
1704 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1705 ! Encode/gb12345.ucm
1707 ! Encode/ksc5601.ucm
1708 "-raw" appended to canonical names.
1709 File mames stay unchanged thanks to UCM format.
1710 ! lib/Encode/CN/HZ.pm
1711 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1712 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1714 0.99 Tue Mar 26 2002
1715 - lib/Encode/JP/Const.pm
1716 + lib/Encode/CJKConstants.pm
1717 + lib/Encode/CN/2022_CN.pm
1718 + lib/Encode/KR/2022_KR.pm
1727 * Support for ISO-2022-KR and ISO-2022-CN added.
1729 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1730 * ucm2table autogenerates character table out of UCM files.
1733 - lib/Encode/Supports.pod
1734 + lib/Encode/Supported.pod
1735 Names reverted due to popular demand.
1736 8.3 rule applies only when there is a conflict.
1737 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1742 - lib/Encode/Format/Enc.pod
1744 * Character tables is now 100% ucm.
1745 * All files under Encode/ is now 8.3-compliant
1746 * some of missing encodings added (i.e. gsm0338 and nextstep)
1747 * Vendor mappings aggregated with appropriate national std in
1748 Makefile.PL, resulting smaller *.so especially for CJK.
1749 Following is result on Dan's FreeBSD box.
1751 ---------------------------------------------------------------
1752 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1753 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1754 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1755 blib/arch/auto/Encode/Encode.so 27,791 27,791
1756 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1757 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1758 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1759 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1760 ---------------------------------------------------------------
1761 Total 5,375,297 6,343,819
1763 * As a result of ucm-transition, Encode::Tcl dropped because
1764 Encode::Tcl demands *.enc.
1765 Encode::Tcl will be supplied in a separate tarball with *.enc.
1766 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1771 -lib/Encode/Supported.pod
1772 +lib/Encode/Supports.pod
1773 -lib/Encode/iso10646_1.pm
1774 +lib/Encode/10646_1.pm
1775 -lib/Encode/EncFormat.pod
1776 +lib/Encode/Format/Enc.pod
1777 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1778 - lib/Encode/JP/Constants.pm
1779 + lib/Encode/JP/Consts.pm
1780 ! lib/Encode/JP/JIS.pm
1781 ! lib/Encode/JP/H2Z.pm
1782 Version nit problem and 8.3 rule fix.
1783 > Package namespace installed latest in CPAN file
1784 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1785 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1786 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1787 accordingly. In addition, legacy "use vars qw()..." are replaced with
1789 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1790 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1792 - lib/Encode/JP/ISO_2022_JP.pm
1793 - lib/Encode/JP/ISO_2022_JP_1.pm
1794 + lib/Encode/JP/2022_JP.pm
1795 + lib/Encode/JP/2022_JP1.pm
1797 8.3 naming conflict for vanilla fat addressed by jhi
1798 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1801 Typecast fix addressed by jhi
1802 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1804 0.98 Mon Mar 25 2002
1805 ! lib/Encode/Supported.pod
1807 + lib/Encode/JP/ISO_2022_JP_1.pm
1808 ! lib/Encode/JP/ISO_2022_JP.pm
1809 ! lib/Encode/JP/JIS.pm
1811 Now Encode::JP is more strict on the difference between ISO-2022-JP
1812 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1813 makes Anton happier :) FYI the previous version implements
1814 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1815 ! lib/Encode/Supported.pod
1818 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1819 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1824 ! lib/Encode/Suppoted.pod
1825 pod fixes to replace F<http://...> to L<http://...>,
1826 as suggested by Autrijius in:
1827 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1828 ! lib/Encode/Suppoted.pod
1829 fixes and enhancements by Anton
1830 Message-Id: <10632060120.20020324103753@motor.ru>
1831 ! lib/Encode/Alias.pm
1832 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1833 added. Suggested by Anton then deobfuscated by Autrijius
1834 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1836 Further fix by Nicholas Clark
1837 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1838 - lib/EncodeFormat.pod
1839 + lib/Encode/EncFormat.pod
1841 File renamed as suggested by Autrijius
1843 ! lib/Encode/Details.pod
1844 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1845 ! Encode.pm Sun Mar 24 13:43:47 2002
1846 pod fixes by Autrijius.
1847 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1848 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1850 ! lib/Encode/Alias.pm
1852 now more EBCDIC conscious;
1853 %ExtModules on EBCDIC system excludes CJK so that you don't
1854 have to worry about the matched alias resulting cloaking.
1855 t/Alias.t also revised to reflect changes. Verified by jhi
1856 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1858 0.97 Sun Mar 24 2002
1862 EBCDIC detection mechanism installed as in JP/JP.pm
1863 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1866 ! EBCDIC/Makefile.PL
1869 ! Symbol/Makefile.PL
1871 Now all table files used by compile are postfixed '_t' to avoid
1872 namespace collisions in case insensitive file systems once for all!
1874 Message-ID: <58290227735.20020323195659@familiehaase.de>
1876 Since the Encode::JP is unsupported under EBCDIC we
1877 cannot run this test (aliases as such should work fine) -- jhi
1878 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1880 duplicate occurance of ascii.ucm and 8859-1.ucm
1881 causes MacOS X dlyd to cloak
1887 < chdir 't' if -d 't';
1889 > if (! -d 'blib' and -d 't'){ chdir 't' };
1890 When you are "make test"-ing on Encode/ directory, you must not
1891 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1892 remain unchanced. Also the situation detection was made simpler
1893 in t/JP.t, which was originally;
1894 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1896 "Use of uninitialized value in string eq at Encode.pm line 96."
1897 ! Symbol/Makefile.PL
1898 ! EBCDIC/Makefile.PL
1900 -- Problem on case insensitive file systems
1901 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1902 Message-ID: <88254111953.20020323095503@familiehaase.de>
1905 "So I think it's a bug in gcc, not perl. But it still needs to be
1907 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1908 Message-Id: <20020323170509.C96475@plum.flirble.org>
1910 0.96 Sat Mar 23 2002
1912 ! lib/Encode/Encoding.pm
1913 ! lib/Encode/Alias.pm
1914 ! lib/Encode/Supported.pod
1916 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1917 Message-ID: <20020322073908.GB10539@blackrider>
1920 "...I think we should include ISO 8859-1 as well." -- NI-S
1921 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1926 ! lib/Encode/Alias.pm
1927 alias definitions relocated to Encode::Alias so module autoloading
1928 works for aliases also.
1930 encodings() now accepts args to check ExtModules.
1934 + EBCDIC/Makefile.PL
1935 + Symbol/Makefile.PL
1939 Latin and single byte encodings are reorganized so they are
1940 demand-loaded like Encode::XX. Now only ascii is compiled into
1942 ! lib/Encode/Alias.pm
1943 for my $k (keys %hash){ delete $hash{$k}; }
1944 is depreciated; fixed.
1946 0.95 Fri Mar 22 2002
1947 In this update, pod rewrites and alias fixes are the main issues
1948 + lib/Encode/Supported.pod
1949 Describes supported encodings
1951 streamlined compiled-in encodings.
1952 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1954 + Encode/ibm-125?.ucm
1955 Added from icu distibution with any occurance of
1956 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1957 some respect to icu staff, however.
1958 + lib/Encode/Alias.pm
1960 Alias difinitions in Encode.pm relocated.
1963 packWARN patch from Paul Marquess via jhi
1964 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1965 Paul added to AUTHORS as a result.
1966 ! t/CJKalias.t -> t/Aliases.t
1967 Renamed. Checks even more aliases and alias overloading
1970 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1971 gbk => cp936 relocated to CN.pm
1973 Test::More with plans (by jhi)
1975 0.94 Thu Mar 21 2002
1976 + lib/Encode/Description.pod
1977 ! lib/Encode/Encoding.pm
1978 Now the pod in Encode.pm is abridged as programming references.
1979 lib/Encode/Description.pod contains the original, detailed description
1980 and Encode::Encoding explains how to write your own module to
1981 add new encodings. So far, lib/Encode/Description.pod contains
1982 the whole pod once in Encode.pm. This is intentional.
1984 Pod revisions by Anton Tagunov
1985 Message-Id: <517178431.20020320174824@motor.ru>
1987 all occrance of Encode::Tcl::Extended removed including pod
1989 test now checks $encoding->name only; $encoding->{name} are
1990 no longer check to find the canonical name.
1991 ! lib/Encode/JP/JIS.pm
1992 ! lib/Encode/JP/ISO_2022_JP.pm
1993 ->name() added to be more compliant with API
1999 Patch by Autrijus to add aliases to TW and fixes to POD
2000 Message-Id: <20020320090619.GA24774@not.autrijus.org>
2002 SADAHIRO Tomoyuki added as should. My apologies.
2004 0.93 Wed Mar 20 2002
2005 * First release to be uploaded to CPAN. For prehistoric changes,
2006 please see Changes file of perl distibution as well as
2007 perl-unicode@perl.org archive, available at:
2008 http://archive.develooper.com/perl-unicode@perl.org/
2010 Changes Since 0.92 includes;
2015 + Mention to perl-unicode@perl.org added
2017 + Encoding aliases added so you can feed locale names
2018 and MIME Charset="" directly.
2019 - Mention to JISX0212 removed because it's fixed
2022 + Encoding aliases added. Note TW is left untouched because
2023 euc-tw is not implemented in TW but in Encode::HanExtra.
2024 Autrijus, you may fix Encode::HanExtra.
2026 + to test encode aliases added