1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 2.33 2009/03/25 07:55:57 dankogai Exp dankogai $
5 $Revision: 2.33 $ $Date: 2009/03/25 07:55:57 $
6 ! lib/Encode/MIME/Header.pm
7 Decontaminated $& which sneaked in on 2.31.
8 <67FC9F3A39C746DA95AAB6BB01539099@robmhp>
9 <693254b90903242352x2dc26ba6p5e68deb871fa88ae@mail.gmail.com>
10 http://coderepos.org/share/changeset/31542
12 2.32 2009/03/07 07:32:37
13 ! lib/Encode/Alias.pm t/Alias.t
14 Encode now resolves 'en_US.UTF-8' to utf-8-strict like 'ja_JP.euc'
15 Those who set locale on their shells should be happier now.
19 "encode(undef, 'str') should die earlier"
20 http://coderepos.org/share/changeset/30790
22 2.31 2009/02/16 06:18:09
23 ! lib/Encode/MIME/Header.pm
24 "Revert [29767] and [29771] since it breaks perl 5.8" by miyagawa
25 http://coderepos.org/share/changeset/30111
27 2.30 2009/02/15 17:44:13
29 fixed regexes, et cetera. by drry
30 http://coderepos.org/share/changeset/29767
31 ! lib/Encode/MIME/Header.pm
32 Addressed: Encode::MIME::Header::decode should respect CHECK
33 http://rt.cpan.org/Ticket/Display.html?id=43204
34 http://coderepos.org/share/changeset/29767
36 2.29 2009/02/01 13:14:37
38 VERSION++ just to make PAUSE happy
39 Message-Id: <877i4anwwt.fsf@k75.linux.bogus>
41 2.28 Date: 2009/02/01 12:30:18
43 Latest refactoring broke the backward compatibility
44 w/ Perl 5.8.6 and before now restored
45 Message-Id: <1233185156.DABa130.74940@basic2.hostingcompartido.com>
46 Message-Id: <693254b90902010027x277a5d0fm4f5700ba2f276239@mail.gmail.com>
47 ! lib/Encode/MIME/Header.pm
48 Addressed: Split header lines are joined incorrectly
49 http://rt.cpan.org/Ticket/Display.html?id=42902
51 2.27 2009/01/21 22:55:07
52 ! lib/Encode/MIME/Header.pm t/mime-header.t
53 Addressed: Encode::MIME::Header MIME-Q encoding truncates
54 trailing zeros in some circumstances
55 http://rt.cpan.org/Ticket/Display.html?id=42627
57 Added alias: unicode-1-1-utf-7
58 http://rt.cpan.org/Ticket/Display.html?id=38558
60 Documented: _utf8_on() does not work for tainted values
61 http://rt.cpan.org/Ticket/Display.html?id=41163
63 s[oss.software.ibm.com/icu][www.icu-project.org]g
64 http://rt.cpan.org/Ticket/Display.html?id=40245
65 ! lib/Encode/Guess.pm t/guess.t
66 Addressed:Empty file should produce an error message
67 http://rt.cpan.org/Ticket/Display.html?id=38652
68 ! Unicode/Unicode.xs AUTHORS
69 Refactored by Alex Davies
70 http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/2007-10/msg00745.html
71 Message-Id: <7637669B2E3D46B187591747DA27F4C8@Amelie>
73 2.26 2008/07/01 20:56:17
75 Absense of Encode::ConfigLocal no longer carps no matter what.
76 http://bugzilla.redhat.com/show_bug.cgi?id=435505#c2
77 http://rt.cpan.org/Ticket/Display.html?id=28638
78 http://rt.cpan.org/Ticket/Display.html?id=11511
80 use encoding 'utf8' and 'iso-2022-jp' glitches on perl 5.10
82 Message-Id: <693254b90807011224h3ab50d76v50c6fea87baf223c@mail.gmail.com>
83 ! lib/Encode/Alias.pm t/Aliases.t
84 macintosh' not recognize as MacRoman
85 http://rt.cpan.org/Ticket/Display.html?id=36326
87 s{INC => "-I./Encode"}
88 {INC => '-I' . File::Spec->catfile( '.', 'Encode' )}
89 To prevent some platforms from forgetting to include Encode/encode.h.
90 http://rt.cpan.org/Ticket/Display.html?id=36348
92 2.25 2008/05/07 20:56:05
94 added ':default' to Exporter option.
95 ! lib/Encode/GSM0338.pm
96 GSM0338 now handles coderef in CHECK
97 http://rt.cpan.org/Ticket/Display.html?id=31335
99 Perl 5.10/Encode 2.24: Tiny typo in Encode's Makefile.PL arg processing
100 Message-Id: <961C2A4F-92B3-416D-A9F9-E7B0ADA9F134@fsck.com>
101 ! lib/Encode/Alias.pm
102 "This fix for Encode::Alias should make Solaris happy:"
103 Message-ID: <47D886D9.6060001@iki.fi>
105 2.24 2008/03/12 09:51:11
106 ! lib/Encode/Config.pm
107 adds and fixes also adds cp858 support.
108 ! Encode.pm encoding.pm lib/Encode/Alias.pm ucm/cp858.ucm
110 > Change 33486 by rgs@scipion on 2008/03/12 08:50:11
111 An unfortunate side-effect of Encode and Encode::Alias use'ing each
112 other, and Encode::Alias exporting functions into Encode for it to use
113 as methods, broke the loading of the find_alias() Encode method in some
114 cases since 5.10. Breaking the recursive inheritance fixes it.
115 Message-Id: <b77c1dce0803120151o4166c3a0gfcfd14681ab7e10d@mail.gmail.com>
118 Message-Id: <200711281835.36125@bloodgate.com>
120 Fix by MIYAGAWA via CodeRepos
121 http://coderepos.org/share/changeset/1791
122 ! encoding.pm t/mime_header_iso2022jp.t
123 ported back from Perl 5.10-RC1
125 2.23 2007/05/29 18:15:32
127 got rid of global fallback_cb; encode_method() now takes one more
128 argument which is a coderef to fallback. This should make
129 encode_method() thread-safe.
131 Added perluniintro, perlunifaq, and perlunitut to POD
133 Plug a memory leak in Encode -- by rgs
134 Message-Id: <b77c1dce0705290858v2be239c3o2d726e3d59091493@mail.gmail.com>
136 POD fixes on UTF-16LE
137 http://aspn.activestate.com/ASPN/Mail/Message/perl5-porters/3486118
139 man page generation is now conditional; yes by default but no if $PERL_CORE
140 Message-Id: <b77c1dce0705290237h5c4667cdlf79a48b839170add@mail.gmail.com>
142 2.22 2007/05/29 07:35:27
144 from_to() does not honor the check while decoding. That's a feature.
145 To make sure it is a feature it is mentioned in the POD.
146 http://rt.cpan.org/NoAuth/Bug.html?id=27277
148 Encode used to suppress man page generation. Now it does.
149 http://rt.cpan.org/NoAuth/Bug.html?id=27200
150 ! Encode.pm Encode.xs t/fallback.t
151 Addressed: (de|en)code("ascii", "\x{3000}", sub{ $_[0] }) segfaults
154 2.21 2007/05/12 06:42:19
155 + lib/Encode/MIME/Name.pm t/mime-name.t
156 ! Encode.pm Encode.xs lib/Encode/Encoding.pm
157 new method: mime_name()
158 inspired by: MIYAGAWA
160 Subject: Re: Compress::Zlib, pack "C" and utf-8 [PATCH]
161 From: Marc Lehmann <schmorp@schmorp.de>
162 Date: Thu, 12 Apr 2007 08:41:53 +0200
163 Message-ID: <20070412064153.GA22475@schmorp.de>
164 http://public.activestate.com/cgi-bin/perlbrowse/p/31194
167 Message-Id: <20070417220547.GA11999@zetta.zet>
169 2.20 2007/04/22 14:56:12
171 Pod fixes. Now find_encoding() is explained more in details.
172 + lib/Encode/GSM0338.pm
174 ! lib/Encode/Supported.pod lib/Encode/Config.pm Bytes/Makefile.PL t/gsm0338.t
175 ESTI GSM 03.38 support is relocated from Encode::Byte to Encode::GSM0338.
176 This encoding is so kaputt it is unfit for Encode::XS!
177 Though it was okay for general cases and escape sequences,
178 '\0' => '@' IFF '\0\0' => '\0' had gliches.
179 So kaputt even t/gsm0338 wrongly interpreted that.
180 ref. http://www.csoft.co.uk/sms/character_sets/gsm.htm
181 ! encoding.pm t/Aliases.t
182 Imported from bleedperl #31015
184 2.19 2007/04/06 12:53:41
185 ! lib/Encode/JP/JIS7.pm
187 encode('iso-2022-jp') fallback support added by MIYAGAWA++
188 decode()'s fallback remains unchanged (FB_PERLQQ) since UTF-8
189 contains all characters in iso-2022-jp so there's no need for fancy stuff.
190 Message-Id: <693254b90704060526s6d850320h71cdda50dfbf7eba@mail.gmail.com>
192 #25216 ([PATCH] Encode.pm: postpone the load of Encode::Encoding)
193 http://rt.cpan.org/NoAuth/Bug.html?id=25216
194 ! lib/Encode/MIME/Header.pm t/mime-header.t
195 #24418 (Encode::MIME::Header: wrong encoding with latin1 characters)
196 http://rt.cpan.org/NoAuth/Bug.html?id=24418
198 #23876 (Add documentation for LEAVE_SRC)
199 http://rt.cpan.org/NoAuth/Bug.html?id=23876
200 ! lib/Encode/Alias.pm t/Aliases.t
201 #20781: Thai encoding needs alias for tis-620
202 http://rt.cpan.org/NoAuth/Bug.html?id=20781
204 #20344: piconv: wrong conversion of utf-16le encoded files (with PATCH)
205 http://rt.cpan.org/NoAuth/Bug.html?id=20344
206 ! Encode.pm Encode.xs bin/enc2xs encoding.pm t/Aliases.t t/utf8strict.t
207 Imported from bleedperl's 2.18_01
209 2.18 2006/06/03 20:28:48
211 overhauled the -C option
212 - added ascii-ctrl', 'null', 'utf-8-strict' to core
213 - auto-generated Encode::ConfigLocal no longer use v-string for version
214 - now searches modules via File::Find so Encode/JP/Mobile is happy
215 ! Byte/Byte.pm CN/CN.pm EBCDIC/EBCDIC.pm JP/JP.pm KR/KR.pm Symbol/Symbol.pm
216 use strict added; though all they do is load XS, it's
217 still better a practice
219 use warnings added to all of them for better practices' sake.
221 2.17 2006/05/09 17:10:09
223 'chin' =~ /^zh_CN|chin(?:a|ese)?$/i is true
224 but chin is not china or chinese.
225 http://d.hatena.ne.jp/jankogai/20060508/1147090316
227 Integrated maintperl change (27824|27824) which I overlooked
228 -- sorry, Nicholas and Coverity Scan.
229 Message-Id: <200604152115.k3FLF1Ar014538@smtp3.ActiveState.com>
230 Message-Id: <200605091615.k49GF1gJ016777@smtp3.ActiveState.com>
232 2.16 2006/05/03 18:24:10
234 --xmlcref and --htmlcref added.
236 Copyright Notice Added.
237 http://rt.cpan.org/NoAuth/Bug.html?id=19056
239 Replaced remaining ^\t with q( ) x 4. -- Perl Best Practice pp. 20
240 And all .pm's are now perltidy-ed.
242 2.15 2006/04/06 15:44:11
244 Addressed: UTF-16, UTF-32, UCS, UTF-7 decoders mishandle illegal characters
245 http://rt.cpan.org/NoAuth/Bug.html?id=18556
247 added str2bytes() as an alias to encode() and bytes2str() as an alias
249 http://rt.cpan.org/NoAuth/Bug.html?id=17103
251 Change 26922: Avoid warning with MS Visual C compiler.
252 Message-Id: <200601231245.k0NCj2dw009484@smtp3.ActiveState.com>
254 Change 26067: As using -C to turn on utf8 IO is equivalent to the open pragma
255 Message-Id: <200511092227.jA9MRcYD009025@smtp3.ActiveState.com>
257 2.14 2006/01/15 15:43:36
259 Change 26295: Don't build manpages for Encode and Unicode::Normalize
260 Message-Id: <200512071540.jB7Fe4Gt017960@smtp3.ActiveState.com>
262 Change 26081: Pod nit in Encode.pm, found by Marc Lehmann in RT #36949.
263 Message-Id: <200511110357.jAB3vZcP023647@smtp3.ActiveState.com>
264 ! Encode.xs Encode/encode.h bin/enc2xs encengine.c
265 Change 25821: Mark more static Encode data structures as const.
266 Change 25823: use more 'const' in the Encode data structures.
267 Message-Id: <200510221243.j9MChTSu027711@smtp3.ActiveState.com>
268 Message-Id: <200510221343.j9MDhTk9001245@smtp3.ActiveState.com>
270 2.13 2006/01/15 15:06:36
272 Miyagawa's mail address updated
273 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
274 ! lib/Encode/MIME/Header.pm
275 #16413: Encode::MIME::Headers patch to solve what is probably someone else's bug
276 http://rt.cpan.org/NoAuth/Bug.html?id=16413
277 ! lib/Encode/MIME/Header.pm t/mime-header.t
278 Applied: RT #16258: Support for RFC 2184 language tag
279 http://rt.cpan.org/NoAuth/Bug.html?id=16258
281 Fixed RT #14559: fix for #8872 introduces new "bug"
282 http://rt.cpan.org/NoAuth/Bug.html?id=14559
285 from_to() now makes use of $check more naturally.
286 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
287 <B10B4DE2-9BAF-4344-B3BE-2119977D817A@dan.co.jp>
289 2.12 2005/09/08 14:17:17
290 ! Encode.xs Encode.pm t/fallback.t
291 Now accepts coderef for CHECK!
293 Updated to newer version at unicode.org
294 http://rt.cpan.org/NoAuth/Bug.html?id=14222
295 ! lib/Encode/Supported.pod
297 <42F5E243.80500@gmail.com>
299 More POD typo leftover fixed.
300 Message-Id: <b77c1dce05080615487f95314@mail.gmail.com>
302 2.11 2005/08/05 10:58:25
304 To reflect changes below
305 ! Encode.pm encoding.pm
306 lib/Encode/Alias.pm lib/Encode/PerlIO.pod lib/Encode/Supported.pod
307 Typo fixed by Piotr Fusik in Change 25261 & 25266
308 Message-ID: <001401c595bd$dccb5d80$0bd34dd5@piec>
310 Addresses "BUG REPORT: panic in Encode.xs".
311 Message-Id: <42EDDA97.2010608@hyper.to>
312 + lib/Encode/MIME/Header/ISO_2022_JP.pm mime_header_iso2022jp.t
313 ! lib/Encode/MIME/Header.pm lib/Encode/Config.pm
314 Encoding 'MIME-Header-ISO_2022_JP' is introduced by Makamaka
315 Message-Id: <200507311557.j6VFvE2K034605@www231.sakura.ne.jp>
316 ! Encode/encode.h Encode.pm Encode.xs
317 PerlIO's "encoding(utf-8-strict)" got a problem w/ partial character.
318 Found and addressed by KONNO Hiroharu <hiroharu.konno@bowneglobal.co.jp>
319 See also ext/PerlIO/encoding/encoding.pm
320 Message-Id: <E1DineE-00068X-TB@yok-gs-workman.asia.bgsinternal.com>
322 2.10 2005/05/16 18:46:36
324 fixed decode_utf8() accordingly to RT#8872
325 http://rt.cpan.org/NoAuth/Bug.html?id=8872
327 s/SvIVX/SvIV_set/ by Steve Peters.
328 Message-Id: <2297.67.96.185.36.1114626315.squirrel@webmail3.pair.com>
332 New Pod section: "UTF-8 vs utf8"; explains utf-8-strict
334 Tests utf-8-strict, accordingly to
335 UTF-8 decoder capability and stress test" by Markus Kuhn
336 http://smontagu.damowmow.com/utf8test.html
337 Note that malformed and overlong sequences are not test here
338 because perl already does that for you, utf-8-strict or not.
339 ! Encode.pm Encode/encode.h t/fallback.t
340 Addressed "encode(..., Encode::LEAVE_SRC) does not work".
341 Now FB_(PERLQQ|HTMLCREF|XMLCREF) implies LEAVE_SRC so
342 you can (en|de)code constant strings with these fallbacks.
343 http://rt.cpan.org/NoAuth/Bug.html?id=8736
344 ! Encode.pm Encode.xs lib/Encode/Alias.pm t/Aliases.t
345 Make Encode.pm support the real UTF-8, by GAAS
346 Message-Id: <lrfz2mcngd.fsf@caliper.activestate.com>
347 Message-Id: <lr4qizbvvm.fsf@caliper.activestate.com>
348 ! Encode.pm Encode.xs
349 post-2.09 comment patches from GAAS applied.
350 Message-Id: <lroehacz6q.fsf@caliper.activestate.com>
351 Message-Id: <lrk6rycymu.fsf@caliper.activestate.com>
353 2.09 2004/12/03 19:16:53
354 ! Encode.pm Encode.xs
355 Addressed " :encoding(utf8) broken in perl-5.8.6".
356 Message-Id: <lrllcfeank.fsf_-_@caliper.activestate.com>
358 Addressed "(de|en)code($valid_encoding, undef) does not warn".
359 http://rt.cpan.org/NoAuth/Bug.html?id=8723
360 ! Encode.pm t/Encode.t
361 Addressed "Can't encode URI". When a reference is fed to (en|de)code,
362 Encode now stringifies instead of returning undef.
363 http://rt.cpan.org/NoAuth/Bug.html?id=8725
364 ! Encode.xs t/fallback.t
365 Addressed "FB_HTMLCREF and FB_XMLCREF for the UTF-8 decoder".
366 http://rt.cpan.org/NoAuth/Bug.html?id=8694
368 Addressed "s/digit/number/".
369 http://rt.cpan.org/NoAuth/Bug.html?id=8695
371 Addressed "while (defined(read )) { ... } is an infinite loop".
372 http://rt.cpan.org/NoAuth/Bug.html?id=8696
374 Addressed "What the heck is UCM?".
375 Document fixed so that it no longer contains "UCM-Based Encodings".
376 http://rt.cpan.org/NoAuth/Bug.html?id=8697
378 2.08 2004/10/24 13:00:29
379 ! Encode.xs lib/Encode/Encoding.pm Unicode/Unicode.{pm,xs}
380 Resolved the issue that was raised by 2.07 -- Encode::utf8 fallbacks
381 that was introduce messed up PerlIO::encoding.
382 * To do so, ->renew() is renewed and ->renewed() was introduced to
383 tell whether the caller is PerlIO or not.
384 Message-Id: <94B2EB12-25B7-11D9-9E6A-000A95DBB50A@dan.co.jp>
386 2.07 2004/10/22 19:35:52
387 ! lib/Encode/Encoding.pm
388 "Remove Carp from warnings.pm" that influences Encode, by Tels.
389 Message-Id: <200410161618.29779@bloodgate.com>
390 ! Encode.xs AUTHORS t/fallback.t
391 Now Encode::utf8's fallbacks are compliant to Encode standard.
392 Thank Bjoern Hoehrmann for persistently convincing me.
393 Message-Id: <41a61aea.638409494@smtp.bjoern.hoehrmann.de>
397 2.06 2004/10/22 06:23:11
399 RT #8083 reports that MacThai mapping was obsolete
400 Updated all mac* encodings accordingly to the URI below.
401 One remaining mystery is that MacRomanian vs. MacRumanian.
402 MacRumanian is not found in unicode.org...
403 http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/
404 ! Encode.pm t/Encode.t
405 Fixed RT #8081: "decode(..., bless{},'x') segfault"
406 Two more tests added to test that.
407 http://rt.cpan.org/NoAuth/Bug.html?id=8081
409 POD revised accordingly to RT #7966
410 http://rt.cpan.org/NoAuth/Bug.html?id=7966
412 POD updated explaining why Encode::Unicode always croaks on error
413 rather than giving users choices.
414 http://rt.cpan.org/NoAuth/Bug.html?id=7892
416 2.05 2004/10/19 04:55:01
418 "unnuke" jhi's patch in bleedperl, with minor correction by dankogai.
419 Message-ID: <41210A84.6060506@iki.fi>
420 Message-ID: <20041018233442.7418113f@r2d2>
421 Message-Id: <2BA3DAC4-218A-11D9-906D-000A95DBB50A@dan.co.jp>
423 2.04 2004/10/16 21:22:44
425 From: craigberry@mac.com
426 Subject: [PATCH ext/Encode/Makefile.PL] make Encode.c dependency explicit
427 Message-Id: <41716868.7000102@mac.com>
429 2.03 2004/10/06 05:07:20
430 ! lib/Encode/Alias.pm
431 Resolved some alias case sensitivity glitches reported via RT.
432 http://rt.cpan.org/NoAuth/Bug.html?id=7835
434 Resolved Win32 glitches reported via RT.
435 (Fixed by dankogai and tested by Steve Hay)
436 http://rt.cpan.org/Ticket/Display.html?id=7831
437 ! JP/JP.pm lib/Encode/Alias.pm lib/Encode/Supported.pod AUTHORS
438 /\bwindows-31j$/i is now an alias of CP932, by Steve Hay.
439 http://rt.cpan.org/NoAuth/Bug.html?id=6695
441 2.02 2004/08/31 10:55:34
442 ! ucm/big5-hkscs.ucm AUTHORS t/big5-hkscs.enc t/big5-hkscs.utf
443 New map submitted by Deng Liu and Autrijus. Test data needed
444 to be upgrade as well, done by dankogai
445 Message-Id: <20040824204828.GB6999@aut.dyndns.org>
447 Now works for characters U+10000 and above. This fix was needed
448 to "tidy" the original map that was submitted.
450 "ucmsort" now mentioned in pod
452 2.01 2004/05/25 16:27:14
454 From: domo@computer.org
455 Subject: [PATCH] Correct statistics from enc2xs
456 <4AF60A4A-B8BB-11D8-BF99-000A27839BD6@computer.org>
457 <CDEBBD45-B91D-11D8-BF99-000A27839BD6@computer.org>
458 ! lib/Encode/Alias.pm
459 Addressed "False [] range "\s-" in regex;" in Encode::Alias.pm
460 <200405271148.i4RBm4KY026529@mail.mvnet.de>
462 2.01 2004/05/25 16:27:14
463 ! lib/Encode/CN/HZ.pm lib/Encode/Unicode/UTF7.pm
464 "If someone thinks utf8::upgrade($1) should be croaked like
465 chom?p($1),please try the following patch for Encode.pm."
467 <20040522212704.C068.BQW10602@nifty.com>
469 2.0 2004/05/16 20:55:15
470 * version updated to 2.00
471 -- sorry, no big feature change. I just hate version 1.100 :)
472 ! lib/Encode/Guess.pm
474 addressed UTF-(8|32LE) + BOM misguessing
475 https://rt.cpan.org/Ticket/Display.html?id=6279
477 s/is_utif8/is_utf8/ in POD
478 ! Encode/lib/Encode/CN/HZ.pm
479 Fixes "make test" failure after the patch to pp_hot.c
481 Message-Id: <20040222182357.6B39.BQW10602@nifty.com>
483 From: autrijus@autrijus.org
484 Subject: [PATCH] "piconv -C 512" badly broken
485 Message-Id: <1072870210.769.5.camel@localhost>
487 1.99 2003/12/29 02:47:16
489 find_encoding("UTF-16BE")->encode("abc") now null terminates
490 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html
492 prototype bug in decode_utf8() fixed
493 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
494 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
495 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
497 Merged from maintperl@21987
499 1.98 2003/08/20 11:15:31
500 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
501 Dave Evans has found and corrected a bug in Encode::MIME::Header.
502 Test suite added by Dan Kogai.
503 Message-Id: <3F43440B.7060606@rudolf.org.uk>
505 Typo fixes rolled back in from bleedperl
506 ! t/at-cn.t t/at-tw.t
507 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
508 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
510 argv case nit for VMS by Craig
511 Message-ID: <3F2B02DE.10207@mac.com>
512 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
513 Encode test fixes for VMS by Peter Prymmer
514 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
515 ! lib/Encode/Alias.pm t/Aliases.t
516 koi-8 aliases bug detected and patched by sadahiro.
517 Further fix and test suite by dankogai
518 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
520 1.97 2003/07/08 21:52:14
521 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
522 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
523 $DEBUG replaced with DEBUG() so perl optimizes better,
524 by Rafael with further fixes by dankogai
525 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
526 ! lib/Encode/Aliases.pm
527 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
528 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
529 So new hash seeding introduced in bleedperl works.
530 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
531 ! lib/Encode/Guess.pm
532 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
533 automatic utf(8|16|32) guessing -- originally by Autrijus
534 Message-Id: <20030626162731.GA2077@not.autrijus.org>
536 Addressed the following;
537 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
538 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
540 1.96 2003/06/18 09:29:02
541 ! lib/Encode/JP/JP.pm t/guess.t
542 m/(...)/ in void context then $1 is considered a Bad Thing
543 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
545 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
548 More typecast from maintperl@19739
549 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
551 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
552 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
554 Now skips for EBCDIC platform.
555 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
557 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
558 have more than one C<.> in their filenames." -- perlport.
559 Message-Id: <3ED79E01.8050401@mac.com>
561 Found and fixed the back that -p,--perlqq does not work.
562 Induced by the change from Getopt::Std to Getopt::Long.
564 Addressed [cpan #2629] Wrong assumption in numeric comparison
565 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
566 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
567 lib/Encode/Encoding.pm t/perlio.t
568 ! API Change: ->new_sequence() => ->renew()
569 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
570 + Encode::XS and Encode::utf8 now supports ->renew()
571 + Encode::Encoding now documents this with examples
572 - Non-XS (en|de)code stripped out of Encode::Unicode
573 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
575 1.95 2003/05/21 08:41:11
577 Since bogus entries were found in iso-8859-6, all entries are
578 re-generated once again out of
579 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
580 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
581 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
582 + lib/Encode/Unicode/UTF7.pm
583 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
584 lib/Encode/Supported.pod
585 UTF-7 support is now added. With this Encode now has all transcoding
586 methods in Unicode::String.
588 1.94 2003/05/10 18:13:59
589 ! lib/Encode/MIME/Header.pm
590 A more sophisticated solution for double-encoding by dankogai
591 ! lib/Encode/MIME/Header.pm AUTHORS
592 Two bugs fixed by Bjoern Jacke
593 * "Double Encoding" was not possible
594 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
595 * encode("MIME-Q") had UTF-8 flag on
596 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
597 ! lib/Encode/MIME/Header.pm AUTHORS
598 Two occurances of "croak ()" fixed as "croak qq()".
599 Simon Cozens is added to AUTHORS as a result.
600 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
602 POD fixes that reflect enhancements by jhi
604 Two enhancements by jhi.
605 + Now uses Getopt::Long so it accepts long name options
606 (--from for -f, for example)
607 + New option: -r,--resolve
608 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
610 META.yml added upon request of Schwern
611 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
613 Enache Adrian removed upon request -- to live longer than Encode
614 and/or FreeBSD (toy-)?thread :)
615 Message-Id: <20030425015701.GA2069@ratsnest.hole>
617 "close STDOUT unless $^O eq 'freebsd';" once again relocated
618 to keep VMS happy in which case "$^O eq 'freebsd'" is required
619 to keep FreeBSD+thread happy. Sigh.
620 Message-Id: <3EA88ADC.3000300@mac.com>
622 1.93 2003/04/24 17:43:16
624 added "no warnings 'pack'" in for loop to keep bleedperl from
625 complaining "Character in 'C' format wrapped in pack".
627 More elegant perl core detection inspired by Ilya Zakharevich
628 (but further elaborated for general cases).
629 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
632 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
633 are less likely to be confused w/ ISO-8859-*
635 RT tests added (vendor encodings are exemplified)
636 -- that successfully found a flaw on iso-2022-kr before the patch.
637 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
638 decode("ISO-2022-KR") has been buggy but no one ever sited
639 that since no one seems to be using it. Bugs discovered by
641 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
642 ! lib/Encode/CN/HZ.pm t/perlio.t
643 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
644 so it adds test for HZ.
645 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
646 ! lib/Encode/Guess.pm
647 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
648 So long as the string contains \x{00}-\x{ff} it does not fail.
649 See perldoc for details.
650 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
652 1.92 2003/03/31 03:27:27
653 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
654 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
655 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
656 but glibc-BIG5-2.1.2.ucm does.
657 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
658 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
659 Maps regenerated again but this time based upon
660 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
661 (But where is THE DOCUMENT by MICROSOFT?)
662 ! t/enc_module.t AUTHORS
663 failure with threaded Perl on FreeBSD addressed.
664 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
665 Message-Id: <20030322230131.GA813@ratsnest.hole>
666 ! lib/Encode/Guess.pm
669 Change 18989: Make the :bytes conditional on PerlIO.
670 further Modified by Dan Kogai
671 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
673 Chnage 18966: another fix for failing test on windows ("use encoding"
674 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
675 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
677 Change 18970: Hopefully this works also in Win32, by jhi
678 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
679 Change 18965: fix CJKT.t failures on windows due to incorrect
681 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
683 1.91 2003/03/09 20:07:37
685 even more proofread by jhi.
686 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
689 +use lib qw(t ext/Encode/t ../ext/Encode/t);
690 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
692 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
693 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
696 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
698 1.90 2003/03/09 17:32:43
701 Inaba-san has added a patch for perl 5.8.1 or later that makes
702 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
703 that. POD is further revised.
704 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
705 ! encoding.pm t/enc_module.t
706 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
707 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
711 signed vs. unsigned issue discovered by Craig on OpenVM
712 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
713 ! encoding.pm AUTHORS
714 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
715 Because binmode() stacks layers instead of overwrite, you have to
716 ":raw :encoding()" in encoding.pm or your are in trouble when you
717 call encoding.pm multiple times. There are several workarounds
718 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
719 who was the first to address this problem was added to AUTHORS.
720 The test suites was added for this, which is a modified version
721 of SUGAWARA-san's scripts
722 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
724 1.88 2003/02/20 14:42:34
726 one signedness nit for Encode by jhi
727 <200302161933.h1GJX876018710@kosh.hut.fi>
729 VISCII map was incorrect; fixed by Sadahiro-san
730 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
731 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
732 You can't unlink files that are opened in cygwin but the last
733 file handle opened in t/enc_*.t left open. Patch submitted
734 by Yitzchak and he was added to AUTHORS.
735 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
737 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
738 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
740 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
741 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
743 1.87 2003/02/06 01:52:11
745 * Inaba "Sensei" Hirohito added (I thought I have done so a long
746 ago but apparently I did not).
747 * SUZUKI Norio added for verious and useful bug reports.
748 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
749 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
750 podchecked so all warnings are gone except for L<http://>.
751 ! encoding.pm t/enc_eucjp.t
752 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
753 * Filter option overhaul
755 ! Encode.pm Encode.xs encengine.c Encode/encode.h
756 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
757 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
758 without filter. podchecked by Dan Kogai.
759 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
760 ! lib/Encode/Alias.pm
761 decode('alias', $1) went wild because of local $_ in find_alias()
762 the evil local $_ is eradicated but that changes find_alias()
763 format for coderef aliasing. See Encode::Alias for details
764 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
766 1.86 2003/01/22 03:29:07
768 * Don't forget to canonize when you attempt an exact match!
769 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
770 * ${^ENCODING} exception is off for $] > 5.008
771 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
773 $] check commented out so it runs on 5.8.0
775 1.85 2003/01/21 22:19:14
777 ${^ENCODING} exception is now explicit rather than handled by regex.
778 + t/enc_eucjp.t t/enc_utf8.t
779 Test suite for the better "encoding" pragma support for bleedperl.
780 On 5.8.0, they will just be skipped.
782 1.84 2003/01/10 12:00:16
784 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
785 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
786 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
788 %_ fixes saves the resulting .so .05% smaller, by NC
789 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
791 Silence Encode on undef, by Andreas
792 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
793 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
795 s/regognised/recognised/ . British spelling left intact to pay
796 respect to two British Nicks :)
797 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
799 1.83 2002/11/18 17:28:49
800 ! Encode.xs lib/Encode/JIS7.pm
801 Even more patches from Inaba-san has been applied. With this
802 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
803 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
805 1.82 2002/11/14 23:06:12
807 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
808 later found in perls w/ -Dusemymalloc was fixed by NC.
809 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
811 1.81 2002/11/08 18:29:27
812 ! Encode.pm Encode.xs
813 Non-XS version of Encode::utf8 is back (with XS being default).
814 Encode::predefine_encodings(0) to turn off XS.
815 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
816 found that it was Test::More causing the problem, not Encode.
817 But I have already made it configurable so it may be useful in
819 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
821 The ingenious patch by Nicholas Clark that reduces shlib sizes by
822 50% with no penalty and backward compatibility preserved, is in.
823 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
825 1.80 2002/10/21 20:39:09
826 ! Encode.xs t/mime-header.t
827 Even more patches from NI-XS regarding Encode::utf8->decode().
828 And one more test to t/mime-header.t to prove it
829 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
831 1.79 2002/10/21 06:05:37
833 Further patches from NI-XS. Encode::utf8->decode() now checks the
834 value of utf8 flag of the argument. As a result, the fix to
835 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
836 no harm (even speedwise) I'll leave it unreverted.
837 ! ucm/cp949.ucm ucm/cp950.ucm
839 U+00AE REGISTERED SIGN
840 were missing as a result of 1.78. Discovered by Moriyama-san.
841 Moriyama-san has also developed a test script that compares
842 (en|de)coded results to the corresponding Win32 API result and
843 all cp9?? maps are now verified.
844 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
846 1.78 2002/10/20 15:44:00
847 ! lib/Encode/MIME/Header.pm
848 fixed so that it works with new Encode::utf8
849 ! Encode.pm Encode.xs
850 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
851 :encoding(UTF-8) to handle partial chars at end of buffers
853 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
854 ! lib/Encode/Supported.pod
855 More nitpickings applied.
858 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
859 tests were completely useless. To redeem that and get the peace of
860 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
861 ucm/*.ucm Since this script takes too long to finish (30 seconds on
862 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
863 but you can easily run that by either renaming it or:
865 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
866 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
867 Other CJKT cp9?? also updated according to the URI below;
868 http://www.microsoft.com/typography/unicode/cscp.htm
869 + bin/ucmsort MANIFEST
870 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
871 to proper order. intended for hardcore develpers only.
872 ! ucm/cp932.ucm JP/JP.pm AUTHORS
873 CP932 mapping which was based upon the mapping file at unicode.org
874 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
875 has also supplied the patch so he was added to AUTHORS.
876 ! lib/Encode/Supported.pod
877 ISO-8859-11 != TIS 620
878 == TIS 620 + \xA0 ( )
880 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
882 1.77 2002/10/06 03:27:02
884 * Modified to accomodate up and comming patch by Inaba-san that
885 will fix tr/// needing eval qq{}
886 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
888 * pod fixes/enhancements to reflect the changes above
889 ! lib/Encode/Alias.pm
890 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
891 Message-Id: <20021001015648.GB18710@not.autrijus.org>
893 1.76 2002/08/25 15:09:51
895 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
896 but naturally identical to previous version -- dankogai
898 Codepoint fixes -- autrijus
899 Message-Id: <20020805040236.GC5220@not.autrijus.org>
901 copied everything under perl-5.8.0/ext/Encode to make sure Encode
902 is in sync w/ perl core
904 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
905 Now that binmode(FH) does implicit ":bytes" revisit
906 the failing tests. The worrisome one is the Digest::MD5
907 test-- how will it fare in CRLF lands now?
909 From: Radu Greab <radu@netsoft.ro>
910 Date: Mon, 10 Jun 2002 00:40:34 +0300
911 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
912 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
913 ones which I cannot figure out.
914 ! lib/Encode/Alias.pm
915 Subject: [Encode PATCH] spurious warning
916 From: Nicholas Clark <nick@unfortu.net>
917 Date: Sun, 2 Jun 2002 20:26:22 +0100
918 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
920 1.75 2002/06/01 18:07:49
921 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
922 glibc compliance cited by Autrijus.
923 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
924 ! bin/enc2xs bin/piconv
925 Subject: Re: forewarning: usedevel and versiononly
926 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
928 1.74 2002/05/28 18:33:15
929 + ucm/null.ucm ucm/ctrl.ucm
930 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
931 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
932 Subject: Re: unicode -> &# notation
933 Message-ID: <20020518193704.GB40272@not.autrijus.org>
935 1.73 2002/05/28 17:26:18
936 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
937 Chris Nandor has fixed Encode so that it works w/ MacPerl --
938 at least w/ PPC (68k need static linking which does not work due to
939 64k limit). pudge is added to AUTHORS (I'm surprised he was not
940 there in the list). Encode/Makefile_PL.e2x was additionally fixed
941 by dankogai to reflect changes in other Makefile.PL
942 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
944 Subject: Change 16746: -Mutf8 cleanup.
945 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
947 1.72 2002/05/20 15:49:56
949 Subject: [PATCH] Encode should be in perl-core library path
950 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
951 Message-Id: <20020520161201.A11019@alpha.hut.fi>
952 ! lib/Encode/MIME/Header.pm
953 Subject: [PATCH] Encode::MIME::Header
954 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
955 ! Encode/Makefile_PL.e2x
956 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
957 Message-Id: <20020519201031.GA1603@not.autrijus.org>
958 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
959 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
961 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
962 Subject: [PATCH] good day for WinCE port of perl.
963 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
966 Even more linting by Robin via @16532
968 Even more typecast by Sarathy in @16460
970 1.71 2002/05/07 16:22:42
972 even more typecasts by Robin
973 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
975 A very strange bug that was causing a bugus ucm -> C table
976 generation that was revealed by a UCM file that Andreas was
977 working. This is the king of wierdest bug I've encountered
978 in the course of Encode maintenance.
979 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
981 1.70 2002/05/06 10:26:48
983 Made more 'module-safe' with conjunction w/ 'no encoding'.
984 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
985 ! lib/Encode/Encoding.pm
986 'require Encode' because ->Define uses Encode::define_encoding();
987 problem and solution addressed by Miyagawa-kun
988 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
990 Cuts the frill to make djgpp happier, as suggested by Laszlo
991 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
993 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
994 Message-Id: <m3bsbug48n.fsf@anima.de>
996 1.69 2002/05/04 16:41:18
997 ! lib/Encode/MIME/Header
998 Floating-point coerced for UNICOS (in integer arithmetics it folds
999 line one character too early). Verification by Mark is pending.
1000 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
1001 ! Unicode/Unicode.pm
1002 more doc patch from Elizabeth
1003 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
1004 ! Encode/Makefile_PL.e2x
1005 More platform-independent patch from Benjamin
1006 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
1007 ! lib/Encode/Guess AUTHORS
1008 split regex fix by Graham Barr. Adds him to AUTHORS.
1009 Message-Id: <20020504085419.E95940@valueclick.com>
1010 ! Encode/Makefile_PL.e2x
1011 enc2xs script discovery made smarter and more sensible, first cited
1012 by Miyagawa-kun and further suggestions by Rafael and Andreas
1013 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
1014 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
1016 1.68 2002/05/03 12:20:13
1017 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
1018 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
1019 and Supported.pod modified to reflect the change. Elizabeth added
1020 to Authors. And H.M. is also added for forwarding her patch among
1021 other contributions (I was rather surprised to find his name was not
1023 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
1025 1.67 2002/05/02 07:33:09
1027 Error message now consistent w/ perlqq (\N{U+} -> \x{})
1028 done in perl@16308 but Philip linted me further. Now the error
1029 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
1030 ! lib/Encode/Guess.pm
1031 Sanity check for happier -w by Autrijus
1033 1.66 2002/05/01 05:41:06
1034 ! Encode.xs t/fallback.t
1035 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
1036 while fallback is in effect. This even came with a welcome side-effect
1037 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
1038 also modified to test this.
1039 And of course, the corresponding varialbles to UV[Xx]f are appropriately
1040 cast. This should've concluded NI-XS homework.
1042 encode(undef) does warn again! Repented upon suggestion by NI-XS.
1043 Document for unless vs. '' added
1044 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
1046 1.65 2002/04/30 16:13:37
1048 encode(undef) no longer warns for C<Use of uninitialized value in
1049 subroutine entry>. Suggested by Paul.
1050 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
1051 ! lib/Encode/Supported.pod
1052 Encode::MIME::Header and Encode::Guess mentioned
1053 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
1054 ! lib/Encode/Guess.pm
1055 POD fix by Miyagawa-kun
1056 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
1058 1.64 2002/04/29 06:54:06
1060 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
1061 encocde("euc-jisx0213") are still DIFFERENT.
1062 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
1064 A few white spaces corrected by NI-XS via PerlIO integration to
1066 Subject: Change 16247: Integrate perlio;
1068 Document fixes by Andreas
1069 Message-Id: <m3k7qsf1we.fsf@anima.de>
1071 1.63 2002/04/27 18:59:50
1072 ! lib/Encode/Encoding.pm
1073 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
1074 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
1075 Make use of the Encode::Encoding base class!
1076 And other cleanups in Encode.xs upon NI-XS suggestions
1077 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
1079 1.62 2002/04/27 11:17:39
1081 encodings() now just check %ExtModule instead of eval{require}
1082 all of them for ":all" to conserve more memory.
1084 more "%x" -> "%" UVxf stuff.
1086 s/=over2/=over 2/g # oops.
1088 1.61 2002/04/26 03:02:04
1090 Now does decent tests besides use_ok()
1091 ! lib/Encode/Guess.pm t/guess.t
1092 UI streamlined, document added
1093 ! Unicode/Unicode.xs
1094 various signed/unsigned mismatch nits (#16173)
1095 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
1097 POD: utf8-flag-related caveats added. A few sections completely
1101 Thou shalt not assume %d works, either!
1102 Robin Baker added to AUTHORS for this
1103 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
1105 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
1107 1.60 2002/04/24 20:06:52
1109 "Thou shalt not assume %x works." -- jhi
1110 Message-Id: <20020424210618.E24347@alpha.hut.fi>
1111 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
1112 low-memory build machines happy, now *.c is created for each *.ucm
1113 (no table aggregation). You can still override this by setting
1114 $ENV{AGGREGATE_TABLES}.
1115 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
1116 + lib/Encode/Guess.pm
1117 + lib/Encode/JP/JIS7.pm
1118 Encoding-autodetect (mainly for Japanese encoding) added. In a
1119 course of development, JIS7.pm was improved.
1120 + lib/Encode/HTML/Header.pm
1121 + lib/Encode/Config.pm
1122 MIME B/Q Header Encoding Added!
1123 ! Encode.pm Encode.xs t/fallback.t
1124 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
1125 Message-Id: <20020424130709.GA14211@tanglefoot>
1127 1.59 $ 2002/04/22 23:54:22
1128 ! Encode.pm Encode.xs
1129 needs_lines() and perlio_ok() are added to Internal encodings such
1130 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
1132 1.58 2002/04/22 23:54:22
1134 s/MacChineseSimp/MacChineseTrad/ # ... oops.
1139 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
1140 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
1141 t/(JP|KR).t are aggregated to t/CJKT.t
1142 test data are all remade via bin/ucm2text.
1143 And .... They are no longer skipped for -Uuseperlio !
1145 1.57 2002/04/22 20:27:30
1146 ! t/JP.t t/KR.t t/perlio.t
1147 unless (find PerlIO::Layer 'perlio') ... line is back again.
1148 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
1149 need PerlIO. Sigh....
1150 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
1151 ->perlio_ok now does eval{ require PerlIO::encoding } there so
1152 it correctly returns 1 when PerlIO::encoding is yet loaded.
1154 perl-current patch #16072 reflected
1156 1.56 2002/04/22 09:48:07
1157 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
1158 New PerlIO::encoding 0.04 compliance met
1160 1.55 2002/04/22 03:43:05
1161 ! Encode.pm Encode.xs Unicode/Unicode.pm
1162 needs_lines() defined so Encode::Encoding is no longer needed
1165 1.54 2002/04/22 02:50:01
1166 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
1167 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
1168 now perlio_ok is true by default if PerlIO::encoding->VERSION is
1169 0.03 or larger. POD in Encode::Encoding revised to reflect this.
1170 Encode::XS and Encode::Unicode now has perlio_ok() method.
1171 ! lib/Encode/Supported.pod
1172 s/UP-UX/HP-UX/ by jhi
1173 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
1174 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
1175 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
1176 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
1177 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
1178 ! lib/Encode/Supported.pod
1179 Huge document fixes by Philip.
1182 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
1183 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
1185 binmode() after "<:encoding" to make Win32 happy, by Mattia.
1186 Mattia added to AUTHORS file
1187 Message-Id: <3CC3150F.5798.22A05AE@localhost>
1189 1.52 2002/04/20 23:43:47
1191 TODO: is now SKIP:, as NI-XS requested. Also adds more
1192 eraborate failure analysis added.
1194 A note on how to make sure of round-trip safety added to POD
1195 section (so Autrijus is happier)
1196 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
1197 big5-(eten|hkscs) is round-trip safe again!
1198 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
1200 Typo fixes by Andreas
1201 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
1202 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
1203 PerIO coodination patches from NI-XS.
1204 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
1206 1.51 2002/04/20 09:58:23
1208 Updated test suite by Autrijis so "make test" is happy again
1209 Message-Id: <20020420082104.GA25037@not.autrijus.org>
1211 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
1213 TW/TW.pm TW/Makefile.PL
1214 Updates by Autrijus. 'big5' is no longer a canonical but an
1215 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
1216 Message-Id: <20020419195346.GA19597@not.autrijus.org>
1218 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
1219 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
1221 PerlIO detection a little bit smarter; no longer uses eval qq{}
1224 1.50 2002/04/19 06:13:02
1225 ! ! Encode.pm Encode.xs Encode/encoding.h
1227 New Fallback API imlemented and documented. See "perldoc Encode"
1229 ! lib/Encode/JP/JIS7.pm Encode.pm
1230 + lib/Encode/PerlIO.pod t/perlio.t
1231 API compliance met. However, it still does not work unless perlio
1232 implements line buffer. See BUGS section in perldoc Encode::PerlIO
1233 As a sensible workaround, perlio_ok() added to Encode.
1235 ! lib/Encode/Supported.pod
1237 Message-Id: <20020418174647.J8466@alpha.hut.fi>
1239 Doc fixes from Autrijus
1240 Message-Id: <20020418144131.GA10987@not.autrijus.org>
1242 perlqq mode documented
1244 + t/jisx0201.euc t/jisx0201.ref
1245 ! t/jisx0208.euc t/jisx0208.ref
1246 t/JP.t tests more rigorously and with other encodings
1247 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
1248 in jis0208 (used to contain jisx0201 part).
1249 ! Encode/Makefile_PL.e2x
1250 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
1251 enc2xs and encode.h rather than hard-coded. This allows the resulting
1252 module fully CPANizable.
1253 ! encoding.pm t/JP.t t/KR.t
1254 PerlIO detection simplified (checks %INC instead of eval{})
1255 ! Encode.xs Encode/encode.h
1256 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
1257 - lib/Encode/Unicode.pm
1258 (en|de)code_xs relocated to where it belongs. Source reindented
1261 Additional (U8 *) cast added as suggested by jhi
1262 Message-Id: <20020417165916.A28599@alpha.hut.fi>
1264 1.42 Date: 2002/04/17
1266 no-op module; Thought of adding a pod there but enc2xs has
1271 correct mechanism to detect Perlio::encoding layar installed.
1273 PerlIO Layer detached.
1275 1.41 2002/04/16 23:35:00
1277 binmode(STDIN|STDOUT ...) done iff PerlIO is available
1279 Cleaned up PerlIO skip conditions to prepare for the upcoming
1280 Encode - PerlIO forking.
1282 exported functions are now prototyped.
1283 ! lib/Encode/CN/HZ.pm
1286 fallback implemented # was /* FIXME */
1287 affected programs revised to fit (only HZ was using the try-catch
1288 approach which needed to be fixed for API-compliance).
1290 ! Encode/KR/2022_KR.pm
1292 can find =head1 NAME now, jhi
1293 Message-Id: <20020416083059.V30639@alpha.hut.fi>
1297 now complies with less warnings with the pickest compilers.
1298 Suggested by Craig, fixed by Dan.
1299 ! Encode/Makefile_PL.e2x
1301 A bug that fails to find *.e2x in certain conditions fixed
1303 1.40 2002/04/14 22:27:14
1304 + Encode/ConfigLocal_PM.e2x
1305 ! lib/Encode/Config.pm
1307 "enc2xs -C" now generates/updates Encode::ConfigLocal.
1308 ConfigLocal_PM.e2x is a skelton thereof.
1309 ! lib/Encode/Config.pm
1311 "use Encode::CN::HZ;" was missing.
1314 More rigorous tests added to test XS, especially on memory allocation.
1316 ! lib/Encode/Unicode.pm
1317 NI-S implemented an XS version -- merged
1318 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
1321 Source filter option added. With this option on, you can write
1322 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
1323 encodings. t/jperl.t enhanced to test this feature.
1325 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
1326 apply his suggestion, this degree of nitting is enough to add him
1328 Message-Id: <3CB93223.291E5E2E@earthlink.net>
1330 + lib/Encode/JP/JIS7.pm
1331 - lib/Encode/JP/JIS.pm
1332 - lib/Encode/JP/2022_JP.pm
1333 - lib/Encode/JP/2022_JP1.pm
1334 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
1335 JIS7.pm for better maintainability and performance
1337 Added caveat for non-ascii identifiers.
1339 fixes by jhi, the original author of this pragramtic module.
1340 Message-Id: <20020413231527.V1826@alpha.hut.fi>
1342 1.34 2002/04/12 20:23:05 (Unreleased)
1345 EBCDIC fixes addressed by jhi.
1346 Message-Id: <20020412161844.D9383@alpha.hut.fi>
1347 ! lib/Encode/Encoder.pm
1348 POD fix by Miyagawa-kun
1349 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
1351 1.33 2002/04/10 22:28:40
1353 Philip's mail address corrected.
1356 ! lib/Encode/Encoder.pm
1357 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
1358 This adds Miyagawa-kun to AUTHORS.
1359 * encoding() no longer exported by default but on demand
1360 * t/Encoder.t updated to test all these
1361 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
1362 ! lib/Encode/Unicode.pm
1363 ! lib/Encode/Supported.pm
1364 Further doc fixes by Anton
1366 1.32 2002/04/09 20:06:15
1369 - ucm/macDevanaga.ucm Unicode Character Map
1370 - ucm/macGujarati.ucm Unicode Character Map
1371 - ucm/macGurmukhi.ucm Unicode Character Map
1372 A utility to check integrity of .ucm files. t/bogus.ucm is a
1373 ucm that is deliberately bogus. unused Indic mappings are removed
1376 resolve_alias() added as suggested by jhi. Same as
1377 find_encoding("alias")->name. For convenience. This one is
1378 defined in Encode.pm instead of Alias.pm.
1379 Message-Id: <20020409215846.H17022@alpha.hut.fi>
1381 Memory Allocate but detected during the devel of ucmlint -- fixed.
1382 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
1383 ! lib/Encode/Unicode.pm
1384 valid_ucs2(0) is false but must be true.
1385 3 patches from NI-S as follows. This also has fixed the incident
1387 ! lib/Encode/Alias.pm
1388 find_alias() recursion prevention
1390 Checks for the patch above
1391 ! t/Encode/Unicode.pm
1392 An extra "F" that causes valid_ucs2() return a bogus value fixed
1393 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
1394 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
1395 2 Small Patches from jhi as follows:
1397 Encode->encodings() lists in case-insensitve order (as it was)
1399 -l option prints avaiable encodings to STDOUT instead of STDERR
1400 ! lib/Encode/Aliases.pm
1401 s/defintion/definition/
1402 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
1404 ! lib/Encode/Supported.pod
1405 ! lib/Encode/Unicode.pm
1406 POD revise by Philip Newton. This adds Philip to AUTHORS list.
1407 Thank you for the exact quote of Douglas Adams :)
1408 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
1410 1.31 2002/04/08 18:08:07
1411 ! lib/Encode/Encoder.pm
1413 Encode::Encoder, once just a placeholder of an idea, is now much more
1414 practical. See t/Encode.t to find how practical it can be.
1415 + lib/Encode/Config.pm
1417 my false laziness at Encode.pm is fixed. Now %ExtModules are set
1418 in Encode::Config and they are all literally, not programatically
1419 set. My false laziness was resulting many encodings missing from
1421 ! lib/Encode/Unicode.pm
1423 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
1424 so that it does not rely Encode::Unicode for BOM values
1425 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
1427 1.30 2002/04/08 02:34:51
1428 + lib/Encode/Encoder.pm
1429 Object Oriented Encoder. I reckon something like this is in need.
1432 ! lib/Encode/Supported.pod
1433 * autoloading bug that prevented upper-case canonicals such as UTF-16
1434 is fixed. Now even UTF/UCS are autoloaded!
1435 * encodings() is now more intuitive.
1436 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
1438 * Obligatory fixes to the POD.
1439 ! lib/Encode/Supported.pod
1440 Patch from Anton applied.
1441 Message-Id: <66641479.20020408033300@motor.ru>
1443 ! lib/Encode/Unicode.pm
1444 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
1446 1.28 2002/04/07 18:58:42
1450 Just a MANIFEST for those missing files.
1452 1.26 Date: 2002/04/07 15:22:04
1455 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
1456 And t/Aliases.t fixed to make sure they all exist.
1458 1.25 2002/04/07 15:01:25 (Unreleased)
1460 ! lib/Encode/Unicode.pm
1463 - lib/Encode/UTF_EBCDIC.pm
1464 - lib/Encode/Internal.pm
1465 - lib/Encode/utf8.pm
1466 Integrated into Encode.pm as closures. That way "one package, one file"
1467 rule is preserved yet less files to require.
1469 commented out binmode(STDERR ...
1473 ! EBCDIC/Makefile.PL
1476 ! Symbol/Makefile.PL
1478 ! Encode/Makefile_PL.e2x
1479 Schwarn's MM-compliance patch merged
1480 Message-Id: <20020406082609.GA28758@blackrider>
1482 ! lib/Encode/Unicode.pm
1483 + lib/Encode/UTF_EBCDIC.pm
1485 - lib/Encode/10646_1.pm
1486 - lib/Encode/ucs2_le.pm
1487 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1488 per-module based (en|de)code, I saved a number of .pm by
1489 reorganizing it as per-object base (Well, this is what Encode::XS
1490 does under the hood). See Encode::Unicode for details.
1491 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1492 This module is used only on EBCDIC environments.
1494 1.21 2002/04/05 14:46:34 (Not Released)
1500 Are back to make Perl/Tk happy Smile, NI-S.
1502 ! lib/Encode/Alias.pm
1503 ! lib/Encode/Supported.pm
1504 ! lib/Encode/10646_1.pm
1505 ! lib/Encode/ucs2_le.pm
1506 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1507 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1508 updated to reflect changes.
1509 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1510 ! lib/Encode/Alias.pm
1511 ! lib/Encode/Supported.pm
1512 Anton's revision commited. Added Dan's own fixes as well.
1513 Message-Id: <159103166906.20020405161134@motor.ru>
1514 ! lib/Encode/Alias.pm
1516 < qr/^UCS2-le$/i => '"UCS-2"', );
1518 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1519 Sigh. Thank you, Anton.
1520 Message-Id: <14567692196.20020405062020@motor.ru>
1521 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1523 1.20 2002/04/04 19:50:52
1525 the last minute addtion. Just give it a try. Docs remains to be done.
1526 Not installed by default.
1527 ! lib/Encode/Supported.pod
1530 ! lib/Encode/Alias.pm
1531 ! lib/Encode/utf8.pm
1532 ! lib/Encode/10464_1.pm
1533 ! lib/Encode/ucs2_le.pm
1534 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1535 unchanged but UTF-16BE is added as an alias. Implicit aliases
1536 move to Encode::Alias so init_alias() works more as expected.
1537 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1538 Though pedantically wrong, This should make perl mongers happier.
1539 t/Alias.t is enhanced to test all these.
1540 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1542 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1543 of the codepoints are common. CJKT left as is because the saving is
1547 ! EBCDIC/Makefile.PL
1549 ! Encode/Makefile_PL.e2x
1553 ! Symbol/Makefile.PL
1557 All occurance of _def.h replaced with .exh so djgpp works happily
1558 ever after! To credit this amazing discovery, Laszlo is now in
1560 Message-Id: <20020403181424.GA8778@freemail.hu>
1561 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1564 ! Encode/Makefile_PL.skel
1566 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1567 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1569 Two more tests by added jhi
1570 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1573 The showstopper fixed -- Memory reallocation bug was causing
1574 Encode::XS to fall into infinite loop on certain conditions.
1575 t/grow.t tests that.
1576 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1581 ! lib/Encode/Supported.pod
1582 Vendor encodings rebuilt out of original map files at unicode.org.
1583 Indic languages such as MacDevanagali remain unspported do to the
1584 shortcoming of encengine capabilities (they need algorithmical
1585 conversion and I have no knowledge on that!). Pods fixed for added
1587 Oh, macJapan.ucm renamed to macJapanese.ucm.
1588 macROMnn is macRomanian and macRUMnn is macRumanian.
1589 txt2ucm is a crude script that is used to convert them.
1591 Unicode Compound Characters (used extensively on Mac) supported
1593 Typo fixes and improvements by jhi
1594 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1596 1.11 2002/03/31 22:12:13
1600 Missing files from the MANIFEST fixed.
1601 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1602 Version incremented just to make CPAN happy.
1604 1.10 2002/03/31 21:32:42
1607 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1608 if you want. This should make Autrijus happy. Also, piconv
1609 is added to default install.
1612 Here-documented files that enc2xs generates are now exported
1613 to *.e2x. Much cleaner and easier to debug.
1615 encoding enhances so you can make it act more like such
1616 (now prehistoric ) "localized" variations of perl like Jperl.
1618 Further test for encoding.pm. Written in euc-jp
1621 Taken over form jhi.
1622 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1627 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1634 ! Encode/macIceland.ucm
1635 ! lib/Encode/Alias.pm
1636 ! lib/Encode/Supported.pod
1637 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1638 Message-Id: <m3lmcavhjt.fsf@anima.de>
1640 1.01 2002/03/29 20:59:39
1643 s/USE_SCRIPTS/MORE_SCRIPTS/
1645 installs enc2xs by default for external Encode:: modules in CPAN,
1646 such as Encode::HanExtra
1648 More sensible perl core detection via $ENV{PERL_CORE}
1650 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1652 Perl core ditection via $^X =~ m/\bminiperl$/o
1653 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1655 1.00 Wed Mar 29 2002
1657 The version of all files is updated to 1.00 via "ci -f -l1.00",
1658 commemorating version 1.00. All files, including *.ucm are now
1659 under version control.
1662 encode.h moved to Encode/ so it will be installed for the later
1665 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1670 compile renamed to enc2xs.
1671 Affected Makefle.PL updated
1673 "Punt it. HanExtra can take care of that later." -- Autrijus
1674 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1677 ! Encode/ksc5601.ucm
1678 ! lib/Encode/CJKConstants.pm
1679 ! lib/Encode/KR/2022_KR.pm
1680 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1681 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1685 bin/ added for example scripts. They are not installed by default.
1686 to install them, "perl Makefile.PL USE_SCRIPTS".
1687 piconv is iconv reinvented in perl. in addition to all features
1688 of iconv, it also adds perlish features. See L<piconv/1> for more
1690 ! lib/Encode/Alias.pm
1691 qr/^ replaced with qr/\b so it directly matches locale names
1692 such as en_US.US-ASCII
1695 Patch by MJD to fix the following problem applied.
1696 Subject: [PATCH 5.7.3 Encode]
1697 Aliases.t not properly skipped when Encode extension not built
1698 Message-Id: <20020328091850.18677.qmail@plover.com>
1699 ! lib/Encode/KR/2022_KR.pm
1700 ! lib/Encode/CJKConstants.pm
1701 Another patch from Jungshik to make iso-2022-kr actually work
1702 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1703 ! Encode/Encode/euc-kr.ucm
1704 + Encode/Encode/johab.ucm
1705 ! Encode/Encode/ksc5601.ucm
1707 ! Encode/KR/Makefile.PL
1708 ! Encode/lib/Encode/Alias.pm
1710 Johab support and complete revision of Korean Encoding by Jungshik
1711 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1713 Revised to make up with now-dropped Encode::Details.
1714 - lib/Encode/Details.pod
1715 Dropped. Besides being obsolete, the topics are now covered in
1716 respective pods now.
1721 Korean aliases fixed thanks to Jungshik Shin
1722 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1723 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1725 =head1 NAME added to all modules to make buildtoc happy
1726 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1727 - lib/Encode/CJKguide.pod
1728 Too controversial and dropped from the dist. Will be available
1729 separately on the web.
1731 RCS tags added so table debugging gets easier (should that be
1732 needed! I hope they all stay 1.00!)
1733 + lib/Encode/CJKguide.pod
1734 A detailed guide to mainly, but not limited to, CJK multibyte
1737 + Encode/hp-roman8.ucm
1739 ! Encode/Supported.pod
1740 All occurance of "roman8" replaced with "hp-roman8" to avoid
1742 ! Encode/Supported.pod
1745 Mac Encodings now comply the Inside Macintosh
1747 Test for '-raw' conventions added.
1749 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1750 ! Encode/gb12345.ucm
1752 ! Encode/ksc5601.ucm
1753 "-raw" appended to canonical names.
1754 File mames stay unchanged thanks to UCM format.
1755 ! lib/Encode/CN/HZ.pm
1756 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1757 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1759 0.99 Tue Mar 26 2002
1760 - lib/Encode/JP/Const.pm
1761 + lib/Encode/CJKConstants.pm
1762 + lib/Encode/CN/2022_CN.pm
1763 + lib/Encode/KR/2022_KR.pm
1772 * Support for ISO-2022-KR and ISO-2022-CN added.
1774 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1775 * ucm2table autogenerates character table out of UCM files.
1778 - lib/Encode/Supports.pod
1779 + lib/Encode/Supported.pod
1780 Names reverted due to popular demand.
1781 8.3 rule applies only when there is a conflict.
1782 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1787 - lib/Encode/Format/Enc.pod
1789 * Character tables is now 100% ucm.
1790 * All files under Encode/ is now 8.3-compliant
1791 * some of missing encodings added (i.e. gsm0338 and nextstep)
1792 * Vendor mappings aggregated with appropriate national std in
1793 Makefile.PL, resulting smaller *.so especially for CJK.
1794 Following is result on Dan's FreeBSD box.
1796 ---------------------------------------------------------------
1797 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1798 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1799 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1800 blib/arch/auto/Encode/Encode.so 27,791 27,791
1801 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1802 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1803 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1804 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1805 ---------------------------------------------------------------
1806 Total 5,375,297 6,343,819
1808 * As a result of ucm-transition, Encode::Tcl dropped because
1809 Encode::Tcl demands *.enc.
1810 Encode::Tcl will be supplied in a separate tarball with *.enc.
1811 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1816 -lib/Encode/Supported.pod
1817 +lib/Encode/Supports.pod
1818 -lib/Encode/iso10646_1.pm
1819 +lib/Encode/10646_1.pm
1820 -lib/Encode/EncFormat.pod
1821 +lib/Encode/Format/Enc.pod
1822 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1823 - lib/Encode/JP/Constants.pm
1824 + lib/Encode/JP/Consts.pm
1825 ! lib/Encode/JP/JIS.pm
1826 ! lib/Encode/JP/H2Z.pm
1827 Version nit problem and 8.3 rule fix.
1828 > Package namespace installed latest in CPAN file
1829 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1830 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1831 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1832 accordingly. In addition, legacy "use vars qw()..." are replaced with
1834 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1835 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1837 - lib/Encode/JP/ISO_2022_JP.pm
1838 - lib/Encode/JP/ISO_2022_JP_1.pm
1839 + lib/Encode/JP/2022_JP.pm
1840 + lib/Encode/JP/2022_JP1.pm
1842 8.3 naming conflict for vanilla fat addressed by jhi
1843 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1846 Typecast fix addressed by jhi
1847 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1849 0.98 Mon Mar 25 2002
1850 ! lib/Encode/Supported.pod
1852 + lib/Encode/JP/ISO_2022_JP_1.pm
1853 ! lib/Encode/JP/ISO_2022_JP.pm
1854 ! lib/Encode/JP/JIS.pm
1856 Now Encode::JP is more strict on the difference between ISO-2022-JP
1857 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1858 makes Anton happier :) FYI the previous version implements
1859 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1860 ! lib/Encode/Supported.pod
1863 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1864 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1869 ! lib/Encode/Suppoted.pod
1870 pod fixes to replace F<http://...> to L<http://...>,
1871 as suggested by Autrijius in:
1872 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1873 ! lib/Encode/Suppoted.pod
1874 fixes and enhancements by Anton
1875 Message-Id: <10632060120.20020324103753@motor.ru>
1876 ! lib/Encode/Alias.pm
1877 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1878 added. Suggested by Anton then deobfuscated by Autrijius
1879 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1881 Further fix by Nicholas Clark
1882 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1883 - lib/EncodeFormat.pod
1884 + lib/Encode/EncFormat.pod
1886 File renamed as suggested by Autrijius
1888 ! lib/Encode/Details.pod
1889 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1890 ! Encode.pm Sun Mar 24 13:43:47 2002
1891 pod fixes by Autrijius.
1892 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1893 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1895 ! lib/Encode/Alias.pm
1897 now more EBCDIC conscious;
1898 %ExtModules on EBCDIC system excludes CJK so that you don't
1899 have to worry about the matched alias resulting cloaking.
1900 t/Alias.t also revised to reflect changes. Verified by jhi
1901 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1903 0.97 Sun Mar 24 2002
1907 EBCDIC detection mechanism installed as in JP/JP.pm
1908 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1911 ! EBCDIC/Makefile.PL
1914 ! Symbol/Makefile.PL
1916 Now all table files used by compile are postfixed '_t' to avoid
1917 namespace collisions in case insensitive file systems once for all!
1919 Message-ID: <58290227735.20020323195659@familiehaase.de>
1921 Since the Encode::JP is unsupported under EBCDIC we
1922 cannot run this test (aliases as such should work fine) -- jhi
1923 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1925 duplicate occurance of ascii.ucm and 8859-1.ucm
1926 causes MacOS X dlyd to cloak
1932 < chdir 't' if -d 't';
1934 > if (! -d 'blib' and -d 't'){ chdir 't' };
1935 When you are "make test"-ing on Encode/ directory, you must not
1936 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1937 remain unchanced. Also the situation detection was made simpler
1938 in t/JP.t, which was originally;
1939 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1941 "Use of uninitialized value in string eq at Encode.pm line 96."
1942 ! Symbol/Makefile.PL
1943 ! EBCDIC/Makefile.PL
1945 -- Problem on case insensitive file systems
1946 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1947 Message-ID: <88254111953.20020323095503@familiehaase.de>
1950 "So I think it's a bug in gcc, not perl. But it still needs to be
1952 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1953 Message-Id: <20020323170509.C96475@plum.flirble.org>
1955 0.96 Sat Mar 23 2002
1957 ! lib/Encode/Encoding.pm
1958 ! lib/Encode/Alias.pm
1959 ! lib/Encode/Supported.pod
1961 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1962 Message-ID: <20020322073908.GB10539@blackrider>
1965 "...I think we should include ISO 8859-1 as well." -- NI-S
1966 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1971 ! lib/Encode/Alias.pm
1972 alias definitions relocated to Encode::Alias so module autoloading
1973 works for aliases also.
1975 encodings() now accepts args to check ExtModules.
1979 + EBCDIC/Makefile.PL
1980 + Symbol/Makefile.PL
1984 Latin and single byte encodings are reorganized so they are
1985 demand-loaded like Encode::XX. Now only ascii is compiled into
1987 ! lib/Encode/Alias.pm
1988 for my $k (keys %hash){ delete $hash{$k}; }
1989 is depreciated; fixed.
1991 0.95 Fri Mar 22 2002
1992 In this update, pod rewrites and alias fixes are the main issues
1993 + lib/Encode/Supported.pod
1994 Describes supported encodings
1996 streamlined compiled-in encodings.
1997 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1999 + Encode/ibm-125?.ucm
2000 Added from icu distibution with any occurance of
2001 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
2002 some respect to icu staff, however.
2003 + lib/Encode/Alias.pm
2005 Alias difinitions in Encode.pm relocated.
2008 packWARN patch from Paul Marquess via jhi
2009 Message-Id: <20020321010101.O28978@alpha.hut.fi>
2010 Paul added to AUTHORS as a result.
2011 ! t/CJKalias.t -> t/Aliases.t
2012 Renamed. Checks even more aliases and alias overloading
2015 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
2016 gbk => cp936 relocated to CN.pm
2018 Test::More with plans (by jhi)
2020 0.94 Thu Mar 21 2002
2021 + lib/Encode/Description.pod
2022 ! lib/Encode/Encoding.pm
2023 Now the pod in Encode.pm is abridged as programming references.
2024 lib/Encode/Description.pod contains the original, detailed description
2025 and Encode::Encoding explains how to write your own module to
2026 add new encodings. So far, lib/Encode/Description.pod contains
2027 the whole pod once in Encode.pm. This is intentional.
2029 Pod revisions by Anton Tagunov
2030 Message-Id: <517178431.20020320174824@motor.ru>
2032 all occrance of Encode::Tcl::Extended removed including pod
2034 test now checks $encoding->name only; $encoding->{name} are
2035 no longer check to find the canonical name.
2036 ! lib/Encode/JP/JIS.pm
2037 ! lib/Encode/JP/ISO_2022_JP.pm
2038 ->name() added to be more compliant with API
2044 Patch by Autrijus to add aliases to TW and fixes to POD
2045 Message-Id: <20020320090619.GA24774@not.autrijus.org>
2047 SADAHIRO Tomoyuki added as should. My apologies.
2049 0.93 Wed Mar 20 2002
2050 * First release to be uploaded to CPAN. For prehistoric changes,
2051 please see Changes file of perl distibution as well as
2052 perl-unicode@perl.org archive, available at:
2053 http://archive.develooper.com/perl-unicode@perl.org/
2055 Changes Since 0.92 includes;
2060 + Mention to perl-unicode@perl.org added
2062 + Encoding aliases added so you can feed locale names
2063 and MIME Charset="" directly.
2064 - Mention to JISX0212 removed because it's fixed
2067 + Encoding aliases added. Note TW is left untouched because
2068 euc-tw is not implemented in TW but in Encode::HanExtra.
2069 Autrijus, you may fix Encode::HanExtra.
2071 + to test encode aliases added