1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 2.26 2008/07/01 20:56:17 dankogai Exp dankogai $
5 $Revision: 2.26 $ $Date: 2008/07/01 20:56:17 $
7 Absense of Encode::ConfigLocal no longer carps no matter what.
8 https://bugzilla.redhat.com/show_bug.cgi?id=435505#c2
9 https://rt.cpan.org/Ticket/Display.html?id=28638
10 https://rt.cpan.org/Ticket/Display.html?id=11511
12 use encoding 'utf8' and 'iso-2022-jp' glitches on perl 5.10
14 Message-Id: <693254b90807011224h3ab50d76v50c6fea87baf223c@mail.gmail.com>
15 ! lib/Encode/Alias.pm t/Aliases.t
16 macintosh' not recognize as MacRoman
17 http://rt.cpan.org/Ticket/Display.html?id=36326
19 s{INC => "-I./Encode"}
20 {INC => '-I' . File::Spec->catfile( '.', 'Encode' )}
21 To prevent some platforms from forgetting to include Encode/encode.h.
22 http://rt.cpan.org/Ticket/Display.html?id=36348
24 2.25 2008/05/07 20:56:05
26 added ':default' to Exporter option.
27 ! lib/Encode/GSM0338.pm
28 GSM0338 now handles coderef in CHECK
29 http://rt.cpan.org/Ticket/Display.html?id=31335
31 Perl 5.10/Encode 2.24: Tiny typo in Encode's Makefile.PL arg processing
32 Message-Id: <961C2A4F-92B3-416D-A9F9-E7B0ADA9F134@fsck.com>
34 "This fix for Encode::Alias should make Solaris happy:"
35 Message-ID: <47D886D9.6060001@iki.fi>
37 2.24 2008/03/12 09:51:11
38 ! lib/Encode/Config.pm
39 adds and fixes also adds cp858 support.
40 ! Encode.pm encoding.pm lib/Encode/Alias.pm ucm/cp858.ucm
42 > Change 33486 by rgs@scipion on 2008/03/12 08:50:11
43 An unfortunate side-effect of Encode and Encode::Alias use'ing each
44 other, and Encode::Alias exporting functions into Encode for it to use
45 as methods, broke the loading of the find_alias() Encode method in some
46 cases since 5.10. Breaking the recursive inheritance fixes it.
47 Message-Id: <b77c1dce0803120151o4166c3a0gfcfd14681ab7e10d@mail.gmail.com>
50 Message-Id: <200711281835.36125@bloodgate.com>
52 Fix by MIYAGAWA via CodeRepos
53 http://coderepos.org/share/changeset/1791
54 ! encoding.pm t/mime_header_iso2022jp.t
55 ported back from Perl 5.10-RC1
57 2.23 2007/05/29 18:15:32
59 got rid of global fallback_cb; encode_method() now takes one more
60 argument which is a coderef to fallback. This should make
61 encode_method() thread-safe.
63 Added perluniintro, perlunifaq, and perlunitut to POD
65 Plug a memory leak in Encode -- by rgs
66 Message-Id: <b77c1dce0705290858v2be239c3o2d726e3d59091493@mail.gmail.com>
69 http://aspn.activestate.com/ASPN/Mail/Message/perl5-porters/3486118
71 man page generation is now conditional; yes by default but no if $PERL_CORE
72 Message-Id: <b77c1dce0705290237h5c4667cdlf79a48b839170add@mail.gmail.com>
74 2.22 2007/05/29 07:35:27
76 from_to() does not honor the check while decoding. That's a feature.
77 To make sure it is a feature it is mentioned in the POD.
78 http://rt.cpan.org/NoAuth/Bug.html?id=27277
80 Encode used to suppress man page generation. Now it does.
81 http://rt.cpan.org/NoAuth/Bug.html?id=27200
82 ! Encode.pm Encode.xs t/fallback.t
83 Addressed: (de|en)code("ascii", "\x{3000}", sub{ $_[0] }) segfaults
86 2.21 2007/05/12 06:42:19
87 + lib/Encode/MIME/Name.pm t/mime-name.t
88 ! Encode.pm Encode.xs lib/Encode/Encoding.pm
89 new method: mime_name()
92 Subject: Re: Compress::Zlib, pack "C" and utf-8 [PATCH]
93 From: Marc Lehmann <schmorp@schmorp.de>
94 Date: Thu, 12 Apr 2007 08:41:53 +0200
95 Message-ID: <20070412064153.GA22475@schmorp.de>
96 http://public.activestate.com/cgi-bin/perlbrowse/p/31194
99 Message-Id: <20070417220547.GA11999@zetta.zet>
101 2.20 2007/04/22 14:56:12
103 Pod fixes. Now find_encoding() is explained more in details.
104 + lib/Encode/GSM0338.pm
106 ! lib/Encode/Supported.pod lib/Encode/Config.pm Bytes/Makefile.PL t/gsm0338.t
107 ESTI GSM 03.38 support is relocated from Encode::Byte to Encode::GSM0338.
108 This encoding is so kaputt it is unfit for Encode::XS!
109 Though it was okay for general cases and escape sequences,
110 '\0' => '@' IFF '\0\0' => '\0' had gliches.
111 So kaputt even t/gsm0338 wrongly interpreted that.
112 ref. http://www.csoft.co.uk/sms/character_sets/gsm.htm
113 ! encoding.pm t/Aliases.t
114 Imported from bleedperl #31015
116 2.19 2007/04/06 12:53:41
117 ! lib/Encode/JP/JIS7.pm
119 encode('iso-2022-jp') fallback support added by MIYAGAWA++
120 decode()'s fallback remains unchanged (FB_PERLQQ) since UTF-8
121 contains all characters in iso-2022-jp so there's no need for fancy stuff.
122 Message-Id: <693254b90704060526s6d850320h71cdda50dfbf7eba@mail.gmail.com>
124 #25216 ([PATCH] Encode.pm: postpone the load of Encode::Encoding)
125 http://rt.cpan.org/NoAuth/Bug.html?id=25216
126 ! lib/Encode/MIME/Header.pm t/mime-header.t
127 #24418 (Encode::MIME::Header: wrong encoding with latin1 characters)
128 http://rt.cpan.org/NoAuth/Bug.html?id=24418
130 #23876 (Add documentation for LEAVE_SRC)
131 http://rt.cpan.org/NoAuth/Bug.html?id=23876
132 ! lib/Encode/Alias.pm t/Aliases.t
133 #20781: Thai encoding needs alias for tis-620
134 http://rt.cpan.org/NoAuth/Bug.html?id=20781
136 #20344: piconv: wrong conversion of utf-16le encoded files (with PATCH)
137 http://rt.cpan.org/NoAuth/Bug.html?id=20344
138 ! Encode.pm Encode.xs bin/enc2xs encoding.pm t/Aliases.t t/utf8strict.t
139 Imported from bleedperl's 2.18_01
141 2.18 2006/06/03 20:28:48
143 overhauled the -C option
144 - added ascii-ctrl', 'null', 'utf-8-strict' to core
145 - auto-generated Encode::ConfigLocal no longer use v-string for version
146 - now searches modules via File::Find so Encode/JP/Mobile is happy
147 ! Byte/Byte.pm CN/CN.pm EBCDIC/EBCDIC.pm JP/JP.pm KR/KR.pm Symbol/Symbol.pm
148 use strict added; though all they do is load XS, it's
149 still better a practice
151 use warnings added to all of them for better practices' sake.
153 2.17 2006/05/09 17:10:09
155 'chin' =~ /^zh_CN|chin(?:a|ese)?$/i is true
156 but chin is not china or chinese.
157 http://d.hatena.ne.jp/jankogai/20060508/1147090316
159 Integrated maintperl change (27824|27824) which I overlooked
160 -- sorry, Nicholas and Coverity Scan.
161 Message-Id: <200604152115.k3FLF1Ar014538@smtp3.ActiveState.com>
162 Message-Id: <200605091615.k49GF1gJ016777@smtp3.ActiveState.com>
164 2.16 2006/05/03 18:24:10
166 --xmlcref and --htmlcref added.
168 Copyright Notice Added.
169 http://rt.cpan.org/NoAuth/Bug.html?id=19056
171 Replaced remaining ^\t with q( ) x 4. -- Perl Best Practice pp. 20
172 And all .pm's are now perltidy-ed.
174 2.15 2006/04/06 15:44:11
176 Addressed: UTF-16, UTF-32, UCS, UTF-7 decoders mishandle illegal characters
177 http://rt.cpan.org/NoAuth/Bug.html?id=18556
179 added str2bytes() as an alias to encode() and bytes2str() as an alias
181 http://rt.cpan.org/NoAuth/Bug.html?id=17103
183 Change 26922: Avoid warning with MS Visual C compiler.
184 Message-Id: <200601231245.k0NCj2dw009484@smtp3.ActiveState.com>
186 Change 26067: As using -C to turn on utf8 IO is equivalent to the open pragma
187 Message-Id: <200511092227.jA9MRcYD009025@smtp3.ActiveState.com>
189 2.14 2006/01/15 15:43:36
191 Change 26295: Don't build manpages for Encode and Unicode::Normalize
192 Message-Id: <200512071540.jB7Fe4Gt017960@smtp3.ActiveState.com>
194 Change 26081: Pod nit in Encode.pm, found by Marc Lehmann in RT #36949.
195 Message-Id: <200511110357.jAB3vZcP023647@smtp3.ActiveState.com>
196 ! Encode.xs Encode/encode.h bin/enc2xs encengine.c
197 Change 25821: Mark more static Encode data structures as const.
198 Change 25823: use more 'const' in the Encode data structures.
199 Message-Id: <200510221243.j9MChTSu027711@smtp3.ActiveState.com>
200 Message-Id: <200510221343.j9MDhTk9001245@smtp3.ActiveState.com>
202 2.13 2006/01/15 15:06:36
204 Miyagawa's mail address updated
205 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
206 ! lib/Encode/MIME/Header.pm
207 #16413: Encode::MIME::Headers patch to solve what is probably someone else's bug
208 http://rt.cpan.org/NoAuth/Bug.html?id=16413
209 ! lib/Encode/MIME/Header.pm t/mime-header.t
210 Applied: RT #16258: Support for RFC 2184 language tag
211 http://rt.cpan.org/NoAuth/Bug.html?id=16258
213 Fixed RT #14559: fix for #8872 introduces new "bug"
214 http://rt.cpan.org/NoAuth/Bug.html?id=14559
217 from_to() now makes use of $check more naturally.
218 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
219 <B10B4DE2-9BAF-4344-B3BE-2119977D817A@dan.co.jp>
221 2.12 2005/09/08 14:17:17
222 ! Encode.xs Encode.pm t/fallback.t
223 Now accepts coderef for CHECK!
225 Updated to newer version at unicode.org
226 http://rt.cpan.org/NoAuth/Bug.html?id=14222
227 ! lib/Encode/Supported.pod
229 <42F5E243.80500@gmail.com>
231 More POD typo leftover fixed.
232 Message-Id: <b77c1dce05080615487f95314@mail.gmail.com>
234 2.11 2005/08/05 10:58:25
236 To reflect changes below
237 ! Encode.pm encoding.pm
238 lib/Encode/Alias.pm lib/Encode/PerlIO.pod lib/Encode/Supported.pod
239 Typo fixed by Piotr Fusik in Change 25261 & 25266
240 Message-ID: <001401c595bd$dccb5d80$0bd34dd5@piec>
242 Addresses "BUG REPORT: panic in Encode.xs".
243 Message-Id: <42EDDA97.2010608@hyper.to>
244 + lib/Encode/MIME/Header/ISO_2022_JP.pm mime_header_iso2022jp.t
245 ! lib/Encode/MIME/Header.pm lib/Encode/Config.pm
246 Encoding 'MIME-Header-ISO_2022_JP' is introduced by Makamaka
247 Message-Id: <200507311557.j6VFvE2K034605@www231.sakura.ne.jp>
248 ! Encode/encode.h Encode.pm Encode.xs
249 PerlIO's "encoding(utf-8-strict)" got a problem w/ partial character.
250 Found and addressed by KONNO Hiroharu <hiroharu.konno@bowneglobal.co.jp>
251 See also ext/PerlIO/encoding/encoding.pm
252 Message-Id: <E1DineE-00068X-TB@yok-gs-workman.asia.bgsinternal.com>
254 2.10 2005/05/16 18:46:36
256 fixed decode_utf8() accordingly to RT#8872
257 http://rt.cpan.org/NoAuth/Bug.html?id=8872
259 s/SvIVX/SvIV_set/ by Steve Peters.
260 Message-Id: <2297.67.96.185.36.1114626315.squirrel@webmail3.pair.com>
264 New Pod section: "UTF-8 vs utf8"; explains utf-8-strict
266 Tests utf-8-strict, accordingly to
267 UTF-8 decoder capability and stress test" by Markus Kuhn
268 http://smontagu.damowmow.com/utf8test.html
269 Note that malformed and overlong sequences are not test here
270 because perl already does that for you, utf-8-strict or not.
271 ! Encode.pm Encode/encode.h t/fallback.t
272 Addressed "encode(..., Encode::LEAVE_SRC) does not work".
273 Now FB_(PERLQQ|HTMLCREF|XMLCREF) implies LEAVE_SRC so
274 you can (en|de)code constant strings with these fallbacks.
275 http://rt.cpan.org/NoAuth/Bug.html?id=8736
276 ! Encode.pm Encode.xs lib/Encode/Alias.pm t/Aliases.t
277 Make Encode.pm support the real UTF-8, by GAAS
278 Message-Id: <lrfz2mcngd.fsf@caliper.activestate.com>
279 Message-Id: <lr4qizbvvm.fsf@caliper.activestate.com>
280 ! Encode.pm Encode.xs
281 post-2.09 comment patches from GAAS applied.
282 Message-Id: <lroehacz6q.fsf@caliper.activestate.com>
283 Message-Id: <lrk6rycymu.fsf@caliper.activestate.com>
285 2.09 2004/12/03 19:16:53
286 ! Encode.pm Encode.xs
287 Addressed " :encoding(utf8) broken in perl-5.8.6".
288 Message-Id: <lrllcfeank.fsf_-_@caliper.activestate.com>
290 Addressed "(de|en)code($valid_encoding, undef) does not warn".
291 http://rt.cpan.org/NoAuth/Bug.html?id=8723
292 ! Encode.pm t/Encode.t
293 Addressed "Can't encode URI". When a reference is fed to (en|de)code,
294 Encode now stringifies instead of returning undef.
295 http://rt.cpan.org/NoAuth/Bug.html?id=8725
296 ! Encode.xs t/fallback.t
297 Addressed "FB_HTMLCREF and FB_XMLCREF for the UTF-8 decoder".
298 http://rt.cpan.org/NoAuth/Bug.html?id=8694
300 Addressed "s/digit/number/".
301 http://rt.cpan.org/NoAuth/Bug.html?id=8695
303 Addressed "while (defined(read )) { ... } is an infinite loop".
304 http://rt.cpan.org/NoAuth/Bug.html?id=8696
306 Addressed "What the heck is UCM?".
307 Document fixed so that it no longer contains "UCM-Based Encodings".
308 http://rt.cpan.org/NoAuth/Bug.html?id=8697
310 2.08 2004/10/24 13:00:29
311 ! Encode.xs lib/Encode/Encoding.pm Unicode/Unicode.{pm,xs}
312 Resolved the issue that was raised by 2.07 -- Encode::utf8 fallbacks
313 that was introduce messed up PerlIO::encoding.
314 * To do so, ->renew() is renewed and ->renewed() was introduced to
315 tell whether the caller is PerlIO or not.
316 Message-Id: <94B2EB12-25B7-11D9-9E6A-000A95DBB50A@dan.co.jp>
318 2.07 2004/10/22 19:35:52
319 ! lib/Encode/Encoding.pm
320 "Remove Carp from warnings.pm" that influences Encode, by Tels.
321 Message-Id: <200410161618.29779@bloodgate.com>
322 ! Encode.xs AUTHORS t/fallback.t
323 Now Encode::utf8's fallbacks are compliant to Encode standard.
324 Thank Bjoern Hoehrmann for persistently convincing me.
325 Message-Id: <41a61aea.638409494@smtp.bjoern.hoehrmann.de>
329 2.06 2004/10/22 06:23:11
331 RT #8083 reports that MacThai mapping was obsolete
332 Updated all mac* encodings accordingly to the URI below.
333 One remaining mystery is that MacRomanian vs. MacRumanian.
334 MacRumanian is not found in unicode.org...
335 http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/
336 ! Encode.pm t/Encode.t
337 Fixed RT #8081: "decode(..., bless{},'x') segfault"
338 Two more tests added to test that.
339 http://rt.cpan.org/NoAuth/Bug.html?id=8081
341 POD revised accordingly to RT #7966
342 http://rt.cpan.org/NoAuth/Bug.html?id=7966
344 POD updated explaining why Encode::Unicode always croaks on error
345 rather than giving users choices.
346 http://rt.cpan.org/NoAuth/Bug.html?id=7892
348 2.05 2004/10/19 04:55:01
350 "unnuke" jhi's patch in bleedperl, with minor correction by dankogai.
351 Message-ID: <41210A84.6060506@iki.fi>
352 Message-ID: <20041018233442.7418113f@r2d2>
353 Message-Id: <2BA3DAC4-218A-11D9-906D-000A95DBB50A@dan.co.jp>
355 2.04 2004/10/16 21:22:44
357 From: craigberry@mac.com
358 Subject: [PATCH ext/Encode/Makefile.PL] make Encode.c dependency explicit
359 Message-Id: <41716868.7000102@mac.com>
361 2.03 2004/10/06 05:07:20
362 ! lib/Encode/Alias.pm
363 Resolved some alias case sensitivity glitches reported via RT.
364 http://rt.cpan.org/NoAuth/Bug.html?id=7835
366 Resolved Win32 glitches reported via RT.
367 (Fixed by dankogai and tested by Steve Hay)
368 http://rt.cpan.org/Ticket/Display.html?id=7831
369 ! JP/JP.pm lib/Encode/Alias.pm lib/Encode/Supported.pod AUTHORS
370 /\bwindows-31j$/i is now an alias of CP932, by Steve Hay.
371 http://rt.cpan.org/NoAuth/Bug.html?id=6695
373 2.02 2004/08/31 10:55:34
374 ! ucm/big5-hkscs.ucm AUTHORS t/big5-hkscs.enc t/big5-hkscs.utf
375 New map submitted by Deng Liu and Autrijus. Test data needed
376 to be upgrade as well, done by dankogai
377 Message-Id: <20040824204828.GB6999@aut.dyndns.org>
379 Now works for characters U+10000 and above. This fix was needed
380 to "tidy" the original map that was submitted.
382 "ucmsort" now mentioned in pod
384 2.01 2004/05/25 16:27:14
386 From: domo@computer.org
387 Subject: [PATCH] Correct statistics from enc2xs
388 <4AF60A4A-B8BB-11D8-BF99-000A27839BD6@computer.org>
389 <CDEBBD45-B91D-11D8-BF99-000A27839BD6@computer.org>
390 ! lib/Encode/Alias.pm
391 Addressed "False [] range "\s-" in regex;" in Encode::Alias.pm
392 <200405271148.i4RBm4KY026529@mail.mvnet.de>
394 2.01 2004/05/25 16:27:14
395 ! lib/Encode/CN/HZ.pm lib/Encode/Unicode/UTF7.pm
396 "If someone thinks utf8::upgrade($1) should be croaked like
397 chom?p($1),please try the following patch for Encode.pm."
399 <20040522212704.C068.BQW10602@nifty.com>
401 2.0 2004/05/16 20:55:15
402 * version updated to 2.00
403 -- sorry, no big feature change. I just hate version 1.100 :)
404 ! lib/Encode/Guess.pm
406 addressed UTF-(8|32LE) + BOM misguessing
407 https://rt.cpan.org/Ticket/Display.html?id=6279
409 s/is_utif8/is_utf8/ in POD
410 ! Encode/lib/Encode/CN/HZ.pm
411 Fixes "make test" failure after the patch to pp_hot.c
413 Message-Id: <20040222182357.6B39.BQW10602@nifty.com>
415 From: autrijus@autrijus.org
416 Subject: [PATCH] "piconv -C 512" badly broken
417 Message-Id: <1072870210.769.5.camel@localhost>
419 1.99 2003/12/29 02:47:16
421 find_encoding("UTF-16BE")->encode("abc") now null terminates
422 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html
424 prototype bug in decode_utf8() fixed
425 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
426 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
427 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
429 Merged from maintperl@21987
431 1.98 2003/08/20 11:15:31
432 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
433 Dave Evans has found and corrected a bug in Encode::MIME::Header.
434 Test suite added by Dan Kogai.
435 Message-Id: <3F43440B.7060606@rudolf.org.uk>
437 Typo fixes rolled back in from bleedperl
438 ! t/at-cn.t t/at-tw.t
439 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
440 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
442 argv case nit for VMS by Craig
443 Message-ID: <3F2B02DE.10207@mac.com>
444 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
445 Encode test fixes for VMS by Peter Prymmer
446 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
447 ! lib/Encode/Alias.pm t/Aliases.t
448 koi-8 aliases bug detected and patched by sadahiro.
449 Further fix and test suite by dankogai
450 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
452 1.97 2003/07/08 21:52:14
453 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
454 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
455 $DEBUG replaced with DEBUG() so perl optimizes better,
456 by Rafael with further fixes by dankogai
457 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
458 ! lib/Encode/Aliases.pm
459 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
460 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
461 So new hash seeding introduced in bleedperl works.
462 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
463 ! lib/Encode/Guess.pm
464 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
465 automatic utf(8|16|32) guessing -- originally by Autrijus
466 Message-Id: <20030626162731.GA2077@not.autrijus.org>
468 Addressed the following;
469 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
470 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
472 1.96 2003/06/18 09:29:02
473 ! lib/Encode/JP/JP.pm t/guess.t
474 m/(...)/ in void context then $1 is considered a Bad Thing
475 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
477 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
480 More typecast from maintperl@19739
481 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
483 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
484 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
486 Now skips for EBCDIC platform.
487 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
489 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
490 have more than one C<.> in their filenames." -- perlport.
491 Message-Id: <3ED79E01.8050401@mac.com>
493 Found and fixed the back that -p,--perlqq does not work.
494 Induced by the change from Getopt::Std to Getopt::Long.
496 Addressed [cpan #2629] Wrong assumption in numeric comparison
497 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
498 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
499 lib/Encode/Encoding.pm t/perlio.t
500 ! API Change: ->new_sequence() => ->renew()
501 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
502 + Encode::XS and Encode::utf8 now supports ->renew()
503 + Encode::Encoding now documents this with examples
504 - Non-XS (en|de)code stripped out of Encode::Unicode
505 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
507 1.95 2003/05/21 08:41:11
509 Since bogus entries were found in iso-8859-6, all entries are
510 re-generated once again out of
511 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
512 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
513 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
514 + lib/Encode/Unicode/UTF7.pm
515 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
516 lib/Encode/Supported.pod
517 UTF-7 support is now added. With this Encode now has all transcoding
518 methods in Unicode::String.
520 1.94 2003/05/10 18:13:59
521 ! lib/Encode/MIME/Header.pm
522 A more sophisticated solution for double-encoding by dankogai
523 ! lib/Encode/MIME/Header.pm AUTHORS
524 Two bugs fixed by Bjoern Jacke
525 * "Double Encoding" was not possible
526 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
527 * encode("MIME-Q") had UTF-8 flag on
528 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
529 ! lib/Encode/MIME/Header.pm AUTHORS
530 Two occurances of "croak ()" fixed as "croak qq()".
531 Simon Cozens is added to AUTHORS as a result.
532 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
534 POD fixes that reflect enhancements by jhi
536 Two enhancements by jhi.
537 + Now uses Getopt::Long so it accepts long name options
538 (--from for -f, for example)
539 + New option: -r,--resolve
540 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
542 META.yml added upon request of Schwern
543 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
545 Enache Adrian removed upon request -- to live longer than Encode
546 and/or FreeBSD (toy-)?thread :)
547 Message-Id: <20030425015701.GA2069@ratsnest.hole>
549 "close STDOUT unless $^O eq 'freebsd';" once again relocated
550 to keep VMS happy in which case "$^O eq 'freebsd'" is required
551 to keep FreeBSD+thread happy. Sigh.
552 Message-Id: <3EA88ADC.3000300@mac.com>
554 1.93 2003/04/24 17:43:16
556 added "no warnings 'pack'" in for loop to keep bleedperl from
557 complaining "Character in 'C' format wrapped in pack".
559 More elegant perl core detection inspired by Ilya Zakharevich
560 (but further elaborated for general cases).
561 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
564 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
565 are less likely to be confused w/ ISO-8859-*
567 RT tests added (vendor encodings are exemplified)
568 -- that successfully found a flaw on iso-2022-kr before the patch.
569 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
570 decode("ISO-2022-KR") has been buggy but no one ever sited
571 that since no one seems to be using it. Bugs discovered by
573 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
574 ! lib/Encode/CN/HZ.pm t/perlio.t
575 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
576 so it adds test for HZ.
577 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
578 ! lib/Encode/Guess.pm
579 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
580 So long as the string contains \x{00}-\x{ff} it does not fail.
581 See perldoc for details.
582 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
584 1.92 2003/03/31 03:27:27
585 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
586 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
587 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
588 but glibc-BIG5-2.1.2.ucm does.
589 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
590 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
591 Maps regenerated again but this time based upon
592 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
593 (But where is THE DOCUMENT by MICROSOFT?)
594 ! t/enc_module.t AUTHORS
595 failure with threaded Perl on FreeBSD addressed.
596 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
597 Message-Id: <20030322230131.GA813@ratsnest.hole>
598 ! lib/Encode/Guess.pm
601 Change 18989: Make the :bytes conditional on PerlIO.
602 further Modified by Dan Kogai
603 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
605 Chnage 18966: another fix for failing test on windows ("use encoding"
606 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
607 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
609 Change 18970: Hopefully this works also in Win32, by jhi
610 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
611 Change 18965: fix CJKT.t failures on windows due to incorrect
613 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
615 1.91 2003/03/09 20:07:37
617 even more proofread by jhi.
618 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
621 +use lib qw(t ext/Encode/t ../ext/Encode/t);
622 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
624 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
625 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
628 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
630 1.90 2003/03/09 17:32:43
633 Inaba-san has added a patch for perl 5.8.1 or later that makes
634 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
635 that. POD is further revised.
636 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
637 ! encoding.pm t/enc_module.t
638 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
639 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
643 signed vs. unsigned issue discovered by Craig on OpenVM
644 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
645 ! encoding.pm AUTHORS
646 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
647 Because binmode() stacks layers instead of overwrite, you have to
648 ":raw :encoding()" in encoding.pm or your are in trouble when you
649 call encoding.pm multiple times. There are several workarounds
650 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
651 who was the first to address this problem was added to AUTHORS.
652 The test suites was added for this, which is a modified version
653 of SUGAWARA-san's scripts
654 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
656 1.88 2003/02/20 14:42:34
658 one signedness nit for Encode by jhi
659 <200302161933.h1GJX876018710@kosh.hut.fi>
661 VISCII map was incorrect; fixed by Sadahiro-san
662 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
663 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
664 You can't unlink files that are opened in cygwin but the last
665 file handle opened in t/enc_*.t left open. Patch submitted
666 by Yitzchak and he was added to AUTHORS.
667 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
669 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
670 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
672 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
673 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
675 1.87 2003/02/06 01:52:11
677 * Inaba "Sensei" Hirohito added (I thought I have done so a long
678 ago but apparently I did not).
679 * SUZUKI Norio added for verious and useful bug reports.
680 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
681 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
682 podchecked so all warnings are gone except for L<http://>.
683 ! encoding.pm t/enc_eucjp.t
684 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
685 * Filter option overhaul
687 ! Encode.pm Encode.xs encengine.c Encode/encode.h
688 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
689 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
690 without filter. podchecked by Dan Kogai.
691 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
692 ! lib/Encode/Alias.pm
693 decode('alias', $1) went wild because of local $_ in find_alias()
694 the evil local $_ is eradicated but that changes find_alias()
695 format for coderef aliasing. See Encode::Alias for details
696 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
698 1.86 2003/01/22 03:29:07
700 * Don't forget to canonize when you attempt an exact match!
701 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
702 * ${^ENCODING} exception is off for $] > 5.008
703 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
705 $] check commented out so it runs on 5.8.0
707 1.85 2003/01/21 22:19:14
709 ${^ENCODING} exception is now explicit rather than handled by regex.
710 + t/enc_eucjp.t t/enc_utf8.t
711 Test suite for the better "encoding" pragma support for bleedperl.
712 On 5.8.0, they will just be skipped.
714 1.84 2003/01/10 12:00:16
716 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
717 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
718 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
720 %_ fixes saves the resulting .so .05% smaller, by NC
721 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
723 Silence Encode on undef, by Andreas
724 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
725 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
727 s/regognised/recognised/ . British spelling left intact to pay
728 respect to two British Nicks :)
729 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
731 1.83 2002/11/18 17:28:49
732 ! Encode.xs lib/Encode/JIS7.pm
733 Even more patches from Inaba-san has been applied. With this
734 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
735 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
737 1.82 2002/11/14 23:06:12
739 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
740 later found in perls w/ -Dusemymalloc was fixed by NC.
741 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
743 1.81 2002/11/08 18:29:27
744 ! Encode.pm Encode.xs
745 Non-XS version of Encode::utf8 is back (with XS being default).
746 Encode::predefine_encodings(0) to turn off XS.
747 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
748 found that it was Test::More causing the problem, not Encode.
749 But I have already made it configurable so it may be useful in
751 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
753 The ingenious patch by Nicholas Clark that reduces shlib sizes by
754 50% with no penalty and backward compatibility preserved, is in.
755 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
757 1.80 2002/10/21 20:39:09
758 ! Encode.xs t/mime-header.t
759 Even more patches from NI-XS regarding Encode::utf8->decode().
760 And one more test to t/mime-header.t to prove it
761 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
763 1.79 2002/10/21 06:05:37
765 Further patches from NI-XS. Encode::utf8->decode() now checks the
766 value of utf8 flag of the argument. As a result, the fix to
767 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
768 no harm (even speedwise) I'll leave it unreverted.
769 ! ucm/cp949.ucm ucm/cp950.ucm
771 U+00AE REGISTERED SIGN
772 were missing as a result of 1.78. Discovered by Moriyama-san.
773 Moriyama-san has also developed a test script that compares
774 (en|de)coded results to the corresponding Win32 API result and
775 all cp9?? maps are now verified.
776 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
778 1.78 2002/10/20 15:44:00
779 ! lib/Encode/MIME/Header.pm
780 fixed so that it works with new Encode::utf8
781 ! Encode.pm Encode.xs
782 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
783 :encoding(UTF-8) to handle partial chars at end of buffers
785 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
786 ! lib/Encode/Supported.pod
787 More nitpickings applied.
790 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
791 tests were completely useless. To redeem that and get the peace of
792 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
793 ucm/*.ucm Since this script takes too long to finish (30 seconds on
794 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
795 but you can easily run that by either renaming it or:
797 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
798 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
799 Other CJKT cp9?? also updated according to the URI below;
800 http://www.microsoft.com/typography/unicode/cscp.htm
801 + bin/ucmsort MANIFEST
802 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
803 to proper order. intended for hardcore develpers only.
804 ! ucm/cp932.ucm JP/JP.pm AUTHORS
805 CP932 mapping which was based upon the mapping file at unicode.org
806 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
807 has also supplied the patch so he was added to AUTHORS.
808 ! lib/Encode/Supported.pod
809 ISO-8859-11 != TIS 620
810 == TIS 620 + \xA0 ( )
812 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
814 1.77 2002/10/06 03:27:02
816 * Modified to accomodate up and comming patch by Inaba-san that
817 will fix tr/// needing eval qq{}
818 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
820 * pod fixes/enhancements to reflect the changes above
821 ! lib/Encode/Alias.pm
822 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
823 Message-Id: <20021001015648.GB18710@not.autrijus.org>
825 1.76 2002/08/25 15:09:51
827 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
828 but naturally identical to previous version -- dankogai
830 Codepoint fixes -- autrijus
831 Message-Id: <20020805040236.GC5220@not.autrijus.org>
833 copied everything under perl-5.8.0/ext/Encode to make sure Encode
834 is in sync w/ perl core
836 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
837 Now that binmode(FH) does implicit ":bytes" revisit
838 the failing tests. The worrisome one is the Digest::MD5
839 test-- how will it fare in CRLF lands now?
841 From: Radu Greab <radu@netsoft.ro>
842 Date: Mon, 10 Jun 2002 00:40:34 +0300
843 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
844 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
845 ones which I cannot figure out.
846 ! lib/Encode/Alias.pm
847 Subject: [Encode PATCH] spurious warning
848 From: Nicholas Clark <nick@unfortu.net>
849 Date: Sun, 2 Jun 2002 20:26:22 +0100
850 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
852 1.75 2002/06/01 18:07:49
853 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
854 glibc compliance cited by Autrijus.
855 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
856 ! bin/enc2xs bin/piconv
857 Subject: Re: forewarning: usedevel and versiononly
858 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
860 1.74 2002/05/28 18:33:15
861 + ucm/null.ucm ucm/ctrl.ucm
862 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
863 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
864 Subject: Re: unicode -> &# notation
865 Message-ID: <20020518193704.GB40272@not.autrijus.org>
867 1.73 2002/05/28 17:26:18
868 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
869 Chris Nandor has fixed Encode so that it works w/ MacPerl --
870 at least w/ PPC (68k need static linking which does not work due to
871 64k limit). pudge is added to AUTHORS (I'm surprised he was not
872 there in the list). Encode/Makefile_PL.e2x was additionally fixed
873 by dankogai to reflect changes in other Makefile.PL
874 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
876 Subject: Change 16746: -Mutf8 cleanup.
877 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
879 1.72 2002/05/20 15:49:56
881 Subject: [PATCH] Encode should be in perl-core library path
882 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
883 Message-Id: <20020520161201.A11019@alpha.hut.fi>
884 ! lib/Encode/MIME/Header.pm
885 Subject: [PATCH] Encode::MIME::Header
886 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
887 ! Encode/Makefile_PL.e2x
888 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
889 Message-Id: <20020519201031.GA1603@not.autrijus.org>
890 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
891 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
893 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
894 Subject: [PATCH] good day for WinCE port of perl.
895 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
898 Even more linting by Robin via @16532
900 Even more typecast by Sarathy in @16460
902 1.71 2002/05/07 16:22:42
904 even more typecasts by Robin
905 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
907 A very strange bug that was causing a bugus ucm -> C table
908 generation that was revealed by a UCM file that Andreas was
909 working. This is the king of wierdest bug I've encountered
910 in the course of Encode maintenance.
911 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
913 1.70 2002/05/06 10:26:48
915 Made more 'module-safe' with conjunction w/ 'no encoding'.
916 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
917 ! lib/Encode/Encoding.pm
918 'require Encode' because ->Define uses Encode::define_encoding();
919 problem and solution addressed by Miyagawa-kun
920 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
922 Cuts the frill to make djgpp happier, as suggested by Laszlo
923 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
925 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
926 Message-Id: <m3bsbug48n.fsf@anima.de>
928 1.69 2002/05/04 16:41:18
929 ! lib/Encode/MIME/Header
930 Floating-point coerced for UNICOS (in integer arithmetics it folds
931 line one character too early). Verification by Mark is pending.
932 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
934 more doc patch from Elizabeth
935 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
936 ! Encode/Makefile_PL.e2x
937 More platform-independent patch from Benjamin
938 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
939 ! lib/Encode/Guess AUTHORS
940 split regex fix by Graham Barr. Adds him to AUTHORS.
941 Message-Id: <20020504085419.E95940@valueclick.com>
942 ! Encode/Makefile_PL.e2x
943 enc2xs script discovery made smarter and more sensible, first cited
944 by Miyagawa-kun and further suggestions by Rafael and Andreas
945 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
946 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
948 1.68 2002/05/03 12:20:13
949 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
950 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
951 and Supported.pod modified to reflect the change. Elizabeth added
952 to Authors. And H.M. is also added for forwarding her patch among
953 other contributions (I was rather surprised to find his name was not
955 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
957 1.67 2002/05/02 07:33:09
959 Error message now consistent w/ perlqq (\N{U+} -> \x{})
960 done in perl@16308 but Philip linted me further. Now the error
961 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
962 ! lib/Encode/Guess.pm
963 Sanity check for happier -w by Autrijus
965 1.66 2002/05/01 05:41:06
966 ! Encode.xs t/fallback.t
967 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
968 while fallback is in effect. This even came with a welcome side-effect
969 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
970 also modified to test this.
971 And of course, the corresponding varialbles to UV[Xx]f are appropriately
972 cast. This should've concluded NI-XS homework.
974 encode(undef) does warn again! Repented upon suggestion by NI-XS.
975 Document for unless vs. '' added
976 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
978 1.65 2002/04/30 16:13:37
980 encode(undef) no longer warns for C<Use of uninitialized value in
981 subroutine entry>. Suggested by Paul.
982 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
983 ! lib/Encode/Supported.pod
984 Encode::MIME::Header and Encode::Guess mentioned
985 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
986 ! lib/Encode/Guess.pm
987 POD fix by Miyagawa-kun
988 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
990 1.64 2002/04/29 06:54:06
992 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
993 encocde("euc-jisx0213") are still DIFFERENT.
994 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
996 A few white spaces corrected by NI-XS via PerlIO integration to
998 Subject: Change 16247: Integrate perlio;
1000 Document fixes by Andreas
1001 Message-Id: <m3k7qsf1we.fsf@anima.de>
1003 1.63 2002/04/27 18:59:50
1004 ! lib/Encode/Encoding.pm
1005 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
1006 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
1007 Make use of the Encode::Encoding base class!
1008 And other cleanups in Encode.xs upon NI-XS suggestions
1009 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
1011 1.62 2002/04/27 11:17:39
1013 encodings() now just check %ExtModule instead of eval{require}
1014 all of them for ":all" to conserve more memory.
1016 more "%x" -> "%" UVxf stuff.
1018 s/=over2/=over 2/g # oops.
1020 1.61 2002/04/26 03:02:04
1022 Now does decent tests besides use_ok()
1023 ! lib/Encode/Guess.pm t/guess.t
1024 UI streamlined, document added
1025 ! Unicode/Unicode.xs
1026 various signed/unsigned mismatch nits (#16173)
1027 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
1029 POD: utf8-flag-related caveats added. A few sections completely
1033 Thou shalt not assume %d works, either!
1034 Robin Baker added to AUTHORS for this
1035 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
1037 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
1039 1.60 2002/04/24 20:06:52
1041 "Thou shalt not assume %x works." -- jhi
1042 Message-Id: <20020424210618.E24347@alpha.hut.fi>
1043 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
1044 low-memory build machines happy, now *.c is created for each *.ucm
1045 (no table aggregation). You can still override this by setting
1046 $ENV{AGGREGATE_TABLES}.
1047 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
1048 + lib/Encode/Guess.pm
1049 + lib/Encode/JP/JIS7.pm
1050 Encoding-autodetect (mainly for Japanese encoding) added. In a
1051 course of development, JIS7.pm was improved.
1052 + lib/Encode/HTML/Header.pm
1053 + lib/Encode/Config.pm
1054 MIME B/Q Header Encoding Added!
1055 ! Encode.pm Encode.xs t/fallback.t
1056 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
1057 Message-Id: <20020424130709.GA14211@tanglefoot>
1059 1.59 $ 2002/04/22 23:54:22
1060 ! Encode.pm Encode.xs
1061 needs_lines() and perlio_ok() are added to Internal encodings such
1062 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
1064 1.58 2002/04/22 23:54:22
1066 s/MacChineseSimp/MacChineseTrad/ # ... oops.
1071 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
1072 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
1073 t/(JP|KR).t are aggregated to t/CJKT.t
1074 test data are all remade via bin/ucm2text.
1075 And .... They are no longer skipped for -Uuseperlio !
1077 1.57 2002/04/22 20:27:30
1078 ! t/JP.t t/KR.t t/perlio.t
1079 unless (find PerlIO::Layer 'perlio') ... line is back again.
1080 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
1081 need PerlIO. Sigh....
1082 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
1083 ->perlio_ok now does eval{ require PerlIO::encoding } there so
1084 it correctly returns 1 when PerlIO::encoding is yet loaded.
1086 perl-current patch #16072 reflected
1088 1.56 2002/04/22 09:48:07
1089 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
1090 New PerlIO::encoding 0.04 compliance met
1092 1.55 2002/04/22 03:43:05
1093 ! Encode.pm Encode.xs Unicode/Unicode.pm
1094 needs_lines() defined so Encode::Encoding is no longer needed
1097 1.54 2002/04/22 02:50:01
1098 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
1099 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
1100 now perlio_ok is true by default if PerlIO::encoding->VERSION is
1101 0.03 or larger. POD in Encode::Encoding revised to reflect this.
1102 Encode::XS and Encode::Unicode now has perlio_ok() method.
1103 ! lib/Encode/Supported.pod
1104 s/UP-UX/HP-UX/ by jhi
1105 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
1106 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
1107 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
1108 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
1109 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
1110 ! lib/Encode/Supported.pod
1111 Huge document fixes by Philip.
1114 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
1115 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
1117 binmode() after "<:encoding" to make Win32 happy, by Mattia.
1118 Mattia added to AUTHORS file
1119 Message-Id: <3CC3150F.5798.22A05AE@localhost>
1121 1.52 2002/04/20 23:43:47
1123 TODO: is now SKIP:, as NI-XS requested. Also adds more
1124 eraborate failure analysis added.
1126 A note on how to make sure of round-trip safety added to POD
1127 section (so Autrijus is happier)
1128 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
1129 big5-(eten|hkscs) is round-trip safe again!
1130 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
1132 Typo fixes by Andreas
1133 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
1134 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
1135 PerIO coodination patches from NI-XS.
1136 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
1138 1.51 2002/04/20 09:58:23
1140 Updated test suite by Autrijis so "make test" is happy again
1141 Message-Id: <20020420082104.GA25037@not.autrijus.org>
1143 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
1145 TW/TW.pm TW/Makefile.PL
1146 Updates by Autrijus. 'big5' is no longer a canonical but an
1147 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
1148 Message-Id: <20020419195346.GA19597@not.autrijus.org>
1150 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
1151 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
1153 PerlIO detection a little bit smarter; no longer uses eval qq{}
1156 1.50 2002/04/19 06:13:02
1157 ! ! Encode.pm Encode.xs Encode/encoding.h
1159 New Fallback API imlemented and documented. See "perldoc Encode"
1161 ! lib/Encode/JP/JIS7.pm Encode.pm
1162 + lib/Encode/PerlIO.pod t/perlio.t
1163 API compliance met. However, it still does not work unless perlio
1164 implements line buffer. See BUGS section in perldoc Encode::PerlIO
1165 As a sensible workaround, perlio_ok() added to Encode.
1167 ! lib/Encode/Supported.pod
1169 Message-Id: <20020418174647.J8466@alpha.hut.fi>
1171 Doc fixes from Autrijus
1172 Message-Id: <20020418144131.GA10987@not.autrijus.org>
1174 perlqq mode documented
1176 + t/jisx0201.euc t/jisx0201.ref
1177 ! t/jisx0208.euc t/jisx0208.ref
1178 t/JP.t tests more rigorously and with other encodings
1179 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
1180 in jis0208 (used to contain jisx0201 part).
1181 ! Encode/Makefile_PL.e2x
1182 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
1183 enc2xs and encode.h rather than hard-coded. This allows the resulting
1184 module fully CPANizable.
1185 ! encoding.pm t/JP.t t/KR.t
1186 PerlIO detection simplified (checks %INC instead of eval{})
1187 ! Encode.xs Encode/encode.h
1188 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
1189 - lib/Encode/Unicode.pm
1190 (en|de)code_xs relocated to where it belongs. Source reindented
1193 Additional (U8 *) cast added as suggested by jhi
1194 Message-Id: <20020417165916.A28599@alpha.hut.fi>
1196 1.42 Date: 2002/04/17
1198 no-op module; Thought of adding a pod there but enc2xs has
1203 correct mechanism to detect Perlio::encoding layar installed.
1205 PerlIO Layer detached.
1207 1.41 2002/04/16 23:35:00
1209 binmode(STDIN|STDOUT ...) done iff PerlIO is available
1211 Cleaned up PerlIO skip conditions to prepare for the upcoming
1212 Encode - PerlIO forking.
1214 exported functions are now prototyped.
1215 ! lib/Encode/CN/HZ.pm
1218 fallback implemented # was /* FIXME */
1219 affected programs revised to fit (only HZ was using the try-catch
1220 approach which needed to be fixed for API-compliance).
1222 ! Encode/KR/2022_KR.pm
1224 can find =head1 NAME now, jhi
1225 Message-Id: <20020416083059.V30639@alpha.hut.fi>
1229 now complies with less warnings with the pickest compilers.
1230 Suggested by Craig, fixed by Dan.
1231 ! Encode/Makefile_PL.e2x
1233 A bug that fails to find *.e2x in certain conditions fixed
1235 1.40 2002/04/14 22:27:14
1236 + Encode/ConfigLocal_PM.e2x
1237 ! lib/Encode/Config.pm
1239 "enc2xs -C" now generates/updates Encode::ConfigLocal.
1240 ConfigLocal_PM.e2x is a skelton thereof.
1241 ! lib/Encode/Config.pm
1243 "use Encode::CN::HZ;" was missing.
1246 More rigorous tests added to test XS, especially on memory allocation.
1248 ! lib/Encode/Unicode.pm
1249 NI-S implemented an XS version -- merged
1250 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
1253 Source filter option added. With this option on, you can write
1254 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
1255 encodings. t/jperl.t enhanced to test this feature.
1257 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
1258 apply his suggestion, this degree of nitting is enough to add him
1260 Message-Id: <3CB93223.291E5E2E@earthlink.net>
1262 + lib/Encode/JP/JIS7.pm
1263 - lib/Encode/JP/JIS.pm
1264 - lib/Encode/JP/2022_JP.pm
1265 - lib/Encode/JP/2022_JP1.pm
1266 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
1267 JIS7.pm for better maintainability and performance
1269 Added caveat for non-ascii identifiers.
1271 fixes by jhi, the original author of this pragramtic module.
1272 Message-Id: <20020413231527.V1826@alpha.hut.fi>
1274 1.34 2002/04/12 20:23:05 (Unreleased)
1277 EBCDIC fixes addressed by jhi.
1278 Message-Id: <20020412161844.D9383@alpha.hut.fi>
1279 ! lib/Encode/Encoder.pm
1280 POD fix by Miyagawa-kun
1281 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
1283 1.33 2002/04/10 22:28:40
1285 Philip's mail address corrected.
1288 ! lib/Encode/Encoder.pm
1289 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
1290 This adds Miyagawa-kun to AUTHORS.
1291 * encoding() no longer exported by default but on demand
1292 * t/Encoder.t updated to test all these
1293 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
1294 ! lib/Encode/Unicode.pm
1295 ! lib/Encode/Supported.pm
1296 Further doc fixes by Anton
1298 1.32 2002/04/09 20:06:15
1301 - ucm/macDevanaga.ucm Unicode Character Map
1302 - ucm/macGujarati.ucm Unicode Character Map
1303 - ucm/macGurmukhi.ucm Unicode Character Map
1304 A utility to check integrity of .ucm files. t/bogus.ucm is a
1305 ucm that is deliberately bogus. unused Indic mappings are removed
1308 resolve_alias() added as suggested by jhi. Same as
1309 find_encoding("alias")->name. For convenience. This one is
1310 defined in Encode.pm instead of Alias.pm.
1311 Message-Id: <20020409215846.H17022@alpha.hut.fi>
1313 Memory Allocate but detected during the devel of ucmlint -- fixed.
1314 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
1315 ! lib/Encode/Unicode.pm
1316 valid_ucs2(0) is false but must be true.
1317 3 patches from NI-S as follows. This also has fixed the incident
1319 ! lib/Encode/Alias.pm
1320 find_alias() recursion prevention
1322 Checks for the patch above
1323 ! t/Encode/Unicode.pm
1324 An extra "F" that causes valid_ucs2() return a bogus value fixed
1325 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
1326 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
1327 2 Small Patches from jhi as follows:
1329 Encode->encodings() lists in case-insensitve order (as it was)
1331 -l option prints avaiable encodings to STDOUT instead of STDERR
1332 ! lib/Encode/Aliases.pm
1333 s/defintion/definition/
1334 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
1336 ! lib/Encode/Supported.pod
1337 ! lib/Encode/Unicode.pm
1338 POD revise by Philip Newton. This adds Philip to AUTHORS list.
1339 Thank you for the exact quote of Douglas Adams :)
1340 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
1342 1.31 2002/04/08 18:08:07
1343 ! lib/Encode/Encoder.pm
1345 Encode::Encoder, once just a placeholder of an idea, is now much more
1346 practical. See t/Encode.t to find how practical it can be.
1347 + lib/Encode/Config.pm
1349 my false laziness at Encode.pm is fixed. Now %ExtModules are set
1350 in Encode::Config and they are all literally, not programatically
1351 set. My false laziness was resulting many encodings missing from
1353 ! lib/Encode/Unicode.pm
1355 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
1356 so that it does not rely Encode::Unicode for BOM values
1357 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
1359 1.30 2002/04/08 02:34:51
1360 + lib/Encode/Encoder.pm
1361 Object Oriented Encoder. I reckon something like this is in need.
1364 ! lib/Encode/Supported.pod
1365 * autoloading bug that prevented upper-case canonicals such as UTF-16
1366 is fixed. Now even UTF/UCS are autoloaded!
1367 * encodings() is now more intuitive.
1368 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
1370 * Obligatory fixes to the POD.
1371 ! lib/Encode/Supported.pod
1372 Patch from Anton applied.
1373 Message-Id: <66641479.20020408033300@motor.ru>
1375 ! lib/Encode/Unicode.pm
1376 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
1378 1.28 2002/04/07 18:58:42
1382 Just a MANIFEST for those missing files.
1384 1.26 Date: 2002/04/07 15:22:04
1387 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
1388 And t/Aliases.t fixed to make sure they all exist.
1390 1.25 2002/04/07 15:01:25 (Unreleased)
1392 ! lib/Encode/Unicode.pm
1395 - lib/Encode/UTF_EBCDIC.pm
1396 - lib/Encode/Internal.pm
1397 - lib/Encode/utf8.pm
1398 Integrated into Encode.pm as closures. That way "one package, one file"
1399 rule is preserved yet less files to require.
1401 commented out binmode(STDERR ...
1405 ! EBCDIC/Makefile.PL
1408 ! Symbol/Makefile.PL
1410 ! Encode/Makefile_PL.e2x
1411 Schwarn's MM-compliance patch merged
1412 Message-Id: <20020406082609.GA28758@blackrider>
1414 ! lib/Encode/Unicode.pm
1415 + lib/Encode/UTF_EBCDIC.pm
1417 - lib/Encode/10646_1.pm
1418 - lib/Encode/ucs2_le.pm
1419 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1420 per-module based (en|de)code, I saved a number of .pm by
1421 reorganizing it as per-object base (Well, this is what Encode::XS
1422 does under the hood). See Encode::Unicode for details.
1423 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1424 This module is used only on EBCDIC environments.
1426 1.21 2002/04/05 14:46:34 (Not Released)
1432 Are back to make Perl/Tk happy Smile, NI-S.
1434 ! lib/Encode/Alias.pm
1435 ! lib/Encode/Supported.pm
1436 ! lib/Encode/10646_1.pm
1437 ! lib/Encode/ucs2_le.pm
1438 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1439 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1440 updated to reflect changes.
1441 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1442 ! lib/Encode/Alias.pm
1443 ! lib/Encode/Supported.pm
1444 Anton's revision commited. Added Dan's own fixes as well.
1445 Message-Id: <159103166906.20020405161134@motor.ru>
1446 ! lib/Encode/Alias.pm
1448 < qr/^UCS2-le$/i => '"UCS-2"', );
1450 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1451 Sigh. Thank you, Anton.
1452 Message-Id: <14567692196.20020405062020@motor.ru>
1453 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1455 1.20 2002/04/04 19:50:52
1457 the last minute addtion. Just give it a try. Docs remains to be done.
1458 Not installed by default.
1459 ! lib/Encode/Supported.pod
1462 ! lib/Encode/Alias.pm
1463 ! lib/Encode/utf8.pm
1464 ! lib/Encode/10464_1.pm
1465 ! lib/Encode/ucs2_le.pm
1466 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1467 unchanged but UTF-16BE is added as an alias. Implicit aliases
1468 move to Encode::Alias so init_alias() works more as expected.
1469 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1470 Though pedantically wrong, This should make perl mongers happier.
1471 t/Alias.t is enhanced to test all these.
1472 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1474 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1475 of the codepoints are common. CJKT left as is because the saving is
1479 ! EBCDIC/Makefile.PL
1481 ! Encode/Makefile_PL.e2x
1485 ! Symbol/Makefile.PL
1489 All occurance of _def.h replaced with .exh so djgpp works happily
1490 ever after! To credit this amazing discovery, Laszlo is now in
1492 Message-Id: <20020403181424.GA8778@freemail.hu>
1493 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1496 ! Encode/Makefile_PL.skel
1498 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1499 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1501 Two more tests by added jhi
1502 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1505 The showstopper fixed -- Memory reallocation bug was causing
1506 Encode::XS to fall into infinite loop on certain conditions.
1507 t/grow.t tests that.
1508 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1513 ! lib/Encode/Supported.pod
1514 Vendor encodings rebuilt out of original map files at unicode.org.
1515 Indic languages such as MacDevanagali remain unspported do to the
1516 shortcoming of encengine capabilities (they need algorithmical
1517 conversion and I have no knowledge on that!). Pods fixed for added
1519 Oh, macJapan.ucm renamed to macJapanese.ucm.
1520 macROMnn is macRomanian and macRUMnn is macRumanian.
1521 txt2ucm is a crude script that is used to convert them.
1523 Unicode Compound Characters (used extensively on Mac) supported
1525 Typo fixes and improvements by jhi
1526 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1528 1.11 2002/03/31 22:12:13
1532 Missing files from the MANIFEST fixed.
1533 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1534 Version incremented just to make CPAN happy.
1536 1.10 2002/03/31 21:32:42
1539 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1540 if you want. This should make Autrijus happy. Also, piconv
1541 is added to default install.
1544 Here-documented files that enc2xs generates are now exported
1545 to *.e2x. Much cleaner and easier to debug.
1547 encoding enhances so you can make it act more like such
1548 (now prehistoric ) "localized" variations of perl like Jperl.
1550 Further test for encoding.pm. Written in euc-jp
1553 Taken over form jhi.
1554 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1559 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1566 ! Encode/macIceland.ucm
1567 ! lib/Encode/Alias.pm
1568 ! lib/Encode/Supported.pod
1569 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1570 Message-Id: <m3lmcavhjt.fsf@anima.de>
1572 1.01 2002/03/29 20:59:39
1575 s/USE_SCRIPTS/MORE_SCRIPTS/
1577 installs enc2xs by default for external Encode:: modules in CPAN,
1578 such as Encode::HanExtra
1580 More sensible perl core detection via $ENV{PERL_CORE}
1582 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1584 Perl core ditection via $^X =~ m/\bminiperl$/o
1585 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1587 1.00 Wed Mar 29 2002
1589 The version of all files is updated to 1.00 via "ci -f -l1.00",
1590 commemorating version 1.00. All files, including *.ucm are now
1591 under version control.
1594 encode.h moved to Encode/ so it will be installed for the later
1597 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1602 compile renamed to enc2xs.
1603 Affected Makefle.PL updated
1605 "Punt it. HanExtra can take care of that later." -- Autrijus
1606 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1609 ! Encode/ksc5601.ucm
1610 ! lib/Encode/CJKConstants.pm
1611 ! lib/Encode/KR/2022_KR.pm
1612 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1613 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1617 bin/ added for example scripts. They are not installed by default.
1618 to install them, "perl Makefile.PL USE_SCRIPTS".
1619 piconv is iconv reinvented in perl. in addition to all features
1620 of iconv, it also adds perlish features. See L<piconv/1> for more
1622 ! lib/Encode/Alias.pm
1623 qr/^ replaced with qr/\b so it directly matches locale names
1624 such as en_US.US-ASCII
1627 Patch by MJD to fix the following problem applied.
1628 Subject: [PATCH 5.7.3 Encode]
1629 Aliases.t not properly skipped when Encode extension not built
1630 Message-Id: <20020328091850.18677.qmail@plover.com>
1631 ! lib/Encode/KR/2022_KR.pm
1632 ! lib/Encode/CJKConstants.pm
1633 Another patch from Jungshik to make iso-2022-kr actually work
1634 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1635 ! Encode/Encode/euc-kr.ucm
1636 + Encode/Encode/johab.ucm
1637 ! Encode/Encode/ksc5601.ucm
1639 ! Encode/KR/Makefile.PL
1640 ! Encode/lib/Encode/Alias.pm
1642 Johab support and complete revision of Korean Encoding by Jungshik
1643 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1645 Revised to make up with now-dropped Encode::Details.
1646 - lib/Encode/Details.pod
1647 Dropped. Besides being obsolete, the topics are now covered in
1648 respective pods now.
1653 Korean aliases fixed thanks to Jungshik Shin
1654 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1655 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1657 =head1 NAME added to all modules to make buildtoc happy
1658 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1659 - lib/Encode/CJKguide.pod
1660 Too controversial and dropped from the dist. Will be available
1661 separately on the web.
1663 RCS tags added so table debugging gets easier (should that be
1664 needed! I hope they all stay 1.00!)
1665 + lib/Encode/CJKguide.pod
1666 A detailed guide to mainly, but not limited to, CJK multibyte
1669 + Encode/hp-roman8.ucm
1671 ! Encode/Supported.pod
1672 All occurance of "roman8" replaced with "hp-roman8" to avoid
1674 ! Encode/Supported.pod
1677 Mac Encodings now comply the Inside Macintosh
1679 Test for '-raw' conventions added.
1681 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1682 ! Encode/gb12345.ucm
1684 ! Encode/ksc5601.ucm
1685 "-raw" appended to canonical names.
1686 File mames stay unchanged thanks to UCM format.
1687 ! lib/Encode/CN/HZ.pm
1688 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1689 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1691 0.99 Tue Mar 26 2002
1692 - lib/Encode/JP/Const.pm
1693 + lib/Encode/CJKConstants.pm
1694 + lib/Encode/CN/2022_CN.pm
1695 + lib/Encode/KR/2022_KR.pm
1704 * Support for ISO-2022-KR and ISO-2022-CN added.
1706 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1707 * ucm2table autogenerates character table out of UCM files.
1710 - lib/Encode/Supports.pod
1711 + lib/Encode/Supported.pod
1712 Names reverted due to popular demand.
1713 8.3 rule applies only when there is a conflict.
1714 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1719 - lib/Encode/Format/Enc.pod
1721 * Character tables is now 100% ucm.
1722 * All files under Encode/ is now 8.3-compliant
1723 * some of missing encodings added (i.e. gsm0338 and nextstep)
1724 * Vendor mappings aggregated with appropriate national std in
1725 Makefile.PL, resulting smaller *.so especially for CJK.
1726 Following is result on Dan's FreeBSD box.
1728 ---------------------------------------------------------------
1729 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1730 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1731 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1732 blib/arch/auto/Encode/Encode.so 27,791 27,791
1733 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1734 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1735 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1736 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1737 ---------------------------------------------------------------
1738 Total 5,375,297 6,343,819
1740 * As a result of ucm-transition, Encode::Tcl dropped because
1741 Encode::Tcl demands *.enc.
1742 Encode::Tcl will be supplied in a separate tarball with *.enc.
1743 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1748 -lib/Encode/Supported.pod
1749 +lib/Encode/Supports.pod
1750 -lib/Encode/iso10646_1.pm
1751 +lib/Encode/10646_1.pm
1752 -lib/Encode/EncFormat.pod
1753 +lib/Encode/Format/Enc.pod
1754 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1755 - lib/Encode/JP/Constants.pm
1756 + lib/Encode/JP/Consts.pm
1757 ! lib/Encode/JP/JIS.pm
1758 ! lib/Encode/JP/H2Z.pm
1759 Version nit problem and 8.3 rule fix.
1760 > Package namespace installed latest in CPAN file
1761 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1762 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1763 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1764 accordingly. In addition, legacy "use vars qw()..." are replaced with
1766 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1767 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1769 - lib/Encode/JP/ISO_2022_JP.pm
1770 - lib/Encode/JP/ISO_2022_JP_1.pm
1771 + lib/Encode/JP/2022_JP.pm
1772 + lib/Encode/JP/2022_JP1.pm
1774 8.3 naming conflict for vanilla fat addressed by jhi
1775 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1778 Typecast fix addressed by jhi
1779 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1781 0.98 Mon Mar 25 2002
1782 ! lib/Encode/Supported.pod
1784 + lib/Encode/JP/ISO_2022_JP_1.pm
1785 ! lib/Encode/JP/ISO_2022_JP.pm
1786 ! lib/Encode/JP/JIS.pm
1788 Now Encode::JP is more strict on the difference between ISO-2022-JP
1789 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1790 makes Anton happier :) FYI the previous version implements
1791 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1792 ! lib/Encode/Supported.pod
1795 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1796 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1801 ! lib/Encode/Suppoted.pod
1802 pod fixes to replace F<http://...> to L<http://...>,
1803 as suggested by Autrijius in:
1804 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1805 ! lib/Encode/Suppoted.pod
1806 fixes and enhancements by Anton
1807 Message-Id: <10632060120.20020324103753@motor.ru>
1808 ! lib/Encode/Alias.pm
1809 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1810 added. Suggested by Anton then deobfuscated by Autrijius
1811 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1813 Further fix by Nicholas Clark
1814 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1815 - lib/EncodeFormat.pod
1816 + lib/Encode/EncFormat.pod
1818 File renamed as suggested by Autrijius
1820 ! lib/Encode/Details.pod
1821 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1822 ! Encode.pm Sun Mar 24 13:43:47 2002
1823 pod fixes by Autrijius.
1824 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1825 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1827 ! lib/Encode/Alias.pm
1829 now more EBCDIC conscious;
1830 %ExtModules on EBCDIC system excludes CJK so that you don't
1831 have to worry about the matched alias resulting cloaking.
1832 t/Alias.t also revised to reflect changes. Verified by jhi
1833 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1835 0.97 Sun Mar 24 2002
1839 EBCDIC detection mechanism installed as in JP/JP.pm
1840 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1843 ! EBCDIC/Makefile.PL
1846 ! Symbol/Makefile.PL
1848 Now all table files used by compile are postfixed '_t' to avoid
1849 namespace collisions in case insensitive file systems once for all!
1851 Message-ID: <58290227735.20020323195659@familiehaase.de>
1853 Since the Encode::JP is unsupported under EBCDIC we
1854 cannot run this test (aliases as such should work fine) -- jhi
1855 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1857 duplicate occurance of ascii.ucm and 8859-1.ucm
1858 causes MacOS X dlyd to cloak
1864 < chdir 't' if -d 't';
1866 > if (! -d 'blib' and -d 't'){ chdir 't' };
1867 When you are "make test"-ing on Encode/ directory, you must not
1868 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1869 remain unchanced. Also the situation detection was made simpler
1870 in t/JP.t, which was originally;
1871 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1873 "Use of uninitialized value in string eq at Encode.pm line 96."
1874 ! Symbol/Makefile.PL
1875 ! EBCDIC/Makefile.PL
1877 -- Problem on case insensitive file systems
1878 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1879 Message-ID: <88254111953.20020323095503@familiehaase.de>
1882 "So I think it's a bug in gcc, not perl. But it still needs to be
1884 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1885 Message-Id: <20020323170509.C96475@plum.flirble.org>
1887 0.96 Sat Mar 23 2002
1889 ! lib/Encode/Encoding.pm
1890 ! lib/Encode/Alias.pm
1891 ! lib/Encode/Supported.pod
1893 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1894 Message-ID: <20020322073908.GB10539@blackrider>
1897 "...I think we should include ISO 8859-1 as well." -- NI-S
1898 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1903 ! lib/Encode/Alias.pm
1904 alias definitions relocated to Encode::Alias so module autoloading
1905 works for aliases also.
1907 encodings() now accepts args to check ExtModules.
1911 + EBCDIC/Makefile.PL
1912 + Symbol/Makefile.PL
1916 Latin and single byte encodings are reorganized so they are
1917 demand-loaded like Encode::XX. Now only ascii is compiled into
1919 ! lib/Encode/Alias.pm
1920 for my $k (keys %hash){ delete $hash{$k}; }
1921 is depreciated; fixed.
1923 0.95 Fri Mar 22 2002
1924 In this update, pod rewrites and alias fixes are the main issues
1925 + lib/Encode/Supported.pod
1926 Describes supported encodings
1928 streamlined compiled-in encodings.
1929 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1931 + Encode/ibm-125?.ucm
1932 Added from icu distibution with any occurance of
1933 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1934 some respect to icu staff, however.
1935 + lib/Encode/Alias.pm
1937 Alias difinitions in Encode.pm relocated.
1940 packWARN patch from Paul Marquess via jhi
1941 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1942 Paul added to AUTHORS as a result.
1943 ! t/CJKalias.t -> t/Aliases.t
1944 Renamed. Checks even more aliases and alias overloading
1947 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1948 gbk => cp936 relocated to CN.pm
1950 Test::More with plans (by jhi)
1952 0.94 Thu Mar 21 2002
1953 + lib/Encode/Description.pod
1954 ! lib/Encode/Encoding.pm
1955 Now the pod in Encode.pm is abridged as programming references.
1956 lib/Encode/Description.pod contains the original, detailed description
1957 and Encode::Encoding explains how to write your own module to
1958 add new encodings. So far, lib/Encode/Description.pod contains
1959 the whole pod once in Encode.pm. This is intentional.
1961 Pod revisions by Anton Tagunov
1962 Message-Id: <517178431.20020320174824@motor.ru>
1964 all occrance of Encode::Tcl::Extended removed including pod
1966 test now checks $encoding->name only; $encoding->{name} are
1967 no longer check to find the canonical name.
1968 ! lib/Encode/JP/JIS.pm
1969 ! lib/Encode/JP/ISO_2022_JP.pm
1970 ->name() added to be more compliant with API
1976 Patch by Autrijus to add aliases to TW and fixes to POD
1977 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1979 SADAHIRO Tomoyuki added as should. My apologies.
1981 0.93 Wed Mar 20 2002
1982 * First release to be uploaded to CPAN. For prehistoric changes,
1983 please see Changes file of perl distibution as well as
1984 perl-unicode@perl.org archive, available at:
1985 http://archive.develooper.com/perl-unicode@perl.org/
1987 Changes Since 0.92 includes;
1992 + Mention to perl-unicode@perl.org added
1994 + Encoding aliases added so you can feed locale names
1995 and MIME Charset="" directly.
1996 - Mention to JISX0212 removed because it's fixed
1999 + Encoding aliases added. Note TW is left untouched because
2000 euc-tw is not implemented in TW but in Encode::HanExtra.
2001 Autrijus, you may fix Encode::HanExtra.
2003 + to test encode aliases added