1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 2.39 2009/11/26 09:23:59 dankogai Exp dankogai $
4 ! Encode.xs t/fallback.t
5 $utf8 = decode('utf8', $malformed, sub{ ... }) # now works!
6 http://rt.cpan.org/Ticket/Display.html?id=51204
7 ! t/CJKT.t t/guess.t t/perlio.t
8 $ENV{'PERL_CORE'} tricks removed since they are no longer necessary.
9 Message-Id: <20091116161513.GA25556@bestpractical.com>
11 $Revision: 2.39 $ $Date: 2009/11/26 09:23:59 $
13 Addressed: Encode memory corruption [perl #70528]
14 Message-Id: <alpine.LFD.2.00.0911152328070.9483@ein.m-l.org>
15 ! t/Unicode.t Unicode/Unicode.xs
16 Patched: #51263: set magic is not applied when modifying encode arguments
17 http://rt.cpan.org/Ticket/Display.html?id=51263
19 Patched: #51204: Callback CHECK not supported for UTF-8 decoder/encoder
20 http://rt.cpan.org/Ticket/Display.html?id=51204
21 ! Byte/Byte.pm CN/CN.pm Changes JP/JP.pm KR/KR.pm TW/TW.pm
22 Unicode/Unicode.pm bin/enc2xs lib/Encode/Supported.pod
24 http://rt.cpan.org/Ticket/Display.html?id=49776
25 ! t/CJKT.t t/guess.t t/perlio.t t/piconv.t
26 $PERL_CORE trick is now off for perl 5.11 or better.
27 Message-Id: <b77c1dce0909070245s59b294bq8a8a8166e7342793@mail.gmail.com>
28 Message-Id: <E7FADA6C-D5A7-4ECA-BE4C-85911A97677E@dan.co.jp>
29 Message-Id: <20090907154908.GS60303@plum.flirble.org>
30 Message-Id: <20090907161509.GN8057@iabyn.com>
32 2.37 2009/09/06 14:32:21
34 fixed: compilation failure on compilers not supporting C99
35 http://rt.cpan.org/Ticket/Display.html?id=49466
37 2.36 2009/09/06 09:03:07
39 fixed: 'find_encoding("utf8")->decode(undef)' causes segmentation fault
40 http://rt.cpan.org/Ticket/Display.html?id=49462
42 2.35 2009/07/13 02:06:30
43 ! lib/Encode/MIME/Header.pm
45 decode of MIME-Header removes too much whitespace
46 http://rt.cpan.org/Ticket/Display.html?id=40027
47 http://rt.cpan.org/Ticket/Display.html?id=42902
49 Addressed by CSJEWELL: t/piconv.t loops infinitely on Win32
50 http://rt.cpan.org/Ticket/Display.html?id=47760
52 2.34 2009/07/08 13:34:15
54 duplicate-BOM problem now fixed.
55 Message-Id: <10ECB9B7-006E-4570-9EB6-51C49F04ADCF@dan.co.jp>
58 patches and tests by SREZIC
59 Message-Id: <4A5366DA.8050801@iconmobile.com>
61 man* removed on behalf of blead
62 Message-Id: <20090326135219.GU18164@plum.flirble.org>
64 2.33 2009/03/25 07:55:57
65 ! lib/Encode/MIME/Header.pm
66 Decontaminated $& which sneaked in on 2.31.
67 Message-Id: <67FC9F3A39C746DA95AAB6BB01539099@robmhp>
68 Message-Id: <693254b90903242352x2dc26ba6p5e68deb871fa88ae@mail.gmail.com>
69 http://coderepos.org/share/changeset/31542
71 2.32 2009/03/07 07:32:37
72 ! lib/Encode/Alias.pm t/Alias.t
73 Encode now resolves 'en_US.UTF-8' to utf-8-strict like 'ja_JP.euc'
74 Those who set locale on their shells should be happier now.
78 "encode(undef, 'str') should die earlier"
79 http://coderepos.org/share/changeset/30790
81 2.31 2009/02/16 06:18:09
82 ! lib/Encode/MIME/Header.pm
83 "Revert [29767] and [29771] since it breaks perl 5.8" by miyagawa
84 http://coderepos.org/share/changeset/30111
86 2.30 2009/02/15 17:44:13
88 fixed regexes, et cetera. by drry
89 http://coderepos.org/share/changeset/29767
90 ! lib/Encode/MIME/Header.pm
91 Addressed: Encode::MIME::Header::decode should respect CHECK
92 http://rt.cpan.org/Ticket/Display.html?id=43204
93 http://coderepos.org/share/changeset/29767
95 2.29 2009/02/01 13:14:37
97 VERSION++ just to make PAUSE happy
98 Message-Id: <877i4anwwt.fsf@k75.linux.bogus>
100 2.28 Date: 2009/02/01 12:30:18
102 Latest refactoring broke the backward compatibility
103 w/ Perl 5.8.6 and before now restored
104 Message-Id: <1233185156.DABa130.74940@basic2.hostingcompartido.com>
105 Message-Id: <693254b90902010027x277a5d0fm4f5700ba2f276239@mail.gmail.com>
106 ! lib/Encode/MIME/Header.pm
107 Addressed: Split header lines are joined incorrectly
108 http://rt.cpan.org/Ticket/Display.html?id=42902
110 2.27 2009/01/21 22:55:07
111 ! lib/Encode/MIME/Header.pm t/mime-header.t
112 Addressed: Encode::MIME::Header MIME-Q encoding truncates
113 trailing zeros in some circumstances
114 http://rt.cpan.org/Ticket/Display.html?id=42627
115 ! lib/Encode/Alias.pm
116 Added alias: unicode-1-1-utf-7
117 http://rt.cpan.org/Ticket/Display.html?id=38558
119 Documented: _utf8_on() does not work for tainted values
120 http://rt.cpan.org/Ticket/Display.html?id=41163
122 s[oss.software.ibm.com/icu][www.icu-project.org]g
123 http://rt.cpan.org/Ticket/Display.html?id=40245
124 ! lib/Encode/Guess.pm t/guess.t
125 Addressed:Empty file should produce an error message
126 http://rt.cpan.org/Ticket/Display.html?id=38652
127 ! Unicode/Unicode.xs AUTHORS
128 Refactored by Alex Davies
129 http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/2007-10/msg00745.html
130 Message-Id: <7637669B2E3D46B187591747DA27F4C8@Amelie>
132 2.26 2008/07/01 20:56:17
134 Absense of Encode::ConfigLocal no longer carps no matter what.
135 http://bugzilla.redhat.com/show_bug.cgi?id=435505#c2
136 http://rt.cpan.org/Ticket/Display.html?id=28638
137 http://rt.cpan.org/Ticket/Display.html?id=11511
139 use encoding 'utf8' and 'iso-2022-jp' glitches on perl 5.10
141 Message-Id: <693254b90807011224h3ab50d76v50c6fea87baf223c@mail.gmail.com>
142 ! lib/Encode/Alias.pm t/Aliases.t
143 macintosh' not recognize as MacRoman
144 http://rt.cpan.org/Ticket/Display.html?id=36326
146 s{INC => "-I./Encode"}
147 {INC => '-I' . File::Spec->catfile( '.', 'Encode' )}
148 To prevent some platforms from forgetting to include Encode/encode.h.
149 http://rt.cpan.org/Ticket/Display.html?id=36348
151 2.25 2008/05/07 20:56:05
153 added ':default' to Exporter option.
154 ! lib/Encode/GSM0338.pm
155 GSM0338 now handles coderef in CHECK
156 http://rt.cpan.org/Ticket/Display.html?id=31335
158 Perl 5.10/Encode 2.24: Tiny typo in Encode's Makefile.PL arg processing
159 Message-Id: <961C2A4F-92B3-416D-A9F9-E7B0ADA9F134@fsck.com>
160 ! lib/Encode/Alias.pm
161 "This fix for Encode::Alias should make Solaris happy:"
162 Message-ID: <47D886D9.6060001@iki.fi>
164 2.24 2008/03/12 09:51:11
165 ! lib/Encode/Config.pm
166 adds and fixes also adds cp858 support.
167 ! Encode.pm encoding.pm lib/Encode/Alias.pm ucm/cp858.ucm
169 > Change 33486 by rgs@scipion on 2008/03/12 08:50:11
170 An unfortunate side-effect of Encode and Encode::Alias use'ing each
171 other, and Encode::Alias exporting functions into Encode for it to use
172 as methods, broke the loading of the find_alias() Encode method in some
173 cases since 5.10. Breaking the recursive inheritance fixes it.
174 Message-Id: <b77c1dce0803120151o4166c3a0gfcfd14681ab7e10d@mail.gmail.com>
177 Message-Id: <200711281835.36125@bloodgate.com>
179 Fix by MIYAGAWA via CodeRepos
180 http://coderepos.org/share/changeset/1791
181 ! encoding.pm t/mime_header_iso2022jp.t
182 ported back from Perl 5.10-RC1
184 2.23 2007/05/29 18:15:32
186 got rid of global fallback_cb; encode_method() now takes one more
187 argument which is a coderef to fallback. This should make
188 encode_method() thread-safe.
190 Added perluniintro, perlunifaq, and perlunitut to POD
192 Plug a memory leak in Encode -- by rgs
193 Message-Id: <b77c1dce0705290858v2be239c3o2d726e3d59091493@mail.gmail.com>
195 POD fixes on UTF-16LE
196 http://aspn.activestate.com/ASPN/Mail/Message/perl5-porters/3486118
198 man page generation is now conditional; yes by default but no if $PERL_CORE
199 Message-Id: <b77c1dce0705290237h5c4667cdlf79a48b839170add@mail.gmail.com>
201 2.22 2007/05/29 07:35:27
203 from_to() does not honor the check while decoding. That's a feature.
204 To make sure it is a feature it is mentioned in the POD.
205 http://rt.cpan.org/NoAuth/Bug.html?id=27277
207 Encode used to suppress man page generation. Now it does.
208 http://rt.cpan.org/NoAuth/Bug.html?id=27200
209 ! Encode.pm Encode.xs t/fallback.t
210 Addressed: (de|en)code("ascii", "\x{3000}", sub{ $_[0] }) segfaults
213 2.21 2007/05/12 06:42:19
214 + lib/Encode/MIME/Name.pm t/mime-name.t
215 ! Encode.pm Encode.xs lib/Encode/Encoding.pm
216 new method: mime_name()
217 inspired by: MIYAGAWA
219 Subject: Re: Compress::Zlib, pack "C" and utf-8 [PATCH]
220 From: Marc Lehmann <schmorp@schmorp.de>
221 Date: Thu, 12 Apr 2007 08:41:53 +0200
222 Message-ID: <20070412064153.GA22475@schmorp.de>
223 http://public.activestate.com/cgi-bin/perlbrowse/p/31194
226 Message-Id: <20070417220547.GA11999@zetta.zet>
228 2.20 2007/04/22 14:56:12
230 Pod fixes. Now find_encoding() is explained more in details.
231 + lib/Encode/GSM0338.pm
233 ! lib/Encode/Supported.pod lib/Encode/Config.pm Bytes/Makefile.PL t/gsm0338.t
234 ESTI GSM 03.38 support is relocated from Encode::Byte to Encode::GSM0338.
235 This encoding is so kaputt it is unfit for Encode::XS!
236 Though it was okay for general cases and escape sequences,
237 '\0' => '@' IFF '\0\0' => '\0' had gliches.
238 So kaputt even t/gsm0338 wrongly interpreted that.
239 ref. http://www.csoft.co.uk/sms/character_sets/gsm.htm
240 ! encoding.pm t/Aliases.t
241 Imported from bleedperl #31015
243 2.19 2007/04/06 12:53:41
244 ! lib/Encode/JP/JIS7.pm
246 encode('iso-2022-jp') fallback support added by MIYAGAWA++
247 decode()'s fallback remains unchanged (FB_PERLQQ) since UTF-8
248 contains all characters in iso-2022-jp so there's no need for fancy stuff.
249 Message-Id: <693254b90704060526s6d850320h71cdda50dfbf7eba@mail.gmail.com>
251 #25216 ([PATCH] Encode.pm: postpone the load of Encode::Encoding)
252 http://rt.cpan.org/NoAuth/Bug.html?id=25216
253 ! lib/Encode/MIME/Header.pm t/mime-header.t
254 #24418 (Encode::MIME::Header: wrong encoding with latin1 characters)
255 http://rt.cpan.org/NoAuth/Bug.html?id=24418
257 #23876 (Add documentation for LEAVE_SRC)
258 http://rt.cpan.org/NoAuth/Bug.html?id=23876
259 ! lib/Encode/Alias.pm t/Aliases.t
260 #20781: Thai encoding needs alias for tis-620
261 http://rt.cpan.org/NoAuth/Bug.html?id=20781
263 #20344: piconv: wrong conversion of utf-16le encoded files (with PATCH)
264 http://rt.cpan.org/NoAuth/Bug.html?id=20344
265 ! Encode.pm Encode.xs bin/enc2xs encoding.pm t/Aliases.t t/utf8strict.t
266 Imported from bleedperl's 2.18_01
268 2.18 2006/06/03 20:28:48
270 overhauled the -C option
271 - added ascii-ctrl', 'null', 'utf-8-strict' to core
272 - auto-generated Encode::ConfigLocal no longer use v-string for version
273 - now searches modules via File::Find so Encode/JP/Mobile is happy
274 ! Byte/Byte.pm CN/CN.pm EBCDIC/EBCDIC.pm JP/JP.pm KR/KR.pm Symbol/Symbol.pm
275 use strict added; though all they do is load XS, it's
276 still better a practice
278 use warnings added to all of them for better practices' sake.
280 2.17 2006/05/09 17:10:09
282 'chin' =~ /^zh_CN|chin(?:a|ese)?$/i is true
283 but chin is not china or chinese.
284 http://d.hatena.ne.jp/jankogai/20060508/1147090316
286 Integrated maintperl change (27824|27824) which I overlooked
287 -- sorry, Nicholas and Coverity Scan.
288 Message-Id: <200604152115.k3FLF1Ar014538@smtp3.ActiveState.com>
289 Message-Id: <200605091615.k49GF1gJ016777@smtp3.ActiveState.com>
291 2.16 2006/05/03 18:24:10
293 --xmlcref and --htmlcref added.
295 Copyright Notice Added.
296 http://rt.cpan.org/NoAuth/Bug.html?id=19056
298 Replaced remaining ^\t with q( ) x 4. -- Perl Best Practice pp. 20
299 And all .pm's are now perltidy-ed.
301 2.15 2006/04/06 15:44:11
303 Addressed: UTF-16, UTF-32, UCS, UTF-7 decoders mishandle illegal characters
304 http://rt.cpan.org/NoAuth/Bug.html?id=18556
306 added str2bytes() as an alias to encode() and bytes2str() as an alias
308 http://rt.cpan.org/NoAuth/Bug.html?id=17103
310 Change 26922: Avoid warning with MS Visual C compiler.
311 Message-Id: <200601231245.k0NCj2dw009484@smtp3.ActiveState.com>
313 Change 26067: As using -C to turn on utf8 IO is equivalent to the open pragma
314 Message-Id: <200511092227.jA9MRcYD009025@smtp3.ActiveState.com>
316 2.14 2006/01/15 15:43:36
318 Change 26295: Don't build manpages for Encode and Unicode::Normalize
319 Message-Id: <200512071540.jB7Fe4Gt017960@smtp3.ActiveState.com>
321 Change 26081: Pod nit in Encode.pm, found by Marc Lehmann in RT #36949.
322 Message-Id: <200511110357.jAB3vZcP023647@smtp3.ActiveState.com>
323 ! Encode.xs Encode/encode.h bin/enc2xs encengine.c
324 Change 25821: Mark more static Encode data structures as const.
325 Change 25823: use more 'const' in the Encode data structures.
326 Message-Id: <200510221243.j9MChTSu027711@smtp3.ActiveState.com>
327 Message-Id: <200510221343.j9MDhTk9001245@smtp3.ActiveState.com>
329 2.13 2006/01/15 15:06:36
331 Miyagawa's mail address updated
332 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
333 ! lib/Encode/MIME/Header.pm
334 #16413: Encode::MIME::Headers patch to solve what is probably someone else's bug
335 http://rt.cpan.org/NoAuth/Bug.html?id=16413
336 ! lib/Encode/MIME/Header.pm t/mime-header.t
337 Applied: RT #16258: Support for RFC 2184 language tag
338 http://rt.cpan.org/NoAuth/Bug.html?id=16258
340 Fixed RT #14559: fix for #8872 introduces new "bug"
341 http://rt.cpan.org/NoAuth/Bug.html?id=14559
344 from_to() now makes use of $check more naturally.
345 Message-Id: <693254b90601150535o767e10bai4f4732c275b4ebe0@mail.gmail.com>
346 <B10B4DE2-9BAF-4344-B3BE-2119977D817A@dan.co.jp>
348 2.12 2005/09/08 14:17:17
349 ! Encode.xs Encode.pm t/fallback.t
350 Now accepts coderef for CHECK!
352 Updated to newer version at unicode.org
353 http://rt.cpan.org/NoAuth/Bug.html?id=14222
354 ! lib/Encode/Supported.pod
356 <42F5E243.80500@gmail.com>
358 More POD typo leftover fixed.
359 Message-Id: <b77c1dce05080615487f95314@mail.gmail.com>
361 2.11 2005/08/05 10:58:25
363 To reflect changes below
364 ! Encode.pm encoding.pm
365 lib/Encode/Alias.pm lib/Encode/PerlIO.pod lib/Encode/Supported.pod
366 Typo fixed by Piotr Fusik in Change 25261 & 25266
367 Message-ID: <001401c595bd$dccb5d80$0bd34dd5@piec>
369 Addresses "BUG REPORT: panic in Encode.xs".
370 Message-Id: <42EDDA97.2010608@hyper.to>
371 + lib/Encode/MIME/Header/ISO_2022_JP.pm mime_header_iso2022jp.t
372 ! lib/Encode/MIME/Header.pm lib/Encode/Config.pm
373 Encoding 'MIME-Header-ISO_2022_JP' is introduced by Makamaka
374 Message-Id: <200507311557.j6VFvE2K034605@www231.sakura.ne.jp>
375 ! Encode/encode.h Encode.pm Encode.xs
376 PerlIO's "encoding(utf-8-strict)" got a problem w/ partial character.
377 Found and addressed by KONNO Hiroharu <hiroharu.konno@bowneglobal.co.jp>
378 See also ext/PerlIO/encoding/encoding.pm
379 Message-Id: <E1DineE-00068X-TB@yok-gs-workman.asia.bgsinternal.com>
381 2.10 2005/05/16 18:46:36
383 fixed decode_utf8() accordingly to RT#8872
384 http://rt.cpan.org/NoAuth/Bug.html?id=8872
386 s/SvIVX/SvIV_set/ by Steve Peters.
387 Message-Id: <2297.67.96.185.36.1114626315.squirrel@webmail3.pair.com>
391 New Pod section: "UTF-8 vs utf8"; explains utf-8-strict
393 Tests utf-8-strict, accordingly to
394 UTF-8 decoder capability and stress test" by Markus Kuhn
395 http://smontagu.damowmow.com/utf8test.html
396 Note that malformed and overlong sequences are not test here
397 because perl already does that for you, utf-8-strict or not.
398 ! Encode.pm Encode/encode.h t/fallback.t
399 Addressed "encode(..., Encode::LEAVE_SRC) does not work".
400 Now FB_(PERLQQ|HTMLCREF|XMLCREF) implies LEAVE_SRC so
401 you can (en|de)code constant strings with these fallbacks.
402 http://rt.cpan.org/NoAuth/Bug.html?id=8736
403 ! Encode.pm Encode.xs lib/Encode/Alias.pm t/Aliases.t
404 Make Encode.pm support the real UTF-8, by GAAS
405 Message-Id: <lrfz2mcngd.fsf@caliper.activestate.com>
406 Message-Id: <lr4qizbvvm.fsf@caliper.activestate.com>
407 ! Encode.pm Encode.xs
408 post-2.09 comment patches from GAAS applied.
409 Message-Id: <lroehacz6q.fsf@caliper.activestate.com>
410 Message-Id: <lrk6rycymu.fsf@caliper.activestate.com>
412 2.09 2004/12/03 19:16:53
413 ! Encode.pm Encode.xs
414 Addressed " :encoding(utf8) broken in perl-5.8.6".
415 Message-Id: <lrllcfeank.fsf_-_@caliper.activestate.com>
417 Addressed "(de|en)code($valid_encoding, undef) does not warn".
418 http://rt.cpan.org/NoAuth/Bug.html?id=8723
419 ! Encode.pm t/Encode.t
420 Addressed "Can't encode URI". When a reference is fed to (en|de)code,
421 Encode now stringifies instead of returning undef.
422 http://rt.cpan.org/NoAuth/Bug.html?id=8725
423 ! Encode.xs t/fallback.t
424 Addressed "FB_HTMLCREF and FB_XMLCREF for the UTF-8 decoder".
425 http://rt.cpan.org/NoAuth/Bug.html?id=8694
427 Addressed "s/digit/number/".
428 http://rt.cpan.org/NoAuth/Bug.html?id=8695
430 Addressed "while (defined(read )) { ... } is an infinite loop".
431 http://rt.cpan.org/NoAuth/Bug.html?id=8696
433 Addressed "What the heck is UCM?".
434 Document fixed so that it no longer contains "UCM-Based Encodings".
435 http://rt.cpan.org/NoAuth/Bug.html?id=8697
437 2.08 2004/10/24 13:00:29
438 ! Encode.xs lib/Encode/Encoding.pm Unicode/Unicode.{pm,xs}
439 Resolved the issue that was raised by 2.07 -- Encode::utf8 fallbacks
440 that was introduce messed up PerlIO::encoding.
441 * To do so, ->renew() is renewed and ->renewed() was introduced to
442 tell whether the caller is PerlIO or not.
443 Message-Id: <94B2EB12-25B7-11D9-9E6A-000A95DBB50A@dan.co.jp>
445 2.07 2004/10/22 19:35:52
446 ! lib/Encode/Encoding.pm
447 "Remove Carp from warnings.pm" that influences Encode, by Tels.
448 Message-Id: <200410161618.29779@bloodgate.com>
449 ! Encode.xs AUTHORS t/fallback.t
450 Now Encode::utf8's fallbacks are compliant to Encode standard.
451 Thank Bjoern Hoehrmann for persistently convincing me.
452 Message-Id: <41a61aea.638409494@smtp.bjoern.hoehrmann.de>
456 2.06 2004/10/22 06:23:11
458 RT #8083 reports that MacThai mapping was obsolete
459 Updated all mac* encodings accordingly to the URI below.
460 One remaining mystery is that MacRomanian vs. MacRumanian.
461 MacRumanian is not found in unicode.org...
462 http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/
463 ! Encode.pm t/Encode.t
464 Fixed RT #8081: "decode(..., bless{},'x') segfault"
465 Two more tests added to test that.
466 http://rt.cpan.org/NoAuth/Bug.html?id=8081
468 POD revised accordingly to RT #7966
469 http://rt.cpan.org/NoAuth/Bug.html?id=7966
471 POD updated explaining why Encode::Unicode always croaks on error
472 rather than giving users choices.
473 http://rt.cpan.org/NoAuth/Bug.html?id=7892
475 2.05 2004/10/19 04:55:01
477 "unnuke" jhi's patch in bleedperl, with minor correction by dankogai.
478 Message-ID: <41210A84.6060506@iki.fi>
479 Message-ID: <20041018233442.7418113f@r2d2>
480 Message-Id: <2BA3DAC4-218A-11D9-906D-000A95DBB50A@dan.co.jp>
482 2.04 2004/10/16 21:22:44
484 From: craigberry@mac.com
485 Subject: [PATCH ext/Encode/Makefile.PL] make Encode.c dependency explicit
486 Message-Id: <41716868.7000102@mac.com>
488 2.03 2004/10/06 05:07:20
489 ! lib/Encode/Alias.pm
490 Resolved some alias case sensitivity glitches reported via RT.
491 http://rt.cpan.org/NoAuth/Bug.html?id=7835
493 Resolved Win32 glitches reported via RT.
494 (Fixed by dankogai and tested by Steve Hay)
495 http://rt.cpan.org/Ticket/Display.html?id=7831
496 ! JP/JP.pm lib/Encode/Alias.pm lib/Encode/Supported.pod AUTHORS
497 /\bwindows-31j$/i is now an alias of CP932, by Steve Hay.
498 http://rt.cpan.org/NoAuth/Bug.html?id=6695
500 2.02 2004/08/31 10:55:34
501 ! ucm/big5-hkscs.ucm AUTHORS t/big5-hkscs.enc t/big5-hkscs.utf
502 New map submitted by Deng Liu and Autrijus. Test data needed
503 to be upgrade as well, done by dankogai
504 Message-Id: <20040824204828.GB6999@aut.dyndns.org>
506 Now works for characters U+10000 and above. This fix was needed
507 to "tidy" the original map that was submitted.
509 "ucmsort" now mentioned in pod
511 2.01 2004/05/25 16:27:14
513 From: domo@computer.org
514 Subject: [PATCH] Correct statistics from enc2xs
515 <4AF60A4A-B8BB-11D8-BF99-000A27839BD6@computer.org>
516 <CDEBBD45-B91D-11D8-BF99-000A27839BD6@computer.org>
517 ! lib/Encode/Alias.pm
518 Addressed "False [] range "\s-" in regex;" in Encode::Alias.pm
519 <200405271148.i4RBm4KY026529@mail.mvnet.de>
521 2.01 2004/05/25 16:27:14
522 ! lib/Encode/CN/HZ.pm lib/Encode/Unicode/UTF7.pm
523 "If someone thinks utf8::upgrade($1) should be croaked like
524 chom?p($1),please try the following patch for Encode.pm."
526 <20040522212704.C068.BQW10602@nifty.com>
528 2.0 2004/05/16 20:55:15
529 * version updated to 2.00
530 -- sorry, no big feature change. I just hate version 1.100 :)
531 ! lib/Encode/Guess.pm
533 addressed UTF-(8|32LE) + BOM misguessing
534 https://rt.cpan.org/Ticket/Display.html?id=6279
536 s/is_utif8/is_utf8/ in POD
537 ! Encode/lib/Encode/CN/HZ.pm
538 Fixes "make test" failure after the patch to pp_hot.c
540 Message-Id: <20040222182357.6B39.BQW10602@nifty.com>
542 From: autrijus@autrijus.org
543 Subject: [PATCH] "piconv -C 512" badly broken
544 Message-Id: <1072870210.769.5.camel@localhost>
546 1.99 2003/12/29 02:47:16
548 find_encoding("UTF-16BE")->encode("abc") now null terminates
549 http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/2003-10/threads.html#00258
551 prototype bug in decode_utf8() fixed
552 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
553 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
554 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
556 Merged from maintperl@21987
558 1.98 2003/08/20 11:15:31
559 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
560 Dave Evans has found and corrected a bug in Encode::MIME::Header.
561 Test suite added by Dan Kogai.
562 Message-Id: <3F43440B.7060606@rudolf.org.uk>
564 Typo fixes rolled back in from bleedperl
565 ! t/at-cn.t t/at-tw.t
566 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
567 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
569 argv case nit for VMS by Craig
570 Message-ID: <3F2B02DE.10207@mac.com>
571 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
572 Encode test fixes for VMS by Peter Prymmer
573 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
574 ! lib/Encode/Alias.pm t/Aliases.t
575 koi-8 aliases bug detected and patched by sadahiro.
576 Further fix and test suite by dankogai
577 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
579 1.97 2003/07/08 21:52:14
580 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
581 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
582 $DEBUG replaced with DEBUG() so perl optimizes better,
583 by Rafael with further fixes by dankogai
584 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
585 ! lib/Encode/Aliases.pm
586 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
587 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
588 So new hash seeding introduced in bleedperl works.
589 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
590 ! lib/Encode/Guess.pm
591 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
592 automatic utf(8|16|32) guessing -- originally by Autrijus
593 Message-Id: <20030626162731.GA2077@not.autrijus.org>
595 Addressed the following;
596 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
597 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
599 1.96 2003/06/18 09:29:02
600 ! lib/Encode/JP/JP.pm t/guess.t
601 m/(...)/ in void context then $1 is considered a Bad Thing
602 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
604 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
607 More typecast from maintperl@19739
608 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
610 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
611 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
613 Now skips for EBCDIC platform.
614 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
616 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
617 have more than one C<.> in their filenames." -- perlport.
618 Message-Id: <3ED79E01.8050401@mac.com>
620 Found and fixed the back that -p,--perlqq does not work.
621 Induced by the change from Getopt::Std to Getopt::Long.
623 Addressed [cpan #2629] Wrong assumption in numeric comparison
624 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
625 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
626 lib/Encode/Encoding.pm t/perlio.t
627 ! API Change: ->new_sequence() => ->renew()
628 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
629 + Encode::XS and Encode::utf8 now supports ->renew()
630 + Encode::Encoding now documents this with examples
631 - Non-XS (en|de)code stripped out of Encode::Unicode
632 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
634 1.95 2003/05/21 08:41:11
636 Since bogus entries were found in iso-8859-6, all entries are
637 re-generated once again out of
638 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
639 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
640 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
641 + lib/Encode/Unicode/UTF7.pm
642 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
643 lib/Encode/Supported.pod
644 UTF-7 support is now added. With this Encode now has all transcoding
645 methods in Unicode::String.
647 1.94 2003/05/10 18:13:59
648 ! lib/Encode/MIME/Header.pm
649 A more sophisticated solution for double-encoding by dankogai
650 ! lib/Encode/MIME/Header.pm AUTHORS
651 Two bugs fixed by Bjoern Jacke
652 * "Double Encoding" was not possible
653 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
654 * encode("MIME-Q") had UTF-8 flag on
655 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
656 ! lib/Encode/MIME/Header.pm AUTHORS
657 Two occurances of "croak ()" fixed as "croak qq()".
658 Simon Cozens is added to AUTHORS as a result.
659 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
661 POD fixes that reflect enhancements by jhi
663 Two enhancements by jhi.
664 + Now uses Getopt::Long so it accepts long name options
665 (--from for -f, for example)
666 + New option: -r,--resolve
667 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
669 META.yml added upon request of Schwern
670 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
672 Enache Adrian removed upon request -- to live longer than Encode
673 and/or FreeBSD (toy-)?thread :)
674 Message-Id: <20030425015701.GA2069@ratsnest.hole>
676 "close STDOUT unless $^O eq 'freebsd';" once again relocated
677 to keep VMS happy in which case "$^O eq 'freebsd'" is required
678 to keep FreeBSD+thread happy. Sigh.
679 Message-Id: <3EA88ADC.3000300@mac.com>
681 1.93 2003/04/24 17:43:16
683 added "no warnings 'pack'" in for loop to keep bleedperl from
684 complaining "Character in 'C' format wrapped in pack".
686 More elegant perl core detection inspired by Ilya Zakharevich
687 (but further elaborated for general cases).
688 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
691 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
692 are less likely to be confused w/ ISO-8859-*
694 RT tests added (vendor encodings are exemplified)
695 -- that successfully found a flaw on iso-2022-kr before the patch.
696 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
697 decode("ISO-2022-KR") has been buggy but no one ever sited
698 that since no one seems to be using it. Bugs discovered by
700 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
701 ! lib/Encode/CN/HZ.pm t/perlio.t
702 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
703 so it adds test for HZ.
704 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
705 ! lib/Encode/Guess.pm
706 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
707 So long as the string contains \x{00}-\x{ff} it does not fail.
708 See perldoc for details.
709 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
711 1.92 2003/03/31 03:27:27
712 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
713 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
714 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
715 but glibc-BIG5-2.1.2.ucm does.
716 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
717 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
718 Maps regenerated again but this time based upon
719 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
720 (But where is THE DOCUMENT by MICROSOFT?)
721 ! t/enc_module.t AUTHORS
722 failure with threaded Perl on FreeBSD addressed.
723 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
724 Message-Id: <20030322230131.GA813@ratsnest.hole>
725 ! lib/Encode/Guess.pm
728 Change 18989: Make the :bytes conditional on PerlIO.
729 further Modified by Dan Kogai
730 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
732 Chnage 18966: another fix for failing test on windows ("use encoding"
733 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
734 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
736 Change 18970: Hopefully this works also in Win32, by jhi
737 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
738 Change 18965: fix CJKT.t failures on windows due to incorrect
740 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
742 1.91 2003/03/09 20:07:37
744 even more proofread by jhi.
745 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
748 +use lib qw(t ext/Encode/t ../ext/Encode/t);
749 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
751 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
752 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
755 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
757 1.90 2003/03/09 17:32:43
760 Inaba-san has added a patch for perl 5.8.1 or later that makes
761 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
762 that. POD is further revised.
763 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
764 ! encoding.pm t/enc_module.t
765 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
766 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
770 signed vs. unsigned issue discovered by Craig on OpenVM
771 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
772 ! encoding.pm AUTHORS
773 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
774 Because binmode() stacks layers instead of overwrite, you have to
775 ":raw :encoding()" in encoding.pm or your are in trouble when you
776 call encoding.pm multiple times. There are several workarounds
777 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
778 who was the first to address this problem was added to AUTHORS.
779 The test suites was added for this, which is a modified version
780 of SUGAWARA-san's scripts
781 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
783 1.88 2003/02/20 14:42:34
785 one signedness nit for Encode by jhi
786 <200302161933.h1GJX876018710@kosh.hut.fi>
788 VISCII map was incorrect; fixed by Sadahiro-san
789 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
790 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
791 You can't unlink files that are opened in cygwin but the last
792 file handle opened in t/enc_*.t left open. Patch submitted
793 by Yitzchak and he was added to AUTHORS.
794 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
796 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
797 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
799 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
800 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
802 1.87 2003/02/06 01:52:11
804 * Inaba "Sensei" Hirohito added (I thought I have done so a long
805 ago but apparently I did not).
806 * SUZUKI Norio added for verious and useful bug reports.
807 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
808 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
809 podchecked so all warnings are gone except for L<http://>.
810 ! encoding.pm t/enc_eucjp.t
811 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
812 * Filter option overhaul
814 ! Encode.pm Encode.xs encengine.c Encode/encode.h
815 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
816 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
817 without filter. podchecked by Dan Kogai.
818 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
819 ! lib/Encode/Alias.pm
820 decode('alias', $1) went wild because of local $_ in find_alias()
821 the evil local $_ is eradicated but that changes find_alias()
822 format for coderef aliasing. See Encode::Alias for details
823 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
825 1.86 2003/01/22 03:29:07
827 * Don't forget to canonize when you attempt an exact match!
828 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
829 * ${^ENCODING} exception is off for $] > 5.008
830 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
832 $] check commented out so it runs on 5.8.0
834 1.85 2003/01/21 22:19:14
836 ${^ENCODING} exception is now explicit rather than handled by regex.
837 + t/enc_eucjp.t t/enc_utf8.t
838 Test suite for the better "encoding" pragma support for bleedperl.
839 On 5.8.0, they will just be skipped.
841 1.84 2003/01/10 12:00:16
843 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
844 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
845 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
847 %_ fixes saves the resulting .so .05% smaller, by NC
848 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
850 Silence Encode on undef, by Andreas
851 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
852 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
854 s/regognised/recognised/ . British spelling left intact to pay
855 respect to two British Nicks :)
856 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
858 1.83 2002/11/18 17:28:49
859 ! Encode.xs lib/Encode/JIS7.pm
860 Even more patches from Inaba-san has been applied. With this
861 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
862 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
864 1.82 2002/11/14 23:06:12
866 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
867 later found in perls w/ -Dusemymalloc was fixed by NC.
868 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
870 1.81 2002/11/08 18:29:27
871 ! Encode.pm Encode.xs
872 Non-XS version of Encode::utf8 is back (with XS being default).
873 Encode::predefine_encodings(0) to turn off XS.
874 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
875 found that it was Test::More causing the problem, not Encode.
876 But I have already made it configurable so it may be useful in
878 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
880 The ingenious patch by Nicholas Clark that reduces shlib sizes by
881 50% with no penalty and backward compatibility preserved, is in.
882 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
884 1.80 2002/10/21 20:39:09
885 ! Encode.xs t/mime-header.t
886 Even more patches from NI-XS regarding Encode::utf8->decode().
887 And one more test to t/mime-header.t to prove it
888 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
890 1.79 2002/10/21 06:05:37
892 Further patches from NI-XS. Encode::utf8->decode() now checks the
893 value of utf8 flag of the argument. As a result, the fix to
894 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
895 no harm (even speedwise) I'll leave it unreverted.
896 ! ucm/cp949.ucm ucm/cp950.ucm
898 U+00AE REGISTERED SIGN
899 were missing as a result of 1.78. Discovered by Moriyama-san.
900 Moriyama-san has also developed a test script that compares
901 (en|de)coded results to the corresponding Win32 API result and
902 all cp9?? maps are now verified.
903 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
905 1.78 2002/10/20 15:44:00
906 ! lib/Encode/MIME/Header.pm
907 fixed so that it works with new Encode::utf8
908 ! Encode.pm Encode.xs
909 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
910 :encoding(UTF-8) to handle partial chars at end of buffers
912 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
913 ! lib/Encode/Supported.pod
914 More nitpickings applied.
917 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
918 tests were completely useless. To redeem that and get the peace of
919 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
920 ucm/*.ucm Since this script takes too long to finish (30 seconds on
921 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
922 but you can easily run that by either renaming it or:
924 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
925 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
926 Other CJKT cp9?? also updated according to the URI below;
927 http://www.microsoft.com/typography/unicode/cscp.htm
928 + bin/ucmsort MANIFEST
929 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
930 to proper order. intended for hardcore develpers only.
931 ! ucm/cp932.ucm JP/JP.pm AUTHORS
932 CP932 mapping which was based upon the mapping file at unicode.org
933 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
934 has also supplied the patch so he was added to AUTHORS.
935 ! lib/Encode/Supported.pod
936 ISO-8859-11 != TIS 620
937 == TIS 620 + \xA0 ( )
939 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
941 1.77 2002/10/06 03:27:02
943 * Modified to accomodate up and comming patch by Inaba-san that
944 will fix tr/// needing eval qq{}
945 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
947 * pod fixes/enhancements to reflect the changes above
948 ! lib/Encode/Alias.pm
949 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
950 Message-Id: <20021001015648.GB18710@not.autrijus.org>
952 1.76 2002/08/25 15:09:51
954 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
955 but naturally identical to previous version -- dankogai
957 Codepoint fixes -- autrijus
958 Message-Id: <20020805040236.GC5220@not.autrijus.org>
960 copied everything under perl-5.8.0/ext/Encode to make sure Encode
961 is in sync w/ perl core
963 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
964 Now that binmode(FH) does implicit ":bytes" revisit
965 the failing tests. The worrisome one is the Digest::MD5
966 test-- how will it fare in CRLF lands now?
968 From: Radu Greab <radu@netsoft.ro>
969 Date: Mon, 10 Jun 2002 00:40:34 +0300
970 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
971 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
972 ones which I cannot figure out.
973 ! lib/Encode/Alias.pm
974 Subject: [Encode PATCH] spurious warning
975 From: Nicholas Clark <nick@unfortu.net>
976 Date: Sun, 2 Jun 2002 20:26:22 +0100
977 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
979 1.75 2002/06/01 18:07:49
980 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
981 glibc compliance cited by Autrijus.
982 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
983 ! bin/enc2xs bin/piconv
984 Subject: Re: forewarning: usedevel and versiononly
985 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
987 1.74 2002/05/28 18:33:15
988 + ucm/null.ucm ucm/ctrl.ucm
989 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
990 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
991 Subject: Re: unicode -> &# notation
992 Message-ID: <20020518193704.GB40272@not.autrijus.org>
994 1.73 2002/05/28 17:26:18
995 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
996 Chris Nandor has fixed Encode so that it works w/ MacPerl --
997 at least w/ PPC (68k need static linking which does not work due to
998 64k limit). pudge is added to AUTHORS (I'm surprised he was not
999 there in the list). Encode/Makefile_PL.e2x was additionally fixed
1000 by dankogai to reflect changes in other Makefile.PL
1001 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
1003 Subject: Change 16746: -Mutf8 cleanup.
1004 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
1006 1.72 2002/05/20 15:49:56
1008 Subject: [PATCH] Encode should be in perl-core library path
1009 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
1010 Message-Id: <20020520161201.A11019@alpha.hut.fi>
1011 ! lib/Encode/MIME/Header.pm
1012 Subject: [PATCH] Encode::MIME::Header
1013 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
1014 ! Encode/Makefile_PL.e2x
1015 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
1016 Message-Id: <20020519201031.GA1603@not.autrijus.org>
1017 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
1018 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
1020 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
1021 Subject: [PATCH] good day for WinCE port of perl.
1022 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
1024 ! Unicode/Unicode.xs
1025 Even more linting by Robin via @16532
1027 Even more typecast by Sarathy in @16460
1029 1.71 2002/05/07 16:22:42
1031 even more typecasts by Robin
1032 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
1034 A very strange bug that was causing a bugus ucm -> C table
1035 generation that was revealed by a UCM file that Andreas was
1036 working. This is the king of wierdest bug I've encountered
1037 in the course of Encode maintenance.
1038 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
1040 1.70 2002/05/06 10:26:48
1042 Made more 'module-safe' with conjunction w/ 'no encoding'.
1043 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
1044 ! lib/Encode/Encoding.pm
1045 'require Encode' because ->Define uses Encode::define_encoding();
1046 problem and solution addressed by Miyagawa-kun
1047 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
1049 Cuts the frill to make djgpp happier, as suggested by Laszlo
1050 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
1052 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
1053 Message-Id: <m3bsbug48n.fsf@anima.de>
1055 1.69 2002/05/04 16:41:18
1056 ! lib/Encode/MIME/Header
1057 Floating-point coerced for UNICOS (in integer arithmetics it folds
1058 line one character too early). Verification by Mark is pending.
1059 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
1060 ! Unicode/Unicode.pm
1061 more doc patch from Elizabeth
1062 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
1063 ! Encode/Makefile_PL.e2x
1064 More platform-independent patch from Benjamin
1065 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
1066 ! lib/Encode/Guess AUTHORS
1067 split regex fix by Graham Barr. Adds him to AUTHORS.
1068 Message-Id: <20020504085419.E95940@valueclick.com>
1069 ! Encode/Makefile_PL.e2x
1070 enc2xs script discovery made smarter and more sensible, first cited
1071 by Miyagawa-kun and further suggestions by Rafael and Andreas
1072 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
1073 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
1075 1.68 2002/05/03 12:20:13
1076 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
1077 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
1078 and Supported.pod modified to reflect the change. Elizabeth added
1079 to Authors. And H.M. is also added for forwarding her patch among
1080 other contributions (I was rather surprised to find his name was not
1082 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
1084 1.67 2002/05/02 07:33:09
1086 Error message now consistent w/ perlqq (\N{U+} -> \x{})
1087 done in perl@16308 but Philip linted me further. Now the error
1088 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
1089 ! lib/Encode/Guess.pm
1090 Sanity check for happier -w by Autrijus
1092 1.66 2002/05/01 05:41:06
1093 ! Encode.xs t/fallback.t
1094 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
1095 while fallback is in effect. This even came with a welcome side-effect
1096 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
1097 also modified to test this.
1098 And of course, the corresponding varialbles to UV[Xx]f are appropriately
1099 cast. This should've concluded NI-XS homework.
1101 encode(undef) does warn again! Repented upon suggestion by NI-XS.
1102 Document for unless vs. '' added
1103 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
1105 1.65 2002/04/30 16:13:37
1107 encode(undef) no longer warns for C<Use of uninitialized value in
1108 subroutine entry>. Suggested by Paul.
1109 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
1110 ! lib/Encode/Supported.pod
1111 Encode::MIME::Header and Encode::Guess mentioned
1112 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
1113 ! lib/Encode/Guess.pm
1114 POD fix by Miyagawa-kun
1115 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
1117 1.64 2002/04/29 06:54:06
1119 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
1120 encocde("euc-jisx0213") are still DIFFERENT.
1121 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
1123 A few white spaces corrected by NI-XS via PerlIO integration to
1125 Subject: Change 16247: Integrate perlio;
1127 Document fixes by Andreas
1128 Message-Id: <m3k7qsf1we.fsf@anima.de>
1130 1.63 2002/04/27 18:59:50
1131 ! lib/Encode/Encoding.pm
1132 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
1133 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
1134 Make use of the Encode::Encoding base class!
1135 And other cleanups in Encode.xs upon NI-XS suggestions
1136 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
1138 1.62 2002/04/27 11:17:39
1140 encodings() now just check %ExtModule instead of eval{require}
1141 all of them for ":all" to conserve more memory.
1143 more "%x" -> "%" UVxf stuff.
1145 s/=over2/=over 2/g # oops.
1147 1.61 2002/04/26 03:02:04
1149 Now does decent tests besides use_ok()
1150 ! lib/Encode/Guess.pm t/guess.t
1151 UI streamlined, document added
1152 ! Unicode/Unicode.xs
1153 various signed/unsigned mismatch nits (#16173)
1154 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
1156 POD: utf8-flag-related caveats added. A few sections completely
1160 Thou shalt not assume %d works, either!
1161 Robin Baker added to AUTHORS for this
1162 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
1164 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
1166 1.60 2002/04/24 20:06:52
1168 "Thou shalt not assume %x works." -- jhi
1169 Message-Id: <20020424210618.E24347@alpha.hut.fi>
1170 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
1171 low-memory build machines happy, now *.c is created for each *.ucm
1172 (no table aggregation). You can still override this by setting
1173 $ENV{AGGREGATE_TABLES}.
1174 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
1175 + lib/Encode/Guess.pm
1176 + lib/Encode/JP/JIS7.pm
1177 Encoding-autodetect (mainly for Japanese encoding) added. In a
1178 course of development, JIS7.pm was improved.
1179 + lib/Encode/HTML/Header.pm
1180 + lib/Encode/Config.pm
1181 MIME B/Q Header Encoding Added!
1182 ! Encode.pm Encode.xs t/fallback.t
1183 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
1184 Message-Id: <20020424130709.GA14211@tanglefoot>
1186 1.59 $ 2002/04/22 23:54:22
1187 ! Encode.pm Encode.xs
1188 needs_lines() and perlio_ok() are added to Internal encodings such
1189 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
1191 1.58 2002/04/22 23:54:22
1193 s/MacChineseSimp/MacChineseTrad/ # ... oops.
1198 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
1199 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
1200 t/(JP|KR).t are aggregated to t/CJKT.t
1201 test data are all remade via bin/ucm2text.
1202 And .... They are no longer skipped for -Uuseperlio !
1204 1.57 2002/04/22 20:27:30
1205 ! t/JP.t t/KR.t t/perlio.t
1206 unless (find PerlIO::Layer 'perlio') ... line is back again.
1207 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
1208 need PerlIO. Sigh....
1209 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
1210 ->perlio_ok now does eval{ require PerlIO::encoding } there so
1211 it correctly returns 1 when PerlIO::encoding is yet loaded.
1213 perl-current patch #16072 reflected
1215 1.56 2002/04/22 09:48:07
1216 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
1217 New PerlIO::encoding 0.04 compliance met
1219 1.55 2002/04/22 03:43:05
1220 ! Encode.pm Encode.xs Unicode/Unicode.pm
1221 needs_lines() defined so Encode::Encoding is no longer needed
1224 1.54 2002/04/22 02:50:01
1225 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
1226 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
1227 now perlio_ok is true by default if PerlIO::encoding->VERSION is
1228 0.03 or larger. POD in Encode::Encoding revised to reflect this.
1229 Encode::XS and Encode::Unicode now has perlio_ok() method.
1230 ! lib/Encode/Supported.pod
1231 s/UP-UX/HP-UX/ by jhi
1232 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
1233 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
1234 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
1235 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
1236 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
1237 ! lib/Encode/Supported.pod
1238 Huge document fixes by Philip.
1241 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
1242 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
1244 binmode() after "<:encoding" to make Win32 happy, by Mattia.
1245 Mattia added to AUTHORS file
1246 Message-Id: <3CC3150F.5798.22A05AE@localhost>
1248 1.52 2002/04/20 23:43:47
1250 TODO: is now SKIP:, as NI-XS requested. Also adds more
1251 eraborate failure analysis added.
1253 A note on how to make sure of round-trip safety added to POD
1254 section (so Autrijus is happier)
1255 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
1256 big5-(eten|hkscs) is round-trip safe again!
1257 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
1259 Typo fixes by Andreas
1260 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
1261 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
1262 PerIO coodination patches from NI-XS.
1263 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
1265 1.51 2002/04/20 09:58:23
1267 Updated test suite by Autrijis so "make test" is happy again
1268 Message-Id: <20020420082104.GA25037@not.autrijus.org>
1270 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
1272 TW/TW.pm TW/Makefile.PL
1273 Updates by Autrijus. 'big5' is no longer a canonical but an
1274 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
1275 Message-Id: <20020419195346.GA19597@not.autrijus.org>
1277 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
1278 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
1280 PerlIO detection a little bit smarter; no longer uses eval qq{}
1283 1.50 2002/04/19 06:13:02
1284 ! ! Encode.pm Encode.xs Encode/encoding.h
1286 New Fallback API imlemented and documented. See "perldoc Encode"
1288 ! lib/Encode/JP/JIS7.pm Encode.pm
1289 + lib/Encode/PerlIO.pod t/perlio.t
1290 API compliance met. However, it still does not work unless perlio
1291 implements line buffer. See BUGS section in perldoc Encode::PerlIO
1292 As a sensible workaround, perlio_ok() added to Encode.
1294 ! lib/Encode/Supported.pod
1296 Message-Id: <20020418174647.J8466@alpha.hut.fi>
1298 Doc fixes from Autrijus
1299 Message-Id: <20020418144131.GA10987@not.autrijus.org>
1301 perlqq mode documented
1303 + t/jisx0201.euc t/jisx0201.ref
1304 ! t/jisx0208.euc t/jisx0208.ref
1305 t/JP.t tests more rigorously and with other encodings
1306 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
1307 in jis0208 (used to contain jisx0201 part).
1308 ! Encode/Makefile_PL.e2x
1309 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
1310 enc2xs and encode.h rather than hard-coded. This allows the resulting
1311 module fully CPANizable.
1312 ! encoding.pm t/JP.t t/KR.t
1313 PerlIO detection simplified (checks %INC instead of eval{})
1314 ! Encode.xs Encode/encode.h
1315 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
1316 - lib/Encode/Unicode.pm
1317 (en|de)code_xs relocated to where it belongs. Source reindented
1320 Additional (U8 *) cast added as suggested by jhi
1321 Message-Id: <20020417165916.A28599@alpha.hut.fi>
1323 1.42 Date: 2002/04/17
1325 no-op module; Thought of adding a pod there but enc2xs has
1330 correct mechanism to detect Perlio::encoding layar installed.
1332 PerlIO Layer detached.
1334 1.41 2002/04/16 23:35:00
1336 binmode(STDIN|STDOUT ...) done iff PerlIO is available
1338 Cleaned up PerlIO skip conditions to prepare for the upcoming
1339 Encode - PerlIO forking.
1341 exported functions are now prototyped.
1342 ! lib/Encode/CN/HZ.pm
1345 fallback implemented # was /* FIXME */
1346 affected programs revised to fit (only HZ was using the try-catch
1347 approach which needed to be fixed for API-compliance).
1349 ! Encode/KR/2022_KR.pm
1351 can find =head1 NAME now, jhi
1352 Message-Id: <20020416083059.V30639@alpha.hut.fi>
1356 now complies with less warnings with the pickest compilers.
1357 Suggested by Craig, fixed by Dan.
1358 ! Encode/Makefile_PL.e2x
1360 A bug that fails to find *.e2x in certain conditions fixed
1362 1.40 2002/04/14 22:27:14
1363 + Encode/ConfigLocal_PM.e2x
1364 ! lib/Encode/Config.pm
1366 "enc2xs -C" now generates/updates Encode::ConfigLocal.
1367 ConfigLocal_PM.e2x is a skelton thereof.
1368 ! lib/Encode/Config.pm
1370 "use Encode::CN::HZ;" was missing.
1373 More rigorous tests added to test XS, especially on memory allocation.
1375 ! lib/Encode/Unicode.pm
1376 NI-S implemented an XS version -- merged
1377 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
1380 Source filter option added. With this option on, you can write
1381 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
1382 encodings. t/jperl.t enhanced to test this feature.
1384 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
1385 apply his suggestion, this degree of nitting is enough to add him
1387 Message-Id: <3CB93223.291E5E2E@earthlink.net>
1389 + lib/Encode/JP/JIS7.pm
1390 - lib/Encode/JP/JIS.pm
1391 - lib/Encode/JP/2022_JP.pm
1392 - lib/Encode/JP/2022_JP1.pm
1393 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
1394 JIS7.pm for better maintainability and performance
1396 Added caveat for non-ascii identifiers.
1398 fixes by jhi, the original author of this pragramtic module.
1399 Message-Id: <20020413231527.V1826@alpha.hut.fi>
1401 1.34 2002/04/12 20:23:05 (Unreleased)
1404 EBCDIC fixes addressed by jhi.
1405 Message-Id: <20020412161844.D9383@alpha.hut.fi>
1406 ! lib/Encode/Encoder.pm
1407 POD fix by Miyagawa-kun
1408 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
1410 1.33 2002/04/10 22:28:40
1412 Philip's mail address corrected.
1415 ! lib/Encode/Encoder.pm
1416 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
1417 This adds Miyagawa-kun to AUTHORS.
1418 * encoding() no longer exported by default but on demand
1419 * t/Encoder.t updated to test all these
1420 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
1421 ! lib/Encode/Unicode.pm
1422 ! lib/Encode/Supported.pm
1423 Further doc fixes by Anton
1425 1.32 2002/04/09 20:06:15
1428 - ucm/macDevanaga.ucm Unicode Character Map
1429 - ucm/macGujarati.ucm Unicode Character Map
1430 - ucm/macGurmukhi.ucm Unicode Character Map
1431 A utility to check integrity of .ucm files. t/bogus.ucm is a
1432 ucm that is deliberately bogus. unused Indic mappings are removed
1435 resolve_alias() added as suggested by jhi. Same as
1436 find_encoding("alias")->name. For convenience. This one is
1437 defined in Encode.pm instead of Alias.pm.
1438 Message-Id: <20020409215846.H17022@alpha.hut.fi>
1440 Memory Allocate but detected during the devel of ucmlint -- fixed.
1441 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
1442 ! lib/Encode/Unicode.pm
1443 valid_ucs2(0) is false but must be true.
1444 3 patches from NI-S as follows. This also has fixed the incident
1446 ! lib/Encode/Alias.pm
1447 find_alias() recursion prevention
1449 Checks for the patch above
1450 ! t/Encode/Unicode.pm
1451 An extra "F" that causes valid_ucs2() return a bogus value fixed
1452 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
1453 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
1454 2 Small Patches from jhi as follows:
1456 Encode->encodings() lists in case-insensitve order (as it was)
1458 -l option prints avaiable encodings to STDOUT instead of STDERR
1459 ! lib/Encode/Aliases.pm
1460 s/defintion/definition/
1461 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
1463 ! lib/Encode/Supported.pod
1464 ! lib/Encode/Unicode.pm
1465 POD revise by Philip Newton. This adds Philip to AUTHORS list.
1466 Thank you for the exact quote of Douglas Adams :)
1467 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
1469 1.31 2002/04/08 18:08:07
1470 ! lib/Encode/Encoder.pm
1472 Encode::Encoder, once just a placeholder of an idea, is now much more
1473 practical. See t/Encode.t to find how practical it can be.
1474 + lib/Encode/Config.pm
1476 my false laziness at Encode.pm is fixed. Now %ExtModules are set
1477 in Encode::Config and they are all literally, not programatically
1478 set. My false laziness was resulting many encodings missing from
1480 ! lib/Encode/Unicode.pm
1482 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
1483 so that it does not rely Encode::Unicode for BOM values
1484 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
1486 1.30 2002/04/08 02:34:51
1487 + lib/Encode/Encoder.pm
1488 Object Oriented Encoder. I reckon something like this is in need.
1491 ! lib/Encode/Supported.pod
1492 * autoloading bug that prevented upper-case canonicals such as UTF-16
1493 is fixed. Now even UTF/UCS are autoloaded!
1494 * encodings() is now more intuitive.
1495 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
1497 * Obligatory fixes to the POD.
1498 ! lib/Encode/Supported.pod
1499 Patch from Anton applied.
1500 Message-Id: <66641479.20020408033300@motor.ru>
1502 ! lib/Encode/Unicode.pm
1503 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
1505 1.28 2002/04/07 18:58:42
1509 Just a MANIFEST for those missing files.
1511 1.26 Date: 2002/04/07 15:22:04
1514 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
1515 And t/Aliases.t fixed to make sure they all exist.
1517 1.25 2002/04/07 15:01:25 (Unreleased)
1519 ! lib/Encode/Unicode.pm
1522 - lib/Encode/UTF_EBCDIC.pm
1523 - lib/Encode/Internal.pm
1524 - lib/Encode/utf8.pm
1525 Integrated into Encode.pm as closures. That way "one package, one file"
1526 rule is preserved yet less files to require.
1528 commented out binmode(STDERR ...
1532 ! EBCDIC/Makefile.PL
1535 ! Symbol/Makefile.PL
1537 ! Encode/Makefile_PL.e2x
1538 Schwarn's MM-compliance patch merged
1539 Message-Id: <20020406082609.GA28758@blackrider>
1541 ! lib/Encode/Unicode.pm
1542 + lib/Encode/UTF_EBCDIC.pm
1544 - lib/Encode/10646_1.pm
1545 - lib/Encode/ucs2_le.pm
1546 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1547 per-module based (en|de)code, I saved a number of .pm by
1548 reorganizing it as per-object base (Well, this is what Encode::XS
1549 does under the hood). See Encode::Unicode for details.
1550 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1551 This module is used only on EBCDIC environments.
1553 1.21 2002/04/05 14:46:34 (Not Released)
1559 Are back to make Perl/Tk happy Smile, NI-S.
1561 ! lib/Encode/Alias.pm
1562 ! lib/Encode/Supported.pm
1563 ! lib/Encode/10646_1.pm
1564 ! lib/Encode/ucs2_le.pm
1565 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1566 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1567 updated to reflect changes.
1568 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1569 ! lib/Encode/Alias.pm
1570 ! lib/Encode/Supported.pm
1571 Anton's revision commited. Added Dan's own fixes as well.
1572 Message-Id: <159103166906.20020405161134@motor.ru>
1573 ! lib/Encode/Alias.pm
1575 < qr/^UCS2-le$/i => '"UCS-2"', );
1577 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1578 Sigh. Thank you, Anton.
1579 Message-Id: <14567692196.20020405062020@motor.ru>
1580 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1582 1.20 2002/04/04 19:50:52
1584 the last minute addtion. Just give it a try. Docs remains to be done.
1585 Not installed by default.
1586 ! lib/Encode/Supported.pod
1589 ! lib/Encode/Alias.pm
1590 ! lib/Encode/utf8.pm
1591 ! lib/Encode/10464_1.pm
1592 ! lib/Encode/ucs2_le.pm
1593 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1594 unchanged but UTF-16BE is added as an alias. Implicit aliases
1595 move to Encode::Alias so init_alias() works more as expected.
1596 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1597 Though pedantically wrong, This should make perl mongers happier.
1598 t/Alias.t is enhanced to test all these.
1599 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1601 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1602 of the codepoints are common. CJKT left as is because the saving is
1606 ! EBCDIC/Makefile.PL
1608 ! Encode/Makefile_PL.e2x
1612 ! Symbol/Makefile.PL
1616 All occurance of _def.h replaced with .exh so djgpp works happily
1617 ever after! To credit this amazing discovery, Laszlo is now in
1619 Message-Id: <20020403181424.GA8778@freemail.hu>
1620 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1623 ! Encode/Makefile_PL.skel
1625 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1626 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1628 Two more tests by added jhi
1629 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1632 The showstopper fixed -- Memory reallocation bug was causing
1633 Encode::XS to fall into infinite loop on certain conditions.
1634 t/grow.t tests that.
1635 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1640 ! lib/Encode/Supported.pod
1641 Vendor encodings rebuilt out of original map files at unicode.org.
1642 Indic languages such as MacDevanagali remain unspported do to the
1643 shortcoming of encengine capabilities (they need algorithmical
1644 conversion and I have no knowledge on that!). Pods fixed for added
1646 Oh, macJapan.ucm renamed to macJapanese.ucm.
1647 macROMnn is macRomanian and macRUMnn is macRumanian.
1648 txt2ucm is a crude script that is used to convert them.
1650 Unicode Compound Characters (used extensively on Mac) supported
1652 Typo fixes and improvements by jhi
1653 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1655 1.11 2002/03/31 22:12:13
1659 Missing files from the MANIFEST fixed.
1660 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1661 Version incremented just to make CPAN happy.
1663 1.10 2002/03/31 21:32:42
1666 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1667 if you want. This should make Autrijus happy. Also, piconv
1668 is added to default install.
1671 Here-documented files that enc2xs generates are now exported
1672 to *.e2x. Much cleaner and easier to debug.
1674 encoding enhances so you can make it act more like such
1675 (now prehistoric ) "localized" variations of perl like Jperl.
1677 Further test for encoding.pm. Written in euc-jp
1680 Taken over form jhi.
1681 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1686 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1693 ! Encode/macIceland.ucm
1694 ! lib/Encode/Alias.pm
1695 ! lib/Encode/Supported.pod
1696 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1697 Message-Id: <m3lmcavhjt.fsf@anima.de>
1699 1.01 2002/03/29 20:59:39
1702 s/USE_SCRIPTS/MORE_SCRIPTS/
1704 installs enc2xs by default for external Encode:: modules in CPAN,
1705 such as Encode::HanExtra
1707 More sensible perl core detection via $ENV{PERL_CORE}
1709 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1711 Perl core ditection via $^X =~ m/\bminiperl$/o
1712 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1714 1.00 Wed Mar 29 2002
1716 The version of all files is updated to 1.00 via "ci -f -l1.00",
1717 commemorating version 1.00. All files, including *.ucm are now
1718 under version control.
1721 encode.h moved to Encode/ so it will be installed for the later
1724 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1729 compile renamed to enc2xs.
1730 Affected Makefle.PL updated
1732 "Punt it. HanExtra can take care of that later." -- Autrijus
1733 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1736 ! Encode/ksc5601.ucm
1737 ! lib/Encode/CJKConstants.pm
1738 ! lib/Encode/KR/2022_KR.pm
1739 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1740 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1744 bin/ added for example scripts. They are not installed by default.
1745 to install them, "perl Makefile.PL USE_SCRIPTS".
1746 piconv is iconv reinvented in perl. in addition to all features
1747 of iconv, it also adds perlish features. See L<piconv/1> for more
1749 ! lib/Encode/Alias.pm
1750 qr/^ replaced with qr/\b so it directly matches locale names
1751 such as en_US.US-ASCII
1754 Patch by MJD to fix the following problem applied.
1755 Subject: [PATCH 5.7.3 Encode]
1756 Aliases.t not properly skipped when Encode extension not built
1757 Message-Id: <20020328091850.18677.qmail@plover.com>
1758 ! lib/Encode/KR/2022_KR.pm
1759 ! lib/Encode/CJKConstants.pm
1760 Another patch from Jungshik to make iso-2022-kr actually work
1761 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1762 ! Encode/Encode/euc-kr.ucm
1763 + Encode/Encode/johab.ucm
1764 ! Encode/Encode/ksc5601.ucm
1766 ! Encode/KR/Makefile.PL
1767 ! Encode/lib/Encode/Alias.pm
1769 Johab support and complete revision of Korean Encoding by Jungshik
1770 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1772 Revised to make up with now-dropped Encode::Details.
1773 - lib/Encode/Details.pod
1774 Dropped. Besides being obsolete, the topics are now covered in
1775 respective pods now.
1780 Korean aliases fixed thanks to Jungshik Shin
1781 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1782 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1784 =head1 NAME added to all modules to make buildtoc happy
1785 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1786 - lib/Encode/CJKguide.pod
1787 Too controversial and dropped from the dist. Will be available
1788 separately on the web.
1790 RCS tags added so table debugging gets easier (should that be
1791 needed! I hope they all stay 1.00!)
1792 + lib/Encode/CJKguide.pod
1793 A detailed guide to mainly, but not limited to, CJK multibyte
1796 + Encode/hp-roman8.ucm
1798 ! Encode/Supported.pod
1799 All occurance of "roman8" replaced with "hp-roman8" to avoid
1801 ! Encode/Supported.pod
1804 Mac Encodings now comply the Inside Macintosh
1806 Test for '-raw' conventions added.
1808 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1809 ! Encode/gb12345.ucm
1811 ! Encode/ksc5601.ucm
1812 "-raw" appended to canonical names.
1813 File mames stay unchanged thanks to UCM format.
1814 ! lib/Encode/CN/HZ.pm
1815 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1816 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1818 0.99 Tue Mar 26 2002
1819 - lib/Encode/JP/Const.pm
1820 + lib/Encode/CJKConstants.pm
1821 + lib/Encode/CN/2022_CN.pm
1822 + lib/Encode/KR/2022_KR.pm
1831 * Support for ISO-2022-KR and ISO-2022-CN added.
1833 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1834 * ucm2table autogenerates character table out of UCM files.
1837 - lib/Encode/Supports.pod
1838 + lib/Encode/Supported.pod
1839 Names reverted due to popular demand.
1840 8.3 rule applies only when there is a conflict.
1841 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1846 - lib/Encode/Format/Enc.pod
1848 * Character tables is now 100% ucm.
1849 * All files under Encode/ is now 8.3-compliant
1850 * some of missing encodings added (i.e. gsm0338 and nextstep)
1851 * Vendor mappings aggregated with appropriate national std in
1852 Makefile.PL, resulting smaller *.so especially for CJK.
1853 Following is result on Dan's FreeBSD box.
1855 ---------------------------------------------------------------
1856 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1857 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1858 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1859 blib/arch/auto/Encode/Encode.so 27,791 27,791
1860 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1861 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1862 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1863 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1864 ---------------------------------------------------------------
1865 Total 5,375,297 6,343,819
1867 * As a result of ucm-transition, Encode::Tcl dropped because
1868 Encode::Tcl demands *.enc.
1869 Encode::Tcl will be supplied in a separate tarball with *.enc.
1870 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1875 -lib/Encode/Supported.pod
1876 +lib/Encode/Supports.pod
1877 -lib/Encode/iso10646_1.pm
1878 +lib/Encode/10646_1.pm
1879 -lib/Encode/EncFormat.pod
1880 +lib/Encode/Format/Enc.pod
1881 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1882 - lib/Encode/JP/Constants.pm
1883 + lib/Encode/JP/Consts.pm
1884 ! lib/Encode/JP/JIS.pm
1885 ! lib/Encode/JP/H2Z.pm
1886 Version nit problem and 8.3 rule fix.
1887 > Package namespace installed latest in CPAN file
1888 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1889 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1890 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1891 accordingly. In addition, legacy "use vars qw()..." are replaced with
1893 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1894 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1896 - lib/Encode/JP/ISO_2022_JP.pm
1897 - lib/Encode/JP/ISO_2022_JP_1.pm
1898 + lib/Encode/JP/2022_JP.pm
1899 + lib/Encode/JP/2022_JP1.pm
1901 8.3 naming conflict for vanilla fat addressed by jhi
1902 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1905 Typecast fix addressed by jhi
1906 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1908 0.98 Mon Mar 25 2002
1909 ! lib/Encode/Supported.pod
1911 + lib/Encode/JP/ISO_2022_JP_1.pm
1912 ! lib/Encode/JP/ISO_2022_JP.pm
1913 ! lib/Encode/JP/JIS.pm
1915 Now Encode::JP is more strict on the difference between ISO-2022-JP
1916 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1917 makes Anton happier :) FYI the previous version implements
1918 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1919 ! lib/Encode/Supported.pod
1922 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1923 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1928 ! lib/Encode/Suppoted.pod
1929 pod fixes to replace F<http://...> to L<http://...>,
1930 as suggested by Autrijius in:
1931 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1932 ! lib/Encode/Suppoted.pod
1933 fixes and enhancements by Anton
1934 Message-Id: <10632060120.20020324103753@motor.ru>
1935 ! lib/Encode/Alias.pm
1936 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1937 added. Suggested by Anton then deobfuscated by Autrijius
1938 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1940 Further fix by Nicholas Clark
1941 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1942 - lib/EncodeFormat.pod
1943 + lib/Encode/EncFormat.pod
1945 File renamed as suggested by Autrijius
1947 ! lib/Encode/Details.pod
1948 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1949 ! Encode.pm Sun Mar 24 13:43:47 2002
1950 pod fixes by Autrijius.
1951 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1952 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1954 ! lib/Encode/Alias.pm
1956 now more EBCDIC conscious;
1957 %ExtModules on EBCDIC system excludes CJK so that you don't
1958 have to worry about the matched alias resulting cloaking.
1959 t/Alias.t also revised to reflect changes. Verified by jhi
1960 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1962 0.97 Sun Mar 24 2002
1966 EBCDIC detection mechanism installed as in JP/JP.pm
1967 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1970 ! EBCDIC/Makefile.PL
1973 ! Symbol/Makefile.PL
1975 Now all table files used by compile are postfixed '_t' to avoid
1976 namespace collisions in case insensitive file systems once for all!
1978 Message-ID: <58290227735.20020323195659@familiehaase.de>
1980 Since the Encode::JP is unsupported under EBCDIC we
1981 cannot run this test (aliases as such should work fine) -- jhi
1982 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1984 duplicate occurance of ascii.ucm and 8859-1.ucm
1985 causes MacOS X dlyd to cloak
1991 < chdir 't' if -d 't';
1993 > if (! -d 'blib' and -d 't'){ chdir 't' };
1994 When you are "make test"-ing on Encode/ directory, you must not
1995 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1996 remain unchanced. Also the situation detection was made simpler
1997 in t/JP.t, which was originally;
1998 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
2000 "Use of uninitialized value in string eq at Encode.pm line 96."
2001 ! Symbol/Makefile.PL
2002 ! EBCDIC/Makefile.PL
2004 -- Problem on case insensitive file systems
2005 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
2006 Message-ID: <88254111953.20020323095503@familiehaase.de>
2009 "So I think it's a bug in gcc, not perl. But it still needs to be
2011 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
2012 Message-Id: <20020323170509.C96475@plum.flirble.org>
2014 0.96 Sat Mar 23 2002
2016 ! lib/Encode/Encoding.pm
2017 ! lib/Encode/Alias.pm
2018 ! lib/Encode/Supported.pod
2020 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
2021 Message-ID: <20020322073908.GB10539@blackrider>
2024 "...I think we should include ISO 8859-1 as well." -- NI-S
2025 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
2030 ! lib/Encode/Alias.pm
2031 alias definitions relocated to Encode::Alias so module autoloading
2032 works for aliases also.
2034 encodings() now accepts args to check ExtModules.
2038 + EBCDIC/Makefile.PL
2039 + Symbol/Makefile.PL
2043 Latin and single byte encodings are reorganized so they are
2044 demand-loaded like Encode::XX. Now only ascii is compiled into
2046 ! lib/Encode/Alias.pm
2047 for my $k (keys %hash){ delete $hash{$k}; }
2048 is depreciated; fixed.
2050 0.95 Fri Mar 22 2002
2051 In this update, pod rewrites and alias fixes are the main issues
2052 + lib/Encode/Supported.pod
2053 Describes supported encodings
2055 streamlined compiled-in encodings.
2056 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
2058 + Encode/ibm-125?.ucm
2059 Added from icu distibution with any occurance of
2060 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
2061 some respect to icu staff, however.
2062 + lib/Encode/Alias.pm
2064 Alias difinitions in Encode.pm relocated.
2067 packWARN patch from Paul Marquess via jhi
2068 Message-Id: <20020321010101.O28978@alpha.hut.fi>
2069 Paul added to AUTHORS as a result.
2070 ! t/CJKalias.t -> t/Aliases.t
2071 Renamed. Checks even more aliases and alias overloading
2074 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
2075 gbk => cp936 relocated to CN.pm
2077 Test::More with plans (by jhi)
2079 0.94 Thu Mar 21 2002
2080 + lib/Encode/Description.pod
2081 ! lib/Encode/Encoding.pm
2082 Now the pod in Encode.pm is abridged as programming references.
2083 lib/Encode/Description.pod contains the original, detailed description
2084 and Encode::Encoding explains how to write your own module to
2085 add new encodings. So far, lib/Encode/Description.pod contains
2086 the whole pod once in Encode.pm. This is intentional.
2088 Pod revisions by Anton Tagunov
2089 Message-Id: <517178431.20020320174824@motor.ru>
2091 all occrance of Encode::Tcl::Extended removed including pod
2093 test now checks $encoding->name only; $encoding->{name} are
2094 no longer check to find the canonical name.
2095 ! lib/Encode/JP/JIS.pm
2096 ! lib/Encode/JP/ISO_2022_JP.pm
2097 ->name() added to be more compliant with API
2103 Patch by Autrijus to add aliases to TW and fixes to POD
2104 Message-Id: <20020320090619.GA24774@not.autrijus.org>
2106 SADAHIRO Tomoyuki added as should. My apologies.
2108 0.93 Wed Mar 20 2002
2109 * First release to be uploaded to CPAN. For prehistoric changes,
2110 please see Changes file of perl distibution as well as
2111 perl-unicode@perl.org archive, available at:
2112 http://archive.develooper.com/perl-unicode@perl.org/
2114 Changes Since 0.92 includes;
2119 + Mention to perl-unicode@perl.org added
2121 + Encoding aliases added so you can feed locale names
2122 and MIME Charset="" directly.
2123 - Mention to JISX0212 removed because it's fixed
2126 + Encoding aliases added. Note TW is left untouched because
2127 euc-tw is not implemented in TW but in Encode::HanExtra.
2128 Autrijus, you may fix Encode::HanExtra.
2130 + to test encode aliases added