1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 2.2 2004/08/31 10:55:34 dankogai Exp dankogai $
6 $Revision: 2.2 $ $Date: 2004/08/31 10:55:34 $
7 ucm/big5-hkscs.ucm AUTHORS t/big5-hkscs.enc t/big5-hkscs.utf
8 New map submitted by Deng Liu and Autrijus. Test data needed
9 to be upgrade as well, done by dankogai
10 Message-Id: <20040824204828.GB6999@aut.dyndns.org>
12 Now works for characters U+10000 and above. This fix was needed
13 to "tidy" the original map that was submitted.
15 "ucmsort" now mentioned in pod
17 2.01 2004/05/25 16:27:14
19 From: domo@computer.org
20 Subject: [PATCH] Correct statistics from enc2xs
21 <4AF60A4A-B8BB-11D8-BF99-000A27839BD6@computer.org>
22 <CDEBBD45-B91D-11D8-BF99-000A27839BD6@computer.org>
24 Addressed "False [] range "\s-" in regex;" in Encode::Alias.pm
25 <200405271148.i4RBm4KY026529@mail.mvnet.de>
27 2.01 2004/05/25 16:27:14
28 ! lib/Encode/CN/HZ.pm lib/Encode/Unicode/UTF7.pm
29 "If someone thinks utf8::upgrade($1) should be croaked like
30 chom?p($1),please try the following patch for Encode.pm."
32 <20040522212704.C068.BQW10602@nifty.com>
34 2.0 2004/05/16 20:55:15
35 * version updated to 2.00
36 -- sorry, no big feature change. I just hate version 1.100 :)
39 addressed UTF-(8|32LE) + BOM misguessing
40 https://rt.cpan.org/Ticket/Display.html?id=6279
42 s/is_utif8/is_utf8/ in POD
43 ! Encode/lib/Encode/CN/HZ.pm
44 Fixes "make test" failure after the patch to pp_hot.c
46 Message-Id: <20040222182357.6B39.BQW10602@nifty.com>
48 From: autrijus@autrijus.org
49 Subject: [PATCH] "piconv -C 512" badly broken
50 Message-Id: <1072870210.769.5.camel@localhost>
52 1.99 2003/12/29 02:47:16
54 find_encoding("UTF-16BE")->encode("abc") now null terminates
55 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html
57 prototype bug in decode_utf8() fixed
58 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
59 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
60 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
62 Merged from maintperl@21987
64 1.98 2003/08/20 11:15:31
65 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
66 Dave Evans has found and corrected a bug in Encode::MIME::Header.
67 Test suite added by Dan Kogai.
68 Message-Id: <3F43440B.7060606@rudolf.org.uk>
70 Typo fixes rolled back in from bleedperl
72 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
73 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
75 argv case nit for VMS by Craig
76 Message-ID: <3F2B02DE.10207@mac.com>
77 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
78 Encode test fixes for VMS by Peter Prymmer
79 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
80 ! lib/Encode/Alias.pm t/Aliases.t
81 koi-8 aliases bug detected and patched by sadahiro.
82 Further fix and test suite by dankogai
83 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
85 1.97 2003/07/08 21:52:14
86 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
87 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
88 $DEBUG replaced with DEBUG() so perl optimizes better,
89 by Rafael with further fixes by dankogai
90 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
91 ! lib/Encode/Aliases.pm
92 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
93 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
94 So new hash seeding introduced in bleedperl works.
95 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
97 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
98 automatic utf(8|16|32) guessing -- originally by Autrijus
99 Message-Id: <20030626162731.GA2077@not.autrijus.org>
101 Addressed the following;
102 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
103 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
105 1.96 2003/06/18 09:29:02
106 ! lib/Encode/JP/JP.pm t/guess.t
107 m/(...)/ in void context then $1 is considered a Bad Thing
108 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
110 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
113 More typecast from maintperl@19739
114 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
116 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
117 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
119 Now skips for EBCDIC platform.
120 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
122 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
123 have more than one C<.> in their filenames." -- perlport.
124 Message-Id: <3ED79E01.8050401@mac.com>
126 Found and fixed the back that -p,--perlqq does not work.
127 Induced by the change from Getopt::Std to Getopt::Long.
129 Addressed [cpan #2629] Wrong assumption in numeric comparison
130 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
131 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
132 lib/Encode/Encoding.pm t/perlio.t
133 ! API Change: ->new_sequence() => ->renew()
134 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
135 + Encode::XS and Encode::utf8 now supports ->renew()
136 + Encode::Encoding now documents this with examples
137 - Non-XS (en|de)code stripped out of Encode::Unicode
138 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
140 1.95 2003/05/21 08:41:11
142 Since bogus entries were found in iso-8859-6, all entries are
143 re-generated once again out of
144 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
145 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
146 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
147 + lib/Encode/Unicode/UTF7.pm
148 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
149 lib/Encode/Supported.pod
150 UTF-7 support is now added. With this Encode now has all transcoding
151 methods in Unicode::String.
153 1.94 2003/05/10 18:13:59
154 ! lib/Encode/MIME/Header.pm
155 A more sophisticated solution for double-encoding by dankogai
156 ! lib/Encode/MIME/Header.pm AUTHORS
157 Two bugs fixed by Bjoern Jacke
158 * "Double Encoding" was not possible
159 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
160 * encode("MIME-Q") had UTF-8 flag on
161 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
162 ! lib/Encode/MIME/Header.pm AUTHORS
163 Two occurances of "croak ()" fixed as "croak qq()".
164 Simon Cozens is added to AUTHORS as a result.
165 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
167 POD fixes that reflect enhancements by jhi
169 Two enhancements by jhi.
170 + Now uses Getopt::Long so it accepts long name options
171 (--from for -f, for example)
172 + New option: -r,--resolve
173 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
175 META.yml added upon request of Schwern
176 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
178 Enache Adrian removed upon request -- to live longer than Encode
179 and/or FreeBSD (toy-)?thread :)
180 Message-Id: <20030425015701.GA2069@ratsnest.hole>
182 "close STDOUT unless $^O eq 'freebsd';" once again relocated
183 to keep VMS happy in which case "$^O eq 'freebsd'" is required
184 to keep FreeBSD+thread happy. Sigh.
185 Message-Id: <3EA88ADC.3000300@mac.com>
187 1.93 2003/04/24 17:43:16
189 added "no warnings 'pack'" in for loop to keep bleedperl from
190 complaining "Character in 'C' format wrapped in pack".
192 More elegant perl core detection inspired by Ilya Zakharevich
193 (but further elaborated for general cases).
194 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
197 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
198 are less likely to be confused w/ ISO-8859-*
200 RT tests added (vendor encodings are exemplified)
201 -- that successfully found a flaw on iso-2022-kr before the patch.
202 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
203 decode("ISO-2022-KR") has been buggy but no one ever sited
204 that since no one seems to be using it. Bugs discovered by
206 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
207 ! lib/Encode/CN/HZ.pm t/perlio.t
208 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
209 so it adds test for HZ.
210 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
211 ! lib/Encode/Guess.pm
212 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
213 So long as the string contains \x{00}-\x{ff} it does not fail.
214 See perldoc for details.
215 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
217 1.92 2003/03/31 03:27:27
218 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
219 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
220 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
221 but glibc-BIG5-2.1.2.ucm does.
222 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
223 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
224 Maps regenerated again but this time based upon
225 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
226 (But where is THE DOCUMENT by MICROSOFT?)
227 ! t/enc_module.t AUTHORS
228 failure with threaded Perl on FreeBSD addressed.
229 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
230 Message-Id: <20030322230131.GA813@ratsnest.hole>
231 ! lib/Encode/Guess.pm
234 Change 18989: Make the :bytes conditional on PerlIO.
235 further Modified by Dan Kogai
236 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
238 Chnage 18966: another fix for failing test on windows ("use encoding"
239 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
240 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
242 Change 18970: Hopefully this works also in Win32, by jhi
243 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
244 Change 18965: fix CJKT.t failures on windows due to incorrect
246 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
248 1.91 2003/03/09 20:07:37
250 even more proofread by jhi.
251 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
254 +use lib qw(t ext/Encode/t ../ext/Encode/t);
255 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
257 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
258 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
261 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
263 1.90 2003/03/09 17:32:43
266 Inaba-san has added a patch for perl 5.8.1 or later that makes
267 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
268 that. POD is further revised.
269 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
270 ! encoding.pm t/enc_module.t
271 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
272 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
276 signed vs. unsigned issue discovered by Craig on OpenVM
277 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
278 ! encoding.pm AUTHORS
279 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
280 Because binmode() stacks layers instead of overwrite, you have to
281 ":raw :encoding()" in encoding.pm or your are in trouble when you
282 call encoding.pm multiple times. There are several workarounds
283 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
284 who was the first to address this problem was added to AUTHORS.
285 The test suites was added for this, which is a modified version
286 of SUGAWARA-san's scripts
287 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
289 1.88 2003/02/20 14:42:34
291 one signedness nit for Encode by jhi
292 <200302161933.h1GJX876018710@kosh.hut.fi>
294 VISCII map was incorrect; fixed by Sadahiro-san
295 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
296 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
297 You can't unlink files that are opened in cygwin but the last
298 file handle opened in t/enc_*.t left open. Patch submitted
299 by Yitzchak and he was added to AUTHORS.
300 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
302 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
303 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
305 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
306 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
308 1.87 2003/02/06 01:52:11
310 * Inaba "Sensei" Hirohito added (I thought I have done so a long
311 ago but apparently I did not).
312 * SUZUKI Norio added for verious and useful bug reports.
313 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
314 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
315 podchecked so all warnings are gone except for L<http://>.
316 ! encoding.pm t/enc_eucjp.t
317 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
318 * Filter option overhaul
320 ! Encode.pm Encode.xs encengine.c Encode/encode.h
321 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
322 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
323 without filter. podchecked by Dan Kogai.
324 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
325 ! lib/Encode/Alias.pm
326 decode('alias', $1) went wild because of local $_ in find_alias()
327 the evil local $_ is eradicated but that changes find_alias()
328 format for coderef aliasing. See Encode::Alias for details
329 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
331 1.86 2003/01/22 03:29:07
333 * Don't forget to canonize when you attempt an exact match!
334 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
335 * ${^ENCODING} exception is off for $] > 5.008
336 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
338 $] check commented out so it runs on 5.8.0
340 1.85 2003/01/21 22:19:14
342 ${^ENCODING} exception is now explicit rather than handled by regex.
343 + t/enc_eucjp.t t/enc_utf8.t
344 Test suite for the better "encoding" pragma support for bleedperl.
345 On 5.8.0, they will just be skipped.
347 1.84 2003/01/10 12:00:16
349 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
350 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
351 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
353 %_ fixes saves the resulting .so .05% smaller, by NC
354 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
356 Silence Encode on undef, by Andreas
357 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
358 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
360 s/regognised/recognised/ . British spelling left intact to pay
361 respect to two British Nicks :)
362 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
364 1.83 2002/11/18 17:28:49
365 ! Encode.xs lib/Encode/JIS7.pm
366 Even more patches from Inaba-san has been applied. With this
367 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
368 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
370 1.82 2002/11/14 23:06:12
372 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
373 later found in perls w/ -Dusemymalloc was fixed by NC.
374 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
376 1.81 2002/11/08 18:29:27
377 ! Encode.pm Encode.xs
378 Non-XS version of Encode::utf8 is back (with XS being default).
379 Encode::predefine_encodings(0) to turn off XS.
380 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
381 found that it was Test::More causing the problem, not Encode.
382 But I have already made it configurable so it may be useful in
384 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
386 The ingenious patch by Nicholas Clark that reduces shlib sizes by
387 50% with no penalty and backward compatibility preserved, is in.
388 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
390 1.80 2002/10/21 20:39:09
391 ! Encode.xs t/mime-header.t
392 Even more patches from NI-XS regarding Encode::utf8->decode().
393 And one more test to t/mime-header.t to prove it
394 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
396 1.79 2002/10/21 06:05:37
398 Further patches from NI-XS. Encode::utf8->decode() now checks the
399 value of utf8 flag of the argument. As a result, the fix to
400 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
401 no harm (even speedwise) I'll leave it unreverted.
402 ! ucm/cp949.ucm ucm/cp950.ucm
404 U+00AE REGISTERED SIGN
405 were missing as a result of 1.78. Discovered by Moriyama-san.
406 Moriyama-san has also developed a test script that compares
407 (en|de)coded results to the corresponding Win32 API result and
408 all cp9?? maps are now verified.
409 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
411 1.78 2002/10/20 15:44:00
412 ! lib/Encode/MIME/Header.pm
413 fixed so that it works with new Encode::utf8
414 ! Encode.pm Encode.xs
415 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
416 :encoding(UTF-8) to handle partial chars at end of buffers
418 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
419 ! lib/Encode/Supported.pod
420 More nitpickings applied.
423 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
424 tests were completely useless. To redeem that and get the peace of
425 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
426 ucm/*.ucm Since this script takes too long to finish (30 seconds on
427 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
428 but you can easily run that by either renaming it or:
430 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
431 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
432 Other CJKT cp9?? also updated according to the URI below;
433 http://www.microsoft.com/typography/unicode/cscp.htm
434 + bin/ucmsort MANIFEST
435 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
436 to proper order. intended for hardcore develpers only.
437 ! ucm/cp932.ucm JP/JP.pm AUTHORS
438 CP932 mapping which was based upon the mapping file at unicode.org
439 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
440 has also supplied the patch so he was added to AUTHORS.
441 ! lib/Encode/Supported.pod
442 ISO-8859-11 != TIS 620
443 == TIS 620 + \xA0 ( )
445 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
447 1.77 2002/10/06 03:27:02
449 * Modified to accomodate up and comming patch by Inaba-san that
450 will fix tr/// needing eval qq{}
451 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
453 * pod fixes/enhancements to reflect the changes above
454 ! lib/Encode/Alias.pm
455 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
456 Message-Id: <20021001015648.GB18710@not.autrijus.org>
458 1.76 2002/08/25 15:09:51
460 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
461 but naturally identical to previous version -- dankogai
463 Codepoint fixes -- autrijus
464 Message-Id: <20020805040236.GC5220@not.autrijus.org>
466 copied everything under perl-5.8.0/ext/Encode to make sure Encode
467 is in sync w/ perl core
469 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
470 Now that binmode(FH) does implicit ":bytes" revisit
471 the failing tests. The worrisome one is the Digest::MD5
472 test-- how will it fare in CRLF lands now?
474 From: Radu Greab <radu@netsoft.ro>
475 Date: Mon, 10 Jun 2002 00:40:34 +0300
476 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
477 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
478 ones which I cannot figure out.
479 ! lib/Encode/Alias.pm
480 Subject: [Encode PATCH] spurious warning
481 From: Nicholas Clark <nick@unfortu.net>
482 Date: Sun, 2 Jun 2002 20:26:22 +0100
483 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
485 1.75 2002/06/01 18:07:49
486 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
487 glibc compliance cited by Autrijus.
488 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
489 ! bin/enc2xs bin/piconv
490 Subject: Re: forewarning: usedevel and versiononly
491 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
493 1.74 2002/05/28 18:33:15
494 + ucm/null.ucm ucm/ctrl.ucm
495 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
496 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
497 Subject: Re: unicode -> &# notation
498 Message-ID: <20020518193704.GB40272@not.autrijus.org>
500 1.73 2002/05/28 17:26:18
501 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
502 Chris Nandor has fixed Encode so that it works w/ MacPerl --
503 at least w/ PPC (68k need static linking which does not work due to
504 64k limit). pudge is added to AUTHORS (I'm surprised he was not
505 there in the list). Encode/Makefile_PL.e2x was additionally fixed
506 by dankogai to reflect changes in other Makefile.PL
507 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
509 Subject: Change 16746: -Mutf8 cleanup.
510 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
512 1.72 2002/05/20 15:49:56
514 Subject: [PATCH] Encode should be in perl-core library path
515 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
516 Message-Id: <20020520161201.A11019@alpha.hut.fi>
517 ! lib/Encode/MIME/Header.pm
518 Subject: [PATCH] Encode::MIME::Header
519 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
520 ! Encode/Makefile_PL.e2x
521 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
522 Message-Id: <20020519201031.GA1603@not.autrijus.org>
523 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
524 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
526 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
527 Subject: [PATCH] good day for WinCE port of perl.
528 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
531 Even more linting by Robin via @16532
533 Even more typecast by Sarathy in @16460
535 1.71 2002/05/07 16:22:42
537 even more typecasts by Robin
538 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
540 A very strange bug that was causing a bugus ucm -> C table
541 generation that was revealed by a UCM file that Andreas was
542 working. This is the king of wierdest bug I've encountered
543 in the course of Encode maintenance.
544 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
546 1.70 2002/05/06 10:26:48
548 Made more 'module-safe' with conjunction w/ 'no encoding'.
549 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
550 ! lib/Encode/Encoding.pm
551 'require Encode' because ->Define uses Encode::define_encoding();
552 problem and solution addressed by Miyagawa-kun
553 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
555 Cuts the frill to make djgpp happier, as suggested by Laszlo
556 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
558 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
559 Message-Id: <m3bsbug48n.fsf@anima.de>
561 1.69 2002/05/04 16:41:18
562 ! lib/Encode/MIME/Header
563 Floating-point coerced for UNICOS (in integer arithmetics it folds
564 line one character too early). Verification by Mark is pending.
565 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
567 more doc patch from Elizabeth
568 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
569 ! Encode/Makefile_PL.e2x
570 More platform-independent patch from Benjamin
571 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
572 ! lib/Encode/Guess AUTHORS
573 split regex fix by Graham Barr. Adds him to AUTHORS.
574 Message-Id: <20020504085419.E95940@valueclick.com>
575 ! Encode/Makefile_PL.e2x
576 enc2xs script discovery made smarter and more sensible, first cited
577 by Miyagawa-kun and further suggestions by Rafael and Andreas
578 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
579 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
581 1.68 2002/05/03 12:20:13
582 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
583 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
584 and Supported.pod modified to reflect the change. Elizabeth added
585 to Authors. And H.M. is also added for forwarding her patch among
586 other contributions (I was rather surprised to find his name was not
588 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
590 1.67 2002/05/02 07:33:09
592 Error message now consistent w/ perlqq (\N{U+} -> \x{})
593 done in perl@16308 but Philip linted me further. Now the error
594 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
595 ! lib/Encode/Guess.pm
596 Sanity check for happier -w by Autrijus
598 1.66 2002/05/01 05:41:06
599 ! Encode.xs t/fallback.t
600 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
601 while fallback is in effect. This even came with a welcome side-effect
602 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
603 also modified to test this.
604 And of course, the corresponding varialbles to UV[Xx]f are appropriately
605 cast. This should've concluded NI-XS homework.
607 encode(undef) does warn again! Repented upon suggestion by NI-XS.
608 Document for unless vs. '' added
609 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
611 1.65 2002/04/30 16:13:37
613 encode(undef) no longer warns for C<Use of uninitialized value in
614 subroutine entry>. Suggested by Paul.
615 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
616 ! lib/Encode/Supported.pod
617 Encode::MIME::Header and Encode::Guess mentioned
618 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
619 ! lib/Encode/Guess.pm
620 POD fix by Miyagawa-kun
621 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
623 1.64 2002/04/29 06:54:06
625 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
626 encocde("euc-jisx0213") are still DIFFERENT.
627 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
629 A few white spaces corrected by NI-XS via PerlIO integration to
631 Subject: Change 16247: Integrate perlio;
633 Document fixes by Andreas
634 Message-Id: <m3k7qsf1we.fsf@anima.de>
636 1.63 2002/04/27 18:59:50
637 ! lib/Encode/Encoding.pm
638 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
639 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
640 Make use of the Encode::Encoding base class!
641 And other cleanups in Encode.xs upon NI-XS suggestions
642 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
644 1.62 2002/04/27 11:17:39
646 encodings() now just check %ExtModule instead of eval{require}
647 all of them for ":all" to conserve more memory.
649 more "%x" -> "%" UVxf stuff.
651 s/=over2/=over 2/g # oops.
653 1.61 2002/04/26 03:02:04
655 Now does decent tests besides use_ok()
656 ! lib/Encode/Guess.pm t/guess.t
657 UI streamlined, document added
659 various signed/unsigned mismatch nits (#16173)
660 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
662 POD: utf8-flag-related caveats added. A few sections completely
666 Thou shalt not assume %d works, either!
667 Robin Baker added to AUTHORS for this
668 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
670 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
672 1.60 2002/04/24 20:06:52
674 "Thou shalt not assume %x works." -- jhi
675 Message-Id: <20020424210618.E24347@alpha.hut.fi>
676 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
677 low-memory build machines happy, now *.c is created for each *.ucm
678 (no table aggregation). You can still override this by setting
679 $ENV{AGGREGATE_TABLES}.
680 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
681 + lib/Encode/Guess.pm
682 + lib/Encode/JP/JIS7.pm
683 Encoding-autodetect (mainly for Japanese encoding) added. In a
684 course of development, JIS7.pm was improved.
685 + lib/Encode/HTML/Header.pm
686 + lib/Encode/Config.pm
687 MIME B/Q Header Encoding Added!
688 ! Encode.pm Encode.xs t/fallback.t
689 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
690 Message-Id: <20020424130709.GA14211@tanglefoot>
692 1.59 $ 2002/04/22 23:54:22
693 ! Encode.pm Encode.xs
694 needs_lines() and perlio_ok() are added to Internal encodings such
695 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
697 1.58 2002/04/22 23:54:22
699 s/MacChineseSimp/MacChineseTrad/ # ... oops.
704 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
705 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
706 t/(JP|KR).t are aggregated to t/CJKT.t
707 test data are all remade via bin/ucm2text.
708 And .... They are no longer skipped for -Uuseperlio !
710 1.57 2002/04/22 20:27:30
711 ! t/JP.t t/KR.t t/perlio.t
712 unless (find PerlIO::Layer 'perlio') ... line is back again.
713 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
714 need PerlIO. Sigh....
715 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
716 ->perlio_ok now does eval{ require PerlIO::encoding } there so
717 it correctly returns 1 when PerlIO::encoding is yet loaded.
719 perl-current patch #16072 reflected
721 1.56 2002/04/22 09:48:07
722 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
723 New PerlIO::encoding 0.04 compliance met
725 1.55 2002/04/22 03:43:05
726 ! Encode.pm Encode.xs Unicode/Unicode.pm
727 needs_lines() defined so Encode::Encoding is no longer needed
730 1.54 2002/04/22 02:50:01
731 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
732 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
733 now perlio_ok is true by default if PerlIO::encoding->VERSION is
734 0.03 or larger. POD in Encode::Encoding revised to reflect this.
735 Encode::XS and Encode::Unicode now has perlio_ok() method.
736 ! lib/Encode/Supported.pod
737 s/UP-UX/HP-UX/ by jhi
738 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
739 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
740 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
741 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
742 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
743 ! lib/Encode/Supported.pod
744 Huge document fixes by Philip.
747 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
748 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
750 binmode() after "<:encoding" to make Win32 happy, by Mattia.
751 Mattia added to AUTHORS file
752 Message-Id: <3CC3150F.5798.22A05AE@localhost>
754 1.52 2002/04/20 23:43:47
756 TODO: is now SKIP:, as NI-XS requested. Also adds more
757 eraborate failure analysis added.
759 A note on how to make sure of round-trip safety added to POD
760 section (so Autrijus is happier)
761 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
762 big5-(eten|hkscs) is round-trip safe again!
763 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
765 Typo fixes by Andreas
766 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
767 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
768 PerIO coodination patches from NI-XS.
769 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
771 1.51 2002/04/20 09:58:23
773 Updated test suite by Autrijis so "make test" is happy again
774 Message-Id: <20020420082104.GA25037@not.autrijus.org>
776 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
778 TW/TW.pm TW/Makefile.PL
779 Updates by Autrijus. 'big5' is no longer a canonical but an
780 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
781 Message-Id: <20020419195346.GA19597@not.autrijus.org>
783 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
784 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
786 PerlIO detection a little bit smarter; no longer uses eval qq{}
789 1.50 2002/04/19 06:13:02
790 ! ! Encode.pm Encode.xs Encode/encoding.h
792 New Fallback API imlemented and documented. See "perldoc Encode"
794 ! lib/Encode/JP/JIS7.pm Encode.pm
795 + lib/Encode/PerlIO.pod t/perlio.t
796 API compliance met. However, it still does not work unless perlio
797 implements line buffer. See BUGS section in perldoc Encode::PerlIO
798 As a sensible workaround, perlio_ok() added to Encode.
800 ! lib/Encode/Supported.pod
802 Message-Id: <20020418174647.J8466@alpha.hut.fi>
804 Doc fixes from Autrijus
805 Message-Id: <20020418144131.GA10987@not.autrijus.org>
807 perlqq mode documented
809 + t/jisx0201.euc t/jisx0201.ref
810 ! t/jisx0208.euc t/jisx0208.ref
811 t/JP.t tests more rigorously and with other encodings
812 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
813 in jis0208 (used to contain jisx0201 part).
814 ! Encode/Makefile_PL.e2x
815 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
816 enc2xs and encode.h rather than hard-coded. This allows the resulting
817 module fully CPANizable.
818 ! encoding.pm t/JP.t t/KR.t
819 PerlIO detection simplified (checks %INC instead of eval{})
820 ! Encode.xs Encode/encode.h
821 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
822 - lib/Encode/Unicode.pm
823 (en|de)code_xs relocated to where it belongs. Source reindented
826 Additional (U8 *) cast added as suggested by jhi
827 Message-Id: <20020417165916.A28599@alpha.hut.fi>
829 1.42 Date: 2002/04/17
831 no-op module; Thought of adding a pod there but enc2xs has
836 correct mechanism to detect Perlio::encoding layar installed.
838 PerlIO Layer detached.
840 1.41 2002/04/16 23:35:00
842 binmode(STDIN|STDOUT ...) done iff PerlIO is available
844 Cleaned up PerlIO skip conditions to prepare for the upcoming
845 Encode - PerlIO forking.
847 exported functions are now prototyped.
848 ! lib/Encode/CN/HZ.pm
851 fallback implemented # was /* FIXME */
852 affected programs revised to fit (only HZ was using the try-catch
853 approach which needed to be fixed for API-compliance).
855 ! Encode/KR/2022_KR.pm
857 can find =head1 NAME now, jhi
858 Message-Id: <20020416083059.V30639@alpha.hut.fi>
862 now complies with less warnings with the pickest compilers.
863 Suggested by Craig, fixed by Dan.
864 ! Encode/Makefile_PL.e2x
866 A bug that fails to find *.e2x in certain conditions fixed
868 1.40 2002/04/14 22:27:14
869 + Encode/ConfigLocal_PM.e2x
870 ! lib/Encode/Config.pm
872 "enc2xs -C" now generates/updates Encode::ConfigLocal.
873 ConfigLocal_PM.e2x is a skelton thereof.
874 ! lib/Encode/Config.pm
876 "use Encode::CN::HZ;" was missing.
879 More rigorous tests added to test XS, especially on memory allocation.
881 ! lib/Encode/Unicode.pm
882 NI-S implemented an XS version -- merged
883 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
886 Source filter option added. With this option on, you can write
887 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
888 encodings. t/jperl.t enhanced to test this feature.
890 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
891 apply his suggestion, this degree of nitting is enough to add him
893 Message-Id: <3CB93223.291E5E2E@earthlink.net>
895 + lib/Encode/JP/JIS7.pm
896 - lib/Encode/JP/JIS.pm
897 - lib/Encode/JP/2022_JP.pm
898 - lib/Encode/JP/2022_JP1.pm
899 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
900 JIS7.pm for better maintainability and performance
902 Added caveat for non-ascii identifiers.
904 fixes by jhi, the original author of this pragramtic module.
905 Message-Id: <20020413231527.V1826@alpha.hut.fi>
907 1.34 2002/04/12 20:23:05 (Unreleased)
910 EBCDIC fixes addressed by jhi.
911 Message-Id: <20020412161844.D9383@alpha.hut.fi>
912 ! lib/Encode/Encoder.pm
913 POD fix by Miyagawa-kun
914 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
916 1.33 2002/04/10 22:28:40
918 Philip's mail address corrected.
921 ! lib/Encode/Encoder.pm
922 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
923 This adds Miyagawa-kun to AUTHORS.
924 * encoding() no longer exported by default but on demand
925 * t/Encoder.t updated to test all these
926 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
927 ! lib/Encode/Unicode.pm
928 ! lib/Encode/Supported.pm
929 Further doc fixes by Anton
931 1.32 2002/04/09 20:06:15
934 - ucm/macDevanaga.ucm Unicode Character Map
935 - ucm/macGujarati.ucm Unicode Character Map
936 - ucm/macGurmukhi.ucm Unicode Character Map
937 A utility to check integrity of .ucm files. t/bogus.ucm is a
938 ucm that is deliberately bogus. unused Indic mappings are removed
941 resolve_alias() added as suggested by jhi. Same as
942 find_encoding("alias")->name. For convenience. This one is
943 defined in Encode.pm instead of Alias.pm.
944 Message-Id: <20020409215846.H17022@alpha.hut.fi>
946 Memory Allocate but detected during the devel of ucmlint -- fixed.
947 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
948 ! lib/Encode/Unicode.pm
949 valid_ucs2(0) is false but must be true.
950 3 patches from NI-S as follows. This also has fixed the incident
952 ! lib/Encode/Alias.pm
953 find_alias() recursion prevention
955 Checks for the patch above
956 ! t/Encode/Unicode.pm
957 An extra "F" that causes valid_ucs2() return a bogus value fixed
958 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
959 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
960 2 Small Patches from jhi as follows:
962 Encode->encodings() lists in case-insensitve order (as it was)
964 -l option prints avaiable encodings to STDOUT instead of STDERR
965 ! lib/Encode/Aliases.pm
966 s/defintion/definition/
967 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
969 ! lib/Encode/Supported.pod
970 ! lib/Encode/Unicode.pm
971 POD revise by Philip Newton. This adds Philip to AUTHORS list.
972 Thank you for the exact quote of Douglas Adams :)
973 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
975 1.31 2002/04/08 18:08:07
976 ! lib/Encode/Encoder.pm
978 Encode::Encoder, once just a placeholder of an idea, is now much more
979 practical. See t/Encode.t to find how practical it can be.
980 + lib/Encode/Config.pm
982 my false laziness at Encode.pm is fixed. Now %ExtModules are set
983 in Encode::Config and they are all literally, not programatically
984 set. My false laziness was resulting many encodings missing from
986 ! lib/Encode/Unicode.pm
988 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
989 so that it does not rely Encode::Unicode for BOM values
990 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
992 1.30 2002/04/08 02:34:51
993 + lib/Encode/Encoder.pm
994 Object Oriented Encoder. I reckon something like this is in need.
997 ! lib/Encode/Supported.pod
998 * autoloading bug that prevented upper-case canonicals such as UTF-16
999 is fixed. Now even UTF/UCS are autoloaded!
1000 * encodings() is now more intuitive.
1001 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
1003 * Obligatory fixes to the POD.
1004 ! lib/Encode/Supported.pod
1005 Patch from Anton applied.
1006 Message-Id: <66641479.20020408033300@motor.ru>
1008 ! lib/Encode/Unicode.pm
1009 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
1011 1.28 2002/04/07 18:58:42
1015 Just a MANIFEST for those missing files.
1017 1.26 Date: 2002/04/07 15:22:04
1020 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
1021 And t/Aliases.t fixed to make sure they all exist.
1023 1.25 2002/04/07 15:01:25 (Unreleased)
1025 ! lib/Encode/Unicode.pm
1028 - lib/Encode/UTF_EBCDIC.pm
1029 - lib/Encode/Internal.pm
1030 - lib/Encode/utf8.pm
1031 Integrated into Encode.pm as closures. That way "one package, one file"
1032 rule is preserved yet less files to require.
1034 commented out binmode(STDERR ...
1038 ! EBCDIC/Makefile.PL
1041 ! Symbol/Makefile.PL
1043 ! Encode/Makefile_PL.e2x
1044 Schwarn's MM-compliance patch merged
1045 Message-Id: <20020406082609.GA28758@blackrider>
1047 ! lib/Encode/Unicode.pm
1048 + lib/Encode/UTF_EBCDIC.pm
1050 - lib/Encode/10646_1.pm
1051 - lib/Encode/ucs2_le.pm
1052 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1053 per-module based (en|de)code, I saved a number of .pm by
1054 reorganizing it as per-object base (Well, this is what Encode::XS
1055 does under the hood). See Encode::Unicode for details.
1056 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1057 This module is used only on EBCDIC environments.
1059 1.21 2002/04/05 14:46:34 (Not Released)
1065 Are back to make Perl/Tk happy Smile, NI-S.
1067 ! lib/Encode/Alias.pm
1068 ! lib/Encode/Supported.pm
1069 ! lib/Encode/10646_1.pm
1070 ! lib/Encode/ucs2_le.pm
1071 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1072 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1073 updated to reflect changes.
1074 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1075 ! lib/Encode/Alias.pm
1076 ! lib/Encode/Supported.pm
1077 Anton's revision commited. Added Dan's own fixes as well.
1078 Message-Id: <159103166906.20020405161134@motor.ru>
1079 ! lib/Encode/Alias.pm
1081 < qr/^UCS2-le$/i => '"UCS-2"', );
1083 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1084 Sigh. Thank you, Anton.
1085 Message-Id: <14567692196.20020405062020@motor.ru>
1086 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1088 1.20 2002/04/04 19:50:52
1090 the last minute addtion. Just give it a try. Docs remains to be done.
1091 Not installed by default.
1092 ! lib/Encode/Supported.pod
1095 ! lib/Encode/Alias.pm
1096 ! lib/Encode/utf8.pm
1097 ! lib/Encode/10464_1.pm
1098 ! lib/Encode/ucs2_le.pm
1099 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1100 unchanged but UTF-16BE is added as an alias. Implicit aliases
1101 move to Encode::Alias so init_alias() works more as expected.
1102 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1103 Though pedantically wrong, This should make perl mongers happier.
1104 t/Alias.t is enhanced to test all these.
1105 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1107 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1108 of the codepoints are common. CJKT left as is because the saving is
1112 ! EBCDIC/Makefile.PL
1114 ! Encode/Makefile_PL.e2x
1118 ! Symbol/Makefile.PL
1122 All occurance of _def.h replaced with .exh so djgpp works happily
1123 ever after! To credit this amazing discovery, Laszlo is now in
1125 Message-Id: <20020403181424.GA8778@freemail.hu>
1126 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1129 ! Encode/Makefile_PL.skel
1131 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1132 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1134 Two more tests by added jhi
1135 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1138 The showstopper fixed -- Memory reallocation bug was causing
1139 Encode::XS to fall into infinite loop on certain conditions.
1140 t/grow.t tests that.
1141 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1146 ! lib/Encode/Supported.pod
1147 Vendor encodings rebuilt out of original map files at unicode.org.
1148 Indic languages such as MacDevanagali remain unspported do to the
1149 shortcoming of encengine capabilities (they need algorithmical
1150 conversion and I have no knowledge on that!). Pods fixed for added
1152 Oh, macJapan.ucm renamed to macJapanese.ucm.
1153 macROMnn is macRomanian and macRUMnn is macRumanian.
1154 txt2ucm is a crude script that is used to convert them.
1156 Unicode Compound Characters (used extensively on Mac) supported
1158 Typo fixes and improvements by jhi
1159 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1161 1.11 2002/03/31 22:12:13
1165 Missing files from the MANIFEST fixed.
1166 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1167 Version incremented just to make CPAN happy.
1169 1.10 2002/03/31 21:32:42
1172 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1173 if you want. This should make Autrijus happy. Also, piconv
1174 is added to default install.
1177 Here-documented files that enc2xs generates are now exported
1178 to *.e2x. Much cleaner and easier to debug.
1180 encoding enhances so you can make it act more like such
1181 (now prehistoric ) "localized" variations of perl like Jperl.
1183 Further test for encoding.pm. Written in euc-jp
1186 Taken over form jhi.
1187 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1192 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1199 ! Encode/macIceland.ucm
1200 ! lib/Encode/Alias.pm
1201 ! lib/Encode/Supported.pod
1202 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1203 Message-Id: <m3lmcavhjt.fsf@anima.de>
1205 1.01 2002/03/29 20:59:39
1208 s/USE_SCRIPTS/MORE_SCRIPTS/
1210 installs enc2xs by default for external Encode:: modules in CPAN,
1211 such as Encode::HanExtra
1213 More sensible perl core detection via $ENV{PERL_CORE}
1215 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1217 Perl core ditection via $^X =~ m/\bminiperl$/o
1218 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1220 1.00 Wed Mar 29 2002
1222 The version of all files is updated to 1.00 via "ci -f -l1.00",
1223 commemorating version 1.00. All files, including *.ucm are now
1224 under version control.
1227 encode.h moved to Encode/ so it will be installed for the later
1230 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1235 compile renamed to enc2xs.
1236 Affected Makefle.PL updated
1238 "Punt it. HanExtra can take care of that later." -- Autrijus
1239 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1242 ! Encode/ksc5601.ucm
1243 ! lib/Encode/CJKConstants.pm
1244 ! lib/Encode/KR/2022_KR.pm
1245 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1246 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1250 bin/ added for example scripts. They are not installed by default.
1251 to install them, "perl Makefile.PL USE_SCRIPTS".
1252 piconv is iconv reinvented in perl. in addition to all features
1253 of iconv, it also adds perlish features. See L<piconv/1> for more
1255 ! lib/Encode/Alias.pm
1256 qr/^ replaced with qr/\b so it directly matches locale names
1257 such as en_US.US-ASCII
1260 Patch by MJD to fix the following problem applied.
1261 Subject: [PATCH 5.7.3 Encode]
1262 Aliases.t not properly skipped when Encode extension not built
1263 Message-Id: <20020328091850.18677.qmail@plover.com>
1264 ! lib/Encode/KR/2022_KR.pm
1265 ! lib/Encode/CJKConstants.pm
1266 Another patch from Jungshik to make iso-2022-kr actually work
1267 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1268 ! Encode/Encode/euc-kr.ucm
1269 + Encode/Encode/johab.ucm
1270 ! Encode/Encode/ksc5601.ucm
1272 ! Encode/KR/Makefile.PL
1273 ! Encode/lib/Encode/Alias.pm
1275 Johab support and complete revision of Korean Encoding by Jungshik
1276 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1278 Revised to make up with now-dropped Encode::Details.
1279 - lib/Encode/Details.pod
1280 Dropped. Besides being obsolete, the topics are now covered in
1281 respective pods now.
1286 Korean aliases fixed thanks to Jungshik Shin
1287 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1288 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1290 =head1 NAME added to all modules to make buildtoc happy
1291 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1292 - lib/Encode/CJKguide.pod
1293 Too controversial and dropped from the dist. Will be available
1294 separately on the web.
1296 RCS tags added so table debugging gets easier (should that be
1297 needed! I hope they all stay 1.00!)
1298 + lib/Encode/CJKguide.pod
1299 A detailed guide to mainly, but not limited to, CJK multibyte
1302 + Encode/hp-roman8.ucm
1304 ! Encode/Supported.pod
1305 All occurance of "roman8" replaced with "hp-roman8" to avoid
1307 ! Encode/Supported.pod
1310 Mac Encodings now comply the Inside Macintosh
1312 Test for '-raw' conventions added.
1314 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1315 ! Encode/gb12345.ucm
1317 ! Encode/ksc5601.ucm
1318 "-raw" appended to canonical names.
1319 File mames stay unchanged thanks to UCM format.
1320 ! lib/Encode/CN/HZ.pm
1321 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1322 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1324 0.99 Tue Mar 26 2002
1325 - lib/Encode/JP/Const.pm
1326 + lib/Encode/CJKConstants.pm
1327 + lib/Encode/CN/2022_CN.pm
1328 + lib/Encode/KR/2022_KR.pm
1337 * Support for ISO-2022-KR and ISO-2022-CN added.
1339 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1340 * ucm2table autogenerates character table out of UCM files.
1343 - lib/Encode/Supports.pod
1344 + lib/Encode/Supported.pod
1345 Names reverted due to popular demand.
1346 8.3 rule applies only when there is a conflict.
1347 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1352 - lib/Encode/Format/Enc.pod
1354 * Character tables is now 100% ucm.
1355 * All files under Encode/ is now 8.3-compliant
1356 * some of missing encodings added (i.e. gsm0338 and nextstep)
1357 * Vendor mappings aggregated with appropriate national std in
1358 Makefile.PL, resulting smaller *.so especially for CJK.
1359 Following is result on Dan's FreeBSD box.
1361 ---------------------------------------------------------------
1362 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1363 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1364 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1365 blib/arch/auto/Encode/Encode.so 27,791 27,791
1366 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1367 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1368 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1369 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1370 ---------------------------------------------------------------
1371 Total 5,375,297 6,343,819
1373 * As a result of ucm-transition, Encode::Tcl dropped because
1374 Encode::Tcl demands *.enc.
1375 Encode::Tcl will be supplied in a separate tarball with *.enc.
1376 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1381 -lib/Encode/Supported.pod
1382 +lib/Encode/Supports.pod
1383 -lib/Encode/iso10646_1.pm
1384 +lib/Encode/10646_1.pm
1385 -lib/Encode/EncFormat.pod
1386 +lib/Encode/Format/Enc.pod
1387 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1388 - lib/Encode/JP/Constants.pm
1389 + lib/Encode/JP/Consts.pm
1390 ! lib/Encode/JP/JIS.pm
1391 ! lib/Encode/JP/H2Z.pm
1392 Version nit problem and 8.3 rule fix.
1393 > Package namespace installed latest in CPAN file
1394 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1395 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1396 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1397 accordingly. In addition, legacy "use vars qw()..." are replaced with
1399 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1400 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1402 - lib/Encode/JP/ISO_2022_JP.pm
1403 - lib/Encode/JP/ISO_2022_JP_1.pm
1404 + lib/Encode/JP/2022_JP.pm
1405 + lib/Encode/JP/2022_JP1.pm
1407 8.3 naming conflict for vanilla fat addressed by jhi
1408 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1411 Typecast fix addressed by jhi
1412 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1414 0.98 Mon Mar 25 2002
1415 ! lib/Encode/Supported.pod
1417 + lib/Encode/JP/ISO_2022_JP_1.pm
1418 ! lib/Encode/JP/ISO_2022_JP.pm
1419 ! lib/Encode/JP/JIS.pm
1421 Now Encode::JP is more strict on the difference between ISO-2022-JP
1422 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1423 makes Anton happier :) FYI the previous version implements
1424 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1425 ! lib/Encode/Supported.pod
1428 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1429 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1434 ! lib/Encode/Suppoted.pod
1435 pod fixes to replace F<http://...> to L<http://...>,
1436 as suggested by Autrijius in:
1437 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1438 ! lib/Encode/Suppoted.pod
1439 fixes and enhancements by Anton
1440 Message-Id: <10632060120.20020324103753@motor.ru>
1441 ! lib/Encode/Alias.pm
1442 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1443 added. Suggested by Anton then deobfuscated by Autrijius
1444 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1446 Further fix by Nicholas Clark
1447 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1448 - lib/EncodeFormat.pod
1449 + lib/Encode/EncFormat.pod
1451 File renamed as suggested by Autrijius
1453 ! lib/Encode/Details.pod
1454 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1455 ! Encode.pm Sun Mar 24 13:43:47 2002
1456 pod fixes by Autrijius.
1457 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1458 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1460 ! lib/Encode/Alias.pm
1462 now more EBCDIC conscious;
1463 %ExtModules on EBCDIC system excludes CJK so that you don't
1464 have to worry about the matched alias resulting cloaking.
1465 t/Alias.t also revised to reflect changes. Verified by jhi
1466 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1468 0.97 Sun Mar 24 2002
1472 EBCDIC detection mechanism installed as in JP/JP.pm
1473 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1476 ! EBCDIC/Makefile.PL
1479 ! Symbol/Makefile.PL
1481 Now all table files used by compile are postfixed '_t' to avoid
1482 namespace collisions in case insensitive file systems once for all!
1484 Message-ID: <58290227735.20020323195659@familiehaase.de>
1486 Since the Encode::JP is unsupported under EBCDIC we
1487 cannot run this test (aliases as such should work fine) -- jhi
1488 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1490 duplicate occurance of ascii.ucm and 8859-1.ucm
1491 causes MacOS X dlyd to cloak
1497 < chdir 't' if -d 't';
1499 > if (! -d 'blib' and -d 't'){ chdir 't' };
1500 When you are "make test"-ing on Encode/ directory, you must not
1501 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1502 remain unchanced. Also the situation detection was made simpler
1503 in t/JP.t, which was originally;
1504 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1506 "Use of uninitialized value in string eq at Encode.pm line 96."
1507 ! Symbol/Makefile.PL
1508 ! EBCDIC/Makefile.PL
1510 -- Problem on case insensitive file systems
1511 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1512 Message-ID: <88254111953.20020323095503@familiehaase.de>
1515 "So I think it's a bug in gcc, not perl. But it still needs to be
1517 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1518 Message-Id: <20020323170509.C96475@plum.flirble.org>
1520 0.96 Sat Mar 23 2002
1522 ! lib/Encode/Encoding.pm
1523 ! lib/Encode/Alias.pm
1524 ! lib/Encode/Supported.pod
1526 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1527 Message-ID: <20020322073908.GB10539@blackrider>
1530 "...I think we should include ISO 8859-1 as well." -- NI-S
1531 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1536 ! lib/Encode/Alias.pm
1537 alias definitions relocated to Encode::Alias so module autoloading
1538 works for aliases also.
1540 encodings() now accepts args to check ExtModules.
1544 + EBCDIC/Makefile.PL
1545 + Symbol/Makefile.PL
1549 Latin and single byte encodings are reorganized so they are
1550 demand-loaded like Encode::XX. Now only ascii is compiled into
1552 ! lib/Encode/Alias.pm
1553 for my $k (keys %hash){ delete $hash{$k}; }
1554 is depreciated; fixed.
1556 0.95 Fri Mar 22 2002
1557 In this update, pod rewrites and alias fixes are the main issues
1558 + lib/Encode/Supported.pod
1559 Describes supported encodings
1561 streamlined compiled-in encodings.
1562 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1564 + Encode/ibm-125?.ucm
1565 Added from icu distibution with any occurance of
1566 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1567 some respect to icu staff, however.
1568 + lib/Encode/Alias.pm
1570 Alias difinitions in Encode.pm relocated.
1573 packWARN patch from Paul Marquess via jhi
1574 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1575 Paul added to AUTHORS as a result.
1576 ! t/CJKalias.t -> t/Aliases.t
1577 Renamed. Checks even more aliases and alias overloading
1580 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1581 gbk => cp936 relocated to CN.pm
1583 Test::More with plans (by jhi)
1585 0.94 Thu Mar 21 2002
1586 + lib/Encode/Description.pod
1587 ! lib/Encode/Encoding.pm
1588 Now the pod in Encode.pm is abridged as programming references.
1589 lib/Encode/Description.pod contains the original, detailed description
1590 and Encode::Encoding explains how to write your own module to
1591 add new encodings. So far, lib/Encode/Description.pod contains
1592 the whole pod once in Encode.pm. This is intentional.
1594 Pod revisions by Anton Tagunov
1595 Message-Id: <517178431.20020320174824@motor.ru>
1597 all occrance of Encode::Tcl::Extended removed including pod
1599 test now checks $encoding->name only; $encoding->{name} are
1600 no longer check to find the canonical name.
1601 ! lib/Encode/JP/JIS.pm
1602 ! lib/Encode/JP/ISO_2022_JP.pm
1603 ->name() added to be more compliant with API
1609 Patch by Autrijus to add aliases to TW and fixes to POD
1610 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1612 SADAHIRO Tomoyuki added as should. My apologies.
1614 0.93 Wed Mar 20 2002
1615 * First release to be uploaded to CPAN. For prehistoric changes,
1616 please see Changes file of perl distibution as well as
1617 perl-unicode@perl.org archive, available at:
1618 http://archive.develooper.com/perl-unicode@perl.org/
1620 Changes Since 0.92 includes;
1625 + Mention to perl-unicode@perl.org added
1627 + Encoding aliases added so you can feed locale names
1628 and MIME Charset="" directly.
1629 - Mention to JISX0212 removed because it's fixed
1632 + Encoding aliases added. Note TW is left untouched because
1633 euc-tw is not implemented in TW but in Encode::HanExtra.
1634 Autrijus, you may fix Encode::HanExtra.
1636 + to test encode aliases added