1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 2.4 2004/10/16 21:22:44 dankogai Exp dankogai $
6 $Revision: 2.4 $ $Date: 2004/10/16 21:22:44 $
8 From: craigberry@mac.com
9 Subject: [PATCH ext/Encode/Makefile.PL] make Encode.c dependency explicit
10 Message-Id: <41716868.7000102@mac.com>
12 2.03 2004/10/06 05:07:20
14 Resolved some alias case sensitivity glitches reported via RT.
15 http://rt.cpan.org/NoAuth/Bug.html?id=7835
17 Resolved Win32 glitches reported via RT.
18 (Fixed by dankogai and tested by Steve Hay)
19 http://rt.cpan.org/Ticket/Display.html?id=7831
20 ! JP/JP.pm lib/Encode/Alias.pm lib/Encode/Supported.pod AUTHORS
21 /\bwindows-31j$/i is now an alias of CP932, by Steve Hay.
22 http://rt.cpan.org/NoAuth/Bug.html?id=6695
24 2.02 2004/08/31 10:55:34
25 ! ucm/big5-hkscs.ucm AUTHORS t/big5-hkscs.enc t/big5-hkscs.utf
26 New map submitted by Deng Liu and Autrijus. Test data needed
27 to be upgrade as well, done by dankogai
28 Message-Id: <20040824204828.GB6999@aut.dyndns.org>
30 Now works for characters U+10000 and above. This fix was needed
31 to "tidy" the original map that was submitted.
33 "ucmsort" now mentioned in pod
35 2.01 2004/05/25 16:27:14
37 From: domo@computer.org
38 Subject: [PATCH] Correct statistics from enc2xs
39 <4AF60A4A-B8BB-11D8-BF99-000A27839BD6@computer.org>
40 <CDEBBD45-B91D-11D8-BF99-000A27839BD6@computer.org>
42 Addressed "False [] range "\s-" in regex;" in Encode::Alias.pm
43 <200405271148.i4RBm4KY026529@mail.mvnet.de>
45 2.01 2004/05/25 16:27:14
46 ! lib/Encode/CN/HZ.pm lib/Encode/Unicode/UTF7.pm
47 "If someone thinks utf8::upgrade($1) should be croaked like
48 chom?p($1),please try the following patch for Encode.pm."
50 <20040522212704.C068.BQW10602@nifty.com>
52 2.0 2004/05/16 20:55:15
53 * version updated to 2.00
54 -- sorry, no big feature change. I just hate version 1.100 :)
57 addressed UTF-(8|32LE) + BOM misguessing
58 https://rt.cpan.org/Ticket/Display.html?id=6279
60 s/is_utif8/is_utf8/ in POD
61 ! Encode/lib/Encode/CN/HZ.pm
62 Fixes "make test" failure after the patch to pp_hot.c
64 Message-Id: <20040222182357.6B39.BQW10602@nifty.com>
66 From: autrijus@autrijus.org
67 Subject: [PATCH] "piconv -C 512" badly broken
68 Message-Id: <1072870210.769.5.camel@localhost>
70 1.99 2003/12/29 02:47:16
72 find_encoding("UTF-16BE")->encode("abc") now null terminates
73 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html
75 prototype bug in decode_utf8() fixed
76 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
77 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
78 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
80 Merged from maintperl@21987
82 1.98 2003/08/20 11:15:31
83 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
84 Dave Evans has found and corrected a bug in Encode::MIME::Header.
85 Test suite added by Dan Kogai.
86 Message-Id: <3F43440B.7060606@rudolf.org.uk>
88 Typo fixes rolled back in from bleedperl
90 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
91 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
93 argv case nit for VMS by Craig
94 Message-ID: <3F2B02DE.10207@mac.com>
95 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
96 Encode test fixes for VMS by Peter Prymmer
97 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
98 ! lib/Encode/Alias.pm t/Aliases.t
99 koi-8 aliases bug detected and patched by sadahiro.
100 Further fix and test suite by dankogai
101 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
103 1.97 2003/07/08 21:52:14
104 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
105 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
106 $DEBUG replaced with DEBUG() so perl optimizes better,
107 by Rafael with further fixes by dankogai
108 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
109 ! lib/Encode/Aliases.pm
110 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
111 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
112 So new hash seeding introduced in bleedperl works.
113 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
114 ! lib/Encode/Guess.pm
115 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
116 automatic utf(8|16|32) guessing -- originally by Autrijus
117 Message-Id: <20030626162731.GA2077@not.autrijus.org>
119 Addressed the following;
120 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
121 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
123 1.96 2003/06/18 09:29:02
124 ! lib/Encode/JP/JP.pm t/guess.t
125 m/(...)/ in void context then $1 is considered a Bad Thing
126 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
128 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
131 More typecast from maintperl@19739
132 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
134 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
135 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
137 Now skips for EBCDIC platform.
138 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
140 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
141 have more than one C<.> in their filenames." -- perlport.
142 Message-Id: <3ED79E01.8050401@mac.com>
144 Found and fixed the back that -p,--perlqq does not work.
145 Induced by the change from Getopt::Std to Getopt::Long.
147 Addressed [cpan #2629] Wrong assumption in numeric comparison
148 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
149 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
150 lib/Encode/Encoding.pm t/perlio.t
151 ! API Change: ->new_sequence() => ->renew()
152 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
153 + Encode::XS and Encode::utf8 now supports ->renew()
154 + Encode::Encoding now documents this with examples
155 - Non-XS (en|de)code stripped out of Encode::Unicode
156 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
158 1.95 2003/05/21 08:41:11
160 Since bogus entries were found in iso-8859-6, all entries are
161 re-generated once again out of
162 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
163 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
164 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
165 + lib/Encode/Unicode/UTF7.pm
166 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
167 lib/Encode/Supported.pod
168 UTF-7 support is now added. With this Encode now has all transcoding
169 methods in Unicode::String.
171 1.94 2003/05/10 18:13:59
172 ! lib/Encode/MIME/Header.pm
173 A more sophisticated solution for double-encoding by dankogai
174 ! lib/Encode/MIME/Header.pm AUTHORS
175 Two bugs fixed by Bjoern Jacke
176 * "Double Encoding" was not possible
177 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
178 * encode("MIME-Q") had UTF-8 flag on
179 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
180 ! lib/Encode/MIME/Header.pm AUTHORS
181 Two occurances of "croak ()" fixed as "croak qq()".
182 Simon Cozens is added to AUTHORS as a result.
183 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
185 POD fixes that reflect enhancements by jhi
187 Two enhancements by jhi.
188 + Now uses Getopt::Long so it accepts long name options
189 (--from for -f, for example)
190 + New option: -r,--resolve
191 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
193 META.yml added upon request of Schwern
194 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
196 Enache Adrian removed upon request -- to live longer than Encode
197 and/or FreeBSD (toy-)?thread :)
198 Message-Id: <20030425015701.GA2069@ratsnest.hole>
200 "close STDOUT unless $^O eq 'freebsd';" once again relocated
201 to keep VMS happy in which case "$^O eq 'freebsd'" is required
202 to keep FreeBSD+thread happy. Sigh.
203 Message-Id: <3EA88ADC.3000300@mac.com>
205 1.93 2003/04/24 17:43:16
207 added "no warnings 'pack'" in for loop to keep bleedperl from
208 complaining "Character in 'C' format wrapped in pack".
210 More elegant perl core detection inspired by Ilya Zakharevich
211 (but further elaborated for general cases).
212 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
215 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
216 are less likely to be confused w/ ISO-8859-*
218 RT tests added (vendor encodings are exemplified)
219 -- that successfully found a flaw on iso-2022-kr before the patch.
220 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
221 decode("ISO-2022-KR") has been buggy but no one ever sited
222 that since no one seems to be using it. Bugs discovered by
224 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
225 ! lib/Encode/CN/HZ.pm t/perlio.t
226 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
227 so it adds test for HZ.
228 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
229 ! lib/Encode/Guess.pm
230 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
231 So long as the string contains \x{00}-\x{ff} it does not fail.
232 See perldoc for details.
233 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
235 1.92 2003/03/31 03:27:27
236 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
237 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
238 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
239 but glibc-BIG5-2.1.2.ucm does.
240 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
241 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
242 Maps regenerated again but this time based upon
243 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
244 (But where is THE DOCUMENT by MICROSOFT?)
245 ! t/enc_module.t AUTHORS
246 failure with threaded Perl on FreeBSD addressed.
247 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
248 Message-Id: <20030322230131.GA813@ratsnest.hole>
249 ! lib/Encode/Guess.pm
252 Change 18989: Make the :bytes conditional on PerlIO.
253 further Modified by Dan Kogai
254 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
256 Chnage 18966: another fix for failing test on windows ("use encoding"
257 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
258 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
260 Change 18970: Hopefully this works also in Win32, by jhi
261 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
262 Change 18965: fix CJKT.t failures on windows due to incorrect
264 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
266 1.91 2003/03/09 20:07:37
268 even more proofread by jhi.
269 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
272 +use lib qw(t ext/Encode/t ../ext/Encode/t);
273 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
275 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
276 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
279 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
281 1.90 2003/03/09 17:32:43
284 Inaba-san has added a patch for perl 5.8.1 or later that makes
285 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
286 that. POD is further revised.
287 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
288 ! encoding.pm t/enc_module.t
289 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
290 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
294 signed vs. unsigned issue discovered by Craig on OpenVM
295 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
296 ! encoding.pm AUTHORS
297 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
298 Because binmode() stacks layers instead of overwrite, you have to
299 ":raw :encoding()" in encoding.pm or your are in trouble when you
300 call encoding.pm multiple times. There are several workarounds
301 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
302 who was the first to address this problem was added to AUTHORS.
303 The test suites was added for this, which is a modified version
304 of SUGAWARA-san's scripts
305 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
307 1.88 2003/02/20 14:42:34
309 one signedness nit for Encode by jhi
310 <200302161933.h1GJX876018710@kosh.hut.fi>
312 VISCII map was incorrect; fixed by Sadahiro-san
313 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
314 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
315 You can't unlink files that are opened in cygwin but the last
316 file handle opened in t/enc_*.t left open. Patch submitted
317 by Yitzchak and he was added to AUTHORS.
318 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
320 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
321 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
323 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
324 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
326 1.87 2003/02/06 01:52:11
328 * Inaba "Sensei" Hirohito added (I thought I have done so a long
329 ago but apparently I did not).
330 * SUZUKI Norio added for verious and useful bug reports.
331 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
332 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
333 podchecked so all warnings are gone except for L<http://>.
334 ! encoding.pm t/enc_eucjp.t
335 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
336 * Filter option overhaul
338 ! Encode.pm Encode.xs encengine.c Encode/encode.h
339 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
340 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
341 without filter. podchecked by Dan Kogai.
342 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
343 ! lib/Encode/Alias.pm
344 decode('alias', $1) went wild because of local $_ in find_alias()
345 the evil local $_ is eradicated but that changes find_alias()
346 format for coderef aliasing. See Encode::Alias for details
347 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
349 1.86 2003/01/22 03:29:07
351 * Don't forget to canonize when you attempt an exact match!
352 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
353 * ${^ENCODING} exception is off for $] > 5.008
354 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
356 $] check commented out so it runs on 5.8.0
358 1.85 2003/01/21 22:19:14
360 ${^ENCODING} exception is now explicit rather than handled by regex.
361 + t/enc_eucjp.t t/enc_utf8.t
362 Test suite for the better "encoding" pragma support for bleedperl.
363 On 5.8.0, they will just be skipped.
365 1.84 2003/01/10 12:00:16
367 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
368 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
369 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
371 %_ fixes saves the resulting .so .05% smaller, by NC
372 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
374 Silence Encode on undef, by Andreas
375 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
376 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
378 s/regognised/recognised/ . British spelling left intact to pay
379 respect to two British Nicks :)
380 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
382 1.83 2002/11/18 17:28:49
383 ! Encode.xs lib/Encode/JIS7.pm
384 Even more patches from Inaba-san has been applied. With this
385 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
386 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
388 1.82 2002/11/14 23:06:12
390 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
391 later found in perls w/ -Dusemymalloc was fixed by NC.
392 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
394 1.81 2002/11/08 18:29:27
395 ! Encode.pm Encode.xs
396 Non-XS version of Encode::utf8 is back (with XS being default).
397 Encode::predefine_encodings(0) to turn off XS.
398 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
399 found that it was Test::More causing the problem, not Encode.
400 But I have already made it configurable so it may be useful in
402 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
404 The ingenious patch by Nicholas Clark that reduces shlib sizes by
405 50% with no penalty and backward compatibility preserved, is in.
406 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
408 1.80 2002/10/21 20:39:09
409 ! Encode.xs t/mime-header.t
410 Even more patches from NI-XS regarding Encode::utf8->decode().
411 And one more test to t/mime-header.t to prove it
412 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
414 1.79 2002/10/21 06:05:37
416 Further patches from NI-XS. Encode::utf8->decode() now checks the
417 value of utf8 flag of the argument. As a result, the fix to
418 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
419 no harm (even speedwise) I'll leave it unreverted.
420 ! ucm/cp949.ucm ucm/cp950.ucm
422 U+00AE REGISTERED SIGN
423 were missing as a result of 1.78. Discovered by Moriyama-san.
424 Moriyama-san has also developed a test script that compares
425 (en|de)coded results to the corresponding Win32 API result and
426 all cp9?? maps are now verified.
427 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
429 1.78 2002/10/20 15:44:00
430 ! lib/Encode/MIME/Header.pm
431 fixed so that it works with new Encode::utf8
432 ! Encode.pm Encode.xs
433 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
434 :encoding(UTF-8) to handle partial chars at end of buffers
436 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
437 ! lib/Encode/Supported.pod
438 More nitpickings applied.
441 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
442 tests were completely useless. To redeem that and get the peace of
443 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
444 ucm/*.ucm Since this script takes too long to finish (30 seconds on
445 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
446 but you can easily run that by either renaming it or:
448 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
449 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
450 Other CJKT cp9?? also updated according to the URI below;
451 http://www.microsoft.com/typography/unicode/cscp.htm
452 + bin/ucmsort MANIFEST
453 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
454 to proper order. intended for hardcore develpers only.
455 ! ucm/cp932.ucm JP/JP.pm AUTHORS
456 CP932 mapping which was based upon the mapping file at unicode.org
457 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
458 has also supplied the patch so he was added to AUTHORS.
459 ! lib/Encode/Supported.pod
460 ISO-8859-11 != TIS 620
461 == TIS 620 + \xA0 ( )
463 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
465 1.77 2002/10/06 03:27:02
467 * Modified to accomodate up and comming patch by Inaba-san that
468 will fix tr/// needing eval qq{}
469 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
471 * pod fixes/enhancements to reflect the changes above
472 ! lib/Encode/Alias.pm
473 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
474 Message-Id: <20021001015648.GB18710@not.autrijus.org>
476 1.76 2002/08/25 15:09:51
478 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
479 but naturally identical to previous version -- dankogai
481 Codepoint fixes -- autrijus
482 Message-Id: <20020805040236.GC5220@not.autrijus.org>
484 copied everything under perl-5.8.0/ext/Encode to make sure Encode
485 is in sync w/ perl core
487 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
488 Now that binmode(FH) does implicit ":bytes" revisit
489 the failing tests. The worrisome one is the Digest::MD5
490 test-- how will it fare in CRLF lands now?
492 From: Radu Greab <radu@netsoft.ro>
493 Date: Mon, 10 Jun 2002 00:40:34 +0300
494 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
495 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
496 ones which I cannot figure out.
497 ! lib/Encode/Alias.pm
498 Subject: [Encode PATCH] spurious warning
499 From: Nicholas Clark <nick@unfortu.net>
500 Date: Sun, 2 Jun 2002 20:26:22 +0100
501 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
503 1.75 2002/06/01 18:07:49
504 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
505 glibc compliance cited by Autrijus.
506 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
507 ! bin/enc2xs bin/piconv
508 Subject: Re: forewarning: usedevel and versiononly
509 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
511 1.74 2002/05/28 18:33:15
512 + ucm/null.ucm ucm/ctrl.ucm
513 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
514 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
515 Subject: Re: unicode -> &# notation
516 Message-ID: <20020518193704.GB40272@not.autrijus.org>
518 1.73 2002/05/28 17:26:18
519 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
520 Chris Nandor has fixed Encode so that it works w/ MacPerl --
521 at least w/ PPC (68k need static linking which does not work due to
522 64k limit). pudge is added to AUTHORS (I'm surprised he was not
523 there in the list). Encode/Makefile_PL.e2x was additionally fixed
524 by dankogai to reflect changes in other Makefile.PL
525 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
527 Subject: Change 16746: -Mutf8 cleanup.
528 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
530 1.72 2002/05/20 15:49:56
532 Subject: [PATCH] Encode should be in perl-core library path
533 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
534 Message-Id: <20020520161201.A11019@alpha.hut.fi>
535 ! lib/Encode/MIME/Header.pm
536 Subject: [PATCH] Encode::MIME::Header
537 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
538 ! Encode/Makefile_PL.e2x
539 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
540 Message-Id: <20020519201031.GA1603@not.autrijus.org>
541 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
542 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
544 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
545 Subject: [PATCH] good day for WinCE port of perl.
546 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
549 Even more linting by Robin via @16532
551 Even more typecast by Sarathy in @16460
553 1.71 2002/05/07 16:22:42
555 even more typecasts by Robin
556 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
558 A very strange bug that was causing a bugus ucm -> C table
559 generation that was revealed by a UCM file that Andreas was
560 working. This is the king of wierdest bug I've encountered
561 in the course of Encode maintenance.
562 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
564 1.70 2002/05/06 10:26:48
566 Made more 'module-safe' with conjunction w/ 'no encoding'.
567 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
568 ! lib/Encode/Encoding.pm
569 'require Encode' because ->Define uses Encode::define_encoding();
570 problem and solution addressed by Miyagawa-kun
571 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
573 Cuts the frill to make djgpp happier, as suggested by Laszlo
574 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
576 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
577 Message-Id: <m3bsbug48n.fsf@anima.de>
579 1.69 2002/05/04 16:41:18
580 ! lib/Encode/MIME/Header
581 Floating-point coerced for UNICOS (in integer arithmetics it folds
582 line one character too early). Verification by Mark is pending.
583 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
585 more doc patch from Elizabeth
586 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
587 ! Encode/Makefile_PL.e2x
588 More platform-independent patch from Benjamin
589 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
590 ! lib/Encode/Guess AUTHORS
591 split regex fix by Graham Barr. Adds him to AUTHORS.
592 Message-Id: <20020504085419.E95940@valueclick.com>
593 ! Encode/Makefile_PL.e2x
594 enc2xs script discovery made smarter and more sensible, first cited
595 by Miyagawa-kun and further suggestions by Rafael and Andreas
596 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
597 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
599 1.68 2002/05/03 12:20:13
600 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
601 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
602 and Supported.pod modified to reflect the change. Elizabeth added
603 to Authors. And H.M. is also added for forwarding her patch among
604 other contributions (I was rather surprised to find his name was not
606 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
608 1.67 2002/05/02 07:33:09
610 Error message now consistent w/ perlqq (\N{U+} -> \x{})
611 done in perl@16308 but Philip linted me further. Now the error
612 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
613 ! lib/Encode/Guess.pm
614 Sanity check for happier -w by Autrijus
616 1.66 2002/05/01 05:41:06
617 ! Encode.xs t/fallback.t
618 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
619 while fallback is in effect. This even came with a welcome side-effect
620 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
621 also modified to test this.
622 And of course, the corresponding varialbles to UV[Xx]f are appropriately
623 cast. This should've concluded NI-XS homework.
625 encode(undef) does warn again! Repented upon suggestion by NI-XS.
626 Document for unless vs. '' added
627 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
629 1.65 2002/04/30 16:13:37
631 encode(undef) no longer warns for C<Use of uninitialized value in
632 subroutine entry>. Suggested by Paul.
633 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
634 ! lib/Encode/Supported.pod
635 Encode::MIME::Header and Encode::Guess mentioned
636 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
637 ! lib/Encode/Guess.pm
638 POD fix by Miyagawa-kun
639 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
641 1.64 2002/04/29 06:54:06
643 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
644 encocde("euc-jisx0213") are still DIFFERENT.
645 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
647 A few white spaces corrected by NI-XS via PerlIO integration to
649 Subject: Change 16247: Integrate perlio;
651 Document fixes by Andreas
652 Message-Id: <m3k7qsf1we.fsf@anima.de>
654 1.63 2002/04/27 18:59:50
655 ! lib/Encode/Encoding.pm
656 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
657 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
658 Make use of the Encode::Encoding base class!
659 And other cleanups in Encode.xs upon NI-XS suggestions
660 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
662 1.62 2002/04/27 11:17:39
664 encodings() now just check %ExtModule instead of eval{require}
665 all of them for ":all" to conserve more memory.
667 more "%x" -> "%" UVxf stuff.
669 s/=over2/=over 2/g # oops.
671 1.61 2002/04/26 03:02:04
673 Now does decent tests besides use_ok()
674 ! lib/Encode/Guess.pm t/guess.t
675 UI streamlined, document added
677 various signed/unsigned mismatch nits (#16173)
678 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
680 POD: utf8-flag-related caveats added. A few sections completely
684 Thou shalt not assume %d works, either!
685 Robin Baker added to AUTHORS for this
686 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
688 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
690 1.60 2002/04/24 20:06:52
692 "Thou shalt not assume %x works." -- jhi
693 Message-Id: <20020424210618.E24347@alpha.hut.fi>
694 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
695 low-memory build machines happy, now *.c is created for each *.ucm
696 (no table aggregation). You can still override this by setting
697 $ENV{AGGREGATE_TABLES}.
698 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
699 + lib/Encode/Guess.pm
700 + lib/Encode/JP/JIS7.pm
701 Encoding-autodetect (mainly for Japanese encoding) added. In a
702 course of development, JIS7.pm was improved.
703 + lib/Encode/HTML/Header.pm
704 + lib/Encode/Config.pm
705 MIME B/Q Header Encoding Added!
706 ! Encode.pm Encode.xs t/fallback.t
707 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
708 Message-Id: <20020424130709.GA14211@tanglefoot>
710 1.59 $ 2002/04/22 23:54:22
711 ! Encode.pm Encode.xs
712 needs_lines() and perlio_ok() are added to Internal encodings such
713 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
715 1.58 2002/04/22 23:54:22
717 s/MacChineseSimp/MacChineseTrad/ # ... oops.
722 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
723 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
724 t/(JP|KR).t are aggregated to t/CJKT.t
725 test data are all remade via bin/ucm2text.
726 And .... They are no longer skipped for -Uuseperlio !
728 1.57 2002/04/22 20:27:30
729 ! t/JP.t t/KR.t t/perlio.t
730 unless (find PerlIO::Layer 'perlio') ... line is back again.
731 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
732 need PerlIO. Sigh....
733 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
734 ->perlio_ok now does eval{ require PerlIO::encoding } there so
735 it correctly returns 1 when PerlIO::encoding is yet loaded.
737 perl-current patch #16072 reflected
739 1.56 2002/04/22 09:48:07
740 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
741 New PerlIO::encoding 0.04 compliance met
743 1.55 2002/04/22 03:43:05
744 ! Encode.pm Encode.xs Unicode/Unicode.pm
745 needs_lines() defined so Encode::Encoding is no longer needed
748 1.54 2002/04/22 02:50:01
749 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
750 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
751 now perlio_ok is true by default if PerlIO::encoding->VERSION is
752 0.03 or larger. POD in Encode::Encoding revised to reflect this.
753 Encode::XS and Encode::Unicode now has perlio_ok() method.
754 ! lib/Encode/Supported.pod
755 s/UP-UX/HP-UX/ by jhi
756 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
757 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
758 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
759 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
760 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
761 ! lib/Encode/Supported.pod
762 Huge document fixes by Philip.
765 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
766 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
768 binmode() after "<:encoding" to make Win32 happy, by Mattia.
769 Mattia added to AUTHORS file
770 Message-Id: <3CC3150F.5798.22A05AE@localhost>
772 1.52 2002/04/20 23:43:47
774 TODO: is now SKIP:, as NI-XS requested. Also adds more
775 eraborate failure analysis added.
777 A note on how to make sure of round-trip safety added to POD
778 section (so Autrijus is happier)
779 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
780 big5-(eten|hkscs) is round-trip safe again!
781 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
783 Typo fixes by Andreas
784 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
785 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
786 PerIO coodination patches from NI-XS.
787 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
789 1.51 2002/04/20 09:58:23
791 Updated test suite by Autrijis so "make test" is happy again
792 Message-Id: <20020420082104.GA25037@not.autrijus.org>
794 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
796 TW/TW.pm TW/Makefile.PL
797 Updates by Autrijus. 'big5' is no longer a canonical but an
798 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
799 Message-Id: <20020419195346.GA19597@not.autrijus.org>
801 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
802 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
804 PerlIO detection a little bit smarter; no longer uses eval qq{}
807 1.50 2002/04/19 06:13:02
808 ! ! Encode.pm Encode.xs Encode/encoding.h
810 New Fallback API imlemented and documented. See "perldoc Encode"
812 ! lib/Encode/JP/JIS7.pm Encode.pm
813 + lib/Encode/PerlIO.pod t/perlio.t
814 API compliance met. However, it still does not work unless perlio
815 implements line buffer. See BUGS section in perldoc Encode::PerlIO
816 As a sensible workaround, perlio_ok() added to Encode.
818 ! lib/Encode/Supported.pod
820 Message-Id: <20020418174647.J8466@alpha.hut.fi>
822 Doc fixes from Autrijus
823 Message-Id: <20020418144131.GA10987@not.autrijus.org>
825 perlqq mode documented
827 + t/jisx0201.euc t/jisx0201.ref
828 ! t/jisx0208.euc t/jisx0208.ref
829 t/JP.t tests more rigorously and with other encodings
830 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
831 in jis0208 (used to contain jisx0201 part).
832 ! Encode/Makefile_PL.e2x
833 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
834 enc2xs and encode.h rather than hard-coded. This allows the resulting
835 module fully CPANizable.
836 ! encoding.pm t/JP.t t/KR.t
837 PerlIO detection simplified (checks %INC instead of eval{})
838 ! Encode.xs Encode/encode.h
839 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
840 - lib/Encode/Unicode.pm
841 (en|de)code_xs relocated to where it belongs. Source reindented
844 Additional (U8 *) cast added as suggested by jhi
845 Message-Id: <20020417165916.A28599@alpha.hut.fi>
847 1.42 Date: 2002/04/17
849 no-op module; Thought of adding a pod there but enc2xs has
854 correct mechanism to detect Perlio::encoding layar installed.
856 PerlIO Layer detached.
858 1.41 2002/04/16 23:35:00
860 binmode(STDIN|STDOUT ...) done iff PerlIO is available
862 Cleaned up PerlIO skip conditions to prepare for the upcoming
863 Encode - PerlIO forking.
865 exported functions are now prototyped.
866 ! lib/Encode/CN/HZ.pm
869 fallback implemented # was /* FIXME */
870 affected programs revised to fit (only HZ was using the try-catch
871 approach which needed to be fixed for API-compliance).
873 ! Encode/KR/2022_KR.pm
875 can find =head1 NAME now, jhi
876 Message-Id: <20020416083059.V30639@alpha.hut.fi>
880 now complies with less warnings with the pickest compilers.
881 Suggested by Craig, fixed by Dan.
882 ! Encode/Makefile_PL.e2x
884 A bug that fails to find *.e2x in certain conditions fixed
886 1.40 2002/04/14 22:27:14
887 + Encode/ConfigLocal_PM.e2x
888 ! lib/Encode/Config.pm
890 "enc2xs -C" now generates/updates Encode::ConfigLocal.
891 ConfigLocal_PM.e2x is a skelton thereof.
892 ! lib/Encode/Config.pm
894 "use Encode::CN::HZ;" was missing.
897 More rigorous tests added to test XS, especially on memory allocation.
899 ! lib/Encode/Unicode.pm
900 NI-S implemented an XS version -- merged
901 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
904 Source filter option added. With this option on, you can write
905 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
906 encodings. t/jperl.t enhanced to test this feature.
908 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
909 apply his suggestion, this degree of nitting is enough to add him
911 Message-Id: <3CB93223.291E5E2E@earthlink.net>
913 + lib/Encode/JP/JIS7.pm
914 - lib/Encode/JP/JIS.pm
915 - lib/Encode/JP/2022_JP.pm
916 - lib/Encode/JP/2022_JP1.pm
917 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
918 JIS7.pm for better maintainability and performance
920 Added caveat for non-ascii identifiers.
922 fixes by jhi, the original author of this pragramtic module.
923 Message-Id: <20020413231527.V1826@alpha.hut.fi>
925 1.34 2002/04/12 20:23:05 (Unreleased)
928 EBCDIC fixes addressed by jhi.
929 Message-Id: <20020412161844.D9383@alpha.hut.fi>
930 ! lib/Encode/Encoder.pm
931 POD fix by Miyagawa-kun
932 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
934 1.33 2002/04/10 22:28:40
936 Philip's mail address corrected.
939 ! lib/Encode/Encoder.pm
940 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
941 This adds Miyagawa-kun to AUTHORS.
942 * encoding() no longer exported by default but on demand
943 * t/Encoder.t updated to test all these
944 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
945 ! lib/Encode/Unicode.pm
946 ! lib/Encode/Supported.pm
947 Further doc fixes by Anton
949 1.32 2002/04/09 20:06:15
952 - ucm/macDevanaga.ucm Unicode Character Map
953 - ucm/macGujarati.ucm Unicode Character Map
954 - ucm/macGurmukhi.ucm Unicode Character Map
955 A utility to check integrity of .ucm files. t/bogus.ucm is a
956 ucm that is deliberately bogus. unused Indic mappings are removed
959 resolve_alias() added as suggested by jhi. Same as
960 find_encoding("alias")->name. For convenience. This one is
961 defined in Encode.pm instead of Alias.pm.
962 Message-Id: <20020409215846.H17022@alpha.hut.fi>
964 Memory Allocate but detected during the devel of ucmlint -- fixed.
965 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
966 ! lib/Encode/Unicode.pm
967 valid_ucs2(0) is false but must be true.
968 3 patches from NI-S as follows. This also has fixed the incident
970 ! lib/Encode/Alias.pm
971 find_alias() recursion prevention
973 Checks for the patch above
974 ! t/Encode/Unicode.pm
975 An extra "F" that causes valid_ucs2() return a bogus value fixed
976 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
977 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
978 2 Small Patches from jhi as follows:
980 Encode->encodings() lists in case-insensitve order (as it was)
982 -l option prints avaiable encodings to STDOUT instead of STDERR
983 ! lib/Encode/Aliases.pm
984 s/defintion/definition/
985 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
987 ! lib/Encode/Supported.pod
988 ! lib/Encode/Unicode.pm
989 POD revise by Philip Newton. This adds Philip to AUTHORS list.
990 Thank you for the exact quote of Douglas Adams :)
991 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
993 1.31 2002/04/08 18:08:07
994 ! lib/Encode/Encoder.pm
996 Encode::Encoder, once just a placeholder of an idea, is now much more
997 practical. See t/Encode.t to find how practical it can be.
998 + lib/Encode/Config.pm
1000 my false laziness at Encode.pm is fixed. Now %ExtModules are set
1001 in Encode::Config and they are all literally, not programatically
1002 set. My false laziness was resulting many encodings missing from
1004 ! lib/Encode/Unicode.pm
1006 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
1007 so that it does not rely Encode::Unicode for BOM values
1008 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
1010 1.30 2002/04/08 02:34:51
1011 + lib/Encode/Encoder.pm
1012 Object Oriented Encoder. I reckon something like this is in need.
1015 ! lib/Encode/Supported.pod
1016 * autoloading bug that prevented upper-case canonicals such as UTF-16
1017 is fixed. Now even UTF/UCS are autoloaded!
1018 * encodings() is now more intuitive.
1019 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
1021 * Obligatory fixes to the POD.
1022 ! lib/Encode/Supported.pod
1023 Patch from Anton applied.
1024 Message-Id: <66641479.20020408033300@motor.ru>
1026 ! lib/Encode/Unicode.pm
1027 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
1029 1.28 2002/04/07 18:58:42
1033 Just a MANIFEST for those missing files.
1035 1.26 Date: 2002/04/07 15:22:04
1038 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
1039 And t/Aliases.t fixed to make sure they all exist.
1041 1.25 2002/04/07 15:01:25 (Unreleased)
1043 ! lib/Encode/Unicode.pm
1046 - lib/Encode/UTF_EBCDIC.pm
1047 - lib/Encode/Internal.pm
1048 - lib/Encode/utf8.pm
1049 Integrated into Encode.pm as closures. That way "one package, one file"
1050 rule is preserved yet less files to require.
1052 commented out binmode(STDERR ...
1056 ! EBCDIC/Makefile.PL
1059 ! Symbol/Makefile.PL
1061 ! Encode/Makefile_PL.e2x
1062 Schwarn's MM-compliance patch merged
1063 Message-Id: <20020406082609.GA28758@blackrider>
1065 ! lib/Encode/Unicode.pm
1066 + lib/Encode/UTF_EBCDIC.pm
1068 - lib/Encode/10646_1.pm
1069 - lib/Encode/ucs2_le.pm
1070 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1071 per-module based (en|de)code, I saved a number of .pm by
1072 reorganizing it as per-object base (Well, this is what Encode::XS
1073 does under the hood). See Encode::Unicode for details.
1074 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1075 This module is used only on EBCDIC environments.
1077 1.21 2002/04/05 14:46:34 (Not Released)
1083 Are back to make Perl/Tk happy Smile, NI-S.
1085 ! lib/Encode/Alias.pm
1086 ! lib/Encode/Supported.pm
1087 ! lib/Encode/10646_1.pm
1088 ! lib/Encode/ucs2_le.pm
1089 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1090 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1091 updated to reflect changes.
1092 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1093 ! lib/Encode/Alias.pm
1094 ! lib/Encode/Supported.pm
1095 Anton's revision commited. Added Dan's own fixes as well.
1096 Message-Id: <159103166906.20020405161134@motor.ru>
1097 ! lib/Encode/Alias.pm
1099 < qr/^UCS2-le$/i => '"UCS-2"', );
1101 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1102 Sigh. Thank you, Anton.
1103 Message-Id: <14567692196.20020405062020@motor.ru>
1104 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1106 1.20 2002/04/04 19:50:52
1108 the last minute addtion. Just give it a try. Docs remains to be done.
1109 Not installed by default.
1110 ! lib/Encode/Supported.pod
1113 ! lib/Encode/Alias.pm
1114 ! lib/Encode/utf8.pm
1115 ! lib/Encode/10464_1.pm
1116 ! lib/Encode/ucs2_le.pm
1117 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1118 unchanged but UTF-16BE is added as an alias. Implicit aliases
1119 move to Encode::Alias so init_alias() works more as expected.
1120 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1121 Though pedantically wrong, This should make perl mongers happier.
1122 t/Alias.t is enhanced to test all these.
1123 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1125 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1126 of the codepoints are common. CJKT left as is because the saving is
1130 ! EBCDIC/Makefile.PL
1132 ! Encode/Makefile_PL.e2x
1136 ! Symbol/Makefile.PL
1140 All occurance of _def.h replaced with .exh so djgpp works happily
1141 ever after! To credit this amazing discovery, Laszlo is now in
1143 Message-Id: <20020403181424.GA8778@freemail.hu>
1144 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1147 ! Encode/Makefile_PL.skel
1149 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1150 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1152 Two more tests by added jhi
1153 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1156 The showstopper fixed -- Memory reallocation bug was causing
1157 Encode::XS to fall into infinite loop on certain conditions.
1158 t/grow.t tests that.
1159 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1164 ! lib/Encode/Supported.pod
1165 Vendor encodings rebuilt out of original map files at unicode.org.
1166 Indic languages such as MacDevanagali remain unspported do to the
1167 shortcoming of encengine capabilities (they need algorithmical
1168 conversion and I have no knowledge on that!). Pods fixed for added
1170 Oh, macJapan.ucm renamed to macJapanese.ucm.
1171 macROMnn is macRomanian and macRUMnn is macRumanian.
1172 txt2ucm is a crude script that is used to convert them.
1174 Unicode Compound Characters (used extensively on Mac) supported
1176 Typo fixes and improvements by jhi
1177 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1179 1.11 2002/03/31 22:12:13
1183 Missing files from the MANIFEST fixed.
1184 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1185 Version incremented just to make CPAN happy.
1187 1.10 2002/03/31 21:32:42
1190 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1191 if you want. This should make Autrijus happy. Also, piconv
1192 is added to default install.
1195 Here-documented files that enc2xs generates are now exported
1196 to *.e2x. Much cleaner and easier to debug.
1198 encoding enhances so you can make it act more like such
1199 (now prehistoric ) "localized" variations of perl like Jperl.
1201 Further test for encoding.pm. Written in euc-jp
1204 Taken over form jhi.
1205 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1210 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1217 ! Encode/macIceland.ucm
1218 ! lib/Encode/Alias.pm
1219 ! lib/Encode/Supported.pod
1220 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1221 Message-Id: <m3lmcavhjt.fsf@anima.de>
1223 1.01 2002/03/29 20:59:39
1226 s/USE_SCRIPTS/MORE_SCRIPTS/
1228 installs enc2xs by default for external Encode:: modules in CPAN,
1229 such as Encode::HanExtra
1231 More sensible perl core detection via $ENV{PERL_CORE}
1233 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1235 Perl core ditection via $^X =~ m/\bminiperl$/o
1236 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1238 1.00 Wed Mar 29 2002
1240 The version of all files is updated to 1.00 via "ci -f -l1.00",
1241 commemorating version 1.00. All files, including *.ucm are now
1242 under version control.
1245 encode.h moved to Encode/ so it will be installed for the later
1248 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1253 compile renamed to enc2xs.
1254 Affected Makefle.PL updated
1256 "Punt it. HanExtra can take care of that later." -- Autrijus
1257 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1260 ! Encode/ksc5601.ucm
1261 ! lib/Encode/CJKConstants.pm
1262 ! lib/Encode/KR/2022_KR.pm
1263 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1264 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1268 bin/ added for example scripts. They are not installed by default.
1269 to install them, "perl Makefile.PL USE_SCRIPTS".
1270 piconv is iconv reinvented in perl. in addition to all features
1271 of iconv, it also adds perlish features. See L<piconv/1> for more
1273 ! lib/Encode/Alias.pm
1274 qr/^ replaced with qr/\b so it directly matches locale names
1275 such as en_US.US-ASCII
1278 Patch by MJD to fix the following problem applied.
1279 Subject: [PATCH 5.7.3 Encode]
1280 Aliases.t not properly skipped when Encode extension not built
1281 Message-Id: <20020328091850.18677.qmail@plover.com>
1282 ! lib/Encode/KR/2022_KR.pm
1283 ! lib/Encode/CJKConstants.pm
1284 Another patch from Jungshik to make iso-2022-kr actually work
1285 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1286 ! Encode/Encode/euc-kr.ucm
1287 + Encode/Encode/johab.ucm
1288 ! Encode/Encode/ksc5601.ucm
1290 ! Encode/KR/Makefile.PL
1291 ! Encode/lib/Encode/Alias.pm
1293 Johab support and complete revision of Korean Encoding by Jungshik
1294 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1296 Revised to make up with now-dropped Encode::Details.
1297 - lib/Encode/Details.pod
1298 Dropped. Besides being obsolete, the topics are now covered in
1299 respective pods now.
1304 Korean aliases fixed thanks to Jungshik Shin
1305 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1306 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1308 =head1 NAME added to all modules to make buildtoc happy
1309 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1310 - lib/Encode/CJKguide.pod
1311 Too controversial and dropped from the dist. Will be available
1312 separately on the web.
1314 RCS tags added so table debugging gets easier (should that be
1315 needed! I hope they all stay 1.00!)
1316 + lib/Encode/CJKguide.pod
1317 A detailed guide to mainly, but not limited to, CJK multibyte
1320 + Encode/hp-roman8.ucm
1322 ! Encode/Supported.pod
1323 All occurance of "roman8" replaced with "hp-roman8" to avoid
1325 ! Encode/Supported.pod
1328 Mac Encodings now comply the Inside Macintosh
1330 Test for '-raw' conventions added.
1332 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1333 ! Encode/gb12345.ucm
1335 ! Encode/ksc5601.ucm
1336 "-raw" appended to canonical names.
1337 File mames stay unchanged thanks to UCM format.
1338 ! lib/Encode/CN/HZ.pm
1339 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1340 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1342 0.99 Tue Mar 26 2002
1343 - lib/Encode/JP/Const.pm
1344 + lib/Encode/CJKConstants.pm
1345 + lib/Encode/CN/2022_CN.pm
1346 + lib/Encode/KR/2022_KR.pm
1355 * Support for ISO-2022-KR and ISO-2022-CN added.
1357 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1358 * ucm2table autogenerates character table out of UCM files.
1361 - lib/Encode/Supports.pod
1362 + lib/Encode/Supported.pod
1363 Names reverted due to popular demand.
1364 8.3 rule applies only when there is a conflict.
1365 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1370 - lib/Encode/Format/Enc.pod
1372 * Character tables is now 100% ucm.
1373 * All files under Encode/ is now 8.3-compliant
1374 * some of missing encodings added (i.e. gsm0338 and nextstep)
1375 * Vendor mappings aggregated with appropriate national std in
1376 Makefile.PL, resulting smaller *.so especially for CJK.
1377 Following is result on Dan's FreeBSD box.
1379 ---------------------------------------------------------------
1380 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1381 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1382 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1383 blib/arch/auto/Encode/Encode.so 27,791 27,791
1384 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1385 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1386 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1387 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1388 ---------------------------------------------------------------
1389 Total 5,375,297 6,343,819
1391 * As a result of ucm-transition, Encode::Tcl dropped because
1392 Encode::Tcl demands *.enc.
1393 Encode::Tcl will be supplied in a separate tarball with *.enc.
1394 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1399 -lib/Encode/Supported.pod
1400 +lib/Encode/Supports.pod
1401 -lib/Encode/iso10646_1.pm
1402 +lib/Encode/10646_1.pm
1403 -lib/Encode/EncFormat.pod
1404 +lib/Encode/Format/Enc.pod
1405 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1406 - lib/Encode/JP/Constants.pm
1407 + lib/Encode/JP/Consts.pm
1408 ! lib/Encode/JP/JIS.pm
1409 ! lib/Encode/JP/H2Z.pm
1410 Version nit problem and 8.3 rule fix.
1411 > Package namespace installed latest in CPAN file
1412 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1413 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1414 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1415 accordingly. In addition, legacy "use vars qw()..." are replaced with
1417 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1418 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1420 - lib/Encode/JP/ISO_2022_JP.pm
1421 - lib/Encode/JP/ISO_2022_JP_1.pm
1422 + lib/Encode/JP/2022_JP.pm
1423 + lib/Encode/JP/2022_JP1.pm
1425 8.3 naming conflict for vanilla fat addressed by jhi
1426 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1429 Typecast fix addressed by jhi
1430 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1432 0.98 Mon Mar 25 2002
1433 ! lib/Encode/Supported.pod
1435 + lib/Encode/JP/ISO_2022_JP_1.pm
1436 ! lib/Encode/JP/ISO_2022_JP.pm
1437 ! lib/Encode/JP/JIS.pm
1439 Now Encode::JP is more strict on the difference between ISO-2022-JP
1440 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1441 makes Anton happier :) FYI the previous version implements
1442 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1443 ! lib/Encode/Supported.pod
1446 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1447 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1452 ! lib/Encode/Suppoted.pod
1453 pod fixes to replace F<http://...> to L<http://...>,
1454 as suggested by Autrijius in:
1455 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1456 ! lib/Encode/Suppoted.pod
1457 fixes and enhancements by Anton
1458 Message-Id: <10632060120.20020324103753@motor.ru>
1459 ! lib/Encode/Alias.pm
1460 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1461 added. Suggested by Anton then deobfuscated by Autrijius
1462 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1464 Further fix by Nicholas Clark
1465 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1466 - lib/EncodeFormat.pod
1467 + lib/Encode/EncFormat.pod
1469 File renamed as suggested by Autrijius
1471 ! lib/Encode/Details.pod
1472 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1473 ! Encode.pm Sun Mar 24 13:43:47 2002
1474 pod fixes by Autrijius.
1475 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1476 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1478 ! lib/Encode/Alias.pm
1480 now more EBCDIC conscious;
1481 %ExtModules on EBCDIC system excludes CJK so that you don't
1482 have to worry about the matched alias resulting cloaking.
1483 t/Alias.t also revised to reflect changes. Verified by jhi
1484 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1486 0.97 Sun Mar 24 2002
1490 EBCDIC detection mechanism installed as in JP/JP.pm
1491 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1494 ! EBCDIC/Makefile.PL
1497 ! Symbol/Makefile.PL
1499 Now all table files used by compile are postfixed '_t' to avoid
1500 namespace collisions in case insensitive file systems once for all!
1502 Message-ID: <58290227735.20020323195659@familiehaase.de>
1504 Since the Encode::JP is unsupported under EBCDIC we
1505 cannot run this test (aliases as such should work fine) -- jhi
1506 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1508 duplicate occurance of ascii.ucm and 8859-1.ucm
1509 causes MacOS X dlyd to cloak
1515 < chdir 't' if -d 't';
1517 > if (! -d 'blib' and -d 't'){ chdir 't' };
1518 When you are "make test"-ing on Encode/ directory, you must not
1519 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1520 remain unchanced. Also the situation detection was made simpler
1521 in t/JP.t, which was originally;
1522 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1524 "Use of uninitialized value in string eq at Encode.pm line 96."
1525 ! Symbol/Makefile.PL
1526 ! EBCDIC/Makefile.PL
1528 -- Problem on case insensitive file systems
1529 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1530 Message-ID: <88254111953.20020323095503@familiehaase.de>
1533 "So I think it's a bug in gcc, not perl. But it still needs to be
1535 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1536 Message-Id: <20020323170509.C96475@plum.flirble.org>
1538 0.96 Sat Mar 23 2002
1540 ! lib/Encode/Encoding.pm
1541 ! lib/Encode/Alias.pm
1542 ! lib/Encode/Supported.pod
1544 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1545 Message-ID: <20020322073908.GB10539@blackrider>
1548 "...I think we should include ISO 8859-1 as well." -- NI-S
1549 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1554 ! lib/Encode/Alias.pm
1555 alias definitions relocated to Encode::Alias so module autoloading
1556 works for aliases also.
1558 encodings() now accepts args to check ExtModules.
1562 + EBCDIC/Makefile.PL
1563 + Symbol/Makefile.PL
1567 Latin and single byte encodings are reorganized so they are
1568 demand-loaded like Encode::XX. Now only ascii is compiled into
1570 ! lib/Encode/Alias.pm
1571 for my $k (keys %hash){ delete $hash{$k}; }
1572 is depreciated; fixed.
1574 0.95 Fri Mar 22 2002
1575 In this update, pod rewrites and alias fixes are the main issues
1576 + lib/Encode/Supported.pod
1577 Describes supported encodings
1579 streamlined compiled-in encodings.
1580 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1582 + Encode/ibm-125?.ucm
1583 Added from icu distibution with any occurance of
1584 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1585 some respect to icu staff, however.
1586 + lib/Encode/Alias.pm
1588 Alias difinitions in Encode.pm relocated.
1591 packWARN patch from Paul Marquess via jhi
1592 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1593 Paul added to AUTHORS as a result.
1594 ! t/CJKalias.t -> t/Aliases.t
1595 Renamed. Checks even more aliases and alias overloading
1598 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1599 gbk => cp936 relocated to CN.pm
1601 Test::More with plans (by jhi)
1603 0.94 Thu Mar 21 2002
1604 + lib/Encode/Description.pod
1605 ! lib/Encode/Encoding.pm
1606 Now the pod in Encode.pm is abridged as programming references.
1607 lib/Encode/Description.pod contains the original, detailed description
1608 and Encode::Encoding explains how to write your own module to
1609 add new encodings. So far, lib/Encode/Description.pod contains
1610 the whole pod once in Encode.pm. This is intentional.
1612 Pod revisions by Anton Tagunov
1613 Message-Id: <517178431.20020320174824@motor.ru>
1615 all occrance of Encode::Tcl::Extended removed including pod
1617 test now checks $encoding->name only; $encoding->{name} are
1618 no longer check to find the canonical name.
1619 ! lib/Encode/JP/JIS.pm
1620 ! lib/Encode/JP/ISO_2022_JP.pm
1621 ->name() added to be more compliant with API
1627 Patch by Autrijus to add aliases to TW and fixes to POD
1628 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1630 SADAHIRO Tomoyuki added as should. My apologies.
1632 0.93 Wed Mar 20 2002
1633 * First release to be uploaded to CPAN. For prehistoric changes,
1634 please see Changes file of perl distibution as well as
1635 perl-unicode@perl.org archive, available at:
1636 http://archive.develooper.com/perl-unicode@perl.org/
1638 Changes Since 0.92 includes;
1643 + Mention to perl-unicode@perl.org added
1645 + Encoding aliases added so you can feed locale names
1646 and MIME Charset="" directly.
1647 - Mention to JISX0212 removed because it's fixed
1650 + Encoding aliases added. Note TW is left untouched because
1651 euc-tw is not implemented in TW but in Encode::HanExtra.
1652 Autrijus, you may fix Encode::HanExtra.
1654 + to test encode aliases added