1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 2.7 2004/10/22 19:35:52 dankogai Exp $
5 $Revision: 2.7 $ $Date: 2004/10/22 19:35:52 $
6 ! lib/Encode/Encoding.pm
7 "Remove Carp from warnings.pm" that influences Encode, by Tels.
8 Message-Id: <200410161618.29779@bloodgate.com>
9 ! Encode.xs AUTHORS t/fallback.t
10 Now Encode::utf8's fallbacks are compliant to Encode standard.
11 Thank Bjoern Hoehrmann for persistently convincing me.
12 Message-Id: <41a61aea.638409494@smtp.bjoern.hoehrmann.de>
16 2.06 2004/10/22 06:23:11
18 RT #8083 reports that MacThai mapping was obsolete
19 Updated all mac* encodings accordingly to the URI below.
20 One remaining mystery is that MacRomanian vs. MacRumanian.
21 MacRumanian is not found in unicode.org...
22 http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/
23 ! Encode.pm t/Encode.t
24 Fixed RT #8081: "decode(..., bless{},'x') segfault"
25 Two more tests added to test that.
26 http://rt.cpan.org/NoAuth/Bug.html?id=8081
28 POD revised accordingly to RT #7966
29 http://rt.cpan.org/NoAuth/Bug.html?id=7966
31 POD updated explaining why Encode::Unicode always croaks on error
32 rather than giving users choices.
33 http://rt.cpan.org/NoAuth/Bug.html?id=7892
35 2.05 2004/10/19 04:55:01
37 "unnuke" jhi's patch in bleedperl, with minor correction by dankogai.
38 Message-ID: <41210A84.6060506@iki.fi>
39 Message-ID: <20041018233442.7418113f@r2d2>
40 Message-Id: <2BA3DAC4-218A-11D9-906D-000A95DBB50A@dan.co.jp>
42 2.04 2004/10/16 21:22:44
44 From: craigberry@mac.com
45 Subject: [PATCH ext/Encode/Makefile.PL] make Encode.c dependency explicit
46 Message-Id: <41716868.7000102@mac.com>
48 2.03 2004/10/06 05:07:20
50 Resolved some alias case sensitivity glitches reported via RT.
51 http://rt.cpan.org/NoAuth/Bug.html?id=7835
53 Resolved Win32 glitches reported via RT.
54 (Fixed by dankogai and tested by Steve Hay)
55 http://rt.cpan.org/Ticket/Display.html?id=7831
56 ! JP/JP.pm lib/Encode/Alias.pm lib/Encode/Supported.pod AUTHORS
57 /\bwindows-31j$/i is now an alias of CP932, by Steve Hay.
58 http://rt.cpan.org/NoAuth/Bug.html?id=6695
60 2.02 2004/08/31 10:55:34
61 ! ucm/big5-hkscs.ucm AUTHORS t/big5-hkscs.enc t/big5-hkscs.utf
62 New map submitted by Deng Liu and Autrijus. Test data needed
63 to be upgrade as well, done by dankogai
64 Message-Id: <20040824204828.GB6999@aut.dyndns.org>
66 Now works for characters U+10000 and above. This fix was needed
67 to "tidy" the original map that was submitted.
69 "ucmsort" now mentioned in pod
71 2.01 2004/05/25 16:27:14
73 From: domo@computer.org
74 Subject: [PATCH] Correct statistics from enc2xs
75 <4AF60A4A-B8BB-11D8-BF99-000A27839BD6@computer.org>
76 <CDEBBD45-B91D-11D8-BF99-000A27839BD6@computer.org>
78 Addressed "False [] range "\s-" in regex;" in Encode::Alias.pm
79 <200405271148.i4RBm4KY026529@mail.mvnet.de>
81 2.01 2004/05/25 16:27:14
82 ! lib/Encode/CN/HZ.pm lib/Encode/Unicode/UTF7.pm
83 "If someone thinks utf8::upgrade($1) should be croaked like
84 chom?p($1),please try the following patch for Encode.pm."
86 <20040522212704.C068.BQW10602@nifty.com>
88 2.0 2004/05/16 20:55:15
89 * version updated to 2.00
90 -- sorry, no big feature change. I just hate version 1.100 :)
93 addressed UTF-(8|32LE) + BOM misguessing
94 https://rt.cpan.org/Ticket/Display.html?id=6279
96 s/is_utif8/is_utf8/ in POD
97 ! Encode/lib/Encode/CN/HZ.pm
98 Fixes "make test" failure after the patch to pp_hot.c
100 Message-Id: <20040222182357.6B39.BQW10602@nifty.com>
102 From: autrijus@autrijus.org
103 Subject: [PATCH] "piconv -C 512" badly broken
104 Message-Id: <1072870210.769.5.camel@localhost>
106 1.99 2003/12/29 02:47:16
108 find_encoding("UTF-16BE")->encode("abc") now null terminates
109 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html
111 prototype bug in decode_utf8() fixed
112 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
113 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
114 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
116 Merged from maintperl@21987
118 1.98 2003/08/20 11:15:31
119 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
120 Dave Evans has found and corrected a bug in Encode::MIME::Header.
121 Test suite added by Dan Kogai.
122 Message-Id: <3F43440B.7060606@rudolf.org.uk>
124 Typo fixes rolled back in from bleedperl
125 ! t/at-cn.t t/at-tw.t
126 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
127 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
129 argv case nit for VMS by Craig
130 Message-ID: <3F2B02DE.10207@mac.com>
131 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
132 Encode test fixes for VMS by Peter Prymmer
133 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
134 ! lib/Encode/Alias.pm t/Aliases.t
135 koi-8 aliases bug detected and patched by sadahiro.
136 Further fix and test suite by dankogai
137 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
139 1.97 2003/07/08 21:52:14
140 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
141 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
142 $DEBUG replaced with DEBUG() so perl optimizes better,
143 by Rafael with further fixes by dankogai
144 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
145 ! lib/Encode/Aliases.pm
146 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
147 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
148 So new hash seeding introduced in bleedperl works.
149 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
150 ! lib/Encode/Guess.pm
151 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
152 automatic utf(8|16|32) guessing -- originally by Autrijus
153 Message-Id: <20030626162731.GA2077@not.autrijus.org>
155 Addressed the following;
156 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
157 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
159 1.96 2003/06/18 09:29:02
160 ! lib/Encode/JP/JP.pm t/guess.t
161 m/(...)/ in void context then $1 is considered a Bad Thing
162 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
164 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
167 More typecast from maintperl@19739
168 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
170 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
171 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
173 Now skips for EBCDIC platform.
174 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
176 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
177 have more than one C<.> in their filenames." -- perlport.
178 Message-Id: <3ED79E01.8050401@mac.com>
180 Found and fixed the back that -p,--perlqq does not work.
181 Induced by the change from Getopt::Std to Getopt::Long.
183 Addressed [cpan #2629] Wrong assumption in numeric comparison
184 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
185 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
186 lib/Encode/Encoding.pm t/perlio.t
187 ! API Change: ->new_sequence() => ->renew()
188 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
189 + Encode::XS and Encode::utf8 now supports ->renew()
190 + Encode::Encoding now documents this with examples
191 - Non-XS (en|de)code stripped out of Encode::Unicode
192 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
194 1.95 2003/05/21 08:41:11
196 Since bogus entries were found in iso-8859-6, all entries are
197 re-generated once again out of
198 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
199 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
200 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
201 + lib/Encode/Unicode/UTF7.pm
202 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
203 lib/Encode/Supported.pod
204 UTF-7 support is now added. With this Encode now has all transcoding
205 methods in Unicode::String.
207 1.94 2003/05/10 18:13:59
208 ! lib/Encode/MIME/Header.pm
209 A more sophisticated solution for double-encoding by dankogai
210 ! lib/Encode/MIME/Header.pm AUTHORS
211 Two bugs fixed by Bjoern Jacke
212 * "Double Encoding" was not possible
213 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
214 * encode("MIME-Q") had UTF-8 flag on
215 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
216 ! lib/Encode/MIME/Header.pm AUTHORS
217 Two occurances of "croak ()" fixed as "croak qq()".
218 Simon Cozens is added to AUTHORS as a result.
219 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
221 POD fixes that reflect enhancements by jhi
223 Two enhancements by jhi.
224 + Now uses Getopt::Long so it accepts long name options
225 (--from for -f, for example)
226 + New option: -r,--resolve
227 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
229 META.yml added upon request of Schwern
230 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
232 Enache Adrian removed upon request -- to live longer than Encode
233 and/or FreeBSD (toy-)?thread :)
234 Message-Id: <20030425015701.GA2069@ratsnest.hole>
236 "close STDOUT unless $^O eq 'freebsd';" once again relocated
237 to keep VMS happy in which case "$^O eq 'freebsd'" is required
238 to keep FreeBSD+thread happy. Sigh.
239 Message-Id: <3EA88ADC.3000300@mac.com>
241 1.93 2003/04/24 17:43:16
243 added "no warnings 'pack'" in for loop to keep bleedperl from
244 complaining "Character in 'C' format wrapped in pack".
246 More elegant perl core detection inspired by Ilya Zakharevich
247 (but further elaborated for general cases).
248 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
251 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
252 are less likely to be confused w/ ISO-8859-*
254 RT tests added (vendor encodings are exemplified)
255 -- that successfully found a flaw on iso-2022-kr before the patch.
256 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
257 decode("ISO-2022-KR") has been buggy but no one ever sited
258 that since no one seems to be using it. Bugs discovered by
260 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
261 ! lib/Encode/CN/HZ.pm t/perlio.t
262 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
263 so it adds test for HZ.
264 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
265 ! lib/Encode/Guess.pm
266 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
267 So long as the string contains \x{00}-\x{ff} it does not fail.
268 See perldoc for details.
269 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
271 1.92 2003/03/31 03:27:27
272 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
273 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
274 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
275 but glibc-BIG5-2.1.2.ucm does.
276 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
277 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
278 Maps regenerated again but this time based upon
279 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
280 (But where is THE DOCUMENT by MICROSOFT?)
281 ! t/enc_module.t AUTHORS
282 failure with threaded Perl on FreeBSD addressed.
283 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
284 Message-Id: <20030322230131.GA813@ratsnest.hole>
285 ! lib/Encode/Guess.pm
288 Change 18989: Make the :bytes conditional on PerlIO.
289 further Modified by Dan Kogai
290 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
292 Chnage 18966: another fix for failing test on windows ("use encoding"
293 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
294 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
296 Change 18970: Hopefully this works also in Win32, by jhi
297 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
298 Change 18965: fix CJKT.t failures on windows due to incorrect
300 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
302 1.91 2003/03/09 20:07:37
304 even more proofread by jhi.
305 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
308 +use lib qw(t ext/Encode/t ../ext/Encode/t);
309 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
311 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
312 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
315 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
317 1.90 2003/03/09 17:32:43
320 Inaba-san has added a patch for perl 5.8.1 or later that makes
321 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
322 that. POD is further revised.
323 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
324 ! encoding.pm t/enc_module.t
325 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
326 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
330 signed vs. unsigned issue discovered by Craig on OpenVM
331 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
332 ! encoding.pm AUTHORS
333 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
334 Because binmode() stacks layers instead of overwrite, you have to
335 ":raw :encoding()" in encoding.pm or your are in trouble when you
336 call encoding.pm multiple times. There are several workarounds
337 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
338 who was the first to address this problem was added to AUTHORS.
339 The test suites was added for this, which is a modified version
340 of SUGAWARA-san's scripts
341 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
343 1.88 2003/02/20 14:42:34
345 one signedness nit for Encode by jhi
346 <200302161933.h1GJX876018710@kosh.hut.fi>
348 VISCII map was incorrect; fixed by Sadahiro-san
349 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
350 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
351 You can't unlink files that are opened in cygwin but the last
352 file handle opened in t/enc_*.t left open. Patch submitted
353 by Yitzchak and he was added to AUTHORS.
354 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
356 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
357 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
359 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
360 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
362 1.87 2003/02/06 01:52:11
364 * Inaba "Sensei" Hirohito added (I thought I have done so a long
365 ago but apparently I did not).
366 * SUZUKI Norio added for verious and useful bug reports.
367 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
368 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
369 podchecked so all warnings are gone except for L<http://>.
370 ! encoding.pm t/enc_eucjp.t
371 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
372 * Filter option overhaul
374 ! Encode.pm Encode.xs encengine.c Encode/encode.h
375 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
376 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
377 without filter. podchecked by Dan Kogai.
378 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
379 ! lib/Encode/Alias.pm
380 decode('alias', $1) went wild because of local $_ in find_alias()
381 the evil local $_ is eradicated but that changes find_alias()
382 format for coderef aliasing. See Encode::Alias for details
383 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
385 1.86 2003/01/22 03:29:07
387 * Don't forget to canonize when you attempt an exact match!
388 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
389 * ${^ENCODING} exception is off for $] > 5.008
390 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
392 $] check commented out so it runs on 5.8.0
394 1.85 2003/01/21 22:19:14
396 ${^ENCODING} exception is now explicit rather than handled by regex.
397 + t/enc_eucjp.t t/enc_utf8.t
398 Test suite for the better "encoding" pragma support for bleedperl.
399 On 5.8.0, they will just be skipped.
401 1.84 2003/01/10 12:00:16
403 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
404 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
405 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
407 %_ fixes saves the resulting .so .05% smaller, by NC
408 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
410 Silence Encode on undef, by Andreas
411 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
412 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
414 s/regognised/recognised/ . British spelling left intact to pay
415 respect to two British Nicks :)
416 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
418 1.83 2002/11/18 17:28:49
419 ! Encode.xs lib/Encode/JIS7.pm
420 Even more patches from Inaba-san has been applied. With this
421 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
422 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
424 1.82 2002/11/14 23:06:12
426 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
427 later found in perls w/ -Dusemymalloc was fixed by NC.
428 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
430 1.81 2002/11/08 18:29:27
431 ! Encode.pm Encode.xs
432 Non-XS version of Encode::utf8 is back (with XS being default).
433 Encode::predefine_encodings(0) to turn off XS.
434 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
435 found that it was Test::More causing the problem, not Encode.
436 But I have already made it configurable so it may be useful in
438 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
440 The ingenious patch by Nicholas Clark that reduces shlib sizes by
441 50% with no penalty and backward compatibility preserved, is in.
442 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
444 1.80 2002/10/21 20:39:09
445 ! Encode.xs t/mime-header.t
446 Even more patches from NI-XS regarding Encode::utf8->decode().
447 And one more test to t/mime-header.t to prove it
448 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
450 1.79 2002/10/21 06:05:37
452 Further patches from NI-XS. Encode::utf8->decode() now checks the
453 value of utf8 flag of the argument. As a result, the fix to
454 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
455 no harm (even speedwise) I'll leave it unreverted.
456 ! ucm/cp949.ucm ucm/cp950.ucm
458 U+00AE REGISTERED SIGN
459 were missing as a result of 1.78. Discovered by Moriyama-san.
460 Moriyama-san has also developed a test script that compares
461 (en|de)coded results to the corresponding Win32 API result and
462 all cp9?? maps are now verified.
463 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
465 1.78 2002/10/20 15:44:00
466 ! lib/Encode/MIME/Header.pm
467 fixed so that it works with new Encode::utf8
468 ! Encode.pm Encode.xs
469 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
470 :encoding(UTF-8) to handle partial chars at end of buffers
472 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
473 ! lib/Encode/Supported.pod
474 More nitpickings applied.
477 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
478 tests were completely useless. To redeem that and get the peace of
479 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
480 ucm/*.ucm Since this script takes too long to finish (30 seconds on
481 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
482 but you can easily run that by either renaming it or:
484 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
485 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
486 Other CJKT cp9?? also updated according to the URI below;
487 http://www.microsoft.com/typography/unicode/cscp.htm
488 + bin/ucmsort MANIFEST
489 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
490 to proper order. intended for hardcore develpers only.
491 ! ucm/cp932.ucm JP/JP.pm AUTHORS
492 CP932 mapping which was based upon the mapping file at unicode.org
493 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
494 has also supplied the patch so he was added to AUTHORS.
495 ! lib/Encode/Supported.pod
496 ISO-8859-11 != TIS 620
497 == TIS 620 + \xA0 ( )
499 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
501 1.77 2002/10/06 03:27:02
503 * Modified to accomodate up and comming patch by Inaba-san that
504 will fix tr/// needing eval qq{}
505 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
507 * pod fixes/enhancements to reflect the changes above
508 ! lib/Encode/Alias.pm
509 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
510 Message-Id: <20021001015648.GB18710@not.autrijus.org>
512 1.76 2002/08/25 15:09:51
514 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
515 but naturally identical to previous version -- dankogai
517 Codepoint fixes -- autrijus
518 Message-Id: <20020805040236.GC5220@not.autrijus.org>
520 copied everything under perl-5.8.0/ext/Encode to make sure Encode
521 is in sync w/ perl core
523 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
524 Now that binmode(FH) does implicit ":bytes" revisit
525 the failing tests. The worrisome one is the Digest::MD5
526 test-- how will it fare in CRLF lands now?
528 From: Radu Greab <radu@netsoft.ro>
529 Date: Mon, 10 Jun 2002 00:40:34 +0300
530 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
531 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
532 ones which I cannot figure out.
533 ! lib/Encode/Alias.pm
534 Subject: [Encode PATCH] spurious warning
535 From: Nicholas Clark <nick@unfortu.net>
536 Date: Sun, 2 Jun 2002 20:26:22 +0100
537 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
539 1.75 2002/06/01 18:07:49
540 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
541 glibc compliance cited by Autrijus.
542 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
543 ! bin/enc2xs bin/piconv
544 Subject: Re: forewarning: usedevel and versiononly
545 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
547 1.74 2002/05/28 18:33:15
548 + ucm/null.ucm ucm/ctrl.ucm
549 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
550 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
551 Subject: Re: unicode -> &# notation
552 Message-ID: <20020518193704.GB40272@not.autrijus.org>
554 1.73 2002/05/28 17:26:18
555 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
556 Chris Nandor has fixed Encode so that it works w/ MacPerl --
557 at least w/ PPC (68k need static linking which does not work due to
558 64k limit). pudge is added to AUTHORS (I'm surprised he was not
559 there in the list). Encode/Makefile_PL.e2x was additionally fixed
560 by dankogai to reflect changes in other Makefile.PL
561 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
563 Subject: Change 16746: -Mutf8 cleanup.
564 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
566 1.72 2002/05/20 15:49:56
568 Subject: [PATCH] Encode should be in perl-core library path
569 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
570 Message-Id: <20020520161201.A11019@alpha.hut.fi>
571 ! lib/Encode/MIME/Header.pm
572 Subject: [PATCH] Encode::MIME::Header
573 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
574 ! Encode/Makefile_PL.e2x
575 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
576 Message-Id: <20020519201031.GA1603@not.autrijus.org>
577 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
578 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
580 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
581 Subject: [PATCH] good day for WinCE port of perl.
582 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
585 Even more linting by Robin via @16532
587 Even more typecast by Sarathy in @16460
589 1.71 2002/05/07 16:22:42
591 even more typecasts by Robin
592 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
594 A very strange bug that was causing a bugus ucm -> C table
595 generation that was revealed by a UCM file that Andreas was
596 working. This is the king of wierdest bug I've encountered
597 in the course of Encode maintenance.
598 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
600 1.70 2002/05/06 10:26:48
602 Made more 'module-safe' with conjunction w/ 'no encoding'.
603 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
604 ! lib/Encode/Encoding.pm
605 'require Encode' because ->Define uses Encode::define_encoding();
606 problem and solution addressed by Miyagawa-kun
607 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
609 Cuts the frill to make djgpp happier, as suggested by Laszlo
610 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
612 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
613 Message-Id: <m3bsbug48n.fsf@anima.de>
615 1.69 2002/05/04 16:41:18
616 ! lib/Encode/MIME/Header
617 Floating-point coerced for UNICOS (in integer arithmetics it folds
618 line one character too early). Verification by Mark is pending.
619 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
621 more doc patch from Elizabeth
622 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
623 ! Encode/Makefile_PL.e2x
624 More platform-independent patch from Benjamin
625 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
626 ! lib/Encode/Guess AUTHORS
627 split regex fix by Graham Barr. Adds him to AUTHORS.
628 Message-Id: <20020504085419.E95940@valueclick.com>
629 ! Encode/Makefile_PL.e2x
630 enc2xs script discovery made smarter and more sensible, first cited
631 by Miyagawa-kun and further suggestions by Rafael and Andreas
632 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
633 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
635 1.68 2002/05/03 12:20:13
636 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
637 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
638 and Supported.pod modified to reflect the change. Elizabeth added
639 to Authors. And H.M. is also added for forwarding her patch among
640 other contributions (I was rather surprised to find his name was not
642 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
644 1.67 2002/05/02 07:33:09
646 Error message now consistent w/ perlqq (\N{U+} -> \x{})
647 done in perl@16308 but Philip linted me further. Now the error
648 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
649 ! lib/Encode/Guess.pm
650 Sanity check for happier -w by Autrijus
652 1.66 2002/05/01 05:41:06
653 ! Encode.xs t/fallback.t
654 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
655 while fallback is in effect. This even came with a welcome side-effect
656 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
657 also modified to test this.
658 And of course, the corresponding varialbles to UV[Xx]f are appropriately
659 cast. This should've concluded NI-XS homework.
661 encode(undef) does warn again! Repented upon suggestion by NI-XS.
662 Document for unless vs. '' added
663 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
665 1.65 2002/04/30 16:13:37
667 encode(undef) no longer warns for C<Use of uninitialized value in
668 subroutine entry>. Suggested by Paul.
669 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
670 ! lib/Encode/Supported.pod
671 Encode::MIME::Header and Encode::Guess mentioned
672 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
673 ! lib/Encode/Guess.pm
674 POD fix by Miyagawa-kun
675 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
677 1.64 2002/04/29 06:54:06
679 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
680 encocde("euc-jisx0213") are still DIFFERENT.
681 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
683 A few white spaces corrected by NI-XS via PerlIO integration to
685 Subject: Change 16247: Integrate perlio;
687 Document fixes by Andreas
688 Message-Id: <m3k7qsf1we.fsf@anima.de>
690 1.63 2002/04/27 18:59:50
691 ! lib/Encode/Encoding.pm
692 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
693 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
694 Make use of the Encode::Encoding base class!
695 And other cleanups in Encode.xs upon NI-XS suggestions
696 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
698 1.62 2002/04/27 11:17:39
700 encodings() now just check %ExtModule instead of eval{require}
701 all of them for ":all" to conserve more memory.
703 more "%x" -> "%" UVxf stuff.
705 s/=over2/=over 2/g # oops.
707 1.61 2002/04/26 03:02:04
709 Now does decent tests besides use_ok()
710 ! lib/Encode/Guess.pm t/guess.t
711 UI streamlined, document added
713 various signed/unsigned mismatch nits (#16173)
714 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
716 POD: utf8-flag-related caveats added. A few sections completely
720 Thou shalt not assume %d works, either!
721 Robin Baker added to AUTHORS for this
722 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
724 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
726 1.60 2002/04/24 20:06:52
728 "Thou shalt not assume %x works." -- jhi
729 Message-Id: <20020424210618.E24347@alpha.hut.fi>
730 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
731 low-memory build machines happy, now *.c is created for each *.ucm
732 (no table aggregation). You can still override this by setting
733 $ENV{AGGREGATE_TABLES}.
734 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
735 + lib/Encode/Guess.pm
736 + lib/Encode/JP/JIS7.pm
737 Encoding-autodetect (mainly for Japanese encoding) added. In a
738 course of development, JIS7.pm was improved.
739 + lib/Encode/HTML/Header.pm
740 + lib/Encode/Config.pm
741 MIME B/Q Header Encoding Added!
742 ! Encode.pm Encode.xs t/fallback.t
743 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
744 Message-Id: <20020424130709.GA14211@tanglefoot>
746 1.59 $ 2002/04/22 23:54:22
747 ! Encode.pm Encode.xs
748 needs_lines() and perlio_ok() are added to Internal encodings such
749 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
751 1.58 2002/04/22 23:54:22
753 s/MacChineseSimp/MacChineseTrad/ # ... oops.
758 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
759 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
760 t/(JP|KR).t are aggregated to t/CJKT.t
761 test data are all remade via bin/ucm2text.
762 And .... They are no longer skipped for -Uuseperlio !
764 1.57 2002/04/22 20:27:30
765 ! t/JP.t t/KR.t t/perlio.t
766 unless (find PerlIO::Layer 'perlio') ... line is back again.
767 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
768 need PerlIO. Sigh....
769 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
770 ->perlio_ok now does eval{ require PerlIO::encoding } there so
771 it correctly returns 1 when PerlIO::encoding is yet loaded.
773 perl-current patch #16072 reflected
775 1.56 2002/04/22 09:48:07
776 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
777 New PerlIO::encoding 0.04 compliance met
779 1.55 2002/04/22 03:43:05
780 ! Encode.pm Encode.xs Unicode/Unicode.pm
781 needs_lines() defined so Encode::Encoding is no longer needed
784 1.54 2002/04/22 02:50:01
785 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
786 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
787 now perlio_ok is true by default if PerlIO::encoding->VERSION is
788 0.03 or larger. POD in Encode::Encoding revised to reflect this.
789 Encode::XS and Encode::Unicode now has perlio_ok() method.
790 ! lib/Encode/Supported.pod
791 s/UP-UX/HP-UX/ by jhi
792 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
793 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
794 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
795 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
796 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
797 ! lib/Encode/Supported.pod
798 Huge document fixes by Philip.
801 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
802 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
804 binmode() after "<:encoding" to make Win32 happy, by Mattia.
805 Mattia added to AUTHORS file
806 Message-Id: <3CC3150F.5798.22A05AE@localhost>
808 1.52 2002/04/20 23:43:47
810 TODO: is now SKIP:, as NI-XS requested. Also adds more
811 eraborate failure analysis added.
813 A note on how to make sure of round-trip safety added to POD
814 section (so Autrijus is happier)
815 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
816 big5-(eten|hkscs) is round-trip safe again!
817 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
819 Typo fixes by Andreas
820 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
821 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
822 PerIO coodination patches from NI-XS.
823 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
825 1.51 2002/04/20 09:58:23
827 Updated test suite by Autrijis so "make test" is happy again
828 Message-Id: <20020420082104.GA25037@not.autrijus.org>
830 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
832 TW/TW.pm TW/Makefile.PL
833 Updates by Autrijus. 'big5' is no longer a canonical but an
834 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
835 Message-Id: <20020419195346.GA19597@not.autrijus.org>
837 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
838 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
840 PerlIO detection a little bit smarter; no longer uses eval qq{}
843 1.50 2002/04/19 06:13:02
844 ! ! Encode.pm Encode.xs Encode/encoding.h
846 New Fallback API imlemented and documented. See "perldoc Encode"
848 ! lib/Encode/JP/JIS7.pm Encode.pm
849 + lib/Encode/PerlIO.pod t/perlio.t
850 API compliance met. However, it still does not work unless perlio
851 implements line buffer. See BUGS section in perldoc Encode::PerlIO
852 As a sensible workaround, perlio_ok() added to Encode.
854 ! lib/Encode/Supported.pod
856 Message-Id: <20020418174647.J8466@alpha.hut.fi>
858 Doc fixes from Autrijus
859 Message-Id: <20020418144131.GA10987@not.autrijus.org>
861 perlqq mode documented
863 + t/jisx0201.euc t/jisx0201.ref
864 ! t/jisx0208.euc t/jisx0208.ref
865 t/JP.t tests more rigorously and with other encodings
866 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
867 in jis0208 (used to contain jisx0201 part).
868 ! Encode/Makefile_PL.e2x
869 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
870 enc2xs and encode.h rather than hard-coded. This allows the resulting
871 module fully CPANizable.
872 ! encoding.pm t/JP.t t/KR.t
873 PerlIO detection simplified (checks %INC instead of eval{})
874 ! Encode.xs Encode/encode.h
875 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
876 - lib/Encode/Unicode.pm
877 (en|de)code_xs relocated to where it belongs. Source reindented
880 Additional (U8 *) cast added as suggested by jhi
881 Message-Id: <20020417165916.A28599@alpha.hut.fi>
883 1.42 Date: 2002/04/17
885 no-op module; Thought of adding a pod there but enc2xs has
890 correct mechanism to detect Perlio::encoding layar installed.
892 PerlIO Layer detached.
894 1.41 2002/04/16 23:35:00
896 binmode(STDIN|STDOUT ...) done iff PerlIO is available
898 Cleaned up PerlIO skip conditions to prepare for the upcoming
899 Encode - PerlIO forking.
901 exported functions are now prototyped.
902 ! lib/Encode/CN/HZ.pm
905 fallback implemented # was /* FIXME */
906 affected programs revised to fit (only HZ was using the try-catch
907 approach which needed to be fixed for API-compliance).
909 ! Encode/KR/2022_KR.pm
911 can find =head1 NAME now, jhi
912 Message-Id: <20020416083059.V30639@alpha.hut.fi>
916 now complies with less warnings with the pickest compilers.
917 Suggested by Craig, fixed by Dan.
918 ! Encode/Makefile_PL.e2x
920 A bug that fails to find *.e2x in certain conditions fixed
922 1.40 2002/04/14 22:27:14
923 + Encode/ConfigLocal_PM.e2x
924 ! lib/Encode/Config.pm
926 "enc2xs -C" now generates/updates Encode::ConfigLocal.
927 ConfigLocal_PM.e2x is a skelton thereof.
928 ! lib/Encode/Config.pm
930 "use Encode::CN::HZ;" was missing.
933 More rigorous tests added to test XS, especially on memory allocation.
935 ! lib/Encode/Unicode.pm
936 NI-S implemented an XS version -- merged
937 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
940 Source filter option added. With this option on, you can write
941 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
942 encodings. t/jperl.t enhanced to test this feature.
944 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
945 apply his suggestion, this degree of nitting is enough to add him
947 Message-Id: <3CB93223.291E5E2E@earthlink.net>
949 + lib/Encode/JP/JIS7.pm
950 - lib/Encode/JP/JIS.pm
951 - lib/Encode/JP/2022_JP.pm
952 - lib/Encode/JP/2022_JP1.pm
953 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
954 JIS7.pm for better maintainability and performance
956 Added caveat for non-ascii identifiers.
958 fixes by jhi, the original author of this pragramtic module.
959 Message-Id: <20020413231527.V1826@alpha.hut.fi>
961 1.34 2002/04/12 20:23:05 (Unreleased)
964 EBCDIC fixes addressed by jhi.
965 Message-Id: <20020412161844.D9383@alpha.hut.fi>
966 ! lib/Encode/Encoder.pm
967 POD fix by Miyagawa-kun
968 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
970 1.33 2002/04/10 22:28:40
972 Philip's mail address corrected.
975 ! lib/Encode/Encoder.pm
976 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
977 This adds Miyagawa-kun to AUTHORS.
978 * encoding() no longer exported by default but on demand
979 * t/Encoder.t updated to test all these
980 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
981 ! lib/Encode/Unicode.pm
982 ! lib/Encode/Supported.pm
983 Further doc fixes by Anton
985 1.32 2002/04/09 20:06:15
988 - ucm/macDevanaga.ucm Unicode Character Map
989 - ucm/macGujarati.ucm Unicode Character Map
990 - ucm/macGurmukhi.ucm Unicode Character Map
991 A utility to check integrity of .ucm files. t/bogus.ucm is a
992 ucm that is deliberately bogus. unused Indic mappings are removed
995 resolve_alias() added as suggested by jhi. Same as
996 find_encoding("alias")->name. For convenience. This one is
997 defined in Encode.pm instead of Alias.pm.
998 Message-Id: <20020409215846.H17022@alpha.hut.fi>
1000 Memory Allocate but detected during the devel of ucmlint -- fixed.
1001 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
1002 ! lib/Encode/Unicode.pm
1003 valid_ucs2(0) is false but must be true.
1004 3 patches from NI-S as follows. This also has fixed the incident
1006 ! lib/Encode/Alias.pm
1007 find_alias() recursion prevention
1009 Checks for the patch above
1010 ! t/Encode/Unicode.pm
1011 An extra "F" that causes valid_ucs2() return a bogus value fixed
1012 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
1013 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
1014 2 Small Patches from jhi as follows:
1016 Encode->encodings() lists in case-insensitve order (as it was)
1018 -l option prints avaiable encodings to STDOUT instead of STDERR
1019 ! lib/Encode/Aliases.pm
1020 s/defintion/definition/
1021 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
1023 ! lib/Encode/Supported.pod
1024 ! lib/Encode/Unicode.pm
1025 POD revise by Philip Newton. This adds Philip to AUTHORS list.
1026 Thank you for the exact quote of Douglas Adams :)
1027 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
1029 1.31 2002/04/08 18:08:07
1030 ! lib/Encode/Encoder.pm
1032 Encode::Encoder, once just a placeholder of an idea, is now much more
1033 practical. See t/Encode.t to find how practical it can be.
1034 + lib/Encode/Config.pm
1036 my false laziness at Encode.pm is fixed. Now %ExtModules are set
1037 in Encode::Config and they are all literally, not programatically
1038 set. My false laziness was resulting many encodings missing from
1040 ! lib/Encode/Unicode.pm
1042 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
1043 so that it does not rely Encode::Unicode for BOM values
1044 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
1046 1.30 2002/04/08 02:34:51
1047 + lib/Encode/Encoder.pm
1048 Object Oriented Encoder. I reckon something like this is in need.
1051 ! lib/Encode/Supported.pod
1052 * autoloading bug that prevented upper-case canonicals such as UTF-16
1053 is fixed. Now even UTF/UCS are autoloaded!
1054 * encodings() is now more intuitive.
1055 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
1057 * Obligatory fixes to the POD.
1058 ! lib/Encode/Supported.pod
1059 Patch from Anton applied.
1060 Message-Id: <66641479.20020408033300@motor.ru>
1062 ! lib/Encode/Unicode.pm
1063 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
1065 1.28 2002/04/07 18:58:42
1069 Just a MANIFEST for those missing files.
1071 1.26 Date: 2002/04/07 15:22:04
1074 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
1075 And t/Aliases.t fixed to make sure they all exist.
1077 1.25 2002/04/07 15:01:25 (Unreleased)
1079 ! lib/Encode/Unicode.pm
1082 - lib/Encode/UTF_EBCDIC.pm
1083 - lib/Encode/Internal.pm
1084 - lib/Encode/utf8.pm
1085 Integrated into Encode.pm as closures. That way "one package, one file"
1086 rule is preserved yet less files to require.
1088 commented out binmode(STDERR ...
1092 ! EBCDIC/Makefile.PL
1095 ! Symbol/Makefile.PL
1097 ! Encode/Makefile_PL.e2x
1098 Schwarn's MM-compliance patch merged
1099 Message-Id: <20020406082609.GA28758@blackrider>
1101 ! lib/Encode/Unicode.pm
1102 + lib/Encode/UTF_EBCDIC.pm
1104 - lib/Encode/10646_1.pm
1105 - lib/Encode/ucs2_le.pm
1106 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1107 per-module based (en|de)code, I saved a number of .pm by
1108 reorganizing it as per-object base (Well, this is what Encode::XS
1109 does under the hood). See Encode::Unicode for details.
1110 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1111 This module is used only on EBCDIC environments.
1113 1.21 2002/04/05 14:46:34 (Not Released)
1119 Are back to make Perl/Tk happy Smile, NI-S.
1121 ! lib/Encode/Alias.pm
1122 ! lib/Encode/Supported.pm
1123 ! lib/Encode/10646_1.pm
1124 ! lib/Encode/ucs2_le.pm
1125 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1126 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1127 updated to reflect changes.
1128 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1129 ! lib/Encode/Alias.pm
1130 ! lib/Encode/Supported.pm
1131 Anton's revision commited. Added Dan's own fixes as well.
1132 Message-Id: <159103166906.20020405161134@motor.ru>
1133 ! lib/Encode/Alias.pm
1135 < qr/^UCS2-le$/i => '"UCS-2"', );
1137 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1138 Sigh. Thank you, Anton.
1139 Message-Id: <14567692196.20020405062020@motor.ru>
1140 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1142 1.20 2002/04/04 19:50:52
1144 the last minute addtion. Just give it a try. Docs remains to be done.
1145 Not installed by default.
1146 ! lib/Encode/Supported.pod
1149 ! lib/Encode/Alias.pm
1150 ! lib/Encode/utf8.pm
1151 ! lib/Encode/10464_1.pm
1152 ! lib/Encode/ucs2_le.pm
1153 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1154 unchanged but UTF-16BE is added as an alias. Implicit aliases
1155 move to Encode::Alias so init_alias() works more as expected.
1156 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1157 Though pedantically wrong, This should make perl mongers happier.
1158 t/Alias.t is enhanced to test all these.
1159 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1161 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1162 of the codepoints are common. CJKT left as is because the saving is
1166 ! EBCDIC/Makefile.PL
1168 ! Encode/Makefile_PL.e2x
1172 ! Symbol/Makefile.PL
1176 All occurance of _def.h replaced with .exh so djgpp works happily
1177 ever after! To credit this amazing discovery, Laszlo is now in
1179 Message-Id: <20020403181424.GA8778@freemail.hu>
1180 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1183 ! Encode/Makefile_PL.skel
1185 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1186 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1188 Two more tests by added jhi
1189 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1192 The showstopper fixed -- Memory reallocation bug was causing
1193 Encode::XS to fall into infinite loop on certain conditions.
1194 t/grow.t tests that.
1195 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1200 ! lib/Encode/Supported.pod
1201 Vendor encodings rebuilt out of original map files at unicode.org.
1202 Indic languages such as MacDevanagali remain unspported do to the
1203 shortcoming of encengine capabilities (they need algorithmical
1204 conversion and I have no knowledge on that!). Pods fixed for added
1206 Oh, macJapan.ucm renamed to macJapanese.ucm.
1207 macROMnn is macRomanian and macRUMnn is macRumanian.
1208 txt2ucm is a crude script that is used to convert them.
1210 Unicode Compound Characters (used extensively on Mac) supported
1212 Typo fixes and improvements by jhi
1213 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1215 1.11 2002/03/31 22:12:13
1219 Missing files from the MANIFEST fixed.
1220 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1221 Version incremented just to make CPAN happy.
1223 1.10 2002/03/31 21:32:42
1226 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1227 if you want. This should make Autrijus happy. Also, piconv
1228 is added to default install.
1231 Here-documented files that enc2xs generates are now exported
1232 to *.e2x. Much cleaner and easier to debug.
1234 encoding enhances so you can make it act more like such
1235 (now prehistoric ) "localized" variations of perl like Jperl.
1237 Further test for encoding.pm. Written in euc-jp
1240 Taken over form jhi.
1241 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1246 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1253 ! Encode/macIceland.ucm
1254 ! lib/Encode/Alias.pm
1255 ! lib/Encode/Supported.pod
1256 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1257 Message-Id: <m3lmcavhjt.fsf@anima.de>
1259 1.01 2002/03/29 20:59:39
1262 s/USE_SCRIPTS/MORE_SCRIPTS/
1264 installs enc2xs by default for external Encode:: modules in CPAN,
1265 such as Encode::HanExtra
1267 More sensible perl core detection via $ENV{PERL_CORE}
1269 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1271 Perl core ditection via $^X =~ m/\bminiperl$/o
1272 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1274 1.00 Wed Mar 29 2002
1276 The version of all files is updated to 1.00 via "ci -f -l1.00",
1277 commemorating version 1.00. All files, including *.ucm are now
1278 under version control.
1281 encode.h moved to Encode/ so it will be installed for the later
1284 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1289 compile renamed to enc2xs.
1290 Affected Makefle.PL updated
1292 "Punt it. HanExtra can take care of that later." -- Autrijus
1293 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1296 ! Encode/ksc5601.ucm
1297 ! lib/Encode/CJKConstants.pm
1298 ! lib/Encode/KR/2022_KR.pm
1299 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1300 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1304 bin/ added for example scripts. They are not installed by default.
1305 to install them, "perl Makefile.PL USE_SCRIPTS".
1306 piconv is iconv reinvented in perl. in addition to all features
1307 of iconv, it also adds perlish features. See L<piconv/1> for more
1309 ! lib/Encode/Alias.pm
1310 qr/^ replaced with qr/\b so it directly matches locale names
1311 such as en_US.US-ASCII
1314 Patch by MJD to fix the following problem applied.
1315 Subject: [PATCH 5.7.3 Encode]
1316 Aliases.t not properly skipped when Encode extension not built
1317 Message-Id: <20020328091850.18677.qmail@plover.com>
1318 ! lib/Encode/KR/2022_KR.pm
1319 ! lib/Encode/CJKConstants.pm
1320 Another patch from Jungshik to make iso-2022-kr actually work
1321 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1322 ! Encode/Encode/euc-kr.ucm
1323 + Encode/Encode/johab.ucm
1324 ! Encode/Encode/ksc5601.ucm
1326 ! Encode/KR/Makefile.PL
1327 ! Encode/lib/Encode/Alias.pm
1329 Johab support and complete revision of Korean Encoding by Jungshik
1330 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1332 Revised to make up with now-dropped Encode::Details.
1333 - lib/Encode/Details.pod
1334 Dropped. Besides being obsolete, the topics are now covered in
1335 respective pods now.
1340 Korean aliases fixed thanks to Jungshik Shin
1341 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1342 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1344 =head1 NAME added to all modules to make buildtoc happy
1345 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1346 - lib/Encode/CJKguide.pod
1347 Too controversial and dropped from the dist. Will be available
1348 separately on the web.
1350 RCS tags added so table debugging gets easier (should that be
1351 needed! I hope they all stay 1.00!)
1352 + lib/Encode/CJKguide.pod
1353 A detailed guide to mainly, but not limited to, CJK multibyte
1356 + Encode/hp-roman8.ucm
1358 ! Encode/Supported.pod
1359 All occurance of "roman8" replaced with "hp-roman8" to avoid
1361 ! Encode/Supported.pod
1364 Mac Encodings now comply the Inside Macintosh
1366 Test for '-raw' conventions added.
1368 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1369 ! Encode/gb12345.ucm
1371 ! Encode/ksc5601.ucm
1372 "-raw" appended to canonical names.
1373 File mames stay unchanged thanks to UCM format.
1374 ! lib/Encode/CN/HZ.pm
1375 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1376 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1378 0.99 Tue Mar 26 2002
1379 - lib/Encode/JP/Const.pm
1380 + lib/Encode/CJKConstants.pm
1381 + lib/Encode/CN/2022_CN.pm
1382 + lib/Encode/KR/2022_KR.pm
1391 * Support for ISO-2022-KR and ISO-2022-CN added.
1393 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1394 * ucm2table autogenerates character table out of UCM files.
1397 - lib/Encode/Supports.pod
1398 + lib/Encode/Supported.pod
1399 Names reverted due to popular demand.
1400 8.3 rule applies only when there is a conflict.
1401 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1406 - lib/Encode/Format/Enc.pod
1408 * Character tables is now 100% ucm.
1409 * All files under Encode/ is now 8.3-compliant
1410 * some of missing encodings added (i.e. gsm0338 and nextstep)
1411 * Vendor mappings aggregated with appropriate national std in
1412 Makefile.PL, resulting smaller *.so especially for CJK.
1413 Following is result on Dan's FreeBSD box.
1415 ---------------------------------------------------------------
1416 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1417 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1418 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1419 blib/arch/auto/Encode/Encode.so 27,791 27,791
1420 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1421 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1422 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1423 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1424 ---------------------------------------------------------------
1425 Total 5,375,297 6,343,819
1427 * As a result of ucm-transition, Encode::Tcl dropped because
1428 Encode::Tcl demands *.enc.
1429 Encode::Tcl will be supplied in a separate tarball with *.enc.
1430 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1435 -lib/Encode/Supported.pod
1436 +lib/Encode/Supports.pod
1437 -lib/Encode/iso10646_1.pm
1438 +lib/Encode/10646_1.pm
1439 -lib/Encode/EncFormat.pod
1440 +lib/Encode/Format/Enc.pod
1441 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1442 - lib/Encode/JP/Constants.pm
1443 + lib/Encode/JP/Consts.pm
1444 ! lib/Encode/JP/JIS.pm
1445 ! lib/Encode/JP/H2Z.pm
1446 Version nit problem and 8.3 rule fix.
1447 > Package namespace installed latest in CPAN file
1448 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1449 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1450 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1451 accordingly. In addition, legacy "use vars qw()..." are replaced with
1453 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1454 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1456 - lib/Encode/JP/ISO_2022_JP.pm
1457 - lib/Encode/JP/ISO_2022_JP_1.pm
1458 + lib/Encode/JP/2022_JP.pm
1459 + lib/Encode/JP/2022_JP1.pm
1461 8.3 naming conflict for vanilla fat addressed by jhi
1462 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1465 Typecast fix addressed by jhi
1466 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1468 0.98 Mon Mar 25 2002
1469 ! lib/Encode/Supported.pod
1471 + lib/Encode/JP/ISO_2022_JP_1.pm
1472 ! lib/Encode/JP/ISO_2022_JP.pm
1473 ! lib/Encode/JP/JIS.pm
1475 Now Encode::JP is more strict on the difference between ISO-2022-JP
1476 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1477 makes Anton happier :) FYI the previous version implements
1478 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1479 ! lib/Encode/Supported.pod
1482 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1483 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1488 ! lib/Encode/Suppoted.pod
1489 pod fixes to replace F<http://...> to L<http://...>,
1490 as suggested by Autrijius in:
1491 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1492 ! lib/Encode/Suppoted.pod
1493 fixes and enhancements by Anton
1494 Message-Id: <10632060120.20020324103753@motor.ru>
1495 ! lib/Encode/Alias.pm
1496 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1497 added. Suggested by Anton then deobfuscated by Autrijius
1498 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1500 Further fix by Nicholas Clark
1501 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1502 - lib/EncodeFormat.pod
1503 + lib/Encode/EncFormat.pod
1505 File renamed as suggested by Autrijius
1507 ! lib/Encode/Details.pod
1508 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1509 ! Encode.pm Sun Mar 24 13:43:47 2002
1510 pod fixes by Autrijius.
1511 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1512 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1514 ! lib/Encode/Alias.pm
1516 now more EBCDIC conscious;
1517 %ExtModules on EBCDIC system excludes CJK so that you don't
1518 have to worry about the matched alias resulting cloaking.
1519 t/Alias.t also revised to reflect changes. Verified by jhi
1520 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1522 0.97 Sun Mar 24 2002
1526 EBCDIC detection mechanism installed as in JP/JP.pm
1527 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1530 ! EBCDIC/Makefile.PL
1533 ! Symbol/Makefile.PL
1535 Now all table files used by compile are postfixed '_t' to avoid
1536 namespace collisions in case insensitive file systems once for all!
1538 Message-ID: <58290227735.20020323195659@familiehaase.de>
1540 Since the Encode::JP is unsupported under EBCDIC we
1541 cannot run this test (aliases as such should work fine) -- jhi
1542 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1544 duplicate occurance of ascii.ucm and 8859-1.ucm
1545 causes MacOS X dlyd to cloak
1551 < chdir 't' if -d 't';
1553 > if (! -d 'blib' and -d 't'){ chdir 't' };
1554 When you are "make test"-ing on Encode/ directory, you must not
1555 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1556 remain unchanced. Also the situation detection was made simpler
1557 in t/JP.t, which was originally;
1558 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1560 "Use of uninitialized value in string eq at Encode.pm line 96."
1561 ! Symbol/Makefile.PL
1562 ! EBCDIC/Makefile.PL
1564 -- Problem on case insensitive file systems
1565 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1566 Message-ID: <88254111953.20020323095503@familiehaase.de>
1569 "So I think it's a bug in gcc, not perl. But it still needs to be
1571 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1572 Message-Id: <20020323170509.C96475@plum.flirble.org>
1574 0.96 Sat Mar 23 2002
1576 ! lib/Encode/Encoding.pm
1577 ! lib/Encode/Alias.pm
1578 ! lib/Encode/Supported.pod
1580 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1581 Message-ID: <20020322073908.GB10539@blackrider>
1584 "...I think we should include ISO 8859-1 as well." -- NI-S
1585 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1590 ! lib/Encode/Alias.pm
1591 alias definitions relocated to Encode::Alias so module autoloading
1592 works for aliases also.
1594 encodings() now accepts args to check ExtModules.
1598 + EBCDIC/Makefile.PL
1599 + Symbol/Makefile.PL
1603 Latin and single byte encodings are reorganized so they are
1604 demand-loaded like Encode::XX. Now only ascii is compiled into
1606 ! lib/Encode/Alias.pm
1607 for my $k (keys %hash){ delete $hash{$k}; }
1608 is depreciated; fixed.
1610 0.95 Fri Mar 22 2002
1611 In this update, pod rewrites and alias fixes are the main issues
1612 + lib/Encode/Supported.pod
1613 Describes supported encodings
1615 streamlined compiled-in encodings.
1616 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1618 + Encode/ibm-125?.ucm
1619 Added from icu distibution with any occurance of
1620 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1621 some respect to icu staff, however.
1622 + lib/Encode/Alias.pm
1624 Alias difinitions in Encode.pm relocated.
1627 packWARN patch from Paul Marquess via jhi
1628 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1629 Paul added to AUTHORS as a result.
1630 ! t/CJKalias.t -> t/Aliases.t
1631 Renamed. Checks even more aliases and alias overloading
1634 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1635 gbk => cp936 relocated to CN.pm
1637 Test::More with plans (by jhi)
1639 0.94 Thu Mar 21 2002
1640 + lib/Encode/Description.pod
1641 ! lib/Encode/Encoding.pm
1642 Now the pod in Encode.pm is abridged as programming references.
1643 lib/Encode/Description.pod contains the original, detailed description
1644 and Encode::Encoding explains how to write your own module to
1645 add new encodings. So far, lib/Encode/Description.pod contains
1646 the whole pod once in Encode.pm. This is intentional.
1648 Pod revisions by Anton Tagunov
1649 Message-Id: <517178431.20020320174824@motor.ru>
1651 all occrance of Encode::Tcl::Extended removed including pod
1653 test now checks $encoding->name only; $encoding->{name} are
1654 no longer check to find the canonical name.
1655 ! lib/Encode/JP/JIS.pm
1656 ! lib/Encode/JP/ISO_2022_JP.pm
1657 ->name() added to be more compliant with API
1663 Patch by Autrijus to add aliases to TW and fixes to POD
1664 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1666 SADAHIRO Tomoyuki added as should. My apologies.
1668 0.93 Wed Mar 20 2002
1669 * First release to be uploaded to CPAN. For prehistoric changes,
1670 please see Changes file of perl distibution as well as
1671 perl-unicode@perl.org archive, available at:
1672 http://archive.develooper.com/perl-unicode@perl.org/
1674 Changes Since 0.92 includes;
1679 + Mention to perl-unicode@perl.org added
1681 + Encoding aliases added so you can feed locale names
1682 and MIME Charset="" directly.
1683 - Mention to JISX0212 removed because it's fixed
1686 + Encoding aliases added. Note TW is left untouched because
1687 euc-tw is not implemented in TW but in Encode::HanExtra.
1688 Autrijus, you may fix Encode::HanExtra.
1690 + to test encode aliases added