1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.98 2003/08/20 11:15:31 dankogai Exp dankogai $
5 $Revision: 1.98 $ $Date: 2003/08/20 11:15:31 $
6 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
7 Dave Evans has found and corrected a bug in Encode::MIME::Header.
8 Test suite added by Dan Kogai.
9 Message-Id: <3F43440B.7060606@rudolf.org.uk>
11 Typo fixes rolled back in from bleedperl
13 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
14 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
16 argv case nit for VMS by Craig
17 Message-ID: <3F2B02DE.10207@mac.com>
18 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
19 Encode test fixes for VMS by Peter Prymmer
20 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
21 ! lib/Encode/Alias.pm t/Aliases.t
22 koi-8 aliases bug detected and patched by sadahiro.
23 Further fix and test suite by dankogai
24 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
26 1.97 2003/07/08 21:52:14
27 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
28 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
29 $DEBUG replaced with DEBUG() so perl optimizes better,
30 by Rafael with further fixes by dankogai
31 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
32 ! lib/Encode/Aliases.pm
33 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
34 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
35 So new hash seeding introduced in bleedperl works.
36 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
38 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
39 automatic utf(8|16|32) guessing -- originally by Autrijus
40 Message-Id: <20030626162731.GA2077@not.autrijus.org>
42 Addressed the following;
43 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
44 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
46 1.96 2003/06/18 09:29:02
47 ! lib/Encode/JP/JP.pm t/guess.t
48 m/(...)/ in void context then $1 is considered a Bad Thing
49 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
51 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
54 More typecast from maintperl@19739
55 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
57 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
58 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
60 Now skips for EBCDIC platform.
61 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
63 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
64 have more than one C<.> in their filenames." -- perlport.
65 Message-Id: <3ED79E01.8050401@mac.com>
67 Found and fixed the back that -p,--perlqq does not work.
68 Induced by the change from Getopt::Std to Getopt::Long.
70 Addressed [cpan #2629] Wrong assumption in numeric comparison
71 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
72 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
73 lib/Encode/Encoding.pm t/perlio.t
74 ! API Change: ->new_sequence() => ->renew()
75 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
76 + Encode::XS and Encode::utf8 now supports ->renew()
77 + Encode::Encoding now documents this with examples
78 - Non-XS (en|de)code stripped out of Encode::Unicode
79 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
81 1.95 2003/05/21 08:41:11
83 Since bogus entries were found in iso-8859-6, all entries are
84 re-generated once again out of
85 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
86 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
87 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
88 + lib/Encode/Unicode/UTF7.pm
89 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
90 lib/Encode/Supported.pod
91 UTF-7 support is now added. With this Encode now has all transcoding
92 methods in Unicode::String.
94 1.94 2003/05/10 18:13:59
95 ! lib/Encode/MIME/Header.pm
96 A more sophisticated solution for double-encoding by dankogai
97 ! lib/Encode/MIME/Header.pm AUTHORS
98 Two bugs fixed by Bjoern Jacke
99 * "Double Encoding" was not possible
100 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
101 * encode("MIME-Q") had UTF-8 flag on
102 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
103 ! lib/Encode/MIME/Header.pm AUTHORS
104 Two occurances of "croak ()" fixed as "croak qq()".
105 Simon Cozens is added to AUTHORS as a result.
106 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
108 POD fixes that reflect enhancements by jhi
110 Two enhancements by jhi.
111 + Now uses Getopt::Long so it accepts long name options
112 (--from for -f, for example)
113 + New option: -r,--resolve
114 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
116 META.yml added upon request of Schwern
117 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
119 Enache Adrian removed upon request -- to live longer than Encode
120 and/or FreeBSD (toy-)?thread :)
121 Message-Id: <20030425015701.GA2069@ratsnest.hole>
123 "close STDOUT unless $^O eq 'freebsd';" once again relocated
124 to keep VMS happy in which case "$^O eq 'freebsd'" is required
125 to keep FreeBSD+thread happy. Sigh.
126 Message-Id: <3EA88ADC.3000300@mac.com>
128 1.93 2003/04/24 17:43:16
130 added "no warnings 'pack'" in for loop to keep bleedperl from
131 complaining "Character in 'C' format wrapped in pack".
133 More elegant perl core detection inspired by Ilya Zakharevich
134 (but further elaborated for general cases).
135 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
138 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
139 are less likely to be confused w/ ISO-8859-*
141 RT tests added (vendor encodings are exemplified)
142 -- that successfully found a flaw on iso-2022-kr before the patch.
143 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
144 decode("ISO-2022-KR") has been buggy but no one ever sited
145 that since no one seems to be using it. Bugs discovered by
147 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
148 ! lib/Encode/CN/HZ.pm t/perlio.t
149 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
150 so it adds test for HZ.
151 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
152 ! lib/Encode/Guess.pm
153 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
154 So long as the string contains \x{00}-\x{ff} it does not fail.
155 See perldoc for details.
156 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
158 1.92 2003/03/31 03:27:27
159 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
160 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
161 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
162 but glibc-BIG5-2.1.2.ucm does.
163 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
164 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
165 Maps regenerated again but this time based upon
166 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
167 (But where is THE DOCUMENT by MICROSOFT?)
168 ! t/enc_module.t AUTHORS
169 failure with threaded Perl on FreeBSD addressed.
170 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
171 Message-Id: <20030322230131.GA813@ratsnest.hole>
172 ! lib/Encode/Guess.pm
175 Change 18989: Make the :bytes conditional on PerlIO.
176 further Modified by Dan Kogai
177 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
179 Chnage 18966: another fix for failing test on windows ("use encoding"
180 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
181 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
183 Change 18970: Hopefully this works also in Win32, by jhi
184 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
185 Change 18965: fix CJKT.t failures on windows due to incorrect
187 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
189 1.91 2003/03/09 20:07:37
191 even more proofread by jhi.
192 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
195 +use lib qw(t ext/Encode/t ../ext/Encode/t);
196 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
198 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
199 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
202 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
204 1.90 2003/03/09 17:32:43
207 Inaba-san has added a patch for perl 5.8.1 or later that makes
208 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
209 that. POD is further revised.
210 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
211 ! encoding.pm t/enc_module.t
212 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
213 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
217 signed vs. unsigned issue discovered by Craig on OpenVM
218 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
219 ! encoding.pm AUTHORS
220 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
221 Because binmode() stacks layers instead of overwrite, you have to
222 ":raw :encoding()" in encoding.pm or your are in trouble when you
223 call encoding.pm multiple times. There are several workarounds
224 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
225 who was the first to address this problem was added to AUTHORS.
226 The test suites was added for this, which is a modified version
227 of SUGAWARA-san's scripts
228 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
230 1.88 2003/02/20 14:42:34
232 one signedness nit for Encode by jhi
233 <200302161933.h1GJX876018710@kosh.hut.fi>
235 VISCII map was incorrect; fixed by Sadahiro-san
236 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
237 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
238 You can't unlink files that are opened in cygwin but the last
239 file handle opened in t/enc_*.t left open. Patch submitted
240 by Yitzchak and he was added to AUTHORS.
241 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
243 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
244 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
246 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
247 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
249 1.87 2003/02/06 01:52:11
251 * Inaba "Sensei" Hirohito added (I thought I have done so a long
252 ago but apparently I did not).
253 * SUZUKI Norio added for verious and useful bug reports.
254 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
255 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
256 podchecked so all warnings are gone except for L<http://>.
257 ! encoding.pm t/enc_eucjp.t
258 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
259 * Filter option overhaul
261 ! Encode.pm Encode.xs encengine.c Encode/encode.h
262 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
263 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
264 without filter. podchecked by Dan Kogai.
265 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
266 ! lib/Encode/Alias.pm
267 decode('alias', $1) went wild because of local $_ in find_alias()
268 the evil local $_ is eradicated but that changes find_alias()
269 format for coderef aliasing. See Encode::Alias for details
270 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
272 1.86 2003/01/22 03:29:07
274 * Don't forget to canonize when you attempt an exact match!
275 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
276 * ${^ENCODING} exception is off for $] > 5.008
277 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
279 $] check commented out so it runs on 5.8.0
281 1.85 2003/01/21 22:19:14
283 ${^ENCODING} exception is now explicit rather than handled by regex.
284 + t/enc_eucjp.t t/enc_utf8.t
285 Test suite for the better "encoding" pragma support for bleedperl.
286 On 5.8.0, they will just be skipped.
288 1.84 2003/01/10 12:00:16
290 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
291 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
292 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
294 %_ fixes saves the resulting .so .05% smaller, by NC
295 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
297 Silence Encode on undef, by Andreas
298 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
299 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
301 s/regognised/recognised/ . British spelling left intact to pay
302 respect to two British Nicks :)
303 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
305 1.83 2002/11/18 17:28:49
306 ! Encode.xs lib/Encode/JIS7.pm
307 Even more patches from Inaba-san has been applied. With this
308 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
309 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
311 1.82 2002/11/14 23:06:12
313 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
314 later found in perls w/ -Dusemymalloc was fixed by NC.
315 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
317 1.81 2002/11/08 18:29:27
318 ! Encode.pm Encode.xs
319 Non-XS version of Encode::utf8 is back (with XS being default).
320 Encode::predefine_encodings(0) to turn off XS.
321 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
322 found that it was Test::More causing the problem, not Encode.
323 But I have already made it configurable so it may be useful in
325 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
327 The ingenious patch by Nicholas Clark that reduces shlib sizes by
328 50% with no penalty and backward compatibility preserved, is in.
329 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
331 1.80 2002/10/21 20:39:09
332 ! Encode.xs t/mime-header.t
333 Even more patches from NI-XS regarding Encode::utf8->decode().
334 And one more test to t/mime-header.t to prove it
335 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
337 1.79 2002/10/21 06:05:37
339 Further patches from NI-XS. Encode::utf8->decode() now checks the
340 value of utf8 flag of the argument. As a result, the fix to
341 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
342 no harm (even speedwise) I'll leave it unreverted.
343 ! ucm/cp949.ucm ucm/cp950.ucm
345 U+00AE REGISTERED SIGN
346 were missing as a result of 1.78. Discovered by Moriyama-san.
347 Moriyama-san has also developed a test script that compares
348 (en|de)coded results to the corresponding Win32 API result and
349 all cp9?? maps are now verified.
350 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
352 1.78 2002/10/20 15:44:00
353 ! lib/Encode/MIME/Header.pm
354 fixed so that it works with new Encode::utf8
355 ! Encode.pm Encode.xs
356 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
357 :encoding(UTF-8) to handle partial chars at end of buffers
359 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
360 ! lib/Encode/Supported.pod
361 More nitpickings applied.
364 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
365 tests were completely useless. To redeem that and get the peace of
366 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
367 ucm/*.ucm Since this script takes too long to finish (30 seconds on
368 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
369 but you can easily run that by either renaming it or:
371 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
372 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
373 Other CJKT cp9?? also updated according to the URI below;
374 http://www.microsoft.com/typography/unicode/cscp.htm
375 + bin/ucmsort MANIFEST
376 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
377 to proper order. intended for hardcore develpers only.
378 ! ucm/cp932.ucm JP/JP.pm AUTHORS
379 CP932 mapping which was based upon the mapping file at unicode.org
380 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
381 has also supplied the patch so he was added to AUTHORS.
382 ! lib/Encode/Supported.pod
383 ISO-8859-11 != TIS 620
384 == TIS 620 + \xA0 ( )
386 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
388 1.77 2002/10/06 03:27:02
390 * Modified to accomodate up and comming patch by Inaba-san that
391 will fix tr/// needing eval qq{}
392 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
394 * pod fixes/enhancements to reflect the changes above
395 ! lib/Encode/Alias.pm
396 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
397 Message-Id: <20021001015648.GB18710@not.autrijus.org>
399 1.76 2002/08/25 15:09:51
401 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
402 but naturally identical to previous version -- dankogai
404 Codepoint fixes -- autrijus
405 Message-Id: <20020805040236.GC5220@not.autrijus.org>
407 copied everything under perl-5.8.0/ext/Encode to make sure Encode
408 is in sync w/ perl core
410 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
411 Now that binmode(FH) does implicit ":bytes" revisit
412 the failing tests. The worrisome one is the Digest::MD5
413 test-- how will it fare in CRLF lands now?
415 From: Radu Greab <radu@netsoft.ro>
416 Date: Mon, 10 Jun 2002 00:40:34 +0300
417 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
418 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
419 ones which I cannot figure out.
420 ! lib/Encode/Alias.pm
421 Subject: [Encode PATCH] spurious warning
422 From: Nicholas Clark <nick@unfortu.net>
423 Date: Sun, 2 Jun 2002 20:26:22 +0100
424 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
426 1.75 2002/06/01 18:07:49
427 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
428 glibc compliance cited by Autrijus.
429 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
430 ! bin/enc2xs bin/piconv
431 Subject: Re: forewarning: usedevel and versiononly
432 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
434 1.74 2002/05/28 18:33:15
435 + ucm/null.ucm ucm/ctrl.ucm
436 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
437 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
438 Subject: Re: unicode -> &# notation
439 Message-ID: <20020518193704.GB40272@not.autrijus.org>
441 1.73 2002/05/28 17:26:18
442 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
443 Chris Nandor has fixed Encode so that it works w/ MacPerl --
444 at least w/ PPC (68k need static linking which does not work due to
445 64k limit). pudge is added to AUTHORS (I'm surprised he was not
446 there in the list). Encode/Makefile_PL.e2x was additionally fixed
447 by dankogai to reflect changes in other Makefile.PL
448 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
450 Subject: Change 16746: -Mutf8 cleanup.
451 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
453 1.72 2002/05/20 15:49:56
455 Subject: [PATCH] Encode should be in perl-core library path
456 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
457 Message-Id: <20020520161201.A11019@alpha.hut.fi>
458 ! lib/Encode/MIME/Header.pm
459 Subject: [PATCH] Encode::MIME::Header
460 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
461 ! Encode/Makefile_PL.e2x
462 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
463 Message-Id: <20020519201031.GA1603@not.autrijus.org>
464 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
465 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
467 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
468 Subject: [PATCH] good day for WinCE port of perl.
469 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
472 Even more linting by Robin via @16532
474 Even more typecast by Sarathy in @16460
476 1.71 2002/05/07 16:22:42
478 even more typecasts by Robin
479 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
481 A very strange bug that was causing a bugus ucm -> C table
482 generation that was revealed by a UCM file that Andreas was
483 working. This is the king of wierdest bug I've encountered
484 in the course of Encode maintenance.
485 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
487 1.70 2002/05/06 10:26:48
489 Made more 'module-safe' with conjunction w/ 'no encoding'.
490 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
491 ! lib/Encode/Encoding.pm
492 'require Encode' because ->Define uses Encode::define_encoding();
493 problem and solution addressed by Miyagawa-kun
494 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
496 Cuts the frill to make djgpp happier, as suggested by Laszlo
497 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
499 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
500 Message-Id: <m3bsbug48n.fsf@anima.de>
502 1.69 2002/05/04 16:41:18
503 ! lib/Encode/MIME/Header
504 Floating-point coerced for UNICOS (in integer arithmetics it folds
505 line one character too early). Verification by Mark is pending.
506 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
508 more doc patch from Elizabeth
509 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
510 ! Encode/Makefile_PL.e2x
511 More platform-independent patch from Benjamin
512 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
513 ! lib/Encode/Guess AUTHORS
514 split regex fix by Graham Barr. Adds him to AUTHORS.
515 Message-Id: <20020504085419.E95940@valueclick.com>
516 ! Encode/Makefile_PL.e2x
517 enc2xs script discovery made smarter and more sensible, first cited
518 by Miyagawa-kun and further suggestions by Rafael and Andreas
519 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
520 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
522 1.68 2002/05/03 12:20:13
523 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
524 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
525 and Supported.pod modified to reflect the change. Elizabeth added
526 to Authors. And H.M. is also added for forwarding her patch among
527 other contributions (I was rather surprised to find his name was not
529 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
531 1.67 2002/05/02 07:33:09
533 Error message now consistent w/ perlqq (\N{U+} -> \x{})
534 done in perl@16308 but Philip linted me further. Now the error
535 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
536 ! lib/Encode/Guess.pm
537 Sanity check for happier -w by Autrijus
539 1.66 2002/05/01 05:41:06
540 ! Encode.xs t/fallback.t
541 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
542 while fallback is in effect. This even came with a welcome side-effect
543 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
544 also modified to test this.
545 And of course, the corresponding varialbles to UV[Xx]f are appropriately
546 cast. This should've concluded NI-XS homework.
548 encode(undef) does warn again! Repented upon suggestion by NI-XS.
549 Document for unless vs. '' added
550 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
552 1.65 2002/04/30 16:13:37
554 encode(undef) no longer warns for C<Use of uninitialized value in
555 subroutine entry>. Suggested by Paul.
556 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
557 ! lib/Encode/Supported.pod
558 Encode::MIME::Header and Encode::Guess mentioned
559 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
560 ! lib/Encode/Guess.pm
561 POD fix by Miyagawa-kun
562 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
564 1.64 2002/04/29 06:54:06
566 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
567 encocde("euc-jisx0213") are still DIFFERENT.
568 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
570 A few white spaces corrected by NI-XS via PerlIO integration to
572 Subject: Change 16247: Integrate perlio;
574 Document fixes by Andreas
575 Message-Id: <m3k7qsf1we.fsf@anima.de>
577 1.63 2002/04/27 18:59:50
578 ! lib/Encode/Encoding.pm
579 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
580 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
581 Make use of the Encode::Encoding base class!
582 And other cleanups in Encode.xs upon NI-XS suggestions
583 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
585 1.62 2002/04/27 11:17:39
587 encodings() now just check %ExtModule instead of eval{require}
588 all of them for ":all" to conserve more memory.
590 more "%x" -> "%" UVxf stuff.
592 s/=over2/=over 2/g # oops.
594 1.61 2002/04/26 03:02:04
596 Now does decent tests besides use_ok()
597 ! lib/Encode/Guess.pm t/guess.t
598 UI streamlined, document added
600 various signed/unsigned mismatch nits (#16173)
601 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
603 POD: utf8-flag-related caveats added. A few sections completely
607 Thou shalt not assume %d works, either!
608 Robin Baker added to AUTHORS for this
609 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
611 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
613 1.60 2002/04/24 20:06:52
615 "Thou shalt not assume %x works." -- jhi
616 Message-Id: <20020424210618.E24347@alpha.hut.fi>
617 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
618 low-memory build machines happy, now *.c is created for each *.ucm
619 (no table aggregation). You can still override this by setting
620 $ENV{AGGREGATE_TABLES}.
621 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
622 + lib/Encode/Guess.pm
623 + lib/Encode/JP/JIS7.pm
624 Encoding-autodetect (mainly for Japanese encoding) added. In a
625 course of development, JIS7.pm was improved.
626 + lib/Encode/HTML/Header.pm
627 + lib/Encode/Config.pm
628 MIME B/Q Header Encoding Added!
629 ! Encode.pm Encode.xs t/fallback.t
630 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
631 Message-Id: <20020424130709.GA14211@tanglefoot>
633 1.59 $ 2002/04/22 23:54:22
634 ! Encode.pm Encode.xs
635 needs_lines() and perlio_ok() are added to Internal encodings such
636 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
638 1.58 2002/04/22 23:54:22
640 s/MacChineseSimp/MacChineseTrad/ # ... oops.
645 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
646 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
647 t/(JP|KR).t are aggregated to t/CJKT.t
648 test data are all remade via bin/ucm2text.
649 And .... They are no longer skipped for -Uuseperlio !
651 1.57 2002/04/22 20:27:30
652 ! t/JP.t t/KR.t t/perlio.t
653 unless (find PerlIO::Layer 'perlio') ... line is back again.
654 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
655 need PerlIO. Sigh....
656 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
657 ->perlio_ok now does eval{ require PerlIO::encoding } there so
658 it correctly returns 1 when PerlIO::encoding is yet loaded.
660 perl-current patch #16072 reflected
662 1.56 2002/04/22 09:48:07
663 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
664 New PerlIO::encoding 0.04 compliance met
666 1.55 2002/04/22 03:43:05
667 ! Encode.pm Encode.xs Unicode/Unicode.pm
668 needs_lines() defined so Encode::Encoding is no longer needed
671 1.54 2002/04/22 02:50:01
672 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
673 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
674 now perlio_ok is true by default if PerlIO::encoding->VERSION is
675 0.03 or larger. POD in Encode::Encoding revised to reflect this.
676 Encode::XS and Encode::Unicode now has perlio_ok() method.
677 ! lib/Encode/Supported.pod
678 s/UP-UX/HP-UX/ by jhi
679 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
680 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
681 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
682 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
683 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
684 ! lib/Encode/Supported.pod
685 Huge document fixes by Philip.
688 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
689 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
691 binmode() after "<:encoding" to make Win32 happy, by Mattia.
692 Mattia added to AUTHORS file
693 Message-Id: <3CC3150F.5798.22A05AE@localhost>
695 1.52 2002/04/20 23:43:47
697 TODO: is now SKIP:, as NI-XS requested. Also adds more
698 eraborate failure analysis added.
700 A note on how to make sure of round-trip safety added to POD
701 section (so Autrijus is happier)
702 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
703 big5-(eten|hkscs) is round-trip safe again!
704 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
706 Typo fixes by Andreas
707 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
708 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
709 PerIO coodination patches from NI-XS.
710 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
712 1.51 2002/04/20 09:58:23
714 Updated test suite by Autrijis so "make test" is happy again
715 Message-Id: <20020420082104.GA25037@not.autrijus.org>
717 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
719 TW/TW.pm TW/Makefile.PL
720 Updates by Autrijus. 'big5' is no longer a canonical but an
721 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
722 Message-Id: <20020419195346.GA19597@not.autrijus.org>
724 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
725 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
727 PerlIO detection a little bit smarter; no longer uses eval qq{}
730 1.50 2002/04/19 06:13:02
731 ! ! Encode.pm Encode.xs Encode/encoding.h
733 New Fallback API imlemented and documented. See "perldoc Encode"
735 ! lib/Encode/JP/JIS7.pm Encode.pm
736 + lib/Encode/PerlIO.pod t/perlio.t
737 API compliance met. However, it still does not work unless perlio
738 implements line buffer. See BUGS section in perldoc Encode::PerlIO
739 As a sensible workaround, perlio_ok() added to Encode.
741 ! lib/Encode/Supported.pod
743 Message-Id: <20020418174647.J8466@alpha.hut.fi>
745 Doc fixes from Autrijus
746 Message-Id: <20020418144131.GA10987@not.autrijus.org>
748 perlqq mode documented
750 + t/jisx0201.euc t/jisx0201.ref
751 ! t/jisx0208.euc t/jisx0208.ref
752 t/JP.t tests more rigorously and with other encodings
753 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
754 in jis0208 (used to contain jisx0201 part).
755 ! Encode/Makefile_PL.e2x
756 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
757 enc2xs and encode.h rather than hard-coded. This allows the resulting
758 module fully CPANizable.
759 ! encoding.pm t/JP.t t/KR.t
760 PerlIO detection simplified (checks %INC instead of eval{})
761 ! Encode.xs Encode/encode.h
762 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
763 - lib/Encode/Unicode.pm
764 (en|de)code_xs relocated to where it belongs. Source reindented
767 Additional (U8 *) cast added as suggested by jhi
768 Message-Id: <20020417165916.A28599@alpha.hut.fi>
770 1.42 Date: 2002/04/17
772 no-op module; Thought of adding a pod there but enc2xs has
777 correct mechanism to detect Perlio::encoding layar installed.
779 PerlIO Layer detached.
781 1.41 2002/04/16 23:35:00
783 binmode(STDIN|STDOUT ...) done iff PerlIO is available
785 Cleaned up PerlIO skip conditions to prepare for the upcoming
786 Encode - PerlIO forking.
788 exported functions are now prototyped.
789 ! lib/Encode/CN/HZ.pm
792 fallback implemented # was /* FIXME */
793 affected programs revised to fit (only HZ was using the try-catch
794 approach which needed to be fixed for API-compliance).
796 ! Encode/KR/2022_KR.pm
798 can find =head1 NAME now, jhi
799 Message-Id: <20020416083059.V30639@alpha.hut.fi>
803 now complies with less warnings with the pickest compilers.
804 Suggested by Craig, fixed by Dan.
805 ! Encode/Makefile_PL.e2x
807 A bug that fails to find *.e2x in certain conditions fixed
809 1.40 2002/04/14 22:27:14
810 + Encode/ConfigLocal_PM.e2x
811 ! lib/Encode/Config.pm
813 "enc2xs -C" now generates/updates Encode::ConfigLocal.
814 ConfigLocal_PM.e2x is a skelton thereof.
815 ! lib/Encode/Config.pm
817 "use Encode::CN::HZ;" was missing.
820 More rigorous tests added to test XS, especially on memory allocation.
822 ! lib/Encode/Unicode.pm
823 NI-S implemented an XS version -- merged
824 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
827 Source filter option added. With this option on, you can write
828 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
829 encodings. t/jperl.t enhanced to test this feature.
831 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
832 apply his suggestion, this degree of nitting is enough to add him
834 Message-Id: <3CB93223.291E5E2E@earthlink.net>
836 + lib/Encode/JP/JIS7.pm
837 - lib/Encode/JP/JIS.pm
838 - lib/Encode/JP/2022_JP.pm
839 - lib/Encode/JP/2022_JP1.pm
840 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
841 JIS7.pm for better maintainability and performance
843 Added caveat for non-ascii identifiers.
845 fixes by jhi, the original author of this pragramtic module.
846 Message-Id: <20020413231527.V1826@alpha.hut.fi>
848 1.34 2002/04/12 20:23:05 (Unreleased)
851 EBCDIC fixes addressed by jhi.
852 Message-Id: <20020412161844.D9383@alpha.hut.fi>
853 ! lib/Encode/Encoder.pm
854 POD fix by Miyagawa-kun
855 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
857 1.33 2002/04/10 22:28:40
859 Philip's mail address corrected.
862 ! lib/Encode/Encoder.pm
863 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
864 This adds Miyagawa-kun to AUTHORS.
865 * encoding() no longer exported by default but on demand
866 * t/Encoder.t updated to test all these
867 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
868 ! lib/Encode/Unicode.pm
869 ! lib/Encode/Supported.pm
870 Further doc fixes by Anton
872 1.32 2002/04/09 20:06:15
875 - ucm/macDevanaga.ucm Unicode Character Map
876 - ucm/macGujarati.ucm Unicode Character Map
877 - ucm/macGurmukhi.ucm Unicode Character Map
878 A utility to check integrity of .ucm files. t/bogus.ucm is a
879 ucm that is deliberately bogus. unused Indic mappings are removed
882 resolve_alias() added as suggested by jhi. Same as
883 find_encoding("alias")->name. For convenience. This one is
884 defined in Encode.pm instead of Alias.pm.
885 Message-Id: <20020409215846.H17022@alpha.hut.fi>
887 Memory Allocate but detected during the devel of ucmlint -- fixed.
888 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
889 ! lib/Encode/Unicode.pm
890 valid_ucs2(0) is false but must be true.
891 3 patches from NI-S as follows. This also has fixed the incident
893 ! lib/Encode/Alias.pm
894 find_alias() recursion prevention
896 Checks for the patch above
897 ! t/Encode/Unicode.pm
898 An extra "F" that causes valid_ucs2() return a bogus value fixed
899 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
900 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
901 2 Small Patches from jhi as follows:
903 Encode->encodings() lists in case-insensitve order (as it was)
905 -l option prints avaiable encodings to STDOUT instead of STDERR
906 ! lib/Encode/Aliases.pm
907 s/defintion/definition/
908 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
910 ! lib/Encode/Supported.pod
911 ! lib/Encode/Unicode.pm
912 POD revise by Philip Newton. This adds Philip to AUTHORS list.
913 Thank you for the exact quote of Douglas Adams :)
914 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
916 1.31 2002/04/08 18:08:07
917 ! lib/Encode/Encoder.pm
919 Encode::Encoder, once just a placeholder of an idea, is now much more
920 practical. See t/Encode.t to find how practical it can be.
921 + lib/Encode/Config.pm
923 my false laziness at Encode.pm is fixed. Now %ExtModules are set
924 in Encode::Config and they are all literally, not programatically
925 set. My false laziness was resulting many encodings missing from
927 ! lib/Encode/Unicode.pm
929 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
930 so that it does not rely Encode::Unicode for BOM values
931 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
933 1.30 2002/04/08 02:34:51
934 + lib/Encode/Encoder.pm
935 Object Oriented Encoder. I reckon something like this is in need.
938 ! lib/Encode/Supported.pod
939 * autoloading bug that prevented upper-case canonicals such as UTF-16
940 is fixed. Now even UTF/UCS are autoloaded!
941 * encodings() is now more intuitive.
942 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
944 * Obligatory fixes to the POD.
945 ! lib/Encode/Supported.pod
946 Patch from Anton applied.
947 Message-Id: <66641479.20020408033300@motor.ru>
949 ! lib/Encode/Unicode.pm
950 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
952 1.28 2002/04/07 18:58:42
956 Just a MANIFEST for those missing files.
958 1.26 Date: 2002/04/07 15:22:04
961 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
962 And t/Aliases.t fixed to make sure they all exist.
964 1.25 2002/04/07 15:01:25 (Unreleased)
966 ! lib/Encode/Unicode.pm
969 - lib/Encode/UTF_EBCDIC.pm
970 - lib/Encode/Internal.pm
972 Integrated into Encode.pm as closures. That way "one package, one file"
973 rule is preserved yet less files to require.
975 commented out binmode(STDERR ...
984 ! Encode/Makefile_PL.e2x
985 Schwarn's MM-compliance patch merged
986 Message-Id: <20020406082609.GA28758@blackrider>
988 ! lib/Encode/Unicode.pm
989 + lib/Encode/UTF_EBCDIC.pm
991 - lib/Encode/10646_1.pm
992 - lib/Encode/ucs2_le.pm
993 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
994 per-module based (en|de)code, I saved a number of .pm by
995 reorganizing it as per-object base (Well, this is what Encode::XS
996 does under the hood). See Encode::Unicode for details.
997 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
998 This module is used only on EBCDIC environments.
1000 1.21 2002/04/05 14:46:34 (Not Released)
1006 Are back to make Perl/Tk happy Smile, NI-S.
1008 ! lib/Encode/Alias.pm
1009 ! lib/Encode/Supported.pm
1010 ! lib/Encode/10646_1.pm
1011 ! lib/Encode/ucs2_le.pm
1012 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1013 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1014 updated to reflect changes.
1015 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1016 ! lib/Encode/Alias.pm
1017 ! lib/Encode/Supported.pm
1018 Anton's revision commited. Added Dan's own fixes as well.
1019 Message-Id: <159103166906.20020405161134@motor.ru>
1020 ! lib/Encode/Alias.pm
1022 < qr/^UCS2-le$/i => '"UCS-2"', );
1024 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1025 Sigh. Thank you, Anton.
1026 Message-Id: <14567692196.20020405062020@motor.ru>
1027 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1029 1.20 2002/04/04 19:50:52
1031 the last minute addtion. Just give it a try. Docs remains to be done.
1032 Not installed by default.
1033 ! lib/Encode/Supported.pod
1036 ! lib/Encode/Alias.pm
1037 ! lib/Encode/utf8.pm
1038 ! lib/Encode/10464_1.pm
1039 ! lib/Encode/ucs2_le.pm
1040 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1041 unchanged but UTF-16BE is added as an alias. Implicit aliases
1042 move to Encode::Alias so init_alias() works more as expected.
1043 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1044 Though pedantically wrong, This should make perl mongers happier.
1045 t/Alias.t is enhanced to test all these.
1046 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1048 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1049 of the codepoints are common. CJKT left as is because the saving is
1053 ! EBCDIC/Makefile.PL
1055 ! Encode/Makefile_PL.e2x
1059 ! Symbol/Makefile.PL
1063 All occurance of _def.h replaced with .exh so djgpp works happily
1064 ever after! To credit this amazing discovery, Laszlo is now in
1066 Message-Id: <20020403181424.GA8778@freemail.hu>
1067 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1070 ! Encode/Makefile_PL.skel
1072 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1073 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1075 Two more tests by added jhi
1076 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1079 The showstopper fixed -- Memory reallocation bug was causing
1080 Encode::XS to fall into infinite loop on certain conditions.
1081 t/grow.t tests that.
1082 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1087 ! lib/Encode/Supported.pod
1088 Vendor encodings rebuilt out of original map files at unicode.org.
1089 Indic languages such as MacDevanagali remain unspported do to the
1090 shortcoming of encengine capabilities (they need algorithmical
1091 conversion and I have no knowledge on that!). Pods fixed for added
1093 Oh, macJapan.ucm renamed to macJapanese.ucm.
1094 macROMnn is macRomanian and macRUMnn is macRumanian.
1095 txt2ucm is a crude script that is used to convert them.
1097 Unicode Compound Characters (used extensively on Mac) supported
1099 Typo fixes and improvements by jhi
1100 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1102 1.11 2002/03/31 22:12:13
1106 Missing files from the MANIFEST fixed.
1107 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1108 Version incremented just to make CPAN happy.
1110 1.10 2002/03/31 21:32:42
1113 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1114 if you want. This should make Autrijus happy. Also, piconv
1115 is added to default install.
1118 Here-documented files that enc2xs generates are now exported
1119 to *.e2x. Much cleaner and easier to debug.
1121 encoding enhances so you can make it act more like such
1122 (now prehistoric ) "localized" variations of perl like Jperl.
1124 Further test for encoding.pm. Written in euc-jp
1127 Taken over form jhi.
1128 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1133 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1140 ! Encode/macIceland.ucm
1141 ! lib/Encode/Alias.pm
1142 ! lib/Encode/Supported.pod
1143 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1144 Message-Id: <m3lmcavhjt.fsf@anima.de>
1146 1.01 2002/03/29 20:59:39
1149 s/USE_SCRIPTS/MORE_SCRIPTS/
1151 installs enc2xs by default for external Encode:: modules in CPAN,
1152 such as Encode::HanExtra
1154 More sensible perl core detection via $ENV{PERL_CORE}
1156 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1158 Perl core ditection via $^X =~ m/\bminiperl$/o
1159 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1161 1.00 Wed Mar 29 2002
1163 The version of all files is updated to 1.00 via "ci -f -l1.00",
1164 commemorating version 1.00. All files, including *.ucm are now
1165 under version control.
1168 encode.h moved to Encode/ so it will be installed for the later
1171 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1176 compile renamed to enc2xs.
1177 Affected Makefle.PL updated
1179 "Punt it. HanExtra can take care of that later." -- Autrijus
1180 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1183 ! Encode/ksc5601.ucm
1184 ! lib/Encode/CJKConstants.pm
1185 ! lib/Encode/KR/2022_KR.pm
1186 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1187 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1191 bin/ added for example scripts. They are not installed by default.
1192 to install them, "perl Makefile.PL USE_SCRIPTS".
1193 piconv is iconv reinvented in perl. in addition to all features
1194 of iconv, it also adds perlish features. See L<piconv/1> for more
1196 ! lib/Encode/Alias.pm
1197 qr/^ replaced with qr/\b so it directly matches locale names
1198 such as en_US.US-ASCII
1201 Patch by MJD to fix the following problem applied.
1202 Subject: [PATCH 5.7.3 Encode]
1203 Aliases.t not properly skipped when Encode extension not built
1204 Message-Id: <20020328091850.18677.qmail@plover.com>
1205 ! lib/Encode/KR/2022_KR.pm
1206 ! lib/Encode/CJKConstants.pm
1207 Another patch from Jungshik to make iso-2022-kr actually work
1208 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1209 ! Encode/Encode/euc-kr.ucm
1210 + Encode/Encode/johab.ucm
1211 ! Encode/Encode/ksc5601.ucm
1213 ! Encode/KR/Makefile.PL
1214 ! Encode/lib/Encode/Alias.pm
1216 Johab support and complete revision of Korean Encoding by Jungshik
1217 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1219 Revised to make up with now-dropped Encode::Details.
1220 - lib/Encode/Details.pod
1221 Dropped. Besides being obsolete, the topics are now covered in
1222 respective pods now.
1227 Korean aliases fixed thanks to Jungshik Shin
1228 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1229 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1231 =head1 NAME added to all modules to make buildtoc happy
1232 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1233 - lib/Encode/CJKguide.pod
1234 Too controversial and dropped from the dist. Will be available
1235 separately on the web.
1237 RCS tags added so table debugging gets easier (should that be
1238 needed! I hope they all stay 1.00!)
1239 + lib/Encode/CJKguide.pod
1240 A detailed guide to mainly, but not limited to, CJK multibyte
1243 + Encode/hp-roman8.ucm
1245 ! Encode/Supported.pod
1246 All occurance of "roman8" replaced with "hp-roman8" to avoid
1248 ! Encode/Supported.pod
1251 Mac Encodings now comply the Inside Macintosh
1253 Test for '-raw' conventions added.
1255 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1256 ! Encode/gb12345.ucm
1258 ! Encode/ksc5601.ucm
1259 "-raw" appended to canonical names.
1260 File mames stay unchanged thanks to UCM format.
1261 ! lib/Encode/CN/HZ.pm
1262 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1263 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1265 0.99 Tue Mar 26 2002
1266 - lib/Encode/JP/Const.pm
1267 + lib/Encode/CJKConstants.pm
1268 + lib/Encode/CN/2022_CN.pm
1269 + lib/Encode/KR/2022_KR.pm
1278 * Support for ISO-2022-KR and ISO-2022-CN added.
1280 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1281 * ucm2table autogenerates character table out of UCM files.
1284 - lib/Encode/Supports.pod
1285 + lib/Encode/Supported.pod
1286 Names reverted due to popular demand.
1287 8.3 rule applies only when there is a conflict.
1288 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1293 - lib/Encode/Format/Enc.pod
1295 * Character tables is now 100% ucm.
1296 * All files under Encode/ is now 8.3-compliant
1297 * some of missing encodings added (i.e. gsm0338 and nextstep)
1298 * Vendor mappings aggregated with appropriate national std in
1299 Makefile.PL, resulting smaller *.so especially for CJK.
1300 Following is result on Dan's FreeBSD box.
1302 ---------------------------------------------------------------
1303 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1304 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1305 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1306 blib/arch/auto/Encode/Encode.so 27,791 27,791
1307 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1308 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1309 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1310 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1311 ---------------------------------------------------------------
1312 Total 5,375,297 6,343,819
1314 * As a result of ucm-transition, Encode::Tcl dropped because
1315 Encode::Tcl demands *.enc.
1316 Encode::Tcl will be supplied in a separate tarball with *.enc.
1317 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1322 -lib/Encode/Supported.pod
1323 +lib/Encode/Supports.pod
1324 -lib/Encode/iso10646_1.pm
1325 +lib/Encode/10646_1.pm
1326 -lib/Encode/EncFormat.pod
1327 +lib/Encode/Format/Enc.pod
1328 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1329 - lib/Encode/JP/Constants.pm
1330 + lib/Encode/JP/Consts.pm
1331 ! lib/Encode/JP/JIS.pm
1332 ! lib/Encode/JP/H2Z.pm
1333 Version nit problem and 8.3 rule fix.
1334 > Package namespace installed latest in CPAN file
1335 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1336 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1337 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1338 accordingly. In addition, legacy "use vars qw()..." are replaced with
1340 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1341 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1343 - lib/Encode/JP/ISO_2022_JP.pm
1344 - lib/Encode/JP/ISO_2022_JP_1.pm
1345 + lib/Encode/JP/2022_JP.pm
1346 + lib/Encode/JP/2022_JP1.pm
1348 8.3 naming conflict for vanilla fat addressed by jhi
1349 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1352 Typecast fix addressed by jhi
1353 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1355 0.98 Mon Mar 25 2002
1356 ! lib/Encode/Supported.pod
1358 + lib/Encode/JP/ISO_2022_JP_1.pm
1359 ! lib/Encode/JP/ISO_2022_JP.pm
1360 ! lib/Encode/JP/JIS.pm
1362 Now Encode::JP is more strict on the difference between ISO-2022-JP
1363 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1364 makes Anton happier :) FYI the previous version implements
1365 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1366 ! lib/Encode/Supported.pod
1369 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1370 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1375 ! lib/Encode/Suppoted.pod
1376 pod fixes to replace F<http://...> to L<http://...>,
1377 as suggested by Autrijius in:
1378 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1379 ! lib/Encode/Suppoted.pod
1380 fixes and enhancements by Anton
1381 Message-Id: <10632060120.20020324103753@motor.ru>
1382 ! lib/Encode/Alias.pm
1383 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1384 added. Suggested by Anton then deobfuscated by Autrijius
1385 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1387 Further fix by Nicholas Clark
1388 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1389 - lib/EncodeFormat.pod
1390 + lib/Encode/EncFormat.pod
1392 File renamed as suggested by Autrijius
1394 ! lib/Encode/Details.pod
1395 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1396 ! Encode.pm Sun Mar 24 13:43:47 2002
1397 pod fixes by Autrijius.
1398 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1399 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1401 ! lib/Encode/Alias.pm
1403 now more EBCDIC conscious;
1404 %ExtModules on EBCDIC system excludes CJK so that you don't
1405 have to worry about the matched alias resulting cloaking.
1406 t/Alias.t also revised to reflect changes. Verified by jhi
1407 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1409 0.97 Sun Mar 24 2002
1413 EBCDIC detection mechanism installed as in JP/JP.pm
1414 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1417 ! EBCDIC/Makefile.PL
1420 ! Symbol/Makefile.PL
1422 Now all table files used by compile are postfixed '_t' to avoid
1423 namespace collisions in case insensitive file systems once for all!
1425 Message-ID: <58290227735.20020323195659@familiehaase.de>
1427 Since the Encode::JP is unsupported under EBCDIC we
1428 cannot run this test (aliases as such should work fine) -- jhi
1429 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1431 duplicate occurance of ascii.ucm and 8859-1.ucm
1432 causes MacOS X dlyd to cloak
1438 < chdir 't' if -d 't';
1440 > if (! -d 'blib' and -d 't'){ chdir 't' };
1441 When you are "make test"-ing on Encode/ directory, you must not
1442 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1443 remain unchanced. Also the situation detection was made simpler
1444 in t/JP.t, which was originally;
1445 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1447 "Use of uninitialized value in string eq at Encode.pm line 96."
1448 ! Symbol/Makefile.PL
1449 ! EBCDIC/Makefile.PL
1451 -- Problem on case insensitive file systems
1452 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1453 Message-ID: <88254111953.20020323095503@familiehaase.de>
1456 "So I think it's a bug in gcc, not perl. But it still needs to be
1458 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1459 Message-Id: <20020323170509.C96475@plum.flirble.org>
1461 0.96 Sat Mar 23 2002
1463 ! lib/Encode/Encoding.pm
1464 ! lib/Encode/Alias.pm
1465 ! lib/Encode/Supported.pod
1467 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1468 Message-ID: <20020322073908.GB10539@blackrider>
1471 "...I think we should include ISO 8859-1 as well." -- NI-S
1472 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1477 ! lib/Encode/Alias.pm
1478 alias definitions relocated to Encode::Alias so module autoloading
1479 works for aliases also.
1481 encodings() now accepts args to check ExtModules.
1485 + EBCDIC/Makefile.PL
1486 + Symbol/Makefile.PL
1490 Latin and single byte encodings are reorganized so they are
1491 demand-loaded like Encode::XX. Now only ascii is compiled into
1493 ! lib/Encode/Alias.pm
1494 for my $k (keys %hash){ delete $hash{$k}; }
1495 is depreciated; fixed.
1497 0.95 Fri Mar 22 2002
1498 In this update, pod rewrites and alias fixes are the main issues
1499 + lib/Encode/Supported.pod
1500 Describes supported encodings
1502 streamlined compiled-in encodings.
1503 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1505 + Encode/ibm-125?.ucm
1506 Added from icu distibution with any occurance of
1507 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1508 some respect to icu staff, however.
1509 + lib/Encode/Alias.pm
1511 Alias difinitions in Encode.pm relocated.
1514 packWARN patch from Paul Marquess via jhi
1515 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1516 Paul added to AUTHORS as a result.
1517 ! t/CJKalias.t -> t/Aliases.t
1518 Renamed. Checks even more aliases and alias overloading
1521 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1522 gbk => cp936 relocated to CN.pm
1524 Test::More with plans (by jhi)
1526 0.94 Thu Mar 21 2002
1527 + lib/Encode/Description.pod
1528 ! lib/Encode/Encoding.pm
1529 Now the pod in Encode.pm is abridged as programming references.
1530 lib/Encode/Description.pod contains the original, detailed description
1531 and Encode::Encoding explains how to write your own module to
1532 add new encodings. So far, lib/Encode/Description.pod contains
1533 the whole pod once in Encode.pm. This is intentional.
1535 Pod revisions by Anton Tagunov
1536 Message-Id: <517178431.20020320174824@motor.ru>
1538 all occrance of Encode::Tcl::Extended removed including pod
1540 test now checks $encoding->name only; $encoding->{name} are
1541 no longer check to find the canonical name.
1542 ! lib/Encode/JP/JIS.pm
1543 ! lib/Encode/JP/ISO_2022_JP.pm
1544 ->name() added to be more compliant with API
1550 Patch by Autrijus to add aliases to TW and fixes to POD
1551 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1553 SADAHIRO Tomoyuki added as should. My apologies.
1555 0.93 Wed Mar 20 2002
1556 * First release to be uploaded to CPAN. For prehistoric changes,
1557 please see Changes file of perl distibution as well as
1558 perl-unicode@perl.org archive, available at:
1559 http://archive.develooper.com/perl-unicode@perl.org/
1561 Changes Since 0.92 includes;
1566 + Mention to perl-unicode@perl.org added
1568 + Encoding aliases added so you can feed locale names
1569 and MIME Charset="" directly.
1570 - Mention to JISX0212 removed because it's fixed
1573 + Encoding aliases added. Note TW is left untouched because
1574 euc-tw is not implemented in TW but in Encode::HanExtra.
1575 Autrijus, you may fix Encode::HanExtra.
1577 + to test encode aliases added