1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.97 2003/07/08 21:52:14 dankogai Exp $
5 $Revision: 1.97 $ $Date: 2003/07/08 21:52:14 $
6 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
7 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
8 $DEBUG replaced with DEBUG() so perl optimizes better,
9 by Rafael with further fixes by dankogai
10 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
11 ! lib/Encode/Aliases.pm
12 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
13 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
14 So new hash seeding introduced in bleedperl works.
15 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
17 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
18 automatic utf(8|16|32) guessing -- originally by Autrijus
19 Message-Id: <20030626162731.GA2077@not.autrijus.org>
21 Addressed the following;
22 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
23 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
25 1.96 2003/06/18 09:29:02
26 ! lib/Encode/JP/JP.pm t/guess.t
27 m/(...)/ in void context then $1 is considered a Bad Thing
28 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
30 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
33 More typecast from maintperl@19739
34 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
36 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
37 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
39 Now skips for EBCDIC platform.
40 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
42 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
43 have more than one C<.> in their filenames." -- perlport.
44 Message-Id: <3ED79E01.8050401@mac.com>
46 Found and fixed the back that -p,--perlqq does not work.
47 Induced by the change from Getopt::Std to Getopt::Long.
49 Addressed [cpan #2629] Wrong assumption in numeric comparison
50 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
51 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
52 lib/Encode/Encoding.pm t/perlio.t
53 ! API Change: ->new_sequence() => ->renew()
54 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
55 + Encode::XS and Encode::utf8 now supports ->renew()
56 + Encode::Encoding now documents this with examples
57 - Non-XS (en|de)code stripped out of Encode::Unicode
58 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
60 1.95 2003/05/21 08:41:11
62 Since bogus entries were found in iso-8859-6, all entries are
63 re-generated once again out of
64 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
65 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
66 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
67 + lib/Encode/Unicode/UTF7.pm
68 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
69 lib/Encode/Supported.pod
70 UTF-7 support is now added. With this Encode now has all transcoding
71 methods in Unicode::String.
73 1.94 2003/05/10 18:13:59
74 ! lib/Encode/MIME/Header.pm
75 A more sophisticated solution for double-encoding by dankogai
76 ! lib/Encode/MIME/Header.pm AUTHORS
77 Two bugs fixed by Bjoern Jacke
78 * "Double Encoding" was not possible
79 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
80 * encode("MIME-Q") had UTF-8 flag on
81 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
82 ! lib/Encode/MIME/Header.pm AUTHORS
83 Two occurances of "croak ()" fixed as "croak qq()".
84 Simon Cozens is added to AUTHORS as a result.
85 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
87 POD fixes that reflect enhancements by jhi
89 Two enhancements by jhi.
90 + Now uses Getopt::Long so it accepts long name options
91 (--from for -f, for example)
92 + New option: -r,--resolve
93 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
95 META.yml added upon request of Schwern
96 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
98 Enache Adrian removed upon request -- to live longer than Encode
99 and/or FreeBSD (toy-)?thread :)
100 Message-Id: <20030425015701.GA2069@ratsnest.hole>
102 "close STDOUT unless $^O eq 'freebsd';" once again relocated
103 to keep VMS happy in which case "$^O eq 'freebsd'" is required
104 to keep FreeBSD+thread happy. Sigh.
105 Message-Id: <3EA88ADC.3000300@mac.com>
107 1.93 2003/04/24 17:43:16
109 added "no warnings 'pack'" in for loop to keep bleedperl from
110 complaining "Character in 'C' format wrapped in pack".
112 More elegant perl core detection inspired by Ilya Zakharevich
113 (but further elaborated for general cases).
114 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
117 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
118 are less likely to be confused w/ ISO-8859-*
120 RT tests added (vendor encodings are exemplified)
121 -- that successfully found a flaw on iso-2022-kr before the patch.
122 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
123 decode("ISO-2022-KR") has been buggy but no one ever sited
124 that since no one seems to be using it. Bugs discovered by
126 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
127 ! lib/Encode/CN/HZ.pm t/perlio.t
128 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
129 so it adds test for HZ.
130 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
131 ! lib/Encode/Guess.pm
132 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
133 So long as the string contains \x{00}-\x{ff} it does not fail.
134 See perldoc for details.
135 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
137 1.92 2003/03/31 03:27:27
138 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
139 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
140 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
141 but glibc-BIG5-2.1.2.ucm does.
142 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
143 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
144 Maps regenerated again but this time based upon
145 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
146 (But where is THE DOCUMENT by MICROSOFT?)
147 ! t/enc_module.t AUTHORS
148 failure with threaded Perl on FreeBSD addressed.
149 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
150 Message-Id: <20030322230131.GA813@ratsnest.hole>
151 ! lib/Encode/Guess.pm
154 Change 18989: Make the :bytes conditional on PerlIO.
155 further Modified by Dan Kogai
156 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
158 Chnage 18966: another fix for failing test on windows ("use encoding"
159 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
160 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
162 Change 18970: Hopefully this works also in Win32, by jhi
163 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
164 Change 18965: fix CJKT.t failures on windows due to incorrect
166 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
168 1.91 2003/03/09 20:07:37
170 even more proofread by jhi.
171 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
174 +use lib qw(t ext/Encode/t ../ext/Encode/t);
175 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
177 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
178 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
181 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
183 1.90 2003/03/09 17:32:43
186 Inaba-san has added a patch for perl 5.8.1 or later that makes
187 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
188 that. POD is further revised.
189 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
190 ! encoding.pm t/enc_module.t
191 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
192 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
196 signed vs. unsigned issue discovered by Craig on OpenVM
197 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
198 ! encoding.pm AUTHORS
199 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
200 Because binmode() stacks layers instead of overwrite, you have to
201 ":raw :encoding()" in encoding.pm or your are in trouble when you
202 call encoding.pm multiple times. There are several workarounds
203 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
204 who was the first to address this problem was added to AUTHORS.
205 The test suites was added for this, which is a modified version
206 of SUGAWARA-san's scripts
207 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
209 1.88 2003/02/20 14:42:34
211 one signedness nit for Encode by jhi
212 <200302161933.h1GJX876018710@kosh.hut.fi>
214 VISCII map was incorrect; fixed by Sadahiro-san
215 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
216 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
217 You can't unlink files that are opened in cygwin but the last
218 file handle opened in t/enc_*.t left open. Patch submitted
219 by Yitzchak and he was added to AUTHORS.
220 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
222 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
223 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
225 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
226 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
228 1.87 2003/02/06 01:52:11
230 * Inaba "Sensei" Hirohito added (I thought I have done so a long
231 ago but apparently I did not).
232 * SUZUKI Norio added for verious and useful bug reports.
233 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
234 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
235 podchecked so all warnings are gone except for L<http://>.
236 ! encoding.pm t/enc_eucjp.t
237 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
238 * Filter option overhaul
240 ! Encode.pm Encode.xs encengine.c Encode/encode.h
241 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
242 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
243 without filter. podchecked by Dan Kogai.
244 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
245 ! lib/Encode/Alias.pm
246 decode('alias', $1) went wild because of local $_ in find_alias()
247 the evil local $_ is eradicated but that changes find_alias()
248 format for coderef aliasing. See Encode::Alias for details
249 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
251 1.86 2003/01/22 03:29:07
253 * Don't forget to canonize when you attempt an exact match!
254 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
255 * ${^ENCODING} exception is off for $] > 5.008
256 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
258 $] check commented out so it runs on 5.8.0
260 1.85 2003/01/21 22:19:14
262 ${^ENCODING} exception is now explicit rather than handled by regex.
263 + t/enc_eucjp.t t/enc_utf8.t
264 Test suite for the better "encoding" pragma support for bleedperl.
265 On 5.8.0, they will just be skipped.
267 1.84 2003/01/10 12:00:16
269 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
270 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
271 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
273 %_ fixes saves the resulting .so .05% smaller, by NC
274 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
276 Silence Encode on undef, by Andreas
277 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
278 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
280 s/regognised/recognised/ . British spelling left intact to pay
281 respect to two British Nicks :)
282 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
284 1.83 2002/11/18 17:28:49
285 ! Encode.xs lib/Encode/JIS7.pm
286 Even more patches from Inaba-san has been applied. With this
287 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
288 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
290 1.82 2002/11/14 23:06:12
292 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
293 later found in perls w/ -Dusemymalloc was fixed by NC.
294 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
296 1.81 2002/11/08 18:29:27
297 ! Encode.pm Encode.xs
298 Non-XS version of Encode::utf8 is back (with XS being default).
299 Encode::predefine_encodings(0) to turn off XS.
300 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
301 found that it was Test::More causing the problem, not Encode.
302 But I have already made it configurable so it may be useful in
304 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
306 The ingenious patch by Nicholas Clark that reduces shlib sizes by
307 50% with no penalty and backward compatibility preserved, is in.
308 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
310 1.80 2002/10/21 20:39:09
311 ! Encode.xs t/mime-header.t
312 Even more patches from NI-XS regarding Encode::utf8->decode().
313 And one more test to t/mime-header.t to prove it
314 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
316 1.79 2002/10/21 06:05:37
318 Further patches from NI-XS. Encode::utf8->decode() now checks the
319 value of utf8 flag of the argument. As a result, the fix to
320 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
321 no harm (even speedwise) I'll leave it unreverted.
322 ! ucm/cp949.ucm ucm/cp950.ucm
324 U+00AE REGISTERED SIGN
325 were missing as a result of 1.78. Discovered by Moriyama-san.
326 Moriyama-san has also developed a test script that compares
327 (en|de)coded results to the corresponding Win32 API result and
328 all cp9?? maps are now verified.
329 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
331 1.78 2002/10/20 15:44:00
332 ! lib/Encode/MIME/Header.pm
333 fixed so that it works with new Encode::utf8
334 ! Encode.pm Encode.xs
335 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
336 :encoding(UTF-8) to handle partial chars at end of buffers
338 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
339 ! lib/Encode/Supported.pod
340 More nitpickings applied.
343 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
344 tests were completely useless. To redeem that and get the peace of
345 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
346 ucm/*.ucm Since this script takes too long to finish (30 seconds on
347 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
348 but you can easily run that by either renaming it or:
350 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
351 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
352 Other CJKT cp9?? also updated according to the URI below;
353 http://www.microsoft.com/typography/unicode/cscp.htm
354 + bin/ucmsort MANIFEST
355 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
356 to proper order. intended for hardcore develpers only.
357 ! ucm/cp932.ucm JP/JP.pm AUTHORS
358 CP932 mapping which was based upon the mapping file at unicode.org
359 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
360 has also supplied the patch so he was added to AUTHORS.
361 ! lib/Encode/Supported.pod
362 ISO-8859-11 != TIS 620
363 == TIS 620 + \xA0 ( )
365 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
367 1.77 2002/10/06 03:27:02
369 * Modified to accomodate up and comming patch by Inaba-san that
370 will fix tr/// needing eval qq{}
371 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
373 * pod fixes/enhancements to reflect the changes above
374 ! lib/Encode/Alias.pm
375 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
376 Message-Id: <20021001015648.GB18710@not.autrijus.org>
378 1.76 2002/08/25 15:09:51
380 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
381 but naturally identical to previous version -- dankogai
383 Codepoint fixes -- autrijus
384 Message-Id: <20020805040236.GC5220@not.autrijus.org>
386 copied everything under perl-5.8.0/ext/Encode to make sure Encode
387 is in sync w/ perl core
389 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
390 Now that binmode(FH) does implicit ":bytes" revisit
391 the failing tests. The worrisome one is the Digest::MD5
392 test-- how will it fare in CRLF lands now?
394 From: Radu Greab <radu@netsoft.ro>
395 Date: Mon, 10 Jun 2002 00:40:34 +0300
396 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
397 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
398 ones which I cannot figure out.
399 ! lib/Encode/Alias.pm
400 Subject: [Encode PATCH] spurious warning
401 From: Nicholas Clark <nick@unfortu.net>
402 Date: Sun, 2 Jun 2002 20:26:22 +0100
403 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
405 1.75 2002/06/01 18:07:49
406 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
407 glibc compliance cited by Autrijus.
408 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
409 ! bin/enc2xs bin/piconv
410 Subject: Re: forewarning: usedevel and versiononly
411 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
413 1.74 2002/05/28 18:33:15
414 + ucm/null.ucm ucm/ctrl.ucm
415 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
416 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
417 Subject: Re: unicode -> &# notation
418 Message-ID: <20020518193704.GB40272@not.autrijus.org>
420 1.73 2002/05/28 17:26:18
421 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
422 Chris Nandor has fixed Encode so that it works w/ MacPerl --
423 at least w/ PPC (68k need static linking which does not work due to
424 64k limit). pudge is added to AUTHORS (I'm surprised he was not
425 there in the list). Encode/Makefile_PL.e2x was additionally fixed
426 by dankogai to reflect changes in other Makefile.PL
427 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
429 Subject: Change 16746: -Mutf8 cleanup.
430 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
432 1.72 2002/05/20 15:49:56
434 Subject: [PATCH] Encode should be in perl-core library path
435 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
436 Message-Id: <20020520161201.A11019@alpha.hut.fi>
437 ! lib/Encode/MIME/Header.pm
438 Subject: [PATCH] Encode::MIME::Header
439 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
440 ! Encode/Makefile_PL.e2x
441 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
442 Message-Id: <20020519201031.GA1603@not.autrijus.org>
443 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
444 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
446 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
447 Subject: [PATCH] good day for WinCE port of perl.
448 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
451 Even more linting by Robin via @16532
453 Even more typecast by Sarathy in @16460
455 1.71 2002/05/07 16:22:42
457 even more typecasts by Robin
458 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
460 A very strange bug that was causing a bugus ucm -> C table
461 generation that was revealed by a UCM file that Andreas was
462 working. This is the king of wierdest bug I've encountered
463 in the course of Encode maintenance.
464 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
466 1.70 2002/05/06 10:26:48
468 Made more 'module-safe' with conjunction w/ 'no encoding'.
469 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
470 ! lib/Encode/Encoding.pm
471 'require Encode' because ->Define uses Encode::define_encoding();
472 problem and solution addressed by Miyagawa-kun
473 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
475 Cuts the frill to make djgpp happier, as suggested by Laszlo
476 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
478 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
479 Message-Id: <m3bsbug48n.fsf@anima.de>
481 1.69 2002/05/04 16:41:18
482 ! lib/Encode/MIME/Header
483 Floating-point coerced for UNICOS (in integer arithmetics it folds
484 line one character too early). Verification by Mark is pending.
485 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
487 more doc patch from Elizabeth
488 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
489 ! Encode/Makefile_PL.e2x
490 More platform-independent patch from Benjamin
491 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
492 ! lib/Encode/Guess AUTHORS
493 split regex fix by Graham Barr. Adds him to AUTHORS.
494 Message-Id: <20020504085419.E95940@valueclick.com>
495 ! Encode/Makefile_PL.e2x
496 enc2xs script discovery made smarter and more sensible, first cited
497 by Miyagawa-kun and further suggestions by Rafael and Andreas
498 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
499 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
501 1.68 2002/05/03 12:20:13
502 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
503 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
504 and Supported.pod modified to reflect the change. Elizabeth added
505 to Authors. And H.M. is also added for forwarding her patch among
506 other contributions (I was rather surprised to find his name was not
508 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
510 1.67 2002/05/02 07:33:09
512 Error message now consistent w/ perlqq (\N{U+} -> \x{})
513 done in perl@16308 but Philip linted me further. Now the error
514 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
515 ! lib/Encode/Guess.pm
516 Sanity check for happier -w by Autrijus
518 1.66 2002/05/01 05:41:06
519 ! Encode.xs t/fallback.t
520 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
521 while fallback is in effect. This even came with a welcome side-effect
522 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
523 also modified to test this.
524 And of course, the corresponding varialbles to UV[Xx]f are appropriately
525 cast. This should've concluded NI-XS homework.
527 encode(undef) does warn again! Repented upon suggestion by NI-XS.
528 Document for unless vs. '' added
529 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
531 1.65 2002/04/30 16:13:37
533 encode(undef) no longer warns for C<Use of uninitialized value in
534 subroutine entry>. Suggested by Paul.
535 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
536 ! lib/Encode/Supported.pod
537 Encode::MIME::Header and Encode::Guess mentioned
538 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
539 ! lib/Encode/Guess.pm
540 POD fix by Miyagawa-kun
541 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
543 1.64 2002/04/29 06:54:06
545 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
546 encocde("euc-jisx0213") are still DIFFERENT.
547 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
549 A few white spaces corrected by NI-XS via PerlIO integration to
551 Subject: Change 16247: Integrate perlio;
553 Document fixes by Andreas
554 Message-Id: <m3k7qsf1we.fsf@anima.de>
556 1.63 2002/04/27 18:59:50
557 ! lib/Encode/Encoding.pm
558 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
559 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
560 Make use of the Encode::Encoding base class!
561 And other cleanups in Encode.xs upon NI-XS suggestions
562 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
564 1.62 2002/04/27 11:17:39
566 encodings() now just check %ExtModule instead of eval{require}
567 all of them for ":all" to conserve more memory.
569 more "%x" -> "%" UVxf stuff.
571 s/=over2/=over 2/g # oops.
573 1.61 2002/04/26 03:02:04
575 Now does decent tests besides use_ok()
576 ! lib/Encode/Guess.pm t/guess.t
577 UI streamlined, document added
579 various signed/unsigned mismatch nits (#16173)
580 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
582 POD: utf8-flag-related caveats added. A few sections completely
586 Thou shalt not assume %d works, either!
587 Robin Baker added to AUTHORS for this
588 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
590 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
592 1.60 2002/04/24 20:06:52
594 "Thou shalt not assume %x works." -- jhi
595 Message-Id: <20020424210618.E24347@alpha.hut.fi>
596 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
597 low-memory build machines happy, now *.c is created for each *.ucm
598 (no table aggregation). You can still override this by setting
599 $ENV{AGGREGATE_TABLES}.
600 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
601 + lib/Encode/Guess.pm
602 + lib/Encode/JP/JIS7.pm
603 Encoding-autodetect (mainly for Japanese encoding) added. In a
604 course of development, JIS7.pm was improved.
605 + lib/Encode/HTML/Header.pm
606 + lib/Encode/Config.pm
607 MIME B/Q Header Encoding Added!
608 ! Encode.pm Encode.xs t/fallback.t
609 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
610 Message-Id: <20020424130709.GA14211@tanglefoot>
612 1.59 $ 2002/04/22 23:54:22
613 ! Encode.pm Encode.xs
614 needs_lines() and perlio_ok() are added to Internal encodings such
615 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
617 1.58 2002/04/22 23:54:22
619 s/MacChineseSimp/MacChineseTrad/ # ... oops.
624 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
625 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
626 t/(JP|KR).t are aggregated to t/CJKT.t
627 test data are all remade via bin/ucm2text.
628 And .... They are no longer skipped for -Uuseperlio !
630 1.57 2002/04/22 20:27:30
631 ! t/JP.t t/KR.t t/perlio.t
632 unless (find PerlIO::Layer 'perlio') ... line is back again.
633 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
634 need PerlIO. Sigh....
635 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
636 ->perlio_ok now does eval{ require PerlIO::encoding } there so
637 it correctly returns 1 when PerlIO::encoding is yet loaded.
639 perl-current patch #16072 reflected
641 1.56 2002/04/22 09:48:07
642 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
643 New PerlIO::encoding 0.04 compliance met
645 1.55 2002/04/22 03:43:05
646 ! Encode.pm Encode.xs Unicode/Unicode.pm
647 needs_lines() defined so Encode::Encoding is no longer needed
650 1.54 2002/04/22 02:50:01
651 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
652 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
653 now perlio_ok is true by default if PerlIO::encoding->VERSION is
654 0.03 or larger. POD in Encode::Encoding revised to reflect this.
655 Encode::XS and Encode::Unicode now has perlio_ok() method.
656 ! lib/Encode/Supported.pod
657 s/UP-UX/HP-UX/ by jhi
658 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
659 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
660 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
661 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
662 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
663 ! lib/Encode/Supported.pod
664 Huge document fixes by Philip.
667 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
668 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
670 binmode() after "<:encoding" to make Win32 happy, by Mattia.
671 Mattia added to AUTHORS file
672 Message-Id: <3CC3150F.5798.22A05AE@localhost>
674 1.52 2002/04/20 23:43:47
676 TODO: is now SKIP:, as NI-XS requested. Also adds more
677 eraborate failure analysis added.
679 A note on how to make sure of round-trip safety added to POD
680 section (so Autrijus is happier)
681 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
682 big5-(eten|hkscs) is round-trip safe again!
683 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
685 Typo fixes by Andreas
686 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
687 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
688 PerIO coodination patches from NI-XS.
689 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
691 1.51 2002/04/20 09:58:23
693 Updated test suite by Autrijis so "make test" is happy again
694 Message-Id: <20020420082104.GA25037@not.autrijus.org>
696 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
698 TW/TW.pm TW/Makefile.PL
699 Updates by Autrijus. 'big5' is no longer a canonical but an
700 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
701 Message-Id: <20020419195346.GA19597@not.autrijus.org>
703 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
704 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
706 PerlIO detection a little bit smarter; no longer uses eval qq{}
709 1.50 2002/04/19 06:13:02
710 ! ! Encode.pm Encode.xs Encode/encoding.h
712 New Fallback API imlemented and documented. See "perldoc Encode"
714 ! lib/Encode/JP/JIS7.pm Encode.pm
715 + lib/Encode/PerlIO.pod t/perlio.t
716 API compliance met. However, it still does not work unless perlio
717 implements line buffer. See BUGS section in perldoc Encode::PerlIO
718 As a sensible workaround, perlio_ok() added to Encode.
720 ! lib/Encode/Supported.pod
722 Message-Id: <20020418174647.J8466@alpha.hut.fi>
724 Doc fixes from Autrijus
725 Message-Id: <20020418144131.GA10987@not.autrijus.org>
727 perlqq mode documented
729 + t/jisx0201.euc t/jisx0201.ref
730 ! t/jisx0208.euc t/jisx0208.ref
731 t/JP.t tests more rigorously and with other encodings
732 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
733 in jis0208 (used to contain jisx0201 part).
734 ! Encode/Makefile_PL.e2x
735 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
736 enc2xs and encode.h rather than hard-coded. This allows the resulting
737 module fully CPANizable.
738 ! encoding.pm t/JP.t t/KR.t
739 PerlIO detection simplified (checks %INC instead of eval{})
740 ! Encode.xs Encode/encode.h
741 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
742 - lib/Encode/Unicode.pm
743 (en|de)code_xs relocated to where it belongs. Source reindented
746 Additional (U8 *) cast added as suggested by jhi
747 Message-Id: <20020417165916.A28599@alpha.hut.fi>
749 1.42 Date: 2002/04/17
751 no-op module; Thought of adding a pod there but enc2xs has
756 correct mechanism to detect Perlio::encoding layar installed.
758 PerlIO Layer detached.
760 1.41 2002/04/16 23:35:00
762 binmode(STDIN|STDOUT ...) done iff PerlIO is available
764 Cleaned up PerlIO skip conditions to prepare for the upcoming
765 Encode - PerlIO forking.
767 exported functions are now prototyped.
768 ! lib/Encode/CN/HZ.pm
771 fallback implemented # was /* FIXME */
772 affected programs revised to fit (only HZ was using the try-catch
773 approach which needed to be fixed for API-compliance).
775 ! Encode/KR/2022_KR.pm
777 can find =head1 NAME now, jhi
778 Message-Id: <20020416083059.V30639@alpha.hut.fi>
782 now complies with less warnings with the pickest compilers.
783 Suggested by Craig, fixed by Dan.
784 ! Encode/Makefile_PL.e2x
786 A bug that fails to find *.e2x in certain conditions fixed
788 1.40 2002/04/14 22:27:14
789 + Encode/ConfigLocal_PM.e2x
790 ! lib/Encode/Config.pm
792 "enc2xs -C" now generates/updates Encode::ConfigLocal.
793 ConfigLocal_PM.e2x is a skelton thereof.
794 ! lib/Encode/Config.pm
796 "use Encode::CN::HZ;" was missing.
799 More rigorous tests added to test XS, especially on memory allocation.
801 ! lib/Encode/Unicode.pm
802 NI-S implemented an XS version -- merged
803 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
806 Source filter option added. With this option on, you can write
807 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
808 encodings. t/jperl.t enhanced to test this feature.
810 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
811 apply his suggestion, this degree of nitting is enough to add him
813 Message-Id: <3CB93223.291E5E2E@earthlink.net>
815 + lib/Encode/JP/JIS7.pm
816 - lib/Encode/JP/JIS.pm
817 - lib/Encode/JP/2022_JP.pm
818 - lib/Encode/JP/2022_JP1.pm
819 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
820 JIS7.pm for better maintainability and performance
822 Added caveat for non-ascii identifiers.
824 fixes by jhi, the original author of this pragramtic module.
825 Message-Id: <20020413231527.V1826@alpha.hut.fi>
827 1.34 2002/04/12 20:23:05 (Unreleased)
830 EBCDIC fixes addressed by jhi.
831 Message-Id: <20020412161844.D9383@alpha.hut.fi>
832 ! lib/Encode/Encoder.pm
833 POD fix by Miyagawa-kun
834 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
836 1.33 2002/04/10 22:28:40
838 Philip's mail address corrected.
841 ! lib/Encode/Encoder.pm
842 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
843 This adds Miyagawa-kun to AUTHORS.
844 * encoding() no longer exported by default but on demand
845 * t/Encoder.t updated to test all these
846 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
847 ! lib/Encode/Unicode.pm
848 ! lib/Encode/Supported.pm
849 Further doc fixes by Anton
851 1.32 2002/04/09 20:06:15
854 - ucm/macDevanaga.ucm Unicode Character Map
855 - ucm/macGujarati.ucm Unicode Character Map
856 - ucm/macGurmukhi.ucm Unicode Character Map
857 A utility to check integrity of .ucm files. t/bogus.ucm is a
858 ucm that is deliberately bogus. unused Indic mappings are removed
861 resolve_alias() added as suggested by jhi. Same as
862 find_encoding("alias")->name. For convenience. This one is
863 defined in Encode.pm instead of Alias.pm.
864 Message-Id: <20020409215846.H17022@alpha.hut.fi>
866 Memory Allocate but detected during the devel of ucmlint -- fixed.
867 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
868 ! lib/Encode/Unicode.pm
869 valid_ucs2(0) is false but must be true.
870 3 patches from NI-S as follows. This also has fixed the incident
872 ! lib/Encode/Alias.pm
873 find_alias() recursion prevention
875 Checks for the patch above
876 ! t/Encode/Unicode.pm
877 An extra "F" that causes valid_ucs2() return a bogus value fixed
878 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
879 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
880 2 Small Patches from jhi as follows:
882 Encode->encodings() lists in case-insensitve order (as it was)
884 -l option prints avaiable encodings to STDOUT instead of STDERR
885 ! lib/Encode/Aliases.pm
886 s/defintion/definition/
887 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
889 ! lib/Encode/Supported.pod
890 ! lib/Encode/Unicode.pm
891 POD revise by Philip Newton. This adds Philip to AUTHORS list.
892 Thank you for the exact quote of Douglas Adams :)
893 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
895 1.31 2002/04/08 18:08:07
896 ! lib/Encode/Encoder.pm
898 Encode::Encoder, once just a placeholder of an idea, is now much more
899 practical. See t/Encode.t to find how practical it can be.
900 + lib/Encode/Config.pm
902 my false laziness at Encode.pm is fixed. Now %ExtModules are set
903 in Encode::Config and they are all literally, not programatically
904 set. My false laziness was resulting many encodings missing from
906 ! lib/Encode/Unicode.pm
908 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
909 so that it does not rely Encode::Unicode for BOM values
910 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
912 1.30 2002/04/08 02:34:51
913 + lib/Encode/Encoder.pm
914 Object Oriented Encoder. I reckon something like this is in need.
917 ! lib/Encode/Supported.pod
918 * autoloading bug that prevented upper-case canonicals such as UTF-16
919 is fixed. Now even UTF/UCS are autoloaded!
920 * encodings() is now more intuitive.
921 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
923 * Obligatory fixes to the POD.
924 ! lib/Encode/Supported.pod
925 Patch from Anton applied.
926 Message-Id: <66641479.20020408033300@motor.ru>
928 ! lib/Encode/Unicode.pm
929 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
931 1.28 2002/04/07 18:58:42
935 Just a MANIFEST for those missing files.
937 1.26 Date: 2002/04/07 15:22:04
940 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
941 And t/Aliases.t fixed to make sure they all exist.
943 1.25 2002/04/07 15:01:25 (Unreleased)
945 ! lib/Encode/Unicode.pm
948 - lib/Encode/UTF_EBCDIC.pm
949 - lib/Encode/Internal.pm
951 Integrated into Encode.pm as closures. That way "one package, one file"
952 rule is preserved yet less files to require.
954 commented out binmode(STDERR ...
963 ! Encode/Makefile_PL.e2x
964 Schwarn's MM-compliance patch merged
965 Message-Id: <20020406082609.GA28758@blackrider>
967 ! lib/Encode/Unicode.pm
968 + lib/Encode/UTF_EBCDIC.pm
970 - lib/Encode/10646_1.pm
971 - lib/Encode/ucs2_le.pm
972 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
973 per-module based (en|de)code, I saved a number of .pm by
974 reorganizing it as per-object base (Well, this is what Encode::XS
975 does under the hood). See Encode::Unicode for details.
976 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
977 This module is used only on EBCDIC environments.
979 1.21 2002/04/05 14:46:34 (Not Released)
985 Are back to make Perl/Tk happy Smile, NI-S.
987 ! lib/Encode/Alias.pm
988 ! lib/Encode/Supported.pm
989 ! lib/Encode/10646_1.pm
990 ! lib/Encode/ucs2_le.pm
991 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
992 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
993 updated to reflect changes.
994 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
995 ! lib/Encode/Alias.pm
996 ! lib/Encode/Supported.pm
997 Anton's revision commited. Added Dan's own fixes as well.
998 Message-Id: <159103166906.20020405161134@motor.ru>
999 ! lib/Encode/Alias.pm
1001 < qr/^UCS2-le$/i => '"UCS-2"', );
1003 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1004 Sigh. Thank you, Anton.
1005 Message-Id: <14567692196.20020405062020@motor.ru>
1006 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1008 1.20 2002/04/04 19:50:52
1010 the last minute addtion. Just give it a try. Docs remains to be done.
1011 Not installed by default.
1012 ! lib/Encode/Supported.pod
1015 ! lib/Encode/Alias.pm
1016 ! lib/Encode/utf8.pm
1017 ! lib/Encode/10464_1.pm
1018 ! lib/Encode/ucs2_le.pm
1019 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1020 unchanged but UTF-16BE is added as an alias. Implicit aliases
1021 move to Encode::Alias so init_alias() works more as expected.
1022 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1023 Though pedantically wrong, This should make perl mongers happier.
1024 t/Alias.t is enhanced to test all these.
1025 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1027 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1028 of the codepoints are common. CJKT left as is because the saving is
1032 ! EBCDIC/Makefile.PL
1034 ! Encode/Makefile_PL.e2x
1038 ! Symbol/Makefile.PL
1042 All occurance of _def.h replaced with .exh so djgpp works happily
1043 ever after! To credit this amazing discovery, Laszlo is now in
1045 Message-Id: <20020403181424.GA8778@freemail.hu>
1046 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1049 ! Encode/Makefile_PL.skel
1051 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1052 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1054 Two more tests by added jhi
1055 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1058 The showstopper fixed -- Memory reallocation bug was causing
1059 Encode::XS to fall into infinite loop on certain conditions.
1060 t/grow.t tests that.
1061 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1066 ! lib/Encode/Supported.pod
1067 Vendor encodings rebuilt out of original map files at unicode.org.
1068 Indic languages such as MacDevanagali remain unspported do to the
1069 shortcoming of encengine capabilities (they need algorithmical
1070 conversion and I have no knowledge on that!). Pods fixed for added
1072 Oh, macJapan.ucm renamed to macJapanese.ucm.
1073 macROMnn is macRomanian and macRUMnn is macRumanian.
1074 txt2ucm is a crude script that is used to convert them.
1076 Unicode Compound Characters (used extensively on Mac) supported
1078 Typo fixes and improvements by jhi
1079 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1081 1.11 2002/03/31 22:12:13
1085 Missing files from the MANIFEST fixed.
1086 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1087 Version incremented just to make CPAN happy.
1089 1.10 2002/03/31 21:32:42
1092 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1093 if you want. This should make Autrijus happy. Also, piconv
1094 is added to default install.
1097 Here-documented files that enc2xs generates are now exported
1098 to *.e2x. Much cleaner and easier to debug.
1100 encoding enhances so you can make it act more like such
1101 (now prehistoric ) "localized" variations of perl like Jperl.
1103 Further test for encoding.pm. Written in euc-jp
1106 Taken over form jhi.
1107 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1112 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1119 ! Encode/macIceland.ucm
1120 ! lib/Encode/Alias.pm
1121 ! lib/Encode/Supported.pod
1122 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1123 Message-Id: <m3lmcavhjt.fsf@anima.de>
1125 1.01 2002/03/29 20:59:39
1128 s/USE_SCRIPTS/MORE_SCRIPTS/
1130 installs enc2xs by default for external Encode:: modules in CPAN,
1131 such as Encode::HanExtra
1133 More sensible perl core detection via $ENV{PERL_CORE}
1135 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1137 Perl core ditection via $^X =~ m/\bminiperl$/o
1138 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1140 1.00 Wed Mar 29 2002
1142 The version of all files is updated to 1.00 via "ci -f -l1.00",
1143 commemorating version 1.00. All files, including *.ucm are now
1144 under version control.
1147 encode.h moved to Encode/ so it will be installed for the later
1150 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1155 compile renamed to enc2xs.
1156 Affected Makefle.PL updated
1158 "Punt it. HanExtra can take care of that later." -- Autrijus
1159 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1162 ! Encode/ksc5601.ucm
1163 ! lib/Encode/CJKConstants.pm
1164 ! lib/Encode/KR/2022_KR.pm
1165 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1166 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1170 bin/ added for example scripts. They are not installed by default.
1171 to install them, "perl Makefile.PL USE_SCRIPTS".
1172 piconv is iconv reinvented in perl. in addition to all features
1173 of iconv, it also adds perlish features. See L<piconv/1> for more
1175 ! lib/Encode/Alias.pm
1176 qr/^ replaced with qr/\b so it directly matches locale names
1177 such as en_US.US-ASCII
1180 Patch by MJD to fix the following problem applied.
1181 Subject: [PATCH 5.7.3 Encode]
1182 Aliases.t not properly skipped when Encode extension not built
1183 Message-Id: <20020328091850.18677.qmail@plover.com>
1184 ! lib/Encode/KR/2022_KR.pm
1185 ! lib/Encode/CJKConstants.pm
1186 Another patch from Jungshik to make iso-2022-kr actually work
1187 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1188 ! Encode/Encode/euc-kr.ucm
1189 + Encode/Encode/johab.ucm
1190 ! Encode/Encode/ksc5601.ucm
1192 ! Encode/KR/Makefile.PL
1193 ! Encode/lib/Encode/Alias.pm
1195 Johab support and complete revision of Korean Encoding by Jungshik
1196 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1198 Revised to make up with now-dropped Encode::Details.
1199 - lib/Encode/Details.pod
1200 Dropped. Besides being obsolete, the topics are now covered in
1201 respective pods now.
1206 Korean aliases fixed thanks to Jungshik Shin
1207 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1208 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1210 =head1 NAME added to all modules to make buildtoc happy
1211 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1212 - lib/Encode/CJKguide.pod
1213 Too controversial and dropped from the dist. Will be available
1214 separately on the web.
1216 RCS tags added so table debugging gets easier (should that be
1217 needed! I hope they all stay 1.00!)
1218 + lib/Encode/CJKguide.pod
1219 A detailed guide to mainly, but not limited to, CJK multibyte
1222 + Encode/hp-roman8.ucm
1224 ! Encode/Supported.pod
1225 All occurance of "roman8" replaced with "hp-roman8" to avoid
1227 ! Encode/Supported.pod
1230 Mac Encodings now comply the Inside Macintosh
1232 Test for '-raw' conventions added.
1234 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1235 ! Encode/gb12345.ucm
1237 ! Encode/ksc5601.ucm
1238 "-raw" appended to canonical names.
1239 File mames stay unchanged thanks to UCM format.
1240 ! lib/Encode/CN/HZ.pm
1241 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1242 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1244 0.99 Tue Mar 26 2002
1245 - lib/Encode/JP/Const.pm
1246 + lib/Encode/CJKConstants.pm
1247 + lib/Encode/CN/2022_CN.pm
1248 + lib/Encode/KR/2022_KR.pm
1257 * Support for ISO-2022-KR and ISO-2022-CN added.
1259 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1260 * ucm2table autogenerates character table out of UCM files.
1263 - lib/Encode/Supports.pod
1264 + lib/Encode/Supported.pod
1265 Names reverted due to popular demand.
1266 8.3 rule applies only when there is a conflict.
1267 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1272 - lib/Encode/Format/Enc.pod
1274 * Character tables is now 100% ucm.
1275 * All files under Encode/ is now 8.3-compliant
1276 * some of missing encodings added (i.e. gsm0338 and nextstep)
1277 * Vendor mappings aggregated with appropriate national std in
1278 Makefile.PL, resulting smaller *.so especially for CJK.
1279 Following is result on Dan's FreeBSD box.
1281 ---------------------------------------------------------------
1282 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1283 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1284 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1285 blib/arch/auto/Encode/Encode.so 27,791 27,791
1286 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1287 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1288 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1289 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1290 ---------------------------------------------------------------
1291 Total 5,375,297 6,343,819
1293 * As a result of ucm-transition, Encode::Tcl dropped because
1294 Encode::Tcl demands *.enc.
1295 Encode::Tcl will be supplied in a separate tarball with *.enc.
1296 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1301 -lib/Encode/Supported.pod
1302 +lib/Encode/Supports.pod
1303 -lib/Encode/iso10646_1.pm
1304 +lib/Encode/10646_1.pm
1305 -lib/Encode/EncFormat.pod
1306 +lib/Encode/Format/Enc.pod
1307 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1308 - lib/Encode/JP/Constants.pm
1309 + lib/Encode/JP/Consts.pm
1310 ! lib/Encode/JP/JIS.pm
1311 ! lib/Encode/JP/H2Z.pm
1312 Version nit problem and 8.3 rule fix.
1313 > Package namespace installed latest in CPAN file
1314 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1315 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1316 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1317 accordingly. In addition, legacy "use vars qw()..." are replaced with
1319 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1320 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1322 - lib/Encode/JP/ISO_2022_JP.pm
1323 - lib/Encode/JP/ISO_2022_JP_1.pm
1324 + lib/Encode/JP/2022_JP.pm
1325 + lib/Encode/JP/2022_JP1.pm
1327 8.3 naming conflict for vanilla fat addressed by jhi
1328 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1331 Typecast fix addressed by jhi
1332 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1334 0.98 Mon Mar 25 2002
1335 ! lib/Encode/Supported.pod
1337 + lib/Encode/JP/ISO_2022_JP_1.pm
1338 ! lib/Encode/JP/ISO_2022_JP.pm
1339 ! lib/Encode/JP/JIS.pm
1341 Now Encode::JP is more strict on the difference between ISO-2022-JP
1342 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1343 makes Anton happier :) FYI the previous version implements
1344 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1345 ! lib/Encode/Supported.pod
1348 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1349 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1354 ! lib/Encode/Suppoted.pod
1355 pod fixes to replace F<http://...> to L<http://...>,
1356 as suggested by Autrijius in:
1357 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1358 ! lib/Encode/Suppoted.pod
1359 fixes and enhancements by Anton
1360 Message-Id: <10632060120.20020324103753@motor.ru>
1361 ! lib/Encode/Alias.pm
1362 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1363 added. Suggested by Anton then deobfuscated by Autrijius
1364 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1366 Further fix by Nicholas Clark
1367 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1368 - lib/EncodeFormat.pod
1369 + lib/Encode/EncFormat.pod
1371 File renamed as suggested by Autrijius
1373 ! lib/Encode/Details.pod
1374 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1375 ! Encode.pm Sun Mar 24 13:43:47 2002
1376 pod fixes by Autrijius.
1377 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1378 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1380 ! lib/Encode/Alias.pm
1382 now more EBCDIC conscious;
1383 %ExtModules on EBCDIC system excludes CJK so that you don't
1384 have to worry about the matched alias resulting cloaking.
1385 t/Alias.t also revised to reflect changes. Verified by jhi
1386 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1388 0.97 Sun Mar 24 2002
1392 EBCDIC detection mechanism installed as in JP/JP.pm
1393 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1396 ! EBCDIC/Makefile.PL
1399 ! Symbol/Makefile.PL
1401 Now all table files used by compile are postfixed '_t' to avoid
1402 namespace collisions in case insensitive file systems once for all!
1404 Message-ID: <58290227735.20020323195659@familiehaase.de>
1406 Since the Encode::JP is unsupported under EBCDIC we
1407 cannot run this test (aliases as such should work fine) -- jhi
1408 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1410 duplicate occurance of ascii.ucm and 8859-1.ucm
1411 causes MacOS X dlyd to cloak
1417 < chdir 't' if -d 't';
1419 > if (! -d 'blib' and -d 't'){ chdir 't' };
1420 When you are "make test"-ing on Encode/ directory, you must not
1421 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1422 remain unchanced. Also the situation detection was made simpler
1423 in t/JP.t, which was originally;
1424 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1426 "Use of uninitialized value in string eq at Encode.pm line 96."
1427 ! Symbol/Makefile.PL
1428 ! EBCDIC/Makefile.PL
1430 -- Problem on case insensitive file systems
1431 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1432 Message-ID: <88254111953.20020323095503@familiehaase.de>
1435 "So I think it's a bug in gcc, not perl. But it still needs to be
1437 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1438 Message-Id: <20020323170509.C96475@plum.flirble.org>
1440 0.96 Sat Mar 23 2002
1442 ! lib/Encode/Encoding.pm
1443 ! lib/Encode/Alias.pm
1444 ! lib/Encode/Supported.pod
1446 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1447 Message-ID: <20020322073908.GB10539@blackrider>
1450 "...I think we should include ISO 8859-1 as well." -- NI-S
1451 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1456 ! lib/Encode/Alias.pm
1457 alias definitions relocated to Encode::Alias so module autoloading
1458 works for aliases also.
1460 encodings() now accepts args to check ExtModules.
1464 + EBCDIC/Makefile.PL
1465 + Symbol/Makefile.PL
1469 Latin and single byte encodings are reorganized so they are
1470 demand-loaded like Encode::XX. Now only ascii is compiled into
1472 ! lib/Encode/Alias.pm
1473 for my $k (keys %hash){ delete $hash{$k}; }
1474 is depreciated; fixed.
1476 0.95 Fri Mar 22 2002
1477 In this update, pod rewrites and alias fixes are the main issues
1478 + lib/Encode/Supported.pod
1479 Describes supported encodings
1481 streamlined compiled-in encodings.
1482 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1484 + Encode/ibm-125?.ucm
1485 Added from icu distibution with any occurance of
1486 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1487 some respect to icu staff, however.
1488 + lib/Encode/Alias.pm
1490 Alias difinitions in Encode.pm relocated.
1493 packWARN patch from Paul Marquess via jhi
1494 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1495 Paul added to AUTHORS as a result.
1496 ! t/CJKalias.t -> t/Aliases.t
1497 Renamed. Checks even more aliases and alias overloading
1500 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1501 gbk => cp936 relocated to CN.pm
1503 Test::More with plans (by jhi)
1505 0.94 Thu Mar 21 2002
1506 + lib/Encode/Description.pod
1507 ! lib/Encode/Encoding.pm
1508 Now the pod in Encode.pm is abridged as programming references.
1509 lib/Encode/Description.pod contains the original, detailed description
1510 and Encode::Encoding explains how to write your own module to
1511 add new encodings. So far, lib/Encode/Description.pod contains
1512 the whole pod once in Encode.pm. This is intentional.
1514 Pod revisions by Anton Tagunov
1515 Message-Id: <517178431.20020320174824@motor.ru>
1517 all occrance of Encode::Tcl::Extended removed including pod
1519 test now checks $encoding->name only; $encoding->{name} are
1520 no longer check to find the canonical name.
1521 ! lib/Encode/JP/JIS.pm
1522 ! lib/Encode/JP/ISO_2022_JP.pm
1523 ->name() added to be more compliant with API
1529 Patch by Autrijus to add aliases to TW and fixes to POD
1530 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1532 SADAHIRO Tomoyuki added as should. My apologies.
1534 0.93 Wed Mar 20 2002
1535 * First release to be uploaded to CPAN. For prehistoric changes,
1536 please see Changes file of perl distibution as well as
1537 perl-unicode@perl.org archive, available at:
1538 http://archive.develooper.com/perl-unicode@perl.org/
1540 Changes Since 0.92 includes;
1545 + Mention to perl-unicode@perl.org added
1547 + Encoding aliases added so you can feed locale names
1548 and MIME Charset="" directly.
1549 - Mention to JISX0212 removed because it's fixed
1552 + Encoding aliases added. Note TW is left untouched because
1553 euc-tw is not implemented in TW but in Encode::HanExtra.
1554 Autrijus, you may fix Encode::HanExtra.
1556 + to test encode aliases added