1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.96 2003/06/18 09:29:02 dankogai Exp $
5 $Revision: 1.96 $ $Date: 2003/06/18 09:29:02 $
7 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
8 automatic utf(8|16|32) guessing -- originally by Autrijus
9 Message-Id: <20030626162731.GA2077@not.autrijus.org>
11 Addressed the following;
12 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
13 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
15 1.96 2003/06/18 09:29:02
16 ! lib/Encode/JP/JP.pm t/guess.t
17 m/(...)/ in void context then $1 is considered a Bad Thing
18 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
20 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
23 More typecast from maintperl@19739
24 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
26 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
27 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
29 Now skips for EBCDIC platform.
30 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
32 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
33 have more than one C<.> in their filenames." -- perlport.
34 Message-Id: <3ED79E01.8050401@mac.com>
36 Found and fixed the back that -p,--perlqq does not work.
37 Induced by the change from Getopt::Std to Getopt::Long.
39 Addressed [cpan #2629] Wrong assumption in numeric comparison
40 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
41 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
42 lib/Encode/Encoding.pm t/perlio.t
43 ! API Change: ->new_sequence() => ->renew()
44 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
45 + Encode::XS and Encode::utf8 now supports ->renew()
46 + Encode::Encoding now documents this with examples
47 - Non-XS (en|de)code stripped out of Encode::Unicode
48 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
50 1.95 2003/05/21 08:41:11
52 Since bogus entries were found in iso-8859-6, all entries are
53 re-generated once again out of
54 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
55 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
56 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
57 + lib/Encode/Unicode/UTF7.pm
58 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
59 lib/Encode/Supported.pod
60 UTF-7 support is now added. With this Encode now has all transcoding
61 methods in Unicode::String.
63 1.94 2003/05/10 18:13:59
64 ! lib/Encode/MIME/Header.pm
65 A more sophisticated solution for double-encoding by dankogai
66 ! lib/Encode/MIME/Header.pm AUTHORS
67 Two bugs fixed by Bjoern Jacke
68 * "Double Encoding" was not possible
69 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
70 * encode("MIME-Q") had UTF-8 flag on
71 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
72 ! lib/Encode/MIME/Header.pm AUTHORS
73 Two occurances of "croak ()" fixed as "croak qq()".
74 Simon Cozens is added to AUTHORS as a result.
75 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
77 POD fixes that reflect enhancements by jhi
79 Two enhancements by jhi.
80 + Now uses Getopt::Long so it accepts long name options
81 (--from for -f, for example)
82 + New option: -r,--resolve
83 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
85 META.yml added upon request of Schwern
86 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
88 Enache Adrian removed upon request -- to live longer than Encode
89 and/or FreeBSD (toy-)?thread :)
90 Message-Id: <20030425015701.GA2069@ratsnest.hole>
92 "close STDOUT unless $^O eq 'freebsd';" once again relocated
93 to keep VMS happy in which case "$^O eq 'freebsd'" is required
94 to keep FreeBSD+thread happy. Sigh.
95 Message-Id: <3EA88ADC.3000300@mac.com>
97 1.93 2003/04/24 17:43:16
99 added "no warnings 'pack'" in for loop to keep bleedperl from
100 complaining "Character in 'C' format wrapped in pack".
102 More elegant perl core detection inspired by Ilya Zakharevich
103 (but further elaborated for general cases).
104 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
107 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
108 are less likely to be confused w/ ISO-8859-*
110 RT tests added (vendor encodings are exemplified)
111 -- that successfully found a flaw on iso-2022-kr before the patch.
112 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
113 decode("ISO-2022-KR") has been buggy but no one ever sited
114 that since no one seems to be using it. Bugs discovered by
116 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
117 ! lib/Encode/CN/HZ.pm t/perlio.t
118 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
119 so it adds test for HZ.
120 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
121 ! lib/Encode/Guess.pm
122 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
123 So long as the string contains \x{00}-\x{ff} it does not fail.
124 See perldoc for details.
125 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
127 1.92 2003/03/31 03:27:27
128 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
129 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
130 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
131 but glibc-BIG5-2.1.2.ucm does.
132 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
133 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
134 Maps regenerated again but this time based upon
135 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
136 (But where is THE DOCUMENT by MICROSOFT?)
137 ! t/enc_module.t AUTHORS
138 failure with threaded Perl on FreeBSD addressed.
139 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
140 Message-Id: <20030322230131.GA813@ratsnest.hole>
141 ! lib/Encode/Guess.pm
144 Change 18989: Make the :bytes conditional on PerlIO.
145 further Modified by Dan Kogai
146 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
148 Chnage 18966: another fix for failing test on windows ("use encoding"
149 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
150 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
152 Change 18970: Hopefully this works also in Win32, by jhi
153 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
154 Change 18965: fix CJKT.t failures on windows due to incorrect
156 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
158 1.91 2003/03/09 20:07:37
160 even more proofread by jhi.
161 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
164 +use lib qw(t ext/Encode/t ../ext/Encode/t);
165 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
167 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
168 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
171 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
173 1.90 2003/03/09 17:32:43
176 Inaba-san has added a patch for perl 5.8.1 or later that makes
177 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
178 that. POD is further revised.
179 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
180 ! encoding.pm t/enc_module.t
181 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
182 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
186 signed vs. unsigned issue discovered by Craig on OpenVM
187 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
188 ! encoding.pm AUTHORS
189 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
190 Because binmode() stacks layers instead of overwrite, you have to
191 ":raw :encoding()" in encoding.pm or your are in trouble when you
192 call encoding.pm multiple times. There are several workarounds
193 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
194 who was the first to address this problem was added to AUTHORS.
195 The test suites was added for this, which is a modified version
196 of SUGAWARA-san's scripts
197 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
199 1.88 2003/02/20 14:42:34
201 one signedness nit for Encode by jhi
202 <200302161933.h1GJX876018710@kosh.hut.fi>
204 VISCII map was incorrect; fixed by Sadahiro-san
205 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
206 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
207 You can't unlink files that are opened in cygwin but the last
208 file handle opened in t/enc_*.t left open. Patch submitted
209 by Yitzchak and he was added to AUTHORS.
210 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
212 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
213 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
215 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
216 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
218 1.87 2003/02/06 01:52:11
220 * Inaba "Sensei" Hirohito added (I thought I have done so a long
221 ago but apparently I did not).
222 * SUZUKI Norio added for verious and useful bug reports.
223 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
224 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
225 podchecked so all warnings are gone except for L<http://>.
226 ! encoding.pm t/enc_eucjp.t
227 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
228 * Filter option overhaul
230 ! Encode.pm Encode.xs encengine.c Encode/encode.h
231 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
232 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
233 without filter. podchecked by Dan Kogai.
234 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
235 ! lib/Encode/Alias.pm
236 decode('alias', $1) went wild because of local $_ in find_alias()
237 the evil local $_ is eradicated but that changes find_alias()
238 format for coderef aliasing. See Encode::Alias for details
239 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
241 1.86 2003/01/22 03:29:07
243 * Don't forget to canonize when you attempt an exact match!
244 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
245 * ${^ENCODING} exception is off for $] > 5.008
246 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
248 $] check commented out so it runs on 5.8.0
250 1.85 2003/01/21 22:19:14
252 ${^ENCODING} exception is now explicit rather than handled by regex.
253 + t/enc_eucjp.t t/enc_utf8.t
254 Test suite for the better "encoding" pragma support for bleedperl.
255 On 5.8.0, they will just be skipped.
257 1.84 2003/01/10 12:00:16
259 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
260 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
261 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
263 %_ fixes saves the resulting .so .05% smaller, by NC
264 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
266 Silence Encode on undef, by Andreas
267 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
268 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
270 s/regognised/recognised/ . British spelling left intact to pay
271 respect to two British Nicks :)
272 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
274 1.83 2002/11/18 17:28:49
275 ! Encode.xs lib/Encode/JIS7.pm
276 Even more patches from Inaba-san has been applied. With this
277 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
278 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
280 1.82 2002/11/14 23:06:12
282 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
283 later found in perls w/ -Dusemymalloc was fixed by NC.
284 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
286 1.81 2002/11/08 18:29:27
287 ! Encode.pm Encode.xs
288 Non-XS version of Encode::utf8 is back (with XS being default).
289 Encode::predefine_encodings(0) to turn off XS.
290 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
291 found that it was Test::More causing the problem, not Encode.
292 But I have already made it configurable so it may be useful in
294 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
296 The ingenious patch by Nicholas Clark that reduces shlib sizes by
297 50% with no penalty and backward compatibility preserved, is in.
298 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
300 1.80 2002/10/21 20:39:09
301 ! Encode.xs t/mime-header.t
302 Even more patches from NI-XS regarding Encode::utf8->decode().
303 And one more test to t/mime-header.t to prove it
304 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
306 1.79 2002/10/21 06:05:37
308 Further patches from NI-XS. Encode::utf8->decode() now checks the
309 value of utf8 flag of the argument. As a result, the fix to
310 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
311 no harm (even speedwise) I'll leave it unreverted.
312 ! ucm/cp949.ucm ucm/cp950.ucm
314 U+00AE REGISTERED SIGN
315 were missing as a result of 1.78. Discovered by Moriyama-san.
316 Moriyama-san has also developed a test script that compares
317 (en|de)coded results to the corresponding Win32 API result and
318 all cp9?? maps are now verified.
319 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
321 1.78 2002/10/20 15:44:00
322 ! lib/Encode/MIME/Header.pm
323 fixed so that it works with new Encode::utf8
324 ! Encode.pm Encode.xs
325 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
326 :encoding(UTF-8) to handle partial chars at end of buffers
328 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
329 ! lib/Encode/Supported.pod
330 More nitpickings applied.
333 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
334 tests were completely useless. To redeem that and get the peace of
335 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
336 ucm/*.ucm Since this script takes too long to finish (30 seconds on
337 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
338 but you can easily run that by either renaming it or:
340 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
341 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
342 Other CJKT cp9?? also updated according to the URI below;
343 http://www.microsoft.com/typography/unicode/cscp.htm
344 + bin/ucmsort MANIFEST
345 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
346 to proper order. intended for hardcore develpers only.
347 ! ucm/cp932.ucm JP/JP.pm AUTHORS
348 CP932 mapping which was based upon the mapping file at unicode.org
349 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
350 has also supplied the patch so he was added to AUTHORS.
351 ! lib/Encode/Supported.pod
352 ISO-8859-11 != TIS 620
353 == TIS 620 + \xA0 ( )
355 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
357 1.77 2002/10/06 03:27:02
359 * Modified to accomodate up and comming patch by Inaba-san that
360 will fix tr/// needing eval qq{}
361 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
363 * pod fixes/enhancements to reflect the changes above
364 ! lib/Encode/Alias.pm
365 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
366 Message-Id: <20021001015648.GB18710@not.autrijus.org>
368 1.76 2002/08/25 15:09:51
370 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
371 but naturally identical to previous version -- dankogai
373 Codepoint fixes -- autrijus
374 Message-Id: <20020805040236.GC5220@not.autrijus.org>
376 copied everything under perl-5.8.0/ext/Encode to make sure Encode
377 is in sync w/ perl core
379 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
380 Now that binmode(FH) does implicit ":bytes" revisit
381 the failing tests. The worrisome one is the Digest::MD5
382 test-- how will it fare in CRLF lands now?
384 From: Radu Greab <radu@netsoft.ro>
385 Date: Mon, 10 Jun 2002 00:40:34 +0300
386 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
387 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
388 ones which I cannot figure out.
389 ! lib/Encode/Alias.pm
390 Subject: [Encode PATCH] spurious warning
391 From: Nicholas Clark <nick@unfortu.net>
392 Date: Sun, 2 Jun 2002 20:26:22 +0100
393 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
395 1.75 2002/06/01 18:07:49
396 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
397 glibc compliance cited by Autrijus.
398 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
399 ! bin/enc2xs bin/piconv
400 Subject: Re: forewarning: usedevel and versiononly
401 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
403 1.74 2002/05/28 18:33:15
404 + ucm/null.ucm ucm/ctrl.ucm
405 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
406 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
407 Subject: Re: unicode -> &# notation
408 Message-ID: <20020518193704.GB40272@not.autrijus.org>
410 1.73 2002/05/28 17:26:18
411 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
412 Chris Nandor has fixed Encode so that it works w/ MacPerl --
413 at least w/ PPC (68k need static linking which does not work due to
414 64k limit). pudge is added to AUTHORS (I'm surprised he was not
415 there in the list). Encode/Makefile_PL.e2x was additionally fixed
416 by dankogai to reflect changes in other Makefile.PL
417 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
419 Subject: Change 16746: -Mutf8 cleanup.
420 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
422 1.72 2002/05/20 15:49:56
424 Subject: [PATCH] Encode should be in perl-core library path
425 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
426 Message-Id: <20020520161201.A11019@alpha.hut.fi>
427 ! lib/Encode/MIME/Header.pm
428 Subject: [PATCH] Encode::MIME::Header
429 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
430 ! Encode/Makefile_PL.e2x
431 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
432 Message-Id: <20020519201031.GA1603@not.autrijus.org>
433 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
434 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
436 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
437 Subject: [PATCH] good day for WinCE port of perl.
438 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
441 Even more linting by Robin via @16532
443 Even more typecast by Sarathy in @16460
445 1.71 2002/05/07 16:22:42
447 even more typecasts by Robin
448 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
450 A very strange bug that was causing a bugus ucm -> C table
451 generation that was revealed by a UCM file that Andreas was
452 working. This is the king of wierdest bug I've encountered
453 in the course of Encode maintenance.
454 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
456 1.70 2002/05/06 10:26:48
458 Made more 'module-safe' with conjunction w/ 'no encoding'.
459 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
460 ! lib/Encode/Encoding.pm
461 'require Encode' because ->Define uses Encode::define_encoding();
462 problem and solution addressed by Miyagawa-kun
463 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
465 Cuts the frill to make djgpp happier, as suggested by Laszlo
466 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
468 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
469 Message-Id: <m3bsbug48n.fsf@anima.de>
471 1.69 2002/05/04 16:41:18
472 ! lib/Encode/MIME/Header
473 Floating-point coerced for UNICOS (in integer arithmetics it folds
474 line one character too early). Verification by Mark is pending.
475 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
477 more doc patch from Elizabeth
478 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
479 ! Encode/Makefile_PL.e2x
480 More platform-independent patch from Benjamin
481 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
482 ! lib/Encode/Guess AUTHORS
483 split regex fix by Graham Barr. Adds him to AUTHORS.
484 Message-Id: <20020504085419.E95940@valueclick.com>
485 ! Encode/Makefile_PL.e2x
486 enc2xs script discovery made smarter and more sensible, first cited
487 by Miyagawa-kun and further suggestions by Rafael and Andreas
488 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
489 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
491 1.68 2002/05/03 12:20:13
492 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
493 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
494 and Supported.pod modified to reflect the change. Elizabeth added
495 to Authors. And H.M. is also added for forwarding her patch among
496 other contributions (I was rather surprised to find his name was not
498 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
500 1.67 2002/05/02 07:33:09
502 Error message now consistent w/ perlqq (\N{U+} -> \x{})
503 done in perl@16308 but Philip linted me further. Now the error
504 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
505 ! lib/Encode/Guess.pm
506 Sanity check for happier -w by Autrijus
508 1.66 2002/05/01 05:41:06
509 ! Encode.xs t/fallback.t
510 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
511 while fallback is in effect. This even came with a welcome side-effect
512 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
513 also modified to test this.
514 And of course, the corresponding varialbles to UV[Xx]f are appropriately
515 cast. This should've concluded NI-XS homework.
517 encode(undef) does warn again! Repented upon suggestion by NI-XS.
518 Document for unless vs. '' added
519 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
521 1.65 2002/04/30 16:13:37
523 encode(undef) no longer warns for C<Use of uninitialized value in
524 subroutine entry>. Suggested by Paul.
525 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
526 ! lib/Encode/Supported.pod
527 Encode::MIME::Header and Encode::Guess mentioned
528 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
529 ! lib/Encode/Guess.pm
530 POD fix by Miyagawa-kun
531 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
533 1.64 2002/04/29 06:54:06
535 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
536 encocde("euc-jisx0213") are still DIFFERENT.
537 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
539 A few white spaces corrected by NI-XS via PerlIO integration to
541 Subject: Change 16247: Integrate perlio;
543 Document fixes by Andreas
544 Message-Id: <m3k7qsf1we.fsf@anima.de>
546 1.63 2002/04/27 18:59:50
547 ! lib/Encode/Encoding.pm
548 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
549 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
550 Make use of the Encode::Encoding base class!
551 And other cleanups in Encode.xs upon NI-XS suggestions
552 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
554 1.62 2002/04/27 11:17:39
556 encodings() now just check %ExtModule instead of eval{require}
557 all of them for ":all" to conserve more memory.
559 more "%x" -> "%" UVxf stuff.
561 s/=over2/=over 2/g # oops.
563 1.61 2002/04/26 03:02:04
565 Now does decent tests besides use_ok()
566 ! lib/Encode/Guess.pm t/guess.t
567 UI streamlined, document added
569 various signed/unsigned mismatch nits (#16173)
570 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
572 POD: utf8-flag-related caveats added. A few sections completely
576 Thou shalt not assume %d works, either!
577 Robin Baker added to AUTHORS for this
578 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
580 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
582 1.60 2002/04/24 20:06:52
584 "Thou shalt not assume %x works." -- jhi
585 Message-Id: <20020424210618.E24347@alpha.hut.fi>
586 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
587 low-memory build machines happy, now *.c is created for each *.ucm
588 (no table aggregation). You can still override this by setting
589 $ENV{AGGREGATE_TABLES}.
590 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
591 + lib/Encode/Guess.pm
592 + lib/Encode/JP/JIS7.pm
593 Encoding-autodetect (mainly for Japanese encoding) added. In a
594 course of development, JIS7.pm was improved.
595 + lib/Encode/HTML/Header.pm
596 + lib/Encode/Config.pm
597 MIME B/Q Header Encoding Added!
598 ! Encode.pm Encode.xs t/fallback.t
599 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
600 Message-Id: <20020424130709.GA14211@tanglefoot>
602 1.59 $ 2002/04/22 23:54:22
603 ! Encode.pm Encode.xs
604 needs_lines() and perlio_ok() are added to Internal encodings such
605 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
607 1.58 2002/04/22 23:54:22
609 s/MacChineseSimp/MacChineseTrad/ # ... oops.
614 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
615 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
616 t/(JP|KR).t are aggregated to t/CJKT.t
617 test data are all remade via bin/ucm2text.
618 And .... They are no longer skipped for -Uuseperlio !
620 1.57 2002/04/22 20:27:30
621 ! t/JP.t t/KR.t t/perlio.t
622 unless (find PerlIO::Layer 'perlio') ... line is back again.
623 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
624 need PerlIO. Sigh....
625 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
626 ->perlio_ok now does eval{ require PerlIO::encoding } there so
627 it correctly returns 1 when PerlIO::encoding is yet loaded.
629 perl-current patch #16072 reflected
631 1.56 2002/04/22 09:48:07
632 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
633 New PerlIO::encoding 0.04 compliance met
635 1.55 2002/04/22 03:43:05
636 ! Encode.pm Encode.xs Unicode/Unicode.pm
637 needs_lines() defined so Encode::Encoding is no longer needed
640 1.54 2002/04/22 02:50:01
641 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
642 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
643 now perlio_ok is true by default if PerlIO::encoding->VERSION is
644 0.03 or larger. POD in Encode::Encoding revised to reflect this.
645 Encode::XS and Encode::Unicode now has perlio_ok() method.
646 ! lib/Encode/Supported.pod
647 s/UP-UX/HP-UX/ by jhi
648 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
649 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
650 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
651 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
652 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
653 ! lib/Encode/Supported.pod
654 Huge document fixes by Philip.
657 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
658 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
660 binmode() after "<:encoding" to make Win32 happy, by Mattia.
661 Mattia added to AUTHORS file
662 Message-Id: <3CC3150F.5798.22A05AE@localhost>
664 1.52 2002/04/20 23:43:47
666 TODO: is now SKIP:, as NI-XS requested. Also adds more
667 eraborate failure analysis added.
669 A note on how to make sure of round-trip safety added to POD
670 section (so Autrijus is happier)
671 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
672 big5-(eten|hkscs) is round-trip safe again!
673 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
675 Typo fixes by Andreas
676 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
677 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
678 PerIO coodination patches from NI-XS.
679 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
681 1.51 2002/04/20 09:58:23
683 Updated test suite by Autrijis so "make test" is happy again
684 Message-Id: <20020420082104.GA25037@not.autrijus.org>
686 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
688 TW/TW.pm TW/Makefile.PL
689 Updates by Autrijus. 'big5' is no longer a canonical but an
690 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
691 Message-Id: <20020419195346.GA19597@not.autrijus.org>
693 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
694 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
696 PerlIO detection a little bit smarter; no longer uses eval qq{}
699 1.50 2002/04/19 06:13:02
700 ! ! Encode.pm Encode.xs Encode/encoding.h
702 New Fallback API imlemented and documented. See "perldoc Encode"
704 ! lib/Encode/JP/JIS7.pm Encode.pm
705 + lib/Encode/PerlIO.pod t/perlio.t
706 API compliance met. However, it still does not work unless perlio
707 implements line buffer. See BUGS section in perldoc Encode::PerlIO
708 As a sensible workaround, perlio_ok() added to Encode.
710 ! lib/Encode/Supported.pod
712 Message-Id: <20020418174647.J8466@alpha.hut.fi>
714 Doc fixes from Autrijus
715 Message-Id: <20020418144131.GA10987@not.autrijus.org>
717 perlqq mode documented
719 + t/jisx0201.euc t/jisx0201.ref
720 ! t/jisx0208.euc t/jisx0208.ref
721 t/JP.t tests more rigorously and with other encodings
722 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
723 in jis0208 (used to contain jisx0201 part).
724 ! Encode/Makefile_PL.e2x
725 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
726 enc2xs and encode.h rather than hard-coded. This allows the resulting
727 module fully CPANizable.
728 ! encoding.pm t/JP.t t/KR.t
729 PerlIO detection simplified (checks %INC instead of eval{})
730 ! Encode.xs Encode/encode.h
731 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
732 - lib/Encode/Unicode.pm
733 (en|de)code_xs relocated to where it belongs. Source reindented
736 Additional (U8 *) cast added as suggested by jhi
737 Message-Id: <20020417165916.A28599@alpha.hut.fi>
739 1.42 Date: 2002/04/17
741 no-op module; Thought of adding a pod there but enc2xs has
746 correct mechanism to detect Perlio::encoding layar installed.
748 PerlIO Layer detached.
750 1.41 2002/04/16 23:35:00
752 binmode(STDIN|STDOUT ...) done iff PerlIO is available
754 Cleaned up PerlIO skip conditions to prepare for the upcoming
755 Encode - PerlIO forking.
757 exported functions are now prototyped.
758 ! lib/Encode/CN/HZ.pm
761 fallback implemented # was /* FIXME */
762 affected programs revised to fit (only HZ was using the try-catch
763 approach which needed to be fixed for API-compliance).
765 ! Encode/KR/2022_KR.pm
767 can find =head1 NAME now, jhi
768 Message-Id: <20020416083059.V30639@alpha.hut.fi>
772 now complies with less warnings with the pickest compilers.
773 Suggested by Craig, fixed by Dan.
774 ! Encode/Makefile_PL.e2x
776 A bug that fails to find *.e2x in certain conditions fixed
778 1.40 2002/04/14 22:27:14
779 + Encode/ConfigLocal_PM.e2x
780 ! lib/Encode/Config.pm
782 "enc2xs -C" now generates/updates Encode::ConfigLocal.
783 ConfigLocal_PM.e2x is a skelton thereof.
784 ! lib/Encode/Config.pm
786 "use Encode::CN::HZ;" was missing.
789 More rigorous tests added to test XS, especially on memory allocation.
791 ! lib/Encode/Unicode.pm
792 NI-S implemented an XS version -- merged
793 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
796 Source filter option added. With this option on, you can write
797 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
798 encodings. t/jperl.t enhanced to test this feature.
800 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
801 apply his suggestion, this degree of nitting is enough to add him
803 Message-Id: <3CB93223.291E5E2E@earthlink.net>
805 + lib/Encode/JP/JIS7.pm
806 - lib/Encode/JP/JIS.pm
807 - lib/Encode/JP/2022_JP.pm
808 - lib/Encode/JP/2022_JP1.pm
809 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
810 JIS7.pm for better maintainability and performance
812 Added caveat for non-ascii identifiers.
814 fixes by jhi, the original author of this pragramtic module.
815 Message-Id: <20020413231527.V1826@alpha.hut.fi>
817 1.34 2002/04/12 20:23:05 (Unreleased)
820 EBCDIC fixes addressed by jhi.
821 Message-Id: <20020412161844.D9383@alpha.hut.fi>
822 ! lib/Encode/Encoder.pm
823 POD fix by Miyagawa-kun
824 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
826 1.33 2002/04/10 22:28:40
828 Philip's mail address corrected.
831 ! lib/Encode/Encoder.pm
832 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
833 This adds Miyagawa-kun to AUTHORS.
834 * encoding() no longer exported by default but on demand
835 * t/Encoder.t updated to test all these
836 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
837 ! lib/Encode/Unicode.pm
838 ! lib/Encode/Supported.pm
839 Further doc fixes by Anton
841 1.32 2002/04/09 20:06:15
844 - ucm/macDevanaga.ucm Unicode Character Map
845 - ucm/macGujarati.ucm Unicode Character Map
846 - ucm/macGurmukhi.ucm Unicode Character Map
847 A utility to check integrity of .ucm files. t/bogus.ucm is a
848 ucm that is deliberately bogus. unused Indic mappings are removed
851 resolve_alias() added as suggested by jhi. Same as
852 find_encoding("alias")->name. For convenience. This one is
853 defined in Encode.pm instead of Alias.pm.
854 Message-Id: <20020409215846.H17022@alpha.hut.fi>
856 Memory Allocate but detected during the devel of ucmlint -- fixed.
857 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
858 ! lib/Encode/Unicode.pm
859 valid_ucs2(0) is false but must be true.
860 3 patches from NI-S as follows. This also has fixed the incident
862 ! lib/Encode/Alias.pm
863 find_alias() recursion prevention
865 Checks for the patch above
866 ! t/Encode/Unicode.pm
867 An extra "F" that causes valid_ucs2() return a bogus value fixed
868 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
869 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
870 2 Small Patches from jhi as follows:
872 Encode->encodings() lists in case-insensitve order (as it was)
874 -l option prints avaiable encodings to STDOUT instead of STDERR
875 ! lib/Encode/Aliases.pm
876 s/defintion/definition/
877 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
879 ! lib/Encode/Supported.pod
880 ! lib/Encode/Unicode.pm
881 POD revise by Philip Newton. This adds Philip to AUTHORS list.
882 Thank you for the exact quote of Douglas Adams :)
883 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
885 1.31 2002/04/08 18:08:07
886 ! lib/Encode/Encoder.pm
888 Encode::Encoder, once just a placeholder of an idea, is now much more
889 practical. See t/Encode.t to find how practical it can be.
890 + lib/Encode/Config.pm
892 my false laziness at Encode.pm is fixed. Now %ExtModules are set
893 in Encode::Config and they are all literally, not programatically
894 set. My false laziness was resulting many encodings missing from
896 ! lib/Encode/Unicode.pm
898 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
899 so that it does not rely Encode::Unicode for BOM values
900 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
902 1.30 2002/04/08 02:34:51
903 + lib/Encode/Encoder.pm
904 Object Oriented Encoder. I reckon something like this is in need.
907 ! lib/Encode/Supported.pod
908 * autoloading bug that prevented upper-case canonicals such as UTF-16
909 is fixed. Now even UTF/UCS are autoloaded!
910 * encodings() is now more intuitive.
911 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
913 * Obligatory fixes to the POD.
914 ! lib/Encode/Supported.pod
915 Patch from Anton applied.
916 Message-Id: <66641479.20020408033300@motor.ru>
918 ! lib/Encode/Unicode.pm
919 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
921 1.28 2002/04/07 18:58:42
925 Just a MANIFEST for those missing files.
927 1.26 Date: 2002/04/07 15:22:04
930 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
931 And t/Aliases.t fixed to make sure they all exist.
933 1.25 2002/04/07 15:01:25 (Unreleased)
935 ! lib/Encode/Unicode.pm
938 - lib/Encode/UTF_EBCDIC.pm
939 - lib/Encode/Internal.pm
941 Integrated into Encode.pm as closures. That way "one package, one file"
942 rule is preserved yet less files to require.
944 commented out binmode(STDERR ...
953 ! Encode/Makefile_PL.e2x
954 Schwarn's MM-compliance patch merged
955 Message-Id: <20020406082609.GA28758@blackrider>
957 ! lib/Encode/Unicode.pm
958 + lib/Encode/UTF_EBCDIC.pm
960 - lib/Encode/10646_1.pm
961 - lib/Encode/ucs2_le.pm
962 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
963 per-module based (en|de)code, I saved a number of .pm by
964 reorganizing it as per-object base (Well, this is what Encode::XS
965 does under the hood). See Encode::Unicode for details.
966 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
967 This module is used only on EBCDIC environments.
969 1.21 2002/04/05 14:46:34 (Not Released)
975 Are back to make Perl/Tk happy Smile, NI-S.
977 ! lib/Encode/Alias.pm
978 ! lib/Encode/Supported.pm
979 ! lib/Encode/10646_1.pm
980 ! lib/Encode/ucs2_le.pm
981 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
982 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
983 updated to reflect changes.
984 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
985 ! lib/Encode/Alias.pm
986 ! lib/Encode/Supported.pm
987 Anton's revision commited. Added Dan's own fixes as well.
988 Message-Id: <159103166906.20020405161134@motor.ru>
989 ! lib/Encode/Alias.pm
991 < qr/^UCS2-le$/i => '"UCS-2"', );
993 > qr/^UCS2-LE$/i => '"UTF-16LE"');
994 Sigh. Thank you, Anton.
995 Message-Id: <14567692196.20020405062020@motor.ru>
996 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
998 1.20 2002/04/04 19:50:52
1000 the last minute addtion. Just give it a try. Docs remains to be done.
1001 Not installed by default.
1002 ! lib/Encode/Supported.pod
1005 ! lib/Encode/Alias.pm
1006 ! lib/Encode/utf8.pm
1007 ! lib/Encode/10464_1.pm
1008 ! lib/Encode/ucs2_le.pm
1009 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1010 unchanged but UTF-16BE is added as an alias. Implicit aliases
1011 move to Encode::Alias so init_alias() works more as expected.
1012 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1013 Though pedantically wrong, This should make perl mongers happier.
1014 t/Alias.t is enhanced to test all these.
1015 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1017 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1018 of the codepoints are common. CJKT left as is because the saving is
1022 ! EBCDIC/Makefile.PL
1024 ! Encode/Makefile_PL.e2x
1028 ! Symbol/Makefile.PL
1032 All occurance of _def.h replaced with .exh so djgpp works happily
1033 ever after! To credit this amazing discovery, Laszlo is now in
1035 Message-Id: <20020403181424.GA8778@freemail.hu>
1036 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1039 ! Encode/Makefile_PL.skel
1041 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1042 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1044 Two more tests by added jhi
1045 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1048 The showstopper fixed -- Memory reallocation bug was causing
1049 Encode::XS to fall into infinite loop on certain conditions.
1050 t/grow.t tests that.
1051 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1056 ! lib/Encode/Supported.pod
1057 Vendor encodings rebuilt out of original map files at unicode.org.
1058 Indic languages such as MacDevanagali remain unspported do to the
1059 shortcoming of encengine capabilities (they need algorithmical
1060 conversion and I have no knowledge on that!). Pods fixed for added
1062 Oh, macJapan.ucm renamed to macJapanese.ucm.
1063 macROMnn is macRomanian and macRUMnn is macRumanian.
1064 txt2ucm is a crude script that is used to convert them.
1066 Unicode Compound Characters (used extensively on Mac) supported
1068 Typo fixes and improvements by jhi
1069 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1071 1.11 2002/03/31 22:12:13
1075 Missing files from the MANIFEST fixed.
1076 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1077 Version incremented just to make CPAN happy.
1079 1.10 2002/03/31 21:32:42
1082 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1083 if you want. This should make Autrijus happy. Also, piconv
1084 is added to default install.
1087 Here-documented files that enc2xs generates are now exported
1088 to *.e2x. Much cleaner and easier to debug.
1090 encoding enhances so you can make it act more like such
1091 (now prehistoric ) "localized" variations of perl like Jperl.
1093 Further test for encoding.pm. Written in euc-jp
1096 Taken over form jhi.
1097 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1102 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1109 ! Encode/macIceland.ucm
1110 ! lib/Encode/Alias.pm
1111 ! lib/Encode/Supported.pod
1112 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1113 Message-Id: <m3lmcavhjt.fsf@anima.de>
1115 1.01 2002/03/29 20:59:39
1118 s/USE_SCRIPTS/MORE_SCRIPTS/
1120 installs enc2xs by default for external Encode:: modules in CPAN,
1121 such as Encode::HanExtra
1123 More sensible perl core detection via $ENV{PERL_CORE}
1125 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1127 Perl core ditection via $^X =~ m/\bminiperl$/o
1128 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1130 1.00 Wed Mar 29 2002
1132 The version of all files is updated to 1.00 via "ci -f -l1.00",
1133 commemorating version 1.00. All files, including *.ucm are now
1134 under version control.
1137 encode.h moved to Encode/ so it will be installed for the later
1140 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1145 compile renamed to enc2xs.
1146 Affected Makefle.PL updated
1148 "Punt it. HanExtra can take care of that later." -- Autrijus
1149 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1152 ! Encode/ksc5601.ucm
1153 ! lib/Encode/CJKConstants.pm
1154 ! lib/Encode/KR/2022_KR.pm
1155 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1156 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1160 bin/ added for example scripts. They are not installed by default.
1161 to install them, "perl Makefile.PL USE_SCRIPTS".
1162 piconv is iconv reinvented in perl. in addition to all features
1163 of iconv, it also adds perlish features. See L<piconv/1> for more
1165 ! lib/Encode/Alias.pm
1166 qr/^ replaced with qr/\b so it directly matches locale names
1167 such as en_US.US-ASCII
1170 Patch by MJD to fix the following problem applied.
1171 Subject: [PATCH 5.7.3 Encode]
1172 Aliases.t not properly skipped when Encode extension not built
1173 Message-Id: <20020328091850.18677.qmail@plover.com>
1174 ! lib/Encode/KR/2022_KR.pm
1175 ! lib/Encode/CJKConstants.pm
1176 Another patch from Jungshik to make iso-2022-kr actually work
1177 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1178 ! Encode/Encode/euc-kr.ucm
1179 + Encode/Encode/johab.ucm
1180 ! Encode/Encode/ksc5601.ucm
1182 ! Encode/KR/Makefile.PL
1183 ! Encode/lib/Encode/Alias.pm
1185 Johab support and complete revision of Korean Encoding by Jungshik
1186 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1188 Revised to make up with now-dropped Encode::Details.
1189 - lib/Encode/Details.pod
1190 Dropped. Besides being obsolete, the topics are now covered in
1191 respective pods now.
1196 Korean aliases fixed thanks to Jungshik Shin
1197 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1198 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1200 =head1 NAME added to all modules to make buildtoc happy
1201 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1202 - lib/Encode/CJKguide.pod
1203 Too controversial and dropped from the dist. Will be available
1204 separately on the web.
1206 RCS tags added so table debugging gets easier (should that be
1207 needed! I hope they all stay 1.00!)
1208 + lib/Encode/CJKguide.pod
1209 A detailed guide to mainly, but not limited to, CJK multibyte
1212 + Encode/hp-roman8.ucm
1214 ! Encode/Supported.pod
1215 All occurance of "roman8" replaced with "hp-roman8" to avoid
1217 ! Encode/Supported.pod
1220 Mac Encodings now comply the Inside Macintosh
1222 Test for '-raw' conventions added.
1224 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1225 ! Encode/gb12345.ucm
1227 ! Encode/ksc5601.ucm
1228 "-raw" appended to canonical names.
1229 File mames stay unchanged thanks to UCM format.
1230 ! lib/Encode/CN/HZ.pm
1231 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1232 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1234 0.99 Tue Mar 26 2002
1235 - lib/Encode/JP/Const.pm
1236 + lib/Encode/CJKConstants.pm
1237 + lib/Encode/CN/2022_CN.pm
1238 + lib/Encode/KR/2022_KR.pm
1247 * Support for ISO-2022-KR and ISO-2022-CN added.
1249 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1250 * ucm2table autogenerates character table out of UCM files.
1253 - lib/Encode/Supports.pod
1254 + lib/Encode/Supported.pod
1255 Names reverted due to popular demand.
1256 8.3 rule applies only when there is a conflict.
1257 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1262 - lib/Encode/Format/Enc.pod
1264 * Character tables is now 100% ucm.
1265 * All files under Encode/ is now 8.3-compliant
1266 * some of missing encodings added (i.e. gsm0338 and nextstep)
1267 * Vendor mappings aggregated with appropriate national std in
1268 Makefile.PL, resulting smaller *.so especially for CJK.
1269 Following is result on Dan's FreeBSD box.
1271 ---------------------------------------------------------------
1272 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1273 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1274 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1275 blib/arch/auto/Encode/Encode.so 27,791 27,791
1276 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1277 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1278 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1279 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1280 ---------------------------------------------------------------
1281 Total 5,375,297 6,343,819
1283 * As a result of ucm-transition, Encode::Tcl dropped because
1284 Encode::Tcl demands *.enc.
1285 Encode::Tcl will be supplied in a separate tarball with *.enc.
1286 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1291 -lib/Encode/Supported.pod
1292 +lib/Encode/Supports.pod
1293 -lib/Encode/iso10646_1.pm
1294 +lib/Encode/10646_1.pm
1295 -lib/Encode/EncFormat.pod
1296 +lib/Encode/Format/Enc.pod
1297 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1298 - lib/Encode/JP/Constants.pm
1299 + lib/Encode/JP/Consts.pm
1300 ! lib/Encode/JP/JIS.pm
1301 ! lib/Encode/JP/H2Z.pm
1302 Version nit problem and 8.3 rule fix.
1303 > Package namespace installed latest in CPAN file
1304 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1305 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1306 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1307 accordingly. In addition, legacy "use vars qw()..." are replaced with
1309 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1310 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1312 - lib/Encode/JP/ISO_2022_JP.pm
1313 - lib/Encode/JP/ISO_2022_JP_1.pm
1314 + lib/Encode/JP/2022_JP.pm
1315 + lib/Encode/JP/2022_JP1.pm
1317 8.3 naming conflict for vanilla fat addressed by jhi
1318 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1321 Typecast fix addressed by jhi
1322 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1324 0.98 Mon Mar 25 2002
1325 ! lib/Encode/Supported.pod
1327 + lib/Encode/JP/ISO_2022_JP_1.pm
1328 ! lib/Encode/JP/ISO_2022_JP.pm
1329 ! lib/Encode/JP/JIS.pm
1331 Now Encode::JP is more strict on the difference between ISO-2022-JP
1332 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1333 makes Anton happier :) FYI the previous version implements
1334 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1335 ! lib/Encode/Supported.pod
1338 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1339 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1344 ! lib/Encode/Suppoted.pod
1345 pod fixes to replace F<http://...> to L<http://...>,
1346 as suggested by Autrijius in:
1347 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1348 ! lib/Encode/Suppoted.pod
1349 fixes and enhancements by Anton
1350 Message-Id: <10632060120.20020324103753@motor.ru>
1351 ! lib/Encode/Alias.pm
1352 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1353 added. Suggested by Anton then deobfuscated by Autrijius
1354 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1356 Further fix by Nicholas Clark
1357 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1358 - lib/EncodeFormat.pod
1359 + lib/Encode/EncFormat.pod
1361 File renamed as suggested by Autrijius
1363 ! lib/Encode/Details.pod
1364 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1365 ! Encode.pm Sun Mar 24 13:43:47 2002
1366 pod fixes by Autrijius.
1367 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1368 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1370 ! lib/Encode/Alias.pm
1372 now more EBCDIC conscious;
1373 %ExtModules on EBCDIC system excludes CJK so that you don't
1374 have to worry about the matched alias resulting cloaking.
1375 t/Alias.t also revised to reflect changes. Verified by jhi
1376 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1378 0.97 Sun Mar 24 2002
1382 EBCDIC detection mechanism installed as in JP/JP.pm
1383 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1386 ! EBCDIC/Makefile.PL
1389 ! Symbol/Makefile.PL
1391 Now all table files used by compile are postfixed '_t' to avoid
1392 namespace collisions in case insensitive file systems once for all!
1394 Message-ID: <58290227735.20020323195659@familiehaase.de>
1396 Since the Encode::JP is unsupported under EBCDIC we
1397 cannot run this test (aliases as such should work fine) -- jhi
1398 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1400 duplicate occurance of ascii.ucm and 8859-1.ucm
1401 causes MacOS X dlyd to cloak
1407 < chdir 't' if -d 't';
1409 > if (! -d 'blib' and -d 't'){ chdir 't' };
1410 When you are "make test"-ing on Encode/ directory, you must not
1411 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1412 remain unchanced. Also the situation detection was made simpler
1413 in t/JP.t, which was originally;
1414 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1416 "Use of uninitialized value in string eq at Encode.pm line 96."
1417 ! Symbol/Makefile.PL
1418 ! EBCDIC/Makefile.PL
1420 -- Problem on case insensitive file systems
1421 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1422 Message-ID: <88254111953.20020323095503@familiehaase.de>
1425 "So I think it's a bug in gcc, not perl. But it still needs to be
1427 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1428 Message-Id: <20020323170509.C96475@plum.flirble.org>
1430 0.96 Sat Mar 23 2002
1432 ! lib/Encode/Encoding.pm
1433 ! lib/Encode/Alias.pm
1434 ! lib/Encode/Supported.pod
1436 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1437 Message-ID: <20020322073908.GB10539@blackrider>
1440 "...I think we should include ISO 8859-1 as well." -- NI-S
1441 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1446 ! lib/Encode/Alias.pm
1447 alias definitions relocated to Encode::Alias so module autoloading
1448 works for aliases also.
1450 encodings() now accepts args to check ExtModules.
1454 + EBCDIC/Makefile.PL
1455 + Symbol/Makefile.PL
1459 Latin and single byte encodings are reorganized so they are
1460 demand-loaded like Encode::XX. Now only ascii is compiled into
1462 ! lib/Encode/Alias.pm
1463 for my $k (keys %hash){ delete $hash{$k}; }
1464 is depreciated; fixed.
1466 0.95 Fri Mar 22 2002
1467 In this update, pod rewrites and alias fixes are the main issues
1468 + lib/Encode/Supported.pod
1469 Describes supported encodings
1471 streamlined compiled-in encodings.
1472 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1474 + Encode/ibm-125?.ucm
1475 Added from icu distibution with any occurance of
1476 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1477 some respect to icu staff, however.
1478 + lib/Encode/Alias.pm
1480 Alias difinitions in Encode.pm relocated.
1483 packWARN patch from Paul Marquess via jhi
1484 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1485 Paul added to AUTHORS as a result.
1486 ! t/CJKalias.t -> t/Aliases.t
1487 Renamed. Checks even more aliases and alias overloading
1490 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1491 gbk => cp936 relocated to CN.pm
1493 Test::More with plans (by jhi)
1495 0.94 Thu Mar 21 2002
1496 + lib/Encode/Description.pod
1497 ! lib/Encode/Encoding.pm
1498 Now the pod in Encode.pm is abridged as programming references.
1499 lib/Encode/Description.pod contains the original, detailed description
1500 and Encode::Encoding explains how to write your own module to
1501 add new encodings. So far, lib/Encode/Description.pod contains
1502 the whole pod once in Encode.pm. This is intentional.
1504 Pod revisions by Anton Tagunov
1505 Message-Id: <517178431.20020320174824@motor.ru>
1507 all occrance of Encode::Tcl::Extended removed including pod
1509 test now checks $encoding->name only; $encoding->{name} are
1510 no longer check to find the canonical name.
1511 ! lib/Encode/JP/JIS.pm
1512 ! lib/Encode/JP/ISO_2022_JP.pm
1513 ->name() added to be more compliant with API
1519 Patch by Autrijus to add aliases to TW and fixes to POD
1520 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1522 SADAHIRO Tomoyuki added as should. My apologies.
1524 0.93 Wed Mar 20 2002
1525 * First release to be uploaded to CPAN. For prehistoric changes,
1526 please see Changes file of perl distibution as well as
1527 perl-unicode@perl.org archive, available at:
1528 http://archive.develooper.com/perl-unicode@perl.org/
1530 Changes Since 0.92 includes;
1535 + Mention to perl-unicode@perl.org added
1537 + Encoding aliases added so you can feed locale names
1538 and MIME Charset="" directly.
1539 - Mention to JISX0212 removed because it's fixed
1542 + Encoding aliases added. Note TW is left untouched because
1543 euc-tw is not implemented in TW but in Encode::HanExtra.
1544 Autrijus, you may fix Encode::HanExtra.
1546 + to test encode aliases added