1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.96 2003/06/18 09:29:02 dankogai Exp $
5 $Revision: 1.96 $ $Date: 2003/06/18 09:29:02 $
6 ! lib/Encode/JP/JP.pm t/guess.t
7 m/(...)/ in void context then $1 is considered a Bad Thing
8 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
10 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
13 More typecast from maintperl@19739
14 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
16 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
17 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
19 Now skips for EBCDIC platform.
20 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
22 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
23 have more than one C<.> in their filenames." -- perlport.
24 Message-Id: <3ED79E01.8050401@mac.com>
26 Found and fixed the back that -p,--perlqq does not work.
27 Induced by the change from Getopt::Std to Getopt::Long.
29 Addressed [cpan #2629] Wrong assumption in numeric comparison
30 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
31 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
32 lib/Encode/Encoding.pm t/perlio.t
33 ! API Change: ->new_sequence() => ->renew()
34 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
35 + Encode::XS and Encode::utf8 now supports ->renew()
36 + Encode::Encoding now documents this with examples
37 - Non-XS (en|de)code stripped out of Encode::Unicode
38 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
40 1.95 2003/05/21 08:41:11
42 Since bogus entries were found in iso-8859-6, all entries are
43 re-generated once again out of
44 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
45 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
46 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
47 + lib/Encode/Unicode/UTF7.pm
48 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
49 lib/Encode/Supported.pod
50 UTF-7 support is now added. With this Encode now has all transcoding
51 methods in Unicode::String.
53 1.94 2003/05/10 18:13:59
54 ! lib/Encode/MIME/Header.pm
55 A more sophisticated solution for double-encoding by dankogai
56 ! lib/Encode/MIME/Header.pm AUTHORS
57 Two bugs fixed by Bjoern Jacke
58 * "Double Encoding" was not possible
59 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
60 * encode("MIME-Q") had UTF-8 flag on
61 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
62 ! lib/Encode/MIME/Header.pm AUTHORS
63 Two occurances of "croak ()" fixed as "croak qq()".
64 Simon Cozens is added to AUTHORS as a result.
65 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
67 POD fixes that reflect enhancements by jhi
69 Two enhancements by jhi.
70 + Now uses Getopt::Long so it accepts long name options
71 (--from for -f, for example)
72 + New option: -r,--resolve
73 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
75 META.yml added upon request of Schwern
76 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
78 Enache Adrian removed upon request -- to live longer than Encode
79 and/or FreeBSD (toy-)?thread :)
80 Message-Id: <20030425015701.GA2069@ratsnest.hole>
82 "close STDOUT unless $^O eq 'freebsd';" once again relocated
83 to keep VMS happy in which case "$^O eq 'freebsd'" is required
84 to keep FreeBSD+thread happy. Sigh.
85 Message-Id: <3EA88ADC.3000300@mac.com>
87 1.93 2003/04/24 17:43:16
89 added "no warnings 'pack'" in for loop to keep bleedperl from
90 complaining "Character in 'C' format wrapped in pack".
92 More elegant perl core detection inspired by Ilya Zakharevich
93 (but further elaborated for general cases).
94 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
97 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
98 are less likely to be confused w/ ISO-8859-*
100 RT tests added (vendor encodings are exemplified)
101 -- that successfully found a flaw on iso-2022-kr before the patch.
102 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
103 decode("ISO-2022-KR") has been buggy but no one ever sited
104 that since no one seems to be using it. Bugs discovered by
106 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
107 ! lib/Encode/CN/HZ.pm t/perlio.t
108 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
109 so it adds test for HZ.
110 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
111 ! lib/Encode/Guess.pm
112 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
113 So long as the string contains \x{00}-\x{ff} it does not fail.
114 See perldoc for details.
115 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
117 1.92 2003/03/31 03:27:27
118 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
119 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
120 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
121 but glibc-BIG5-2.1.2.ucm does.
122 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
123 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
124 Maps regenerated again but this time based upon
125 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
126 (But where is THE DOCUMENT by MICROSOFT?)
127 ! t/enc_module.t AUTHORS
128 failure with threaded Perl on FreeBSD addressed.
129 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
130 Message-Id: <20030322230131.GA813@ratsnest.hole>
131 ! lib/Encode/Guess.pm
134 Change 18989: Make the :bytes conditional on PerlIO.
135 further Modified by Dan Kogai
136 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
138 Chnage 18966: another fix for failing test on windows ("use encoding"
139 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
140 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
142 Change 18970: Hopefully this works also in Win32, by jhi
143 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
144 Change 18965: fix CJKT.t failures on windows due to incorrect
146 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
148 1.91 2003/03/09 20:07:37
150 even more proofread by jhi.
151 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
154 +use lib qw(t ext/Encode/t ../ext/Encode/t);
155 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
157 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
158 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
161 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
163 1.90 2003/03/09 17:32:43
166 Inaba-san has added a patch for perl 5.8.1 or later that makes
167 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
168 that. POD is further revised.
169 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
170 ! encoding.pm t/enc_module.t
171 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
172 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
176 signed vs. unsigned issue discovered by Craig on OpenVM
177 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
178 ! encoding.pm AUTHORS
179 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
180 Because binmode() stacks layers instead of overwrite, you have to
181 ":raw :encoding()" in encoding.pm or your are in trouble when you
182 call encoding.pm multiple times. There are several workarounds
183 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
184 who was the first to address this problem was added to AUTHORS.
185 The test suites was added for this, which is a modified version
186 of SUGAWARA-san's scripts
187 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
189 1.88 2003/02/20 14:42:34
191 one signedness nit for Encode by jhi
192 <200302161933.h1GJX876018710@kosh.hut.fi>
194 VISCII map was incorrect; fixed by Sadahiro-san
195 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
196 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
197 You can't unlink files that are opened in cygwin but the last
198 file handle opened in t/enc_*.t left open. Patch submitted
199 by Yitzchak and he was added to AUTHORS.
200 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
202 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
203 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
205 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
206 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
208 1.87 2003/02/06 01:52:11
210 * Inaba "Sensei" Hirohito added (I thought I have done so a long
211 ago but apparently I did not).
212 * SUZUKI Norio added for verious and useful bug reports.
213 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
214 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
215 podchecked so all warnings are gone except for L<http://>.
216 ! encoding.pm t/enc_eucjp.t
217 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
218 * Filter option overhaul
220 ! Encode.pm Encode.xs encengine.c Encode/encode.h
221 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
222 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
223 without filter. podchecked by Dan Kogai.
224 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
225 ! lib/Encode/Alias.pm
226 decode('alias', $1) went wild because of local $_ in find_alias()
227 the evil local $_ is eradicated but that changes find_alias()
228 format for coderef aliasing. See Encode::Alias for details
229 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
231 1.86 2003/01/22 03:29:07
233 * Don't forget to canonize when you attempt an exact match!
234 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
235 * ${^ENCODING} exception is off for $] > 5.008
236 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
238 $] check commented out so it runs on 5.8.0
240 1.85 2003/01/21 22:19:14
242 ${^ENCODING} exception is now explicit rather than handled by regex.
243 + t/enc_eucjp.t t/enc_utf8.t
244 Test suite for the better "encoding" pragma support for bleedperl.
245 On 5.8.0, they will just be skipped.
247 1.84 2003/01/10 12:00:16
249 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
250 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
251 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
253 %_ fixes saves the resulting .so .05% smaller, by NC
254 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
256 Silence Encode on undef, by Andreas
257 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
258 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
260 s/regognised/recognised/ . British spelling left intact to pay
261 respect to two British Nicks :)
262 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
264 1.83 2002/11/18 17:28:49
265 ! Encode.xs lib/Encode/JIS7.pm
266 Even more patches from Inaba-san has been applied. With this
267 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
268 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
270 1.82 2002/11/14 23:06:12
272 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
273 later found in perls w/ -Dusemymalloc was fixed by NC.
274 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
276 1.81 2002/11/08 18:29:27
277 ! Encode.pm Encode.xs
278 Non-XS version of Encode::utf8 is back (with XS being default).
279 Encode::predefine_encodings(0) to turn off XS.
280 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
281 found that it was Test::More causing the problem, not Encode.
282 But I have already made it configurable so it may be useful in
284 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
286 The ingenious patch by Nicholas Clark that reduces shlib sizes by
287 50% with no penalty and backward compatibility preserved, is in.
288 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
290 1.80 2002/10/21 20:39:09
291 ! Encode.xs t/mime-header.t
292 Even more patches from NI-XS regarding Encode::utf8->decode().
293 And one more test to t/mime-header.t to prove it
294 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
296 1.79 2002/10/21 06:05:37
298 Further patches from NI-XS. Encode::utf8->decode() now checks the
299 value of utf8 flag of the argument. As a result, the fix to
300 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
301 no harm (even speedwise) I'll leave it unreverted.
302 ! ucm/cp949.ucm ucm/cp950.ucm
304 U+00AE REGISTERED SIGN
305 were missing as a result of 1.78. Discovered by Moriyama-san.
306 Moriyama-san has also developed a test script that compares
307 (en|de)coded results to the corresponding Win32 API result and
308 all cp9?? maps are now verified.
309 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
311 1.78 2002/10/20 15:44:00
312 ! lib/Encode/MIME/Header.pm
313 fixed so that it works with new Encode::utf8
314 ! Encode.pm Encode.xs
315 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
316 :encoding(UTF-8) to handle partial chars at end of buffers
318 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
319 ! lib/Encode/Supported.pod
320 More nitpickings applied.
323 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
324 tests were completely useless. To redeem that and get the peace of
325 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
326 ucm/*.ucm Since this script takes too long to finish (30 seconds on
327 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
328 but you can easily run that by either renaming it or:
330 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
331 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
332 Other CJKT cp9?? also updated according to the URI below;
333 http://www.microsoft.com/typography/unicode/cscp.htm
334 + bin/ucmsort MANIFEST
335 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
336 to proper order. intended for hardcore develpers only.
337 ! ucm/cp932.ucm JP/JP.pm AUTHORS
338 CP932 mapping which was based upon the mapping file at unicode.org
339 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
340 has also supplied the patch so he was added to AUTHORS.
341 ! lib/Encode/Supported.pod
342 ISO-8859-11 != TIS 620
343 == TIS 620 + \xA0 ( )
345 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
347 1.77 2002/10/06 03:27:02
349 * Modified to accomodate up and comming patch by Inaba-san that
350 will fix tr/// needing eval qq{}
351 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
353 * pod fixes/enhancements to reflect the changes above
354 ! lib/Encode/Alias.pm
355 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
356 Message-Id: <20021001015648.GB18710@not.autrijus.org>
358 1.76 2002/08/25 15:09:51
360 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
361 but naturally identical to previous version -- dankogai
363 Codepoint fixes -- autrijus
364 Message-Id: <20020805040236.GC5220@not.autrijus.org>
366 copied everything under perl-5.8.0/ext/Encode to make sure Encode
367 is in sync w/ perl core
369 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
370 Now that binmode(FH) does implicit ":bytes" revisit
371 the failing tests. The worrisome one is the Digest::MD5
372 test-- how will it fare in CRLF lands now?
374 From: Radu Greab <radu@netsoft.ro>
375 Date: Mon, 10 Jun 2002 00:40:34 +0300
376 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
377 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
378 ones which I cannot figure out.
379 ! lib/Encode/Alias.pm
380 Subject: [Encode PATCH] spurious warning
381 From: Nicholas Clark <nick@unfortu.net>
382 Date: Sun, 2 Jun 2002 20:26:22 +0100
383 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
385 1.75 2002/06/01 18:07:49
386 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
387 glibc compliance cited by Autrijus.
388 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
389 ! bin/enc2xs bin/piconv
390 Subject: Re: forewarning: usedevel and versiononly
391 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
393 1.74 2002/05/28 18:33:15
394 + ucm/null.ucm ucm/ctrl.ucm
395 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
396 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
397 Subject: Re: unicode -> &# notation
398 Message-ID: <20020518193704.GB40272@not.autrijus.org>
400 1.73 2002/05/28 17:26:18
401 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
402 Chris Nandor has fixed Encode so that it works w/ MacPerl --
403 at least w/ PPC (68k need static linking which does not work due to
404 64k limit). pudge is added to AUTHORS (I'm surprised he was not
405 there in the list). Encode/Makefile_PL.e2x was additionally fixed
406 by dankogai to reflect changes in other Makefile.PL
407 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
409 Subject: Change 16746: -Mutf8 cleanup.
410 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
412 1.72 2002/05/20 15:49:56
414 Subject: [PATCH] Encode should be in perl-core library path
415 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
416 Message-Id: <20020520161201.A11019@alpha.hut.fi>
417 ! lib/Encode/MIME/Header.pm
418 Subject: [PATCH] Encode::MIME::Header
419 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
420 ! Encode/Makefile_PL.e2x
421 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
422 Message-Id: <20020519201031.GA1603@not.autrijus.org>
423 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
424 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
426 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
427 Subject: [PATCH] good day for WinCE port of perl.
428 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
431 Even more linting by Robin via @16532
433 Even more typecast by Sarathy in @16460
435 1.71 2002/05/07 16:22:42
437 even more typecasts by Robin
438 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
440 A very strange bug that was causing a bugus ucm -> C table
441 generation that was revealed by a UCM file that Andreas was
442 working. This is the king of wierdest bug I've encountered
443 in the course of Encode maintenance.
444 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
446 1.70 2002/05/06 10:26:48
448 Made more 'module-safe' with conjunction w/ 'no encoding'.
449 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
450 ! lib/Encode/Encoding.pm
451 'require Encode' because ->Define uses Encode::define_encoding();
452 problem and solution addressed by Miyagawa-kun
453 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
455 Cuts the frill to make djgpp happier, as suggested by Laszlo
456 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
458 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
459 Message-Id: <m3bsbug48n.fsf@anima.de>
461 1.69 2002/05/04 16:41:18
462 ! lib/Encode/MIME/Header
463 Floating-point coerced for UNICOS (in integer arithmetics it folds
464 line one character too early). Verification by Mark is pending.
465 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
467 more doc patch from Elizabeth
468 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
469 ! Encode/Makefile_PL.e2x
470 More platform-independent patch from Benjamin
471 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
472 ! lib/Encode/Guess AUTHORS
473 split regex fix by Graham Barr. Adds him to AUTHORS.
474 Message-Id: <20020504085419.E95940@valueclick.com>
475 ! Encode/Makefile_PL.e2x
476 enc2xs script discovery made smarter and more sensible, first cited
477 by Miyagawa-kun and further suggestions by Rafael and Andreas
478 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
479 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
481 1.68 2002/05/03 12:20:13
482 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
483 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
484 and Supported.pod modified to reflect the change. Elizabeth added
485 to Authors. And H.M. is also added for forwarding her patch among
486 other contributions (I was rather surprised to find his name was not
488 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
490 1.67 2002/05/02 07:33:09
492 Error message now consistent w/ perlqq (\N{U+} -> \x{})
493 done in perl@16308 but Philip linted me further. Now the error
494 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
495 ! lib/Encode/Guess.pm
496 Sanity check for happier -w by Autrijus
498 1.66 2002/05/01 05:41:06
499 ! Encode.xs t/fallback.t
500 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
501 while fallback is in effect. This even came with a welcome side-effect
502 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
503 also modified to test this.
504 And of course, the corresponding varialbles to UV[Xx]f are appropriately
505 cast. This should've concluded NI-XS homework.
507 encode(undef) does warn again! Repented upon suggestion by NI-XS.
508 Document for unless vs. '' added
509 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
511 1.65 2002/04/30 16:13:37
513 encode(undef) no longer warns for C<Use of uninitialized value in
514 subroutine entry>. Suggested by Paul.
515 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
516 ! lib/Encode/Supported.pod
517 Encode::MIME::Header and Encode::Guess mentioned
518 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
519 ! lib/Encode/Guess.pm
520 POD fix by Miyagawa-kun
521 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
523 1.64 2002/04/29 06:54:06
525 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
526 encocde("euc-jisx0213") are still DIFFERENT.
527 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
529 A few white spaces corrected by NI-XS via PerlIO integration to
531 Subject: Change 16247: Integrate perlio;
533 Document fixes by Andreas
534 Message-Id: <m3k7qsf1we.fsf@anima.de>
536 1.63 2002/04/27 18:59:50
537 ! lib/Encode/Encoding.pm
538 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
539 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
540 Make use of the Encode::Encoding base class!
541 And other cleanups in Encode.xs upon NI-XS suggestions
542 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
544 1.62 2002/04/27 11:17:39
546 encodings() now just check %ExtModule instead of eval{require}
547 all of them for ":all" to conserve more memory.
549 more "%x" -> "%" UVxf stuff.
551 s/=over2/=over 2/g # oops.
553 1.61 2002/04/26 03:02:04
555 Now does decent tests besides use_ok()
556 ! lib/Encode/Guess.pm t/guess.t
557 UI streamlined, document added
559 various signed/unsigned mismatch nits (#16173)
560 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
562 POD: utf8-flag-related caveats added. A few sections completely
566 Thou shalt not assume %d works, either!
567 Robin Baker added to AUTHORS for this
568 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
570 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
572 1.60 2002/04/24 20:06:52
574 "Thou shalt not assume %x works." -- jhi
575 Message-Id: <20020424210618.E24347@alpha.hut.fi>
576 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
577 low-memory build machines happy, now *.c is created for each *.ucm
578 (no table aggregation). You can still override this by setting
579 $ENV{AGGREGATE_TABLES}.
580 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
581 + lib/Encode/Guess.pm
582 + lib/Encode/JP/JIS7.pm
583 Encoding-autodetect (mainly for Japanese encoding) added. In a
584 course of development, JIS7.pm was improved.
585 + lib/Encode/HTML/Header.pm
586 + lib/Encode/Config.pm
587 MIME B/Q Header Encoding Added!
588 ! Encode.pm Encode.xs t/fallback.t
589 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
590 Message-Id: <20020424130709.GA14211@tanglefoot>
592 1.59 $ 2002/04/22 23:54:22
593 ! Encode.pm Encode.xs
594 needs_lines() and perlio_ok() are added to Internal encodings such
595 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
597 1.58 2002/04/22 23:54:22
599 s/MacChineseSimp/MacChineseTrad/ # ... oops.
604 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
605 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
606 t/(JP|KR).t are aggregated to t/CJKT.t
607 test data are all remade via bin/ucm2text.
608 And .... They are no longer skipped for -Uuseperlio !
610 1.57 2002/04/22 20:27:30
611 ! t/JP.t t/KR.t t/perlio.t
612 unless (find PerlIO::Layer 'perlio') ... line is back again.
613 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
614 need PerlIO. Sigh....
615 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
616 ->perlio_ok now does eval{ require PerlIO::encoding } there so
617 it correctly returns 1 when PerlIO::encoding is yet loaded.
619 perl-current patch #16072 reflected
621 1.56 2002/04/22 09:48:07
622 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
623 New PerlIO::encoding 0.04 compliance met
625 1.55 2002/04/22 03:43:05
626 ! Encode.pm Encode.xs Unicode/Unicode.pm
627 needs_lines() defined so Encode::Encoding is no longer needed
630 1.54 2002/04/22 02:50:01
631 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
632 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
633 now perlio_ok is true by default if PerlIO::encoding->VERSION is
634 0.03 or larger. POD in Encode::Encoding revised to reflect this.
635 Encode::XS and Encode::Unicode now has perlio_ok() method.
636 ! lib/Encode/Supported.pod
637 s/UP-UX/HP-UX/ by jhi
638 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
639 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
640 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
641 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
642 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
643 ! lib/Encode/Supported.pod
644 Huge document fixes by Philip.
647 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
648 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
650 binmode() after "<:encoding" to make Win32 happy, by Mattia.
651 Mattia added to AUTHORS file
652 Message-Id: <3CC3150F.5798.22A05AE@localhost>
654 1.52 2002/04/20 23:43:47
656 TODO: is now SKIP:, as NI-XS requested. Also adds more
657 eraborate failure analysis added.
659 A note on how to make sure of round-trip safety added to POD
660 section (so Autrijus is happier)
661 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
662 big5-(eten|hkscs) is round-trip safe again!
663 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
665 Typo fixes by Andreas
666 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
667 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
668 PerIO coodination patches from NI-XS.
669 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
671 1.51 2002/04/20 09:58:23
673 Updated test suite by Autrijis so "make test" is happy again
674 Message-Id: <20020420082104.GA25037@not.autrijus.org>
676 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
678 TW/TW.pm TW/Makefile.PL
679 Updates by Autrijus. 'big5' is no longer a canonical but an
680 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
681 Message-Id: <20020419195346.GA19597@not.autrijus.org>
683 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
684 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
686 PerlIO detection a little bit smarter; no longer uses eval qq{}
689 1.50 2002/04/19 06:13:02
690 ! ! Encode.pm Encode.xs Encode/encoding.h
692 New Fallback API imlemented and documented. See "perldoc Encode"
694 ! lib/Encode/JP/JIS7.pm Encode.pm
695 + lib/Encode/PerlIO.pod t/perlio.t
696 API compliance met. However, it still does not work unless perlio
697 implements line buffer. See BUGS section in perldoc Encode::PerlIO
698 As a sensible workaround, perlio_ok() added to Encode.
700 ! lib/Encode/Supported.pod
702 Message-Id: <20020418174647.J8466@alpha.hut.fi>
704 Doc fixes from Autrijus
705 Message-Id: <20020418144131.GA10987@not.autrijus.org>
707 perlqq mode documented
709 + t/jisx0201.euc t/jisx0201.ref
710 ! t/jisx0208.euc t/jisx0208.ref
711 t/JP.t tests more rigorously and with other encodings
712 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
713 in jis0208 (used to contain jisx0201 part).
714 ! Encode/Makefile_PL.e2x
715 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
716 enc2xs and encode.h rather than hard-coded. This allows the resulting
717 module fully CPANizable.
718 ! encoding.pm t/JP.t t/KR.t
719 PerlIO detection simplified (checks %INC instead of eval{})
720 ! Encode.xs Encode/encode.h
721 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
722 - lib/Encode/Unicode.pm
723 (en|de)code_xs relocated to where it belongs. Source reindented
726 Additional (U8 *) cast added as suggested by jhi
727 Message-Id: <20020417165916.A28599@alpha.hut.fi>
729 1.42 Date: 2002/04/17
731 no-op module; Thought of adding a pod there but enc2xs has
736 correct mechanism to detect Perlio::encoding layar installed.
738 PerlIO Layer detached.
740 1.41 2002/04/16 23:35:00
742 binmode(STDIN|STDOUT ...) done iff PerlIO is available
744 Cleaned up PerlIO skip conditions to prepare for the upcoming
745 Encode - PerlIO forking.
747 exported functions are now prototyped.
748 ! lib/Encode/CN/HZ.pm
751 fallback implemented # was /* FIXME */
752 affected programs revised to fit (only HZ was using the try-catch
753 approach which needed to be fixed for API-compliance).
755 ! Encode/KR/2022_KR.pm
757 can find =head1 NAME now, jhi
758 Message-Id: <20020416083059.V30639@alpha.hut.fi>
762 now complies with less warnings with the pickest compilers.
763 Suggested by Craig, fixed by Dan.
764 ! Encode/Makefile_PL.e2x
766 A bug that fails to find *.e2x in certain conditions fixed
768 1.40 2002/04/14 22:27:14
769 + Encode/ConfigLocal_PM.e2x
770 ! lib/Encode/Config.pm
772 "enc2xs -C" now generates/updates Encode::ConfigLocal.
773 ConfigLocal_PM.e2x is a skelton thereof.
774 ! lib/Encode/Config.pm
776 "use Encode::CN::HZ;" was missing.
779 More rigorous tests added to test XS, especially on memory allocation.
781 ! lib/Encode/Unicode.pm
782 NI-S implemented an XS version -- merged
783 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
786 Source filter option added. With this option on, you can write
787 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
788 encodings. t/jperl.t enhanced to test this feature.
790 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
791 apply his suggestion, this degree of nitting is enough to add him
793 Message-Id: <3CB93223.291E5E2E@earthlink.net>
795 + lib/Encode/JP/JIS7.pm
796 - lib/Encode/JP/JIS.pm
797 - lib/Encode/JP/2022_JP.pm
798 - lib/Encode/JP/2022_JP1.pm
799 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
800 JIS7.pm for better maintainability and performance
802 Added caveat for non-ascii identifiers.
804 fixes by jhi, the original author of this pragramtic module.
805 Message-Id: <20020413231527.V1826@alpha.hut.fi>
807 1.34 2002/04/12 20:23:05 (Unreleased)
810 EBCDIC fixes addressed by jhi.
811 Message-Id: <20020412161844.D9383@alpha.hut.fi>
812 ! lib/Encode/Encoder.pm
813 POD fix by Miyagawa-kun
814 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
816 1.33 2002/04/10 22:28:40
818 Philip's mail address corrected.
821 ! lib/Encode/Encoder.pm
822 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
823 This adds Miyagawa-kun to AUTHORS.
824 * encoding() no longer exported by default but on demand
825 * t/Encoder.t updated to test all these
826 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
827 ! lib/Encode/Unicode.pm
828 ! lib/Encode/Supported.pm
829 Further doc fixes by Anton
831 1.32 2002/04/09 20:06:15
834 - ucm/macDevanaga.ucm Unicode Character Map
835 - ucm/macGujarati.ucm Unicode Character Map
836 - ucm/macGurmukhi.ucm Unicode Character Map
837 A utility to check integrity of .ucm files. t/bogus.ucm is a
838 ucm that is deliberately bogus. unused Indic mappings are removed
841 resolve_alias() added as suggested by jhi. Same as
842 find_encoding("alias")->name. For convenience. This one is
843 defined in Encode.pm instead of Alias.pm.
844 Message-Id: <20020409215846.H17022@alpha.hut.fi>
846 Memory Allocate but detected during the devel of ucmlint -- fixed.
847 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
848 ! lib/Encode/Unicode.pm
849 valid_ucs2(0) is false but must be true.
850 3 patches from NI-S as follows. This also has fixed the incident
852 ! lib/Encode/Alias.pm
853 find_alias() recursion prevention
855 Checks for the patch above
856 ! t/Encode/Unicode.pm
857 An extra "F" that causes valid_ucs2() return a bogus value fixed
858 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
859 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
860 2 Small Patches from jhi as follows:
862 Encode->encodings() lists in case-insensitve order (as it was)
864 -l option prints avaiable encodings to STDOUT instead of STDERR
865 ! lib/Encode/Aliases.pm
866 s/defintion/definition/
867 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
869 ! lib/Encode/Supported.pod
870 ! lib/Encode/Unicode.pm
871 POD revise by Philip Newton. This adds Philip to AUTHORS list.
872 Thank you for the exact quote of Douglas Adams :)
873 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
875 1.31 2002/04/08 18:08:07
876 ! lib/Encode/Encoder.pm
878 Encode::Encoder, once just a placeholder of an idea, is now much more
879 practical. See t/Encode.t to find how practical it can be.
880 + lib/Encode/Config.pm
882 my false laziness at Encode.pm is fixed. Now %ExtModules are set
883 in Encode::Config and they are all literally, not programatically
884 set. My false laziness was resulting many encodings missing from
886 ! lib/Encode/Unicode.pm
888 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
889 so that it does not rely Encode::Unicode for BOM values
890 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
892 1.30 2002/04/08 02:34:51
893 + lib/Encode/Encoder.pm
894 Object Oriented Encoder. I reckon something like this is in need.
897 ! lib/Encode/Supported.pod
898 * autoloading bug that prevented upper-case canonicals such as UTF-16
899 is fixed. Now even UTF/UCS are autoloaded!
900 * encodings() is now more intuitive.
901 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
903 * Obligatory fixes to the POD.
904 ! lib/Encode/Supported.pod
905 Patch from Anton applied.
906 Message-Id: <66641479.20020408033300@motor.ru>
908 ! lib/Encode/Unicode.pm
909 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
911 1.28 2002/04/07 18:58:42
915 Just a MANIFEST for those missing files.
917 1.26 Date: 2002/04/07 15:22:04
920 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
921 And t/Aliases.t fixed to make sure they all exist.
923 1.25 2002/04/07 15:01:25 (Unreleased)
925 ! lib/Encode/Unicode.pm
928 - lib/Encode/UTF_EBCDIC.pm
929 - lib/Encode/Internal.pm
931 Integrated into Encode.pm as closures. That way "one package, one file"
932 rule is preserved yet less files to require.
934 commented out binmode(STDERR ...
943 ! Encode/Makefile_PL.e2x
944 Schwarn's MM-compliance patch merged
945 Message-Id: <20020406082609.GA28758@blackrider>
947 ! lib/Encode/Unicode.pm
948 + lib/Encode/UTF_EBCDIC.pm
950 - lib/Encode/10646_1.pm
951 - lib/Encode/ucs2_le.pm
952 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
953 per-module based (en|de)code, I saved a number of .pm by
954 reorganizing it as per-object base (Well, this is what Encode::XS
955 does under the hood). See Encode::Unicode for details.
956 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
957 This module is used only on EBCDIC environments.
959 1.21 2002/04/05 14:46:34 (Not Released)
965 Are back to make Perl/Tk happy Smile, NI-S.
967 ! lib/Encode/Alias.pm
968 ! lib/Encode/Supported.pm
969 ! lib/Encode/10646_1.pm
970 ! lib/Encode/ucs2_le.pm
971 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
972 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
973 updated to reflect changes.
974 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
975 ! lib/Encode/Alias.pm
976 ! lib/Encode/Supported.pm
977 Anton's revision commited. Added Dan's own fixes as well.
978 Message-Id: <159103166906.20020405161134@motor.ru>
979 ! lib/Encode/Alias.pm
981 < qr/^UCS2-le$/i => '"UCS-2"', );
983 > qr/^UCS2-LE$/i => '"UTF-16LE"');
984 Sigh. Thank you, Anton.
985 Message-Id: <14567692196.20020405062020@motor.ru>
986 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
988 1.20 2002/04/04 19:50:52
990 the last minute addtion. Just give it a try. Docs remains to be done.
991 Not installed by default.
992 ! lib/Encode/Supported.pod
995 ! lib/Encode/Alias.pm
997 ! lib/Encode/10464_1.pm
998 ! lib/Encode/ucs2_le.pm
999 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1000 unchanged but UTF-16BE is added as an alias. Implicit aliases
1001 move to Encode::Alias so init_alias() works more as expected.
1002 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1003 Though pedantically wrong, This should make perl mongers happier.
1004 t/Alias.t is enhanced to test all these.
1005 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1007 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1008 of the codepoints are common. CJKT left as is because the saving is
1012 ! EBCDIC/Makefile.PL
1014 ! Encode/Makefile_PL.e2x
1018 ! Symbol/Makefile.PL
1022 All occurance of _def.h replaced with .exh so djgpp works happily
1023 ever after! To credit this amazing discovery, Laszlo is now in
1025 Message-Id: <20020403181424.GA8778@freemail.hu>
1026 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1029 ! Encode/Makefile_PL.skel
1031 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1032 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1034 Two more tests by added jhi
1035 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1038 The showstopper fixed -- Memory reallocation bug was causing
1039 Encode::XS to fall into infinite loop on certain conditions.
1040 t/grow.t tests that.
1041 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1046 ! lib/Encode/Supported.pod
1047 Vendor encodings rebuilt out of original map files at unicode.org.
1048 Indic languages such as MacDevanagali remain unspported do to the
1049 shortcoming of encengine capabilities (they need algorithmical
1050 conversion and I have no knowledge on that!). Pods fixed for added
1052 Oh, macJapan.ucm renamed to macJapanese.ucm.
1053 macROMnn is macRomanian and macRUMnn is macRumanian.
1054 txt2ucm is a crude script that is used to convert them.
1056 Unicode Compound Characters (used extensively on Mac) supported
1058 Typo fixes and improvements by jhi
1059 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1061 1.11 2002/03/31 22:12:13
1065 Missing files from the MANIFEST fixed.
1066 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1067 Version incremented just to make CPAN happy.
1069 1.10 2002/03/31 21:32:42
1072 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1073 if you want. This should make Autrijus happy. Also, piconv
1074 is added to default install.
1077 Here-documented files that enc2xs generates are now exported
1078 to *.e2x. Much cleaner and easier to debug.
1080 encoding enhances so you can make it act more like such
1081 (now prehistoric ) "localized" variations of perl like Jperl.
1083 Further test for encoding.pm. Written in euc-jp
1086 Taken over form jhi.
1087 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1092 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1099 ! Encode/macIceland.ucm
1100 ! lib/Encode/Alias.pm
1101 ! lib/Encode/Supported.pod
1102 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1103 Message-Id: <m3lmcavhjt.fsf@anima.de>
1105 1.01 2002/03/29 20:59:39
1108 s/USE_SCRIPTS/MORE_SCRIPTS/
1110 installs enc2xs by default for external Encode:: modules in CPAN,
1111 such as Encode::HanExtra
1113 More sensible perl core detection via $ENV{PERL_CORE}
1115 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1117 Perl core ditection via $^X =~ m/\bminiperl$/o
1118 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1120 1.00 Wed Mar 29 2002
1122 The version of all files is updated to 1.00 via "ci -f -l1.00",
1123 commemorating version 1.00. All files, including *.ucm are now
1124 under version control.
1127 encode.h moved to Encode/ so it will be installed for the later
1130 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1135 compile renamed to enc2xs.
1136 Affected Makefle.PL updated
1138 "Punt it. HanExtra can take care of that later." -- Autrijus
1139 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1142 ! Encode/ksc5601.ucm
1143 ! lib/Encode/CJKConstants.pm
1144 ! lib/Encode/KR/2022_KR.pm
1145 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1146 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1150 bin/ added for example scripts. They are not installed by default.
1151 to install them, "perl Makefile.PL USE_SCRIPTS".
1152 piconv is iconv reinvented in perl. in addition to all features
1153 of iconv, it also adds perlish features. See L<piconv/1> for more
1155 ! lib/Encode/Alias.pm
1156 qr/^ replaced with qr/\b so it directly matches locale names
1157 such as en_US.US-ASCII
1160 Patch by MJD to fix the following problem applied.
1161 Subject: [PATCH 5.7.3 Encode]
1162 Aliases.t not properly skipped when Encode extension not built
1163 Message-Id: <20020328091850.18677.qmail@plover.com>
1164 ! lib/Encode/KR/2022_KR.pm
1165 ! lib/Encode/CJKConstants.pm
1166 Another patch from Jungshik to make iso-2022-kr actually work
1167 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1168 ! Encode/Encode/euc-kr.ucm
1169 + Encode/Encode/johab.ucm
1170 ! Encode/Encode/ksc5601.ucm
1172 ! Encode/KR/Makefile.PL
1173 ! Encode/lib/Encode/Alias.pm
1175 Johab support and complete revision of Korean Encoding by Jungshik
1176 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1178 Revised to make up with now-dropped Encode::Details.
1179 - lib/Encode/Details.pod
1180 Dropped. Besides being obsolete, the topics are now covered in
1181 respective pods now.
1186 Korean aliases fixed thanks to Jungshik Shin
1187 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1188 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1190 =head1 NAME added to all modules to make buildtoc happy
1191 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1192 - lib/Encode/CJKguide.pod
1193 Too controversial and dropped from the dist. Will be available
1194 separately on the web.
1196 RCS tags added so table debugging gets easier (should that be
1197 needed! I hope they all stay 1.00!)
1198 + lib/Encode/CJKguide.pod
1199 A detailed guide to mainly, but not limited to, CJK multibyte
1202 + Encode/hp-roman8.ucm
1204 ! Encode/Supported.pod
1205 All occurance of "roman8" replaced with "hp-roman8" to avoid
1207 ! Encode/Supported.pod
1210 Mac Encodings now comply the Inside Macintosh
1212 Test for '-raw' conventions added.
1214 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1215 ! Encode/gb12345.ucm
1217 ! Encode/ksc5601.ucm
1218 "-raw" appended to canonical names.
1219 File mames stay unchanged thanks to UCM format.
1220 ! lib/Encode/CN/HZ.pm
1221 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1222 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1224 0.99 Tue Mar 26 2002
1225 - lib/Encode/JP/Const.pm
1226 + lib/Encode/CJKConstants.pm
1227 + lib/Encode/CN/2022_CN.pm
1228 + lib/Encode/KR/2022_KR.pm
1237 * Support for ISO-2022-KR and ISO-2022-CN added.
1239 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1240 * ucm2table autogenerates character table out of UCM files.
1243 - lib/Encode/Supports.pod
1244 + lib/Encode/Supported.pod
1245 Names reverted due to popular demand.
1246 8.3 rule applies only when there is a conflict.
1247 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1252 - lib/Encode/Format/Enc.pod
1254 * Character tables is now 100% ucm.
1255 * All files under Encode/ is now 8.3-compliant
1256 * some of missing encodings added (i.e. gsm0338 and nextstep)
1257 * Vendor mappings aggregated with appropriate national std in
1258 Makefile.PL, resulting smaller *.so especially for CJK.
1259 Following is result on Dan's FreeBSD box.
1261 ---------------------------------------------------------------
1262 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1263 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1264 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1265 blib/arch/auto/Encode/Encode.so 27,791 27,791
1266 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1267 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1268 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1269 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1270 ---------------------------------------------------------------
1271 Total 5,375,297 6,343,819
1273 * As a result of ucm-transition, Encode::Tcl dropped because
1274 Encode::Tcl demands *.enc.
1275 Encode::Tcl will be supplied in a separate tarball with *.enc.
1276 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1281 -lib/Encode/Supported.pod
1282 +lib/Encode/Supports.pod
1283 -lib/Encode/iso10646_1.pm
1284 +lib/Encode/10646_1.pm
1285 -lib/Encode/EncFormat.pod
1286 +lib/Encode/Format/Enc.pod
1287 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1288 - lib/Encode/JP/Constants.pm
1289 + lib/Encode/JP/Consts.pm
1290 ! lib/Encode/JP/JIS.pm
1291 ! lib/Encode/JP/H2Z.pm
1292 Version nit problem and 8.3 rule fix.
1293 > Package namespace installed latest in CPAN file
1294 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1295 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1296 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1297 accordingly. In addition, legacy "use vars qw()..." are replaced with
1299 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1300 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1302 - lib/Encode/JP/ISO_2022_JP.pm
1303 - lib/Encode/JP/ISO_2022_JP_1.pm
1304 + lib/Encode/JP/2022_JP.pm
1305 + lib/Encode/JP/2022_JP1.pm
1307 8.3 naming conflict for vanilla fat addressed by jhi
1308 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1311 Typecast fix addressed by jhi
1312 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1314 0.98 Mon Mar 25 2002
1315 ! lib/Encode/Supported.pod
1317 + lib/Encode/JP/ISO_2022_JP_1.pm
1318 ! lib/Encode/JP/ISO_2022_JP.pm
1319 ! lib/Encode/JP/JIS.pm
1321 Now Encode::JP is more strict on the difference between ISO-2022-JP
1322 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1323 makes Anton happier :) FYI the previous version implements
1324 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1325 ! lib/Encode/Supported.pod
1328 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1329 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1334 ! lib/Encode/Suppoted.pod
1335 pod fixes to replace F<http://...> to L<http://...>,
1336 as suggested by Autrijius in:
1337 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1338 ! lib/Encode/Suppoted.pod
1339 fixes and enhancements by Anton
1340 Message-Id: <10632060120.20020324103753@motor.ru>
1341 ! lib/Encode/Alias.pm
1342 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1343 added. Suggested by Anton then deobfuscated by Autrijius
1344 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1346 Further fix by Nicholas Clark
1347 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1348 - lib/EncodeFormat.pod
1349 + lib/Encode/EncFormat.pod
1351 File renamed as suggested by Autrijius
1353 ! lib/Encode/Details.pod
1354 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1355 ! Encode.pm Sun Mar 24 13:43:47 2002
1356 pod fixes by Autrijius.
1357 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1358 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1360 ! lib/Encode/Alias.pm
1362 now more EBCDIC conscious;
1363 %ExtModules on EBCDIC system excludes CJK so that you don't
1364 have to worry about the matched alias resulting cloaking.
1365 t/Alias.t also revised to reflect changes. Verified by jhi
1366 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1368 0.97 Sun Mar 24 2002
1372 EBCDIC detection mechanism installed as in JP/JP.pm
1373 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1376 ! EBCDIC/Makefile.PL
1379 ! Symbol/Makefile.PL
1381 Now all table files used by compile are postfixed '_t' to avoid
1382 namespace collisions in case insensitive file systems once for all!
1384 Message-ID: <58290227735.20020323195659@familiehaase.de>
1386 Since the Encode::JP is unsupported under EBCDIC we
1387 cannot run this test (aliases as such should work fine) -- jhi
1388 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1390 duplicate occurance of ascii.ucm and 8859-1.ucm
1391 causes MacOS X dlyd to cloak
1397 < chdir 't' if -d 't';
1399 > if (! -d 'blib' and -d 't'){ chdir 't' };
1400 When you are "make test"-ing on Encode/ directory, you must not
1401 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1402 remain unchanced. Also the situation detection was made simpler
1403 in t/JP.t, which was originally;
1404 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1406 "Use of uninitialized value in string eq at Encode.pm line 96."
1407 ! Symbol/Makefile.PL
1408 ! EBCDIC/Makefile.PL
1410 -- Problem on case insensitive file systems
1411 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1412 Message-ID: <88254111953.20020323095503@familiehaase.de>
1415 "So I think it's a bug in gcc, not perl. But it still needs to be
1417 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1418 Message-Id: <20020323170509.C96475@plum.flirble.org>
1420 0.96 Sat Mar 23 2002
1422 ! lib/Encode/Encoding.pm
1423 ! lib/Encode/Alias.pm
1424 ! lib/Encode/Supported.pod
1426 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1427 Message-ID: <20020322073908.GB10539@blackrider>
1430 "...I think we should include ISO 8859-1 as well." -- NI-S
1431 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1436 ! lib/Encode/Alias.pm
1437 alias definitions relocated to Encode::Alias so module autoloading
1438 works for aliases also.
1440 encodings() now accepts args to check ExtModules.
1444 + EBCDIC/Makefile.PL
1445 + Symbol/Makefile.PL
1449 Latin and single byte encodings are reorganized so they are
1450 demand-loaded like Encode::XX. Now only ascii is compiled into
1452 ! lib/Encode/Alias.pm
1453 for my $k (keys %hash){ delete $hash{$k}; }
1454 is depreciated; fixed.
1456 0.95 Fri Mar 22 2002
1457 In this update, pod rewrites and alias fixes are the main issues
1458 + lib/Encode/Supported.pod
1459 Describes supported encodings
1461 streamlined compiled-in encodings.
1462 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1464 + Encode/ibm-125?.ucm
1465 Added from icu distibution with any occurance of
1466 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1467 some respect to icu staff, however.
1468 + lib/Encode/Alias.pm
1470 Alias difinitions in Encode.pm relocated.
1473 packWARN patch from Paul Marquess via jhi
1474 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1475 Paul added to AUTHORS as a result.
1476 ! t/CJKalias.t -> t/Aliases.t
1477 Renamed. Checks even more aliases and alias overloading
1480 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1481 gbk => cp936 relocated to CN.pm
1483 Test::More with plans (by jhi)
1485 0.94 Thu Mar 21 2002
1486 + lib/Encode/Description.pod
1487 ! lib/Encode/Encoding.pm
1488 Now the pod in Encode.pm is abridged as programming references.
1489 lib/Encode/Description.pod contains the original, detailed description
1490 and Encode::Encoding explains how to write your own module to
1491 add new encodings. So far, lib/Encode/Description.pod contains
1492 the whole pod once in Encode.pm. This is intentional.
1494 Pod revisions by Anton Tagunov
1495 Message-Id: <517178431.20020320174824@motor.ru>
1497 all occrance of Encode::Tcl::Extended removed including pod
1499 test now checks $encoding->name only; $encoding->{name} are
1500 no longer check to find the canonical name.
1501 ! lib/Encode/JP/JIS.pm
1502 ! lib/Encode/JP/ISO_2022_JP.pm
1503 ->name() added to be more compliant with API
1509 Patch by Autrijus to add aliases to TW and fixes to POD
1510 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1512 SADAHIRO Tomoyuki added as should. My apologies.
1514 0.93 Wed Mar 20 2002
1515 * First release to be uploaded to CPAN. For prehistoric changes,
1516 please see Changes file of perl distibution as well as
1517 perl-unicode@perl.org archive, available at:
1518 http://archive.develooper.com/perl-unicode@perl.org/
1520 Changes Since 0.92 includes;
1525 + Mention to perl-unicode@perl.org added
1527 + Encoding aliases added so you can feed locale names
1528 and MIME Charset="" directly.
1529 - Mention to JISX0212 removed because it's fixed
1532 + Encoding aliases added. Note TW is left untouched because
1533 euc-tw is not implemented in TW but in Encode::HanExtra.
1534 Autrijus, you may fix Encode::HanExtra.
1536 + to test encode aliases added