1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.95 2003/05/21 08:41:11 dankogai Exp $
5 $Revision: 1.95 $ $Date: 2003/05/21 08:41:11 $
7 Since bogus entries were found in iso-8859-6, all entries are
8 re-generated once again out of
9 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
10 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
11 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
12 + lib/Encode/Unicode/UTF7.pm
13 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
14 lib/Encode/Supported.pod
15 UTF-7 support is now added. With this Encode now has all transcoding
16 methods in Unicode::String.
18 1.94 2003/05/10 18:13:59
19 ! lib/Encode/MIME/Header.pm
20 A more sophisticated solution for double-encoding by dankogai
21 ! lib/Encode/MIME/Header.pm AUTHORS
22 Two bugs fixed by Bjoern Jacke
23 * "Double Encoding" was not possible
24 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
25 * encode("MIME-Q") had UTF-8 flag on
26 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
27 ! lib/Encode/MIME/Header.pm AUTHORS
28 Two occurances of "croak ()" fixed as "croak qq()".
29 Simon Cozens is added to AUTHORS as a result.
30 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
32 POD fixes that reflect enhancements by jhi
34 Two enhancements by jhi.
35 + Now uses Getopt::Long so it accepts long name options
36 (--from for -f, for example)
37 + New option: -r,--resolve
38 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
40 META.yml added upon request of Schwern
41 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
43 Enache Adrian removed upon request -- to live longer than Encode
44 and/or FreeBSD (toy-)?thread :)
45 Message-Id: <20030425015701.GA2069@ratsnest.hole>
47 "close STDOUT unless $^O eq 'freebsd';" once again relocated
48 to keep VMS happy in which case "$^O eq 'freebsd'" is required
49 to keep FreeBSD+thread happy. Sigh.
50 Message-Id: <3EA88ADC.3000300@mac.com>
52 1.93 2003/04/24 17:43:16
54 added "no warnings 'pack'" in for loop to keep bleedperl from
55 complaining "Character in 'C' format wrapped in pack".
57 More elegant perl core detection inspired by Ilya Zakharevich
58 (but further elaborated for general cases).
59 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
62 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
63 are less likely to be confused w/ ISO-8859-*
65 RT tests added (vendor encodings are exemplified)
66 -- that successfully found a flaw on iso-2022-kr before the patch.
67 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
68 decode("ISO-2022-KR") has been buggy but no one ever sited
69 that since no one seems to be using it. Bugs discovered by
71 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
72 ! lib/Encode/CN/HZ.pm t/perlio.t
73 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
74 so it adds test for HZ.
75 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
77 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
78 So long as the string contains \x{00}-\x{ff} it does not fail.
79 See perldoc for details.
80 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
82 1.92 2003/03/31 03:27:27
83 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
84 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
85 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
86 but glibc-BIG5-2.1.2.ucm does.
87 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
88 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
89 Maps regenerated again but this time based upon
90 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
91 (But where is THE DOCUMENT by MICROSOFT?)
92 ! t/enc_module.t AUTHORS
93 failure with threaded Perl on FreeBSD addressed.
94 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
95 Message-Id: <20030322230131.GA813@ratsnest.hole>
99 Change 18989: Make the :bytes conditional on PerlIO.
100 further Modified by Dan Kogai
101 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
103 Chnage 18966: another fix for failing test on windows ("use encoding"
104 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
105 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
107 Change 18970: Hopefully this works also in Win32, by jhi
108 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
109 Change 18965: fix CJKT.t failures on windows due to incorrect
111 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
113 1.91 2003/03/09 20:07:37
115 even more proofread by jhi.
116 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
119 +use lib qw(t ext/Encode/t ../ext/Encode/t);
120 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
122 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
123 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
126 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
128 1.90 2003/03/09 17:32:43
131 Inaba-san has added a patch for perl 5.8.1 or later that makes
132 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
133 that. POD is further revised.
134 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
135 ! encoding.pm t/enc_module.t
136 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
137 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
141 signed vs. unsigned issue discovered by Craig on OpenVM
142 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
143 ! encoding.pm AUTHORS
144 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
145 Because binmode() stacks layers instead of overwrite, you have to
146 ":raw :encoding()" in encoding.pm or your are in trouble when you
147 call encoding.pm multiple times. There are several workarounds
148 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
149 who was the first to address this problem was added to AUTHORS.
150 The test suites was added for this, which is a modified version
151 of SUGAWARA-san's scripts
152 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
154 1.88 2003/02/20 14:42:34
156 one signedness nit for Encode by jhi
157 <200302161933.h1GJX876018710@kosh.hut.fi>
159 VISCII map was incorrect; fixed by Sadahiro-san
160 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
161 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
162 You can't unlink files that are opened in cygwin but the last
163 file handle opened in t/enc_*.t left open. Patch submitted
164 by Yitzchak and he was added to AUTHORS.
165 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
167 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
168 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
170 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
171 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
173 1.87 2003/02/06 01:52:11
175 * Inaba "Sensei" Hirohito added (I thought I have done so a long
176 ago but apparently I did not).
177 * SUZUKI Norio added for verious and useful bug reports.
178 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
179 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
180 podchecked so all warnings are gone except for L<http://>.
181 ! encoding.pm t/enc_eucjp.t
182 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
183 * Filter option overhaul
185 ! Encode.pm Encode.xs encengine.c Encode/encode.h
186 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
187 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
188 without filter. podchecked by Dan Kogai.
189 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
190 ! lib/Encode/Alias.pm
191 decode('alias', $1) went wild because of local $_ in find_alias()
192 the evil local $_ is eradicated but that changes find_alias()
193 format for coderef aliasing. See Encode::Alias for details
194 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
196 1.86 2003/01/22 03:29:07
198 * Don't forget to canonize when you attempt an exact match!
199 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
200 * ${^ENCODING} exception is off for $] > 5.008
201 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
203 $] check commented out so it runs on 5.8.0
205 1.85 2003/01/21 22:19:14
207 ${^ENCODING} exception is now explicit rather than handled by regex.
208 + t/enc_eucjp.t t/enc_utf8.t
209 Test suite for the better "encoding" pragma support for bleedperl.
210 On 5.8.0, they will just be skipped.
212 1.84 2003/01/10 12:00:16
214 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
215 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
216 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
218 %_ fixes saves the resulting .so .05% smaller, by NC
219 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
221 Silence Encode on undef, by Andreas
222 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
223 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
225 s/regognised/recognised/ . British spelling left intact to pay
226 respect to two British Nicks :)
227 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
229 1.83 2002/11/18 17:28:49
230 ! Encode.xs lib/Encode/JIS7.pm
231 Even more patches from Inaba-san has been applied. With this
232 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
233 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
235 1.82 2002/11/14 23:06:12
237 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
238 later found in perls w/ -Dusemymalloc was fixed by NC.
239 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
241 1.81 2002/11/08 18:29:27
242 ! Encode.pm Encode.xs
243 Non-XS version of Encode::utf8 is back (with XS being default).
244 Encode::predefine_encodings(0) to turn off XS.
245 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
246 found that it was Test::More causing the problem, not Encode.
247 But I have already made it configurable so it may be useful in
249 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
251 The ingenious patch by Nicholas Clark that reduces shlib sizes by
252 50% with no penalty and backward compatibility preserved, is in.
253 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
255 1.80 2002/10/21 20:39:09
256 ! Encode.xs t/mime-header.t
257 Even more patches from NI-XS regarding Encode::utf8->decode().
258 And one more test to t/mime-header.t to prove it
259 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
261 1.79 2002/10/21 06:05:37
263 Further patches from NI-XS. Encode::utf8->decode() now checks the
264 value of utf8 flag of the argument. As a result, the fix to
265 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
266 no harm (even speedwise) I'll leave it unreverted.
267 ! ucm/cp949.ucm ucm/cp950.ucm
269 U+00AE REGISTERED SIGN
270 were missing as a result of 1.78. Discovered by Moriyama-san.
271 Moriyama-san has also developed a test script that compares
272 (en|de)coded results to the corresponding Win32 API result and
273 all cp9?? maps are now verified.
274 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
276 1.78 2002/10/20 15:44:00
277 ! lib/Encode/MIME/Header.pm
278 fixed so that it works with new Encode::utf8
279 ! Encode.pm Encode.xs
280 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
281 :encoding(UTF-8) to handle partial chars at end of buffers
283 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
284 ! lib/Encode/Supported.pod
285 More nitpickings applied.
288 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
289 tests were completely useless. To redeem that and get the peace of
290 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
291 ucm/*.ucm Since this script takes too long to finish (30 seconds on
292 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
293 but you can easily run that by either renaming it or:
295 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
296 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
297 Other CJKT cp9?? also updated according to the URI below;
298 http://www.microsoft.com/typography/unicode/cscp.htm
299 + bin/ucmsort MANIFEST
300 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
301 to proper order. intended for hardcore develpers only.
302 ! ucm/cp932.ucm JP/JP.pm AUTHORS
303 CP932 mapping which was based upon the mapping file at unicode.org
304 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
305 has also supplied the patch so he was added to AUTHORS.
306 ! lib/Encode/Supported.pod
307 ISO-8859-11 != TIS 620
308 == TIS 620 + \xA0 ( )
310 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
312 1.77 2002/10/06 03:27:02
314 * Modified to accomodate up and comming patch by Inaba-san that
315 will fix tr/// needing eval qq{}
316 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
318 * pod fixes/enhancements to reflect the changes above
319 ! lib/Encode/Alias.pm
320 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
321 Message-Id: <20021001015648.GB18710@not.autrijus.org>
323 1.76 2002/08/25 15:09:51
325 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
326 but naturally identical to previous version -- dankogai
328 Codepoint fixes -- autrijus
329 Message-Id: <20020805040236.GC5220@not.autrijus.org>
331 copied everything under perl-5.8.0/ext/Encode to make sure Encode
332 is in sync w/ perl core
334 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
335 Now that binmode(FH) does implicit ":bytes" revisit
336 the failing tests. The worrisome one is the Digest::MD5
337 test-- how will it fare in CRLF lands now?
339 From: Radu Greab <radu@netsoft.ro>
340 Date: Mon, 10 Jun 2002 00:40:34 +0300
341 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
342 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
343 ones which I cannot figure out.
344 ! lib/Encode/Alias.pm
345 Subject: [Encode PATCH] spurious warning
346 From: Nicholas Clark <nick@unfortu.net>
347 Date: Sun, 2 Jun 2002 20:26:22 +0100
348 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
350 1.75 2002/06/01 18:07:49
351 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
352 glibc compliance cited by Autrijus.
353 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
354 ! bin/enc2xs bin/piconv
355 Subject: Re: forewarning: usedevel and versiononly
356 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
358 1.74 2002/05/28 18:33:15
359 + ucm/null.ucm ucm/ctrl.ucm
360 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
361 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
362 Subject: Re: unicode -> &# notation
363 Message-ID: <20020518193704.GB40272@not.autrijus.org>
365 1.73 2002/05/28 17:26:18
366 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
367 Chris Nandor has fixed Encode so that it works w/ MacPerl --
368 at least w/ PPC (68k need static linking which does not work due to
369 64k limit). pudge is added to AUTHORS (I'm surprised he was not
370 there in the list). Encode/Makefile_PL.e2x was additionally fixed
371 by dankogai to reflect changes in other Makefile.PL
372 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
374 Subject: Change 16746: -Mutf8 cleanup.
375 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
377 1.72 2002/05/20 15:49:56
379 Subject: [PATCH] Encode should be in perl-core library path
380 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
381 Message-Id: <20020520161201.A11019@alpha.hut.fi>
382 ! lib/Encode/MIME/Header.pm
383 Subject: [PATCH] Encode::MIME::Header
384 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
385 ! Encode/Makefile_PL.e2x
386 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
387 Message-Id: <20020519201031.GA1603@not.autrijus.org>
388 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
389 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
391 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
392 Subject: [PATCH] good day for WinCE port of perl.
393 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
396 Even more linting by Robin via @16532
398 Even more typecast by Sarathy in @16460
400 1.71 2002/05/07 16:22:42
402 even more typecasts by Robin
403 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
405 A very strange bug that was causing a bugus ucm -> C table
406 generation that was revealed by a UCM file that Andreas was
407 working. This is the king of wierdest bug I've encountered
408 in the course of Encode maintenance.
409 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
411 1.70 2002/05/06 10:26:48
413 Made more 'module-safe' with conjunction w/ 'no encoding'.
414 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
415 ! lib/Encode/Encoding.pm
416 'require Encode' because ->Define uses Encode::define_encoding();
417 problem and solution addressed by Miyagawa-kun
418 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
420 Cuts the frill to make djgpp happier, as suggested by Laszlo
421 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
423 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
424 Message-Id: <m3bsbug48n.fsf@anima.de>
426 1.69 2002/05/04 16:41:18
427 ! lib/Encode/MIME/Header
428 Floating-point coerced for UNICOS (in integer arithmetics it folds
429 line one character too early). Verification by Mark is pending.
430 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
432 more doc patch from Elizabeth
433 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
434 ! Encode/Makefile_PL.e2x
435 More platform-independent patch from Benjamin
436 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
437 ! lib/Encode/Guess AUTHORS
438 split regex fix by Graham Barr. Adds him to AUTHORS.
439 Message-Id: <20020504085419.E95940@valueclick.com>
440 ! Encode/Makefile_PL.e2x
441 enc2xs script discovery made smarter and more sensible, first cited
442 by Miyagawa-kun and further suggestions by Rafael and Andreas
443 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
444 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
446 1.68 2002/05/03 12:20:13
447 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
448 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
449 and Supported.pod modified to reflect the change. Elizabeth added
450 to Authors. And H.M. is also added for forwarding her patch among
451 other contributions (I was rather surprised to find his name was not
453 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
455 1.67 2002/05/02 07:33:09
457 Error message now consistent w/ perlqq (\N{U+} -> \x{})
458 done in perl@16308 but Philip linted me further. Now the error
459 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
460 ! lib/Encode/Guess.pm
461 Sanity check for happier -w by Autrijus
463 1.66 2002/05/01 05:41:06
464 ! Encode.xs t/fallback.t
465 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
466 while fallback is in effect. This even came with a welcome side-effect
467 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
468 also modified to test this.
469 And of course, the corresponding varialbles to UV[Xx]f are appropriately
470 cast. This should've concluded NI-XS homework.
472 encode(undef) does warn again! Repented upon suggestion by NI-XS.
473 Document for unless vs. '' added
474 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
476 1.65 2002/04/30 16:13:37
478 encode(undef) no longer warns for C<Use of uninitialized value in
479 subroutine entry>. Suggested by Paul.
480 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
481 ! lib/Encode/Supported.pod
482 Encode::MIME::Header and Encode::Guess mentioned
483 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
484 ! lib/Encode/Guess.pm
485 POD fix by Miyagawa-kun
486 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
488 1.64 2002/04/29 06:54:06
490 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
491 encocde("euc-jisx0213") are still DIFFERENT.
492 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
494 A few white spaces corrected by NI-XS via PerlIO integration to
496 Subject: Change 16247: Integrate perlio;
498 Document fixes by Andreas
499 Message-Id: <m3k7qsf1we.fsf@anima.de>
501 1.63 2002/04/27 18:59:50
502 ! lib/Encode/Encoding.pm
503 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
504 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
505 Make use of the Encode::Encoding base class!
506 And other cleanups in Encode.xs upon NI-XS suggestions
507 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
509 1.62 2002/04/27 11:17:39
511 encodings() now just check %ExtModule instead of eval{require}
512 all of them for ":all" to conserve more memory.
514 more "%x" -> "%" UVxf stuff.
516 s/=over2/=over 2/g # oops.
518 1.61 2002/04/26 03:02:04
520 Now does decent tests besides use_ok()
521 ! lib/Encode/Guess.pm t/guess.t
522 UI streamlined, document added
524 various signed/unsigned mismatch nits (#16173)
525 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
527 POD: utf8-flag-related caveats added. A few sections completely
531 Thou shalt not assume %d works, either!
532 Robin Baker added to AUTHORS for this
533 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
535 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
537 1.60 2002/04/24 20:06:52
539 "Thou shalt not assume %x works." -- jhi
540 Message-Id: <20020424210618.E24347@alpha.hut.fi>
541 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
542 low-memory build machines happy, now *.c is created for each *.ucm
543 (no table aggregation). You can still override this by setting
544 $ENV{AGGREGATE_TABLES}.
545 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
546 + lib/Encode/Guess.pm
547 + lib/Encode/JP/JIS7.pm
548 Encoding-autodetect (mainly for Japanese encoding) added. In a
549 course of development, JIS7.pm was improved.
550 + lib/Encode/HTML/Header.pm
551 + lib/Encode/Config.pm
552 MIME B/Q Header Encoding Added!
553 ! Encode.pm Encode.xs t/fallback.t
554 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
555 Message-Id: <20020424130709.GA14211@tanglefoot>
557 1.59 $ 2002/04/22 23:54:22
558 ! Encode.pm Encode.xs
559 needs_lines() and perlio_ok() are added to Internal encodings such
560 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
562 1.58 2002/04/22 23:54:22
564 s/MacChineseSimp/MacChineseTrad/ # ... oops.
569 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
570 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
571 t/(JP|KR).t are aggregated to t/CJKT.t
572 test data are all remade via bin/ucm2text.
573 And .... They are no longer skipped for -Uuseperlio !
575 1.57 2002/04/22 20:27:30
576 ! t/JP.t t/KR.t t/perlio.t
577 unless (find PerlIO::Layer 'perlio') ... line is back again.
578 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
579 need PerlIO. Sigh....
580 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
581 ->perlio_ok now does eval{ require PerlIO::encoding } there so
582 it correctly returns 1 when PerlIO::encoding is yet loaded.
584 perl-current patch #16072 reflected
586 1.56 2002/04/22 09:48:07
587 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
588 New PerlIO::encoding 0.04 compliance met
590 1.55 2002/04/22 03:43:05
591 ! Encode.pm Encode.xs Unicode/Unicode.pm
592 needs_lines() defined so Encode::Encoding is no longer needed
595 1.54 2002/04/22 02:50:01
596 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
597 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
598 now perlio_ok is true by default if PerlIO::encoding->VERSION is
599 0.03 or larger. POD in Encode::Encoding revised to reflect this.
600 Encode::XS and Encode::Unicode now has perlio_ok() method.
601 ! lib/Encode/Supported.pod
602 s/UP-UX/HP-UX/ by jhi
603 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
604 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
605 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
606 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
607 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
608 ! lib/Encode/Supported.pod
609 Huge document fixes by Philip.
612 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
613 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
615 binmode() after "<:encoding" to make Win32 happy, by Mattia.
616 Mattia added to AUTHORS file
617 Message-Id: <3CC3150F.5798.22A05AE@localhost>
619 1.52 2002/04/20 23:43:47
621 TODO: is now SKIP:, as NI-XS requested. Also adds more
622 eraborate failure analysis added.
624 A note on how to make sure of round-trip safety added to POD
625 section (so Autrijus is happier)
626 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
627 big5-(eten|hkscs) is round-trip safe again!
628 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
630 Typo fixes by Andreas
631 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
632 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
633 PerIO coodination patches from NI-XS.
634 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
636 1.51 2002/04/20 09:58:23
638 Updated test suite by Autrijis so "make test" is happy again
639 Message-Id: <20020420082104.GA25037@not.autrijus.org>
641 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
643 TW/TW.pm TW/Makefile.PL
644 Updates by Autrijus. 'big5' is no longer a canonical but an
645 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
646 Message-Id: <20020419195346.GA19597@not.autrijus.org>
648 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
649 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
651 PerlIO detection a little bit smarter; no longer uses eval qq{}
654 1.50 2002/04/19 06:13:02
655 ! ! Encode.pm Encode.xs Encode/encoding.h
657 New Fallback API imlemented and documented. See "perldoc Encode"
659 ! lib/Encode/JP/JIS7.pm Encode.pm
660 + lib/Encode/PerlIO.pod t/perlio.t
661 API compliance met. However, it still does not work unless perlio
662 implements line buffer. See BUGS section in perldoc Encode::PerlIO
663 As a sensible workaround, perlio_ok() added to Encode.
665 ! lib/Encode/Supported.pod
667 Message-Id: <20020418174647.J8466@alpha.hut.fi>
669 Doc fixes from Autrijus
670 Message-Id: <20020418144131.GA10987@not.autrijus.org>
672 perlqq mode documented
674 + t/jisx0201.euc t/jisx0201.ref
675 ! t/jisx0208.euc t/jisx0208.ref
676 t/JP.t tests more rigorously and with other encodings
677 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
678 in jis0208 (used to contain jisx0201 part).
679 ! Encode/Makefile_PL.e2x
680 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
681 enc2xs and encode.h rather than hard-coded. This allows the resulting
682 module fully CPANizable.
683 ! encoding.pm t/JP.t t/KR.t
684 PerlIO detection simplified (checks %INC instead of eval{})
685 ! Encode.xs Encode/encode.h
686 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
687 - lib/Encode/Unicode.pm
688 (en|de)code_xs relocated to where it belongs. Source reindented
691 Additional (U8 *) cast added as suggested by jhi
692 Message-Id: <20020417165916.A28599@alpha.hut.fi>
694 1.42 Date: 2002/04/17
696 no-op module; Thought of adding a pod there but enc2xs has
701 correct mechanism to detect Perlio::encoding layar installed.
703 PerlIO Layer detached.
705 1.41 2002/04/16 23:35:00
707 binmode(STDIN|STDOUT ...) done iff PerlIO is available
709 Cleaned up PerlIO skip conditions to prepare for the upcoming
710 Encode - PerlIO forking.
712 exported functions are now prototyped.
713 ! lib/Encode/CN/HZ.pm
716 fallback implemented # was /* FIXME */
717 affected programs revised to fit (only HZ was using the try-catch
718 approach which needed to be fixed for API-compliance).
720 ! Encode/KR/2022_KR.pm
722 can find =head1 NAME now, jhi
723 Message-Id: <20020416083059.V30639@alpha.hut.fi>
727 now complies with less warnings with the pickest compilers.
728 Suggested by Craig, fixed by Dan.
729 ! Encode/Makefile_PL.e2x
731 A bug that fails to find *.e2x in certain conditions fixed
733 1.40 2002/04/14 22:27:14
734 + Encode/ConfigLocal_PM.e2x
735 ! lib/Encode/Config.pm
737 "enc2xs -C" now generates/updates Encode::ConfigLocal.
738 ConfigLocal_PM.e2x is a skelton thereof.
739 ! lib/Encode/Config.pm
741 "use Encode::CN::HZ;" was missing.
744 More rigorous tests added to test XS, especially on memory allocation.
746 ! lib/Encode/Unicode.pm
747 NI-S implemented an XS version -- merged
748 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
751 Source filter option added. With this option on, you can write
752 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
753 encodings. t/jperl.t enhanced to test this feature.
755 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
756 apply his suggestion, this degree of nitting is enough to add him
758 Message-Id: <3CB93223.291E5E2E@earthlink.net>
760 + lib/Encode/JP/JIS7.pm
761 - lib/Encode/JP/JIS.pm
762 - lib/Encode/JP/2022_JP.pm
763 - lib/Encode/JP/2022_JP1.pm
764 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
765 JIS7.pm for better maintainability and performance
767 Added caveat for non-ascii identifiers.
769 fixes by jhi, the original author of this pragramtic module.
770 Message-Id: <20020413231527.V1826@alpha.hut.fi>
772 1.34 2002/04/12 20:23:05 (Unreleased)
775 EBCDIC fixes addressed by jhi.
776 Message-Id: <20020412161844.D9383@alpha.hut.fi>
777 ! lib/Encode/Encoder.pm
778 POD fix by Miyagawa-kun
779 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
781 1.33 2002/04/10 22:28:40
783 Philip's mail address corrected.
786 ! lib/Encode/Encoder.pm
787 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
788 This adds Miyagawa-kun to AUTHORS.
789 * encoding() no longer exported by default but on demand
790 * t/Encoder.t updated to test all these
791 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
792 ! lib/Encode/Unicode.pm
793 ! lib/Encode/Supported.pm
794 Further doc fixes by Anton
796 1.32 2002/04/09 20:06:15
799 - ucm/macDevanaga.ucm Unicode Character Map
800 - ucm/macGujarati.ucm Unicode Character Map
801 - ucm/macGurmukhi.ucm Unicode Character Map
802 A utility to check integrity of .ucm files. t/bogus.ucm is a
803 ucm that is deliberately bogus. unused Indic mappings are removed
806 resolve_alias() added as suggested by jhi. Same as
807 find_encoding("alias")->name. For convenience. This one is
808 defined in Encode.pm instead of Alias.pm.
809 Message-Id: <20020409215846.H17022@alpha.hut.fi>
811 Memory Allocate but detected during the devel of ucmlint -- fixed.
812 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
813 ! lib/Encode/Unicode.pm
814 valid_ucs2(0) is false but must be true.
815 3 patches from NI-S as follows. This also has fixed the incident
817 ! lib/Encode/Alias.pm
818 find_alias() recursion prevention
820 Checks for the patch above
821 ! t/Encode/Unicode.pm
822 An extra "F" that causes valid_ucs2() return a bogus value fixed
823 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
824 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
825 2 Small Patches from jhi as follows:
827 Encode->encodings() lists in case-insensitve order (as it was)
829 -l option prints avaiable encodings to STDOUT instead of STDERR
830 ! lib/Encode/Aliases.pm
831 s/defintion/definition/
832 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
834 ! lib/Encode/Supported.pod
835 ! lib/Encode/Unicode.pm
836 POD revise by Philip Newton. This adds Philip to AUTHORS list.
837 Thank you for the exact quote of Douglas Adams :)
838 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
840 1.31 2002/04/08 18:08:07
841 ! lib/Encode/Encoder.pm
843 Encode::Encoder, once just a placeholder of an idea, is now much more
844 practical. See t/Encode.t to find how practical it can be.
845 + lib/Encode/Config.pm
847 my false laziness at Encode.pm is fixed. Now %ExtModules are set
848 in Encode::Config and they are all literally, not programatically
849 set. My false laziness was resulting many encodings missing from
851 ! lib/Encode/Unicode.pm
853 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
854 so that it does not rely Encode::Unicode for BOM values
855 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
857 1.30 2002/04/08 02:34:51
858 + lib/Encode/Encoder.pm
859 Object Oriented Encoder. I reckon something like this is in need.
862 ! lib/Encode/Supported.pod
863 * autoloading bug that prevented upper-case canonicals such as UTF-16
864 is fixed. Now even UTF/UCS are autoloaded!
865 * encodings() is now more intuitive.
866 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
868 * Obligatory fixes to the POD.
869 ! lib/Encode/Supported.pod
870 Patch from Anton applied.
871 Message-Id: <66641479.20020408033300@motor.ru>
873 ! lib/Encode/Unicode.pm
874 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
876 1.28 2002/04/07 18:58:42
880 Just a MANIFEST for those missing files.
882 1.26 Date: 2002/04/07 15:22:04
885 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
886 And t/Aliases.t fixed to make sure they all exist.
888 1.25 2002/04/07 15:01:25 (Unreleased)
890 ! lib/Encode/Unicode.pm
893 - lib/Encode/UTF_EBCDIC.pm
894 - lib/Encode/Internal.pm
896 Integrated into Encode.pm as closures. That way "one package, one file"
897 rule is preserved yet less files to require.
899 commented out binmode(STDERR ...
908 ! Encode/Makefile_PL.e2x
909 Schwarn's MM-compliance patch merged
910 Message-Id: <20020406082609.GA28758@blackrider>
912 ! lib/Encode/Unicode.pm
913 + lib/Encode/UTF_EBCDIC.pm
915 - lib/Encode/10646_1.pm
916 - lib/Encode/ucs2_le.pm
917 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
918 per-module based (en|de)code, I saved a number of .pm by
919 reorganizing it as per-object base (Well, this is what Encode::XS
920 does under the hood). See Encode::Unicode for details.
921 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
922 This module is used only on EBCDIC environments.
924 1.21 2002/04/05 14:46:34 (Not Released)
930 Are back to make Perl/Tk happy Smile, NI-S.
932 ! lib/Encode/Alias.pm
933 ! lib/Encode/Supported.pm
934 ! lib/Encode/10646_1.pm
935 ! lib/Encode/ucs2_le.pm
936 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
937 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
938 updated to reflect changes.
939 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
940 ! lib/Encode/Alias.pm
941 ! lib/Encode/Supported.pm
942 Anton's revision commited. Added Dan's own fixes as well.
943 Message-Id: <159103166906.20020405161134@motor.ru>
944 ! lib/Encode/Alias.pm
946 < qr/^UCS2-le$/i => '"UCS-2"', );
948 > qr/^UCS2-LE$/i => '"UTF-16LE"');
949 Sigh. Thank you, Anton.
950 Message-Id: <14567692196.20020405062020@motor.ru>
951 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
953 1.20 2002/04/04 19:50:52
955 the last minute addtion. Just give it a try. Docs remains to be done.
956 Not installed by default.
957 ! lib/Encode/Supported.pod
960 ! lib/Encode/Alias.pm
962 ! lib/Encode/10464_1.pm
963 ! lib/Encode/ucs2_le.pm
964 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
965 unchanged but UTF-16BE is added as an alias. Implicit aliases
966 move to Encode::Alias so init_alias() works more as expected.
967 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
968 Though pedantically wrong, This should make perl mongers happier.
969 t/Alias.t is enhanced to test all these.
970 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
972 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
973 of the codepoints are common. CJKT left as is because the saving is
979 ! Encode/Makefile_PL.e2x
987 All occurance of _def.h replaced with .exh so djgpp works happily
988 ever after! To credit this amazing discovery, Laszlo is now in
990 Message-Id: <20020403181424.GA8778@freemail.hu>
991 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
994 ! Encode/Makefile_PL.skel
996 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
997 Message-Id: <20020401222744.GX2000@blackrider>, et al.
999 Two more tests by added jhi
1000 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1003 The showstopper fixed -- Memory reallocation bug was causing
1004 Encode::XS to fall into infinite loop on certain conditions.
1005 t/grow.t tests that.
1006 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1011 ! lib/Encode/Supported.pod
1012 Vendor encodings rebuilt out of original map files at unicode.org.
1013 Indic languages such as MacDevanagali remain unspported do to the
1014 shortcoming of encengine capabilities (they need algorithmical
1015 conversion and I have no knowledge on that!). Pods fixed for added
1017 Oh, macJapan.ucm renamed to macJapanese.ucm.
1018 macROMnn is macRomanian and macRUMnn is macRumanian.
1019 txt2ucm is a crude script that is used to convert them.
1021 Unicode Compound Characters (used extensively on Mac) supported
1023 Typo fixes and improvements by jhi
1024 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1026 1.11 2002/03/31 22:12:13
1030 Missing files from the MANIFEST fixed.
1031 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1032 Version incremented just to make CPAN happy.
1034 1.10 2002/03/31 21:32:42
1037 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1038 if you want. This should make Autrijus happy. Also, piconv
1039 is added to default install.
1042 Here-documented files that enc2xs generates are now exported
1043 to *.e2x. Much cleaner and easier to debug.
1045 encoding enhances so you can make it act more like such
1046 (now prehistoric ) "localized" variations of perl like Jperl.
1048 Further test for encoding.pm. Written in euc-jp
1051 Taken over form jhi.
1052 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1057 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1064 ! Encode/macIceland.ucm
1065 ! lib/Encode/Alias.pm
1066 ! lib/Encode/Supported.pod
1067 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1068 Message-Id: <m3lmcavhjt.fsf@anima.de>
1070 1.01 2002/03/29 20:59:39
1073 s/USE_SCRIPTS/MORE_SCRIPTS/
1075 installs enc2xs by default for external Encode:: modules in CPAN,
1076 such as Encode::HanExtra
1078 More sensible perl core detection via $ENV{PERL_CORE}
1080 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1082 Perl core ditection via $^X =~ m/\bminiperl$/o
1083 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1085 1.00 Wed Mar 29 2002
1087 The version of all files is updated to 1.00 via "ci -f -l1.00",
1088 commemorating version 1.00. All files, including *.ucm are now
1089 under version control.
1092 encode.h moved to Encode/ so it will be installed for the later
1095 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1100 compile renamed to enc2xs.
1101 Affected Makefle.PL updated
1103 "Punt it. HanExtra can take care of that later." -- Autrijus
1104 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1107 ! Encode/ksc5601.ucm
1108 ! lib/Encode/CJKConstants.pm
1109 ! lib/Encode/KR/2022_KR.pm
1110 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1111 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1115 bin/ added for example scripts. They are not installed by default.
1116 to install them, "perl Makefile.PL USE_SCRIPTS".
1117 piconv is iconv reinvented in perl. in addition to all features
1118 of iconv, it also adds perlish features. See L<piconv/1> for more
1120 ! lib/Encode/Alias.pm
1121 qr/^ replaced with qr/\b so it directly matches locale names
1122 such as en_US.US-ASCII
1125 Patch by MJD to fix the following problem applied.
1126 Subject: [PATCH 5.7.3 Encode]
1127 Aliases.t not properly skipped when Encode extension not built
1128 Message-Id: <20020328091850.18677.qmail@plover.com>
1129 ! lib/Encode/KR/2022_KR.pm
1130 ! lib/Encode/CJKConstants.pm
1131 Another patch from Jungshik to make iso-2022-kr actually work
1132 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1133 ! Encode/Encode/euc-kr.ucm
1134 + Encode/Encode/johab.ucm
1135 ! Encode/Encode/ksc5601.ucm
1137 ! Encode/KR/Makefile.PL
1138 ! Encode/lib/Encode/Alias.pm
1140 Johab support and complete revision of Korean Encoding by Jungshik
1141 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1143 Revised to make up with now-dropped Encode::Details.
1144 - lib/Encode/Details.pod
1145 Dropped. Besides being obsolete, the topics are now covered in
1146 respective pods now.
1151 Korean aliases fixed thanks to Jungshik Shin
1152 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1153 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1155 =head1 NAME added to all modules to make buildtoc happy
1156 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1157 - lib/Encode/CJKguide.pod
1158 Too controversial and dropped from the dist. Will be available
1159 separately on the web.
1161 RCS tags added so table debugging gets easier (should that be
1162 needed! I hope they all stay 1.00!)
1163 + lib/Encode/CJKguide.pod
1164 A detailed guide to mainly, but not limited to, CJK multibyte
1167 + Encode/hp-roman8.ucm
1169 ! Encode/Supported.pod
1170 All occurance of "roman8" replaced with "hp-roman8" to avoid
1172 ! Encode/Supported.pod
1175 Mac Encodings now comply the Inside Macintosh
1177 Test for '-raw' conventions added.
1179 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1180 ! Encode/gb12345.ucm
1182 ! Encode/ksc5601.ucm
1183 "-raw" appended to canonical names.
1184 File mames stay unchanged thanks to UCM format.
1185 ! lib/Encode/CN/HZ.pm
1186 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1187 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1189 0.99 Tue Mar 26 2002
1190 - lib/Encode/JP/Const.pm
1191 + lib/Encode/CJKConstants.pm
1192 + lib/Encode/CN/2022_CN.pm
1193 + lib/Encode/KR/2022_KR.pm
1202 * Support for ISO-2022-KR and ISO-2022-CN added.
1204 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1205 * ucm2table autogenerates character table out of UCM files.
1208 - lib/Encode/Supports.pod
1209 + lib/Encode/Supported.pod
1210 Names reverted due to popular demand.
1211 8.3 rule applies only when there is a conflict.
1212 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1217 - lib/Encode/Format/Enc.pod
1219 * Character tables is now 100% ucm.
1220 * All files under Encode/ is now 8.3-compliant
1221 * some of missing encodings added (i.e. gsm0338 and nextstep)
1222 * Vendor mappings aggregated with appropriate national std in
1223 Makefile.PL, resulting smaller *.so especially for CJK.
1224 Following is result on Dan's FreeBSD box.
1226 ---------------------------------------------------------------
1227 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1228 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1229 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1230 blib/arch/auto/Encode/Encode.so 27,791 27,791
1231 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1232 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1233 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1234 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1235 ---------------------------------------------------------------
1236 Total 5,375,297 6,343,819
1238 * As a result of ucm-transition, Encode::Tcl dropped because
1239 Encode::Tcl demands *.enc.
1240 Encode::Tcl will be supplied in a separate tarball with *.enc.
1241 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1246 -lib/Encode/Supported.pod
1247 +lib/Encode/Supports.pod
1248 -lib/Encode/iso10646_1.pm
1249 +lib/Encode/10646_1.pm
1250 -lib/Encode/EncFormat.pod
1251 +lib/Encode/Format/Enc.pod
1252 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1253 - lib/Encode/JP/Constants.pm
1254 + lib/Encode/JP/Consts.pm
1255 ! lib/Encode/JP/JIS.pm
1256 ! lib/Encode/JP/H2Z.pm
1257 Version nit problem and 8.3 rule fix.
1258 > Package namespace installed latest in CPAN file
1259 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1260 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1261 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1262 accordingly. In addition, legacy "use vars qw()..." are replaced with
1264 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1265 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1267 - lib/Encode/JP/ISO_2022_JP.pm
1268 - lib/Encode/JP/ISO_2022_JP_1.pm
1269 + lib/Encode/JP/2022_JP.pm
1270 + lib/Encode/JP/2022_JP1.pm
1272 8.3 naming conflict for vanilla fat addressed by jhi
1273 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1276 Typecast fix addressed by jhi
1277 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1279 0.98 Mon Mar 25 2002
1280 ! lib/Encode/Supported.pod
1282 + lib/Encode/JP/ISO_2022_JP_1.pm
1283 ! lib/Encode/JP/ISO_2022_JP.pm
1284 ! lib/Encode/JP/JIS.pm
1286 Now Encode::JP is more strict on the difference between ISO-2022-JP
1287 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1288 makes Anton happier :) FYI the previous version implements
1289 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1290 ! lib/Encode/Supported.pod
1293 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1294 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1299 ! lib/Encode/Suppoted.pod
1300 pod fixes to replace F<http://...> to L<http://...>,
1301 as suggested by Autrijius in:
1302 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1303 ! lib/Encode/Suppoted.pod
1304 fixes and enhancements by Anton
1305 Message-Id: <10632060120.20020324103753@motor.ru>
1306 ! lib/Encode/Alias.pm
1307 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1308 added. Suggested by Anton then deobfuscated by Autrijius
1309 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1311 Further fix by Nicholas Clark
1312 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1313 - lib/EncodeFormat.pod
1314 + lib/Encode/EncFormat.pod
1316 File renamed as suggested by Autrijius
1318 ! lib/Encode/Details.pod
1319 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1320 ! Encode.pm Sun Mar 24 13:43:47 2002
1321 pod fixes by Autrijius.
1322 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1323 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1325 ! lib/Encode/Alias.pm
1327 now more EBCDIC conscious;
1328 %ExtModules on EBCDIC system excludes CJK so that you don't
1329 have to worry about the matched alias resulting cloaking.
1330 t/Alias.t also revised to reflect changes. Verified by jhi
1331 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1333 0.97 Sun Mar 24 2002
1337 EBCDIC detection mechanism installed as in JP/JP.pm
1338 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1341 ! EBCDIC/Makefile.PL
1344 ! Symbol/Makefile.PL
1346 Now all table files used by compile are postfixed '_t' to avoid
1347 namespace collisions in case insensitive file systems once for all!
1349 Message-ID: <58290227735.20020323195659@familiehaase.de>
1351 Since the Encode::JP is unsupported under EBCDIC we
1352 cannot run this test (aliases as such should work fine) -- jhi
1353 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1355 duplicate occurance of ascii.ucm and 8859-1.ucm
1356 causes MacOS X dlyd to cloak
1362 < chdir 't' if -d 't';
1364 > if (! -d 'blib' and -d 't'){ chdir 't' };
1365 When you are "make test"-ing on Encode/ directory, you must not
1366 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1367 remain unchanced. Also the situation detection was made simpler
1368 in t/JP.t, which was originally;
1369 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1371 "Use of uninitialized value in string eq at Encode.pm line 96."
1372 ! Symbol/Makefile.PL
1373 ! EBCDIC/Makefile.PL
1375 -- Problem on case insensitive file systems
1376 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1377 Message-ID: <88254111953.20020323095503@familiehaase.de>
1380 "So I think it's a bug in gcc, not perl. But it still needs to be
1382 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1383 Message-Id: <20020323170509.C96475@plum.flirble.org>
1385 0.96 Sat Mar 23 2002
1387 ! lib/Encode/Encoding.pm
1388 ! lib/Encode/Alias.pm
1389 ! lib/Encode/Supported.pod
1391 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1392 Message-ID: <20020322073908.GB10539@blackrider>
1395 "...I think we should include ISO 8859-1 as well." -- NI-S
1396 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1401 ! lib/Encode/Alias.pm
1402 alias definitions relocated to Encode::Alias so module autoloading
1403 works for aliases also.
1405 encodings() now accepts args to check ExtModules.
1409 + EBCDIC/Makefile.PL
1410 + Symbol/Makefile.PL
1414 Latin and single byte encodings are reorganized so they are
1415 demand-loaded like Encode::XX. Now only ascii is compiled into
1417 ! lib/Encode/Alias.pm
1418 for my $k (keys %hash){ delete $hash{$k}; }
1419 is depreciated; fixed.
1421 0.95 Fri Mar 22 2002
1422 In this update, pod rewrites and alias fixes are the main issues
1423 + lib/Encode/Supported.pod
1424 Describes supported encodings
1426 streamlined compiled-in encodings.
1427 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1429 + Encode/ibm-125?.ucm
1430 Added from icu distibution with any occurance of
1431 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1432 some respect to icu staff, however.
1433 + lib/Encode/Alias.pm
1435 Alias difinitions in Encode.pm relocated.
1438 packWARN patch from Paul Marquess via jhi
1439 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1440 Paul added to AUTHORS as a result.
1441 ! t/CJKalias.t -> t/Aliases.t
1442 Renamed. Checks even more aliases and alias overloading
1445 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1446 gbk => cp936 relocated to CN.pm
1448 Test::More with plans (by jhi)
1450 0.94 Thu Mar 21 2002
1451 + lib/Encode/Description.pod
1452 ! lib/Encode/Encoding.pm
1453 Now the pod in Encode.pm is abridged as programming references.
1454 lib/Encode/Description.pod contains the original, detailed description
1455 and Encode::Encoding explains how to write your own module to
1456 add new encodings. So far, lib/Encode/Description.pod contains
1457 the whole pod once in Encode.pm. This is intentional.
1459 Pod revisions by Anton Tagunov
1460 Message-Id: <517178431.20020320174824@motor.ru>
1462 all occrance of Encode::Tcl::Extended removed including pod
1464 test now checks $encoding->name only; $encoding->{name} are
1465 no longer check to find the canonical name.
1466 ! lib/Encode/JP/JIS.pm
1467 ! lib/Encode/JP/ISO_2022_JP.pm
1468 ->name() added to be more compliant with API
1474 Patch by Autrijus to add aliases to TW and fixes to POD
1475 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1477 SADAHIRO Tomoyuki added as should. My apologies.
1479 0.93 Wed Mar 20 2002
1480 * First release to be uploaded to CPAN. For prehistoric changes,
1481 please see Changes file of perl distibution as well as
1482 perl-unicode@perl.org archive, available at:
1483 http://archive.develooper.com/perl-unicode@perl.org/
1485 Changes Since 0.92 includes;
1490 + Mention to perl-unicode@perl.org added
1492 + Encoding aliases added so you can feed locale names
1493 and MIME Charset="" directly.
1494 - Mention to JISX0212 removed because it's fixed
1497 + Encoding aliases added. Note TW is left untouched because
1498 euc-tw is not implemented in TW but in Encode::HanExtra.
1499 Autrijus, you may fix Encode::HanExtra.
1501 + to test encode aliases added