Upgrade to Encode 1.93.
[p5sagit/p5-mst-13.2.git] / ext / Encode / Changes
CommitLineData
fab31126 1# Revision history for Perl extension Encode.
2#
8676e7d3 3# $Id: Changes,v 1.93 2003/04/24 17:43:16 dankogai Exp $
fab31126 4#
8676e7d3 5$Revision: 1.93 $ $Date: 2003/04/24 17:43:16 $
6! t/enc_eucjp.t
7 added "no warnings 'pack'" in for loop to keep bleedperl from
8 complaining "Character in 'C' format wrapped in pack".
9! Makefile.PL
10 More elegant perl core detection inspired by Ilya Zakharevich
11 (but further elaborated for general cases).
12! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
13 POD fixes.
14! t/euc-jp.ucm
15 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
16 are less likely to be confused w/ ISO-8859-*
17! t/CJKT.t
18 RT tests added (vendor encodings are exemplified)
19 -- that successfully found a flaw on iso-2022-kr before the patch.
20! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
21 decode("ISO-2022-KR") has been buggy but no one ever sited
22 that since no one seems to be using it. Bugs discovered by
23 SADAHIRO-san
24 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
25! lib/Encode/CN/HZ.pm t/perlio.t
26 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
27 so it adds test for HZ.
28 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
29! lib/Encode/Guess.pm
30 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
31 So long as the string contains \x{00}-\x{ff} it does not fail.
32 See perldoc for details.
33 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
34
351.92 2003/03/31 03:27:27
9735c3fc 36! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
37 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
38 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
39 but glibc-BIG5-2.1.2.ucm does.
40 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
41! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
42 Maps regenerated again but this time based upon
43 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
44 (But where is THE DOCUMENT by MICROSOFT?)
45! t/enc_module.t AUTHORS
46 failure with threaded Perl on FreeBSD addressed.
47 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
48 Message-Id: <20030322230131.GA813@ratsnest.hole>
49! lib/Encode/Guess.pm
50 Some POD fixes.
51! t/CJKT.t
52 Change 18989: Make the :bytes conditional on PerlIO.
53 further Modified by Dan Kogai
54 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
55! t/enc_module.t
56 Chnage 18966: another fix for failing test on windows ("use encoding"
57 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
58 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
59! t/CJKT.t
60 Change 18970: Hopefully this works also in Win32, by jhi
61 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
62 Change 18965: fix CJKT.t failures on windows due to incorrect
63 binmode(), by gsar
64 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
65
661.91 2003/03/09 20:07:37
0f29a567 67! encoding.pm
68 even more proofread by jhi.
69 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
70! t/enc_module.t
71 -use lib 't';
72 +use lib qw(t ext/Encode/t ../ext/Encode/t);
73 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
74! AUTHORS
75 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
76 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
77! encoding.pm
78 s/logner/longer/
79 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
80
811.90 2003/03/09 17:32:43
05ef2f67 82! encoding.pm
83+ t/enc_data.t
84 Inaba-san has added a patch for perl 5.8.1 or later that makes
85 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
86 that. POD is further revised.
87 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
88! encoding.pm t/enc_module.t
89 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
90 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
91
921.89 2003/02/28
6be7c101 93! Encode.xs
94 signed vs. unsigned issue discovered by Craig on OpenVM
95 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
96! encoding.pm AUTHORS
97+ t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
98 Because binmode() stacks layers instead of overwrite, you have to
99 ":raw :encoding()" in encoding.pm or your are in trouble when you
100 call encoding.pm multiple times. There are several workarounds
101 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
102 who was the first to address this problem was added to AUTHORS.
103 The test suites was added for this, which is a modified version
104 of SUGAWARA-san's scripts
105 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
106
1071.88 2003/02/20 14:42:34
cc7dbc11 108! Encode.xs
109 one signedness nit for Encode by jhi
110 <200302161933.h1GJX876018710@kosh.hut.fi>
111! ucm/viscii.ucm
112 VISCII map was incorrect; fixed by Sadahiro-san
113 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
114! t/enc_eucjp.t t/enc_utf8.t AUTHORS
115 You can't unlink files that are opened in cygwin but the last
116 file handle opened in t/enc_*.t left open. Patch submitted
117 by Yitzchak and he was added to AUTHORS.
118 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
119! t/CJKT.t
120 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
121 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
122! Unicode/Unicode.xs
123 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
124 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
b0b300a3 125
cc7dbc11 1261.87 2003/02/06 01:52:11
151b5d36 127! AUTHORS
128 * Inaba "Sensei" Hirohito added (I thought I have done so a long
129 ago but apparently I did not).
130 * SUZUKI Norio added for verious and useful bug reports.
131! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
132 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
133 podchecked so all warnings are gone except for L<http://>.
134! encoding.pm t/enc_eucjp.t
135 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
136 * Filter option overhaul
137 * POD revision
138! Encode.pm Encode.xs encengine.c Encode/encode.h
139 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
140 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
141 without filter. podchecked by Dan Kogai.
142 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
143! lib/Encode/Alias.pm
144 decode('alias', $1) went wild because of local $_ in find_alias()
145 the evil local $_ is eradicated but that changes find_alias()
146 format for coderef aliasing. See Encode::Alias for details
147 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
148
1491.86 2003/01/22 03:29:07
fa6f41cf 150! encoding.pm
151 * Don't forget to canonize when you attempt an exact match!
152 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
153 * ${^ENCODING} exception is off for $] > 5.008
154 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
155! t/enc_utf8.t
156 $] check commented out so it runs on 5.8.0
157
1581.85 2003/01/21 22:19:14
159! encoding.pm
160 ${^ENCODING} exception is now explicit rather than handled by regex.
161+ t/enc_eucjp.t t/enc_utf8.t
162 Test suite for the better "encoding" pragma support for bleedperl.
163 On 5.8.0, they will just be skipped.
164
1651.84 2003/01/10 12:00:16
0f7c507f 166! encoding.pm
167 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
168 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
169 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
170! Unicode/Unicode.xs
171 %_ fixes saves the resulting .so .05% smaller, by NC
172 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
173! Encode.pm
174 Silence Encode on undef, by Andreas
175 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
176 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
177! Unicode/Unicode.xs
178 s/regognised/recognised/ . British spelling left intact to pay
179 respect to two British Nicks :)
180 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
181
1821.83 2002/11/18 17:28:49
b536bf57 183! Encode.xs lib/Encode/JIS7.pm
184 Even more patches from Inaba-san has been applied. With this
185 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
186 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
187
1881.82 2002/11/14 23:06:12
189! Encode.xs
190 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
191 later found in perls w/ -Dusemymalloc was fixed by NC.
192 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
193
1941.81 2002/11/08 18:29:27
195! Encode.pm Encode.xs
196 Non-XS version of Encode::utf8 is back (with XS being default).
197 Encode::predefine_encodings(0) to turn off XS.
198 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
199 found that it was Test::More causing the problem, not Encode.
200 But I have already made it configurable so it may be useful in
201 some rare cases....
202 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
203! bin/enc2xs
204 The ingenious patch by Nicholas Clark that reduces shlib sizes by
205 50% with no penalty and backward compatibility preserved, is in.
206 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
207
2081.80 2002/10/21 20:39:09
ab3374e4 209! Encode.xs t/mime-header.t
210 Even more patches from NI-XS regarding Encode::utf8->decode().
211 And one more test to t/mime-header.t to prove it
212 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
213
2141.79 2002/10/21 06:05:37
215! Encode.xs
216 Further patches from NI-XS. Encode::utf8->decode() now checks the
217 value of utf8 flag of the argument. As a result, the fix to
218 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
219 no harm (even speedwise) I'll leave it unreverted.
220! ucm/cp949.ucm ucm/cp950.ucm
221 U+20AC EURO SIGN
222 U+00AE REGISTERED SIGN
223 were missing as a result of 1.78. Discovered by Moriyama-san.
224 Moriyama-san has also developed a test script that compares
225 (en|de)coded results to the corresponding Win32 API result and
226 all cp9?? maps are now verified.
227 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
228
2291.78 2002/10/20 15:44:00
230! lib/Encode/MIME/Header.pm
231 fixed so that it works with new Encode::utf8
232! Encode.pm Encode.xs
233 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
234 :encoding(UTF-8) to handle partial chars at end of buffers
235 correctly.
236 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
237! lib/Encode/Supported.pod
238 More nitpickings applied.
239+ t/rt.pl MANIFEST
240! t/CJKT.t
241 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
242 tests were completely useless. To redeem that and get the peace of
243 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
244 ucm/*.ucm Since this script takes too long to finish (30 seconds on
245 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
246 but you can easily run that by either renaming it or:
247 perl -Mblib t/rt.pl
248 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
249! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
250 Other CJKT cp9?? also updated according to the URI below;
251 http://www.microsoft.com/typography/unicode/cscp.htm
252+ bin/ucmsort MANIFEST
253 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
254 to proper order. intended for hardcore develpers only.
255! ucm/cp932.ucm JP/JP.pm AUTHORS
256 CP932 mapping which was based upon the mapping file at unicode.org
257 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
258 has also supplied the patch so he was added to AUTHORS.
259! lib/Encode/Supported.pod
260 ISO-8859-11 != TIS 620
261 == TIS 620 + \xA0 (&nbsp;)
262 Message-Id:
263 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
264
2651.77 2002/10/06 03:27:02
4b291ae6 266! t/jperl.t
267 * Modified to accomodate up and comming patch by Inaba-san that
268 will fix tr/// needing eval qq{}
269 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
270! encoding.pm
271 * pod fixes/enhancements to reflect the changes above
272! lib/Encode/Alias.pm
273 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
274 Message-Id: <20021001015648.GB18710@not.autrijus.org>
275
2761.76 2002/08/25 15:09:51
35e20135 277! t/big5-eten.utf
278 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
279 but naturally identical to previous version -- dankogai
280! ucm/big5-eten.ucm
281 Codepoint fixes -- autrijus
282 Message-Id: <20020805040236.GC5220@not.autrijus.org>
283= *
284 copied everything under perl-5.8.0/ext/Encode to make sure Encode
285 is in sync w/ perl core
286! t/CJKT.t t/guess.t
287 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
288 Now that binmode(FH) does implicit ":bytes" revisit
289 the failing tests. The worrisome one is the Digest::MD5
290 test-- how will it fare in CRLF lands now?
291! t/CJKT.t t/guess.t
292 From: Radu Greab <radu@netsoft.ro>
293 Date: Mon, 10 Jun 2002 00:40:34 +0300
294 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
295 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
296 ones which I cannot figure out.
297! lib/Encode/Alias.pm
298 Subject: [Encode PATCH] spurious warning
299 From: Nicholas Clark <nick@unfortu.net>
300 Date: Sun, 2 Jun 2002 20:26:22 +0100
301 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
302
3031.75 2002/06/01 18:07:49
2d06ad02 304! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
305 glibc compliance cited by Autrijus.
306 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
307! bin/enc2xs bin/piconv
308 Subject: Re: forewarning: usedevel and versiononly
309 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
310
3111.74 2002/05/28 18:33:15
f0a41339 312+ ucm/null.ucm ucm/ctrl.ucm
313! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
314 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
315 Subject: Re: unicode -> &# notation
316 Message-ID: <20020518193704.GB40272@not.autrijus.org>
317
3181.73 2002/05/28 17:26:18
319! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
320 Chris Nandor has fixed Encode so that it works w/ MacPerl --
321 at least w/ PPC (68k need static linking which does not work due to
322 64k limit). pudge is added to AUTHORS (I'm surprised he was not
323 there in the list). Encode/Makefile_PL.e2x was additionally fixed
324 by dankogai to reflect changes in other Makefile.PL
325 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
326! t/mime-header.t
327 Subject: Change 16746: -Mutf8 cleanup.
328 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
329
3301.72 2002/05/20 15:49:56
f74b3917 331! Makefile.PL
332 Subject: [PATCH] Encode should be in perl-core library path
333 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
334 Message-Id: <20020520161201.A11019@alpha.hut.fi>
335! lib/Encode/MIME/Header.pm
336 Subject: [PATCH] Encode::MIME::Header
337 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
338! Encode/Makefile_PL.e2x
339 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
340 Message-Id: <20020519201031.GA1603@not.autrijus.org>
341! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
342 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
343 AUTHORS
344 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
345 Subject: [PATCH] good day for WinCE port of perl.
346 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
347! Encode.xs
348! Unicode/Unicode.xs
349 Even more linting by Robin via @16532
350! Encode.xs
351 Even more typecast by Sarathy in @16460
352
3531.71 2002/05/07 16:22:42
d6f02b51 354! Encode.xs
355 even more typecasts by Robin
356 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
357! bin/enc2xs
358 A very strange bug that was causing a bugus ucm -> C table
359 generation that was revealed by a UCM file that Andreas was
360 working. This is the king of wierdest bug I've encountered
f74b3917 361 in the course of Encode maintenance.
d6f02b51 362 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
363
3641.70 2002/05/06 10:26:48
621b0f8d 365! encoding.pm
366 Made more 'module-safe' with conjunction w/ 'no encoding'.
367 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
368! lib/Encode/Encoding.pm
369 'require Encode' because ->Define uses Encode::define_encoding();
370 problem and solution addressed by Miyagawa-kun
371 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
372! t/Unicode.t
373 Cuts the frill to make djgpp happier, as suggested by Laszlo
374 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
375! bin/enc2xs
376 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
377 Message-Id: <m3bsbug48n.fsf@anima.de>
378
3791.69 2002/05/04 16:41:18
11067275 380! lib/Encode/MIME/Header
381 Floating-point coerced for UNICOS (in integer arithmetics it folds
382 line one character too early). Verification by Mark is pending.
383 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
384! Unicode/Unicode.pm
385 more doc patch from Elizabeth
386 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
387! Encode/Makefile_PL.e2x
388 More platform-independent patch from Benjamin
389 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
390! lib/Encode/Guess AUTHORS
391 split regex fix by Graham Barr. Adds him to AUTHORS.
392 Message-Id: <20020504085419.E95940@valueclick.com>
393! Encode/Makefile_PL.e2x
394 enc2xs script discovery made smarter and more sensible, first cited
395 by Miyagawa-kun and further suggestions by Rafael and Andreas
396! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
397 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
398
3991.68 2002/05/03 12:20:13
126bf8bf 400! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
401 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
402 and Supported.pod modified to reflect the change. Elizabeth added
403 to Authors. And H.M. is also added for forwarding her patch among
404 other contributions (I was rather surprised to find his name was not
405 there yet!)
406 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
407
4081.67 2002/05/02 07:33:09
2fc614e0 409! Encode.xs
410 Error message now consistent w/ perlqq (\N{U+} -> \x{})
411 done in perl@16308 but Philip linted me further. Now the error
412 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
413! lib/Encode/Guess.pm
414 Sanity check for happier -w by Autrijus
415
4161.66 2002/05/01 05:41:06
4089adc4 417! Encode.xs t/fallback.t
418 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
419 while fallback is in effect. This even came with a welcome side-effect
420 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
421 also modified to test this.
422 And of course, the corresponding varialbles to UV[Xx]f are appropriately
423 cast. This should've concluded NI-XS homework.
424! Encode.pm
425 encode(undef) does warn again! Repented upon suggestion by NI-XS.
426 Document for unless vs. '' added
427 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
428
4291.65 2002/04/30 16:13:37
e8c86ba6 430! Encode.pm
431 encode(undef) no longer warns for C<Use of uninitialized value in
432 subroutine entry>. Suggested by Paul.
433 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
434! lib/Encode/Supported.pod
435 Encode::MIME::Header and Encode::Guess mentioned
436 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
437! lib/Encode/Guess.pm
438 POD fix by Miyagawa-kun
439 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
440
4411.64 2002/04/29 06:54:06
b7a5c9de 442! ucm/euc-jp.ucm
443 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
444 encocde("euc-jisx0213") are still DIFFERENT.
445 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
446! Encode.xs
447 A few white spaces corrected by NI-XS via PerlIO integration to
448 Mainline
449 Subject: Change 16247: Integrate perlio;
450! Encode.pm
451 Document fixes by Andreas
452 Message-Id: <m3k7qsf1we.fsf@anima.de>
453
4541.63 2002/04/27 18:59:50
10c5ecbb 455! lib/Encode/Encoding.pm
456! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
457! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
458 Make use of the Encode::Encoding base class!
459 And other cleanups in Encode.xs upon NI-XS suggestions
460 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
461
4621.62 2002/04/27 11:17:39
fc17bd48 463! Encode.pm
464 encodings() now just check %ExtModule instead of eval{require}
465 all of them for ":all" to conserve more memory.
466! Encode.xs
467 more "%x" -> "%" UVxf stuff.
468! Encode.pm
469 s/=over2/=over 2/g # oops.
470
4711.61 2002/04/26 03:02:04
7e19fb92 472! t/mime-header.t
473 Now does decent tests besides use_ok()
474! lib/Encode/Guess.pm t/guess.t
475 UI streamlined, document added
476! Unicode/Unicode.xs
477 various signed/unsigned mismatch nits (#16173)
478 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
479! Encode.pm
480 POD: utf8-flag-related caveats added. A few sections completely
481 rewritten.
482! Encode.xs
483! AUTHORS
484 Thou shalt not assume %d works, either!
485 Robin Baker added to AUTHORS for this
486 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
487! t/CJKT.t
488 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
489
4901.60 2002/04/24 20:06:52
af1f55d9 491! Encode.xs
492 "Thou shalt not assume %x works." -- jhi
493 Message-Id: <20020424210618.E24347@alpha.hut.fi>
494! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
495 low-memory build machines happy, now *.c is created for each *.ucm
496 (no table aggregation). You can still override this by setting
497 $ENV{AGGREGATE_TABLES}.
498 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
499+ lib/Encode/Guess.pm
500+ lib/Encode/JP/JIS7.pm
501 Encoding-autodetect (mainly for Japanese encoding) added. In a
502 course of development, JIS7.pm was improved.
503+ lib/Encode/HTML/Header.pm
504+ lib/Encode/Config.pm
505 MIME B/Q Header Encoding Added!
506! Encode.pm Encode.xs t/fallback.t
507 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
508 Message-Id: <20020424130709.GA14211@tanglefoot>
509
5101.59 $ 2002/04/22 23:54:22
511! Encode.pm Encode.xs
512 needs_lines() and perlio_ok() are added to Internal encodings such
513 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
514
5151.58 2002/04/22 23:54:22
ef175861 516! TW/TW.pm
517 s/MacChineseSimp/MacChineseTrad/ # ... oops.
518! bin/ucm2text
519! t/*.t
520- t/*.euc t/*.ref
521+ t/*.enc t/*.utf
522 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
523 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
524 t/(JP|KR).t are aggregated to t/CJKT.t
525 test data are all remade via bin/ucm2text.
526 And .... They are no longer skipped for -Uuseperlio !
527
5281.57 2002/04/22 20:27:30
011b2d2f 529! t/JP.t t/KR.t t/perlio.t
530 unless (find PerlIO::Layer 'perlio') ... line is back again.
531 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
532 need PerlIO. Sigh....
533! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
534 ->perlio_ok now does eval{ require PerlIO::encoding } there so
535 it correctly returns 1 when PerlIO::encoding is yet loaded.
536! Encode.xs
537 perl-current patch #16072 reflected
538
5391.56 2002/04/22 09:48:07
0ab8f81e 540! Encode.pm encoding.pm t/perlio.t t/jperl.t
541 New PerlIO::encoding 0.04 compliance met
542
5431.55 2002/04/22 03:43:05
544! Encode.pm Encode.xs Unicode/Unicode.pm
545 needs_lines() defined so Encode::Encoding is no longer needed
546 for perlio
547
5481.54 2002/04/22 02:50:01
549! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
550! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
551 now perlio_ok is true by default if PerlIO::encoding->VERSION is
552 0.03 or larger. POD in Encode::Encoding revised to reflect this.
553 Encode::XS and Encode::Unicode now has perlio_ok() method.
554! lib/Encode/Supported.pod
555 s/UP-UX/HP-UX/ by jhi
556! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
557! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
558! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
559! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
560! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
561! lib/Encode/Supported.pod
562 Huge document fixes by Philip.
563! AUTHORS
564! t/JP.t
565 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
566 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
567! t/perlio.t
568 binmode() after "<:encoding" to make Win32 happy, by Mattia.
569 Mattia added to AUTHORS file
570 Message-Id: <3CC3150F.5798.22A05AE@localhost>
571
5721.52 2002/04/20 23:43:47
6d1c0808 573! t/perlio.t
574 TODO: is now SKIP:, as NI-XS requested. Also adds more
575 eraborate failure analysis added.
576! bin/enc2xs
577 A note on how to make sure of round-trip safety added to POD
578 section (so Autrijus is happier)
579! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
580 big5-(eten|hkscs) is round-trip safe again!
581 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
582! encoding.pm
583 Typo fixes by Andreas
584! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
585! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
586 PerIO coodination patches from NI-XS.
587 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
588
5891.51 2002/04/20 09:58:23
b0b300a3 590! t/TW.t
591 Updated test suite by Autrijis so "make test" is happy again
592 Message-Id: <20020420082104.GA25037@not.autrijus.org>
593+ ucm/big5-eten.ucm
594! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
595- ucm/big5.ucm
596 TW/TW.pm TW/Makefile.PL
597 Updates by Autrijus. 'big5' is no longer a canonical but an
598 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
599 Message-Id: <20020419195346.GA19597@not.autrijus.org>
600! Encode.xs
601 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
602 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
603! Encode.pm
604 PerlIO detection a little bit smarter; no longer uses eval qq{}
605 but eval {}.
606
6071.50 2002/04/19 06:13:02
85982a32 608! ! Encode.pm Encode.xs Encode/encoding.h
609+ t/fallback.pm
610 New Fallback API imlemented and documented. See "perldoc Encode"
611 for details
612! lib/Encode/JP/JIS7.pm Encode.pm
613+ lib/Encode/PerlIO.pod t/perlio.t
614 API compliance met. However, it still does not work unless perlio
615 implements line buffer. See BUGS section in perldoc Encode::PerlIO
616 As a sensible workaround, perlio_ok() added to Encode.
617! encoding.pm
618! lib/Encode/Supported.pod
619 Doc fixes from jhi
620 Message-Id: <20020418174647.J8466@alpha.hut.fi>
621! CN/CN.pm
622 Doc fixes from Autrijus
623 Message-Id: <20020418144131.GA10987@not.autrijus.org>
624! Encode.pm
625 perlqq mode documented
626! t/JP.t
627+ t/jisx0201.euc t/jisx0201.ref
628! t/jisx0208.euc t/jisx0208.ref
629 t/JP.t tests more rigorously and with other encodings
630 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
631 in jis0208 (used to contain jisx0201 part).
632! Encode/Makefile_PL.e2x
633 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
634 enc2xs and encode.h rather than hard-coded. This allows the resulting
635 module fully CPANizable.
636! encoding.pm t/JP.t t/KR.t
637 PerlIO detection simplified (checks %INC instead of eval{})
638! Encode.xs Encode/encode.h
639+ Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
640- lib/Encode/Unicode.pm
641 (en|de)code_xs relocated to where it belongs. Source reindented
642 to my taste
643! bin/enc2xs
644 Additional (U8 *) cast added as suggested by jhi
645 Message-Id: <20020417165916.A28599@alpha.hut.fi>
f2a2953c 646
85982a32 6471.42 Date: 2002/04/17
b2704119 648- lib/Encode/XS.pm
649 no-op module; Thought of adding a pod there but enc2xs has
650 one so gone.
651! encoding.pm
652! t/JP.pm
653! t/KR.pm
654 correct mechanism to detect Perlio::encoding layar installed.
655! Encode.xs
656 PerlIO Layer detached.
657
6581.41 2002/04/16 23:35:00
659! encoding.pm
660 binmode(STDIN|STDOUT ...) done iff PerlIO is available
661! t/*.t
662 Cleaned up PerlIO skip conditions to prepare for the upcoming
663 Encode - PerlIO forking.
664! Encode.pm
665 exported functions are now prototyped.
666! lib/Encode/CN/HZ.pm
667! bin/enc2xs
668! Encode.xs
669 fallback implemented # was /* FIXME */
670 affected programs revised to fit (only HZ was using the try-catch
671 approach which needed to be fixed for API-compliance).
672! Encode/Config.pm
673! Encode/KR/2022_KR.pm
674! Encode/KR/KR.pm
675 can find =head1 NAME now, jhi
676 Message-Id: <20020416083059.V30639@alpha.hut.fi>
677! encoding.pm
678 s/\{h\}/{$h}/g ;)
679! Encode.xs
680 now complies with less warnings with the pickest compilers.
681 Suggested by Craig, fixed by Dan.
682 ! Encode/Makefile_PL.e2x
683! bin/enc2xs
684 A bug that fails to find *.e2x in certain conditions fixed
685
6861.40 2002/04/14 22:27:14
aae85ceb 687+ Encode/ConfigLocal_PM.e2x
688! lib/Encode/Config.pm
689! bin/enc2xs
690 "enc2xs -C" now generates/updates Encode::ConfigLocal.
691 ConfigLocal_PM.e2x is a skelton thereof.
692! lib/Encode/Config.pm
693! CN/CN.pm
694 "use Encode::CN::HZ;" was missing.
695! t/Unicode.t
696! t/unibench.t
697 More rigorous tests added to test XS, especially on memory allocation.
698! Encode.xs
699! lib/Encode/Unicode.pm
700 NI-S implemented an XS version -- merged
701 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
702! encoding.pm
703! t/jperl.t
704 Source filter option added. With this option on, you can write
705 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
706 encodings. t/jperl.t enhanced to test this feature.
707! t/Unicode.t
708 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
709 apply his suggestion, this degree of nitting is enough to add him
710 to AUTHORS list.
711 Message-Id: <3CB93223.291E5E2E@earthlink.net>
712! JP/JP.pm
713+ lib/Encode/JP/JIS7.pm
714- lib/Encode/JP/JIS.pm
715- lib/Encode/JP/2022_JP.pm
716- lib/Encode/JP/2022_JP1.pm
717 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
718 JIS7.pm for better maintainability and performance
719! encoding.pm
720 Added caveat for non-ascii identifiers.
721! encoding.pm
722 fixes by jhi, the original author of this pragramtic module.
723 Message-Id: <20020413231527.V1826@alpha.hut.fi>
724
7251.34 2002/04/12 20:23:05 (Unreleased)
77ea6967 726! Encode.pm
727! t/Unicode.t
728 EBCDIC fixes addressed by jhi.
729 Message-Id: <20020412161844.D9383@alpha.hut.fi>
730! lib/Encode/Encoder.pm
731 POD fix by Miyagawa-kun
732 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
733
7341.33 2002/04/10 22:28:40
448e90bb 735! AUTHORS
736 Philip's mail address corrected.
737! AUTHORS
738! t/Encoder.t
739! lib/Encode/Encoder.pm
aae85ceb 740 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
448e90bb 741 This adds Miyagawa-kun to AUTHORS.
742 * encoding() no longer exported by default but on demand
743 * t/Encoder.t updated to test all these
744 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
77ea6967 745! lib/Encode/Unicode.pm
746! lib/Encode/Supported.pm
747 Further doc fixes by Anton
448e90bb 748
7491.32 2002/04/09 20:06:15
fcb875d4 750+ bin/ucmlint
751+ t/bogus.ucm
752- ucm/macDevanaga.ucm Unicode Character Map
753- ucm/macGujarati.ucm Unicode Character Map
754- ucm/macGurmukhi.ucm Unicode Character Map
755 A utility to check integrity of .ucm files. t/bogus.ucm is a
756 ucm that is deliberately bogus. unused Indic mappings are removed
757 for the time being.
758! Encode.pm
759 resolve_alias() added as suggested by jhi. Same as
760 find_encoding("alias")->name. For convenience. This one is
761 defined in Encode.pm instead of Alias.pm.
762 Message-Id: <20020409215846.H17022@alpha.hut.fi>
763! Encode.xs
764 Memory Allocate but detected during the devel of ucmlint -- fixed.
765 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
766! lib/Encode/Unicode.pm
767 valid_ucs2(0) is false but must be true.
768 3 patches from NI-S as follows. This also has fixed the incident
769 Andy has reported.
770! lib/Encode/Alias.pm
771 find_alias() recursion prevention
772! t/Aliases.t
773 Checks for the patch above
774! t/Encode/Unicode.pm
775 An extra "F" that causes valid_ucs2() return a bogus value fixed
776 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
777 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
778 2 Small Patches from jhi as follows:
779! Encode.pm
780 Encode->encodings() lists in case-insensitve order (as it was)
781! bin/piconv
782 -l option prints avaiable encodings to STDOUT instead of STDERR
783! lib/Encode/Aliases.pm
784 s/defintion/definition/
785 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
786! AUTHORS
787! lib/Encode/Supported.pod
788! lib/Encode/Unicode.pm
789 POD revise by Philip Newton. This adds Philip to AUTHORS list.
790 Thank you for the exact quote of Douglas Adams :)
791 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
792
7931.31 2002/04/08 18:08:07
fdd579e2 794! lib/Encode/Encoder.pm
795+ t/Encoder.t
796 Encode::Encoder, once just a placeholder of an idea, is now much more
797 practical. See t/Encode.t to find how practical it can be.
798+ lib/Encode/Config.pm
799! Encode.pm
800 my false laziness at Encode.pm is fixed. Now %ExtModules are set
801 in Encode::Config and they are all literally, not programatically
802 set. My false laziness was resulting many encodings missing from
803 %ExtModules.
804! lib/Encode/Unicode.pm
805! t/Unicode.t
806 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
807 so that it does not rely Encode::Unicode for BOM values
808 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
809
8101.30 2002/04/08 02:34:51
c731e18e 811+ lib/Encode/Encoder.pm
812 Object Oriented Encoder. I reckon something like this is in need.
813! Encode.pm
814! t/Unicode.pm
815! lib/Encode/Supported.pod
fdd579e2 816 * autoloading bug that prevented upper-case canonicals such as UTF-16
c731e18e 817 is fixed. Now even UTF/UCS are autoloaded!
818 * encodings() is now more intuitive.
819 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
820 stored therein.
821 * Obligatory fixes to the POD.
822! lib/Encode/Supported.pod
823 Patch from Anton applied.
824 Message-Id: <66641479.20020408033300@motor.ru>
825! Encode.pm
826! lib/Encode/Unicode.pm
827 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
828
8291.28 2002/04/07 18:58:42
830! MANIFEST
831+ t/Unicode.t
832+ t/grow.t
833 Just a MANIFEST for those missing files.
834
8351.26 Date: 2002/04/07 15:22:04
f2a2953c 836! JP/Makefile.PL
837! t/Aliases.PL
838 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
839 And t/Aliases.t fixed to make sure they all exist.
840
8411.25 2002/04/07 15:01:25 (Unreleased)
842! Encode.pm
843! lib/Encode/Unicode.pm
844 More POD fixes....
845! Encode.pm
846- lib/Encode/UTF_EBCDIC.pm
847- lib/Encode/Internal.pm
848- lib/Encode/utf8.pm
849 Integrated into Encode.pm as closures. That way "one package, one file"
850 rule is preserved yet less files to require.
851! encoding.pm
852 commented out binmode(STDERR ...
853! Makefile.PL
854! Byte/Makefile.PL
855! CN/Makefile.PL
856! EBCDIC/Makefile.PL
857! JP/Makefile.PL
858! KR/Makefile.PL
859! Symbol/Makefile.PL
860! TW/Makefile.PL
861! Encode/Makefile_PL.e2x
862 Schwarn's MM-compliance patch merged
863 Message-Id: <20020406082609.GA28758@blackrider>
864! Encode.pm
865! lib/Encode/Unicode.pm
866+ lib/Encode/UTF_EBCDIC.pm
867+ t/Unicode.t
868- lib/Encode/10646_1.pm
869- lib/Encode/ucs2_le.pm
870 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
871 per-module based (en|de)code, I saved a number of .pm by
872 reorganizing it as per-object base (Well, this is what Encode::XS
873 does under the hood). See Encode::Unicode for details.
874 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
875 This module is used only on EBCDIC environments.
876
8771.21 2002/04/05 14:46:34 (Not Released)
878! JP/JP.pm
879! Encode.pm
880+ ucm/jis0201.ucm
881+ ucm/jis0208.ucm
882+ ucm/jis0212.ucm
883 Are back to make Perl/Tk happy Smile, NI-S.
884! t/Alias.pm
885! lib/Encode/Alias.pm
886! lib/Encode/Supported.pm
887! lib/Encode/10646_1.pm
888! lib/Encode/ucs2_le.pm
889 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
890 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
891 updated to reflect changes.
892 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
893! lib/Encode/Alias.pm
894! lib/Encode/Supported.pm
895 Anton's revision commited. Added Dan's own fixes as well.
896 Message-Id: <159103166906.20020405161134@motor.ru>
897! lib/Encode/Alias.pm
898 134c134
899 < qr/^UCS2-le$/i => '"UCS-2"', );
900 ---
901 > qr/^UCS2-LE$/i => '"UTF-16LE"');
902 Sigh. Thank you, Anton.
903 Message-Id: <14567692196.20020405062020@motor.ru>
904 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
905
9061.20 2002/04/04 19:50:52
a999c27c 907+ bin/unidump
908 the last minute addtion. Just give it a try. Docs remains to be done.
909 Not installed by default.
910! lib/Encode/Supported.pod
911 Enhanced Greatly.
912! t/Alias.t
913! lib/Encode/Alias.pm
914! lib/Encode/utf8.pm
915! lib/Encode/10464_1.pm
916! lib/Encode/ucs2_le.pm
917 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
918 unchanged but UTF-16BE is added as an alias. Implicit aliases
919 move to Encode::Alias so init_alias() works more as expected.
920 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
921 Though pedantically wrong, This should make perl mongers happier.
922 t/Alias.t is enhanced to test all these.
923 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
924! Byte/Makefile.PL
925 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
926 of the codepoints are common. CJKT left as is because the saving is
927 not significant.
928! Byte/Makefile.PL
929! CN/Makefile.PL
930! EBCDIC/Makefile.PL
931! Encode.xs
932! Encode/Makefile_PL.e2x
933! JP/Makefile.PL
934! KR/Makefile.PL
935! Makefile.PL
936! Symbol/Makefile.PL
937! TW/Makefile.PL
938! bin/enc2xs
939! AUTHORS
940 All occurance of _def.h replaced with .exh so djgpp works happily
941 ever after! To credit this amazing discovery, Laszlo is now in
942 AUTHORS list
943 Message-Id: <20020403181424.GA8778@freemail.hu>
944 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
945! Makefile.PL
946! */Makefile.PL
947! Encode/Makefile_PL.skel
948 bin/enc2xs
949 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
950 Message-Id: <20020401222744.GX2000@blackrider>, et al.
951! t/encoding.t
952 Two more tests by added jhi
953 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
954+ t/grow.t
955! Encode.xs
956 The showstopper fixed -- Memory reallocation bug was causing
957 Encode::XS to fall into infinite loop on certain conditions.
958 t/grow.t tests that.
959 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
960+ bin/txt2ucm
961! */Makefile.PL
962! */*.ucm
963! */XX.pm
964! lib/Encode/Supported.pod
965 Vendor encodings rebuilt out of original map files at unicode.org.
966 Indic languages such as MacDevanagali remain unspported do to the
967 shortcoming of encengine capabilities (they need algorithmical
968 conversion and I have no knowledge on that!). Pods fixed for added
969 encodings.
970 Oh, macJapan.ucm renamed to macJapanese.ucm.
971 macROMnn is macRomanian and macRUMnn is macRumanian.
972 txt2ucm is a crude script that is used to convert them.
973! bin/enc2xs
974 Unicode Compound Characters (used extensively on Mac) supported
975! bin/piconv
976 Typo fixes and improvements by jhi
977 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
3ef515df 978
b536bf57 9791.11 2002/03/31 22:12:13
3ef515df 980+ t/encoding.t
981+ t/jperl.t
982! MANIFEST
983 Missing files from the MANIFEST fixed.
984 Message-Id: <20020401010156.H10509@alpha.hut.fi>
985 Version incremented just to make CPAN happy.
986
9871.10 2002/03/31 21:32:42
988! Makefile.PL
989! README
990 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
991 if you want. This should make Autrijus happy. Also, piconv
992 is added to default install.
993+ Encode/*.e2x
994! bin/enc2xs
995 Here-documented files that enc2xs generates are now exported
996 to *.e2x. Much cleaner and easier to debug.
997! encoding.pm
998 encoding enhances so you can make it act more like such
999 (now prehistoric ) "localized" variations of perl like Jperl.
1000+ t/jperl.t
1001 Further test for encoding.pm. Written in euc-jp
1002+ encoding.pm
1003+ t/encoding.t
1004 Taken over form jhi.
1005 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1006- Encode/*.ucm
1007+ ucm/*.ucm
1008! Makefile.PL
1009! */Makefile.PL
1010 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1011- ucm2table
1012+ bin/ucm2table
1013 ***
1014! AUTHORS
1015! Byte/Byte.pm
1016! Encode.pm
1017! Encode/macIceland.ucm
1018! lib/Encode/Alias.pm
1019! lib/Encode/Supported.pod
1020 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1021 Message-Id: <m3lmcavhjt.fsf@anima.de>
1022
10231.01 2002/03/29 20:59:39
037b88d6 1024! Makefile.PL
1025! README
1026 s/USE_SCRIPTS/MORE_SCRIPTS/
1027! Makefile.PL
1028 installs enc2xs by default for external Encode:: modules in CPAN,
1029 such as Encode::HanExtra
1030! t/*.t
1031 More sensible perl core detection via $ENV{PERL_CORE}
1032 suggested by Spider
1033 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1034! bin/enc2xs
1035 Perl core ditection via $^X =~ m/\bminiperl$/o
1036 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1037
10381.00 Wed Mar 29 2002
67d7b5ef 1039! *
1040 The version of all files is updated to 1.00 via "ci -f -l1.00",
1041 commemorating version 1.00. All files, including *.ucm are now
1042 under version control.
1043- encode.h
1044+ Encode/encode.h
1045 encode.h moved to Encode/ so it will be installed for the later
1046 use by enc2xs
1047! enc2xs
1048 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1049! Makefile.PL
1050! */Makefile.PL
1051- compile
1052+ bin/enc2xs
1053 compile renamed to enc2xs.
1054 Affected Makefle.PL updated
1055- lib/CN/2022_CN.pm
1056 "Punt it. HanExtra can take care of that later." -- Autrijus
1057 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1058! Encode/johab.ucm
1059! Encode/euc-kr.ucm
1060! Encode/ksc5601.ucm
1061! lib/Encode/CJKConstants.pm
1062! lib/Encode/KR/2022_KR.pm
1063 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1064 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1065! README
1066! Makefile.PL
1067+ bin/piconv
1068 bin/ added for example scripts. They are not installed by default.
1069 to install them, "perl Makefile.PL USE_SCRIPTS".
1070 piconv is iconv reinvented in perl. in addition to all features
1071 of iconv, it also adds perlish features. See L<piconv/1> for more
1072 details.
1073! lib/Encode/Alias.pm
1074 qr/^ replaced with qr/\b so it directly matches locale names
1075 such as en_US.US-ASCII
1076! AUTHORS
1077! t/Aliases.t
1078 Patch by MJD to fix the following problem applied.
1079 Subject: [PATCH 5.7.3 Encode]
1080 Aliases.t not properly skipped when Encode extension not built
1081 Message-Id: <20020328091850.18677.qmail@plover.com>
1082! lib/Encode/KR/2022_KR.pm
1083! lib/Encode/CJKConstants.pm
1084 Another patch from Jungshik to make iso-2022-kr actually work
1085 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1086! Encode/Encode/euc-kr.ucm
1087+ Encode/Encode/johab.ucm
1088! Encode/Encode/ksc5601.ucm
1089! Encode/KR/KR.pm
1090! Encode/KR/Makefile.PL
1091! Encode/lib/Encode/Alias.pm
1092! t/Alias.t
1093 Johab support and complete revision of Korean Encoding by Jungshik
1094 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1095+ Encode.pm
1096 Revised to make up with now-dropped Encode::Details.
1097- lib/Encode/Details.pod
1098 Dropped. Besides being obsolete, the topics are now covered in
1099 respective pods now.
1100! AUTHORS
1101! t/Alias.t
1102 KR/KR.pm
1103 lib/Encode/Alias.pm
1104 Korean aliases fixed thanks to Jungshik Shin
1105 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1106 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1107! *.pm
1108 =head1 NAME added to all modules to make buildtoc happy
1109 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1110- lib/Encode/CJKguide.pod
1111 Too controversial and dropped from the dist. Will be available
1112 separately on the web.
1113! Encode/*.ucm
1114 RCS tags added so table debugging gets easier (should that be
1115 needed! I hope they all stay 1.00!)
1116+ lib/Encode/CJKguide.pod
1117 A detailed guide to mainly, but not limited to, CJK multibyte
1118 encodings.
1119- Encode/roman8.ucm
1120+ Encode/hp-roman8.ucm
1121! Byte/Makefile.PL
1122! Encode/Supported.pod
1123 All occurance of "roman8" replaced with "hp-roman8" to avoid
1124 confusion
1125! Encode/Supported.pod
1126! Encode/mac*.ucm
1127! t/Alias.t
1128 Mac Encodings now comply the Inside Macintosh
1129! t/Alias.t
1130 Test for '-raw' conventions added.
1131! Encode/Alias.pm
1132 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1133! Encode/gb12345.ucm
1134! Encode/gb2312.ucm
1135! Encode/ksc5601.ucm
1136 "-raw" appended to canonical names.
1137 File mames stay unchanged thanks to UCM format.
1138! lib/Encode/CN/HZ.pm
1139 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1140 Message-Id: <20020326035210.GA2091@not.autrijus.org>
5129552c 1141
64ffdd5e 11420.99 Tue Mar 26 2002
1143- lib/Encode/JP/Const.pm
1144+ lib/Encode/CJKConstants.pm
1145+ lib/Encode/CN/2022_CN.pm
1146+ lib/Encode/KR/2022_KR.pm
1147+ t/KR.t
1148+ t/gb2312.euc
1149+ t/gb2312.ref
1150+ t/ksc5601.euc
1151+ t/ksc5601.ref
1152+ t/table.euc
1153+ t/table.ref
1154+ ucm2table
1155 * Support for ISO-2022-KR and ISO-2022-CN added.
1156 * t/KR.t added!
1157 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1158 * ucm2table autogenerates character table out of UCM files.
1159- engine.c
1160+ encengine.c
1161- lib/Encode/Supports.pod
1162+ lib/Encode/Supported.pod
1163 Names reverted due to popular demand.
1164 8.3 rule applies only when there is a conflict.
1165 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1166! */Makefile.PL
1167- Encode/*.enc
1168+ Encode/*.ucm
1169- lib/Tcl*
1170- lib/Encode/Format/Enc.pod
1171- t/Tcl.t
1172 * Character tables is now 100% ucm.
1173 * All files under Encode/ is now 8.3-compliant
1174 * some of missing encodings added (i.e. gsm0338 and nextstep)
1175 * Vendor mappings aggregated with appropriate national std in
1176 Makefile.PL, resulting smaller *.so especially for CJK.
1177 Following is result on Dan's FreeBSD box.
1178 Now Then
1179 ---------------------------------------------------------------
1180 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1181 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1182 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1183 blib/arch/auto/Encode/Encode.so 27,791 27,791
1184 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1185 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1186 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1187 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1188 ---------------------------------------------------------------
1189 Total 5,375,297 6,343,819
1190 Saving 968,522
1191 * As a result of ucm-transition, Encode::Tcl dropped because
1192 Encode::Tcl demands *.enc.
1193 Encode::Tcl will be supplied in a separate tarball with *.enc.
1194 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1195!compile
1196-encengine.c
1197+encode.c
1198!Encode.pm
1199-lib/Encode/Supported.pod
1200+lib/Encode/Supports.pod
1201-lib/Encode/iso10646_1.pm
1202+lib/Encode/10646_1.pm
1203-lib/Encode/EncFormat.pod
1204+lib/Encode/Format/Enc.pod
1205 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1206- lib/Encode/JP/Constants.pm
1207+ lib/Encode/JP/Consts.pm
1208! lib/Encode/JP/JIS.pm
1209! lib/Encode/JP/H2Z.pm
1210 Version nit problem and 8.3 rule fix.
1211 > Package namespace installed latest in CPAN file
1212 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1213 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1214 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1215 accordingly. In addition, legacy "use vars qw()..." are replaced with
1216 "our";
1217 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1218 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1219! JP/JP.pm
1220- lib/Encode/JP/ISO_2022_JP.pm
1221- lib/Encode/JP/ISO_2022_JP_1.pm
1222+ lib/Encode/JP/2022_JP.pm
1223+ lib/Encode/JP/2022_JP1.pm
1224 01234567.012
1225 8.3 naming conflict for vanilla fat addressed by jhi
1226 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1227
1228! Encode.xs
1229 Typecast fix addressed by jhi
1230 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1231
a63c962f 12320.98 Mon Mar 25 2002
1233! lib/Encode/Supported.pod
1234 Further pod fixes
1235+ lib/Encode/JP/ISO_2022_JP_1.pm
1236! lib/Encode/JP/ISO_2022_JP.pm
1237! lib/Encode/JP/JIS.pm
1238! JP/JP.pm
1239 Now Encode::JP is more strict on the difference between ISO-2022-JP
1240 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1241 makes Anton happier :) FYI the previous version implements
1242 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1243! lib/Encode/Supported.pod
1244 Further pod fixes
1245! Encode.xs
1246 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1247 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1248! CN/CN.pm
1249! JP/JP.pm
1250! KR/KR.pm
1251! TW/TW.pm
1252! lib/Encode/Suppoted.pod
1253 pod fixes to replace F<http://...> to L<http://...>,
1254 as suggested by Autrijius in:
1255 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1256! lib/Encode/Suppoted.pod
1257 fixes and enhancements by Anton
1258 Message-Id: <10632060120.20020324103753@motor.ru>
1259! lib/Encode/Alias.pm
1260 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1261 added. Suggested by Anton then deobfuscated by Autrijius
1262 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1263! compile
1264 Further fix by Nicholas Clark
1265 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1266- lib/EncodeFormat.pod
1267+ lib/Encode/EncFormat.pod
1268! MANIFEST
1269 File renamed as suggested by Autrijius
1270! Encode.pm
1271! lib/Encode/Details.pod
1272! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1273! Encode.pm Sun Mar 24 13:43:47 2002
1274 pod fixes by Autrijius.
1275 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1276 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1277! t/Alias.t
1278! lib/Encode/Alias.pm
1279! Encode.pm
1280 now more EBCDIC conscious;
1281 %ExtModules on EBCDIC system excludes CJK so that you don't
1282 have to worry about the matched alias resulting cloaking.
1283 t/Alias.t also revised to reflect changes. Verified by jhi
1284 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1285
071db25d 12860.97 Sun Mar 24 2002
1287! CN/CN.pm
1288! KR/KR.pm
1289! TW/TW.pm
1290 EBCDIC detection mechanism installed as in JP/JP.pm
1291 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1292! Byte/Makefile.PL
1293! CN/Makefile.PL
1294! EBCDIC/Makefile.PL
1295! JP/Makefile.PL
1296! KR/Makefile.PL
1297! Symbol/Makefile.PL
1298! TW/Makefile.PL
1299 Now all table files used by compile are postfixed '_t' to avoid
1300 namespace collisions in case insensitive file systems once for all!
1301 inspired by:
1302 Message-ID: <58290227735.20020323195659@familiehaase.de>
1303! t/Aliases.t
1304 Since the Encode::JP is unsupported under EBCDIC we
1305 cannot run this test (aliases as such should work fine) -- jhi
1306 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1307! Byte/Makefile.PL
1308 duplicate occurance of ascii.ucm and 8859-1.ucm
1309 causes MacOS X dlyd to cloak
1310! t/CN.t
1311! t/Encode.t
1312! t/JP.t
1313! t/TW.t
1314! t/Tcl.t
1315 < chdir 't' if -d 't';
1316 ---
1317 > if (! -d 'blib' and -d 't'){ chdir 't' };
1318 When you are "make test"-ing on Encode/ directory, you must not
1319 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1320 remain unchanced. Also the situation detection was made simpler
1321 in t/JP.t, which was originally;
1322 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1323! Encode.pm
1324 "Use of uninitialized value in string eq at Encode.pm line 96."
1325! Symbol/Makefile.PL
1326! EBCDIC/Makefile.PL
1327! AUTHOR
1328 -- Problem on case insensitive file systems
1329 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1330 Message-ID: <88254111953.20020323095503@familiehaase.de>
1331! compile
1332! AUTHOR
1333 "So I think it's a bug in gcc, not perl. But it still needs to be
1334 worked around."
1335 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1336 Message-Id: <20020323170509.C96475@plum.flirble.org>
1337
5129552c 13380.96 Sat Mar 23 2002
1339! TW/TW.pm
1340! lib/Encode/Encoding.pm
1341! lib/Encode/Alias.pm
1342! lib/Encode/Supported.pod
1343! KR/KR.pm
1344 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1345 Message-ID: <20020322073908.GB10539@blackrider>
1346! Makefile.PL
1347! Encode.pm
1348 "...I think we should include ISO 8859-1 as well." -- NI-S
1349 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1350! JP/JP.pm
1351! CN/CN.pm
1352! KR/KR.pm
1353! TW/TW.pm
1354! lib/Encode/Alias.pm
1355 alias definitions relocated to Encode::Alias so module autoloading
1356 works for aliases also.
1357! Encode.pm
1358 encodings() now accepts args to check ExtModules.
1359+ Byte/Byte.pm
1360+ Byte/Makefile.PL
1361+ EBCDIC/EBCDIC.pm
1362+ EBCDIC/Makefile.PL
1363+ Symbol/Makefile.PL
1364+ Symbol/Symbol.pm
1365! Encode.pm
1366! Encode.xs
1367 Latin and single byte encodings are reorganized so they are
1368 demand-loaded like Encode::XX. Now only ascii is compiled into
1369 Encode itself.
1370! lib/Encode/Alias.pm
1371 for my $k (keys %hash){ delete $hash{$k}; }
1372 is depreciated; fixed.
1373
13740.95 Fri Mar 22 2002
5d030b67 1375 In this update, pod rewrites and alias fixes are the main issues
1376+ lib/Encode/Supported.pod
1377 Describes supported encodings
1378! Makefile.PL
1379 streamlined compiled-in encodings.
1380! lib/Encode/Description.pod -> lib/Encode/Details.pod
1381 Renamed.
1382+ Encode/ibm-125?.ucm
1383 Added from icu distibution with any occurance of
1384 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1385 some respect to icu staff, however.
1386+ lib/Encode/Alias.pm
1387! Encode.pm
1388 Alias difinitions in Encode.pm relocated.
1389! AUTHORS
1390! Encode.xs
1391 packWARN patch from Paul Marquess via jhi
1392 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1393 Paul added to AUTHORS as a result.
1394! t/CJKalias.t -> t/Aliases.t
1395 Renamed. Checks even more aliases and alias overloading
1396! Encode.pm
1397! CN/CN.pm
1398 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1399 gbk => cp936 relocated to CN.pm
1400! t/CJKalias.t
1401 Test::More with plans (by jhi)
5129552c 1402
1b2c56c8 14030.94 Thu Mar 21 2002
1404+ lib/Encode/Description.pod
1405! lib/Encode/Encoding.pm
1406 Now the pod in Encode.pm is abridged as programming references.
1407 lib/Encode/Description.pod contains the original, detailed description
1408 and Encode::Encoding explains how to write your own module to
1409 add new encodings. So far, lib/Encode/Description.pod contains
1410 the whole pod once in Encode.pm. This is intentional.
1411! Encode.pm
1412 Pod revisions by Anton Tagunov
1413 Message-Id: <517178431.20020320174824@motor.ru>
1414! lib/Encode/Tcl.pm
1415 all occrance of Encode::Tcl::Extended removed including pod
1416! t/CJKalias.t
1417 test now checks $encoding->name only; $encoding->{name} are
1418 no longer check to find the canonical name.
1419! lib/Encode/JP/JIS.pm
1420! lib/Encode/JP/ISO_2022_JP.pm
1421 ->name() added to be more compliant with API
1422! CN/CN.pm
1423! JP/JP.pm
1424! KR/KR.pm
1425! TW/TW.pm
1426! t/CJKalias.t
1427 Patch by Autrijus to add aliases to TW and fixes to POD
1428 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1429! AUTHORS
1430 SADAHIRO Tomoyuki added as should. My apologies.
1431
fab31126 14320.93 Wed Mar 20 2002
1433* First release to be uploaded to CPAN. For prehistoric changes,
1434 please see Changes file of perl distibution as well as
1435 perl-unicode@perl.org archive, available at:
1436 http://archive.develooper.com/perl-unicode@perl.org/
1437
1438 Changes Since 0.92 includes;
1439+ Changes
1440+ AUTHORS
1441! Encode.pm
1442! README
1443 + Mention to perl-unicode@perl.org added
1444! JP/JP.pm
1445 + Encoding aliases added so you can feed locale names
1446 and MIME Charset="" directly.
1447 - Mention to JISX0212 removed because it's fixed
1448! CN/CN.pm
1449! KR/KR.pm
1450 + Encoding aliases added. Note TW is left untouched because
1451 euc-tw is not implemented in TW but in Encode::HanExtra.
1452 Autrijus, you may fix Encode::HanExtra.
1453+ t/CJKalias.t
1454 + to test encode aliases added