1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.99 2003/12/29 02:47:16 dankogai Exp dankogai $
6 $Revision: 1.99 $ $Date: 2003/12/29 02:47:16 $
8 find_encoding("UTF-16BE")->encode("abc") now null terminates
9 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html
11 prototype bug in decode_utf8() fixed
12 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp>
13 ! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod
14 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm
16 Merged from maintperl@21987
18 1.98 2003/08/20 11:15:31
19 ! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t
20 Dave Evans has found and corrected a bug in Encode::MIME::Header.
21 Test suite added by Dan Kogai.
22 Message-Id: <3F43440B.7060606@rudolf.org.uk>
24 Typo fixes rolled back in from bleedperl
26 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro
27 Message-Id: <20030805002313.9880.BQW10602@nifty.com>
29 argv case nit for VMS by Craig
30 Message-ID: <3F2B02DE.10207@mac.com>
31 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
32 Encode test fixes for VMS by Peter Prymmer
33 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com>
34 ! lib/Encode/Alias.pm t/Aliases.t
35 koi-8 aliases bug detected and patched by sadahiro.
36 Further fix and test suite by dankogai
37 Message-Id: <20030713102228.C76A.BQW10602@nifty.com>
39 1.97 2003/07/08 21:52:14
40 ! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm
41 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm
42 $DEBUG replaced with DEBUG() so perl optimizes better,
43 by Rafael with further fixes by dankogai
44 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr>
45 ! lib/Encode/Aliases.pm
46 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' );
47 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' );
48 So new hash seeding introduced in bleedperl works.
49 Message-Id: <20030629100937.GD20285@vipunen.hut.fi>
51 $Encode::Guess::NoUTFAutoGuess is added so you can turn off
52 automatic utf(8|16|32) guessing -- originally by Autrijus
53 Message-Id: <20030626162731.GA2077@not.autrijus.org>
55 Addressed the following;
56 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode
57 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org>
59 1.96 2003/06/18 09:29:02
60 ! lib/Encode/JP/JP.pm t/guess.t
61 m/(...)/ in void context then $1 is considered a Bad Thing
62 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp>
64 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is
67 More typecast from maintperl@19739
68 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com>
70 Tests 37 & 38 failed on Win32 -- yet another CRLF issue
71 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com>
73 Now skips for EBCDIC platform.
74 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com>
76 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot
77 have more than one C<.> in their filenames." -- perlport.
78 Message-Id: <3ED79E01.8050401@mac.com>
80 Found and fixed the back that -p,--perlqq does not work.
81 Induced by the change from Getopt::Std to Getopt::Long.
83 Addressed [cpan #2629] Wrong assumption in numeric comparison
84 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org>
85 ! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs
86 lib/Encode/Encoding.pm t/perlio.t
87 ! API Change: ->new_sequence() => ->renew()
88 + Encode::Unicode makes use of it so it can handle BOM on PerlIO
89 + Encode::XS and Encode::utf8 now supports ->renew()
90 + Encode::Encoding now documents this with examples
91 - Non-XS (en|de)code stripped out of Encode::Unicode
92 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp>
94 1.95 2003/05/21 08:41:11
96 Since bogus entries were found in iso-8859-6, all entries are
97 re-generated once again out of
98 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT
99 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery
100 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu>
101 + lib/Encode/Unicode/UTF7.pm
102 ! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t
103 lib/Encode/Supported.pod
104 UTF-7 support is now added. With this Encode now has all transcoding
105 methods in Unicode::String.
107 1.94 2003/05/10 18:13:59
108 ! lib/Encode/MIME/Header.pm
109 A more sophisticated solution for double-encoding by dankogai
110 ! lib/Encode/MIME/Header.pm AUTHORS
111 Two bugs fixed by Bjoern Jacke
112 * "Double Encoding" was not possible
113 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=")
114 * encode("MIME-Q") had UTF-8 flag on
115 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org>
116 ! lib/Encode/MIME/Header.pm AUTHORS
117 Two occurances of "croak ()" fixed as "croak qq()".
118 Simon Cozens is added to AUTHORS as a result.
119 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad>
121 POD fixes that reflect enhancements by jhi
123 Two enhancements by jhi.
124 + Now uses Getopt::Long so it accepts long name options
125 (--from for -f, for example)
126 + New option: -r,--resolve
127 Message-Id: <20030505114149.GA227075@kosh.hut.fi>
129 META.yml added upon request of Schwern
130 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp>
132 Enache Adrian removed upon request -- to live longer than Encode
133 and/or FreeBSD (toy-)?thread :)
134 Message-Id: <20030425015701.GA2069@ratsnest.hole>
136 "close STDOUT unless $^O eq 'freebsd';" once again relocated
137 to keep VMS happy in which case "$^O eq 'freebsd'" is required
138 to keep FreeBSD+thread happy. Sigh.
139 Message-Id: <3EA88ADC.3000300@mac.com>
141 1.93 2003/04/24 17:43:16
143 added "no warnings 'pack'" in for loop to keep bleedperl from
144 complaining "Character in 'C' format wrapped in pack".
146 More elegant perl core detection inspired by Ilya Zakharevich
147 (but further elaborated for general cases).
148 ! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod
151 like cp9??, \x80-\x9F (control + 0x80) are zapped so they
152 are less likely to be confused w/ ISO-8859-*
154 RT tests added (vendor encodings are exemplified)
155 -- that successfully found a flaw on iso-2022-kr before the patch.
156 ! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm
157 decode("ISO-2022-KR") has been buggy but no one ever sited
158 that since no one seems to be using it. Bugs discovered by
160 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
161 ! lib/Encode/CN/HZ.pm t/perlio.t
162 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified
163 so it adds test for HZ.
164 Message-Id: <20030416231757.A545.BQW10602@nifty.com>
165 ! lib/Encode/Guess.pm
166 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00.
167 So long as the string contains \x{00}-\x{ff} it does not fail.
168 See perldoc for details.
169 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp>
171 1.92 2003/03/31 03:27:27
172 ! ucm/big5-eten.ucm ucm/big5-hkscs.ucm
173 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF
174 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm
175 but glibc-BIG5-2.1.2.ucm does.
176 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com>
177 ! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
178 Maps regenerated again but this time based upon
179 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/
180 (But where is THE DOCUMENT by MICROSOFT?)
181 ! t/enc_module.t AUTHORS
182 failure with threaded Perl on FreeBSD addressed.
183 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this.
184 Message-Id: <20030322230131.GA813@ratsnest.hole>
185 ! lib/Encode/Guess.pm
188 Change 18989: Make the :bytes conditional on PerlIO.
189 further Modified by Dan Kogai
190 <200303161730.h2GHU5B16265@smtp3.ActiveState.com>
192 Chnage 18966: another fix for failing test on windows ("use encoding"
193 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar
194 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com>
196 Change 18970: Hopefully this works also in Win32, by jhi
197 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com>
198 Change 18965: fix CJKT.t failures on windows due to incorrect
200 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com>
202 1.91 2003/03/09 20:07:37
204 even more proofread by jhi.
205 Message-Id: <20030309194323.GT20843@kosh.hut.fi>
208 +use lib qw(t ext/Encode/t ../ext/Encode/t);
209 Message-Id: <20030309182057.GR20843@kosh.hut.fi>
211 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san.
212 Message-Id: <20030309181748.GP20843@kosh.hut.fi>
215 Message-Id: <20030309181907.GQ20843@kosh.hut.fi>
217 1.90 2003/03/09 17:32:43
220 Inaba-san has added a patch for perl 5.8.1 or later that makes
221 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test
222 that. POD is further revised.
223 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com>
224 ! encoding.pm t/enc_module.t
225 encoding vs. ${^UNICODE} resolved. POD revised accordingly.
226 Message-Id: <20030306112940.GN20652@kosh.hut.fi>
230 signed vs. unsigned issue discovered by Craig on OpenVM
231 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]>
232 ! encoding.pm AUTHORS
233 + t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t
234 Because binmode() stacks layers instead of overwrite, you have to
235 ":raw :encoding()" in encoding.pm or your are in trouble when you
236 call encoding.pm multiple times. There are several workarounds
237 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>,
238 who was the first to address this problem was added to AUTHORS.
239 The test suites was added for this, which is a modified version
240 of SUGAWARA-san's scripts
241 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp>
243 1.88 2003/02/20 14:42:34
245 one signedness nit for Encode by jhi
246 <200302161933.h1GJX876018710@kosh.hut.fi>
248 VISCII map was incorrect; fixed by Sadahiro-san
249 Message-Id: <20030216120828.47D3.BQW10602@nifty.com>
250 ! t/enc_eucjp.t t/enc_utf8.t AUTHORS
251 You can't unlink files that are opened in cygwin but the last
252 file handle opened in t/enc_*.t left open. Patch submitted
253 by Yitzchak and he was added to AUTHORS.
254 Message-Id: <iN0Q+gzkgmZN092yn@efn.org>
256 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1'
257 Message-Id: <20030206104513.GA11081@kosh.hut.fi>
259 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi
260 Message-Id: <20030206045153.GA6826@kosh.hut.fi>
262 1.87 2003/02/06 01:52:11
264 * Inaba "Sensei" Hirohito added (I thought I have done so a long
265 ago but apparently I did not).
266 * SUZUKI Norio added for verious and useful bug reports.
267 ! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm
268 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm
269 podchecked so all warnings are gone except for L<http://>.
270 ! encoding.pm t/enc_eucjp.t
271 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi)
272 * Filter option overhaul
274 ! Encode.pm Encode.xs encengine.c Encode/encode.h
275 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm
276 Merged inaba-san's patch that fixes "use encoding 'shiftjis'"
277 without filter. podchecked by Dan Kogai.
278 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp>
279 ! lib/Encode/Alias.pm
280 decode('alias', $1) went wild because of local $_ in find_alias()
281 the evil local $_ is eradicated but that changes find_alias()
282 format for coderef aliasing. See Encode::Alias for details
283 Message-Id: <200302051704.AA00042@kipp0.nifty.com>
285 1.86 2003/01/22 03:29:07
287 * Don't forget to canonize when you attempt an exact match!
288 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp>
289 * ${^ENCODING} exception is off for $] > 5.008
290 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp>
292 $] check commented out so it runs on 5.8.0
294 1.85 2003/01/21 22:19:14
296 ${^ENCODING} exception is now explicit rather than handled by regex.
297 + t/enc_eucjp.t t/enc_utf8.t
298 Test suite for the better "encoding" pragma support for bleedperl.
299 On 5.8.0, they will just be skipped.
301 1.84 2003/01/10 12:00:16
303 ${^ENCODING} is no longer set for utf so encoding is no longer fun :)
304 (That is to prevent duplicate encoding first by IO then ${^ENCODING})
305 Message-Id: <20030108213737.GK331043@lyta.hut.fi>
307 %_ fixes saves the resulting .so .05% smaller, by NC
308 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net>
310 Silence Encode on undef, by Andreas
311 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus>
312 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus>
314 s/regognised/recognised/ . British spelling left intact to pay
315 respect to two British Nicks :)
316 Message-Id: <20021203020454.GK2274@kosh.hut.fi>
318 1.83 2002/11/18 17:28:49
319 ! Encode.xs lib/Encode/JIS7.pm
320 Even more patches from Inaba-san has been applied. With this
321 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work.
322 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp>
324 1.82 2002/11/14 23:06:12
326 Encode::utf8 (XS Version) assertion botch first found in Cygwin,
327 later found in perls w/ -Dusemymalloc was fixed by NC.
328 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net>
330 1.81 2002/11/08 18:29:27
331 ! Encode.pm Encode.xs
332 Non-XS version of Encode::utf8 is back (with XS being default).
333 Encode::predefine_encodings(0) to turn off XS.
334 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has
335 found that it was Test::More causing the problem, not Encode.
336 But I have already made it configurable so it may be useful in
338 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al.
340 The ingenious patch by Nicholas Clark that reduces shlib sizes by
341 50% with no penalty and backward compatibility preserved, is in.
342 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net>
344 1.80 2002/10/21 20:39:09
345 ! Encode.xs t/mime-header.t
346 Even more patches from NI-XS regarding Encode::utf8->decode().
347 And one more test to t/mime-header.t to prove it
348 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk>
350 1.79 2002/10/21 06:05:37
352 Further patches from NI-XS. Encode::utf8->decode() now checks the
353 value of utf8 flag of the argument. As a result, the fix to
354 lib/Encode/MIME/Header.pm is no longer neccessary but since it did
355 no harm (even speedwise) I'll leave it unreverted.
356 ! ucm/cp949.ucm ucm/cp950.ucm
358 U+00AE REGISTERED SIGN
359 were missing as a result of 1.78. Discovered by Moriyama-san.
360 Moriyama-san has also developed a test script that compares
361 (en|de)coded results to the corresponding Win32 API result and
362 all cp9?? maps are now verified.
363 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp>
365 1.78 2002/10/20 15:44:00
366 ! lib/Encode/MIME/Header.pm
367 fixed so that it works with new Encode::utf8
368 ! Encode.pm Encode.xs
369 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows
370 :encoding(UTF-8) to handle partial chars at end of buffers
372 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com>
373 ! lib/Encode/Supported.pod
374 More nitpickings applied.
377 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip
378 tests were completely useless. To redeem that and get the peace of
379 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all
380 ucm/*.ucm Since this script takes too long to finish (30 seconds on
381 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test'
382 but you can easily run that by either renaming it or:
384 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp>
385 ! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm
386 Other CJKT cp9?? also updated according to the URI below;
387 http://www.microsoft.com/typography/unicode/cscp.htm
388 + bin/ucmsort MANIFEST
389 ucmsort is a crude utility that sorts CHARMAP entries in UCM files
390 to proper order. intended for hardcore develpers only.
391 ! ucm/cp932.ucm JP/JP.pm AUTHORS
392 CP932 mapping which was based upon the mapping file at unicode.org
393 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He
394 has also supplied the patch so he was added to AUTHORS.
395 ! lib/Encode/Supported.pod
396 ISO-8859-11 != TIS 620
397 == TIS 620 + \xA0 ( )
399 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com>
401 1.77 2002/10/06 03:27:02
403 * Modified to accomodate up and comming patch by Inaba-san that
404 will fix tr/// needing eval qq{}
405 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp>
407 * pod fixes/enhancements to reflect the changes above
408 ! lib/Encode/Alias.pm
409 "Encode::TW is correct, Encode::Alias not." - /Autrijus/
410 Message-Id: <20021001015648.GB18710@not.autrijus.org>
412 1.76 2002/08/25 15:09:51
414 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
415 but naturally identical to previous version -- dankogai
417 Codepoint fixes -- autrijus
418 Message-Id: <20020805040236.GC5220@not.autrijus.org>
420 copied everything under perl-5.8.0/ext/Encode to make sure Encode
421 is in sync w/ perl core
423 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
424 Now that binmode(FH) does implicit ":bytes" revisit
425 the failing tests. The worrisome one is the Digest::MD5
426 test-- how will it fare in CRLF lands now?
428 From: Radu Greab <radu@netsoft.ro>
429 Date: Mon, 10 Jun 2002 00:40:34 +0300
430 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
431 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
432 ones which I cannot figure out.
433 ! lib/Encode/Alias.pm
434 Subject: [Encode PATCH] spurious warning
435 From: Nicholas Clark <nick@unfortu.net>
436 Date: Sun, 2 Jun 2002 20:26:22 +0100
437 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
439 1.75 2002/06/01 18:07:49
440 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
441 glibc compliance cited by Autrijus.
442 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
443 ! bin/enc2xs bin/piconv
444 Subject: Re: forewarning: usedevel and versiononly
445 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
447 1.74 2002/05/28 18:33:15
448 + ucm/null.ucm ucm/ctrl.ucm
449 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
450 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
451 Subject: Re: unicode -> &# notation
452 Message-ID: <20020518193704.GB40272@not.autrijus.org>
454 1.73 2002/05/28 17:26:18
455 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
456 Chris Nandor has fixed Encode so that it works w/ MacPerl --
457 at least w/ PPC (68k need static linking which does not work due to
458 64k limit). pudge is added to AUTHORS (I'm surprised he was not
459 there in the list). Encode/Makefile_PL.e2x was additionally fixed
460 by dankogai to reflect changes in other Makefile.PL
461 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
463 Subject: Change 16746: -Mutf8 cleanup.
464 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
466 1.72 2002/05/20 15:49:56
468 Subject: [PATCH] Encode should be in perl-core library path
469 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
470 Message-Id: <20020520161201.A11019@alpha.hut.fi>
471 ! lib/Encode/MIME/Header.pm
472 Subject: [PATCH] Encode::MIME::Header
473 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
474 ! Encode/Makefile_PL.e2x
475 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
476 Message-Id: <20020519201031.GA1603@not.autrijus.org>
477 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
478 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
480 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
481 Subject: [PATCH] good day for WinCE port of perl.
482 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
485 Even more linting by Robin via @16532
487 Even more typecast by Sarathy in @16460
489 1.71 2002/05/07 16:22:42
491 even more typecasts by Robin
492 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
494 A very strange bug that was causing a bugus ucm -> C table
495 generation that was revealed by a UCM file that Andreas was
496 working. This is the king of wierdest bug I've encountered
497 in the course of Encode maintenance.
498 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
500 1.70 2002/05/06 10:26:48
502 Made more 'module-safe' with conjunction w/ 'no encoding'.
503 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
504 ! lib/Encode/Encoding.pm
505 'require Encode' because ->Define uses Encode::define_encoding();
506 problem and solution addressed by Miyagawa-kun
507 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
509 Cuts the frill to make djgpp happier, as suggested by Laszlo
510 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
512 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
513 Message-Id: <m3bsbug48n.fsf@anima.de>
515 1.69 2002/05/04 16:41:18
516 ! lib/Encode/MIME/Header
517 Floating-point coerced for UNICOS (in integer arithmetics it folds
518 line one character too early). Verification by Mark is pending.
519 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
521 more doc patch from Elizabeth
522 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
523 ! Encode/Makefile_PL.e2x
524 More platform-independent patch from Benjamin
525 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
526 ! lib/Encode/Guess AUTHORS
527 split regex fix by Graham Barr. Adds him to AUTHORS.
528 Message-Id: <20020504085419.E95940@valueclick.com>
529 ! Encode/Makefile_PL.e2x
530 enc2xs script discovery made smarter and more sensible, first cited
531 by Miyagawa-kun and further suggestions by Rafael and Andreas
532 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
533 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
535 1.68 2002/05/03 12:20:13
536 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
537 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
538 and Supported.pod modified to reflect the change. Elizabeth added
539 to Authors. And H.M. is also added for forwarding her patch among
540 other contributions (I was rather surprised to find his name was not
542 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
544 1.67 2002/05/02 07:33:09
546 Error message now consistent w/ perlqq (\N{U+} -> \x{})
547 done in perl@16308 but Philip linted me further. Now the error
548 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
549 ! lib/Encode/Guess.pm
550 Sanity check for happier -w by Autrijus
552 1.66 2002/05/01 05:41:06
553 ! Encode.xs t/fallback.t
554 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
555 while fallback is in effect. This even came with a welcome side-effect
556 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
557 also modified to test this.
558 And of course, the corresponding varialbles to UV[Xx]f are appropriately
559 cast. This should've concluded NI-XS homework.
561 encode(undef) does warn again! Repented upon suggestion by NI-XS.
562 Document for unless vs. '' added
563 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
565 1.65 2002/04/30 16:13:37
567 encode(undef) no longer warns for C<Use of uninitialized value in
568 subroutine entry>. Suggested by Paul.
569 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
570 ! lib/Encode/Supported.pod
571 Encode::MIME::Header and Encode::Guess mentioned
572 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
573 ! lib/Encode/Guess.pm
574 POD fix by Miyagawa-kun
575 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
577 1.64 2002/04/29 06:54:06
579 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
580 encocde("euc-jisx0213") are still DIFFERENT.
581 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
583 A few white spaces corrected by NI-XS via PerlIO integration to
585 Subject: Change 16247: Integrate perlio;
587 Document fixes by Andreas
588 Message-Id: <m3k7qsf1we.fsf@anima.de>
590 1.63 2002/04/27 18:59:50
591 ! lib/Encode/Encoding.pm
592 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
593 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
594 Make use of the Encode::Encoding base class!
595 And other cleanups in Encode.xs upon NI-XS suggestions
596 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
598 1.62 2002/04/27 11:17:39
600 encodings() now just check %ExtModule instead of eval{require}
601 all of them for ":all" to conserve more memory.
603 more "%x" -> "%" UVxf stuff.
605 s/=over2/=over 2/g # oops.
607 1.61 2002/04/26 03:02:04
609 Now does decent tests besides use_ok()
610 ! lib/Encode/Guess.pm t/guess.t
611 UI streamlined, document added
613 various signed/unsigned mismatch nits (#16173)
614 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
616 POD: utf8-flag-related caveats added. A few sections completely
620 Thou shalt not assume %d works, either!
621 Robin Baker added to AUTHORS for this
622 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
624 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
626 1.60 2002/04/24 20:06:52
628 "Thou shalt not assume %x works." -- jhi
629 Message-Id: <20020424210618.E24347@alpha.hut.fi>
630 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
631 low-memory build machines happy, now *.c is created for each *.ucm
632 (no table aggregation). You can still override this by setting
633 $ENV{AGGREGATE_TABLES}.
634 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
635 + lib/Encode/Guess.pm
636 + lib/Encode/JP/JIS7.pm
637 Encoding-autodetect (mainly for Japanese encoding) added. In a
638 course of development, JIS7.pm was improved.
639 + lib/Encode/HTML/Header.pm
640 + lib/Encode/Config.pm
641 MIME B/Q Header Encoding Added!
642 ! Encode.pm Encode.xs t/fallback.t
643 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
644 Message-Id: <20020424130709.GA14211@tanglefoot>
646 1.59 $ 2002/04/22 23:54:22
647 ! Encode.pm Encode.xs
648 needs_lines() and perlio_ok() are added to Internal encodings such
649 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
651 1.58 2002/04/22 23:54:22
653 s/MacChineseSimp/MacChineseTrad/ # ... oops.
658 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
659 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
660 t/(JP|KR).t are aggregated to t/CJKT.t
661 test data are all remade via bin/ucm2text.
662 And .... They are no longer skipped for -Uuseperlio !
664 1.57 2002/04/22 20:27:30
665 ! t/JP.t t/KR.t t/perlio.t
666 unless (find PerlIO::Layer 'perlio') ... line is back again.
667 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
668 need PerlIO. Sigh....
669 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
670 ->perlio_ok now does eval{ require PerlIO::encoding } there so
671 it correctly returns 1 when PerlIO::encoding is yet loaded.
673 perl-current patch #16072 reflected
675 1.56 2002/04/22 09:48:07
676 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
677 New PerlIO::encoding 0.04 compliance met
679 1.55 2002/04/22 03:43:05
680 ! Encode.pm Encode.xs Unicode/Unicode.pm
681 needs_lines() defined so Encode::Encoding is no longer needed
684 1.54 2002/04/22 02:50:01
685 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
686 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
687 now perlio_ok is true by default if PerlIO::encoding->VERSION is
688 0.03 or larger. POD in Encode::Encoding revised to reflect this.
689 Encode::XS and Encode::Unicode now has perlio_ok() method.
690 ! lib/Encode/Supported.pod
691 s/UP-UX/HP-UX/ by jhi
692 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
693 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
694 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
695 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
696 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
697 ! lib/Encode/Supported.pod
698 Huge document fixes by Philip.
701 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
702 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
704 binmode() after "<:encoding" to make Win32 happy, by Mattia.
705 Mattia added to AUTHORS file
706 Message-Id: <3CC3150F.5798.22A05AE@localhost>
708 1.52 2002/04/20 23:43:47
710 TODO: is now SKIP:, as NI-XS requested. Also adds more
711 eraborate failure analysis added.
713 A note on how to make sure of round-trip safety added to POD
714 section (so Autrijus is happier)
715 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
716 big5-(eten|hkscs) is round-trip safe again!
717 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
719 Typo fixes by Andreas
720 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
721 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
722 PerIO coodination patches from NI-XS.
723 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
725 1.51 2002/04/20 09:58:23
727 Updated test suite by Autrijis so "make test" is happy again
728 Message-Id: <20020420082104.GA25037@not.autrijus.org>
730 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
732 TW/TW.pm TW/Makefile.PL
733 Updates by Autrijus. 'big5' is no longer a canonical but an
734 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
735 Message-Id: <20020419195346.GA19597@not.autrijus.org>
737 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
738 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
740 PerlIO detection a little bit smarter; no longer uses eval qq{}
743 1.50 2002/04/19 06:13:02
744 ! ! Encode.pm Encode.xs Encode/encoding.h
746 New Fallback API imlemented and documented. See "perldoc Encode"
748 ! lib/Encode/JP/JIS7.pm Encode.pm
749 + lib/Encode/PerlIO.pod t/perlio.t
750 API compliance met. However, it still does not work unless perlio
751 implements line buffer. See BUGS section in perldoc Encode::PerlIO
752 As a sensible workaround, perlio_ok() added to Encode.
754 ! lib/Encode/Supported.pod
756 Message-Id: <20020418174647.J8466@alpha.hut.fi>
758 Doc fixes from Autrijus
759 Message-Id: <20020418144131.GA10987@not.autrijus.org>
761 perlqq mode documented
763 + t/jisx0201.euc t/jisx0201.ref
764 ! t/jisx0208.euc t/jisx0208.ref
765 t/JP.t tests more rigorously and with other encodings
766 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
767 in jis0208 (used to contain jisx0201 part).
768 ! Encode/Makefile_PL.e2x
769 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
770 enc2xs and encode.h rather than hard-coded. This allows the resulting
771 module fully CPANizable.
772 ! encoding.pm t/JP.t t/KR.t
773 PerlIO detection simplified (checks %INC instead of eval{})
774 ! Encode.xs Encode/encode.h
775 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
776 - lib/Encode/Unicode.pm
777 (en|de)code_xs relocated to where it belongs. Source reindented
780 Additional (U8 *) cast added as suggested by jhi
781 Message-Id: <20020417165916.A28599@alpha.hut.fi>
783 1.42 Date: 2002/04/17
785 no-op module; Thought of adding a pod there but enc2xs has
790 correct mechanism to detect Perlio::encoding layar installed.
792 PerlIO Layer detached.
794 1.41 2002/04/16 23:35:00
796 binmode(STDIN|STDOUT ...) done iff PerlIO is available
798 Cleaned up PerlIO skip conditions to prepare for the upcoming
799 Encode - PerlIO forking.
801 exported functions are now prototyped.
802 ! lib/Encode/CN/HZ.pm
805 fallback implemented # was /* FIXME */
806 affected programs revised to fit (only HZ was using the try-catch
807 approach which needed to be fixed for API-compliance).
809 ! Encode/KR/2022_KR.pm
811 can find =head1 NAME now, jhi
812 Message-Id: <20020416083059.V30639@alpha.hut.fi>
816 now complies with less warnings with the pickest compilers.
817 Suggested by Craig, fixed by Dan.
818 ! Encode/Makefile_PL.e2x
820 A bug that fails to find *.e2x in certain conditions fixed
822 1.40 2002/04/14 22:27:14
823 + Encode/ConfigLocal_PM.e2x
824 ! lib/Encode/Config.pm
826 "enc2xs -C" now generates/updates Encode::ConfigLocal.
827 ConfigLocal_PM.e2x is a skelton thereof.
828 ! lib/Encode/Config.pm
830 "use Encode::CN::HZ;" was missing.
833 More rigorous tests added to test XS, especially on memory allocation.
835 ! lib/Encode/Unicode.pm
836 NI-S implemented an XS version -- merged
837 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
840 Source filter option added. With this option on, you can write
841 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
842 encodings. t/jperl.t enhanced to test this feature.
844 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
845 apply his suggestion, this degree of nitting is enough to add him
847 Message-Id: <3CB93223.291E5E2E@earthlink.net>
849 + lib/Encode/JP/JIS7.pm
850 - lib/Encode/JP/JIS.pm
851 - lib/Encode/JP/2022_JP.pm
852 - lib/Encode/JP/2022_JP1.pm
853 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
854 JIS7.pm for better maintainability and performance
856 Added caveat for non-ascii identifiers.
858 fixes by jhi, the original author of this pragramtic module.
859 Message-Id: <20020413231527.V1826@alpha.hut.fi>
861 1.34 2002/04/12 20:23:05 (Unreleased)
864 EBCDIC fixes addressed by jhi.
865 Message-Id: <20020412161844.D9383@alpha.hut.fi>
866 ! lib/Encode/Encoder.pm
867 POD fix by Miyagawa-kun
868 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
870 1.33 2002/04/10 22:28:40
872 Philip's mail address corrected.
875 ! lib/Encode/Encoder.pm
876 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
877 This adds Miyagawa-kun to AUTHORS.
878 * encoding() no longer exported by default but on demand
879 * t/Encoder.t updated to test all these
880 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
881 ! lib/Encode/Unicode.pm
882 ! lib/Encode/Supported.pm
883 Further doc fixes by Anton
885 1.32 2002/04/09 20:06:15
888 - ucm/macDevanaga.ucm Unicode Character Map
889 - ucm/macGujarati.ucm Unicode Character Map
890 - ucm/macGurmukhi.ucm Unicode Character Map
891 A utility to check integrity of .ucm files. t/bogus.ucm is a
892 ucm that is deliberately bogus. unused Indic mappings are removed
895 resolve_alias() added as suggested by jhi. Same as
896 find_encoding("alias")->name. For convenience. This one is
897 defined in Encode.pm instead of Alias.pm.
898 Message-Id: <20020409215846.H17022@alpha.hut.fi>
900 Memory Allocate but detected during the devel of ucmlint -- fixed.
901 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
902 ! lib/Encode/Unicode.pm
903 valid_ucs2(0) is false but must be true.
904 3 patches from NI-S as follows. This also has fixed the incident
906 ! lib/Encode/Alias.pm
907 find_alias() recursion prevention
909 Checks for the patch above
910 ! t/Encode/Unicode.pm
911 An extra "F" that causes valid_ucs2() return a bogus value fixed
912 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
913 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
914 2 Small Patches from jhi as follows:
916 Encode->encodings() lists in case-insensitve order (as it was)
918 -l option prints avaiable encodings to STDOUT instead of STDERR
919 ! lib/Encode/Aliases.pm
920 s/defintion/definition/
921 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
923 ! lib/Encode/Supported.pod
924 ! lib/Encode/Unicode.pm
925 POD revise by Philip Newton. This adds Philip to AUTHORS list.
926 Thank you for the exact quote of Douglas Adams :)
927 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
929 1.31 2002/04/08 18:08:07
930 ! lib/Encode/Encoder.pm
932 Encode::Encoder, once just a placeholder of an idea, is now much more
933 practical. See t/Encode.t to find how practical it can be.
934 + lib/Encode/Config.pm
936 my false laziness at Encode.pm is fixed. Now %ExtModules are set
937 in Encode::Config and they are all literally, not programatically
938 set. My false laziness was resulting many encodings missing from
940 ! lib/Encode/Unicode.pm
942 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
943 so that it does not rely Encode::Unicode for BOM values
944 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
946 1.30 2002/04/08 02:34:51
947 + lib/Encode/Encoder.pm
948 Object Oriented Encoder. I reckon something like this is in need.
951 ! lib/Encode/Supported.pod
952 * autoloading bug that prevented upper-case canonicals such as UTF-16
953 is fixed. Now even UTF/UCS are autoloaded!
954 * encodings() is now more intuitive.
955 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
957 * Obligatory fixes to the POD.
958 ! lib/Encode/Supported.pod
959 Patch from Anton applied.
960 Message-Id: <66641479.20020408033300@motor.ru>
962 ! lib/Encode/Unicode.pm
963 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
965 1.28 2002/04/07 18:58:42
969 Just a MANIFEST for those missing files.
971 1.26 Date: 2002/04/07 15:22:04
974 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
975 And t/Aliases.t fixed to make sure they all exist.
977 1.25 2002/04/07 15:01:25 (Unreleased)
979 ! lib/Encode/Unicode.pm
982 - lib/Encode/UTF_EBCDIC.pm
983 - lib/Encode/Internal.pm
985 Integrated into Encode.pm as closures. That way "one package, one file"
986 rule is preserved yet less files to require.
988 commented out binmode(STDERR ...
997 ! Encode/Makefile_PL.e2x
998 Schwarn's MM-compliance patch merged
999 Message-Id: <20020406082609.GA28758@blackrider>
1001 ! lib/Encode/Unicode.pm
1002 + lib/Encode/UTF_EBCDIC.pm
1004 - lib/Encode/10646_1.pm
1005 - lib/Encode/ucs2_le.pm
1006 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
1007 per-module based (en|de)code, I saved a number of .pm by
1008 reorganizing it as per-object base (Well, this is what Encode::XS
1009 does under the hood). See Encode::Unicode for details.
1010 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
1011 This module is used only on EBCDIC environments.
1013 1.21 2002/04/05 14:46:34 (Not Released)
1019 Are back to make Perl/Tk happy Smile, NI-S.
1021 ! lib/Encode/Alias.pm
1022 ! lib/Encode/Supported.pm
1023 ! lib/Encode/10646_1.pm
1024 ! lib/Encode/ucs2_le.pm
1025 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
1026 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
1027 updated to reflect changes.
1028 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
1029 ! lib/Encode/Alias.pm
1030 ! lib/Encode/Supported.pm
1031 Anton's revision commited. Added Dan's own fixes as well.
1032 Message-Id: <159103166906.20020405161134@motor.ru>
1033 ! lib/Encode/Alias.pm
1035 < qr/^UCS2-le$/i => '"UCS-2"', );
1037 > qr/^UCS2-LE$/i => '"UTF-16LE"');
1038 Sigh. Thank you, Anton.
1039 Message-Id: <14567692196.20020405062020@motor.ru>
1040 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
1042 1.20 2002/04/04 19:50:52
1044 the last minute addtion. Just give it a try. Docs remains to be done.
1045 Not installed by default.
1046 ! lib/Encode/Supported.pod
1049 ! lib/Encode/Alias.pm
1050 ! lib/Encode/utf8.pm
1051 ! lib/Encode/10464_1.pm
1052 ! lib/Encode/ucs2_le.pm
1053 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
1054 unchanged but UTF-16BE is added as an alias. Implicit aliases
1055 move to Encode::Alias so init_alias() works more as expected.
1056 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
1057 Though pedantically wrong, This should make perl mongers happier.
1058 t/Alias.t is enhanced to test all these.
1059 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
1061 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
1062 of the codepoints are common. CJKT left as is because the saving is
1066 ! EBCDIC/Makefile.PL
1068 ! Encode/Makefile_PL.e2x
1072 ! Symbol/Makefile.PL
1076 All occurance of _def.h replaced with .exh so djgpp works happily
1077 ever after! To credit this amazing discovery, Laszlo is now in
1079 Message-Id: <20020403181424.GA8778@freemail.hu>
1080 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
1083 ! Encode/Makefile_PL.skel
1085 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
1086 Message-Id: <20020401222744.GX2000@blackrider>, et al.
1088 Two more tests by added jhi
1089 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
1092 The showstopper fixed -- Memory reallocation bug was causing
1093 Encode::XS to fall into infinite loop on certain conditions.
1094 t/grow.t tests that.
1095 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
1100 ! lib/Encode/Supported.pod
1101 Vendor encodings rebuilt out of original map files at unicode.org.
1102 Indic languages such as MacDevanagali remain unspported do to the
1103 shortcoming of encengine capabilities (they need algorithmical
1104 conversion and I have no knowledge on that!). Pods fixed for added
1106 Oh, macJapan.ucm renamed to macJapanese.ucm.
1107 macROMnn is macRomanian and macRUMnn is macRumanian.
1108 txt2ucm is a crude script that is used to convert them.
1110 Unicode Compound Characters (used extensively on Mac) supported
1112 Typo fixes and improvements by jhi
1113 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
1115 1.11 2002/03/31 22:12:13
1119 Missing files from the MANIFEST fixed.
1120 Message-Id: <20020401010156.H10509@alpha.hut.fi>
1121 Version incremented just to make CPAN happy.
1123 1.10 2002/03/31 21:32:42
1126 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
1127 if you want. This should make Autrijus happy. Also, piconv
1128 is added to default install.
1131 Here-documented files that enc2xs generates are now exported
1132 to *.e2x. Much cleaner and easier to debug.
1134 encoding enhances so you can make it act more like such
1135 (now prehistoric ) "localized" variations of perl like Jperl.
1137 Further test for encoding.pm. Written in euc-jp
1140 Taken over form jhi.
1141 Message-Id: <20020330174618.B10154@alpha.hut.fi>
1146 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
1153 ! Encode/macIceland.ucm
1154 ! lib/Encode/Alias.pm
1155 ! lib/Encode/Supported.pod
1156 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
1157 Message-Id: <m3lmcavhjt.fsf@anima.de>
1159 1.01 2002/03/29 20:59:39
1162 s/USE_SCRIPTS/MORE_SCRIPTS/
1164 installs enc2xs by default for external Encode:: modules in CPAN,
1165 such as Encode::HanExtra
1167 More sensible perl core detection via $ENV{PERL_CORE}
1169 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
1171 Perl core ditection via $^X =~ m/\bminiperl$/o
1172 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
1174 1.00 Wed Mar 29 2002
1176 The version of all files is updated to 1.00 via "ci -f -l1.00",
1177 commemorating version 1.00. All files, including *.ucm are now
1178 under version control.
1181 encode.h moved to Encode/ so it will be installed for the later
1184 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
1189 compile renamed to enc2xs.
1190 Affected Makefle.PL updated
1192 "Punt it. HanExtra can take care of that later." -- Autrijus
1193 Message-Id: <20020328154338.GA7351@not.autrijus.org>
1196 ! Encode/ksc5601.ucm
1197 ! lib/Encode/CJKConstants.pm
1198 ! lib/Encode/KR/2022_KR.pm
1199 Table patches for Euro Signs, 2022-KR fixups by Jungshik
1200 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
1204 bin/ added for example scripts. They are not installed by default.
1205 to install them, "perl Makefile.PL USE_SCRIPTS".
1206 piconv is iconv reinvented in perl. in addition to all features
1207 of iconv, it also adds perlish features. See L<piconv/1> for more
1209 ! lib/Encode/Alias.pm
1210 qr/^ replaced with qr/\b so it directly matches locale names
1211 such as en_US.US-ASCII
1214 Patch by MJD to fix the following problem applied.
1215 Subject: [PATCH 5.7.3 Encode]
1216 Aliases.t not properly skipped when Encode extension not built
1217 Message-Id: <20020328091850.18677.qmail@plover.com>
1218 ! lib/Encode/KR/2022_KR.pm
1219 ! lib/Encode/CJKConstants.pm
1220 Another patch from Jungshik to make iso-2022-kr actually work
1221 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
1222 ! Encode/Encode/euc-kr.ucm
1223 + Encode/Encode/johab.ucm
1224 ! Encode/Encode/ksc5601.ucm
1226 ! Encode/KR/Makefile.PL
1227 ! Encode/lib/Encode/Alias.pm
1229 Johab support and complete revision of Korean Encoding by Jungshik
1230 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
1232 Revised to make up with now-dropped Encode::Details.
1233 - lib/Encode/Details.pod
1234 Dropped. Besides being obsolete, the topics are now covered in
1235 respective pods now.
1240 Korean aliases fixed thanks to Jungshik Shin
1241 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
1242 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
1244 =head1 NAME added to all modules to make buildtoc happy
1245 Message-Id: <20020327041151.A10618@alpha.hut.fi>
1246 - lib/Encode/CJKguide.pod
1247 Too controversial and dropped from the dist. Will be available
1248 separately on the web.
1250 RCS tags added so table debugging gets easier (should that be
1251 needed! I hope they all stay 1.00!)
1252 + lib/Encode/CJKguide.pod
1253 A detailed guide to mainly, but not limited to, CJK multibyte
1256 + Encode/hp-roman8.ucm
1258 ! Encode/Supported.pod
1259 All occurance of "roman8" replaced with "hp-roman8" to avoid
1261 ! Encode/Supported.pod
1264 Mac Encodings now comply the Inside Macintosh
1266 Test for '-raw' conventions added.
1268 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
1269 ! Encode/gb12345.ucm
1271 ! Encode/ksc5601.ucm
1272 "-raw" appended to canonical names.
1273 File mames stay unchanged thanks to UCM format.
1274 ! lib/Encode/CN/HZ.pm
1275 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
1276 Message-Id: <20020326035210.GA2091@not.autrijus.org>
1278 0.99 Tue Mar 26 2002
1279 - lib/Encode/JP/Const.pm
1280 + lib/Encode/CJKConstants.pm
1281 + lib/Encode/CN/2022_CN.pm
1282 + lib/Encode/KR/2022_KR.pm
1291 * Support for ISO-2022-KR and ISO-2022-CN added.
1293 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
1294 * ucm2table autogenerates character table out of UCM files.
1297 - lib/Encode/Supports.pod
1298 + lib/Encode/Supported.pod
1299 Names reverted due to popular demand.
1300 8.3 rule applies only when there is a conflict.
1301 Message-Id: <20020325095924.GD44120@not.autrijus.org>
1306 - lib/Encode/Format/Enc.pod
1308 * Character tables is now 100% ucm.
1309 * All files under Encode/ is now 8.3-compliant
1310 * some of missing encodings added (i.e. gsm0338 and nextstep)
1311 * Vendor mappings aggregated with appropriate national std in
1312 Makefile.PL, resulting smaller *.so especially for CJK.
1313 Following is result on Dan's FreeBSD box.
1315 ---------------------------------------------------------------
1316 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
1317 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
1318 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
1319 blib/arch/auto/Encode/Encode.so 27,791 27,791
1320 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
1321 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
1322 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
1323 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
1324 ---------------------------------------------------------------
1325 Total 5,375,297 6,343,819
1327 * As a result of ucm-transition, Encode::Tcl dropped because
1328 Encode::Tcl demands *.enc.
1329 Encode::Tcl will be supplied in a separate tarball with *.enc.
1330 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
1335 -lib/Encode/Supported.pod
1336 +lib/Encode/Supports.pod
1337 -lib/Encode/iso10646_1.pm
1338 +lib/Encode/10646_1.pm
1339 -lib/Encode/EncFormat.pod
1340 +lib/Encode/Format/Enc.pod
1341 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
1342 - lib/Encode/JP/Constants.pm
1343 + lib/Encode/JP/Consts.pm
1344 ! lib/Encode/JP/JIS.pm
1345 ! lib/Encode/JP/H2Z.pm
1346 Version nit problem and 8.3 rule fix.
1347 > Package namespace installed latest in CPAN file
1348 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
1349 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
1350 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
1351 accordingly. In addition, legacy "use vars qw()..." are replaced with
1353 Message-Id: <20020325011248.D1561@alpha.hut.fi>
1354 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
1356 - lib/Encode/JP/ISO_2022_JP.pm
1357 - lib/Encode/JP/ISO_2022_JP_1.pm
1358 + lib/Encode/JP/2022_JP.pm
1359 + lib/Encode/JP/2022_JP1.pm
1361 8.3 naming conflict for vanilla fat addressed by jhi
1362 Message-Id: <20020324201931.V22596@alpha.hut.fi>
1365 Typecast fix addressed by jhi
1366 Message-Id: <20020324185540.T22596@alpha.hut.fi>
1368 0.98 Mon Mar 25 2002
1369 ! lib/Encode/Supported.pod
1371 + lib/Encode/JP/ISO_2022_JP_1.pm
1372 ! lib/Encode/JP/ISO_2022_JP.pm
1373 ! lib/Encode/JP/JIS.pm
1375 Now Encode::JP is more strict on the difference between ISO-2022-JP
1376 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
1377 makes Anton happier :) FYI the previous version implements
1378 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
1379 ! lib/Encode/Supported.pod
1382 Avoid core-dump in Encode with PERLIO=mmap by NI-S
1383 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
1388 ! lib/Encode/Suppoted.pod
1389 pod fixes to replace F<http://...> to L<http://...>,
1390 as suggested by Autrijius in:
1391 Message-Id: <20020324083943.GA14901@not.autrijus.org>
1392 ! lib/Encode/Suppoted.pod
1393 fixes and enhancements by Anton
1394 Message-Id: <10632060120.20020324103753@motor.ru>
1395 ! lib/Encode/Alias.pm
1396 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
1397 added. Suggested by Anton then deobfuscated by Autrijius
1398 Message-Id: <20020324064455.GA3667@not.autrijus.org>
1400 Further fix by Nicholas Clark
1401 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1402 - lib/EncodeFormat.pod
1403 + lib/Encode/EncFormat.pod
1405 File renamed as suggested by Autrijius
1407 ! lib/Encode/Details.pod
1408 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1409 ! Encode.pm Sun Mar 24 13:43:47 2002
1410 pod fixes by Autrijius.
1411 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1412 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1414 ! lib/Encode/Alias.pm
1416 now more EBCDIC conscious;
1417 %ExtModules on EBCDIC system excludes CJK so that you don't
1418 have to worry about the matched alias resulting cloaking.
1419 t/Alias.t also revised to reflect changes. Verified by jhi
1420 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1422 0.97 Sun Mar 24 2002
1426 EBCDIC detection mechanism installed as in JP/JP.pm
1427 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1430 ! EBCDIC/Makefile.PL
1433 ! Symbol/Makefile.PL
1435 Now all table files used by compile are postfixed '_t' to avoid
1436 namespace collisions in case insensitive file systems once for all!
1438 Message-ID: <58290227735.20020323195659@familiehaase.de>
1440 Since the Encode::JP is unsupported under EBCDIC we
1441 cannot run this test (aliases as such should work fine) -- jhi
1442 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1444 duplicate occurance of ascii.ucm and 8859-1.ucm
1445 causes MacOS X dlyd to cloak
1451 < chdir 't' if -d 't';
1453 > if (! -d 'blib' and -d 't'){ chdir 't' };
1454 When you are "make test"-ing on Encode/ directory, you must not
1455 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1456 remain unchanced. Also the situation detection was made simpler
1457 in t/JP.t, which was originally;
1458 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1460 "Use of uninitialized value in string eq at Encode.pm line 96."
1461 ! Symbol/Makefile.PL
1462 ! EBCDIC/Makefile.PL
1464 -- Problem on case insensitive file systems
1465 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1466 Message-ID: <88254111953.20020323095503@familiehaase.de>
1469 "So I think it's a bug in gcc, not perl. But it still needs to be
1471 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1472 Message-Id: <20020323170509.C96475@plum.flirble.org>
1474 0.96 Sat Mar 23 2002
1476 ! lib/Encode/Encoding.pm
1477 ! lib/Encode/Alias.pm
1478 ! lib/Encode/Supported.pod
1480 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1481 Message-ID: <20020322073908.GB10539@blackrider>
1484 "...I think we should include ISO 8859-1 as well." -- NI-S
1485 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1490 ! lib/Encode/Alias.pm
1491 alias definitions relocated to Encode::Alias so module autoloading
1492 works for aliases also.
1494 encodings() now accepts args to check ExtModules.
1498 + EBCDIC/Makefile.PL
1499 + Symbol/Makefile.PL
1503 Latin and single byte encodings are reorganized so they are
1504 demand-loaded like Encode::XX. Now only ascii is compiled into
1506 ! lib/Encode/Alias.pm
1507 for my $k (keys %hash){ delete $hash{$k}; }
1508 is depreciated; fixed.
1510 0.95 Fri Mar 22 2002
1511 In this update, pod rewrites and alias fixes are the main issues
1512 + lib/Encode/Supported.pod
1513 Describes supported encodings
1515 streamlined compiled-in encodings.
1516 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1518 + Encode/ibm-125?.ucm
1519 Added from icu distibution with any occurance of
1520 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1521 some respect to icu staff, however.
1522 + lib/Encode/Alias.pm
1524 Alias difinitions in Encode.pm relocated.
1527 packWARN patch from Paul Marquess via jhi
1528 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1529 Paul added to AUTHORS as a result.
1530 ! t/CJKalias.t -> t/Aliases.t
1531 Renamed. Checks even more aliases and alias overloading
1534 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1535 gbk => cp936 relocated to CN.pm
1537 Test::More with plans (by jhi)
1539 0.94 Thu Mar 21 2002
1540 + lib/Encode/Description.pod
1541 ! lib/Encode/Encoding.pm
1542 Now the pod in Encode.pm is abridged as programming references.
1543 lib/Encode/Description.pod contains the original, detailed description
1544 and Encode::Encoding explains how to write your own module to
1545 add new encodings. So far, lib/Encode/Description.pod contains
1546 the whole pod once in Encode.pm. This is intentional.
1548 Pod revisions by Anton Tagunov
1549 Message-Id: <517178431.20020320174824@motor.ru>
1551 all occrance of Encode::Tcl::Extended removed including pod
1553 test now checks $encoding->name only; $encoding->{name} are
1554 no longer check to find the canonical name.
1555 ! lib/Encode/JP/JIS.pm
1556 ! lib/Encode/JP/ISO_2022_JP.pm
1557 ->name() added to be more compliant with API
1563 Patch by Autrijus to add aliases to TW and fixes to POD
1564 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1566 SADAHIRO Tomoyuki added as should. My apologies.
1568 0.93 Wed Mar 20 2002
1569 * First release to be uploaded to CPAN. For prehistoric changes,
1570 please see Changes file of perl distibution as well as
1571 perl-unicode@perl.org archive, available at:
1572 http://archive.develooper.com/perl-unicode@perl.org/
1574 Changes Since 0.92 includes;
1579 + Mention to perl-unicode@perl.org added
1581 + Encoding aliases added so you can feed locale names
1582 and MIME Charset="" directly.
1583 - Mention to JISX0212 removed because it's fixed
1586 + Encoding aliases added. Note TW is left untouched because
1587 euc-tw is not implemented in TW but in Encode::HanExtra.
1588 Autrijus, you may fix Encode::HanExtra.
1590 + to test encode aliases added