1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.76 2002/08/25 15:09:51 dankogai Exp dankogai $
6 $Revision: 1.76 $ $Date: 2002/08/25 15:09:51 $
8 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated
9 but naturally identical to previous version -- dankogai
11 Codepoint fixes -- autrijus
12 Message-Id: <20020805040236.GC5220@not.autrijus.org>
14 copied everything under perl-5.8.0/ext/Encode to make sure Encode
15 is in sync w/ perl core
17 Change 17175 by jhi@alpha on 2002/06/10 23:24:42
18 Now that binmode(FH) does implicit ":bytes" revisit
19 the failing tests. The worrisome one is the Digest::MD5
20 test-- how will it fare in CRLF lands now?
22 From: Radu Greab <radu@netsoft.ro>
23 Date: Mon, 10 Jun 2002 00:40:34 +0300
24 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro>
25 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t
26 ones which I cannot figure out.
28 Subject: [Encode PATCH] spurious warning
29 From: Nicholas Clark <nick@unfortu.net>
30 Date: Sun, 2 Jun 2002 20:26:22 +0100
31 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net>
33 1.75 2002/06/01 18:07:49
34 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
35 glibc compliance cited by Autrijus.
36 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
37 ! bin/enc2xs bin/piconv
38 Subject: Re: forewarning: usedevel and versiononly
39 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
41 1.74 2002/05/28 18:33:15
42 + ucm/null.ucm ucm/ctrl.ucm
43 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
44 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
45 Subject: Re: unicode -> &# notation
46 Message-ID: <20020518193704.GB40272@not.autrijus.org>
48 1.73 2002/05/28 17:26:18
49 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
50 Chris Nandor has fixed Encode so that it works w/ MacPerl --
51 at least w/ PPC (68k need static linking which does not work due to
52 64k limit). pudge is added to AUTHORS (I'm surprised he was not
53 there in the list). Encode/Makefile_PL.e2x was additionally fixed
54 by dankogai to reflect changes in other Makefile.PL
55 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
57 Subject: Change 16746: -Mutf8 cleanup.
58 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
60 1.72 2002/05/20 15:49:56
62 Subject: [PATCH] Encode should be in perl-core library path
63 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
64 Message-Id: <20020520161201.A11019@alpha.hut.fi>
65 ! lib/Encode/MIME/Header.pm
66 Subject: [PATCH] Encode::MIME::Header
67 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
68 ! Encode/Makefile_PL.e2x
69 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
70 Message-Id: <20020519201031.GA1603@not.autrijus.org>
71 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
72 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
74 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
75 Subject: [PATCH] good day for WinCE port of perl.
76 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
79 Even more linting by Robin via @16532
81 Even more typecast by Sarathy in @16460
83 1.71 2002/05/07 16:22:42
85 even more typecasts by Robin
86 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
88 A very strange bug that was causing a bugus ucm -> C table
89 generation that was revealed by a UCM file that Andreas was
90 working. This is the king of wierdest bug I've encountered
91 in the course of Encode maintenance.
92 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
94 1.70 2002/05/06 10:26:48
96 Made more 'module-safe' with conjunction w/ 'no encoding'.
97 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
98 ! lib/Encode/Encoding.pm
99 'require Encode' because ->Define uses Encode::define_encoding();
100 problem and solution addressed by Miyagawa-kun
101 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
103 Cuts the frill to make djgpp happier, as suggested by Laszlo
104 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
106 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
107 Message-Id: <m3bsbug48n.fsf@anima.de>
109 1.69 2002/05/04 16:41:18
110 ! lib/Encode/MIME/Header
111 Floating-point coerced for UNICOS (in integer arithmetics it folds
112 line one character too early). Verification by Mark is pending.
113 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
115 more doc patch from Elizabeth
116 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
117 ! Encode/Makefile_PL.e2x
118 More platform-independent patch from Benjamin
119 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
120 ! lib/Encode/Guess AUTHORS
121 split regex fix by Graham Barr. Adds him to AUTHORS.
122 Message-Id: <20020504085419.E95940@valueclick.com>
123 ! Encode/Makefile_PL.e2x
124 enc2xs script discovery made smarter and more sensible, first cited
125 by Miyagawa-kun and further suggestions by Rafael and Andreas
126 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
127 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
129 1.68 2002/05/03 12:20:13
130 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
131 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
132 and Supported.pod modified to reflect the change. Elizabeth added
133 to Authors. And H.M. is also added for forwarding her patch among
134 other contributions (I was rather surprised to find his name was not
136 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
138 1.67 2002/05/02 07:33:09
140 Error message now consistent w/ perlqq (\N{U+} -> \x{})
141 done in perl@16308 but Philip linted me further. Now the error
142 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
143 ! lib/Encode/Guess.pm
144 Sanity check for happier -w by Autrijus
146 1.66 2002/05/01 05:41:06
147 ! Encode.xs t/fallback.t
148 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
149 while fallback is in effect. This even came with a welcome side-effect
150 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
151 also modified to test this.
152 And of course, the corresponding varialbles to UV[Xx]f are appropriately
153 cast. This should've concluded NI-XS homework.
155 encode(undef) does warn again! Repented upon suggestion by NI-XS.
156 Document for unless vs. '' added
157 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
159 1.65 2002/04/30 16:13:37
161 encode(undef) no longer warns for C<Use of uninitialized value in
162 subroutine entry>. Suggested by Paul.
163 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
164 ! lib/Encode/Supported.pod
165 Encode::MIME::Header and Encode::Guess mentioned
166 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
167 ! lib/Encode/Guess.pm
168 POD fix by Miyagawa-kun
169 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
171 1.64 2002/04/29 06:54:06
173 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
174 encocde("euc-jisx0213") are still DIFFERENT.
175 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
177 A few white spaces corrected by NI-XS via PerlIO integration to
179 Subject: Change 16247: Integrate perlio;
181 Document fixes by Andreas
182 Message-Id: <m3k7qsf1we.fsf@anima.de>
184 1.63 2002/04/27 18:59:50
185 ! lib/Encode/Encoding.pm
186 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
187 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
188 Make use of the Encode::Encoding base class!
189 And other cleanups in Encode.xs upon NI-XS suggestions
190 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
192 1.62 2002/04/27 11:17:39
194 encodings() now just check %ExtModule instead of eval{require}
195 all of them for ":all" to conserve more memory.
197 more "%x" -> "%" UVxf stuff.
199 s/=over2/=over 2/g # oops.
201 1.61 2002/04/26 03:02:04
203 Now does decent tests besides use_ok()
204 ! lib/Encode/Guess.pm t/guess.t
205 UI streamlined, document added
207 various signed/unsigned mismatch nits (#16173)
208 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
210 POD: utf8-flag-related caveats added. A few sections completely
214 Thou shalt not assume %d works, either!
215 Robin Baker added to AUTHORS for this
216 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
218 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
220 1.60 2002/04/24 20:06:52
222 "Thou shalt not assume %x works." -- jhi
223 Message-Id: <20020424210618.E24347@alpha.hut.fi>
224 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
225 low-memory build machines happy, now *.c is created for each *.ucm
226 (no table aggregation). You can still override this by setting
227 $ENV{AGGREGATE_TABLES}.
228 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
229 + lib/Encode/Guess.pm
230 + lib/Encode/JP/JIS7.pm
231 Encoding-autodetect (mainly for Japanese encoding) added. In a
232 course of development, JIS7.pm was improved.
233 + lib/Encode/HTML/Header.pm
234 + lib/Encode/Config.pm
235 MIME B/Q Header Encoding Added!
236 ! Encode.pm Encode.xs t/fallback.t
237 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
238 Message-Id: <20020424130709.GA14211@tanglefoot>
240 1.59 $ 2002/04/22 23:54:22
241 ! Encode.pm Encode.xs
242 needs_lines() and perlio_ok() are added to Internal encodings such
243 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
245 1.58 2002/04/22 23:54:22
247 s/MacChineseSimp/MacChineseTrad/ # ... oops.
252 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
253 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
254 t/(JP|KR).t are aggregated to t/CJKT.t
255 test data are all remade via bin/ucm2text.
256 And .... They are no longer skipped for -Uuseperlio !
258 1.57 2002/04/22 20:27:30
259 ! t/JP.t t/KR.t t/perlio.t
260 unless (find PerlIO::Layer 'perlio') ... line is back again.
261 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
262 need PerlIO. Sigh....
263 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
264 ->perlio_ok now does eval{ require PerlIO::encoding } there so
265 it correctly returns 1 when PerlIO::encoding is yet loaded.
267 perl-current patch #16072 reflected
269 1.56 2002/04/22 09:48:07
270 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
271 New PerlIO::encoding 0.04 compliance met
273 1.55 2002/04/22 03:43:05
274 ! Encode.pm Encode.xs Unicode/Unicode.pm
275 needs_lines() defined so Encode::Encoding is no longer needed
278 1.54 2002/04/22 02:50:01
279 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
280 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
281 now perlio_ok is true by default if PerlIO::encoding->VERSION is
282 0.03 or larger. POD in Encode::Encoding revised to reflect this.
283 Encode::XS and Encode::Unicode now has perlio_ok() method.
284 ! lib/Encode/Supported.pod
285 s/UP-UX/HP-UX/ by jhi
286 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
287 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
288 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
289 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
290 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
291 ! lib/Encode/Supported.pod
292 Huge document fixes by Philip.
295 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
296 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
298 binmode() after "<:encoding" to make Win32 happy, by Mattia.
299 Mattia added to AUTHORS file
300 Message-Id: <3CC3150F.5798.22A05AE@localhost>
302 1.52 2002/04/20 23:43:47
304 TODO: is now SKIP:, as NI-XS requested. Also adds more
305 eraborate failure analysis added.
307 A note on how to make sure of round-trip safety added to POD
308 section (so Autrijus is happier)
309 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
310 big5-(eten|hkscs) is round-trip safe again!
311 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
313 Typo fixes by Andreas
314 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
315 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
316 PerIO coodination patches from NI-XS.
317 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
319 1.51 2002/04/20 09:58:23
321 Updated test suite by Autrijis so "make test" is happy again
322 Message-Id: <20020420082104.GA25037@not.autrijus.org>
324 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
326 TW/TW.pm TW/Makefile.PL
327 Updates by Autrijus. 'big5' is no longer a canonical but an
328 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
329 Message-Id: <20020419195346.GA19597@not.autrijus.org>
331 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
332 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
334 PerlIO detection a little bit smarter; no longer uses eval qq{}
337 1.50 2002/04/19 06:13:02
338 ! ! Encode.pm Encode.xs Encode/encoding.h
340 New Fallback API imlemented and documented. See "perldoc Encode"
342 ! lib/Encode/JP/JIS7.pm Encode.pm
343 + lib/Encode/PerlIO.pod t/perlio.t
344 API compliance met. However, it still does not work unless perlio
345 implements line buffer. See BUGS section in perldoc Encode::PerlIO
346 As a sensible workaround, perlio_ok() added to Encode.
348 ! lib/Encode/Supported.pod
350 Message-Id: <20020418174647.J8466@alpha.hut.fi>
352 Doc fixes from Autrijus
353 Message-Id: <20020418144131.GA10987@not.autrijus.org>
355 perlqq mode documented
357 + t/jisx0201.euc t/jisx0201.ref
358 ! t/jisx0208.euc t/jisx0208.ref
359 t/JP.t tests more rigorously and with other encodings
360 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
361 in jis0208 (used to contain jisx0201 part).
362 ! Encode/Makefile_PL.e2x
363 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
364 enc2xs and encode.h rather than hard-coded. This allows the resulting
365 module fully CPANizable.
366 ! encoding.pm t/JP.t t/KR.t
367 PerlIO detection simplified (checks %INC instead of eval{})
368 ! Encode.xs Encode/encode.h
369 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
370 - lib/Encode/Unicode.pm
371 (en|de)code_xs relocated to where it belongs. Source reindented
374 Additional (U8 *) cast added as suggested by jhi
375 Message-Id: <20020417165916.A28599@alpha.hut.fi>
377 1.42 Date: 2002/04/17
379 no-op module; Thought of adding a pod there but enc2xs has
384 correct mechanism to detect Perlio::encoding layar installed.
386 PerlIO Layer detached.
388 1.41 2002/04/16 23:35:00
390 binmode(STDIN|STDOUT ...) done iff PerlIO is available
392 Cleaned up PerlIO skip conditions to prepare for the upcoming
393 Encode - PerlIO forking.
395 exported functions are now prototyped.
396 ! lib/Encode/CN/HZ.pm
399 fallback implemented # was /* FIXME */
400 affected programs revised to fit (only HZ was using the try-catch
401 approach which needed to be fixed for API-compliance).
403 ! Encode/KR/2022_KR.pm
405 can find =head1 NAME now, jhi
406 Message-Id: <20020416083059.V30639@alpha.hut.fi>
410 now complies with less warnings with the pickest compilers.
411 Suggested by Craig, fixed by Dan.
412 ! Encode/Makefile_PL.e2x
414 A bug that fails to find *.e2x in certain conditions fixed
416 1.40 2002/04/14 22:27:14
417 + Encode/ConfigLocal_PM.e2x
418 ! lib/Encode/Config.pm
420 "enc2xs -C" now generates/updates Encode::ConfigLocal.
421 ConfigLocal_PM.e2x is a skelton thereof.
422 ! lib/Encode/Config.pm
424 "use Encode::CN::HZ;" was missing.
427 More rigorous tests added to test XS, especially on memory allocation.
429 ! lib/Encode/Unicode.pm
430 NI-S implemented an XS version -- merged
431 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
434 Source filter option added. With this option on, you can write
435 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
436 encodings. t/jperl.t enhanced to test this feature.
438 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
439 apply his suggestion, this degree of nitting is enough to add him
441 Message-Id: <3CB93223.291E5E2E@earthlink.net>
443 + lib/Encode/JP/JIS7.pm
444 - lib/Encode/JP/JIS.pm
445 - lib/Encode/JP/2022_JP.pm
446 - lib/Encode/JP/2022_JP1.pm
447 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
448 JIS7.pm for better maintainability and performance
450 Added caveat for non-ascii identifiers.
452 fixes by jhi, the original author of this pragramtic module.
453 Message-Id: <20020413231527.V1826@alpha.hut.fi>
455 1.34 2002/04/12 20:23:05 (Unreleased)
458 EBCDIC fixes addressed by jhi.
459 Message-Id: <20020412161844.D9383@alpha.hut.fi>
460 ! lib/Encode/Encoder.pm
461 POD fix by Miyagawa-kun
462 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
464 1.33 2002/04/10 22:28:40
466 Philip's mail address corrected.
469 ! lib/Encode/Encoder.pm
470 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
471 This adds Miyagawa-kun to AUTHORS.
472 * encoding() no longer exported by default but on demand
473 * t/Encoder.t updated to test all these
474 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
475 ! lib/Encode/Unicode.pm
476 ! lib/Encode/Supported.pm
477 Further doc fixes by Anton
479 1.32 2002/04/09 20:06:15
482 - ucm/macDevanaga.ucm Unicode Character Map
483 - ucm/macGujarati.ucm Unicode Character Map
484 - ucm/macGurmukhi.ucm Unicode Character Map
485 A utility to check integrity of .ucm files. t/bogus.ucm is a
486 ucm that is deliberately bogus. unused Indic mappings are removed
489 resolve_alias() added as suggested by jhi. Same as
490 find_encoding("alias")->name. For convenience. This one is
491 defined in Encode.pm instead of Alias.pm.
492 Message-Id: <20020409215846.H17022@alpha.hut.fi>
494 Memory Allocate but detected during the devel of ucmlint -- fixed.
495 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
496 ! lib/Encode/Unicode.pm
497 valid_ucs2(0) is false but must be true.
498 3 patches from NI-S as follows. This also has fixed the incident
500 ! lib/Encode/Alias.pm
501 find_alias() recursion prevention
503 Checks for the patch above
504 ! t/Encode/Unicode.pm
505 An extra "F" that causes valid_ucs2() return a bogus value fixed
506 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
507 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
508 2 Small Patches from jhi as follows:
510 Encode->encodings() lists in case-insensitve order (as it was)
512 -l option prints avaiable encodings to STDOUT instead of STDERR
513 ! lib/Encode/Aliases.pm
514 s/defintion/definition/
515 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
517 ! lib/Encode/Supported.pod
518 ! lib/Encode/Unicode.pm
519 POD revise by Philip Newton. This adds Philip to AUTHORS list.
520 Thank you for the exact quote of Douglas Adams :)
521 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
523 1.31 2002/04/08 18:08:07
524 ! lib/Encode/Encoder.pm
526 Encode::Encoder, once just a placeholder of an idea, is now much more
527 practical. See t/Encode.t to find how practical it can be.
528 + lib/Encode/Config.pm
530 my false laziness at Encode.pm is fixed. Now %ExtModules are set
531 in Encode::Config and they are all literally, not programatically
532 set. My false laziness was resulting many encodings missing from
534 ! lib/Encode/Unicode.pm
536 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
537 so that it does not rely Encode::Unicode for BOM values
538 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
540 1.30 2002/04/08 02:34:51
541 + lib/Encode/Encoder.pm
542 Object Oriented Encoder. I reckon something like this is in need.
545 ! lib/Encode/Supported.pod
546 * autoloading bug that prevented upper-case canonicals such as UTF-16
547 is fixed. Now even UTF/UCS are autoloaded!
548 * encodings() is now more intuitive.
549 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
551 * Obligatory fixes to the POD.
552 ! lib/Encode/Supported.pod
553 Patch from Anton applied.
554 Message-Id: <66641479.20020408033300@motor.ru>
556 ! lib/Encode/Unicode.pm
557 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
559 1.28 2002/04/07 18:58:42
563 Just a MANIFEST for those missing files.
565 1.26 Date: 2002/04/07 15:22:04
568 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
569 And t/Aliases.t fixed to make sure they all exist.
571 1.25 2002/04/07 15:01:25 (Unreleased)
573 ! lib/Encode/Unicode.pm
576 - lib/Encode/UTF_EBCDIC.pm
577 - lib/Encode/Internal.pm
579 Integrated into Encode.pm as closures. That way "one package, one file"
580 rule is preserved yet less files to require.
582 commented out binmode(STDERR ...
591 ! Encode/Makefile_PL.e2x
592 Schwarn's MM-compliance patch merged
593 Message-Id: <20020406082609.GA28758@blackrider>
595 ! lib/Encode/Unicode.pm
596 + lib/Encode/UTF_EBCDIC.pm
598 - lib/Encode/10646_1.pm
599 - lib/Encode/ucs2_le.pm
600 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
601 per-module based (en|de)code, I saved a number of .pm by
602 reorganizing it as per-object base (Well, this is what Encode::XS
603 does under the hood). See Encode::Unicode for details.
604 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
605 This module is used only on EBCDIC environments.
607 1.21 2002/04/05 14:46:34 (Not Released)
613 Are back to make Perl/Tk happy Smile, NI-S.
615 ! lib/Encode/Alias.pm
616 ! lib/Encode/Supported.pm
617 ! lib/Encode/10646_1.pm
618 ! lib/Encode/ucs2_le.pm
619 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
620 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
621 updated to reflect changes.
622 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
623 ! lib/Encode/Alias.pm
624 ! lib/Encode/Supported.pm
625 Anton's revision commited. Added Dan's own fixes as well.
626 Message-Id: <159103166906.20020405161134@motor.ru>
627 ! lib/Encode/Alias.pm
629 < qr/^UCS2-le$/i => '"UCS-2"', );
631 > qr/^UCS2-LE$/i => '"UTF-16LE"');
632 Sigh. Thank you, Anton.
633 Message-Id: <14567692196.20020405062020@motor.ru>
634 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
636 1.20 2002/04/04 19:50:52
638 the last minute addtion. Just give it a try. Docs remains to be done.
639 Not installed by default.
640 ! lib/Encode/Supported.pod
643 ! lib/Encode/Alias.pm
645 ! lib/Encode/10464_1.pm
646 ! lib/Encode/ucs2_le.pm
647 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
648 unchanged but UTF-16BE is added as an alias. Implicit aliases
649 move to Encode::Alias so init_alias() works more as expected.
650 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
651 Though pedantically wrong, This should make perl mongers happier.
652 t/Alias.t is enhanced to test all these.
653 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
655 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
656 of the codepoints are common. CJKT left as is because the saving is
662 ! Encode/Makefile_PL.e2x
670 All occurance of _def.h replaced with .exh so djgpp works happily
671 ever after! To credit this amazing discovery, Laszlo is now in
673 Message-Id: <20020403181424.GA8778@freemail.hu>
674 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
677 ! Encode/Makefile_PL.skel
679 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
680 Message-Id: <20020401222744.GX2000@blackrider>, et al.
682 Two more tests by added jhi
683 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
686 The showstopper fixed -- Memory reallocation bug was causing
687 Encode::XS to fall into infinite loop on certain conditions.
689 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
694 ! lib/Encode/Supported.pod
695 Vendor encodings rebuilt out of original map files at unicode.org.
696 Indic languages such as MacDevanagali remain unspported do to the
697 shortcoming of encengine capabilities (they need algorithmical
698 conversion and I have no knowledge on that!). Pods fixed for added
700 Oh, macJapan.ucm renamed to macJapanese.ucm.
701 macROMnn is macRomanian and macRUMnn is macRumanian.
702 txt2ucm is a crude script that is used to convert them.
704 Unicode Compound Characters (used extensively on Mac) supported
706 Typo fixes and improvements by jhi
707 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
709 1.11 $Date: 2002/08/25 15:09:51 $
713 Missing files from the MANIFEST fixed.
714 Message-Id: <20020401010156.H10509@alpha.hut.fi>
715 Version incremented just to make CPAN happy.
717 1.10 2002/03/31 21:32:42
720 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
721 if you want. This should make Autrijus happy. Also, piconv
722 is added to default install.
725 Here-documented files that enc2xs generates are now exported
726 to *.e2x. Much cleaner and easier to debug.
728 encoding enhances so you can make it act more like such
729 (now prehistoric ) "localized" variations of perl like Jperl.
731 Further test for encoding.pm. Written in euc-jp
735 Message-Id: <20020330174618.B10154@alpha.hut.fi>
740 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
747 ! Encode/macIceland.ucm
748 ! lib/Encode/Alias.pm
749 ! lib/Encode/Supported.pod
750 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
751 Message-Id: <m3lmcavhjt.fsf@anima.de>
753 1.01 2002/03/29 20:59:39
756 s/USE_SCRIPTS/MORE_SCRIPTS/
758 installs enc2xs by default for external Encode:: modules in CPAN,
759 such as Encode::HanExtra
761 More sensible perl core detection via $ENV{PERL_CORE}
763 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
765 Perl core ditection via $^X =~ m/\bminiperl$/o
766 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
770 The version of all files is updated to 1.00 via "ci -f -l1.00",
771 commemorating version 1.00. All files, including *.ucm are now
772 under version control.
775 encode.h moved to Encode/ so it will be installed for the later
778 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
783 compile renamed to enc2xs.
784 Affected Makefle.PL updated
786 "Punt it. HanExtra can take care of that later." -- Autrijus
787 Message-Id: <20020328154338.GA7351@not.autrijus.org>
791 ! lib/Encode/CJKConstants.pm
792 ! lib/Encode/KR/2022_KR.pm
793 Table patches for Euro Signs, 2022-KR fixups by Jungshik
794 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
798 bin/ added for example scripts. They are not installed by default.
799 to install them, "perl Makefile.PL USE_SCRIPTS".
800 piconv is iconv reinvented in perl. in addition to all features
801 of iconv, it also adds perlish features. See L<piconv/1> for more
803 ! lib/Encode/Alias.pm
804 qr/^ replaced with qr/\b so it directly matches locale names
805 such as en_US.US-ASCII
808 Patch by MJD to fix the following problem applied.
809 Subject: [PATCH 5.7.3 Encode]
810 Aliases.t not properly skipped when Encode extension not built
811 Message-Id: <20020328091850.18677.qmail@plover.com>
812 ! lib/Encode/KR/2022_KR.pm
813 ! lib/Encode/CJKConstants.pm
814 Another patch from Jungshik to make iso-2022-kr actually work
815 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
816 ! Encode/Encode/euc-kr.ucm
817 + Encode/Encode/johab.ucm
818 ! Encode/Encode/ksc5601.ucm
820 ! Encode/KR/Makefile.PL
821 ! Encode/lib/Encode/Alias.pm
823 Johab support and complete revision of Korean Encoding by Jungshik
824 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
826 Revised to make up with now-dropped Encode::Details.
827 - lib/Encode/Details.pod
828 Dropped. Besides being obsolete, the topics are now covered in
834 Korean aliases fixed thanks to Jungshik Shin
835 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
836 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
838 =head1 NAME added to all modules to make buildtoc happy
839 Message-Id: <20020327041151.A10618@alpha.hut.fi>
840 - lib/Encode/CJKguide.pod
841 Too controversial and dropped from the dist. Will be available
842 separately on the web.
844 RCS tags added so table debugging gets easier (should that be
845 needed! I hope they all stay 1.00!)
846 + lib/Encode/CJKguide.pod
847 A detailed guide to mainly, but not limited to, CJK multibyte
850 + Encode/hp-roman8.ucm
852 ! Encode/Supported.pod
853 All occurance of "roman8" replaced with "hp-roman8" to avoid
855 ! Encode/Supported.pod
858 Mac Encodings now comply the Inside Macintosh
860 Test for '-raw' conventions added.
862 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
866 "-raw" appended to canonical names.
867 File mames stay unchanged thanks to UCM format.
868 ! lib/Encode/CN/HZ.pm
869 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
870 Message-Id: <20020326035210.GA2091@not.autrijus.org>
873 - lib/Encode/JP/Const.pm
874 + lib/Encode/CJKConstants.pm
875 + lib/Encode/CN/2022_CN.pm
876 + lib/Encode/KR/2022_KR.pm
885 * Support for ISO-2022-KR and ISO-2022-CN added.
887 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
888 * ucm2table autogenerates character table out of UCM files.
891 - lib/Encode/Supports.pod
892 + lib/Encode/Supported.pod
893 Names reverted due to popular demand.
894 8.3 rule applies only when there is a conflict.
895 Message-Id: <20020325095924.GD44120@not.autrijus.org>
900 - lib/Encode/Format/Enc.pod
902 * Character tables is now 100% ucm.
903 * All files under Encode/ is now 8.3-compliant
904 * some of missing encodings added (i.e. gsm0338 and nextstep)
905 * Vendor mappings aggregated with appropriate national std in
906 Makefile.PL, resulting smaller *.so especially for CJK.
907 Following is result on Dan's FreeBSD box.
909 ---------------------------------------------------------------
910 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
911 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
912 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
913 blib/arch/auto/Encode/Encode.so 27,791 27,791
914 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
915 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
916 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
917 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
918 ---------------------------------------------------------------
919 Total 5,375,297 6,343,819
921 * As a result of ucm-transition, Encode::Tcl dropped because
922 Encode::Tcl demands *.enc.
923 Encode::Tcl will be supplied in a separate tarball with *.enc.
924 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
929 -lib/Encode/Supported.pod
930 +lib/Encode/Supports.pod
931 -lib/Encode/iso10646_1.pm
932 +lib/Encode/10646_1.pm
933 -lib/Encode/EncFormat.pod
934 +lib/Encode/Format/Enc.pod
935 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
936 - lib/Encode/JP/Constants.pm
937 + lib/Encode/JP/Consts.pm
938 ! lib/Encode/JP/JIS.pm
939 ! lib/Encode/JP/H2Z.pm
940 Version nit problem and 8.3 rule fix.
941 > Package namespace installed latest in CPAN file
942 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
943 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
944 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
945 accordingly. In addition, legacy "use vars qw()..." are replaced with
947 Message-Id: <20020325011248.D1561@alpha.hut.fi>
948 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
950 - lib/Encode/JP/ISO_2022_JP.pm
951 - lib/Encode/JP/ISO_2022_JP_1.pm
952 + lib/Encode/JP/2022_JP.pm
953 + lib/Encode/JP/2022_JP1.pm
955 8.3 naming conflict for vanilla fat addressed by jhi
956 Message-Id: <20020324201931.V22596@alpha.hut.fi>
959 Typecast fix addressed by jhi
960 Message-Id: <20020324185540.T22596@alpha.hut.fi>
963 ! lib/Encode/Supported.pod
965 + lib/Encode/JP/ISO_2022_JP_1.pm
966 ! lib/Encode/JP/ISO_2022_JP.pm
967 ! lib/Encode/JP/JIS.pm
969 Now Encode::JP is more strict on the difference between ISO-2022-JP
970 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
971 makes Anton happier :) FYI the previous version implements
972 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
973 ! lib/Encode/Supported.pod
976 Avoid core-dump in Encode with PERLIO=mmap by NI-S
977 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
982 ! lib/Encode/Suppoted.pod
983 pod fixes to replace F<http://...> to L<http://...>,
984 as suggested by Autrijius in:
985 Message-Id: <20020324083943.GA14901@not.autrijus.org>
986 ! lib/Encode/Suppoted.pod
987 fixes and enhancements by Anton
988 Message-Id: <10632060120.20020324103753@motor.ru>
989 ! lib/Encode/Alias.pm
990 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
991 added. Suggested by Anton then deobfuscated by Autrijius
992 Message-Id: <20020324064455.GA3667@not.autrijus.org>
994 Further fix by Nicholas Clark
995 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
996 - lib/EncodeFormat.pod
997 + lib/Encode/EncFormat.pod
999 File renamed as suggested by Autrijius
1001 ! lib/Encode/Details.pod
1002 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
1003 ! Encode.pm Sun Mar 24 13:43:47 2002
1004 pod fixes by Autrijius.
1005 Message-Id: <20020324062804.GA3595@not.autrijus.org>
1006 Message-Id: <20020324075627.GB11986@not.autrijus.org>
1008 ! lib/Encode/Alias.pm
1010 now more EBCDIC conscious;
1011 %ExtModules on EBCDIC system excludes CJK so that you don't
1012 have to worry about the matched alias resulting cloaking.
1013 t/Alias.t also revised to reflect changes. Verified by jhi
1014 Message-Id: <20020324022929.D22596@alpha.hut.fi>
1016 0.97 Sun Mar 24 2002
1020 EBCDIC detection mechanism installed as in JP/JP.pm
1021 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1024 ! EBCDIC/Makefile.PL
1027 ! Symbol/Makefile.PL
1029 Now all table files used by compile are postfixed '_t' to avoid
1030 namespace collisions in case insensitive file systems once for all!
1032 Message-ID: <58290227735.20020323195659@familiehaase.de>
1034 Since the Encode::JP is unsupported under EBCDIC we
1035 cannot run this test (aliases as such should work fine) -- jhi
1036 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1038 duplicate occurance of ascii.ucm and 8859-1.ucm
1039 causes MacOS X dlyd to cloak
1045 < chdir 't' if -d 't';
1047 > if (! -d 'blib' and -d 't'){ chdir 't' };
1048 When you are "make test"-ing on Encode/ directory, you must not
1049 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1050 remain unchanced. Also the situation detection was made simpler
1051 in t/JP.t, which was originally;
1052 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1054 "Use of uninitialized value in string eq at Encode.pm line 96."
1055 ! Symbol/Makefile.PL
1056 ! EBCDIC/Makefile.PL
1058 -- Problem on case insensitive file systems
1059 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1060 Message-ID: <88254111953.20020323095503@familiehaase.de>
1063 "So I think it's a bug in gcc, not perl. But it still needs to be
1065 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1066 Message-Id: <20020323170509.C96475@plum.flirble.org>
1068 0.96 Sat Mar 23 2002
1070 ! lib/Encode/Encoding.pm
1071 ! lib/Encode/Alias.pm
1072 ! lib/Encode/Supported.pod
1074 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1075 Message-ID: <20020322073908.GB10539@blackrider>
1078 "...I think we should include ISO 8859-1 as well." -- NI-S
1079 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1084 ! lib/Encode/Alias.pm
1085 alias definitions relocated to Encode::Alias so module autoloading
1086 works for aliases also.
1088 encodings() now accepts args to check ExtModules.
1092 + EBCDIC/Makefile.PL
1093 + Symbol/Makefile.PL
1097 Latin and single byte encodings are reorganized so they are
1098 demand-loaded like Encode::XX. Now only ascii is compiled into
1100 ! lib/Encode/Alias.pm
1101 for my $k (keys %hash){ delete $hash{$k}; }
1102 is depreciated; fixed.
1104 0.95 Fri Mar 22 2002
1105 In this update, pod rewrites and alias fixes are the main issues
1106 + lib/Encode/Supported.pod
1107 Describes supported encodings
1109 streamlined compiled-in encodings.
1110 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1112 + Encode/ibm-125?.ucm
1113 Added from icu distibution with any occurance of
1114 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1115 some respect to icu staff, however.
1116 + lib/Encode/Alias.pm
1118 Alias difinitions in Encode.pm relocated.
1121 packWARN patch from Paul Marquess via jhi
1122 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1123 Paul added to AUTHORS as a result.
1124 ! t/CJKalias.t -> t/Aliases.t
1125 Renamed. Checks even more aliases and alias overloading
1128 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1129 gbk => cp936 relocated to CN.pm
1131 Test::More with plans (by jhi)
1133 0.94 Thu Mar 21 2002
1134 + lib/Encode/Description.pod
1135 ! lib/Encode/Encoding.pm
1136 Now the pod in Encode.pm is abridged as programming references.
1137 lib/Encode/Description.pod contains the original, detailed description
1138 and Encode::Encoding explains how to write your own module to
1139 add new encodings. So far, lib/Encode/Description.pod contains
1140 the whole pod once in Encode.pm. This is intentional.
1142 Pod revisions by Anton Tagunov
1143 Message-Id: <517178431.20020320174824@motor.ru>
1145 all occrance of Encode::Tcl::Extended removed including pod
1147 test now checks $encoding->name only; $encoding->{name} are
1148 no longer check to find the canonical name.
1149 ! lib/Encode/JP/JIS.pm
1150 ! lib/Encode/JP/ISO_2022_JP.pm
1151 ->name() added to be more compliant with API
1157 Patch by Autrijus to add aliases to TW and fixes to POD
1158 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1160 SADAHIRO Tomoyuki added as should. My apologies.
1162 0.93 Wed Mar 20 2002
1163 * First release to be uploaded to CPAN. For prehistoric changes,
1164 please see Changes file of perl distibution as well as
1165 perl-unicode@perl.org archive, available at:
1166 http://archive.develooper.com/perl-unicode@perl.org/
1168 Changes Since 0.92 includes;
1173 + Mention to perl-unicode@perl.org added
1175 + Encoding aliases added so you can feed locale names
1176 and MIME Charset="" directly.
1177 - Mention to JISX0212 removed because it's fixed
1180 + Encoding aliases added. Note TW is left untouched because
1181 euc-tw is not implemented in TW but in Encode::HanExtra.
1182 Autrijus, you may fix Encode::HanExtra.
1184 + to test encode aliases added