1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.31 2002/04/08 18:08:07 dankogai Exp dankogai $
6 1.31 $Date: 2002/04/08 18:08:07 $
7 ! lib/Encode/Encoder.pm
9 Encode::Encoder, once just a placeholder of an idea, is now much more
10 practical. See t/Encode.t to find how practical it can be.
11 + lib/Encode/Config.pm
13 my false laziness at Encode.pm is fixed. Now %ExtModules are set
14 in Encode::Config and they are all literally, not programatically
15 set. My false laziness was resulting many encodings missing from
17 ! lib/Encode/Unicode.pm
19 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
20 so that it does not rely Encode::Unicode for BOM values
21 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
23 1.30 2002/04/08 02:34:51
24 + lib/Encode/Encoder.pm
25 Object Oriented Encoder. I reckon something like this is in need.
28 ! lib/Encode/Supported.pod
29 * autoloading bug that prevented upper-case canonicals such as UTF-16
30 is fixed. Now even UTF/UCS are autoloaded!
31 * encodings() is now more intuitive.
32 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
34 * Obligatory fixes to the POD.
35 ! lib/Encode/Supported.pod
36 Patch from Anton applied.
37 Message-Id: <66641479.20020408033300@motor.ru>
39 ! lib/Encode/Unicode.pm
40 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
42 1.28 2002/04/07 18:58:42
46 Just a MANIFEST for those missing files.
48 1.26 Date: 2002/04/07 15:22:04
51 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
52 And t/Aliases.t fixed to make sure they all exist.
54 1.25 2002/04/07 15:01:25 (Unreleased)
56 ! lib/Encode/Unicode.pm
59 - lib/Encode/UTF_EBCDIC.pm
60 - lib/Encode/Internal.pm
62 Integrated into Encode.pm as closures. That way "one package, one file"
63 rule is preserved yet less files to require.
65 commented out binmode(STDERR ...
74 ! Encode/Makefile_PL.e2x
75 Schwarn's MM-compliance patch merged
76 Message-Id: <20020406082609.GA28758@blackrider>
78 ! lib/Encode/Unicode.pm
79 + lib/Encode/UTF_EBCDIC.pm
81 - lib/Encode/10646_1.pm
82 - lib/Encode/ucs2_le.pm
83 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
84 per-module based (en|de)code, I saved a number of .pm by
85 reorganizing it as per-object base (Well, this is what Encode::XS
86 does under the hood). See Encode::Unicode for details.
87 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
88 This module is used only on EBCDIC environments.
90 1.21 2002/04/05 14:46:34 (Not Released)
96 Are back to make Perl/Tk happy Smile, NI-S.
99 ! lib/Encode/Supported.pm
100 ! lib/Encode/10646_1.pm
101 ! lib/Encode/ucs2_le.pm
102 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
103 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
104 updated to reflect changes.
105 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
106 ! lib/Encode/Alias.pm
107 ! lib/Encode/Supported.pm
108 Anton's revision commited. Added Dan's own fixes as well.
109 Message-Id: <159103166906.20020405161134@motor.ru>
110 ! lib/Encode/Alias.pm
112 < qr/^UCS2-le$/i => '"UCS-2"', );
114 > qr/^UCS2-LE$/i => '"UTF-16LE"');
115 Sigh. Thank you, Anton.
116 Message-Id: <14567692196.20020405062020@motor.ru>
117 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
119 1.20 2002/04/04 19:50:52
121 the last minute addtion. Just give it a try. Docs remains to be done.
122 Not installed by default.
123 ! lib/Encode/Supported.pod
126 ! lib/Encode/Alias.pm
128 ! lib/Encode/10464_1.pm
129 ! lib/Encode/ucs2_le.pm
130 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
131 unchanged but UTF-16BE is added as an alias. Implicit aliases
132 move to Encode::Alias so init_alias() works more as expected.
133 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
134 Though pedantically wrong, This should make perl mongers happier.
135 t/Alias.t is enhanced to test all these.
136 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
138 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
139 of the codepoints are common. CJKT left as is because the saving is
145 ! Encode/Makefile_PL.e2x
153 All occurance of _def.h replaced with .exh so djgpp works happily
154 ever after! To credit this amazing discovery, Laszlo is now in
156 Message-Id: <20020403181424.GA8778@freemail.hu>
157 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
160 ! Encode/Makefile_PL.skel
162 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
163 Message-Id: <20020401222744.GX2000@blackrider>, et al.
165 Two more tests by added jhi
166 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
169 The showstopper fixed -- Memory reallocation bug was causing
170 Encode::XS to fall into infinite loop on certain conditions.
172 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
177 ! lib/Encode/Supported.pod
178 Vendor encodings rebuilt out of original map files at unicode.org.
179 Indic languages such as MacDevanagali remain unspported do to the
180 shortcoming of encengine capabilities (they need algorithmical
181 conversion and I have no knowledge on that!). Pods fixed for added
183 Oh, macJapan.ucm renamed to macJapanese.ucm.
184 macROMnn is macRomanian and macRUMnn is macRumanian.
185 txt2ucm is a crude script that is used to convert them.
187 Unicode Compound Characters (used extensively on Mac) supported
189 Typo fixes and improvements by jhi
190 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
192 1.11 $Date: 2002/04/08 18:08:07 $
196 Missing files from the MANIFEST fixed.
197 Message-Id: <20020401010156.H10509@alpha.hut.fi>
198 Version incremented just to make CPAN happy.
200 1.10 2002/03/31 21:32:42
203 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
204 if you want. This should make Autrijus happy. Also, piconv
205 is added to default install.
208 Here-documented files that enc2xs generates are now exported
209 to *.e2x. Much cleaner and easier to debug.
211 encoding enhances so you can make it act more like such
212 (now prehistoric ) "localized" variations of perl like Jperl.
214 Further test for encoding.pm. Written in euc-jp
218 Message-Id: <20020330174618.B10154@alpha.hut.fi>
223 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
230 ! Encode/macIceland.ucm
231 ! lib/Encode/Alias.pm
232 ! lib/Encode/Supported.pod
233 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
234 Message-Id: <m3lmcavhjt.fsf@anima.de>
236 1.01 2002/03/29 20:59:39
239 s/USE_SCRIPTS/MORE_SCRIPTS/
241 installs enc2xs by default for external Encode:: modules in CPAN,
242 such as Encode::HanExtra
244 More sensible perl core detection via $ENV{PERL_CORE}
246 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
248 Perl core ditection via $^X =~ m/\bminiperl$/o
249 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
253 The version of all files is updated to 1.00 via "ci -f -l1.00",
254 commemorating version 1.00. All files, including *.ucm are now
255 under version control.
258 encode.h moved to Encode/ so it will be installed for the later
261 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
266 compile renamed to enc2xs.
267 Affected Makefle.PL updated
269 "Punt it. HanExtra can take care of that later." -- Autrijus
270 Message-Id: <20020328154338.GA7351@not.autrijus.org>
274 ! lib/Encode/CJKConstants.pm
275 ! lib/Encode/KR/2022_KR.pm
276 Table patches for Euro Signs, 2022-KR fixups by Jungshik
277 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
281 bin/ added for example scripts. They are not installed by default.
282 to install them, "perl Makefile.PL USE_SCRIPTS".
283 piconv is iconv reinvented in perl. in addition to all features
284 of iconv, it also adds perlish features. See L<piconv/1> for more
286 ! lib/Encode/Alias.pm
287 qr/^ replaced with qr/\b so it directly matches locale names
288 such as en_US.US-ASCII
291 Patch by MJD to fix the following problem applied.
292 Subject: [PATCH 5.7.3 Encode]
293 Aliases.t not properly skipped when Encode extension not built
294 Message-Id: <20020328091850.18677.qmail@plover.com>
295 ! lib/Encode/KR/2022_KR.pm
296 ! lib/Encode/CJKConstants.pm
297 Another patch from Jungshik to make iso-2022-kr actually work
298 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
299 ! Encode/Encode/euc-kr.ucm
300 + Encode/Encode/johab.ucm
301 ! Encode/Encode/ksc5601.ucm
303 ! Encode/KR/Makefile.PL
304 ! Encode/lib/Encode/Alias.pm
306 Johab support and complete revision of Korean Encoding by Jungshik
307 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
309 Revised to make up with now-dropped Encode::Details.
310 - lib/Encode/Details.pod
311 Dropped. Besides being obsolete, the topics are now covered in
317 Korean aliases fixed thanks to Jungshik Shin
318 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
319 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
321 =head1 NAME added to all modules to make buildtoc happy
322 Message-Id: <20020327041151.A10618@alpha.hut.fi>
323 - lib/Encode/CJKguide.pod
324 Too controversial and dropped from the dist. Will be available
325 separately on the web.
327 RCS tags added so table debugging gets easier (should that be
328 needed! I hope they all stay 1.00!)
329 + lib/Encode/CJKguide.pod
330 A detailed guide to mainly, but not limited to, CJK multibyte
333 + Encode/hp-roman8.ucm
335 ! Encode/Supported.pod
336 All occurance of "roman8" replaced with "hp-roman8" to avoid
338 ! Encode/Supported.pod
341 Mac Encodings now comply the Inside Macintosh
343 Test for '-raw' conventions added.
345 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
349 "-raw" appended to canonical names.
350 File mames stay unchanged thanks to UCM format.
351 ! lib/Encode/CN/HZ.pm
352 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
353 Message-Id: <20020326035210.GA2091@not.autrijus.org>
356 - lib/Encode/JP/Const.pm
357 + lib/Encode/CJKConstants.pm
358 + lib/Encode/CN/2022_CN.pm
359 + lib/Encode/KR/2022_KR.pm
368 * Support for ISO-2022-KR and ISO-2022-CN added.
370 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
371 * ucm2table autogenerates character table out of UCM files.
374 - lib/Encode/Supports.pod
375 + lib/Encode/Supported.pod
376 Names reverted due to popular demand.
377 8.3 rule applies only when there is a conflict.
378 Message-Id: <20020325095924.GD44120@not.autrijus.org>
383 - lib/Encode/Format/Enc.pod
385 * Character tables is now 100% ucm.
386 * All files under Encode/ is now 8.3-compliant
387 * some of missing encodings added (i.e. gsm0338 and nextstep)
388 * Vendor mappings aggregated with appropriate national std in
389 Makefile.PL, resulting smaller *.so especially for CJK.
390 Following is result on Dan's FreeBSD box.
392 ---------------------------------------------------------------
393 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
394 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
395 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
396 blib/arch/auto/Encode/Encode.so 27,791 27,791
397 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
398 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
399 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
400 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
401 ---------------------------------------------------------------
402 Total 5,375,297 6,343,819
404 * As a result of ucm-transition, Encode::Tcl dropped because
405 Encode::Tcl demands *.enc.
406 Encode::Tcl will be supplied in a separate tarball with *.enc.
407 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
412 -lib/Encode/Supported.pod
413 +lib/Encode/Supports.pod
414 -lib/Encode/iso10646_1.pm
415 +lib/Encode/10646_1.pm
416 -lib/Encode/EncFormat.pod
417 +lib/Encode/Format/Enc.pod
418 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
419 - lib/Encode/JP/Constants.pm
420 + lib/Encode/JP/Consts.pm
421 ! lib/Encode/JP/JIS.pm
422 ! lib/Encode/JP/H2Z.pm
423 Version nit problem and 8.3 rule fix.
424 > Package namespace installed latest in CPAN file
425 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
426 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
427 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
428 accordingly. In addition, legacy "use vars qw()..." are replaced with
430 Message-Id: <20020325011248.D1561@alpha.hut.fi>
431 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
433 - lib/Encode/JP/ISO_2022_JP.pm
434 - lib/Encode/JP/ISO_2022_JP_1.pm
435 + lib/Encode/JP/2022_JP.pm
436 + lib/Encode/JP/2022_JP1.pm
438 8.3 naming conflict for vanilla fat addressed by jhi
439 Message-Id: <20020324201931.V22596@alpha.hut.fi>
442 Typecast fix addressed by jhi
443 Message-Id: <20020324185540.T22596@alpha.hut.fi>
446 ! lib/Encode/Supported.pod
448 + lib/Encode/JP/ISO_2022_JP_1.pm
449 ! lib/Encode/JP/ISO_2022_JP.pm
450 ! lib/Encode/JP/JIS.pm
452 Now Encode::JP is more strict on the difference between ISO-2022-JP
453 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
454 makes Anton happier :) FYI the previous version implements
455 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
456 ! lib/Encode/Supported.pod
459 Avoid core-dump in Encode with PERLIO=mmap by NI-S
460 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
465 ! lib/Encode/Suppoted.pod
466 pod fixes to replace F<http://...> to L<http://...>,
467 as suggested by Autrijius in:
468 Message-Id: <20020324083943.GA14901@not.autrijus.org>
469 ! lib/Encode/Suppoted.pod
470 fixes and enhancements by Anton
471 Message-Id: <10632060120.20020324103753@motor.ru>
472 ! lib/Encode/Alias.pm
473 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
474 added. Suggested by Anton then deobfuscated by Autrijius
475 Message-Id: <20020324064455.GA3667@not.autrijus.org>
477 Further fix by Nicholas Clark
478 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
479 - lib/EncodeFormat.pod
480 + lib/Encode/EncFormat.pod
482 File renamed as suggested by Autrijius
484 ! lib/Encode/Details.pod
485 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
486 ! Encode.pm Sun Mar 24 13:43:47 2002
487 pod fixes by Autrijius.
488 Message-Id: <20020324062804.GA3595@not.autrijus.org>
489 Message-Id: <20020324075627.GB11986@not.autrijus.org>
491 ! lib/Encode/Alias.pm
493 now more EBCDIC conscious;
494 %ExtModules on EBCDIC system excludes CJK so that you don't
495 have to worry about the matched alias resulting cloaking.
496 t/Alias.t also revised to reflect changes. Verified by jhi
497 Message-Id: <20020324022929.D22596@alpha.hut.fi>
503 EBCDIC detection mechanism installed as in JP/JP.pm
504 Message-Id: <20020323211847.G19148@alpha.hut.fi>
512 Now all table files used by compile are postfixed '_t' to avoid
513 namespace collisions in case insensitive file systems once for all!
515 Message-ID: <58290227735.20020323195659@familiehaase.de>
517 Since the Encode::JP is unsupported under EBCDIC we
518 cannot run this test (aliases as such should work fine) -- jhi
519 Message-Id: <20020323202119.D19148@alpha.hut.fi>
521 duplicate occurance of ascii.ucm and 8859-1.ucm
522 causes MacOS X dlyd to cloak
528 < chdir 't' if -d 't';
530 > if (! -d 'blib' and -d 't'){ chdir 't' };
531 When you are "make test"-ing on Encode/ directory, you must not
532 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
533 remain unchanced. Also the situation detection was made simpler
534 in t/JP.t, which was originally;
535 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
537 "Use of uninitialized value in string eq at Encode.pm line 96."
541 -- Problem on case insensitive file systems
542 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
543 Message-ID: <88254111953.20020323095503@familiehaase.de>
546 "So I think it's a bug in gcc, not perl. But it still needs to be
548 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
549 Message-Id: <20020323170509.C96475@plum.flirble.org>
553 ! lib/Encode/Encoding.pm
554 ! lib/Encode/Alias.pm
555 ! lib/Encode/Supported.pod
557 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
558 Message-ID: <20020322073908.GB10539@blackrider>
561 "...I think we should include ISO 8859-1 as well." -- NI-S
562 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
567 ! lib/Encode/Alias.pm
568 alias definitions relocated to Encode::Alias so module autoloading
569 works for aliases also.
571 encodings() now accepts args to check ExtModules.
580 Latin and single byte encodings are reorganized so they are
581 demand-loaded like Encode::XX. Now only ascii is compiled into
583 ! lib/Encode/Alias.pm
584 for my $k (keys %hash){ delete $hash{$k}; }
585 is depreciated; fixed.
588 In this update, pod rewrites and alias fixes are the main issues
589 + lib/Encode/Supported.pod
590 Describes supported encodings
592 streamlined compiled-in encodings.
593 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
595 + Encode/ibm-125?.ucm
596 Added from icu distibution with any occurance of
597 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
598 some respect to icu staff, however.
599 + lib/Encode/Alias.pm
601 Alias difinitions in Encode.pm relocated.
604 packWARN patch from Paul Marquess via jhi
605 Message-Id: <20020321010101.O28978@alpha.hut.fi>
606 Paul added to AUTHORS as a result.
607 ! t/CJKalias.t -> t/Aliases.t
608 Renamed. Checks even more aliases and alias overloading
611 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
612 gbk => cp936 relocated to CN.pm
614 Test::More with plans (by jhi)
617 + lib/Encode/Description.pod
618 ! lib/Encode/Encoding.pm
619 Now the pod in Encode.pm is abridged as programming references.
620 lib/Encode/Description.pod contains the original, detailed description
621 and Encode::Encoding explains how to write your own module to
622 add new encodings. So far, lib/Encode/Description.pod contains
623 the whole pod once in Encode.pm. This is intentional.
625 Pod revisions by Anton Tagunov
626 Message-Id: <517178431.20020320174824@motor.ru>
628 all occrance of Encode::Tcl::Extended removed including pod
630 test now checks $encoding->name only; $encoding->{name} are
631 no longer check to find the canonical name.
632 ! lib/Encode/JP/JIS.pm
633 ! lib/Encode/JP/ISO_2022_JP.pm
634 ->name() added to be more compliant with API
640 Patch by Autrijus to add aliases to TW and fixes to POD
641 Message-Id: <20020320090619.GA24774@not.autrijus.org>
643 SADAHIRO Tomoyuki added as should. My apologies.
646 * First release to be uploaded to CPAN. For prehistoric changes,
647 please see Changes file of perl distibution as well as
648 perl-unicode@perl.org archive, available at:
649 http://archive.develooper.com/perl-unicode@perl.org/
651 Changes Since 0.92 includes;
656 + Mention to perl-unicode@perl.org added
658 + Encoding aliases added so you can feed locale names
659 and MIME Charset="" directly.
660 - Mention to JISX0212 removed because it's fixed
663 + Encoding aliases added. Note TW is left untouched because
664 euc-tw is not implemented in TW but in Encode::HanExtra.
665 Autrijus, you may fix Encode::HanExtra.
667 + to test encode aliases added