1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.26 2002/04/07 15:22:04 dankogai Exp $
6 1.26 $Date: 2002/04/07 15:22:04 $
9 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
10 And t/Aliases.t fixed to make sure they all exist.
12 1.25 2002/04/07 15:01:25 (Unreleased)
14 ! lib/Encode/Unicode.pm
17 - lib/Encode/UTF_EBCDIC.pm
18 - lib/Encode/Internal.pm
20 Integrated into Encode.pm as closures. That way "one package, one file"
21 rule is preserved yet less files to require.
23 commented out binmode(STDERR ...
32 ! Encode/Makefile_PL.e2x
33 Schwarn's MM-compliance patch merged
34 Message-Id: <20020406082609.GA28758@blackrider>
36 ! lib/Encode/Unicode.pm
37 + lib/Encode/UTF_EBCDIC.pm
39 - lib/Encode/10646_1.pm
40 - lib/Encode/ucs2_le.pm
41 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
42 per-module based (en|de)code, I saved a number of .pm by
43 reorganizing it as per-object base (Well, this is what Encode::XS
44 does under the hood). See Encode::Unicode for details.
45 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
46 This module is used only on EBCDIC environments.
48 1.21 2002/04/05 14:46:34 (Not Released)
54 Are back to make Perl/Tk happy Smile, NI-S.
57 ! lib/Encode/Supported.pm
58 ! lib/Encode/10646_1.pm
59 ! lib/Encode/ucs2_le.pm
60 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
61 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
62 updated to reflect changes.
63 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
65 ! lib/Encode/Supported.pm
66 Anton's revision commited. Added Dan's own fixes as well.
67 Message-Id: <159103166906.20020405161134@motor.ru>
70 < qr/^UCS2-le$/i => '"UCS-2"', );
72 > qr/^UCS2-LE$/i => '"UTF-16LE"');
73 Sigh. Thank you, Anton.
74 Message-Id: <14567692196.20020405062020@motor.ru>
75 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
77 1.20 2002/04/04 19:50:52
79 the last minute addtion. Just give it a try. Docs remains to be done.
80 Not installed by default.
81 ! lib/Encode/Supported.pod
86 ! lib/Encode/10464_1.pm
87 ! lib/Encode/ucs2_le.pm
88 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
89 unchanged but UTF-16BE is added as an alias. Implicit aliases
90 move to Encode::Alias so init_alias() works more as expected.
91 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
92 Though pedantically wrong, This should make perl mongers happier.
93 t/Alias.t is enhanced to test all these.
94 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
96 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
97 of the codepoints are common. CJKT left as is because the saving is
103 ! Encode/Makefile_PL.e2x
111 All occurance of _def.h replaced with .exh so djgpp works happily
112 ever after! To credit this amazing discovery, Laszlo is now in
114 Message-Id: <20020403181424.GA8778@freemail.hu>
115 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
118 ! Encode/Makefile_PL.skel
120 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
121 Message-Id: <20020401222744.GX2000@blackrider>, et al.
123 Two more tests by added jhi
124 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
127 The showstopper fixed -- Memory reallocation bug was causing
128 Encode::XS to fall into infinite loop on certain conditions.
130 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
135 ! lib/Encode/Supported.pod
136 Vendor encodings rebuilt out of original map files at unicode.org.
137 Indic languages such as MacDevanagali remain unspported do to the
138 shortcoming of encengine capabilities (they need algorithmical
139 conversion and I have no knowledge on that!). Pods fixed for added
141 Oh, macJapan.ucm renamed to macJapanese.ucm.
142 macROMnn is macRomanian and macRUMnn is macRumanian.
143 txt2ucm is a crude script that is used to convert them.
145 Unicode Compound Characters (used extensively on Mac) supported
147 Typo fixes and improvements by jhi
148 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
150 1.11 $Date: 2002/04/07 15:22:04 $
154 Missing files from the MANIFEST fixed.
155 Message-Id: <20020401010156.H10509@alpha.hut.fi>
156 Version incremented just to make CPAN happy.
158 1.10 2002/03/31 21:32:42
161 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
162 if you want. This should make Autrijus happy. Also, piconv
163 is added to default install.
166 Here-documented files that enc2xs generates are now exported
167 to *.e2x. Much cleaner and easier to debug.
169 encoding enhances so you can make it act more like such
170 (now prehistoric ) "localized" variations of perl like Jperl.
172 Further test for encoding.pm. Written in euc-jp
176 Message-Id: <20020330174618.B10154@alpha.hut.fi>
181 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
188 ! Encode/macIceland.ucm
189 ! lib/Encode/Alias.pm
190 ! lib/Encode/Supported.pod
191 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
192 Message-Id: <m3lmcavhjt.fsf@anima.de>
194 1.01 2002/03/29 20:59:39
197 s/USE_SCRIPTS/MORE_SCRIPTS/
199 installs enc2xs by default for external Encode:: modules in CPAN,
200 such as Encode::HanExtra
202 More sensible perl core detection via $ENV{PERL_CORE}
204 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
206 Perl core ditection via $^X =~ m/\bminiperl$/o
207 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
211 The version of all files is updated to 1.00 via "ci -f -l1.00",
212 commemorating version 1.00. All files, including *.ucm are now
213 under version control.
216 encode.h moved to Encode/ so it will be installed for the later
219 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
224 compile renamed to enc2xs.
225 Affected Makefle.PL updated
227 "Punt it. HanExtra can take care of that later." -- Autrijus
228 Message-Id: <20020328154338.GA7351@not.autrijus.org>
232 ! lib/Encode/CJKConstants.pm
233 ! lib/Encode/KR/2022_KR.pm
234 Table patches for Euro Signs, 2022-KR fixups by Jungshik
235 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
239 bin/ added for example scripts. They are not installed by default.
240 to install them, "perl Makefile.PL USE_SCRIPTS".
241 piconv is iconv reinvented in perl. in addition to all features
242 of iconv, it also adds perlish features. See L<piconv/1> for more
244 ! lib/Encode/Alias.pm
245 qr/^ replaced with qr/\b so it directly matches locale names
246 such as en_US.US-ASCII
249 Patch by MJD to fix the following problem applied.
250 Subject: [PATCH 5.7.3 Encode]
251 Aliases.t not properly skipped when Encode extension not built
252 Message-Id: <20020328091850.18677.qmail@plover.com>
253 ! lib/Encode/KR/2022_KR.pm
254 ! lib/Encode/CJKConstants.pm
255 Another patch from Jungshik to make iso-2022-kr actually work
256 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
257 ! Encode/Encode/euc-kr.ucm
258 + Encode/Encode/johab.ucm
259 ! Encode/Encode/ksc5601.ucm
261 ! Encode/KR/Makefile.PL
262 ! Encode/lib/Encode/Alias.pm
264 Johab support and complete revision of Korean Encoding by Jungshik
265 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
267 Revised to make up with now-dropped Encode::Details.
268 - lib/Encode/Details.pod
269 Dropped. Besides being obsolete, the topics are now covered in
275 Korean aliases fixed thanks to Jungshik Shin
276 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
277 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
279 =head1 NAME added to all modules to make buildtoc happy
280 Message-Id: <20020327041151.A10618@alpha.hut.fi>
281 - lib/Encode/CJKguide.pod
282 Too controversial and dropped from the dist. Will be available
283 separately on the web.
285 RCS tags added so table debugging gets easier (should that be
286 needed! I hope they all stay 1.00!)
287 + lib/Encode/CJKguide.pod
288 A detailed guide to mainly, but not limited to, CJK multibyte
291 + Encode/hp-roman8.ucm
293 ! Encode/Supported.pod
294 All occurance of "roman8" replaced with "hp-roman8" to avoid
296 ! Encode/Supported.pod
299 Mac Encodings now comply the Inside Macintosh
301 Test for '-raw' conventions added.
303 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
307 "-raw" appended to canonical names.
308 File mames stay unchanged thanks to UCM format.
309 ! lib/Encode/CN/HZ.pm
310 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
311 Message-Id: <20020326035210.GA2091@not.autrijus.org>
314 - lib/Encode/JP/Const.pm
315 + lib/Encode/CJKConstants.pm
316 + lib/Encode/CN/2022_CN.pm
317 + lib/Encode/KR/2022_KR.pm
326 * Support for ISO-2022-KR and ISO-2022-CN added.
328 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
329 * ucm2table autogenerates character table out of UCM files.
332 - lib/Encode/Supports.pod
333 + lib/Encode/Supported.pod
334 Names reverted due to popular demand.
335 8.3 rule applies only when there is a conflict.
336 Message-Id: <20020325095924.GD44120@not.autrijus.org>
341 - lib/Encode/Format/Enc.pod
343 * Character tables is now 100% ucm.
344 * All files under Encode/ is now 8.3-compliant
345 * some of missing encodings added (i.e. gsm0338 and nextstep)
346 * Vendor mappings aggregated with appropriate national std in
347 Makefile.PL, resulting smaller *.so especially for CJK.
348 Following is result on Dan's FreeBSD box.
350 ---------------------------------------------------------------
351 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
352 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
353 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
354 blib/arch/auto/Encode/Encode.so 27,791 27,791
355 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
356 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
357 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
358 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
359 ---------------------------------------------------------------
360 Total 5,375,297 6,343,819
362 * As a result of ucm-transition, Encode::Tcl dropped because
363 Encode::Tcl demands *.enc.
364 Encode::Tcl will be supplied in a separate tarball with *.enc.
365 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
370 -lib/Encode/Supported.pod
371 +lib/Encode/Supports.pod
372 -lib/Encode/iso10646_1.pm
373 +lib/Encode/10646_1.pm
374 -lib/Encode/EncFormat.pod
375 +lib/Encode/Format/Enc.pod
376 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
377 - lib/Encode/JP/Constants.pm
378 + lib/Encode/JP/Consts.pm
379 ! lib/Encode/JP/JIS.pm
380 ! lib/Encode/JP/H2Z.pm
381 Version nit problem and 8.3 rule fix.
382 > Package namespace installed latest in CPAN file
383 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
384 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
385 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
386 accordingly. In addition, legacy "use vars qw()..." are replaced with
388 Message-Id: <20020325011248.D1561@alpha.hut.fi>
389 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
391 - lib/Encode/JP/ISO_2022_JP.pm
392 - lib/Encode/JP/ISO_2022_JP_1.pm
393 + lib/Encode/JP/2022_JP.pm
394 + lib/Encode/JP/2022_JP1.pm
396 8.3 naming conflict for vanilla fat addressed by jhi
397 Message-Id: <20020324201931.V22596@alpha.hut.fi>
400 Typecast fix addressed by jhi
401 Message-Id: <20020324185540.T22596@alpha.hut.fi>
404 ! lib/Encode/Supported.pod
406 + lib/Encode/JP/ISO_2022_JP_1.pm
407 ! lib/Encode/JP/ISO_2022_JP.pm
408 ! lib/Encode/JP/JIS.pm
410 Now Encode::JP is more strict on the difference between ISO-2022-JP
411 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
412 makes Anton happier :) FYI the previous version implements
413 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
414 ! lib/Encode/Supported.pod
417 Avoid core-dump in Encode with PERLIO=mmap by NI-S
418 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
423 ! lib/Encode/Suppoted.pod
424 pod fixes to replace F<http://...> to L<http://...>,
425 as suggested by Autrijius in:
426 Message-Id: <20020324083943.GA14901@not.autrijus.org>
427 ! lib/Encode/Suppoted.pod
428 fixes and enhancements by Anton
429 Message-Id: <10632060120.20020324103753@motor.ru>
430 ! lib/Encode/Alias.pm
431 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
432 added. Suggested by Anton then deobfuscated by Autrijius
433 Message-Id: <20020324064455.GA3667@not.autrijus.org>
435 Further fix by Nicholas Clark
436 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
437 - lib/EncodeFormat.pod
438 + lib/Encode/EncFormat.pod
440 File renamed as suggested by Autrijius
442 ! lib/Encode/Details.pod
443 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
444 ! Encode.pm Sun Mar 24 13:43:47 2002
445 pod fixes by Autrijius.
446 Message-Id: <20020324062804.GA3595@not.autrijus.org>
447 Message-Id: <20020324075627.GB11986@not.autrijus.org>
449 ! lib/Encode/Alias.pm
451 now more EBCDIC conscious;
452 %ExtModules on EBCDIC system excludes CJK so that you don't
453 have to worry about the matched alias resulting cloaking.
454 t/Alias.t also revised to reflect changes. Verified by jhi
455 Message-Id: <20020324022929.D22596@alpha.hut.fi>
461 EBCDIC detection mechanism installed as in JP/JP.pm
462 Message-Id: <20020323211847.G19148@alpha.hut.fi>
470 Now all table files used by compile are postfixed '_t' to avoid
471 namespace collisions in case insensitive file systems once for all!
473 Message-ID: <58290227735.20020323195659@familiehaase.de>
475 Since the Encode::JP is unsupported under EBCDIC we
476 cannot run this test (aliases as such should work fine) -- jhi
477 Message-Id: <20020323202119.D19148@alpha.hut.fi>
479 duplicate occurance of ascii.ucm and 8859-1.ucm
480 causes MacOS X dlyd to cloak
486 < chdir 't' if -d 't';
488 > if (! -d 'blib' and -d 't'){ chdir 't' };
489 When you are "make test"-ing on Encode/ directory, you must not
490 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
491 remain unchanced. Also the situation detection was made simpler
492 in t/JP.t, which was originally;
493 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
495 "Use of uninitialized value in string eq at Encode.pm line 96."
499 -- Problem on case insensitive file systems
500 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
501 Message-ID: <88254111953.20020323095503@familiehaase.de>
504 "So I think it's a bug in gcc, not perl. But it still needs to be
506 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
507 Message-Id: <20020323170509.C96475@plum.flirble.org>
511 ! lib/Encode/Encoding.pm
512 ! lib/Encode/Alias.pm
513 ! lib/Encode/Supported.pod
515 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
516 Message-ID: <20020322073908.GB10539@blackrider>
519 "...I think we should include ISO 8859-1 as well." -- NI-S
520 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
525 ! lib/Encode/Alias.pm
526 alias definitions relocated to Encode::Alias so module autoloading
527 works for aliases also.
529 encodings() now accepts args to check ExtModules.
538 Latin and single byte encodings are reorganized so they are
539 demand-loaded like Encode::XX. Now only ascii is compiled into
541 ! lib/Encode/Alias.pm
542 for my $k (keys %hash){ delete $hash{$k}; }
543 is depreciated; fixed.
546 In this update, pod rewrites and alias fixes are the main issues
547 + lib/Encode/Supported.pod
548 Describes supported encodings
550 streamlined compiled-in encodings.
551 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
553 + Encode/ibm-125?.ucm
554 Added from icu distibution with any occurance of
555 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
556 some respect to icu staff, however.
557 + lib/Encode/Alias.pm
559 Alias difinitions in Encode.pm relocated.
562 packWARN patch from Paul Marquess via jhi
563 Message-Id: <20020321010101.O28978@alpha.hut.fi>
564 Paul added to AUTHORS as a result.
565 ! t/CJKalias.t -> t/Aliases.t
566 Renamed. Checks even more aliases and alias overloading
569 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
570 gbk => cp936 relocated to CN.pm
572 Test::More with plans (by jhi)
575 + lib/Encode/Description.pod
576 ! lib/Encode/Encoding.pm
577 Now the pod in Encode.pm is abridged as programming references.
578 lib/Encode/Description.pod contains the original, detailed description
579 and Encode::Encoding explains how to write your own module to
580 add new encodings. So far, lib/Encode/Description.pod contains
581 the whole pod once in Encode.pm. This is intentional.
583 Pod revisions by Anton Tagunov
584 Message-Id: <517178431.20020320174824@motor.ru>
586 all occrance of Encode::Tcl::Extended removed including pod
588 test now checks $encoding->name only; $encoding->{name} are
589 no longer check to find the canonical name.
590 ! lib/Encode/JP/JIS.pm
591 ! lib/Encode/JP/ISO_2022_JP.pm
592 ->name() added to be more compliant with API
598 Patch by Autrijus to add aliases to TW and fixes to POD
599 Message-Id: <20020320090619.GA24774@not.autrijus.org>
601 SADAHIRO Tomoyuki added as should. My apologies.
604 * First release to be uploaded to CPAN. For prehistoric changes,
605 please see Changes file of perl distibution as well as
606 perl-unicode@perl.org archive, available at:
607 http://archive.develooper.com/perl-unicode@perl.org/
609 Changes Since 0.92 includes;
614 + Mention to perl-unicode@perl.org added
616 + Encoding aliases added so you can feed locale names
617 and MIME Charset="" directly.
618 - Mention to JISX0212 removed because it's fixed
621 + Encoding aliases added. Note TW is left untouched because
622 euc-tw is not implemented in TW but in Encode::HanExtra.
623 Autrijus, you may fix Encode::HanExtra.
625 + to test encode aliases added