1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.20 2002/04/04 19:50:52 dankogai Exp dankogai $
5 1.20 $Date: 2002/04/04 19:50:52 $
7 the last minute addtion. Just give it a try. Docs remains to be done.
8 Not installed by default.
9 ! lib/Encode/Supported.pod
14 ! lib/Encode/10464_1.pm
15 ! lib/Encode/ucs2_le.pm
16 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
17 unchanged but UTF-16BE is added as an alias. Implicit aliases
18 move to Encode::Alias so init_alias() works more as expected.
19 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
20 Though pedantically wrong, This should make perl mongers happier.
21 t/Alias.t is enhanced to test all these.
22 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
24 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
25 of the codepoints are common. CJKT left as is because the saving is
31 ! Encode/Makefile_PL.e2x
39 All occurance of _def.h replaced with .exh so djgpp works happily
40 ever after! To credit this amazing discovery, Laszlo is now in
42 Message-Id: <20020403181424.GA8778@freemail.hu>
43 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
46 ! Encode/Makefile_PL.skel
48 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
49 Message-Id: <20020401222744.GX2000@blackrider>, et al.
51 Two more tests by added jhi
52 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
55 The showstopper fixed -- Memory reallocation bug was causing
56 Encode::XS to fall into infinite loop on certain conditions.
58 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
63 ! lib/Encode/Supported.pod
64 Vendor encodings rebuilt out of original map files at unicode.org.
65 Indic languages such as MacDevanagali remain unspported do to the
66 shortcoming of encengine capabilities (they need algorithmical
67 conversion and I have no knowledge on that!). Pods fixed for added
69 Oh, macJapan.ucm renamed to macJapanese.ucm.
70 macROMnn is macRomanian and macRUMnn is macRumanian.
71 txt2ucm is a crude script that is used to convert them.
73 Unicode Compound Characters (used extensively on Mac) supported
75 Typo fixes and improvements by jhi
76 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
78 1.11 $Date: 2002/04/04 19:50:52 $
82 Missing files from the MANIFEST fixed.
83 Message-Id: <20020401010156.H10509@alpha.hut.fi>
84 Version incremented just to make CPAN happy.
86 1.10 2002/03/31 21:32:42
89 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
90 if you want. This should make Autrijus happy. Also, piconv
91 is added to default install.
94 Here-documented files that enc2xs generates are now exported
95 to *.e2x. Much cleaner and easier to debug.
97 encoding enhances so you can make it act more like such
98 (now prehistoric ) "localized" variations of perl like Jperl.
100 Further test for encoding.pm. Written in euc-jp
104 Message-Id: <20020330174618.B10154@alpha.hut.fi>
109 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
116 ! Encode/macIceland.ucm
117 ! lib/Encode/Alias.pm
118 ! lib/Encode/Supported.pod
119 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
120 Message-Id: <m3lmcavhjt.fsf@anima.de>
122 1.01 2002/03/29 20:59:39
125 s/USE_SCRIPTS/MORE_SCRIPTS/
127 installs enc2xs by default for external Encode:: modules in CPAN,
128 such as Encode::HanExtra
130 More sensible perl core detection via $ENV{PERL_CORE}
132 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
134 Perl core ditection via $^X =~ m/\bminiperl$/o
135 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
139 The version of all files is updated to 1.00 via "ci -f -l1.00",
140 commemorating version 1.00. All files, including *.ucm are now
141 under version control.
144 encode.h moved to Encode/ so it will be installed for the later
147 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
152 compile renamed to enc2xs.
153 Affected Makefle.PL updated
155 "Punt it. HanExtra can take care of that later." -- Autrijus
156 Message-Id: <20020328154338.GA7351@not.autrijus.org>
160 ! lib/Encode/CJKConstants.pm
161 ! lib/Encode/KR/2022_KR.pm
162 Table patches for Euro Signs, 2022-KR fixups by Jungshik
163 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
167 bin/ added for example scripts. They are not installed by default.
168 to install them, "perl Makefile.PL USE_SCRIPTS".
169 piconv is iconv reinvented in perl. in addition to all features
170 of iconv, it also adds perlish features. See L<piconv/1> for more
172 ! lib/Encode/Alias.pm
173 qr/^ replaced with qr/\b so it directly matches locale names
174 such as en_US.US-ASCII
177 Patch by MJD to fix the following problem applied.
178 Subject: [PATCH 5.7.3 Encode]
179 Aliases.t not properly skipped when Encode extension not built
180 Message-Id: <20020328091850.18677.qmail@plover.com>
181 ! lib/Encode/KR/2022_KR.pm
182 ! lib/Encode/CJKConstants.pm
183 Another patch from Jungshik to make iso-2022-kr actually work
184 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
185 ! Encode/Encode/euc-kr.ucm
186 + Encode/Encode/johab.ucm
187 ! Encode/Encode/ksc5601.ucm
189 ! Encode/KR/Makefile.PL
190 ! Encode/lib/Encode/Alias.pm
192 Johab support and complete revision of Korean Encoding by Jungshik
193 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
195 Revised to make up with now-dropped Encode::Details.
196 - lib/Encode/Details.pod
197 Dropped. Besides being obsolete, the topics are now covered in
203 Korean aliases fixed thanks to Jungshik Shin
204 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
205 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
207 =head1 NAME added to all modules to make buildtoc happy
208 Message-Id: <20020327041151.A10618@alpha.hut.fi>
209 - lib/Encode/CJKguide.pod
210 Too controversial and dropped from the dist. Will be available
211 separately on the web.
213 RCS tags added so table debugging gets easier (should that be
214 needed! I hope they all stay 1.00!)
215 + lib/Encode/CJKguide.pod
216 A detailed guide to mainly, but not limited to, CJK multibyte
219 + Encode/hp-roman8.ucm
221 ! Encode/Supported.pod
222 All occurance of "roman8" replaced with "hp-roman8" to avoid
224 ! Encode/Supported.pod
227 Mac Encodings now comply the Inside Macintosh
229 Test for '-raw' conventions added.
231 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
235 "-raw" appended to canonical names.
236 File mames stay unchanged thanks to UCM format.
237 ! lib/Encode/CN/HZ.pm
238 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
239 Message-Id: <20020326035210.GA2091@not.autrijus.org>
242 - lib/Encode/JP/Const.pm
243 + lib/Encode/CJKConstants.pm
244 + lib/Encode/CN/2022_CN.pm
245 + lib/Encode/KR/2022_KR.pm
254 * Support for ISO-2022-KR and ISO-2022-CN added.
256 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
257 * ucm2table autogenerates character table out of UCM files.
260 - lib/Encode/Supports.pod
261 + lib/Encode/Supported.pod
262 Names reverted due to popular demand.
263 8.3 rule applies only when there is a conflict.
264 Message-Id: <20020325095924.GD44120@not.autrijus.org>
269 - lib/Encode/Format/Enc.pod
271 * Character tables is now 100% ucm.
272 * All files under Encode/ is now 8.3-compliant
273 * some of missing encodings added (i.e. gsm0338 and nextstep)
274 * Vendor mappings aggregated with appropriate national std in
275 Makefile.PL, resulting smaller *.so especially for CJK.
276 Following is result on Dan's FreeBSD box.
278 ---------------------------------------------------------------
279 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
280 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
281 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
282 blib/arch/auto/Encode/Encode.so 27,791 27,791
283 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
284 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
285 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
286 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
287 ---------------------------------------------------------------
288 Total 5,375,297 6,343,819
290 * As a result of ucm-transition, Encode::Tcl dropped because
291 Encode::Tcl demands *.enc.
292 Encode::Tcl will be supplied in a separate tarball with *.enc.
293 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
298 -lib/Encode/Supported.pod
299 +lib/Encode/Supports.pod
300 -lib/Encode/iso10646_1.pm
301 +lib/Encode/10646_1.pm
302 -lib/Encode/EncFormat.pod
303 +lib/Encode/Format/Enc.pod
304 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
305 - lib/Encode/JP/Constants.pm
306 + lib/Encode/JP/Consts.pm
307 ! lib/Encode/JP/JIS.pm
308 ! lib/Encode/JP/H2Z.pm
309 Version nit problem and 8.3 rule fix.
310 > Package namespace installed latest in CPAN file
311 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
312 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
313 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
314 accordingly. In addition, legacy "use vars qw()..." are replaced with
316 Message-Id: <20020325011248.D1561@alpha.hut.fi>
317 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
319 - lib/Encode/JP/ISO_2022_JP.pm
320 - lib/Encode/JP/ISO_2022_JP_1.pm
321 + lib/Encode/JP/2022_JP.pm
322 + lib/Encode/JP/2022_JP1.pm
324 8.3 naming conflict for vanilla fat addressed by jhi
325 Message-Id: <20020324201931.V22596@alpha.hut.fi>
328 Typecast fix addressed by jhi
329 Message-Id: <20020324185540.T22596@alpha.hut.fi>
332 ! lib/Encode/Supported.pod
334 + lib/Encode/JP/ISO_2022_JP_1.pm
335 ! lib/Encode/JP/ISO_2022_JP.pm
336 ! lib/Encode/JP/JIS.pm
338 Now Encode::JP is more strict on the difference between ISO-2022-JP
339 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
340 makes Anton happier :) FYI the previous version implements
341 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
342 ! lib/Encode/Supported.pod
345 Avoid core-dump in Encode with PERLIO=mmap by NI-S
346 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
351 ! lib/Encode/Suppoted.pod
352 pod fixes to replace F<http://...> to L<http://...>,
353 as suggested by Autrijius in:
354 Message-Id: <20020324083943.GA14901@not.autrijus.org>
355 ! lib/Encode/Suppoted.pod
356 fixes and enhancements by Anton
357 Message-Id: <10632060120.20020324103753@motor.ru>
358 ! lib/Encode/Alias.pm
359 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
360 added. Suggested by Anton then deobfuscated by Autrijius
361 Message-Id: <20020324064455.GA3667@not.autrijus.org>
363 Further fix by Nicholas Clark
364 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
365 - lib/EncodeFormat.pod
366 + lib/Encode/EncFormat.pod
368 File renamed as suggested by Autrijius
370 ! lib/Encode/Details.pod
371 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
372 ! Encode.pm Sun Mar 24 13:43:47 2002
373 pod fixes by Autrijius.
374 Message-Id: <20020324062804.GA3595@not.autrijus.org>
375 Message-Id: <20020324075627.GB11986@not.autrijus.org>
377 ! lib/Encode/Alias.pm
379 now more EBCDIC conscious;
380 %ExtModules on EBCDIC system excludes CJK so that you don't
381 have to worry about the matched alias resulting cloaking.
382 t/Alias.t also revised to reflect changes. Verified by jhi
383 Message-Id: <20020324022929.D22596@alpha.hut.fi>
389 EBCDIC detection mechanism installed as in JP/JP.pm
390 Message-Id: <20020323211847.G19148@alpha.hut.fi>
398 Now all table files used by compile are postfixed '_t' to avoid
399 namespace collisions in case insensitive file systems once for all!
401 Message-ID: <58290227735.20020323195659@familiehaase.de>
403 Since the Encode::JP is unsupported under EBCDIC we
404 cannot run this test (aliases as such should work fine) -- jhi
405 Message-Id: <20020323202119.D19148@alpha.hut.fi>
407 duplicate occurance of ascii.ucm and 8859-1.ucm
408 causes MacOS X dlyd to cloak
414 < chdir 't' if -d 't';
416 > if (! -d 'blib' and -d 't'){ chdir 't' };
417 When you are "make test"-ing on Encode/ directory, you must not
418 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
419 remain unchanced. Also the situation detection was made simpler
420 in t/JP.t, which was originally;
421 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
423 "Use of uninitialized value in string eq at Encode.pm line 96."
427 -- Problem on case insensitive file systems
428 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
429 Message-ID: <88254111953.20020323095503@familiehaase.de>
432 "So I think it's a bug in gcc, not perl. But it still needs to be
434 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
435 Message-Id: <20020323170509.C96475@plum.flirble.org>
439 ! lib/Encode/Encoding.pm
440 ! lib/Encode/Alias.pm
441 ! lib/Encode/Supported.pod
443 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
444 Message-ID: <20020322073908.GB10539@blackrider>
447 "...I think we should include ISO 8859-1 as well." -- NI-S
448 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
453 ! lib/Encode/Alias.pm
454 alias definitions relocated to Encode::Alias so module autoloading
455 works for aliases also.
457 encodings() now accepts args to check ExtModules.
466 Latin and single byte encodings are reorganized so they are
467 demand-loaded like Encode::XX. Now only ascii is compiled into
469 ! lib/Encode/Alias.pm
470 for my $k (keys %hash){ delete $hash{$k}; }
471 is depreciated; fixed.
474 In this update, pod rewrites and alias fixes are the main issues
475 + lib/Encode/Supported.pod
476 Describes supported encodings
478 streamlined compiled-in encodings.
479 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
481 + Encode/ibm-125?.ucm
482 Added from icu distibution with any occurance of
483 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
484 some respect to icu staff, however.
485 + lib/Encode/Alias.pm
487 Alias difinitions in Encode.pm relocated.
490 packWARN patch from Paul Marquess via jhi
491 Message-Id: <20020321010101.O28978@alpha.hut.fi>
492 Paul added to AUTHORS as a result.
493 ! t/CJKalias.t -> t/Aliases.t
494 Renamed. Checks even more aliases and alias overloading
497 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
498 gbk => cp936 relocated to CN.pm
500 Test::More with plans (by jhi)
503 + lib/Encode/Description.pod
504 ! lib/Encode/Encoding.pm
505 Now the pod in Encode.pm is abridged as programming references.
506 lib/Encode/Description.pod contains the original, detailed description
507 and Encode::Encoding explains how to write your own module to
508 add new encodings. So far, lib/Encode/Description.pod contains
509 the whole pod once in Encode.pm. This is intentional.
511 Pod revisions by Anton Tagunov
512 Message-Id: <517178431.20020320174824@motor.ru>
514 all occrance of Encode::Tcl::Extended removed including pod
516 test now checks $encoding->name only; $encoding->{name} are
517 no longer check to find the canonical name.
518 ! lib/Encode/JP/JIS.pm
519 ! lib/Encode/JP/ISO_2022_JP.pm
520 ->name() added to be more compliant with API
526 Patch by Autrijus to add aliases to TW and fixes to POD
527 Message-Id: <20020320090619.GA24774@not.autrijus.org>
529 SADAHIRO Tomoyuki added as should. My apologies.
532 * First release to be uploaded to CPAN. For prehistoric changes,
533 please see Changes file of perl distibution as well as
534 perl-unicode@perl.org archive, available at:
535 http://archive.develooper.com/perl-unicode@perl.org/
537 Changes Since 0.92 includes;
542 + Mention to perl-unicode@perl.org added
544 + Encoding aliases added so you can feed locale names
545 and MIME Charset="" directly.
546 - Mention to JISX0212 removed because it's fixed
549 + Encoding aliases added. Note TW is left untouched because
550 euc-tw is not implemented in TW but in Encode::HanExtra.
551 Autrijus, you may fix Encode::HanExtra.
553 + to test encode aliases added