1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.60 2002/04/24 20:06:52 dankogai Exp $
6 $Revision: 1.60 $ $Date: 2002/04/24 20:06:52 $
8 "Thou shalt not assume %x works." -- jhi
9 Message-Id: <20020424210618.E24347@alpha.hut.fi>
10 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
11 low-memory build machines happy, now *.c is created for each *.ucm
12 (no table aggregation). You can still override this by setting
13 $ENV{AGGREGATE_TABLES}.
14 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
16 + lib/Encode/JP/JIS7.pm
17 Encoding-autodetect (mainly for Japanese encoding) added. In a
18 course of development, JIS7.pm was improved.
19 + lib/Encode/HTML/Header.pm
20 + lib/Encode/Config.pm
21 MIME B/Q Header Encoding Added!
22 ! Encode.pm Encode.xs t/fallback.t
23 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
24 Message-Id: <20020424130709.GA14211@tanglefoot>
26 1.59 $ 2002/04/22 23:54:22
28 needs_lines() and perlio_ok() are added to Internal encodings such
29 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
31 1.58 2002/04/22 23:54:22
33 s/MacChineseSimp/MacChineseTrad/ # ... oops.
38 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
39 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
40 t/(JP|KR).t are aggregated to t/CJKT.t
41 test data are all remade via bin/ucm2text.
42 And .... They are no longer skipped for -Uuseperlio !
44 1.57 2002/04/22 20:27:30
45 ! t/JP.t t/KR.t t/perlio.t
46 unless (find PerlIO::Layer 'perlio') ... line is back again.
47 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
49 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
50 ->perlio_ok now does eval{ require PerlIO::encoding } there so
51 it correctly returns 1 when PerlIO::encoding is yet loaded.
53 perl-current patch #16072 reflected
55 1.56 2002/04/22 09:48:07
56 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
57 New PerlIO::encoding 0.04 compliance met
59 1.55 2002/04/22 03:43:05
60 ! Encode.pm Encode.xs Unicode/Unicode.pm
61 needs_lines() defined so Encode::Encoding is no longer needed
64 1.54 2002/04/22 02:50:01
65 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
66 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
67 now perlio_ok is true by default if PerlIO::encoding->VERSION is
68 0.03 or larger. POD in Encode::Encoding revised to reflect this.
69 Encode::XS and Encode::Unicode now has perlio_ok() method.
70 ! lib/Encode/Supported.pod
72 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
73 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
74 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
75 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
76 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
77 ! lib/Encode/Supported.pod
78 Huge document fixes by Philip.
81 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
82 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
84 binmode() after "<:encoding" to make Win32 happy, by Mattia.
85 Mattia added to AUTHORS file
86 Message-Id: <3CC3150F.5798.22A05AE@localhost>
88 1.52 2002/04/20 23:43:47
90 TODO: is now SKIP:, as NI-XS requested. Also adds more
91 eraborate failure analysis added.
93 A note on how to make sure of round-trip safety added to POD
94 section (so Autrijus is happier)
95 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
96 big5-(eten|hkscs) is round-trip safe again!
97 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
100 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
101 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
102 PerIO coodination patches from NI-XS.
103 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
105 1.51 2002/04/20 09:58:23
107 Updated test suite by Autrijis so "make test" is happy again
108 Message-Id: <20020420082104.GA25037@not.autrijus.org>
110 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
112 TW/TW.pm TW/Makefile.PL
113 Updates by Autrijus. 'big5' is no longer a canonical but an
114 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
115 Message-Id: <20020419195346.GA19597@not.autrijus.org>
117 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
118 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
120 PerlIO detection a little bit smarter; no longer uses eval qq{}
123 1.50 2002/04/19 06:13:02
124 ! ! Encode.pm Encode.xs Encode/encoding.h
126 New Fallback API imlemented and documented. See "perldoc Encode"
128 ! lib/Encode/JP/JIS7.pm Encode.pm
129 + lib/Encode/PerlIO.pod t/perlio.t
130 API compliance met. However, it still does not work unless perlio
131 implements line buffer. See BUGS section in perldoc Encode::PerlIO
132 As a sensible workaround, perlio_ok() added to Encode.
134 ! lib/Encode/Supported.pod
136 Message-Id: <20020418174647.J8466@alpha.hut.fi>
138 Doc fixes from Autrijus
139 Message-Id: <20020418144131.GA10987@not.autrijus.org>
141 perlqq mode documented
143 + t/jisx0201.euc t/jisx0201.ref
144 ! t/jisx0208.euc t/jisx0208.ref
145 t/JP.t tests more rigorously and with other encodings
146 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
147 in jis0208 (used to contain jisx0201 part).
148 ! Encode/Makefile_PL.e2x
149 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
150 enc2xs and encode.h rather than hard-coded. This allows the resulting
151 module fully CPANizable.
152 ! encoding.pm t/JP.t t/KR.t
153 PerlIO detection simplified (checks %INC instead of eval{})
154 ! Encode.xs Encode/encode.h
155 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
156 - lib/Encode/Unicode.pm
157 (en|de)code_xs relocated to where it belongs. Source reindented
160 Additional (U8 *) cast added as suggested by jhi
161 Message-Id: <20020417165916.A28599@alpha.hut.fi>
163 1.42 Date: 2002/04/17
165 no-op module; Thought of adding a pod there but enc2xs has
170 correct mechanism to detect Perlio::encoding layar installed.
172 PerlIO Layer detached.
174 1.41 2002/04/16 23:35:00
176 binmode(STDIN|STDOUT ...) done iff PerlIO is available
178 Cleaned up PerlIO skip conditions to prepare for the upcoming
179 Encode - PerlIO forking.
181 exported functions are now prototyped.
182 ! lib/Encode/CN/HZ.pm
185 fallback implemented # was /* FIXME */
186 affected programs revised to fit (only HZ was using the try-catch
187 approach which needed to be fixed for API-compliance).
189 ! Encode/KR/2022_KR.pm
191 can find =head1 NAME now, jhi
192 Message-Id: <20020416083059.V30639@alpha.hut.fi>
196 now complies with less warnings with the pickest compilers.
197 Suggested by Craig, fixed by Dan.
198 ! Encode/Makefile_PL.e2x
200 A bug that fails to find *.e2x in certain conditions fixed
202 1.40 2002/04/14 22:27:14
203 + Encode/ConfigLocal_PM.e2x
204 ! lib/Encode/Config.pm
206 "enc2xs -C" now generates/updates Encode::ConfigLocal.
207 ConfigLocal_PM.e2x is a skelton thereof.
208 ! lib/Encode/Config.pm
210 "use Encode::CN::HZ;" was missing.
213 More rigorous tests added to test XS, especially on memory allocation.
215 ! lib/Encode/Unicode.pm
216 NI-S implemented an XS version -- merged
217 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
220 Source filter option added. With this option on, you can write
221 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
222 encodings. t/jperl.t enhanced to test this feature.
224 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
225 apply his suggestion, this degree of nitting is enough to add him
227 Message-Id: <3CB93223.291E5E2E@earthlink.net>
229 + lib/Encode/JP/JIS7.pm
230 - lib/Encode/JP/JIS.pm
231 - lib/Encode/JP/2022_JP.pm
232 - lib/Encode/JP/2022_JP1.pm
233 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
234 JIS7.pm for better maintainability and performance
236 Added caveat for non-ascii identifiers.
238 fixes by jhi, the original author of this pragramtic module.
239 Message-Id: <20020413231527.V1826@alpha.hut.fi>
241 1.34 2002/04/12 20:23:05 (Unreleased)
244 EBCDIC fixes addressed by jhi.
245 Message-Id: <20020412161844.D9383@alpha.hut.fi>
246 ! lib/Encode/Encoder.pm
247 POD fix by Miyagawa-kun
248 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
250 1.33 2002/04/10 22:28:40
252 Philip's mail address corrected.
255 ! lib/Encode/Encoder.pm
256 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
257 This adds Miyagawa-kun to AUTHORS.
258 * encoding() no longer exported by default but on demand
259 * t/Encoder.t updated to test all these
260 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
261 ! lib/Encode/Unicode.pm
262 ! lib/Encode/Supported.pm
263 Further doc fixes by Anton
265 1.32 2002/04/09 20:06:15
268 - ucm/macDevanaga.ucm Unicode Character Map
269 - ucm/macGujarati.ucm Unicode Character Map
270 - ucm/macGurmukhi.ucm Unicode Character Map
271 A utility to check integrity of .ucm files. t/bogus.ucm is a
272 ucm that is deliberately bogus. unused Indic mappings are removed
275 resolve_alias() added as suggested by jhi. Same as
276 find_encoding("alias")->name. For convenience. This one is
277 defined in Encode.pm instead of Alias.pm.
278 Message-Id: <20020409215846.H17022@alpha.hut.fi>
280 Memory Allocate but detected during the devel of ucmlint -- fixed.
281 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
282 ! lib/Encode/Unicode.pm
283 valid_ucs2(0) is false but must be true.
284 3 patches from NI-S as follows. This also has fixed the incident
286 ! lib/Encode/Alias.pm
287 find_alias() recursion prevention
289 Checks for the patch above
290 ! t/Encode/Unicode.pm
291 An extra "F" that causes valid_ucs2() return a bogus value fixed
292 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
293 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
294 2 Small Patches from jhi as follows:
296 Encode->encodings() lists in case-insensitve order (as it was)
298 -l option prints avaiable encodings to STDOUT instead of STDERR
299 ! lib/Encode/Aliases.pm
300 s/defintion/definition/
301 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
303 ! lib/Encode/Supported.pod
304 ! lib/Encode/Unicode.pm
305 POD revise by Philip Newton. This adds Philip to AUTHORS list.
306 Thank you for the exact quote of Douglas Adams :)
307 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
309 1.31 2002/04/08 18:08:07
310 ! lib/Encode/Encoder.pm
312 Encode::Encoder, once just a placeholder of an idea, is now much more
313 practical. See t/Encode.t to find how practical it can be.
314 + lib/Encode/Config.pm
316 my false laziness at Encode.pm is fixed. Now %ExtModules are set
317 in Encode::Config and they are all literally, not programatically
318 set. My false laziness was resulting many encodings missing from
320 ! lib/Encode/Unicode.pm
322 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
323 so that it does not rely Encode::Unicode for BOM values
324 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
326 1.30 2002/04/08 02:34:51
327 + lib/Encode/Encoder.pm
328 Object Oriented Encoder. I reckon something like this is in need.
331 ! lib/Encode/Supported.pod
332 * autoloading bug that prevented upper-case canonicals such as UTF-16
333 is fixed. Now even UTF/UCS are autoloaded!
334 * encodings() is now more intuitive.
335 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
337 * Obligatory fixes to the POD.
338 ! lib/Encode/Supported.pod
339 Patch from Anton applied.
340 Message-Id: <66641479.20020408033300@motor.ru>
342 ! lib/Encode/Unicode.pm
343 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
345 1.28 2002/04/07 18:58:42
349 Just a MANIFEST for those missing files.
351 1.26 Date: 2002/04/07 15:22:04
354 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
355 And t/Aliases.t fixed to make sure they all exist.
357 1.25 2002/04/07 15:01:25 (Unreleased)
359 ! lib/Encode/Unicode.pm
362 - lib/Encode/UTF_EBCDIC.pm
363 - lib/Encode/Internal.pm
365 Integrated into Encode.pm as closures. That way "one package, one file"
366 rule is preserved yet less files to require.
368 commented out binmode(STDERR ...
377 ! Encode/Makefile_PL.e2x
378 Schwarn's MM-compliance patch merged
379 Message-Id: <20020406082609.GA28758@blackrider>
381 ! lib/Encode/Unicode.pm
382 + lib/Encode/UTF_EBCDIC.pm
384 - lib/Encode/10646_1.pm
385 - lib/Encode/ucs2_le.pm
386 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
387 per-module based (en|de)code, I saved a number of .pm by
388 reorganizing it as per-object base (Well, this is what Encode::XS
389 does under the hood). See Encode::Unicode for details.
390 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
391 This module is used only on EBCDIC environments.
393 1.21 2002/04/05 14:46:34 (Not Released)
399 Are back to make Perl/Tk happy Smile, NI-S.
401 ! lib/Encode/Alias.pm
402 ! lib/Encode/Supported.pm
403 ! lib/Encode/10646_1.pm
404 ! lib/Encode/ucs2_le.pm
405 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
406 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
407 updated to reflect changes.
408 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
409 ! lib/Encode/Alias.pm
410 ! lib/Encode/Supported.pm
411 Anton's revision commited. Added Dan's own fixes as well.
412 Message-Id: <159103166906.20020405161134@motor.ru>
413 ! lib/Encode/Alias.pm
415 < qr/^UCS2-le$/i => '"UCS-2"', );
417 > qr/^UCS2-LE$/i => '"UTF-16LE"');
418 Sigh. Thank you, Anton.
419 Message-Id: <14567692196.20020405062020@motor.ru>
420 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
422 1.20 2002/04/04 19:50:52
424 the last minute addtion. Just give it a try. Docs remains to be done.
425 Not installed by default.
426 ! lib/Encode/Supported.pod
429 ! lib/Encode/Alias.pm
431 ! lib/Encode/10464_1.pm
432 ! lib/Encode/ucs2_le.pm
433 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
434 unchanged but UTF-16BE is added as an alias. Implicit aliases
435 move to Encode::Alias so init_alias() works more as expected.
436 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
437 Though pedantically wrong, This should make perl mongers happier.
438 t/Alias.t is enhanced to test all these.
439 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
441 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
442 of the codepoints are common. CJKT left as is because the saving is
448 ! Encode/Makefile_PL.e2x
456 All occurance of _def.h replaced with .exh so djgpp works happily
457 ever after! To credit this amazing discovery, Laszlo is now in
459 Message-Id: <20020403181424.GA8778@freemail.hu>
460 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
463 ! Encode/Makefile_PL.skel
465 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
466 Message-Id: <20020401222744.GX2000@blackrider>, et al.
468 Two more tests by added jhi
469 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
472 The showstopper fixed -- Memory reallocation bug was causing
473 Encode::XS to fall into infinite loop on certain conditions.
475 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
480 ! lib/Encode/Supported.pod
481 Vendor encodings rebuilt out of original map files at unicode.org.
482 Indic languages such as MacDevanagali remain unspported do to the
483 shortcoming of encengine capabilities (they need algorithmical
484 conversion and I have no knowledge on that!). Pods fixed for added
486 Oh, macJapan.ucm renamed to macJapanese.ucm.
487 macROMnn is macRomanian and macRUMnn is macRumanian.
488 txt2ucm is a crude script that is used to convert them.
490 Unicode Compound Characters (used extensively on Mac) supported
492 Typo fixes and improvements by jhi
493 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
495 1.11 $Date: 2002/04/24 20:06:52 $
499 Missing files from the MANIFEST fixed.
500 Message-Id: <20020401010156.H10509@alpha.hut.fi>
501 Version incremented just to make CPAN happy.
503 1.10 2002/03/31 21:32:42
506 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
507 if you want. This should make Autrijus happy. Also, piconv
508 is added to default install.
511 Here-documented files that enc2xs generates are now exported
512 to *.e2x. Much cleaner and easier to debug.
514 encoding enhances so you can make it act more like such
515 (now prehistoric ) "localized" variations of perl like Jperl.
517 Further test for encoding.pm. Written in euc-jp
521 Message-Id: <20020330174618.B10154@alpha.hut.fi>
526 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
533 ! Encode/macIceland.ucm
534 ! lib/Encode/Alias.pm
535 ! lib/Encode/Supported.pod
536 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
537 Message-Id: <m3lmcavhjt.fsf@anima.de>
539 1.01 2002/03/29 20:59:39
542 s/USE_SCRIPTS/MORE_SCRIPTS/
544 installs enc2xs by default for external Encode:: modules in CPAN,
545 such as Encode::HanExtra
547 More sensible perl core detection via $ENV{PERL_CORE}
549 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
551 Perl core ditection via $^X =~ m/\bminiperl$/o
552 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
556 The version of all files is updated to 1.00 via "ci -f -l1.00",
557 commemorating version 1.00. All files, including *.ucm are now
558 under version control.
561 encode.h moved to Encode/ so it will be installed for the later
564 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
569 compile renamed to enc2xs.
570 Affected Makefle.PL updated
572 "Punt it. HanExtra can take care of that later." -- Autrijus
573 Message-Id: <20020328154338.GA7351@not.autrijus.org>
577 ! lib/Encode/CJKConstants.pm
578 ! lib/Encode/KR/2022_KR.pm
579 Table patches for Euro Signs, 2022-KR fixups by Jungshik
580 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
584 bin/ added for example scripts. They are not installed by default.
585 to install them, "perl Makefile.PL USE_SCRIPTS".
586 piconv is iconv reinvented in perl. in addition to all features
587 of iconv, it also adds perlish features. See L<piconv/1> for more
589 ! lib/Encode/Alias.pm
590 qr/^ replaced with qr/\b so it directly matches locale names
591 such as en_US.US-ASCII
594 Patch by MJD to fix the following problem applied.
595 Subject: [PATCH 5.7.3 Encode]
596 Aliases.t not properly skipped when Encode extension not built
597 Message-Id: <20020328091850.18677.qmail@plover.com>
598 ! lib/Encode/KR/2022_KR.pm
599 ! lib/Encode/CJKConstants.pm
600 Another patch from Jungshik to make iso-2022-kr actually work
601 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
602 ! Encode/Encode/euc-kr.ucm
603 + Encode/Encode/johab.ucm
604 ! Encode/Encode/ksc5601.ucm
606 ! Encode/KR/Makefile.PL
607 ! Encode/lib/Encode/Alias.pm
609 Johab support and complete revision of Korean Encoding by Jungshik
610 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
612 Revised to make up with now-dropped Encode::Details.
613 - lib/Encode/Details.pod
614 Dropped. Besides being obsolete, the topics are now covered in
620 Korean aliases fixed thanks to Jungshik Shin
621 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
622 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
624 =head1 NAME added to all modules to make buildtoc happy
625 Message-Id: <20020327041151.A10618@alpha.hut.fi>
626 - lib/Encode/CJKguide.pod
627 Too controversial and dropped from the dist. Will be available
628 separately on the web.
630 RCS tags added so table debugging gets easier (should that be
631 needed! I hope they all stay 1.00!)
632 + lib/Encode/CJKguide.pod
633 A detailed guide to mainly, but not limited to, CJK multibyte
636 + Encode/hp-roman8.ucm
638 ! Encode/Supported.pod
639 All occurance of "roman8" replaced with "hp-roman8" to avoid
641 ! Encode/Supported.pod
644 Mac Encodings now comply the Inside Macintosh
646 Test for '-raw' conventions added.
648 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
652 "-raw" appended to canonical names.
653 File mames stay unchanged thanks to UCM format.
654 ! lib/Encode/CN/HZ.pm
655 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
656 Message-Id: <20020326035210.GA2091@not.autrijus.org>
659 - lib/Encode/JP/Const.pm
660 + lib/Encode/CJKConstants.pm
661 + lib/Encode/CN/2022_CN.pm
662 + lib/Encode/KR/2022_KR.pm
671 * Support for ISO-2022-KR and ISO-2022-CN added.
673 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
674 * ucm2table autogenerates character table out of UCM files.
677 - lib/Encode/Supports.pod
678 + lib/Encode/Supported.pod
679 Names reverted due to popular demand.
680 8.3 rule applies only when there is a conflict.
681 Message-Id: <20020325095924.GD44120@not.autrijus.org>
686 - lib/Encode/Format/Enc.pod
688 * Character tables is now 100% ucm.
689 * All files under Encode/ is now 8.3-compliant
690 * some of missing encodings added (i.e. gsm0338 and nextstep)
691 * Vendor mappings aggregated with appropriate national std in
692 Makefile.PL, resulting smaller *.so especially for CJK.
693 Following is result on Dan's FreeBSD box.
695 ---------------------------------------------------------------
696 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
697 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
698 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
699 blib/arch/auto/Encode/Encode.so 27,791 27,791
700 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
701 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
702 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
703 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
704 ---------------------------------------------------------------
705 Total 5,375,297 6,343,819
707 * As a result of ucm-transition, Encode::Tcl dropped because
708 Encode::Tcl demands *.enc.
709 Encode::Tcl will be supplied in a separate tarball with *.enc.
710 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
715 -lib/Encode/Supported.pod
716 +lib/Encode/Supports.pod
717 -lib/Encode/iso10646_1.pm
718 +lib/Encode/10646_1.pm
719 -lib/Encode/EncFormat.pod
720 +lib/Encode/Format/Enc.pod
721 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
722 - lib/Encode/JP/Constants.pm
723 + lib/Encode/JP/Consts.pm
724 ! lib/Encode/JP/JIS.pm
725 ! lib/Encode/JP/H2Z.pm
726 Version nit problem and 8.3 rule fix.
727 > Package namespace installed latest in CPAN file
728 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
729 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
730 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
731 accordingly. In addition, legacy "use vars qw()..." are replaced with
733 Message-Id: <20020325011248.D1561@alpha.hut.fi>
734 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
736 - lib/Encode/JP/ISO_2022_JP.pm
737 - lib/Encode/JP/ISO_2022_JP_1.pm
738 + lib/Encode/JP/2022_JP.pm
739 + lib/Encode/JP/2022_JP1.pm
741 8.3 naming conflict for vanilla fat addressed by jhi
742 Message-Id: <20020324201931.V22596@alpha.hut.fi>
745 Typecast fix addressed by jhi
746 Message-Id: <20020324185540.T22596@alpha.hut.fi>
749 ! lib/Encode/Supported.pod
751 + lib/Encode/JP/ISO_2022_JP_1.pm
752 ! lib/Encode/JP/ISO_2022_JP.pm
753 ! lib/Encode/JP/JIS.pm
755 Now Encode::JP is more strict on the difference between ISO-2022-JP
756 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
757 makes Anton happier :) FYI the previous version implements
758 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
759 ! lib/Encode/Supported.pod
762 Avoid core-dump in Encode with PERLIO=mmap by NI-S
763 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
768 ! lib/Encode/Suppoted.pod
769 pod fixes to replace F<http://...> to L<http://...>,
770 as suggested by Autrijius in:
771 Message-Id: <20020324083943.GA14901@not.autrijus.org>
772 ! lib/Encode/Suppoted.pod
773 fixes and enhancements by Anton
774 Message-Id: <10632060120.20020324103753@motor.ru>
775 ! lib/Encode/Alias.pm
776 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
777 added. Suggested by Anton then deobfuscated by Autrijius
778 Message-Id: <20020324064455.GA3667@not.autrijus.org>
780 Further fix by Nicholas Clark
781 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
782 - lib/EncodeFormat.pod
783 + lib/Encode/EncFormat.pod
785 File renamed as suggested by Autrijius
787 ! lib/Encode/Details.pod
788 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
789 ! Encode.pm Sun Mar 24 13:43:47 2002
790 pod fixes by Autrijius.
791 Message-Id: <20020324062804.GA3595@not.autrijus.org>
792 Message-Id: <20020324075627.GB11986@not.autrijus.org>
794 ! lib/Encode/Alias.pm
796 now more EBCDIC conscious;
797 %ExtModules on EBCDIC system excludes CJK so that you don't
798 have to worry about the matched alias resulting cloaking.
799 t/Alias.t also revised to reflect changes. Verified by jhi
800 Message-Id: <20020324022929.D22596@alpha.hut.fi>
806 EBCDIC detection mechanism installed as in JP/JP.pm
807 Message-Id: <20020323211847.G19148@alpha.hut.fi>
815 Now all table files used by compile are postfixed '_t' to avoid
816 namespace collisions in case insensitive file systems once for all!
818 Message-ID: <58290227735.20020323195659@familiehaase.de>
820 Since the Encode::JP is unsupported under EBCDIC we
821 cannot run this test (aliases as such should work fine) -- jhi
822 Message-Id: <20020323202119.D19148@alpha.hut.fi>
824 duplicate occurance of ascii.ucm and 8859-1.ucm
825 causes MacOS X dlyd to cloak
831 < chdir 't' if -d 't';
833 > if (! -d 'blib' and -d 't'){ chdir 't' };
834 When you are "make test"-ing on Encode/ directory, you must not
835 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
836 remain unchanced. Also the situation detection was made simpler
837 in t/JP.t, which was originally;
838 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
840 "Use of uninitialized value in string eq at Encode.pm line 96."
844 -- Problem on case insensitive file systems
845 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
846 Message-ID: <88254111953.20020323095503@familiehaase.de>
849 "So I think it's a bug in gcc, not perl. But it still needs to be
851 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
852 Message-Id: <20020323170509.C96475@plum.flirble.org>
856 ! lib/Encode/Encoding.pm
857 ! lib/Encode/Alias.pm
858 ! lib/Encode/Supported.pod
860 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
861 Message-ID: <20020322073908.GB10539@blackrider>
864 "...I think we should include ISO 8859-1 as well." -- NI-S
865 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
870 ! lib/Encode/Alias.pm
871 alias definitions relocated to Encode::Alias so module autoloading
872 works for aliases also.
874 encodings() now accepts args to check ExtModules.
883 Latin and single byte encodings are reorganized so they are
884 demand-loaded like Encode::XX. Now only ascii is compiled into
886 ! lib/Encode/Alias.pm
887 for my $k (keys %hash){ delete $hash{$k}; }
888 is depreciated; fixed.
891 In this update, pod rewrites and alias fixes are the main issues
892 + lib/Encode/Supported.pod
893 Describes supported encodings
895 streamlined compiled-in encodings.
896 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
898 + Encode/ibm-125?.ucm
899 Added from icu distibution with any occurance of
900 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
901 some respect to icu staff, however.
902 + lib/Encode/Alias.pm
904 Alias difinitions in Encode.pm relocated.
907 packWARN patch from Paul Marquess via jhi
908 Message-Id: <20020321010101.O28978@alpha.hut.fi>
909 Paul added to AUTHORS as a result.
910 ! t/CJKalias.t -> t/Aliases.t
911 Renamed. Checks even more aliases and alias overloading
914 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
915 gbk => cp936 relocated to CN.pm
917 Test::More with plans (by jhi)
920 + lib/Encode/Description.pod
921 ! lib/Encode/Encoding.pm
922 Now the pod in Encode.pm is abridged as programming references.
923 lib/Encode/Description.pod contains the original, detailed description
924 and Encode::Encoding explains how to write your own module to
925 add new encodings. So far, lib/Encode/Description.pod contains
926 the whole pod once in Encode.pm. This is intentional.
928 Pod revisions by Anton Tagunov
929 Message-Id: <517178431.20020320174824@motor.ru>
931 all occrance of Encode::Tcl::Extended removed including pod
933 test now checks $encoding->name only; $encoding->{name} are
934 no longer check to find the canonical name.
935 ! lib/Encode/JP/JIS.pm
936 ! lib/Encode/JP/ISO_2022_JP.pm
937 ->name() added to be more compliant with API
943 Patch by Autrijus to add aliases to TW and fixes to POD
944 Message-Id: <20020320090619.GA24774@not.autrijus.org>
946 SADAHIRO Tomoyuki added as should. My apologies.
949 * First release to be uploaded to CPAN. For prehistoric changes,
950 please see Changes file of perl distibution as well as
951 perl-unicode@perl.org archive, available at:
952 http://archive.develooper.com/perl-unicode@perl.org/
954 Changes Since 0.92 includes;
959 + Mention to perl-unicode@perl.org added
961 + Encoding aliases added so you can feed locale names
962 and MIME Charset="" directly.
963 - Mention to JISX0212 removed because it's fixed
966 + Encoding aliases added. Note TW is left untouched because
967 euc-tw is not implemented in TW but in Encode::HanExtra.
968 Autrijus, you may fix Encode::HanExtra.
970 + to test encode aliases added