1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.63 2002/04/27 18:59:50 dankogai Exp $
6 $Revision: 1.63 $ $Date: 2002/04/27 18:59:50 $
7 ! lib/Encode/Encoding.pm
8 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
9 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
10 Make use of the Encode::Encoding base class!
11 And other cleanups in Encode.xs upon NI-XS suggestions
12 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
14 1.62 2002/04/27 11:17:39
16 encodings() now just check %ExtModule instead of eval{require}
17 all of them for ":all" to conserve more memory.
19 more "%x" -> "%" UVxf stuff.
21 s/=over2/=over 2/g # oops.
23 1.61 2002/04/26 03:02:04
25 Now does decent tests besides use_ok()
26 ! lib/Encode/Guess.pm t/guess.t
27 UI streamlined, document added
29 various signed/unsigned mismatch nits (#16173)
30 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
32 POD: utf8-flag-related caveats added. A few sections completely
36 Thou shalt not assume %d works, either!
37 Robin Baker added to AUTHORS for this
38 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
40 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
42 1.60 2002/04/24 20:06:52
44 "Thou shalt not assume %x works." -- jhi
45 Message-Id: <20020424210618.E24347@alpha.hut.fi>
46 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
47 low-memory build machines happy, now *.c is created for each *.ucm
48 (no table aggregation). You can still override this by setting
49 $ENV{AGGREGATE_TABLES}.
50 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
52 + lib/Encode/JP/JIS7.pm
53 Encoding-autodetect (mainly for Japanese encoding) added. In a
54 course of development, JIS7.pm was improved.
55 + lib/Encode/HTML/Header.pm
56 + lib/Encode/Config.pm
57 MIME B/Q Header Encoding Added!
58 ! Encode.pm Encode.xs t/fallback.t
59 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
60 Message-Id: <20020424130709.GA14211@tanglefoot>
62 1.59 $ 2002/04/22 23:54:22
64 needs_lines() and perlio_ok() are added to Internal encodings such
65 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
67 1.58 2002/04/22 23:54:22
69 s/MacChineseSimp/MacChineseTrad/ # ... oops.
74 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
75 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
76 t/(JP|KR).t are aggregated to t/CJKT.t
77 test data are all remade via bin/ucm2text.
78 And .... They are no longer skipped for -Uuseperlio !
80 1.57 2002/04/22 20:27:30
81 ! t/JP.t t/KR.t t/perlio.t
82 unless (find PerlIO::Layer 'perlio') ... line is back again.
83 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
85 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
86 ->perlio_ok now does eval{ require PerlIO::encoding } there so
87 it correctly returns 1 when PerlIO::encoding is yet loaded.
89 perl-current patch #16072 reflected
91 1.56 2002/04/22 09:48:07
92 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
93 New PerlIO::encoding 0.04 compliance met
95 1.55 2002/04/22 03:43:05
96 ! Encode.pm Encode.xs Unicode/Unicode.pm
97 needs_lines() defined so Encode::Encoding is no longer needed
100 1.54 2002/04/22 02:50:01
101 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
102 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
103 now perlio_ok is true by default if PerlIO::encoding->VERSION is
104 0.03 or larger. POD in Encode::Encoding revised to reflect this.
105 Encode::XS and Encode::Unicode now has perlio_ok() method.
106 ! lib/Encode/Supported.pod
107 s/UP-UX/HP-UX/ by jhi
108 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
109 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
110 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
111 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
112 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
113 ! lib/Encode/Supported.pod
114 Huge document fixes by Philip.
117 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
118 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
120 binmode() after "<:encoding" to make Win32 happy, by Mattia.
121 Mattia added to AUTHORS file
122 Message-Id: <3CC3150F.5798.22A05AE@localhost>
124 1.52 2002/04/20 23:43:47
126 TODO: is now SKIP:, as NI-XS requested. Also adds more
127 eraborate failure analysis added.
129 A note on how to make sure of round-trip safety added to POD
130 section (so Autrijus is happier)
131 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
132 big5-(eten|hkscs) is round-trip safe again!
133 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
135 Typo fixes by Andreas
136 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
137 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
138 PerIO coodination patches from NI-XS.
139 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
141 1.51 2002/04/20 09:58:23
143 Updated test suite by Autrijis so "make test" is happy again
144 Message-Id: <20020420082104.GA25037@not.autrijus.org>
146 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
148 TW/TW.pm TW/Makefile.PL
149 Updates by Autrijus. 'big5' is no longer a canonical but an
150 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
151 Message-Id: <20020419195346.GA19597@not.autrijus.org>
153 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
154 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
156 PerlIO detection a little bit smarter; no longer uses eval qq{}
159 1.50 2002/04/19 06:13:02
160 ! ! Encode.pm Encode.xs Encode/encoding.h
162 New Fallback API imlemented and documented. See "perldoc Encode"
164 ! lib/Encode/JP/JIS7.pm Encode.pm
165 + lib/Encode/PerlIO.pod t/perlio.t
166 API compliance met. However, it still does not work unless perlio
167 implements line buffer. See BUGS section in perldoc Encode::PerlIO
168 As a sensible workaround, perlio_ok() added to Encode.
170 ! lib/Encode/Supported.pod
172 Message-Id: <20020418174647.J8466@alpha.hut.fi>
174 Doc fixes from Autrijus
175 Message-Id: <20020418144131.GA10987@not.autrijus.org>
177 perlqq mode documented
179 + t/jisx0201.euc t/jisx0201.ref
180 ! t/jisx0208.euc t/jisx0208.ref
181 t/JP.t tests more rigorously and with other encodings
182 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
183 in jis0208 (used to contain jisx0201 part).
184 ! Encode/Makefile_PL.e2x
185 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
186 enc2xs and encode.h rather than hard-coded. This allows the resulting
187 module fully CPANizable.
188 ! encoding.pm t/JP.t t/KR.t
189 PerlIO detection simplified (checks %INC instead of eval{})
190 ! Encode.xs Encode/encode.h
191 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
192 - lib/Encode/Unicode.pm
193 (en|de)code_xs relocated to where it belongs. Source reindented
196 Additional (U8 *) cast added as suggested by jhi
197 Message-Id: <20020417165916.A28599@alpha.hut.fi>
199 1.42 Date: 2002/04/17
201 no-op module; Thought of adding a pod there but enc2xs has
206 correct mechanism to detect Perlio::encoding layar installed.
208 PerlIO Layer detached.
210 1.41 2002/04/16 23:35:00
212 binmode(STDIN|STDOUT ...) done iff PerlIO is available
214 Cleaned up PerlIO skip conditions to prepare for the upcoming
215 Encode - PerlIO forking.
217 exported functions are now prototyped.
218 ! lib/Encode/CN/HZ.pm
221 fallback implemented # was /* FIXME */
222 affected programs revised to fit (only HZ was using the try-catch
223 approach which needed to be fixed for API-compliance).
225 ! Encode/KR/2022_KR.pm
227 can find =head1 NAME now, jhi
228 Message-Id: <20020416083059.V30639@alpha.hut.fi>
232 now complies with less warnings with the pickest compilers.
233 Suggested by Craig, fixed by Dan.
234 ! Encode/Makefile_PL.e2x
236 A bug that fails to find *.e2x in certain conditions fixed
238 1.40 2002/04/14 22:27:14
239 + Encode/ConfigLocal_PM.e2x
240 ! lib/Encode/Config.pm
242 "enc2xs -C" now generates/updates Encode::ConfigLocal.
243 ConfigLocal_PM.e2x is a skelton thereof.
244 ! lib/Encode/Config.pm
246 "use Encode::CN::HZ;" was missing.
249 More rigorous tests added to test XS, especially on memory allocation.
251 ! lib/Encode/Unicode.pm
252 NI-S implemented an XS version -- merged
253 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
256 Source filter option added. With this option on, you can write
257 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
258 encodings. t/jperl.t enhanced to test this feature.
260 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
261 apply his suggestion, this degree of nitting is enough to add him
263 Message-Id: <3CB93223.291E5E2E@earthlink.net>
265 + lib/Encode/JP/JIS7.pm
266 - lib/Encode/JP/JIS.pm
267 - lib/Encode/JP/2022_JP.pm
268 - lib/Encode/JP/2022_JP1.pm
269 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
270 JIS7.pm for better maintainability and performance
272 Added caveat for non-ascii identifiers.
274 fixes by jhi, the original author of this pragramtic module.
275 Message-Id: <20020413231527.V1826@alpha.hut.fi>
277 1.34 2002/04/12 20:23:05 (Unreleased)
280 EBCDIC fixes addressed by jhi.
281 Message-Id: <20020412161844.D9383@alpha.hut.fi>
282 ! lib/Encode/Encoder.pm
283 POD fix by Miyagawa-kun
284 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
286 1.33 2002/04/10 22:28:40
288 Philip's mail address corrected.
291 ! lib/Encode/Encoder.pm
292 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
293 This adds Miyagawa-kun to AUTHORS.
294 * encoding() no longer exported by default but on demand
295 * t/Encoder.t updated to test all these
296 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
297 ! lib/Encode/Unicode.pm
298 ! lib/Encode/Supported.pm
299 Further doc fixes by Anton
301 1.32 2002/04/09 20:06:15
304 - ucm/macDevanaga.ucm Unicode Character Map
305 - ucm/macGujarati.ucm Unicode Character Map
306 - ucm/macGurmukhi.ucm Unicode Character Map
307 A utility to check integrity of .ucm files. t/bogus.ucm is a
308 ucm that is deliberately bogus. unused Indic mappings are removed
311 resolve_alias() added as suggested by jhi. Same as
312 find_encoding("alias")->name. For convenience. This one is
313 defined in Encode.pm instead of Alias.pm.
314 Message-Id: <20020409215846.H17022@alpha.hut.fi>
316 Memory Allocate but detected during the devel of ucmlint -- fixed.
317 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
318 ! lib/Encode/Unicode.pm
319 valid_ucs2(0) is false but must be true.
320 3 patches from NI-S as follows. This also has fixed the incident
322 ! lib/Encode/Alias.pm
323 find_alias() recursion prevention
325 Checks for the patch above
326 ! t/Encode/Unicode.pm
327 An extra "F" that causes valid_ucs2() return a bogus value fixed
328 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
329 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
330 2 Small Patches from jhi as follows:
332 Encode->encodings() lists in case-insensitve order (as it was)
334 -l option prints avaiable encodings to STDOUT instead of STDERR
335 ! lib/Encode/Aliases.pm
336 s/defintion/definition/
337 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
339 ! lib/Encode/Supported.pod
340 ! lib/Encode/Unicode.pm
341 POD revise by Philip Newton. This adds Philip to AUTHORS list.
342 Thank you for the exact quote of Douglas Adams :)
343 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
345 1.31 2002/04/08 18:08:07
346 ! lib/Encode/Encoder.pm
348 Encode::Encoder, once just a placeholder of an idea, is now much more
349 practical. See t/Encode.t to find how practical it can be.
350 + lib/Encode/Config.pm
352 my false laziness at Encode.pm is fixed. Now %ExtModules are set
353 in Encode::Config and they are all literally, not programatically
354 set. My false laziness was resulting many encodings missing from
356 ! lib/Encode/Unicode.pm
358 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
359 so that it does not rely Encode::Unicode for BOM values
360 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
362 1.30 2002/04/08 02:34:51
363 + lib/Encode/Encoder.pm
364 Object Oriented Encoder. I reckon something like this is in need.
367 ! lib/Encode/Supported.pod
368 * autoloading bug that prevented upper-case canonicals such as UTF-16
369 is fixed. Now even UTF/UCS are autoloaded!
370 * encodings() is now more intuitive.
371 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
373 * Obligatory fixes to the POD.
374 ! lib/Encode/Supported.pod
375 Patch from Anton applied.
376 Message-Id: <66641479.20020408033300@motor.ru>
378 ! lib/Encode/Unicode.pm
379 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
381 1.28 2002/04/07 18:58:42
385 Just a MANIFEST for those missing files.
387 1.26 Date: 2002/04/07 15:22:04
390 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
391 And t/Aliases.t fixed to make sure they all exist.
393 1.25 2002/04/07 15:01:25 (Unreleased)
395 ! lib/Encode/Unicode.pm
398 - lib/Encode/UTF_EBCDIC.pm
399 - lib/Encode/Internal.pm
401 Integrated into Encode.pm as closures. That way "one package, one file"
402 rule is preserved yet less files to require.
404 commented out binmode(STDERR ...
413 ! Encode/Makefile_PL.e2x
414 Schwarn's MM-compliance patch merged
415 Message-Id: <20020406082609.GA28758@blackrider>
417 ! lib/Encode/Unicode.pm
418 + lib/Encode/UTF_EBCDIC.pm
420 - lib/Encode/10646_1.pm
421 - lib/Encode/ucs2_le.pm
422 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
423 per-module based (en|de)code, I saved a number of .pm by
424 reorganizing it as per-object base (Well, this is what Encode::XS
425 does under the hood). See Encode::Unicode for details.
426 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
427 This module is used only on EBCDIC environments.
429 1.21 2002/04/05 14:46:34 (Not Released)
435 Are back to make Perl/Tk happy Smile, NI-S.
437 ! lib/Encode/Alias.pm
438 ! lib/Encode/Supported.pm
439 ! lib/Encode/10646_1.pm
440 ! lib/Encode/ucs2_le.pm
441 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
442 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
443 updated to reflect changes.
444 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
445 ! lib/Encode/Alias.pm
446 ! lib/Encode/Supported.pm
447 Anton's revision commited. Added Dan's own fixes as well.
448 Message-Id: <159103166906.20020405161134@motor.ru>
449 ! lib/Encode/Alias.pm
451 < qr/^UCS2-le$/i => '"UCS-2"', );
453 > qr/^UCS2-LE$/i => '"UTF-16LE"');
454 Sigh. Thank you, Anton.
455 Message-Id: <14567692196.20020405062020@motor.ru>
456 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
458 1.20 2002/04/04 19:50:52
460 the last minute addtion. Just give it a try. Docs remains to be done.
461 Not installed by default.
462 ! lib/Encode/Supported.pod
465 ! lib/Encode/Alias.pm
467 ! lib/Encode/10464_1.pm
468 ! lib/Encode/ucs2_le.pm
469 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
470 unchanged but UTF-16BE is added as an alias. Implicit aliases
471 move to Encode::Alias so init_alias() works more as expected.
472 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
473 Though pedantically wrong, This should make perl mongers happier.
474 t/Alias.t is enhanced to test all these.
475 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
477 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
478 of the codepoints are common. CJKT left as is because the saving is
484 ! Encode/Makefile_PL.e2x
492 All occurance of _def.h replaced with .exh so djgpp works happily
493 ever after! To credit this amazing discovery, Laszlo is now in
495 Message-Id: <20020403181424.GA8778@freemail.hu>
496 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
499 ! Encode/Makefile_PL.skel
501 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
502 Message-Id: <20020401222744.GX2000@blackrider>, et al.
504 Two more tests by added jhi
505 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
508 The showstopper fixed -- Memory reallocation bug was causing
509 Encode::XS to fall into infinite loop on certain conditions.
511 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
516 ! lib/Encode/Supported.pod
517 Vendor encodings rebuilt out of original map files at unicode.org.
518 Indic languages such as MacDevanagali remain unspported do to the
519 shortcoming of encengine capabilities (they need algorithmical
520 conversion and I have no knowledge on that!). Pods fixed for added
522 Oh, macJapan.ucm renamed to macJapanese.ucm.
523 macROMnn is macRomanian and macRUMnn is macRumanian.
524 txt2ucm is a crude script that is used to convert them.
526 Unicode Compound Characters (used extensively on Mac) supported
528 Typo fixes and improvements by jhi
529 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
531 1.11 $Date: 2002/04/27 18:59:50 $
535 Missing files from the MANIFEST fixed.
536 Message-Id: <20020401010156.H10509@alpha.hut.fi>
537 Version incremented just to make CPAN happy.
539 1.10 2002/03/31 21:32:42
542 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
543 if you want. This should make Autrijus happy. Also, piconv
544 is added to default install.
547 Here-documented files that enc2xs generates are now exported
548 to *.e2x. Much cleaner and easier to debug.
550 encoding enhances so you can make it act more like such
551 (now prehistoric ) "localized" variations of perl like Jperl.
553 Further test for encoding.pm. Written in euc-jp
557 Message-Id: <20020330174618.B10154@alpha.hut.fi>
562 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
569 ! Encode/macIceland.ucm
570 ! lib/Encode/Alias.pm
571 ! lib/Encode/Supported.pod
572 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
573 Message-Id: <m3lmcavhjt.fsf@anima.de>
575 1.01 2002/03/29 20:59:39
578 s/USE_SCRIPTS/MORE_SCRIPTS/
580 installs enc2xs by default for external Encode:: modules in CPAN,
581 such as Encode::HanExtra
583 More sensible perl core detection via $ENV{PERL_CORE}
585 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
587 Perl core ditection via $^X =~ m/\bminiperl$/o
588 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
592 The version of all files is updated to 1.00 via "ci -f -l1.00",
593 commemorating version 1.00. All files, including *.ucm are now
594 under version control.
597 encode.h moved to Encode/ so it will be installed for the later
600 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
605 compile renamed to enc2xs.
606 Affected Makefle.PL updated
608 "Punt it. HanExtra can take care of that later." -- Autrijus
609 Message-Id: <20020328154338.GA7351@not.autrijus.org>
613 ! lib/Encode/CJKConstants.pm
614 ! lib/Encode/KR/2022_KR.pm
615 Table patches for Euro Signs, 2022-KR fixups by Jungshik
616 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
620 bin/ added for example scripts. They are not installed by default.
621 to install them, "perl Makefile.PL USE_SCRIPTS".
622 piconv is iconv reinvented in perl. in addition to all features
623 of iconv, it also adds perlish features. See L<piconv/1> for more
625 ! lib/Encode/Alias.pm
626 qr/^ replaced with qr/\b so it directly matches locale names
627 such as en_US.US-ASCII
630 Patch by MJD to fix the following problem applied.
631 Subject: [PATCH 5.7.3 Encode]
632 Aliases.t not properly skipped when Encode extension not built
633 Message-Id: <20020328091850.18677.qmail@plover.com>
634 ! lib/Encode/KR/2022_KR.pm
635 ! lib/Encode/CJKConstants.pm
636 Another patch from Jungshik to make iso-2022-kr actually work
637 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
638 ! Encode/Encode/euc-kr.ucm
639 + Encode/Encode/johab.ucm
640 ! Encode/Encode/ksc5601.ucm
642 ! Encode/KR/Makefile.PL
643 ! Encode/lib/Encode/Alias.pm
645 Johab support and complete revision of Korean Encoding by Jungshik
646 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
648 Revised to make up with now-dropped Encode::Details.
649 - lib/Encode/Details.pod
650 Dropped. Besides being obsolete, the topics are now covered in
656 Korean aliases fixed thanks to Jungshik Shin
657 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
658 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
660 =head1 NAME added to all modules to make buildtoc happy
661 Message-Id: <20020327041151.A10618@alpha.hut.fi>
662 - lib/Encode/CJKguide.pod
663 Too controversial and dropped from the dist. Will be available
664 separately on the web.
666 RCS tags added so table debugging gets easier (should that be
667 needed! I hope they all stay 1.00!)
668 + lib/Encode/CJKguide.pod
669 A detailed guide to mainly, but not limited to, CJK multibyte
672 + Encode/hp-roman8.ucm
674 ! Encode/Supported.pod
675 All occurance of "roman8" replaced with "hp-roman8" to avoid
677 ! Encode/Supported.pod
680 Mac Encodings now comply the Inside Macintosh
682 Test for '-raw' conventions added.
684 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
688 "-raw" appended to canonical names.
689 File mames stay unchanged thanks to UCM format.
690 ! lib/Encode/CN/HZ.pm
691 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
692 Message-Id: <20020326035210.GA2091@not.autrijus.org>
695 - lib/Encode/JP/Const.pm
696 + lib/Encode/CJKConstants.pm
697 + lib/Encode/CN/2022_CN.pm
698 + lib/Encode/KR/2022_KR.pm
707 * Support for ISO-2022-KR and ISO-2022-CN added.
709 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
710 * ucm2table autogenerates character table out of UCM files.
713 - lib/Encode/Supports.pod
714 + lib/Encode/Supported.pod
715 Names reverted due to popular demand.
716 8.3 rule applies only when there is a conflict.
717 Message-Id: <20020325095924.GD44120@not.autrijus.org>
722 - lib/Encode/Format/Enc.pod
724 * Character tables is now 100% ucm.
725 * All files under Encode/ is now 8.3-compliant
726 * some of missing encodings added (i.e. gsm0338 and nextstep)
727 * Vendor mappings aggregated with appropriate national std in
728 Makefile.PL, resulting smaller *.so especially for CJK.
729 Following is result on Dan's FreeBSD box.
731 ---------------------------------------------------------------
732 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
733 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
734 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
735 blib/arch/auto/Encode/Encode.so 27,791 27,791
736 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
737 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
738 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
739 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
740 ---------------------------------------------------------------
741 Total 5,375,297 6,343,819
743 * As a result of ucm-transition, Encode::Tcl dropped because
744 Encode::Tcl demands *.enc.
745 Encode::Tcl will be supplied in a separate tarball with *.enc.
746 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
751 -lib/Encode/Supported.pod
752 +lib/Encode/Supports.pod
753 -lib/Encode/iso10646_1.pm
754 +lib/Encode/10646_1.pm
755 -lib/Encode/EncFormat.pod
756 +lib/Encode/Format/Enc.pod
757 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
758 - lib/Encode/JP/Constants.pm
759 + lib/Encode/JP/Consts.pm
760 ! lib/Encode/JP/JIS.pm
761 ! lib/Encode/JP/H2Z.pm
762 Version nit problem and 8.3 rule fix.
763 > Package namespace installed latest in CPAN file
764 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
765 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
766 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
767 accordingly. In addition, legacy "use vars qw()..." are replaced with
769 Message-Id: <20020325011248.D1561@alpha.hut.fi>
770 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
772 - lib/Encode/JP/ISO_2022_JP.pm
773 - lib/Encode/JP/ISO_2022_JP_1.pm
774 + lib/Encode/JP/2022_JP.pm
775 + lib/Encode/JP/2022_JP1.pm
777 8.3 naming conflict for vanilla fat addressed by jhi
778 Message-Id: <20020324201931.V22596@alpha.hut.fi>
781 Typecast fix addressed by jhi
782 Message-Id: <20020324185540.T22596@alpha.hut.fi>
785 ! lib/Encode/Supported.pod
787 + lib/Encode/JP/ISO_2022_JP_1.pm
788 ! lib/Encode/JP/ISO_2022_JP.pm
789 ! lib/Encode/JP/JIS.pm
791 Now Encode::JP is more strict on the difference between ISO-2022-JP
792 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
793 makes Anton happier :) FYI the previous version implements
794 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
795 ! lib/Encode/Supported.pod
798 Avoid core-dump in Encode with PERLIO=mmap by NI-S
799 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
804 ! lib/Encode/Suppoted.pod
805 pod fixes to replace F<http://...> to L<http://...>,
806 as suggested by Autrijius in:
807 Message-Id: <20020324083943.GA14901@not.autrijus.org>
808 ! lib/Encode/Suppoted.pod
809 fixes and enhancements by Anton
810 Message-Id: <10632060120.20020324103753@motor.ru>
811 ! lib/Encode/Alias.pm
812 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
813 added. Suggested by Anton then deobfuscated by Autrijius
814 Message-Id: <20020324064455.GA3667@not.autrijus.org>
816 Further fix by Nicholas Clark
817 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
818 - lib/EncodeFormat.pod
819 + lib/Encode/EncFormat.pod
821 File renamed as suggested by Autrijius
823 ! lib/Encode/Details.pod
824 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
825 ! Encode.pm Sun Mar 24 13:43:47 2002
826 pod fixes by Autrijius.
827 Message-Id: <20020324062804.GA3595@not.autrijus.org>
828 Message-Id: <20020324075627.GB11986@not.autrijus.org>
830 ! lib/Encode/Alias.pm
832 now more EBCDIC conscious;
833 %ExtModules on EBCDIC system excludes CJK so that you don't
834 have to worry about the matched alias resulting cloaking.
835 t/Alias.t also revised to reflect changes. Verified by jhi
836 Message-Id: <20020324022929.D22596@alpha.hut.fi>
842 EBCDIC detection mechanism installed as in JP/JP.pm
843 Message-Id: <20020323211847.G19148@alpha.hut.fi>
851 Now all table files used by compile are postfixed '_t' to avoid
852 namespace collisions in case insensitive file systems once for all!
854 Message-ID: <58290227735.20020323195659@familiehaase.de>
856 Since the Encode::JP is unsupported under EBCDIC we
857 cannot run this test (aliases as such should work fine) -- jhi
858 Message-Id: <20020323202119.D19148@alpha.hut.fi>
860 duplicate occurance of ascii.ucm and 8859-1.ucm
861 causes MacOS X dlyd to cloak
867 < chdir 't' if -d 't';
869 > if (! -d 'blib' and -d 't'){ chdir 't' };
870 When you are "make test"-ing on Encode/ directory, you must not
871 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
872 remain unchanced. Also the situation detection was made simpler
873 in t/JP.t, which was originally;
874 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
876 "Use of uninitialized value in string eq at Encode.pm line 96."
880 -- Problem on case insensitive file systems
881 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
882 Message-ID: <88254111953.20020323095503@familiehaase.de>
885 "So I think it's a bug in gcc, not perl. But it still needs to be
887 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
888 Message-Id: <20020323170509.C96475@plum.flirble.org>
892 ! lib/Encode/Encoding.pm
893 ! lib/Encode/Alias.pm
894 ! lib/Encode/Supported.pod
896 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
897 Message-ID: <20020322073908.GB10539@blackrider>
900 "...I think we should include ISO 8859-1 as well." -- NI-S
901 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
906 ! lib/Encode/Alias.pm
907 alias definitions relocated to Encode::Alias so module autoloading
908 works for aliases also.
910 encodings() now accepts args to check ExtModules.
919 Latin and single byte encodings are reorganized so they are
920 demand-loaded like Encode::XX. Now only ascii is compiled into
922 ! lib/Encode/Alias.pm
923 for my $k (keys %hash){ delete $hash{$k}; }
924 is depreciated; fixed.
927 In this update, pod rewrites and alias fixes are the main issues
928 + lib/Encode/Supported.pod
929 Describes supported encodings
931 streamlined compiled-in encodings.
932 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
934 + Encode/ibm-125?.ucm
935 Added from icu distibution with any occurance of
936 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
937 some respect to icu staff, however.
938 + lib/Encode/Alias.pm
940 Alias difinitions in Encode.pm relocated.
943 packWARN patch from Paul Marquess via jhi
944 Message-Id: <20020321010101.O28978@alpha.hut.fi>
945 Paul added to AUTHORS as a result.
946 ! t/CJKalias.t -> t/Aliases.t
947 Renamed. Checks even more aliases and alias overloading
950 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
951 gbk => cp936 relocated to CN.pm
953 Test::More with plans (by jhi)
956 + lib/Encode/Description.pod
957 ! lib/Encode/Encoding.pm
958 Now the pod in Encode.pm is abridged as programming references.
959 lib/Encode/Description.pod contains the original, detailed description
960 and Encode::Encoding explains how to write your own module to
961 add new encodings. So far, lib/Encode/Description.pod contains
962 the whole pod once in Encode.pm. This is intentional.
964 Pod revisions by Anton Tagunov
965 Message-Id: <517178431.20020320174824@motor.ru>
967 all occrance of Encode::Tcl::Extended removed including pod
969 test now checks $encoding->name only; $encoding->{name} are
970 no longer check to find the canonical name.
971 ! lib/Encode/JP/JIS.pm
972 ! lib/Encode/JP/ISO_2022_JP.pm
973 ->name() added to be more compliant with API
979 Patch by Autrijus to add aliases to TW and fixes to POD
980 Message-Id: <20020320090619.GA24774@not.autrijus.org>
982 SADAHIRO Tomoyuki added as should. My apologies.
985 * First release to be uploaded to CPAN. For prehistoric changes,
986 please see Changes file of perl distibution as well as
987 perl-unicode@perl.org archive, available at:
988 http://archive.develooper.com/perl-unicode@perl.org/
990 Changes Since 0.92 includes;
995 + Mention to perl-unicode@perl.org added
997 + Encoding aliases added so you can feed locale names
998 and MIME Charset="" directly.
999 - Mention to JISX0212 removed because it's fixed
1002 + Encoding aliases added. Note TW is left untouched because
1003 euc-tw is not implemented in TW but in Encode::HanExtra.
1004 Autrijus, you may fix Encode::HanExtra.
1006 + to test encode aliases added