1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.62 2002/04/27 11:17:39 dankogai Exp dankogai $
6 $Revision: 1.62 $ $Date: 2002/04/27 11:17:39 $
8 encodings() now just check %ExtModule instead of eval{require}
9 all of them for ":all" to conserve more memory.
11 more "%x" -> "%" UVxf stuff.
13 s/=over2/=over 2/g # oops.
15 1.61 2002/04/26 03:02:04
17 Now does decent tests besides use_ok()
18 ! lib/Encode/Guess.pm t/guess.t
19 UI streamlined, document added
21 various signed/unsigned mismatch nits (#16173)
22 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
24 POD: utf8-flag-related caveats added. A few sections completely
28 Thou shalt not assume %d works, either!
29 Robin Baker added to AUTHORS for this
30 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
32 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
34 1.60 2002/04/24 20:06:52
36 "Thou shalt not assume %x works." -- jhi
37 Message-Id: <20020424210618.E24347@alpha.hut.fi>
38 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
39 low-memory build machines happy, now *.c is created for each *.ucm
40 (no table aggregation). You can still override this by setting
41 $ENV{AGGREGATE_TABLES}.
42 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
44 + lib/Encode/JP/JIS7.pm
45 Encoding-autodetect (mainly for Japanese encoding) added. In a
46 course of development, JIS7.pm was improved.
47 + lib/Encode/HTML/Header.pm
48 + lib/Encode/Config.pm
49 MIME B/Q Header Encoding Added!
50 ! Encode.pm Encode.xs t/fallback.t
51 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
52 Message-Id: <20020424130709.GA14211@tanglefoot>
54 1.59 $ 2002/04/22 23:54:22
56 needs_lines() and perlio_ok() are added to Internal encodings such
57 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
59 1.58 2002/04/22 23:54:22
61 s/MacChineseSimp/MacChineseTrad/ # ... oops.
66 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
67 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
68 t/(JP|KR).t are aggregated to t/CJKT.t
69 test data are all remade via bin/ucm2text.
70 And .... They are no longer skipped for -Uuseperlio !
72 1.57 2002/04/22 20:27:30
73 ! t/JP.t t/KR.t t/perlio.t
74 unless (find PerlIO::Layer 'perlio') ... line is back again.
75 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
77 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
78 ->perlio_ok now does eval{ require PerlIO::encoding } there so
79 it correctly returns 1 when PerlIO::encoding is yet loaded.
81 perl-current patch #16072 reflected
83 1.56 2002/04/22 09:48:07
84 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
85 New PerlIO::encoding 0.04 compliance met
87 1.55 2002/04/22 03:43:05
88 ! Encode.pm Encode.xs Unicode/Unicode.pm
89 needs_lines() defined so Encode::Encoding is no longer needed
92 1.54 2002/04/22 02:50:01
93 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
94 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
95 now perlio_ok is true by default if PerlIO::encoding->VERSION is
96 0.03 or larger. POD in Encode::Encoding revised to reflect this.
97 Encode::XS and Encode::Unicode now has perlio_ok() method.
98 ! lib/Encode/Supported.pod
100 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
101 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
102 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
103 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
104 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
105 ! lib/Encode/Supported.pod
106 Huge document fixes by Philip.
109 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
110 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
112 binmode() after "<:encoding" to make Win32 happy, by Mattia.
113 Mattia added to AUTHORS file
114 Message-Id: <3CC3150F.5798.22A05AE@localhost>
116 1.52 2002/04/20 23:43:47
118 TODO: is now SKIP:, as NI-XS requested. Also adds more
119 eraborate failure analysis added.
121 A note on how to make sure of round-trip safety added to POD
122 section (so Autrijus is happier)
123 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
124 big5-(eten|hkscs) is round-trip safe again!
125 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
127 Typo fixes by Andreas
128 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
129 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
130 PerIO coodination patches from NI-XS.
131 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
133 1.51 2002/04/20 09:58:23
135 Updated test suite by Autrijis so "make test" is happy again
136 Message-Id: <20020420082104.GA25037@not.autrijus.org>
138 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
140 TW/TW.pm TW/Makefile.PL
141 Updates by Autrijus. 'big5' is no longer a canonical but an
142 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
143 Message-Id: <20020419195346.GA19597@not.autrijus.org>
145 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
146 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
148 PerlIO detection a little bit smarter; no longer uses eval qq{}
151 1.50 2002/04/19 06:13:02
152 ! ! Encode.pm Encode.xs Encode/encoding.h
154 New Fallback API imlemented and documented. See "perldoc Encode"
156 ! lib/Encode/JP/JIS7.pm Encode.pm
157 + lib/Encode/PerlIO.pod t/perlio.t
158 API compliance met. However, it still does not work unless perlio
159 implements line buffer. See BUGS section in perldoc Encode::PerlIO
160 As a sensible workaround, perlio_ok() added to Encode.
162 ! lib/Encode/Supported.pod
164 Message-Id: <20020418174647.J8466@alpha.hut.fi>
166 Doc fixes from Autrijus
167 Message-Id: <20020418144131.GA10987@not.autrijus.org>
169 perlqq mode documented
171 + t/jisx0201.euc t/jisx0201.ref
172 ! t/jisx0208.euc t/jisx0208.ref
173 t/JP.t tests more rigorously and with other encodings
174 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
175 in jis0208 (used to contain jisx0201 part).
176 ! Encode/Makefile_PL.e2x
177 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
178 enc2xs and encode.h rather than hard-coded. This allows the resulting
179 module fully CPANizable.
180 ! encoding.pm t/JP.t t/KR.t
181 PerlIO detection simplified (checks %INC instead of eval{})
182 ! Encode.xs Encode/encode.h
183 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
184 - lib/Encode/Unicode.pm
185 (en|de)code_xs relocated to where it belongs. Source reindented
188 Additional (U8 *) cast added as suggested by jhi
189 Message-Id: <20020417165916.A28599@alpha.hut.fi>
191 1.42 Date: 2002/04/17
193 no-op module; Thought of adding a pod there but enc2xs has
198 correct mechanism to detect Perlio::encoding layar installed.
200 PerlIO Layer detached.
202 1.41 2002/04/16 23:35:00
204 binmode(STDIN|STDOUT ...) done iff PerlIO is available
206 Cleaned up PerlIO skip conditions to prepare for the upcoming
207 Encode - PerlIO forking.
209 exported functions are now prototyped.
210 ! lib/Encode/CN/HZ.pm
213 fallback implemented # was /* FIXME */
214 affected programs revised to fit (only HZ was using the try-catch
215 approach which needed to be fixed for API-compliance).
217 ! Encode/KR/2022_KR.pm
219 can find =head1 NAME now, jhi
220 Message-Id: <20020416083059.V30639@alpha.hut.fi>
224 now complies with less warnings with the pickest compilers.
225 Suggested by Craig, fixed by Dan.
226 ! Encode/Makefile_PL.e2x
228 A bug that fails to find *.e2x in certain conditions fixed
230 1.40 2002/04/14 22:27:14
231 + Encode/ConfigLocal_PM.e2x
232 ! lib/Encode/Config.pm
234 "enc2xs -C" now generates/updates Encode::ConfigLocal.
235 ConfigLocal_PM.e2x is a skelton thereof.
236 ! lib/Encode/Config.pm
238 "use Encode::CN::HZ;" was missing.
241 More rigorous tests added to test XS, especially on memory allocation.
243 ! lib/Encode/Unicode.pm
244 NI-S implemented an XS version -- merged
245 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
248 Source filter option added. With this option on, you can write
249 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
250 encodings. t/jperl.t enhanced to test this feature.
252 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
253 apply his suggestion, this degree of nitting is enough to add him
255 Message-Id: <3CB93223.291E5E2E@earthlink.net>
257 + lib/Encode/JP/JIS7.pm
258 - lib/Encode/JP/JIS.pm
259 - lib/Encode/JP/2022_JP.pm
260 - lib/Encode/JP/2022_JP1.pm
261 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
262 JIS7.pm for better maintainability and performance
264 Added caveat for non-ascii identifiers.
266 fixes by jhi, the original author of this pragramtic module.
267 Message-Id: <20020413231527.V1826@alpha.hut.fi>
269 1.34 2002/04/12 20:23:05 (Unreleased)
272 EBCDIC fixes addressed by jhi.
273 Message-Id: <20020412161844.D9383@alpha.hut.fi>
274 ! lib/Encode/Encoder.pm
275 POD fix by Miyagawa-kun
276 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
278 1.33 2002/04/10 22:28:40
280 Philip's mail address corrected.
283 ! lib/Encode/Encoder.pm
284 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
285 This adds Miyagawa-kun to AUTHORS.
286 * encoding() no longer exported by default but on demand
287 * t/Encoder.t updated to test all these
288 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
289 ! lib/Encode/Unicode.pm
290 ! lib/Encode/Supported.pm
291 Further doc fixes by Anton
293 1.32 2002/04/09 20:06:15
296 - ucm/macDevanaga.ucm Unicode Character Map
297 - ucm/macGujarati.ucm Unicode Character Map
298 - ucm/macGurmukhi.ucm Unicode Character Map
299 A utility to check integrity of .ucm files. t/bogus.ucm is a
300 ucm that is deliberately bogus. unused Indic mappings are removed
303 resolve_alias() added as suggested by jhi. Same as
304 find_encoding("alias")->name. For convenience. This one is
305 defined in Encode.pm instead of Alias.pm.
306 Message-Id: <20020409215846.H17022@alpha.hut.fi>
308 Memory Allocate but detected during the devel of ucmlint -- fixed.
309 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
310 ! lib/Encode/Unicode.pm
311 valid_ucs2(0) is false but must be true.
312 3 patches from NI-S as follows. This also has fixed the incident
314 ! lib/Encode/Alias.pm
315 find_alias() recursion prevention
317 Checks for the patch above
318 ! t/Encode/Unicode.pm
319 An extra "F" that causes valid_ucs2() return a bogus value fixed
320 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
321 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
322 2 Small Patches from jhi as follows:
324 Encode->encodings() lists in case-insensitve order (as it was)
326 -l option prints avaiable encodings to STDOUT instead of STDERR
327 ! lib/Encode/Aliases.pm
328 s/defintion/definition/
329 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
331 ! lib/Encode/Supported.pod
332 ! lib/Encode/Unicode.pm
333 POD revise by Philip Newton. This adds Philip to AUTHORS list.
334 Thank you for the exact quote of Douglas Adams :)
335 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
337 1.31 2002/04/08 18:08:07
338 ! lib/Encode/Encoder.pm
340 Encode::Encoder, once just a placeholder of an idea, is now much more
341 practical. See t/Encode.t to find how practical it can be.
342 + lib/Encode/Config.pm
344 my false laziness at Encode.pm is fixed. Now %ExtModules are set
345 in Encode::Config and they are all literally, not programatically
346 set. My false laziness was resulting many encodings missing from
348 ! lib/Encode/Unicode.pm
350 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
351 so that it does not rely Encode::Unicode for BOM values
352 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
354 1.30 2002/04/08 02:34:51
355 + lib/Encode/Encoder.pm
356 Object Oriented Encoder. I reckon something like this is in need.
359 ! lib/Encode/Supported.pod
360 * autoloading bug that prevented upper-case canonicals such as UTF-16
361 is fixed. Now even UTF/UCS are autoloaded!
362 * encodings() is now more intuitive.
363 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
365 * Obligatory fixes to the POD.
366 ! lib/Encode/Supported.pod
367 Patch from Anton applied.
368 Message-Id: <66641479.20020408033300@motor.ru>
370 ! lib/Encode/Unicode.pm
371 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
373 1.28 2002/04/07 18:58:42
377 Just a MANIFEST for those missing files.
379 1.26 Date: 2002/04/07 15:22:04
382 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
383 And t/Aliases.t fixed to make sure they all exist.
385 1.25 2002/04/07 15:01:25 (Unreleased)
387 ! lib/Encode/Unicode.pm
390 - lib/Encode/UTF_EBCDIC.pm
391 - lib/Encode/Internal.pm
393 Integrated into Encode.pm as closures. That way "one package, one file"
394 rule is preserved yet less files to require.
396 commented out binmode(STDERR ...
405 ! Encode/Makefile_PL.e2x
406 Schwarn's MM-compliance patch merged
407 Message-Id: <20020406082609.GA28758@blackrider>
409 ! lib/Encode/Unicode.pm
410 + lib/Encode/UTF_EBCDIC.pm
412 - lib/Encode/10646_1.pm
413 - lib/Encode/ucs2_le.pm
414 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
415 per-module based (en|de)code, I saved a number of .pm by
416 reorganizing it as per-object base (Well, this is what Encode::XS
417 does under the hood). See Encode::Unicode for details.
418 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
419 This module is used only on EBCDIC environments.
421 1.21 2002/04/05 14:46:34 (Not Released)
427 Are back to make Perl/Tk happy Smile, NI-S.
429 ! lib/Encode/Alias.pm
430 ! lib/Encode/Supported.pm
431 ! lib/Encode/10646_1.pm
432 ! lib/Encode/ucs2_le.pm
433 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
434 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
435 updated to reflect changes.
436 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
437 ! lib/Encode/Alias.pm
438 ! lib/Encode/Supported.pm
439 Anton's revision commited. Added Dan's own fixes as well.
440 Message-Id: <159103166906.20020405161134@motor.ru>
441 ! lib/Encode/Alias.pm
443 < qr/^UCS2-le$/i => '"UCS-2"', );
445 > qr/^UCS2-LE$/i => '"UTF-16LE"');
446 Sigh. Thank you, Anton.
447 Message-Id: <14567692196.20020405062020@motor.ru>
448 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
450 1.20 2002/04/04 19:50:52
452 the last minute addtion. Just give it a try. Docs remains to be done.
453 Not installed by default.
454 ! lib/Encode/Supported.pod
457 ! lib/Encode/Alias.pm
459 ! lib/Encode/10464_1.pm
460 ! lib/Encode/ucs2_le.pm
461 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
462 unchanged but UTF-16BE is added as an alias. Implicit aliases
463 move to Encode::Alias so init_alias() works more as expected.
464 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
465 Though pedantically wrong, This should make perl mongers happier.
466 t/Alias.t is enhanced to test all these.
467 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
469 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
470 of the codepoints are common. CJKT left as is because the saving is
476 ! Encode/Makefile_PL.e2x
484 All occurance of _def.h replaced with .exh so djgpp works happily
485 ever after! To credit this amazing discovery, Laszlo is now in
487 Message-Id: <20020403181424.GA8778@freemail.hu>
488 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
491 ! Encode/Makefile_PL.skel
493 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
494 Message-Id: <20020401222744.GX2000@blackrider>, et al.
496 Two more tests by added jhi
497 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
500 The showstopper fixed -- Memory reallocation bug was causing
501 Encode::XS to fall into infinite loop on certain conditions.
503 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
508 ! lib/Encode/Supported.pod
509 Vendor encodings rebuilt out of original map files at unicode.org.
510 Indic languages such as MacDevanagali remain unspported do to the
511 shortcoming of encengine capabilities (they need algorithmical
512 conversion and I have no knowledge on that!). Pods fixed for added
514 Oh, macJapan.ucm renamed to macJapanese.ucm.
515 macROMnn is macRomanian and macRUMnn is macRumanian.
516 txt2ucm is a crude script that is used to convert them.
518 Unicode Compound Characters (used extensively on Mac) supported
520 Typo fixes and improvements by jhi
521 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
523 1.11 $Date: 2002/04/27 11:17:39 $
527 Missing files from the MANIFEST fixed.
528 Message-Id: <20020401010156.H10509@alpha.hut.fi>
529 Version incremented just to make CPAN happy.
531 1.10 2002/03/31 21:32:42
534 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
535 if you want. This should make Autrijus happy. Also, piconv
536 is added to default install.
539 Here-documented files that enc2xs generates are now exported
540 to *.e2x. Much cleaner and easier to debug.
542 encoding enhances so you can make it act more like such
543 (now prehistoric ) "localized" variations of perl like Jperl.
545 Further test for encoding.pm. Written in euc-jp
549 Message-Id: <20020330174618.B10154@alpha.hut.fi>
554 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
561 ! Encode/macIceland.ucm
562 ! lib/Encode/Alias.pm
563 ! lib/Encode/Supported.pod
564 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
565 Message-Id: <m3lmcavhjt.fsf@anima.de>
567 1.01 2002/03/29 20:59:39
570 s/USE_SCRIPTS/MORE_SCRIPTS/
572 installs enc2xs by default for external Encode:: modules in CPAN,
573 such as Encode::HanExtra
575 More sensible perl core detection via $ENV{PERL_CORE}
577 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
579 Perl core ditection via $^X =~ m/\bminiperl$/o
580 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
584 The version of all files is updated to 1.00 via "ci -f -l1.00",
585 commemorating version 1.00. All files, including *.ucm are now
586 under version control.
589 encode.h moved to Encode/ so it will be installed for the later
592 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
597 compile renamed to enc2xs.
598 Affected Makefle.PL updated
600 "Punt it. HanExtra can take care of that later." -- Autrijus
601 Message-Id: <20020328154338.GA7351@not.autrijus.org>
605 ! lib/Encode/CJKConstants.pm
606 ! lib/Encode/KR/2022_KR.pm
607 Table patches for Euro Signs, 2022-KR fixups by Jungshik
608 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
612 bin/ added for example scripts. They are not installed by default.
613 to install them, "perl Makefile.PL USE_SCRIPTS".
614 piconv is iconv reinvented in perl. in addition to all features
615 of iconv, it also adds perlish features. See L<piconv/1> for more
617 ! lib/Encode/Alias.pm
618 qr/^ replaced with qr/\b so it directly matches locale names
619 such as en_US.US-ASCII
622 Patch by MJD to fix the following problem applied.
623 Subject: [PATCH 5.7.3 Encode]
624 Aliases.t not properly skipped when Encode extension not built
625 Message-Id: <20020328091850.18677.qmail@plover.com>
626 ! lib/Encode/KR/2022_KR.pm
627 ! lib/Encode/CJKConstants.pm
628 Another patch from Jungshik to make iso-2022-kr actually work
629 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
630 ! Encode/Encode/euc-kr.ucm
631 + Encode/Encode/johab.ucm
632 ! Encode/Encode/ksc5601.ucm
634 ! Encode/KR/Makefile.PL
635 ! Encode/lib/Encode/Alias.pm
637 Johab support and complete revision of Korean Encoding by Jungshik
638 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
640 Revised to make up with now-dropped Encode::Details.
641 - lib/Encode/Details.pod
642 Dropped. Besides being obsolete, the topics are now covered in
648 Korean aliases fixed thanks to Jungshik Shin
649 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
650 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
652 =head1 NAME added to all modules to make buildtoc happy
653 Message-Id: <20020327041151.A10618@alpha.hut.fi>
654 - lib/Encode/CJKguide.pod
655 Too controversial and dropped from the dist. Will be available
656 separately on the web.
658 RCS tags added so table debugging gets easier (should that be
659 needed! I hope they all stay 1.00!)
660 + lib/Encode/CJKguide.pod
661 A detailed guide to mainly, but not limited to, CJK multibyte
664 + Encode/hp-roman8.ucm
666 ! Encode/Supported.pod
667 All occurance of "roman8" replaced with "hp-roman8" to avoid
669 ! Encode/Supported.pod
672 Mac Encodings now comply the Inside Macintosh
674 Test for '-raw' conventions added.
676 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
680 "-raw" appended to canonical names.
681 File mames stay unchanged thanks to UCM format.
682 ! lib/Encode/CN/HZ.pm
683 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
684 Message-Id: <20020326035210.GA2091@not.autrijus.org>
687 - lib/Encode/JP/Const.pm
688 + lib/Encode/CJKConstants.pm
689 + lib/Encode/CN/2022_CN.pm
690 + lib/Encode/KR/2022_KR.pm
699 * Support for ISO-2022-KR and ISO-2022-CN added.
701 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
702 * ucm2table autogenerates character table out of UCM files.
705 - lib/Encode/Supports.pod
706 + lib/Encode/Supported.pod
707 Names reverted due to popular demand.
708 8.3 rule applies only when there is a conflict.
709 Message-Id: <20020325095924.GD44120@not.autrijus.org>
714 - lib/Encode/Format/Enc.pod
716 * Character tables is now 100% ucm.
717 * All files under Encode/ is now 8.3-compliant
718 * some of missing encodings added (i.e. gsm0338 and nextstep)
719 * Vendor mappings aggregated with appropriate national std in
720 Makefile.PL, resulting smaller *.so especially for CJK.
721 Following is result on Dan's FreeBSD box.
723 ---------------------------------------------------------------
724 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
725 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
726 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
727 blib/arch/auto/Encode/Encode.so 27,791 27,791
728 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
729 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
730 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
731 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
732 ---------------------------------------------------------------
733 Total 5,375,297 6,343,819
735 * As a result of ucm-transition, Encode::Tcl dropped because
736 Encode::Tcl demands *.enc.
737 Encode::Tcl will be supplied in a separate tarball with *.enc.
738 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
743 -lib/Encode/Supported.pod
744 +lib/Encode/Supports.pod
745 -lib/Encode/iso10646_1.pm
746 +lib/Encode/10646_1.pm
747 -lib/Encode/EncFormat.pod
748 +lib/Encode/Format/Enc.pod
749 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
750 - lib/Encode/JP/Constants.pm
751 + lib/Encode/JP/Consts.pm
752 ! lib/Encode/JP/JIS.pm
753 ! lib/Encode/JP/H2Z.pm
754 Version nit problem and 8.3 rule fix.
755 > Package namespace installed latest in CPAN file
756 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
757 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
758 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
759 accordingly. In addition, legacy "use vars qw()..." are replaced with
761 Message-Id: <20020325011248.D1561@alpha.hut.fi>
762 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
764 - lib/Encode/JP/ISO_2022_JP.pm
765 - lib/Encode/JP/ISO_2022_JP_1.pm
766 + lib/Encode/JP/2022_JP.pm
767 + lib/Encode/JP/2022_JP1.pm
769 8.3 naming conflict for vanilla fat addressed by jhi
770 Message-Id: <20020324201931.V22596@alpha.hut.fi>
773 Typecast fix addressed by jhi
774 Message-Id: <20020324185540.T22596@alpha.hut.fi>
777 ! lib/Encode/Supported.pod
779 + lib/Encode/JP/ISO_2022_JP_1.pm
780 ! lib/Encode/JP/ISO_2022_JP.pm
781 ! lib/Encode/JP/JIS.pm
783 Now Encode::JP is more strict on the difference between ISO-2022-JP
784 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
785 makes Anton happier :) FYI the previous version implements
786 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
787 ! lib/Encode/Supported.pod
790 Avoid core-dump in Encode with PERLIO=mmap by NI-S
791 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
796 ! lib/Encode/Suppoted.pod
797 pod fixes to replace F<http://...> to L<http://...>,
798 as suggested by Autrijius in:
799 Message-Id: <20020324083943.GA14901@not.autrijus.org>
800 ! lib/Encode/Suppoted.pod
801 fixes and enhancements by Anton
802 Message-Id: <10632060120.20020324103753@motor.ru>
803 ! lib/Encode/Alias.pm
804 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
805 added. Suggested by Anton then deobfuscated by Autrijius
806 Message-Id: <20020324064455.GA3667@not.autrijus.org>
808 Further fix by Nicholas Clark
809 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
810 - lib/EncodeFormat.pod
811 + lib/Encode/EncFormat.pod
813 File renamed as suggested by Autrijius
815 ! lib/Encode/Details.pod
816 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
817 ! Encode.pm Sun Mar 24 13:43:47 2002
818 pod fixes by Autrijius.
819 Message-Id: <20020324062804.GA3595@not.autrijus.org>
820 Message-Id: <20020324075627.GB11986@not.autrijus.org>
822 ! lib/Encode/Alias.pm
824 now more EBCDIC conscious;
825 %ExtModules on EBCDIC system excludes CJK so that you don't
826 have to worry about the matched alias resulting cloaking.
827 t/Alias.t also revised to reflect changes. Verified by jhi
828 Message-Id: <20020324022929.D22596@alpha.hut.fi>
834 EBCDIC detection mechanism installed as in JP/JP.pm
835 Message-Id: <20020323211847.G19148@alpha.hut.fi>
843 Now all table files used by compile are postfixed '_t' to avoid
844 namespace collisions in case insensitive file systems once for all!
846 Message-ID: <58290227735.20020323195659@familiehaase.de>
848 Since the Encode::JP is unsupported under EBCDIC we
849 cannot run this test (aliases as such should work fine) -- jhi
850 Message-Id: <20020323202119.D19148@alpha.hut.fi>
852 duplicate occurance of ascii.ucm and 8859-1.ucm
853 causes MacOS X dlyd to cloak
859 < chdir 't' if -d 't';
861 > if (! -d 'blib' and -d 't'){ chdir 't' };
862 When you are "make test"-ing on Encode/ directory, you must not
863 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
864 remain unchanced. Also the situation detection was made simpler
865 in t/JP.t, which was originally;
866 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
868 "Use of uninitialized value in string eq at Encode.pm line 96."
872 -- Problem on case insensitive file systems
873 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
874 Message-ID: <88254111953.20020323095503@familiehaase.de>
877 "So I think it's a bug in gcc, not perl. But it still needs to be
879 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
880 Message-Id: <20020323170509.C96475@plum.flirble.org>
884 ! lib/Encode/Encoding.pm
885 ! lib/Encode/Alias.pm
886 ! lib/Encode/Supported.pod
888 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
889 Message-ID: <20020322073908.GB10539@blackrider>
892 "...I think we should include ISO 8859-1 as well." -- NI-S
893 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
898 ! lib/Encode/Alias.pm
899 alias definitions relocated to Encode::Alias so module autoloading
900 works for aliases also.
902 encodings() now accepts args to check ExtModules.
911 Latin and single byte encodings are reorganized so they are
912 demand-loaded like Encode::XX. Now only ascii is compiled into
914 ! lib/Encode/Alias.pm
915 for my $k (keys %hash){ delete $hash{$k}; }
916 is depreciated; fixed.
919 In this update, pod rewrites and alias fixes are the main issues
920 + lib/Encode/Supported.pod
921 Describes supported encodings
923 streamlined compiled-in encodings.
924 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
926 + Encode/ibm-125?.ucm
927 Added from icu distibution with any occurance of
928 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
929 some respect to icu staff, however.
930 + lib/Encode/Alias.pm
932 Alias difinitions in Encode.pm relocated.
935 packWARN patch from Paul Marquess via jhi
936 Message-Id: <20020321010101.O28978@alpha.hut.fi>
937 Paul added to AUTHORS as a result.
938 ! t/CJKalias.t -> t/Aliases.t
939 Renamed. Checks even more aliases and alias overloading
942 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
943 gbk => cp936 relocated to CN.pm
945 Test::More with plans (by jhi)
948 + lib/Encode/Description.pod
949 ! lib/Encode/Encoding.pm
950 Now the pod in Encode.pm is abridged as programming references.
951 lib/Encode/Description.pod contains the original, detailed description
952 and Encode::Encoding explains how to write your own module to
953 add new encodings. So far, lib/Encode/Description.pod contains
954 the whole pod once in Encode.pm. This is intentional.
956 Pod revisions by Anton Tagunov
957 Message-Id: <517178431.20020320174824@motor.ru>
959 all occrance of Encode::Tcl::Extended removed including pod
961 test now checks $encoding->name only; $encoding->{name} are
962 no longer check to find the canonical name.
963 ! lib/Encode/JP/JIS.pm
964 ! lib/Encode/JP/ISO_2022_JP.pm
965 ->name() added to be more compliant with API
971 Patch by Autrijus to add aliases to TW and fixes to POD
972 Message-Id: <20020320090619.GA24774@not.autrijus.org>
974 SADAHIRO Tomoyuki added as should. My apologies.
977 * First release to be uploaded to CPAN. For prehistoric changes,
978 please see Changes file of perl distibution as well as
979 perl-unicode@perl.org archive, available at:
980 http://archive.develooper.com/perl-unicode@perl.org/
982 Changes Since 0.92 includes;
987 + Mention to perl-unicode@perl.org added
989 + Encoding aliases added so you can feed locale names
990 and MIME Charset="" directly.
991 - Mention to JISX0212 removed because it's fixed
994 + Encoding aliases added. Note TW is left untouched because
995 euc-tw is not implemented in TW but in Encode::HanExtra.
996 Autrijus, you may fix Encode::HanExtra.
998 + to test encode aliases added