1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.67 2002/05/02 07:33:09 dankogai Exp $
6 $Revision: 1.67 $ $Date: 2002/05/02 07:33:09 $
8 Error message now consistent w/ perlqq (\N{U+} -> \x{})
9 done in perl@16308 but Philip linted me further. Now the error
10 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
12 Sanity check for happier -w by Autrijus
14 1.66 2002/05/01 05:41:06
15 ! Encode.xs t/fallback.t
16 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
17 while fallback is in effect. This even came with a welcome side-effect
18 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
19 also modified to test this.
20 And of course, the corresponding varialbles to UV[Xx]f are appropriately
21 cast. This should've concluded NI-XS homework.
23 encode(undef) does warn again! Repented upon suggestion by NI-XS.
24 Document for unless vs. '' added
25 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
27 1.65 2002/04/30 16:13:37
29 encode(undef) no longer warns for C<Use of uninitialized value in
30 subroutine entry>. Suggested by Paul.
31 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
32 ! lib/Encode/Supported.pod
33 Encode::MIME::Header and Encode::Guess mentioned
34 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
36 POD fix by Miyagawa-kun
37 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
39 1.64 2002/04/29 06:54:06
41 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
42 encocde("euc-jisx0213") are still DIFFERENT.
43 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
45 A few white spaces corrected by NI-XS via PerlIO integration to
47 Subject: Change 16247: Integrate perlio;
49 Document fixes by Andreas
50 Message-Id: <m3k7qsf1we.fsf@anima.de>
52 1.63 2002/04/27 18:59:50
53 ! lib/Encode/Encoding.pm
54 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
55 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
56 Make use of the Encode::Encoding base class!
57 And other cleanups in Encode.xs upon NI-XS suggestions
58 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
60 1.62 2002/04/27 11:17:39
62 encodings() now just check %ExtModule instead of eval{require}
63 all of them for ":all" to conserve more memory.
65 more "%x" -> "%" UVxf stuff.
67 s/=over2/=over 2/g # oops.
69 1.61 2002/04/26 03:02:04
71 Now does decent tests besides use_ok()
72 ! lib/Encode/Guess.pm t/guess.t
73 UI streamlined, document added
75 various signed/unsigned mismatch nits (#16173)
76 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
78 POD: utf8-flag-related caveats added. A few sections completely
82 Thou shalt not assume %d works, either!
83 Robin Baker added to AUTHORS for this
84 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
86 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
88 1.60 2002/04/24 20:06:52
90 "Thou shalt not assume %x works." -- jhi
91 Message-Id: <20020424210618.E24347@alpha.hut.fi>
92 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
93 low-memory build machines happy, now *.c is created for each *.ucm
94 (no table aggregation). You can still override this by setting
95 $ENV{AGGREGATE_TABLES}.
96 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
98 + lib/Encode/JP/JIS7.pm
99 Encoding-autodetect (mainly for Japanese encoding) added. In a
100 course of development, JIS7.pm was improved.
101 + lib/Encode/HTML/Header.pm
102 + lib/Encode/Config.pm
103 MIME B/Q Header Encoding Added!
104 ! Encode.pm Encode.xs t/fallback.t
105 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
106 Message-Id: <20020424130709.GA14211@tanglefoot>
108 1.59 $ 2002/04/22 23:54:22
109 ! Encode.pm Encode.xs
110 needs_lines() and perlio_ok() are added to Internal encodings such
111 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
113 1.58 2002/04/22 23:54:22
115 s/MacChineseSimp/MacChineseTrad/ # ... oops.
120 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
121 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
122 t/(JP|KR).t are aggregated to t/CJKT.t
123 test data are all remade via bin/ucm2text.
124 And .... They are no longer skipped for -Uuseperlio !
126 1.57 2002/04/22 20:27:30
127 ! t/JP.t t/KR.t t/perlio.t
128 unless (find PerlIO::Layer 'perlio') ... line is back again.
129 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
130 need PerlIO. Sigh....
131 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
132 ->perlio_ok now does eval{ require PerlIO::encoding } there so
133 it correctly returns 1 when PerlIO::encoding is yet loaded.
135 perl-current patch #16072 reflected
137 1.56 2002/04/22 09:48:07
138 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
139 New PerlIO::encoding 0.04 compliance met
141 1.55 2002/04/22 03:43:05
142 ! Encode.pm Encode.xs Unicode/Unicode.pm
143 needs_lines() defined so Encode::Encoding is no longer needed
146 1.54 2002/04/22 02:50:01
147 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
148 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
149 now perlio_ok is true by default if PerlIO::encoding->VERSION is
150 0.03 or larger. POD in Encode::Encoding revised to reflect this.
151 Encode::XS and Encode::Unicode now has perlio_ok() method.
152 ! lib/Encode/Supported.pod
153 s/UP-UX/HP-UX/ by jhi
154 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
155 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
156 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
157 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
158 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
159 ! lib/Encode/Supported.pod
160 Huge document fixes by Philip.
163 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
164 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
166 binmode() after "<:encoding" to make Win32 happy, by Mattia.
167 Mattia added to AUTHORS file
168 Message-Id: <3CC3150F.5798.22A05AE@localhost>
170 1.52 2002/04/20 23:43:47
172 TODO: is now SKIP:, as NI-XS requested. Also adds more
173 eraborate failure analysis added.
175 A note on how to make sure of round-trip safety added to POD
176 section (so Autrijus is happier)
177 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
178 big5-(eten|hkscs) is round-trip safe again!
179 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
181 Typo fixes by Andreas
182 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
183 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
184 PerIO coodination patches from NI-XS.
185 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
187 1.51 2002/04/20 09:58:23
189 Updated test suite by Autrijis so "make test" is happy again
190 Message-Id: <20020420082104.GA25037@not.autrijus.org>
192 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
194 TW/TW.pm TW/Makefile.PL
195 Updates by Autrijus. 'big5' is no longer a canonical but an
196 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
197 Message-Id: <20020419195346.GA19597@not.autrijus.org>
199 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
200 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
202 PerlIO detection a little bit smarter; no longer uses eval qq{}
205 1.50 2002/04/19 06:13:02
206 ! ! Encode.pm Encode.xs Encode/encoding.h
208 New Fallback API imlemented and documented. See "perldoc Encode"
210 ! lib/Encode/JP/JIS7.pm Encode.pm
211 + lib/Encode/PerlIO.pod t/perlio.t
212 API compliance met. However, it still does not work unless perlio
213 implements line buffer. See BUGS section in perldoc Encode::PerlIO
214 As a sensible workaround, perlio_ok() added to Encode.
216 ! lib/Encode/Supported.pod
218 Message-Id: <20020418174647.J8466@alpha.hut.fi>
220 Doc fixes from Autrijus
221 Message-Id: <20020418144131.GA10987@not.autrijus.org>
223 perlqq mode documented
225 + t/jisx0201.euc t/jisx0201.ref
226 ! t/jisx0208.euc t/jisx0208.ref
227 t/JP.t tests more rigorously and with other encodings
228 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
229 in jis0208 (used to contain jisx0201 part).
230 ! Encode/Makefile_PL.e2x
231 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
232 enc2xs and encode.h rather than hard-coded. This allows the resulting
233 module fully CPANizable.
234 ! encoding.pm t/JP.t t/KR.t
235 PerlIO detection simplified (checks %INC instead of eval{})
236 ! Encode.xs Encode/encode.h
237 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
238 - lib/Encode/Unicode.pm
239 (en|de)code_xs relocated to where it belongs. Source reindented
242 Additional (U8 *) cast added as suggested by jhi
243 Message-Id: <20020417165916.A28599@alpha.hut.fi>
245 1.42 Date: 2002/04/17
247 no-op module; Thought of adding a pod there but enc2xs has
252 correct mechanism to detect Perlio::encoding layar installed.
254 PerlIO Layer detached.
256 1.41 2002/04/16 23:35:00
258 binmode(STDIN|STDOUT ...) done iff PerlIO is available
260 Cleaned up PerlIO skip conditions to prepare for the upcoming
261 Encode - PerlIO forking.
263 exported functions are now prototyped.
264 ! lib/Encode/CN/HZ.pm
267 fallback implemented # was /* FIXME */
268 affected programs revised to fit (only HZ was using the try-catch
269 approach which needed to be fixed for API-compliance).
271 ! Encode/KR/2022_KR.pm
273 can find =head1 NAME now, jhi
274 Message-Id: <20020416083059.V30639@alpha.hut.fi>
278 now complies with less warnings with the pickest compilers.
279 Suggested by Craig, fixed by Dan.
280 ! Encode/Makefile_PL.e2x
282 A bug that fails to find *.e2x in certain conditions fixed
284 1.40 2002/04/14 22:27:14
285 + Encode/ConfigLocal_PM.e2x
286 ! lib/Encode/Config.pm
288 "enc2xs -C" now generates/updates Encode::ConfigLocal.
289 ConfigLocal_PM.e2x is a skelton thereof.
290 ! lib/Encode/Config.pm
292 "use Encode::CN::HZ;" was missing.
295 More rigorous tests added to test XS, especially on memory allocation.
297 ! lib/Encode/Unicode.pm
298 NI-S implemented an XS version -- merged
299 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
302 Source filter option added. With this option on, you can write
303 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
304 encodings. t/jperl.t enhanced to test this feature.
306 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
307 apply his suggestion, this degree of nitting is enough to add him
309 Message-Id: <3CB93223.291E5E2E@earthlink.net>
311 + lib/Encode/JP/JIS7.pm
312 - lib/Encode/JP/JIS.pm
313 - lib/Encode/JP/2022_JP.pm
314 - lib/Encode/JP/2022_JP1.pm
315 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
316 JIS7.pm for better maintainability and performance
318 Added caveat for non-ascii identifiers.
320 fixes by jhi, the original author of this pragramtic module.
321 Message-Id: <20020413231527.V1826@alpha.hut.fi>
323 1.34 2002/04/12 20:23:05 (Unreleased)
326 EBCDIC fixes addressed by jhi.
327 Message-Id: <20020412161844.D9383@alpha.hut.fi>
328 ! lib/Encode/Encoder.pm
329 POD fix by Miyagawa-kun
330 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
332 1.33 2002/04/10 22:28:40
334 Philip's mail address corrected.
337 ! lib/Encode/Encoder.pm
338 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
339 This adds Miyagawa-kun to AUTHORS.
340 * encoding() no longer exported by default but on demand
341 * t/Encoder.t updated to test all these
342 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
343 ! lib/Encode/Unicode.pm
344 ! lib/Encode/Supported.pm
345 Further doc fixes by Anton
347 1.32 2002/04/09 20:06:15
350 - ucm/macDevanaga.ucm Unicode Character Map
351 - ucm/macGujarati.ucm Unicode Character Map
352 - ucm/macGurmukhi.ucm Unicode Character Map
353 A utility to check integrity of .ucm files. t/bogus.ucm is a
354 ucm that is deliberately bogus. unused Indic mappings are removed
357 resolve_alias() added as suggested by jhi. Same as
358 find_encoding("alias")->name. For convenience. This one is
359 defined in Encode.pm instead of Alias.pm.
360 Message-Id: <20020409215846.H17022@alpha.hut.fi>
362 Memory Allocate but detected during the devel of ucmlint -- fixed.
363 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
364 ! lib/Encode/Unicode.pm
365 valid_ucs2(0) is false but must be true.
366 3 patches from NI-S as follows. This also has fixed the incident
368 ! lib/Encode/Alias.pm
369 find_alias() recursion prevention
371 Checks for the patch above
372 ! t/Encode/Unicode.pm
373 An extra "F" that causes valid_ucs2() return a bogus value fixed
374 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
375 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
376 2 Small Patches from jhi as follows:
378 Encode->encodings() lists in case-insensitve order (as it was)
380 -l option prints avaiable encodings to STDOUT instead of STDERR
381 ! lib/Encode/Aliases.pm
382 s/defintion/definition/
383 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
385 ! lib/Encode/Supported.pod
386 ! lib/Encode/Unicode.pm
387 POD revise by Philip Newton. This adds Philip to AUTHORS list.
388 Thank you for the exact quote of Douglas Adams :)
389 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
391 1.31 2002/04/08 18:08:07
392 ! lib/Encode/Encoder.pm
394 Encode::Encoder, once just a placeholder of an idea, is now much more
395 practical. See t/Encode.t to find how practical it can be.
396 + lib/Encode/Config.pm
398 my false laziness at Encode.pm is fixed. Now %ExtModules are set
399 in Encode::Config and they are all literally, not programatically
400 set. My false laziness was resulting many encodings missing from
402 ! lib/Encode/Unicode.pm
404 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
405 so that it does not rely Encode::Unicode for BOM values
406 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
408 1.30 2002/04/08 02:34:51
409 + lib/Encode/Encoder.pm
410 Object Oriented Encoder. I reckon something like this is in need.
413 ! lib/Encode/Supported.pod
414 * autoloading bug that prevented upper-case canonicals such as UTF-16
415 is fixed. Now even UTF/UCS are autoloaded!
416 * encodings() is now more intuitive.
417 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
419 * Obligatory fixes to the POD.
420 ! lib/Encode/Supported.pod
421 Patch from Anton applied.
422 Message-Id: <66641479.20020408033300@motor.ru>
424 ! lib/Encode/Unicode.pm
425 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
427 1.28 2002/04/07 18:58:42
431 Just a MANIFEST for those missing files.
433 1.26 Date: 2002/04/07 15:22:04
436 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
437 And t/Aliases.t fixed to make sure they all exist.
439 1.25 2002/04/07 15:01:25 (Unreleased)
441 ! lib/Encode/Unicode.pm
444 - lib/Encode/UTF_EBCDIC.pm
445 - lib/Encode/Internal.pm
447 Integrated into Encode.pm as closures. That way "one package, one file"
448 rule is preserved yet less files to require.
450 commented out binmode(STDERR ...
459 ! Encode/Makefile_PL.e2x
460 Schwarn's MM-compliance patch merged
461 Message-Id: <20020406082609.GA28758@blackrider>
463 ! lib/Encode/Unicode.pm
464 + lib/Encode/UTF_EBCDIC.pm
466 - lib/Encode/10646_1.pm
467 - lib/Encode/ucs2_le.pm
468 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
469 per-module based (en|de)code, I saved a number of .pm by
470 reorganizing it as per-object base (Well, this is what Encode::XS
471 does under the hood). See Encode::Unicode for details.
472 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
473 This module is used only on EBCDIC environments.
475 1.21 2002/04/05 14:46:34 (Not Released)
481 Are back to make Perl/Tk happy Smile, NI-S.
483 ! lib/Encode/Alias.pm
484 ! lib/Encode/Supported.pm
485 ! lib/Encode/10646_1.pm
486 ! lib/Encode/ucs2_le.pm
487 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
488 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
489 updated to reflect changes.
490 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
491 ! lib/Encode/Alias.pm
492 ! lib/Encode/Supported.pm
493 Anton's revision commited. Added Dan's own fixes as well.
494 Message-Id: <159103166906.20020405161134@motor.ru>
495 ! lib/Encode/Alias.pm
497 < qr/^UCS2-le$/i => '"UCS-2"', );
499 > qr/^UCS2-LE$/i => '"UTF-16LE"');
500 Sigh. Thank you, Anton.
501 Message-Id: <14567692196.20020405062020@motor.ru>
502 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
504 1.20 2002/04/04 19:50:52
506 the last minute addtion. Just give it a try. Docs remains to be done.
507 Not installed by default.
508 ! lib/Encode/Supported.pod
511 ! lib/Encode/Alias.pm
513 ! lib/Encode/10464_1.pm
514 ! lib/Encode/ucs2_le.pm
515 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
516 unchanged but UTF-16BE is added as an alias. Implicit aliases
517 move to Encode::Alias so init_alias() works more as expected.
518 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
519 Though pedantically wrong, This should make perl mongers happier.
520 t/Alias.t is enhanced to test all these.
521 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
523 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
524 of the codepoints are common. CJKT left as is because the saving is
530 ! Encode/Makefile_PL.e2x
538 All occurance of _def.h replaced with .exh so djgpp works happily
539 ever after! To credit this amazing discovery, Laszlo is now in
541 Message-Id: <20020403181424.GA8778@freemail.hu>
542 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
545 ! Encode/Makefile_PL.skel
547 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
548 Message-Id: <20020401222744.GX2000@blackrider>, et al.
550 Two more tests by added jhi
551 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
554 The showstopper fixed -- Memory reallocation bug was causing
555 Encode::XS to fall into infinite loop on certain conditions.
557 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
562 ! lib/Encode/Supported.pod
563 Vendor encodings rebuilt out of original map files at unicode.org.
564 Indic languages such as MacDevanagali remain unspported do to the
565 shortcoming of encengine capabilities (they need algorithmical
566 conversion and I have no knowledge on that!). Pods fixed for added
568 Oh, macJapan.ucm renamed to macJapanese.ucm.
569 macROMnn is macRomanian and macRUMnn is macRumanian.
570 txt2ucm is a crude script that is used to convert them.
572 Unicode Compound Characters (used extensively on Mac) supported
574 Typo fixes and improvements by jhi
575 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
577 1.11 $Date: 2002/05/02 07:33:09 $
581 Missing files from the MANIFEST fixed.
582 Message-Id: <20020401010156.H10509@alpha.hut.fi>
583 Version incremented just to make CPAN happy.
585 1.10 2002/03/31 21:32:42
588 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
589 if you want. This should make Autrijus happy. Also, piconv
590 is added to default install.
593 Here-documented files that enc2xs generates are now exported
594 to *.e2x. Much cleaner and easier to debug.
596 encoding enhances so you can make it act more like such
597 (now prehistoric ) "localized" variations of perl like Jperl.
599 Further test for encoding.pm. Written in euc-jp
603 Message-Id: <20020330174618.B10154@alpha.hut.fi>
608 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
615 ! Encode/macIceland.ucm
616 ! lib/Encode/Alias.pm
617 ! lib/Encode/Supported.pod
618 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
619 Message-Id: <m3lmcavhjt.fsf@anima.de>
621 1.01 2002/03/29 20:59:39
624 s/USE_SCRIPTS/MORE_SCRIPTS/
626 installs enc2xs by default for external Encode:: modules in CPAN,
627 such as Encode::HanExtra
629 More sensible perl core detection via $ENV{PERL_CORE}
631 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
633 Perl core ditection via $^X =~ m/\bminiperl$/o
634 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
638 The version of all files is updated to 1.00 via "ci -f -l1.00",
639 commemorating version 1.00. All files, including *.ucm are now
640 under version control.
643 encode.h moved to Encode/ so it will be installed for the later
646 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
651 compile renamed to enc2xs.
652 Affected Makefle.PL updated
654 "Punt it. HanExtra can take care of that later." -- Autrijus
655 Message-Id: <20020328154338.GA7351@not.autrijus.org>
659 ! lib/Encode/CJKConstants.pm
660 ! lib/Encode/KR/2022_KR.pm
661 Table patches for Euro Signs, 2022-KR fixups by Jungshik
662 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
666 bin/ added for example scripts. They are not installed by default.
667 to install them, "perl Makefile.PL USE_SCRIPTS".
668 piconv is iconv reinvented in perl. in addition to all features
669 of iconv, it also adds perlish features. See L<piconv/1> for more
671 ! lib/Encode/Alias.pm
672 qr/^ replaced with qr/\b so it directly matches locale names
673 such as en_US.US-ASCII
676 Patch by MJD to fix the following problem applied.
677 Subject: [PATCH 5.7.3 Encode]
678 Aliases.t not properly skipped when Encode extension not built
679 Message-Id: <20020328091850.18677.qmail@plover.com>
680 ! lib/Encode/KR/2022_KR.pm
681 ! lib/Encode/CJKConstants.pm
682 Another patch from Jungshik to make iso-2022-kr actually work
683 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
684 ! Encode/Encode/euc-kr.ucm
685 + Encode/Encode/johab.ucm
686 ! Encode/Encode/ksc5601.ucm
688 ! Encode/KR/Makefile.PL
689 ! Encode/lib/Encode/Alias.pm
691 Johab support and complete revision of Korean Encoding by Jungshik
692 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
694 Revised to make up with now-dropped Encode::Details.
695 - lib/Encode/Details.pod
696 Dropped. Besides being obsolete, the topics are now covered in
702 Korean aliases fixed thanks to Jungshik Shin
703 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
704 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
706 =head1 NAME added to all modules to make buildtoc happy
707 Message-Id: <20020327041151.A10618@alpha.hut.fi>
708 - lib/Encode/CJKguide.pod
709 Too controversial and dropped from the dist. Will be available
710 separately on the web.
712 RCS tags added so table debugging gets easier (should that be
713 needed! I hope they all stay 1.00!)
714 + lib/Encode/CJKguide.pod
715 A detailed guide to mainly, but not limited to, CJK multibyte
718 + Encode/hp-roman8.ucm
720 ! Encode/Supported.pod
721 All occurance of "roman8" replaced with "hp-roman8" to avoid
723 ! Encode/Supported.pod
726 Mac Encodings now comply the Inside Macintosh
728 Test for '-raw' conventions added.
730 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
734 "-raw" appended to canonical names.
735 File mames stay unchanged thanks to UCM format.
736 ! lib/Encode/CN/HZ.pm
737 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
738 Message-Id: <20020326035210.GA2091@not.autrijus.org>
741 - lib/Encode/JP/Const.pm
742 + lib/Encode/CJKConstants.pm
743 + lib/Encode/CN/2022_CN.pm
744 + lib/Encode/KR/2022_KR.pm
753 * Support for ISO-2022-KR and ISO-2022-CN added.
755 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
756 * ucm2table autogenerates character table out of UCM files.
759 - lib/Encode/Supports.pod
760 + lib/Encode/Supported.pod
761 Names reverted due to popular demand.
762 8.3 rule applies only when there is a conflict.
763 Message-Id: <20020325095924.GD44120@not.autrijus.org>
768 - lib/Encode/Format/Enc.pod
770 * Character tables is now 100% ucm.
771 * All files under Encode/ is now 8.3-compliant
772 * some of missing encodings added (i.e. gsm0338 and nextstep)
773 * Vendor mappings aggregated with appropriate national std in
774 Makefile.PL, resulting smaller *.so especially for CJK.
775 Following is result on Dan's FreeBSD box.
777 ---------------------------------------------------------------
778 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
779 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
780 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
781 blib/arch/auto/Encode/Encode.so 27,791 27,791
782 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
783 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
784 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
785 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
786 ---------------------------------------------------------------
787 Total 5,375,297 6,343,819
789 * As a result of ucm-transition, Encode::Tcl dropped because
790 Encode::Tcl demands *.enc.
791 Encode::Tcl will be supplied in a separate tarball with *.enc.
792 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
797 -lib/Encode/Supported.pod
798 +lib/Encode/Supports.pod
799 -lib/Encode/iso10646_1.pm
800 +lib/Encode/10646_1.pm
801 -lib/Encode/EncFormat.pod
802 +lib/Encode/Format/Enc.pod
803 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
804 - lib/Encode/JP/Constants.pm
805 + lib/Encode/JP/Consts.pm
806 ! lib/Encode/JP/JIS.pm
807 ! lib/Encode/JP/H2Z.pm
808 Version nit problem and 8.3 rule fix.
809 > Package namespace installed latest in CPAN file
810 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
811 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
812 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
813 accordingly. In addition, legacy "use vars qw()..." are replaced with
815 Message-Id: <20020325011248.D1561@alpha.hut.fi>
816 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
818 - lib/Encode/JP/ISO_2022_JP.pm
819 - lib/Encode/JP/ISO_2022_JP_1.pm
820 + lib/Encode/JP/2022_JP.pm
821 + lib/Encode/JP/2022_JP1.pm
823 8.3 naming conflict for vanilla fat addressed by jhi
824 Message-Id: <20020324201931.V22596@alpha.hut.fi>
827 Typecast fix addressed by jhi
828 Message-Id: <20020324185540.T22596@alpha.hut.fi>
831 ! lib/Encode/Supported.pod
833 + lib/Encode/JP/ISO_2022_JP_1.pm
834 ! lib/Encode/JP/ISO_2022_JP.pm
835 ! lib/Encode/JP/JIS.pm
837 Now Encode::JP is more strict on the difference between ISO-2022-JP
838 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
839 makes Anton happier :) FYI the previous version implements
840 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
841 ! lib/Encode/Supported.pod
844 Avoid core-dump in Encode with PERLIO=mmap by NI-S
845 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
850 ! lib/Encode/Suppoted.pod
851 pod fixes to replace F<http://...> to L<http://...>,
852 as suggested by Autrijius in:
853 Message-Id: <20020324083943.GA14901@not.autrijus.org>
854 ! lib/Encode/Suppoted.pod
855 fixes and enhancements by Anton
856 Message-Id: <10632060120.20020324103753@motor.ru>
857 ! lib/Encode/Alias.pm
858 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
859 added. Suggested by Anton then deobfuscated by Autrijius
860 Message-Id: <20020324064455.GA3667@not.autrijus.org>
862 Further fix by Nicholas Clark
863 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
864 - lib/EncodeFormat.pod
865 + lib/Encode/EncFormat.pod
867 File renamed as suggested by Autrijius
869 ! lib/Encode/Details.pod
870 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
871 ! Encode.pm Sun Mar 24 13:43:47 2002
872 pod fixes by Autrijius.
873 Message-Id: <20020324062804.GA3595@not.autrijus.org>
874 Message-Id: <20020324075627.GB11986@not.autrijus.org>
876 ! lib/Encode/Alias.pm
878 now more EBCDIC conscious;
879 %ExtModules on EBCDIC system excludes CJK so that you don't
880 have to worry about the matched alias resulting cloaking.
881 t/Alias.t also revised to reflect changes. Verified by jhi
882 Message-Id: <20020324022929.D22596@alpha.hut.fi>
888 EBCDIC detection mechanism installed as in JP/JP.pm
889 Message-Id: <20020323211847.G19148@alpha.hut.fi>
897 Now all table files used by compile are postfixed '_t' to avoid
898 namespace collisions in case insensitive file systems once for all!
900 Message-ID: <58290227735.20020323195659@familiehaase.de>
902 Since the Encode::JP is unsupported under EBCDIC we
903 cannot run this test (aliases as such should work fine) -- jhi
904 Message-Id: <20020323202119.D19148@alpha.hut.fi>
906 duplicate occurance of ascii.ucm and 8859-1.ucm
907 causes MacOS X dlyd to cloak
913 < chdir 't' if -d 't';
915 > if (! -d 'blib' and -d 't'){ chdir 't' };
916 When you are "make test"-ing on Encode/ directory, you must not
917 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
918 remain unchanced. Also the situation detection was made simpler
919 in t/JP.t, which was originally;
920 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
922 "Use of uninitialized value in string eq at Encode.pm line 96."
926 -- Problem on case insensitive file systems
927 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
928 Message-ID: <88254111953.20020323095503@familiehaase.de>
931 "So I think it's a bug in gcc, not perl. But it still needs to be
933 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
934 Message-Id: <20020323170509.C96475@plum.flirble.org>
938 ! lib/Encode/Encoding.pm
939 ! lib/Encode/Alias.pm
940 ! lib/Encode/Supported.pod
942 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
943 Message-ID: <20020322073908.GB10539@blackrider>
946 "...I think we should include ISO 8859-1 as well." -- NI-S
947 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
952 ! lib/Encode/Alias.pm
953 alias definitions relocated to Encode::Alias so module autoloading
954 works for aliases also.
956 encodings() now accepts args to check ExtModules.
965 Latin and single byte encodings are reorganized so they are
966 demand-loaded like Encode::XX. Now only ascii is compiled into
968 ! lib/Encode/Alias.pm
969 for my $k (keys %hash){ delete $hash{$k}; }
970 is depreciated; fixed.
973 In this update, pod rewrites and alias fixes are the main issues
974 + lib/Encode/Supported.pod
975 Describes supported encodings
977 streamlined compiled-in encodings.
978 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
980 + Encode/ibm-125?.ucm
981 Added from icu distibution with any occurance of
982 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
983 some respect to icu staff, however.
984 + lib/Encode/Alias.pm
986 Alias difinitions in Encode.pm relocated.
989 packWARN patch from Paul Marquess via jhi
990 Message-Id: <20020321010101.O28978@alpha.hut.fi>
991 Paul added to AUTHORS as a result.
992 ! t/CJKalias.t -> t/Aliases.t
993 Renamed. Checks even more aliases and alias overloading
996 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
997 gbk => cp936 relocated to CN.pm
999 Test::More with plans (by jhi)
1001 0.94 Thu Mar 21 2002
1002 + lib/Encode/Description.pod
1003 ! lib/Encode/Encoding.pm
1004 Now the pod in Encode.pm is abridged as programming references.
1005 lib/Encode/Description.pod contains the original, detailed description
1006 and Encode::Encoding explains how to write your own module to
1007 add new encodings. So far, lib/Encode/Description.pod contains
1008 the whole pod once in Encode.pm. This is intentional.
1010 Pod revisions by Anton Tagunov
1011 Message-Id: <517178431.20020320174824@motor.ru>
1013 all occrance of Encode::Tcl::Extended removed including pod
1015 test now checks $encoding->name only; $encoding->{name} are
1016 no longer check to find the canonical name.
1017 ! lib/Encode/JP/JIS.pm
1018 ! lib/Encode/JP/ISO_2022_JP.pm
1019 ->name() added to be more compliant with API
1025 Patch by Autrijus to add aliases to TW and fixes to POD
1026 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1028 SADAHIRO Tomoyuki added as should. My apologies.
1030 0.93 Wed Mar 20 2002
1031 * First release to be uploaded to CPAN. For prehistoric changes,
1032 please see Changes file of perl distibution as well as
1033 perl-unicode@perl.org archive, available at:
1034 http://archive.develooper.com/perl-unicode@perl.org/
1036 Changes Since 0.92 includes;
1041 + Mention to perl-unicode@perl.org added
1043 + Encoding aliases added so you can feed locale names
1044 and MIME Charset="" directly.
1045 - Mention to JISX0212 removed because it's fixed
1048 + Encoding aliases added. Note TW is left untouched because
1049 euc-tw is not implemented in TW but in Encode::HanExtra.
1050 Autrijus, you may fix Encode::HanExtra.
1052 + to test encode aliases added