1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.68 2002/05/03 12:20:13 dankogai Exp $
6 $Revision: 1.68 $ $Date: 2002/05/03 12:20:13 $
7 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
8 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
9 and Supported.pod modified to reflect the change. Elizabeth added
10 to Authors. And H.M. is also added for forwarding her patch among
11 other contributions (I was rather surprised to find his name was not
13 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
15 1.67 2002/05/02 07:33:09
17 Error message now consistent w/ perlqq (\N{U+} -> \x{})
18 done in perl@16308 but Philip linted me further. Now the error
19 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
21 Sanity check for happier -w by Autrijus
23 1.66 2002/05/01 05:41:06
24 ! Encode.xs t/fallback.t
25 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
26 while fallback is in effect. This even came with a welcome side-effect
27 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
28 also modified to test this.
29 And of course, the corresponding varialbles to UV[Xx]f are appropriately
30 cast. This should've concluded NI-XS homework.
32 encode(undef) does warn again! Repented upon suggestion by NI-XS.
33 Document for unless vs. '' added
34 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
36 1.65 2002/04/30 16:13:37
38 encode(undef) no longer warns for C<Use of uninitialized value in
39 subroutine entry>. Suggested by Paul.
40 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
41 ! lib/Encode/Supported.pod
42 Encode::MIME::Header and Encode::Guess mentioned
43 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
45 POD fix by Miyagawa-kun
46 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
48 1.64 2002/04/29 06:54:06
50 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
51 encocde("euc-jisx0213") are still DIFFERENT.
52 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
54 A few white spaces corrected by NI-XS via PerlIO integration to
56 Subject: Change 16247: Integrate perlio;
58 Document fixes by Andreas
59 Message-Id: <m3k7qsf1we.fsf@anima.de>
61 1.63 2002/04/27 18:59:50
62 ! lib/Encode/Encoding.pm
63 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
64 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
65 Make use of the Encode::Encoding base class!
66 And other cleanups in Encode.xs upon NI-XS suggestions
67 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
69 1.62 2002/04/27 11:17:39
71 encodings() now just check %ExtModule instead of eval{require}
72 all of them for ":all" to conserve more memory.
74 more "%x" -> "%" UVxf stuff.
76 s/=over2/=over 2/g # oops.
78 1.61 2002/04/26 03:02:04
80 Now does decent tests besides use_ok()
81 ! lib/Encode/Guess.pm t/guess.t
82 UI streamlined, document added
84 various signed/unsigned mismatch nits (#16173)
85 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
87 POD: utf8-flag-related caveats added. A few sections completely
91 Thou shalt not assume %d works, either!
92 Robin Baker added to AUTHORS for this
93 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
95 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
97 1.60 2002/04/24 20:06:52
99 "Thou shalt not assume %x works." -- jhi
100 Message-Id: <20020424210618.E24347@alpha.hut.fi>
101 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
102 low-memory build machines happy, now *.c is created for each *.ucm
103 (no table aggregation). You can still override this by setting
104 $ENV{AGGREGATE_TABLES}.
105 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
106 + lib/Encode/Guess.pm
107 + lib/Encode/JP/JIS7.pm
108 Encoding-autodetect (mainly for Japanese encoding) added. In a
109 course of development, JIS7.pm was improved.
110 + lib/Encode/HTML/Header.pm
111 + lib/Encode/Config.pm
112 MIME B/Q Header Encoding Added!
113 ! Encode.pm Encode.xs t/fallback.t
114 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
115 Message-Id: <20020424130709.GA14211@tanglefoot>
117 1.59 $ 2002/04/22 23:54:22
118 ! Encode.pm Encode.xs
119 needs_lines() and perlio_ok() are added to Internal encodings such
120 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
122 1.58 2002/04/22 23:54:22
124 s/MacChineseSimp/MacChineseTrad/ # ... oops.
129 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
130 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
131 t/(JP|KR).t are aggregated to t/CJKT.t
132 test data are all remade via bin/ucm2text.
133 And .... They are no longer skipped for -Uuseperlio !
135 1.57 2002/04/22 20:27:30
136 ! t/JP.t t/KR.t t/perlio.t
137 unless (find PerlIO::Layer 'perlio') ... line is back again.
138 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
139 need PerlIO. Sigh....
140 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
141 ->perlio_ok now does eval{ require PerlIO::encoding } there so
142 it correctly returns 1 when PerlIO::encoding is yet loaded.
144 perl-current patch #16072 reflected
146 1.56 2002/04/22 09:48:07
147 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
148 New PerlIO::encoding 0.04 compliance met
150 1.55 2002/04/22 03:43:05
151 ! Encode.pm Encode.xs Unicode/Unicode.pm
152 needs_lines() defined so Encode::Encoding is no longer needed
155 1.54 2002/04/22 02:50:01
156 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
157 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
158 now perlio_ok is true by default if PerlIO::encoding->VERSION is
159 0.03 or larger. POD in Encode::Encoding revised to reflect this.
160 Encode::XS and Encode::Unicode now has perlio_ok() method.
161 ! lib/Encode/Supported.pod
162 s/UP-UX/HP-UX/ by jhi
163 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
164 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
165 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
166 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
167 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
168 ! lib/Encode/Supported.pod
169 Huge document fixes by Philip.
172 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
173 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
175 binmode() after "<:encoding" to make Win32 happy, by Mattia.
176 Mattia added to AUTHORS file
177 Message-Id: <3CC3150F.5798.22A05AE@localhost>
179 1.52 2002/04/20 23:43:47
181 TODO: is now SKIP:, as NI-XS requested. Also adds more
182 eraborate failure analysis added.
184 A note on how to make sure of round-trip safety added to POD
185 section (so Autrijus is happier)
186 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
187 big5-(eten|hkscs) is round-trip safe again!
188 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
190 Typo fixes by Andreas
191 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
192 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
193 PerIO coodination patches from NI-XS.
194 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
196 1.51 2002/04/20 09:58:23
198 Updated test suite by Autrijis so "make test" is happy again
199 Message-Id: <20020420082104.GA25037@not.autrijus.org>
201 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
203 TW/TW.pm TW/Makefile.PL
204 Updates by Autrijus. 'big5' is no longer a canonical but an
205 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
206 Message-Id: <20020419195346.GA19597@not.autrijus.org>
208 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
209 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
211 PerlIO detection a little bit smarter; no longer uses eval qq{}
214 1.50 2002/04/19 06:13:02
215 ! ! Encode.pm Encode.xs Encode/encoding.h
217 New Fallback API imlemented and documented. See "perldoc Encode"
219 ! lib/Encode/JP/JIS7.pm Encode.pm
220 + lib/Encode/PerlIO.pod t/perlio.t
221 API compliance met. However, it still does not work unless perlio
222 implements line buffer. See BUGS section in perldoc Encode::PerlIO
223 As a sensible workaround, perlio_ok() added to Encode.
225 ! lib/Encode/Supported.pod
227 Message-Id: <20020418174647.J8466@alpha.hut.fi>
229 Doc fixes from Autrijus
230 Message-Id: <20020418144131.GA10987@not.autrijus.org>
232 perlqq mode documented
234 + t/jisx0201.euc t/jisx0201.ref
235 ! t/jisx0208.euc t/jisx0208.ref
236 t/JP.t tests more rigorously and with other encodings
237 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
238 in jis0208 (used to contain jisx0201 part).
239 ! Encode/Makefile_PL.e2x
240 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
241 enc2xs and encode.h rather than hard-coded. This allows the resulting
242 module fully CPANizable.
243 ! encoding.pm t/JP.t t/KR.t
244 PerlIO detection simplified (checks %INC instead of eval{})
245 ! Encode.xs Encode/encode.h
246 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
247 - lib/Encode/Unicode.pm
248 (en|de)code_xs relocated to where it belongs. Source reindented
251 Additional (U8 *) cast added as suggested by jhi
252 Message-Id: <20020417165916.A28599@alpha.hut.fi>
254 1.42 Date: 2002/04/17
256 no-op module; Thought of adding a pod there but enc2xs has
261 correct mechanism to detect Perlio::encoding layar installed.
263 PerlIO Layer detached.
265 1.41 2002/04/16 23:35:00
267 binmode(STDIN|STDOUT ...) done iff PerlIO is available
269 Cleaned up PerlIO skip conditions to prepare for the upcoming
270 Encode - PerlIO forking.
272 exported functions are now prototyped.
273 ! lib/Encode/CN/HZ.pm
276 fallback implemented # was /* FIXME */
277 affected programs revised to fit (only HZ was using the try-catch
278 approach which needed to be fixed for API-compliance).
280 ! Encode/KR/2022_KR.pm
282 can find =head1 NAME now, jhi
283 Message-Id: <20020416083059.V30639@alpha.hut.fi>
287 now complies with less warnings with the pickest compilers.
288 Suggested by Craig, fixed by Dan.
289 ! Encode/Makefile_PL.e2x
291 A bug that fails to find *.e2x in certain conditions fixed
293 1.40 2002/04/14 22:27:14
294 + Encode/ConfigLocal_PM.e2x
295 ! lib/Encode/Config.pm
297 "enc2xs -C" now generates/updates Encode::ConfigLocal.
298 ConfigLocal_PM.e2x is a skelton thereof.
299 ! lib/Encode/Config.pm
301 "use Encode::CN::HZ;" was missing.
304 More rigorous tests added to test XS, especially on memory allocation.
306 ! lib/Encode/Unicode.pm
307 NI-S implemented an XS version -- merged
308 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
311 Source filter option added. With this option on, you can write
312 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
313 encodings. t/jperl.t enhanced to test this feature.
315 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
316 apply his suggestion, this degree of nitting is enough to add him
318 Message-Id: <3CB93223.291E5E2E@earthlink.net>
320 + lib/Encode/JP/JIS7.pm
321 - lib/Encode/JP/JIS.pm
322 - lib/Encode/JP/2022_JP.pm
323 - lib/Encode/JP/2022_JP1.pm
324 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
325 JIS7.pm for better maintainability and performance
327 Added caveat for non-ascii identifiers.
329 fixes by jhi, the original author of this pragramtic module.
330 Message-Id: <20020413231527.V1826@alpha.hut.fi>
332 1.34 2002/04/12 20:23:05 (Unreleased)
335 EBCDIC fixes addressed by jhi.
336 Message-Id: <20020412161844.D9383@alpha.hut.fi>
337 ! lib/Encode/Encoder.pm
338 POD fix by Miyagawa-kun
339 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
341 1.33 2002/04/10 22:28:40
343 Philip's mail address corrected.
346 ! lib/Encode/Encoder.pm
347 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
348 This adds Miyagawa-kun to AUTHORS.
349 * encoding() no longer exported by default but on demand
350 * t/Encoder.t updated to test all these
351 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
352 ! lib/Encode/Unicode.pm
353 ! lib/Encode/Supported.pm
354 Further doc fixes by Anton
356 1.32 2002/04/09 20:06:15
359 - ucm/macDevanaga.ucm Unicode Character Map
360 - ucm/macGujarati.ucm Unicode Character Map
361 - ucm/macGurmukhi.ucm Unicode Character Map
362 A utility to check integrity of .ucm files. t/bogus.ucm is a
363 ucm that is deliberately bogus. unused Indic mappings are removed
366 resolve_alias() added as suggested by jhi. Same as
367 find_encoding("alias")->name. For convenience. This one is
368 defined in Encode.pm instead of Alias.pm.
369 Message-Id: <20020409215846.H17022@alpha.hut.fi>
371 Memory Allocate but detected during the devel of ucmlint -- fixed.
372 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
373 ! lib/Encode/Unicode.pm
374 valid_ucs2(0) is false but must be true.
375 3 patches from NI-S as follows. This also has fixed the incident
377 ! lib/Encode/Alias.pm
378 find_alias() recursion prevention
380 Checks for the patch above
381 ! t/Encode/Unicode.pm
382 An extra "F" that causes valid_ucs2() return a bogus value fixed
383 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
384 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
385 2 Small Patches from jhi as follows:
387 Encode->encodings() lists in case-insensitve order (as it was)
389 -l option prints avaiable encodings to STDOUT instead of STDERR
390 ! lib/Encode/Aliases.pm
391 s/defintion/definition/
392 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
394 ! lib/Encode/Supported.pod
395 ! lib/Encode/Unicode.pm
396 POD revise by Philip Newton. This adds Philip to AUTHORS list.
397 Thank you for the exact quote of Douglas Adams :)
398 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
400 1.31 2002/04/08 18:08:07
401 ! lib/Encode/Encoder.pm
403 Encode::Encoder, once just a placeholder of an idea, is now much more
404 practical. See t/Encode.t to find how practical it can be.
405 + lib/Encode/Config.pm
407 my false laziness at Encode.pm is fixed. Now %ExtModules are set
408 in Encode::Config and they are all literally, not programatically
409 set. My false laziness was resulting many encodings missing from
411 ! lib/Encode/Unicode.pm
413 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
414 so that it does not rely Encode::Unicode for BOM values
415 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
417 1.30 2002/04/08 02:34:51
418 + lib/Encode/Encoder.pm
419 Object Oriented Encoder. I reckon something like this is in need.
422 ! lib/Encode/Supported.pod
423 * autoloading bug that prevented upper-case canonicals such as UTF-16
424 is fixed. Now even UTF/UCS are autoloaded!
425 * encodings() is now more intuitive.
426 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
428 * Obligatory fixes to the POD.
429 ! lib/Encode/Supported.pod
430 Patch from Anton applied.
431 Message-Id: <66641479.20020408033300@motor.ru>
433 ! lib/Encode/Unicode.pm
434 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
436 1.28 2002/04/07 18:58:42
440 Just a MANIFEST for those missing files.
442 1.26 Date: 2002/04/07 15:22:04
445 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
446 And t/Aliases.t fixed to make sure they all exist.
448 1.25 2002/04/07 15:01:25 (Unreleased)
450 ! lib/Encode/Unicode.pm
453 - lib/Encode/UTF_EBCDIC.pm
454 - lib/Encode/Internal.pm
456 Integrated into Encode.pm as closures. That way "one package, one file"
457 rule is preserved yet less files to require.
459 commented out binmode(STDERR ...
468 ! Encode/Makefile_PL.e2x
469 Schwarn's MM-compliance patch merged
470 Message-Id: <20020406082609.GA28758@blackrider>
472 ! lib/Encode/Unicode.pm
473 + lib/Encode/UTF_EBCDIC.pm
475 - lib/Encode/10646_1.pm
476 - lib/Encode/ucs2_le.pm
477 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
478 per-module based (en|de)code, I saved a number of .pm by
479 reorganizing it as per-object base (Well, this is what Encode::XS
480 does under the hood). See Encode::Unicode for details.
481 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
482 This module is used only on EBCDIC environments.
484 1.21 2002/04/05 14:46:34 (Not Released)
490 Are back to make Perl/Tk happy Smile, NI-S.
492 ! lib/Encode/Alias.pm
493 ! lib/Encode/Supported.pm
494 ! lib/Encode/10646_1.pm
495 ! lib/Encode/ucs2_le.pm
496 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
497 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
498 updated to reflect changes.
499 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
500 ! lib/Encode/Alias.pm
501 ! lib/Encode/Supported.pm
502 Anton's revision commited. Added Dan's own fixes as well.
503 Message-Id: <159103166906.20020405161134@motor.ru>
504 ! lib/Encode/Alias.pm
506 < qr/^UCS2-le$/i => '"UCS-2"', );
508 > qr/^UCS2-LE$/i => '"UTF-16LE"');
509 Sigh. Thank you, Anton.
510 Message-Id: <14567692196.20020405062020@motor.ru>
511 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
513 1.20 2002/04/04 19:50:52
515 the last minute addtion. Just give it a try. Docs remains to be done.
516 Not installed by default.
517 ! lib/Encode/Supported.pod
520 ! lib/Encode/Alias.pm
522 ! lib/Encode/10464_1.pm
523 ! lib/Encode/ucs2_le.pm
524 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
525 unchanged but UTF-16BE is added as an alias. Implicit aliases
526 move to Encode::Alias so init_alias() works more as expected.
527 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
528 Though pedantically wrong, This should make perl mongers happier.
529 t/Alias.t is enhanced to test all these.
530 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
532 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
533 of the codepoints are common. CJKT left as is because the saving is
539 ! Encode/Makefile_PL.e2x
547 All occurance of _def.h replaced with .exh so djgpp works happily
548 ever after! To credit this amazing discovery, Laszlo is now in
550 Message-Id: <20020403181424.GA8778@freemail.hu>
551 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
554 ! Encode/Makefile_PL.skel
556 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
557 Message-Id: <20020401222744.GX2000@blackrider>, et al.
559 Two more tests by added jhi
560 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
563 The showstopper fixed -- Memory reallocation bug was causing
564 Encode::XS to fall into infinite loop on certain conditions.
566 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
571 ! lib/Encode/Supported.pod
572 Vendor encodings rebuilt out of original map files at unicode.org.
573 Indic languages such as MacDevanagali remain unspported do to the
574 shortcoming of encengine capabilities (they need algorithmical
575 conversion and I have no knowledge on that!). Pods fixed for added
577 Oh, macJapan.ucm renamed to macJapanese.ucm.
578 macROMnn is macRomanian and macRUMnn is macRumanian.
579 txt2ucm is a crude script that is used to convert them.
581 Unicode Compound Characters (used extensively on Mac) supported
583 Typo fixes and improvements by jhi
584 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
586 1.11 $Date: 2002/05/03 12:20:13 $
590 Missing files from the MANIFEST fixed.
591 Message-Id: <20020401010156.H10509@alpha.hut.fi>
592 Version incremented just to make CPAN happy.
594 1.10 2002/03/31 21:32:42
597 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
598 if you want. This should make Autrijus happy. Also, piconv
599 is added to default install.
602 Here-documented files that enc2xs generates are now exported
603 to *.e2x. Much cleaner and easier to debug.
605 encoding enhances so you can make it act more like such
606 (now prehistoric ) "localized" variations of perl like Jperl.
608 Further test for encoding.pm. Written in euc-jp
612 Message-Id: <20020330174618.B10154@alpha.hut.fi>
617 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
624 ! Encode/macIceland.ucm
625 ! lib/Encode/Alias.pm
626 ! lib/Encode/Supported.pod
627 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
628 Message-Id: <m3lmcavhjt.fsf@anima.de>
630 1.01 2002/03/29 20:59:39
633 s/USE_SCRIPTS/MORE_SCRIPTS/
635 installs enc2xs by default for external Encode:: modules in CPAN,
636 such as Encode::HanExtra
638 More sensible perl core detection via $ENV{PERL_CORE}
640 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
642 Perl core ditection via $^X =~ m/\bminiperl$/o
643 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
647 The version of all files is updated to 1.00 via "ci -f -l1.00",
648 commemorating version 1.00. All files, including *.ucm are now
649 under version control.
652 encode.h moved to Encode/ so it will be installed for the later
655 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
660 compile renamed to enc2xs.
661 Affected Makefle.PL updated
663 "Punt it. HanExtra can take care of that later." -- Autrijus
664 Message-Id: <20020328154338.GA7351@not.autrijus.org>
668 ! lib/Encode/CJKConstants.pm
669 ! lib/Encode/KR/2022_KR.pm
670 Table patches for Euro Signs, 2022-KR fixups by Jungshik
671 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
675 bin/ added for example scripts. They are not installed by default.
676 to install them, "perl Makefile.PL USE_SCRIPTS".
677 piconv is iconv reinvented in perl. in addition to all features
678 of iconv, it also adds perlish features. See L<piconv/1> for more
680 ! lib/Encode/Alias.pm
681 qr/^ replaced with qr/\b so it directly matches locale names
682 such as en_US.US-ASCII
685 Patch by MJD to fix the following problem applied.
686 Subject: [PATCH 5.7.3 Encode]
687 Aliases.t not properly skipped when Encode extension not built
688 Message-Id: <20020328091850.18677.qmail@plover.com>
689 ! lib/Encode/KR/2022_KR.pm
690 ! lib/Encode/CJKConstants.pm
691 Another patch from Jungshik to make iso-2022-kr actually work
692 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
693 ! Encode/Encode/euc-kr.ucm
694 + Encode/Encode/johab.ucm
695 ! Encode/Encode/ksc5601.ucm
697 ! Encode/KR/Makefile.PL
698 ! Encode/lib/Encode/Alias.pm
700 Johab support and complete revision of Korean Encoding by Jungshik
701 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
703 Revised to make up with now-dropped Encode::Details.
704 - lib/Encode/Details.pod
705 Dropped. Besides being obsolete, the topics are now covered in
711 Korean aliases fixed thanks to Jungshik Shin
712 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
713 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
715 =head1 NAME added to all modules to make buildtoc happy
716 Message-Id: <20020327041151.A10618@alpha.hut.fi>
717 - lib/Encode/CJKguide.pod
718 Too controversial and dropped from the dist. Will be available
719 separately on the web.
721 RCS tags added so table debugging gets easier (should that be
722 needed! I hope they all stay 1.00!)
723 + lib/Encode/CJKguide.pod
724 A detailed guide to mainly, but not limited to, CJK multibyte
727 + Encode/hp-roman8.ucm
729 ! Encode/Supported.pod
730 All occurance of "roman8" replaced with "hp-roman8" to avoid
732 ! Encode/Supported.pod
735 Mac Encodings now comply the Inside Macintosh
737 Test for '-raw' conventions added.
739 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
743 "-raw" appended to canonical names.
744 File mames stay unchanged thanks to UCM format.
745 ! lib/Encode/CN/HZ.pm
746 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
747 Message-Id: <20020326035210.GA2091@not.autrijus.org>
750 - lib/Encode/JP/Const.pm
751 + lib/Encode/CJKConstants.pm
752 + lib/Encode/CN/2022_CN.pm
753 + lib/Encode/KR/2022_KR.pm
762 * Support for ISO-2022-KR and ISO-2022-CN added.
764 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
765 * ucm2table autogenerates character table out of UCM files.
768 - lib/Encode/Supports.pod
769 + lib/Encode/Supported.pod
770 Names reverted due to popular demand.
771 8.3 rule applies only when there is a conflict.
772 Message-Id: <20020325095924.GD44120@not.autrijus.org>
777 - lib/Encode/Format/Enc.pod
779 * Character tables is now 100% ucm.
780 * All files under Encode/ is now 8.3-compliant
781 * some of missing encodings added (i.e. gsm0338 and nextstep)
782 * Vendor mappings aggregated with appropriate national std in
783 Makefile.PL, resulting smaller *.so especially for CJK.
784 Following is result on Dan's FreeBSD box.
786 ---------------------------------------------------------------
787 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
788 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
789 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
790 blib/arch/auto/Encode/Encode.so 27,791 27,791
791 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
792 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
793 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
794 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
795 ---------------------------------------------------------------
796 Total 5,375,297 6,343,819
798 * As a result of ucm-transition, Encode::Tcl dropped because
799 Encode::Tcl demands *.enc.
800 Encode::Tcl will be supplied in a separate tarball with *.enc.
801 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
806 -lib/Encode/Supported.pod
807 +lib/Encode/Supports.pod
808 -lib/Encode/iso10646_1.pm
809 +lib/Encode/10646_1.pm
810 -lib/Encode/EncFormat.pod
811 +lib/Encode/Format/Enc.pod
812 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
813 - lib/Encode/JP/Constants.pm
814 + lib/Encode/JP/Consts.pm
815 ! lib/Encode/JP/JIS.pm
816 ! lib/Encode/JP/H2Z.pm
817 Version nit problem and 8.3 rule fix.
818 > Package namespace installed latest in CPAN file
819 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
820 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
821 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
822 accordingly. In addition, legacy "use vars qw()..." are replaced with
824 Message-Id: <20020325011248.D1561@alpha.hut.fi>
825 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
827 - lib/Encode/JP/ISO_2022_JP.pm
828 - lib/Encode/JP/ISO_2022_JP_1.pm
829 + lib/Encode/JP/2022_JP.pm
830 + lib/Encode/JP/2022_JP1.pm
832 8.3 naming conflict for vanilla fat addressed by jhi
833 Message-Id: <20020324201931.V22596@alpha.hut.fi>
836 Typecast fix addressed by jhi
837 Message-Id: <20020324185540.T22596@alpha.hut.fi>
840 ! lib/Encode/Supported.pod
842 + lib/Encode/JP/ISO_2022_JP_1.pm
843 ! lib/Encode/JP/ISO_2022_JP.pm
844 ! lib/Encode/JP/JIS.pm
846 Now Encode::JP is more strict on the difference between ISO-2022-JP
847 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
848 makes Anton happier :) FYI the previous version implements
849 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
850 ! lib/Encode/Supported.pod
853 Avoid core-dump in Encode with PERLIO=mmap by NI-S
854 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
859 ! lib/Encode/Suppoted.pod
860 pod fixes to replace F<http://...> to L<http://...>,
861 as suggested by Autrijius in:
862 Message-Id: <20020324083943.GA14901@not.autrijus.org>
863 ! lib/Encode/Suppoted.pod
864 fixes and enhancements by Anton
865 Message-Id: <10632060120.20020324103753@motor.ru>
866 ! lib/Encode/Alias.pm
867 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
868 added. Suggested by Anton then deobfuscated by Autrijius
869 Message-Id: <20020324064455.GA3667@not.autrijus.org>
871 Further fix by Nicholas Clark
872 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
873 - lib/EncodeFormat.pod
874 + lib/Encode/EncFormat.pod
876 File renamed as suggested by Autrijius
878 ! lib/Encode/Details.pod
879 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
880 ! Encode.pm Sun Mar 24 13:43:47 2002
881 pod fixes by Autrijius.
882 Message-Id: <20020324062804.GA3595@not.autrijus.org>
883 Message-Id: <20020324075627.GB11986@not.autrijus.org>
885 ! lib/Encode/Alias.pm
887 now more EBCDIC conscious;
888 %ExtModules on EBCDIC system excludes CJK so that you don't
889 have to worry about the matched alias resulting cloaking.
890 t/Alias.t also revised to reflect changes. Verified by jhi
891 Message-Id: <20020324022929.D22596@alpha.hut.fi>
897 EBCDIC detection mechanism installed as in JP/JP.pm
898 Message-Id: <20020323211847.G19148@alpha.hut.fi>
906 Now all table files used by compile are postfixed '_t' to avoid
907 namespace collisions in case insensitive file systems once for all!
909 Message-ID: <58290227735.20020323195659@familiehaase.de>
911 Since the Encode::JP is unsupported under EBCDIC we
912 cannot run this test (aliases as such should work fine) -- jhi
913 Message-Id: <20020323202119.D19148@alpha.hut.fi>
915 duplicate occurance of ascii.ucm and 8859-1.ucm
916 causes MacOS X dlyd to cloak
922 < chdir 't' if -d 't';
924 > if (! -d 'blib' and -d 't'){ chdir 't' };
925 When you are "make test"-ing on Encode/ directory, you must not
926 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
927 remain unchanced. Also the situation detection was made simpler
928 in t/JP.t, which was originally;
929 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
931 "Use of uninitialized value in string eq at Encode.pm line 96."
935 -- Problem on case insensitive file systems
936 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
937 Message-ID: <88254111953.20020323095503@familiehaase.de>
940 "So I think it's a bug in gcc, not perl. But it still needs to be
942 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
943 Message-Id: <20020323170509.C96475@plum.flirble.org>
947 ! lib/Encode/Encoding.pm
948 ! lib/Encode/Alias.pm
949 ! lib/Encode/Supported.pod
951 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
952 Message-ID: <20020322073908.GB10539@blackrider>
955 "...I think we should include ISO 8859-1 as well." -- NI-S
956 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
961 ! lib/Encode/Alias.pm
962 alias definitions relocated to Encode::Alias so module autoloading
963 works for aliases also.
965 encodings() now accepts args to check ExtModules.
974 Latin and single byte encodings are reorganized so they are
975 demand-loaded like Encode::XX. Now only ascii is compiled into
977 ! lib/Encode/Alias.pm
978 for my $k (keys %hash){ delete $hash{$k}; }
979 is depreciated; fixed.
982 In this update, pod rewrites and alias fixes are the main issues
983 + lib/Encode/Supported.pod
984 Describes supported encodings
986 streamlined compiled-in encodings.
987 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
989 + Encode/ibm-125?.ucm
990 Added from icu distibution with any occurance of
991 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
992 some respect to icu staff, however.
993 + lib/Encode/Alias.pm
995 Alias difinitions in Encode.pm relocated.
998 packWARN patch from Paul Marquess via jhi
999 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1000 Paul added to AUTHORS as a result.
1001 ! t/CJKalias.t -> t/Aliases.t
1002 Renamed. Checks even more aliases and alias overloading
1005 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1006 gbk => cp936 relocated to CN.pm
1008 Test::More with plans (by jhi)
1010 0.94 Thu Mar 21 2002
1011 + lib/Encode/Description.pod
1012 ! lib/Encode/Encoding.pm
1013 Now the pod in Encode.pm is abridged as programming references.
1014 lib/Encode/Description.pod contains the original, detailed description
1015 and Encode::Encoding explains how to write your own module to
1016 add new encodings. So far, lib/Encode/Description.pod contains
1017 the whole pod once in Encode.pm. This is intentional.
1019 Pod revisions by Anton Tagunov
1020 Message-Id: <517178431.20020320174824@motor.ru>
1022 all occrance of Encode::Tcl::Extended removed including pod
1024 test now checks $encoding->name only; $encoding->{name} are
1025 no longer check to find the canonical name.
1026 ! lib/Encode/JP/JIS.pm
1027 ! lib/Encode/JP/ISO_2022_JP.pm
1028 ->name() added to be more compliant with API
1034 Patch by Autrijus to add aliases to TW and fixes to POD
1035 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1037 SADAHIRO Tomoyuki added as should. My apologies.
1039 0.93 Wed Mar 20 2002
1040 * First release to be uploaded to CPAN. For prehistoric changes,
1041 please see Changes file of perl distibution as well as
1042 perl-unicode@perl.org archive, available at:
1043 http://archive.develooper.com/perl-unicode@perl.org/
1045 Changes Since 0.92 includes;
1050 + Mention to perl-unicode@perl.org added
1052 + Encoding aliases added so you can feed locale names
1053 and MIME Charset="" directly.
1054 - Mention to JISX0212 removed because it's fixed
1057 + Encoding aliases added. Note TW is left untouched because
1058 euc-tw is not implemented in TW but in Encode::HanExtra.
1059 Autrijus, you may fix Encode::HanExtra.
1061 + to test encode aliases added