1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.70 2002/05/06 10:26:48 dankogai Exp dankogai $
6 $Revision: 1.70 $ $Date: 2002/05/06 10:26:48 $
8 Made more 'module-safe' with conjunction w/ 'no encoding'.
9 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
10 ! lib/Encode/Encoding.pm
11 'require Encode' because ->Define uses Encode::define_encoding();
12 problem and solution addressed by Miyagawa-kun
13 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
15 Cuts the frill to make djgpp happier, as suggested by Laszlo
16 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
18 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
19 Message-Id: <m3bsbug48n.fsf@anima.de>
21 1.69 2002/05/04 16:41:18
22 ! lib/Encode/MIME/Header
23 Floating-point coerced for UNICOS (in integer arithmetics it folds
24 line one character too early). Verification by Mark is pending.
25 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
27 more doc patch from Elizabeth
28 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
29 ! Encode/Makefile_PL.e2x
30 More platform-independent patch from Benjamin
31 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
32 ! lib/Encode/Guess AUTHORS
33 split regex fix by Graham Barr. Adds him to AUTHORS.
34 Message-Id: <20020504085419.E95940@valueclick.com>
35 ! Encode/Makefile_PL.e2x
36 enc2xs script discovery made smarter and more sensible, first cited
37 by Miyagawa-kun and further suggestions by Rafael and Andreas
38 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
39 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
41 1.68 2002/05/03 12:20:13
42 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
43 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
44 and Supported.pod modified to reflect the change. Elizabeth added
45 to Authors. And H.M. is also added for forwarding her patch among
46 other contributions (I was rather surprised to find his name was not
48 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
50 1.67 2002/05/02 07:33:09
52 Error message now consistent w/ perlqq (\N{U+} -> \x{})
53 done in perl@16308 but Philip linted me further. Now the error
54 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
56 Sanity check for happier -w by Autrijus
58 1.66 2002/05/01 05:41:06
59 ! Encode.xs t/fallback.t
60 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
61 while fallback is in effect. This even came with a welcome side-effect
62 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
63 also modified to test this.
64 And of course, the corresponding varialbles to UV[Xx]f are appropriately
65 cast. This should've concluded NI-XS homework.
67 encode(undef) does warn again! Repented upon suggestion by NI-XS.
68 Document for unless vs. '' added
69 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
71 1.65 2002/04/30 16:13:37
73 encode(undef) no longer warns for C<Use of uninitialized value in
74 subroutine entry>. Suggested by Paul.
75 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
76 ! lib/Encode/Supported.pod
77 Encode::MIME::Header and Encode::Guess mentioned
78 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
80 POD fix by Miyagawa-kun
81 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
83 1.64 2002/04/29 06:54:06
85 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
86 encocde("euc-jisx0213") are still DIFFERENT.
87 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
89 A few white spaces corrected by NI-XS via PerlIO integration to
91 Subject: Change 16247: Integrate perlio;
93 Document fixes by Andreas
94 Message-Id: <m3k7qsf1we.fsf@anima.de>
96 1.63 2002/04/27 18:59:50
97 ! lib/Encode/Encoding.pm
98 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
99 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
100 Make use of the Encode::Encoding base class!
101 And other cleanups in Encode.xs upon NI-XS suggestions
102 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
104 1.62 2002/04/27 11:17:39
106 encodings() now just check %ExtModule instead of eval{require}
107 all of them for ":all" to conserve more memory.
109 more "%x" -> "%" UVxf stuff.
111 s/=over2/=over 2/g # oops.
113 1.61 2002/04/26 03:02:04
115 Now does decent tests besides use_ok()
116 ! lib/Encode/Guess.pm t/guess.t
117 UI streamlined, document added
119 various signed/unsigned mismatch nits (#16173)
120 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
122 POD: utf8-flag-related caveats added. A few sections completely
126 Thou shalt not assume %d works, either!
127 Robin Baker added to AUTHORS for this
128 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
130 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
132 1.60 2002/04/24 20:06:52
134 "Thou shalt not assume %x works." -- jhi
135 Message-Id: <20020424210618.E24347@alpha.hut.fi>
136 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
137 low-memory build machines happy, now *.c is created for each *.ucm
138 (no table aggregation). You can still override this by setting
139 $ENV{AGGREGATE_TABLES}.
140 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
141 + lib/Encode/Guess.pm
142 + lib/Encode/JP/JIS7.pm
143 Encoding-autodetect (mainly for Japanese encoding) added. In a
144 course of development, JIS7.pm was improved.
145 + lib/Encode/HTML/Header.pm
146 + lib/Encode/Config.pm
147 MIME B/Q Header Encoding Added!
148 ! Encode.pm Encode.xs t/fallback.t
149 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
150 Message-Id: <20020424130709.GA14211@tanglefoot>
152 1.59 $ 2002/04/22 23:54:22
153 ! Encode.pm Encode.xs
154 needs_lines() and perlio_ok() are added to Internal encodings such
155 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
157 1.58 2002/04/22 23:54:22
159 s/MacChineseSimp/MacChineseTrad/ # ... oops.
164 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
165 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
166 t/(JP|KR).t are aggregated to t/CJKT.t
167 test data are all remade via bin/ucm2text.
168 And .... They are no longer skipped for -Uuseperlio !
170 1.57 2002/04/22 20:27:30
171 ! t/JP.t t/KR.t t/perlio.t
172 unless (find PerlIO::Layer 'perlio') ... line is back again.
173 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
174 need PerlIO. Sigh....
175 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
176 ->perlio_ok now does eval{ require PerlIO::encoding } there so
177 it correctly returns 1 when PerlIO::encoding is yet loaded.
179 perl-current patch #16072 reflected
181 1.56 2002/04/22 09:48:07
182 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
183 New PerlIO::encoding 0.04 compliance met
185 1.55 2002/04/22 03:43:05
186 ! Encode.pm Encode.xs Unicode/Unicode.pm
187 needs_lines() defined so Encode::Encoding is no longer needed
190 1.54 2002/04/22 02:50:01
191 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
192 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
193 now perlio_ok is true by default if PerlIO::encoding->VERSION is
194 0.03 or larger. POD in Encode::Encoding revised to reflect this.
195 Encode::XS and Encode::Unicode now has perlio_ok() method.
196 ! lib/Encode/Supported.pod
197 s/UP-UX/HP-UX/ by jhi
198 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
199 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
200 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
201 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
202 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
203 ! lib/Encode/Supported.pod
204 Huge document fixes by Philip.
207 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
208 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
210 binmode() after "<:encoding" to make Win32 happy, by Mattia.
211 Mattia added to AUTHORS file
212 Message-Id: <3CC3150F.5798.22A05AE@localhost>
214 1.52 2002/04/20 23:43:47
216 TODO: is now SKIP:, as NI-XS requested. Also adds more
217 eraborate failure analysis added.
219 A note on how to make sure of round-trip safety added to POD
220 section (so Autrijus is happier)
221 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
222 big5-(eten|hkscs) is round-trip safe again!
223 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
225 Typo fixes by Andreas
226 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
227 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
228 PerIO coodination patches from NI-XS.
229 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
231 1.51 2002/04/20 09:58:23
233 Updated test suite by Autrijis so "make test" is happy again
234 Message-Id: <20020420082104.GA25037@not.autrijus.org>
236 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
238 TW/TW.pm TW/Makefile.PL
239 Updates by Autrijus. 'big5' is no longer a canonical but an
240 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
241 Message-Id: <20020419195346.GA19597@not.autrijus.org>
243 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
244 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
246 PerlIO detection a little bit smarter; no longer uses eval qq{}
249 1.50 2002/04/19 06:13:02
250 ! ! Encode.pm Encode.xs Encode/encoding.h
252 New Fallback API imlemented and documented. See "perldoc Encode"
254 ! lib/Encode/JP/JIS7.pm Encode.pm
255 + lib/Encode/PerlIO.pod t/perlio.t
256 API compliance met. However, it still does not work unless perlio
257 implements line buffer. See BUGS section in perldoc Encode::PerlIO
258 As a sensible workaround, perlio_ok() added to Encode.
260 ! lib/Encode/Supported.pod
262 Message-Id: <20020418174647.J8466@alpha.hut.fi>
264 Doc fixes from Autrijus
265 Message-Id: <20020418144131.GA10987@not.autrijus.org>
267 perlqq mode documented
269 + t/jisx0201.euc t/jisx0201.ref
270 ! t/jisx0208.euc t/jisx0208.ref
271 t/JP.t tests more rigorously and with other encodings
272 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
273 in jis0208 (used to contain jisx0201 part).
274 ! Encode/Makefile_PL.e2x
275 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
276 enc2xs and encode.h rather than hard-coded. This allows the resulting
277 module fully CPANizable.
278 ! encoding.pm t/JP.t t/KR.t
279 PerlIO detection simplified (checks %INC instead of eval{})
280 ! Encode.xs Encode/encode.h
281 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
282 - lib/Encode/Unicode.pm
283 (en|de)code_xs relocated to where it belongs. Source reindented
286 Additional (U8 *) cast added as suggested by jhi
287 Message-Id: <20020417165916.A28599@alpha.hut.fi>
289 1.42 Date: 2002/04/17
291 no-op module; Thought of adding a pod there but enc2xs has
296 correct mechanism to detect Perlio::encoding layar installed.
298 PerlIO Layer detached.
300 1.41 2002/04/16 23:35:00
302 binmode(STDIN|STDOUT ...) done iff PerlIO is available
304 Cleaned up PerlIO skip conditions to prepare for the upcoming
305 Encode - PerlIO forking.
307 exported functions are now prototyped.
308 ! lib/Encode/CN/HZ.pm
311 fallback implemented # was /* FIXME */
312 affected programs revised to fit (only HZ was using the try-catch
313 approach which needed to be fixed for API-compliance).
315 ! Encode/KR/2022_KR.pm
317 can find =head1 NAME now, jhi
318 Message-Id: <20020416083059.V30639@alpha.hut.fi>
322 now complies with less warnings with the pickest compilers.
323 Suggested by Craig, fixed by Dan.
324 ! Encode/Makefile_PL.e2x
326 A bug that fails to find *.e2x in certain conditions fixed
328 1.40 2002/04/14 22:27:14
329 + Encode/ConfigLocal_PM.e2x
330 ! lib/Encode/Config.pm
332 "enc2xs -C" now generates/updates Encode::ConfigLocal.
333 ConfigLocal_PM.e2x is a skelton thereof.
334 ! lib/Encode/Config.pm
336 "use Encode::CN::HZ;" was missing.
339 More rigorous tests added to test XS, especially on memory allocation.
341 ! lib/Encode/Unicode.pm
342 NI-S implemented an XS version -- merged
343 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
346 Source filter option added. With this option on, you can write
347 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
348 encodings. t/jperl.t enhanced to test this feature.
350 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
351 apply his suggestion, this degree of nitting is enough to add him
353 Message-Id: <3CB93223.291E5E2E@earthlink.net>
355 + lib/Encode/JP/JIS7.pm
356 - lib/Encode/JP/JIS.pm
357 - lib/Encode/JP/2022_JP.pm
358 - lib/Encode/JP/2022_JP1.pm
359 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
360 JIS7.pm for better maintainability and performance
362 Added caveat for non-ascii identifiers.
364 fixes by jhi, the original author of this pragramtic module.
365 Message-Id: <20020413231527.V1826@alpha.hut.fi>
367 1.34 2002/04/12 20:23:05 (Unreleased)
370 EBCDIC fixes addressed by jhi.
371 Message-Id: <20020412161844.D9383@alpha.hut.fi>
372 ! lib/Encode/Encoder.pm
373 POD fix by Miyagawa-kun
374 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
376 1.33 2002/04/10 22:28:40
378 Philip's mail address corrected.
381 ! lib/Encode/Encoder.pm
382 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
383 This adds Miyagawa-kun to AUTHORS.
384 * encoding() no longer exported by default but on demand
385 * t/Encoder.t updated to test all these
386 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
387 ! lib/Encode/Unicode.pm
388 ! lib/Encode/Supported.pm
389 Further doc fixes by Anton
391 1.32 2002/04/09 20:06:15
394 - ucm/macDevanaga.ucm Unicode Character Map
395 - ucm/macGujarati.ucm Unicode Character Map
396 - ucm/macGurmukhi.ucm Unicode Character Map
397 A utility to check integrity of .ucm files. t/bogus.ucm is a
398 ucm that is deliberately bogus. unused Indic mappings are removed
401 resolve_alias() added as suggested by jhi. Same as
402 find_encoding("alias")->name. For convenience. This one is
403 defined in Encode.pm instead of Alias.pm.
404 Message-Id: <20020409215846.H17022@alpha.hut.fi>
406 Memory Allocate but detected during the devel of ucmlint -- fixed.
407 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
408 ! lib/Encode/Unicode.pm
409 valid_ucs2(0) is false but must be true.
410 3 patches from NI-S as follows. This also has fixed the incident
412 ! lib/Encode/Alias.pm
413 find_alias() recursion prevention
415 Checks for the patch above
416 ! t/Encode/Unicode.pm
417 An extra "F" that causes valid_ucs2() return a bogus value fixed
418 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
419 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
420 2 Small Patches from jhi as follows:
422 Encode->encodings() lists in case-insensitve order (as it was)
424 -l option prints avaiable encodings to STDOUT instead of STDERR
425 ! lib/Encode/Aliases.pm
426 s/defintion/definition/
427 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
429 ! lib/Encode/Supported.pod
430 ! lib/Encode/Unicode.pm
431 POD revise by Philip Newton. This adds Philip to AUTHORS list.
432 Thank you for the exact quote of Douglas Adams :)
433 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
435 1.31 2002/04/08 18:08:07
436 ! lib/Encode/Encoder.pm
438 Encode::Encoder, once just a placeholder of an idea, is now much more
439 practical. See t/Encode.t to find how practical it can be.
440 + lib/Encode/Config.pm
442 my false laziness at Encode.pm is fixed. Now %ExtModules are set
443 in Encode::Config and they are all literally, not programatically
444 set. My false laziness was resulting many encodings missing from
446 ! lib/Encode/Unicode.pm
448 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
449 so that it does not rely Encode::Unicode for BOM values
450 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
452 1.30 2002/04/08 02:34:51
453 + lib/Encode/Encoder.pm
454 Object Oriented Encoder. I reckon something like this is in need.
457 ! lib/Encode/Supported.pod
458 * autoloading bug that prevented upper-case canonicals such as UTF-16
459 is fixed. Now even UTF/UCS are autoloaded!
460 * encodings() is now more intuitive.
461 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
463 * Obligatory fixes to the POD.
464 ! lib/Encode/Supported.pod
465 Patch from Anton applied.
466 Message-Id: <66641479.20020408033300@motor.ru>
468 ! lib/Encode/Unicode.pm
469 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
471 1.28 2002/04/07 18:58:42
475 Just a MANIFEST for those missing files.
477 1.26 Date: 2002/04/07 15:22:04
480 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
481 And t/Aliases.t fixed to make sure they all exist.
483 1.25 2002/04/07 15:01:25 (Unreleased)
485 ! lib/Encode/Unicode.pm
488 - lib/Encode/UTF_EBCDIC.pm
489 - lib/Encode/Internal.pm
491 Integrated into Encode.pm as closures. That way "one package, one file"
492 rule is preserved yet less files to require.
494 commented out binmode(STDERR ...
503 ! Encode/Makefile_PL.e2x
504 Schwarn's MM-compliance patch merged
505 Message-Id: <20020406082609.GA28758@blackrider>
507 ! lib/Encode/Unicode.pm
508 + lib/Encode/UTF_EBCDIC.pm
510 - lib/Encode/10646_1.pm
511 - lib/Encode/ucs2_le.pm
512 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
513 per-module based (en|de)code, I saved a number of .pm by
514 reorganizing it as per-object base (Well, this is what Encode::XS
515 does under the hood). See Encode::Unicode for details.
516 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
517 This module is used only on EBCDIC environments.
519 1.21 2002/04/05 14:46:34 (Not Released)
525 Are back to make Perl/Tk happy Smile, NI-S.
527 ! lib/Encode/Alias.pm
528 ! lib/Encode/Supported.pm
529 ! lib/Encode/10646_1.pm
530 ! lib/Encode/ucs2_le.pm
531 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
532 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
533 updated to reflect changes.
534 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
535 ! lib/Encode/Alias.pm
536 ! lib/Encode/Supported.pm
537 Anton's revision commited. Added Dan's own fixes as well.
538 Message-Id: <159103166906.20020405161134@motor.ru>
539 ! lib/Encode/Alias.pm
541 < qr/^UCS2-le$/i => '"UCS-2"', );
543 > qr/^UCS2-LE$/i => '"UTF-16LE"');
544 Sigh. Thank you, Anton.
545 Message-Id: <14567692196.20020405062020@motor.ru>
546 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
548 1.20 2002/04/04 19:50:52
550 the last minute addtion. Just give it a try. Docs remains to be done.
551 Not installed by default.
552 ! lib/Encode/Supported.pod
555 ! lib/Encode/Alias.pm
557 ! lib/Encode/10464_1.pm
558 ! lib/Encode/ucs2_le.pm
559 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
560 unchanged but UTF-16BE is added as an alias. Implicit aliases
561 move to Encode::Alias so init_alias() works more as expected.
562 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
563 Though pedantically wrong, This should make perl mongers happier.
564 t/Alias.t is enhanced to test all these.
565 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
567 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
568 of the codepoints are common. CJKT left as is because the saving is
574 ! Encode/Makefile_PL.e2x
582 All occurance of _def.h replaced with .exh so djgpp works happily
583 ever after! To credit this amazing discovery, Laszlo is now in
585 Message-Id: <20020403181424.GA8778@freemail.hu>
586 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
589 ! Encode/Makefile_PL.skel
591 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
592 Message-Id: <20020401222744.GX2000@blackrider>, et al.
594 Two more tests by added jhi
595 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
598 The showstopper fixed -- Memory reallocation bug was causing
599 Encode::XS to fall into infinite loop on certain conditions.
601 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
606 ! lib/Encode/Supported.pod
607 Vendor encodings rebuilt out of original map files at unicode.org.
608 Indic languages such as MacDevanagali remain unspported do to the
609 shortcoming of encengine capabilities (they need algorithmical
610 conversion and I have no knowledge on that!). Pods fixed for added
612 Oh, macJapan.ucm renamed to macJapanese.ucm.
613 macROMnn is macRomanian and macRUMnn is macRumanian.
614 txt2ucm is a crude script that is used to convert them.
616 Unicode Compound Characters (used extensively on Mac) supported
618 Typo fixes and improvements by jhi
619 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
621 1.11 $Date: 2002/05/06 10:26:48 $
625 Missing files from the MANIFEST fixed.
626 Message-Id: <20020401010156.H10509@alpha.hut.fi>
627 Version incremented just to make CPAN happy.
629 1.10 2002/03/31 21:32:42
632 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
633 if you want. This should make Autrijus happy. Also, piconv
634 is added to default install.
637 Here-documented files that enc2xs generates are now exported
638 to *.e2x. Much cleaner and easier to debug.
640 encoding enhances so you can make it act more like such
641 (now prehistoric ) "localized" variations of perl like Jperl.
643 Further test for encoding.pm. Written in euc-jp
647 Message-Id: <20020330174618.B10154@alpha.hut.fi>
652 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
659 ! Encode/macIceland.ucm
660 ! lib/Encode/Alias.pm
661 ! lib/Encode/Supported.pod
662 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
663 Message-Id: <m3lmcavhjt.fsf@anima.de>
665 1.01 2002/03/29 20:59:39
668 s/USE_SCRIPTS/MORE_SCRIPTS/
670 installs enc2xs by default for external Encode:: modules in CPAN,
671 such as Encode::HanExtra
673 More sensible perl core detection via $ENV{PERL_CORE}
675 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
677 Perl core ditection via $^X =~ m/\bminiperl$/o
678 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
682 The version of all files is updated to 1.00 via "ci -f -l1.00",
683 commemorating version 1.00. All files, including *.ucm are now
684 under version control.
687 encode.h moved to Encode/ so it will be installed for the later
690 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
695 compile renamed to enc2xs.
696 Affected Makefle.PL updated
698 "Punt it. HanExtra can take care of that later." -- Autrijus
699 Message-Id: <20020328154338.GA7351@not.autrijus.org>
703 ! lib/Encode/CJKConstants.pm
704 ! lib/Encode/KR/2022_KR.pm
705 Table patches for Euro Signs, 2022-KR fixups by Jungshik
706 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
710 bin/ added for example scripts. They are not installed by default.
711 to install them, "perl Makefile.PL USE_SCRIPTS".
712 piconv is iconv reinvented in perl. in addition to all features
713 of iconv, it also adds perlish features. See L<piconv/1> for more
715 ! lib/Encode/Alias.pm
716 qr/^ replaced with qr/\b so it directly matches locale names
717 such as en_US.US-ASCII
720 Patch by MJD to fix the following problem applied.
721 Subject: [PATCH 5.7.3 Encode]
722 Aliases.t not properly skipped when Encode extension not built
723 Message-Id: <20020328091850.18677.qmail@plover.com>
724 ! lib/Encode/KR/2022_KR.pm
725 ! lib/Encode/CJKConstants.pm
726 Another patch from Jungshik to make iso-2022-kr actually work
727 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
728 ! Encode/Encode/euc-kr.ucm
729 + Encode/Encode/johab.ucm
730 ! Encode/Encode/ksc5601.ucm
732 ! Encode/KR/Makefile.PL
733 ! Encode/lib/Encode/Alias.pm
735 Johab support and complete revision of Korean Encoding by Jungshik
736 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
738 Revised to make up with now-dropped Encode::Details.
739 - lib/Encode/Details.pod
740 Dropped. Besides being obsolete, the topics are now covered in
746 Korean aliases fixed thanks to Jungshik Shin
747 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
748 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
750 =head1 NAME added to all modules to make buildtoc happy
751 Message-Id: <20020327041151.A10618@alpha.hut.fi>
752 - lib/Encode/CJKguide.pod
753 Too controversial and dropped from the dist. Will be available
754 separately on the web.
756 RCS tags added so table debugging gets easier (should that be
757 needed! I hope they all stay 1.00!)
758 + lib/Encode/CJKguide.pod
759 A detailed guide to mainly, but not limited to, CJK multibyte
762 + Encode/hp-roman8.ucm
764 ! Encode/Supported.pod
765 All occurance of "roman8" replaced with "hp-roman8" to avoid
767 ! Encode/Supported.pod
770 Mac Encodings now comply the Inside Macintosh
772 Test for '-raw' conventions added.
774 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
778 "-raw" appended to canonical names.
779 File mames stay unchanged thanks to UCM format.
780 ! lib/Encode/CN/HZ.pm
781 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
782 Message-Id: <20020326035210.GA2091@not.autrijus.org>
785 - lib/Encode/JP/Const.pm
786 + lib/Encode/CJKConstants.pm
787 + lib/Encode/CN/2022_CN.pm
788 + lib/Encode/KR/2022_KR.pm
797 * Support for ISO-2022-KR and ISO-2022-CN added.
799 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
800 * ucm2table autogenerates character table out of UCM files.
803 - lib/Encode/Supports.pod
804 + lib/Encode/Supported.pod
805 Names reverted due to popular demand.
806 8.3 rule applies only when there is a conflict.
807 Message-Id: <20020325095924.GD44120@not.autrijus.org>
812 - lib/Encode/Format/Enc.pod
814 * Character tables is now 100% ucm.
815 * All files under Encode/ is now 8.3-compliant
816 * some of missing encodings added (i.e. gsm0338 and nextstep)
817 * Vendor mappings aggregated with appropriate national std in
818 Makefile.PL, resulting smaller *.so especially for CJK.
819 Following is result on Dan's FreeBSD box.
821 ---------------------------------------------------------------
822 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
823 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
824 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
825 blib/arch/auto/Encode/Encode.so 27,791 27,791
826 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
827 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
828 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
829 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
830 ---------------------------------------------------------------
831 Total 5,375,297 6,343,819
833 * As a result of ucm-transition, Encode::Tcl dropped because
834 Encode::Tcl demands *.enc.
835 Encode::Tcl will be supplied in a separate tarball with *.enc.
836 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
841 -lib/Encode/Supported.pod
842 +lib/Encode/Supports.pod
843 -lib/Encode/iso10646_1.pm
844 +lib/Encode/10646_1.pm
845 -lib/Encode/EncFormat.pod
846 +lib/Encode/Format/Enc.pod
847 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
848 - lib/Encode/JP/Constants.pm
849 + lib/Encode/JP/Consts.pm
850 ! lib/Encode/JP/JIS.pm
851 ! lib/Encode/JP/H2Z.pm
852 Version nit problem and 8.3 rule fix.
853 > Package namespace installed latest in CPAN file
854 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
855 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
856 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
857 accordingly. In addition, legacy "use vars qw()..." are replaced with
859 Message-Id: <20020325011248.D1561@alpha.hut.fi>
860 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
862 - lib/Encode/JP/ISO_2022_JP.pm
863 - lib/Encode/JP/ISO_2022_JP_1.pm
864 + lib/Encode/JP/2022_JP.pm
865 + lib/Encode/JP/2022_JP1.pm
867 8.3 naming conflict for vanilla fat addressed by jhi
868 Message-Id: <20020324201931.V22596@alpha.hut.fi>
871 Typecast fix addressed by jhi
872 Message-Id: <20020324185540.T22596@alpha.hut.fi>
875 ! lib/Encode/Supported.pod
877 + lib/Encode/JP/ISO_2022_JP_1.pm
878 ! lib/Encode/JP/ISO_2022_JP.pm
879 ! lib/Encode/JP/JIS.pm
881 Now Encode::JP is more strict on the difference between ISO-2022-JP
882 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
883 makes Anton happier :) FYI the previous version implements
884 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
885 ! lib/Encode/Supported.pod
888 Avoid core-dump in Encode with PERLIO=mmap by NI-S
889 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
894 ! lib/Encode/Suppoted.pod
895 pod fixes to replace F<http://...> to L<http://...>,
896 as suggested by Autrijius in:
897 Message-Id: <20020324083943.GA14901@not.autrijus.org>
898 ! lib/Encode/Suppoted.pod
899 fixes and enhancements by Anton
900 Message-Id: <10632060120.20020324103753@motor.ru>
901 ! lib/Encode/Alias.pm
902 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
903 added. Suggested by Anton then deobfuscated by Autrijius
904 Message-Id: <20020324064455.GA3667@not.autrijus.org>
906 Further fix by Nicholas Clark
907 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
908 - lib/EncodeFormat.pod
909 + lib/Encode/EncFormat.pod
911 File renamed as suggested by Autrijius
913 ! lib/Encode/Details.pod
914 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
915 ! Encode.pm Sun Mar 24 13:43:47 2002
916 pod fixes by Autrijius.
917 Message-Id: <20020324062804.GA3595@not.autrijus.org>
918 Message-Id: <20020324075627.GB11986@not.autrijus.org>
920 ! lib/Encode/Alias.pm
922 now more EBCDIC conscious;
923 %ExtModules on EBCDIC system excludes CJK so that you don't
924 have to worry about the matched alias resulting cloaking.
925 t/Alias.t also revised to reflect changes. Verified by jhi
926 Message-Id: <20020324022929.D22596@alpha.hut.fi>
932 EBCDIC detection mechanism installed as in JP/JP.pm
933 Message-Id: <20020323211847.G19148@alpha.hut.fi>
941 Now all table files used by compile are postfixed '_t' to avoid
942 namespace collisions in case insensitive file systems once for all!
944 Message-ID: <58290227735.20020323195659@familiehaase.de>
946 Since the Encode::JP is unsupported under EBCDIC we
947 cannot run this test (aliases as such should work fine) -- jhi
948 Message-Id: <20020323202119.D19148@alpha.hut.fi>
950 duplicate occurance of ascii.ucm and 8859-1.ucm
951 causes MacOS X dlyd to cloak
957 < chdir 't' if -d 't';
959 > if (! -d 'blib' and -d 't'){ chdir 't' };
960 When you are "make test"-ing on Encode/ directory, you must not
961 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
962 remain unchanced. Also the situation detection was made simpler
963 in t/JP.t, which was originally;
964 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
966 "Use of uninitialized value in string eq at Encode.pm line 96."
970 -- Problem on case insensitive file systems
971 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
972 Message-ID: <88254111953.20020323095503@familiehaase.de>
975 "So I think it's a bug in gcc, not perl. But it still needs to be
977 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
978 Message-Id: <20020323170509.C96475@plum.flirble.org>
982 ! lib/Encode/Encoding.pm
983 ! lib/Encode/Alias.pm
984 ! lib/Encode/Supported.pod
986 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
987 Message-ID: <20020322073908.GB10539@blackrider>
990 "...I think we should include ISO 8859-1 as well." -- NI-S
991 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
996 ! lib/Encode/Alias.pm
997 alias definitions relocated to Encode::Alias so module autoloading
998 works for aliases also.
1000 encodings() now accepts args to check ExtModules.
1004 + EBCDIC/Makefile.PL
1005 + Symbol/Makefile.PL
1009 Latin and single byte encodings are reorganized so they are
1010 demand-loaded like Encode::XX. Now only ascii is compiled into
1012 ! lib/Encode/Alias.pm
1013 for my $k (keys %hash){ delete $hash{$k}; }
1014 is depreciated; fixed.
1016 0.95 Fri Mar 22 2002
1017 In this update, pod rewrites and alias fixes are the main issues
1018 + lib/Encode/Supported.pod
1019 Describes supported encodings
1021 streamlined compiled-in encodings.
1022 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1024 + Encode/ibm-125?.ucm
1025 Added from icu distibution with any occurance of
1026 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1027 some respect to icu staff, however.
1028 + lib/Encode/Alias.pm
1030 Alias difinitions in Encode.pm relocated.
1033 packWARN patch from Paul Marquess via jhi
1034 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1035 Paul added to AUTHORS as a result.
1036 ! t/CJKalias.t -> t/Aliases.t
1037 Renamed. Checks even more aliases and alias overloading
1040 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1041 gbk => cp936 relocated to CN.pm
1043 Test::More with plans (by jhi)
1045 0.94 Thu Mar 21 2002
1046 + lib/Encode/Description.pod
1047 ! lib/Encode/Encoding.pm
1048 Now the pod in Encode.pm is abridged as programming references.
1049 lib/Encode/Description.pod contains the original, detailed description
1050 and Encode::Encoding explains how to write your own module to
1051 add new encodings. So far, lib/Encode/Description.pod contains
1052 the whole pod once in Encode.pm. This is intentional.
1054 Pod revisions by Anton Tagunov
1055 Message-Id: <517178431.20020320174824@motor.ru>
1057 all occrance of Encode::Tcl::Extended removed including pod
1059 test now checks $encoding->name only; $encoding->{name} are
1060 no longer check to find the canonical name.
1061 ! lib/Encode/JP/JIS.pm
1062 ! lib/Encode/JP/ISO_2022_JP.pm
1063 ->name() added to be more compliant with API
1069 Patch by Autrijus to add aliases to TW and fixes to POD
1070 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1072 SADAHIRO Tomoyuki added as should. My apologies.
1074 0.93 Wed Mar 20 2002
1075 * First release to be uploaded to CPAN. For prehistoric changes,
1076 please see Changes file of perl distibution as well as
1077 perl-unicode@perl.org archive, available at:
1078 http://archive.develooper.com/perl-unicode@perl.org/
1080 Changes Since 0.92 includes;
1085 + Mention to perl-unicode@perl.org added
1087 + Encoding aliases added so you can feed locale names
1088 and MIME Charset="" directly.
1089 - Mention to JISX0212 removed because it's fixed
1092 + Encoding aliases added. Note TW is left untouched because
1093 euc-tw is not implemented in TW but in Encode::HanExtra.
1094 Autrijus, you may fix Encode::HanExtra.
1096 + to test encode aliases added