1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.72 2002/05/20 15:49:56 dankogai Exp dankogai $
6 $Revision: 1.72 $ $Date: 2002/05/20 15:49:56 $
8 Subject: [PATCH] Encode should be in perl-core library path
9 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
10 Message-Id: <20020520161201.A11019@alpha.hut.fi>
11 ! lib/Encode/MIME/Header.pm
12 Subject: [PATCH] Encode::MIME::Header
13 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
14 ! Encode/Makefile_PL.e2x
15 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
16 Message-Id: <20020519201031.GA1603@not.autrijus.org>
17 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
18 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
20 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
21 Subject: [PATCH] good day for WinCE port of perl.
22 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
25 Even more linting by Robin via @16532
27 Even more typecast by Sarathy in @16460
29 1.71 2002/05/07 16:22:42
31 even more typecasts by Robin
32 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
34 A very strange bug that was causing a bugus ucm -> C table
35 generation that was revealed by a UCM file that Andreas was
36 working. This is the king of wierdest bug I've encountered
37 in the course of Encode maintenance.
38 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
40 1.70 2002/05/06 10:26:48
42 Made more 'module-safe' with conjunction w/ 'no encoding'.
43 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
44 ! lib/Encode/Encoding.pm
45 'require Encode' because ->Define uses Encode::define_encoding();
46 problem and solution addressed by Miyagawa-kun
47 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
49 Cuts the frill to make djgpp happier, as suggested by Laszlo
50 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
52 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
53 Message-Id: <m3bsbug48n.fsf@anima.de>
55 1.69 2002/05/04 16:41:18
56 ! lib/Encode/MIME/Header
57 Floating-point coerced for UNICOS (in integer arithmetics it folds
58 line one character too early). Verification by Mark is pending.
59 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
61 more doc patch from Elizabeth
62 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
63 ! Encode/Makefile_PL.e2x
64 More platform-independent patch from Benjamin
65 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
66 ! lib/Encode/Guess AUTHORS
67 split regex fix by Graham Barr. Adds him to AUTHORS.
68 Message-Id: <20020504085419.E95940@valueclick.com>
69 ! Encode/Makefile_PL.e2x
70 enc2xs script discovery made smarter and more sensible, first cited
71 by Miyagawa-kun and further suggestions by Rafael and Andreas
72 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
73 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
75 1.68 2002/05/03 12:20:13
76 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
77 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
78 and Supported.pod modified to reflect the change. Elizabeth added
79 to Authors. And H.M. is also added for forwarding her patch among
80 other contributions (I was rather surprised to find his name was not
82 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
84 1.67 2002/05/02 07:33:09
86 Error message now consistent w/ perlqq (\N{U+} -> \x{})
87 done in perl@16308 but Philip linted me further. Now the error
88 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
90 Sanity check for happier -w by Autrijus
92 1.66 2002/05/01 05:41:06
93 ! Encode.xs t/fallback.t
94 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
95 while fallback is in effect. This even came with a welcome side-effect
96 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
97 also modified to test this.
98 And of course, the corresponding varialbles to UV[Xx]f are appropriately
99 cast. This should've concluded NI-XS homework.
101 encode(undef) does warn again! Repented upon suggestion by NI-XS.
102 Document for unless vs. '' added
103 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
105 1.65 2002/04/30 16:13:37
107 encode(undef) no longer warns for C<Use of uninitialized value in
108 subroutine entry>. Suggested by Paul.
109 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
110 ! lib/Encode/Supported.pod
111 Encode::MIME::Header and Encode::Guess mentioned
112 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
113 ! lib/Encode/Guess.pm
114 POD fix by Miyagawa-kun
115 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
117 1.64 2002/04/29 06:54:06
119 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
120 encocde("euc-jisx0213") are still DIFFERENT.
121 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
123 A few white spaces corrected by NI-XS via PerlIO integration to
125 Subject: Change 16247: Integrate perlio;
127 Document fixes by Andreas
128 Message-Id: <m3k7qsf1we.fsf@anima.de>
130 1.63 2002/04/27 18:59:50
131 ! lib/Encode/Encoding.pm
132 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
133 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
134 Make use of the Encode::Encoding base class!
135 And other cleanups in Encode.xs upon NI-XS suggestions
136 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
138 1.62 2002/04/27 11:17:39
140 encodings() now just check %ExtModule instead of eval{require}
141 all of them for ":all" to conserve more memory.
143 more "%x" -> "%" UVxf stuff.
145 s/=over2/=over 2/g # oops.
147 1.61 2002/04/26 03:02:04
149 Now does decent tests besides use_ok()
150 ! lib/Encode/Guess.pm t/guess.t
151 UI streamlined, document added
153 various signed/unsigned mismatch nits (#16173)
154 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
156 POD: utf8-flag-related caveats added. A few sections completely
160 Thou shalt not assume %d works, either!
161 Robin Baker added to AUTHORS for this
162 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
164 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
166 1.60 2002/04/24 20:06:52
168 "Thou shalt not assume %x works." -- jhi
169 Message-Id: <20020424210618.E24347@alpha.hut.fi>
170 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
171 low-memory build machines happy, now *.c is created for each *.ucm
172 (no table aggregation). You can still override this by setting
173 $ENV{AGGREGATE_TABLES}.
174 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
175 + lib/Encode/Guess.pm
176 + lib/Encode/JP/JIS7.pm
177 Encoding-autodetect (mainly for Japanese encoding) added. In a
178 course of development, JIS7.pm was improved.
179 + lib/Encode/HTML/Header.pm
180 + lib/Encode/Config.pm
181 MIME B/Q Header Encoding Added!
182 ! Encode.pm Encode.xs t/fallback.t
183 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
184 Message-Id: <20020424130709.GA14211@tanglefoot>
186 1.59 $ 2002/04/22 23:54:22
187 ! Encode.pm Encode.xs
188 needs_lines() and perlio_ok() are added to Internal encodings such
189 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
191 1.58 2002/04/22 23:54:22
193 s/MacChineseSimp/MacChineseTrad/ # ... oops.
198 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
199 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
200 t/(JP|KR).t are aggregated to t/CJKT.t
201 test data are all remade via bin/ucm2text.
202 And .... They are no longer skipped for -Uuseperlio !
204 1.57 2002/04/22 20:27:30
205 ! t/JP.t t/KR.t t/perlio.t
206 unless (find PerlIO::Layer 'perlio') ... line is back again.
207 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
208 need PerlIO. Sigh....
209 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
210 ->perlio_ok now does eval{ require PerlIO::encoding } there so
211 it correctly returns 1 when PerlIO::encoding is yet loaded.
213 perl-current patch #16072 reflected
215 1.56 2002/04/22 09:48:07
216 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
217 New PerlIO::encoding 0.04 compliance met
219 1.55 2002/04/22 03:43:05
220 ! Encode.pm Encode.xs Unicode/Unicode.pm
221 needs_lines() defined so Encode::Encoding is no longer needed
224 1.54 2002/04/22 02:50:01
225 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
226 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
227 now perlio_ok is true by default if PerlIO::encoding->VERSION is
228 0.03 or larger. POD in Encode::Encoding revised to reflect this.
229 Encode::XS and Encode::Unicode now has perlio_ok() method.
230 ! lib/Encode/Supported.pod
231 s/UP-UX/HP-UX/ by jhi
232 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
233 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
234 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
235 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
236 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
237 ! lib/Encode/Supported.pod
238 Huge document fixes by Philip.
241 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
242 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
244 binmode() after "<:encoding" to make Win32 happy, by Mattia.
245 Mattia added to AUTHORS file
246 Message-Id: <3CC3150F.5798.22A05AE@localhost>
248 1.52 2002/04/20 23:43:47
250 TODO: is now SKIP:, as NI-XS requested. Also adds more
251 eraborate failure analysis added.
253 A note on how to make sure of round-trip safety added to POD
254 section (so Autrijus is happier)
255 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
256 big5-(eten|hkscs) is round-trip safe again!
257 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
259 Typo fixes by Andreas
260 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
261 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
262 PerIO coodination patches from NI-XS.
263 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
265 1.51 2002/04/20 09:58:23
267 Updated test suite by Autrijis so "make test" is happy again
268 Message-Id: <20020420082104.GA25037@not.autrijus.org>
270 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
272 TW/TW.pm TW/Makefile.PL
273 Updates by Autrijus. 'big5' is no longer a canonical but an
274 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
275 Message-Id: <20020419195346.GA19597@not.autrijus.org>
277 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
278 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
280 PerlIO detection a little bit smarter; no longer uses eval qq{}
283 1.50 2002/04/19 06:13:02
284 ! ! Encode.pm Encode.xs Encode/encoding.h
286 New Fallback API imlemented and documented. See "perldoc Encode"
288 ! lib/Encode/JP/JIS7.pm Encode.pm
289 + lib/Encode/PerlIO.pod t/perlio.t
290 API compliance met. However, it still does not work unless perlio
291 implements line buffer. See BUGS section in perldoc Encode::PerlIO
292 As a sensible workaround, perlio_ok() added to Encode.
294 ! lib/Encode/Supported.pod
296 Message-Id: <20020418174647.J8466@alpha.hut.fi>
298 Doc fixes from Autrijus
299 Message-Id: <20020418144131.GA10987@not.autrijus.org>
301 perlqq mode documented
303 + t/jisx0201.euc t/jisx0201.ref
304 ! t/jisx0208.euc t/jisx0208.ref
305 t/JP.t tests more rigorously and with other encodings
306 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
307 in jis0208 (used to contain jisx0201 part).
308 ! Encode/Makefile_PL.e2x
309 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
310 enc2xs and encode.h rather than hard-coded. This allows the resulting
311 module fully CPANizable.
312 ! encoding.pm t/JP.t t/KR.t
313 PerlIO detection simplified (checks %INC instead of eval{})
314 ! Encode.xs Encode/encode.h
315 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
316 - lib/Encode/Unicode.pm
317 (en|de)code_xs relocated to where it belongs. Source reindented
320 Additional (U8 *) cast added as suggested by jhi
321 Message-Id: <20020417165916.A28599@alpha.hut.fi>
323 1.42 Date: 2002/04/17
325 no-op module; Thought of adding a pod there but enc2xs has
330 correct mechanism to detect Perlio::encoding layar installed.
332 PerlIO Layer detached.
334 1.41 2002/04/16 23:35:00
336 binmode(STDIN|STDOUT ...) done iff PerlIO is available
338 Cleaned up PerlIO skip conditions to prepare for the upcoming
339 Encode - PerlIO forking.
341 exported functions are now prototyped.
342 ! lib/Encode/CN/HZ.pm
345 fallback implemented # was /* FIXME */
346 affected programs revised to fit (only HZ was using the try-catch
347 approach which needed to be fixed for API-compliance).
349 ! Encode/KR/2022_KR.pm
351 can find =head1 NAME now, jhi
352 Message-Id: <20020416083059.V30639@alpha.hut.fi>
356 now complies with less warnings with the pickest compilers.
357 Suggested by Craig, fixed by Dan.
358 ! Encode/Makefile_PL.e2x
360 A bug that fails to find *.e2x in certain conditions fixed
362 1.40 2002/04/14 22:27:14
363 + Encode/ConfigLocal_PM.e2x
364 ! lib/Encode/Config.pm
366 "enc2xs -C" now generates/updates Encode::ConfigLocal.
367 ConfigLocal_PM.e2x is a skelton thereof.
368 ! lib/Encode/Config.pm
370 "use Encode::CN::HZ;" was missing.
373 More rigorous tests added to test XS, especially on memory allocation.
375 ! lib/Encode/Unicode.pm
376 NI-S implemented an XS version -- merged
377 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
380 Source filter option added. With this option on, you can write
381 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
382 encodings. t/jperl.t enhanced to test this feature.
384 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
385 apply his suggestion, this degree of nitting is enough to add him
387 Message-Id: <3CB93223.291E5E2E@earthlink.net>
389 + lib/Encode/JP/JIS7.pm
390 - lib/Encode/JP/JIS.pm
391 - lib/Encode/JP/2022_JP.pm
392 - lib/Encode/JP/2022_JP1.pm
393 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
394 JIS7.pm for better maintainability and performance
396 Added caveat for non-ascii identifiers.
398 fixes by jhi, the original author of this pragramtic module.
399 Message-Id: <20020413231527.V1826@alpha.hut.fi>
401 1.34 2002/04/12 20:23:05 (Unreleased)
404 EBCDIC fixes addressed by jhi.
405 Message-Id: <20020412161844.D9383@alpha.hut.fi>
406 ! lib/Encode/Encoder.pm
407 POD fix by Miyagawa-kun
408 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
410 1.33 2002/04/10 22:28:40
412 Philip's mail address corrected.
415 ! lib/Encode/Encoder.pm
416 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
417 This adds Miyagawa-kun to AUTHORS.
418 * encoding() no longer exported by default but on demand
419 * t/Encoder.t updated to test all these
420 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
421 ! lib/Encode/Unicode.pm
422 ! lib/Encode/Supported.pm
423 Further doc fixes by Anton
425 1.32 2002/04/09 20:06:15
428 - ucm/macDevanaga.ucm Unicode Character Map
429 - ucm/macGujarati.ucm Unicode Character Map
430 - ucm/macGurmukhi.ucm Unicode Character Map
431 A utility to check integrity of .ucm files. t/bogus.ucm is a
432 ucm that is deliberately bogus. unused Indic mappings are removed
435 resolve_alias() added as suggested by jhi. Same as
436 find_encoding("alias")->name. For convenience. This one is
437 defined in Encode.pm instead of Alias.pm.
438 Message-Id: <20020409215846.H17022@alpha.hut.fi>
440 Memory Allocate but detected during the devel of ucmlint -- fixed.
441 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
442 ! lib/Encode/Unicode.pm
443 valid_ucs2(0) is false but must be true.
444 3 patches from NI-S as follows. This also has fixed the incident
446 ! lib/Encode/Alias.pm
447 find_alias() recursion prevention
449 Checks for the patch above
450 ! t/Encode/Unicode.pm
451 An extra "F" that causes valid_ucs2() return a bogus value fixed
452 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
453 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
454 2 Small Patches from jhi as follows:
456 Encode->encodings() lists in case-insensitve order (as it was)
458 -l option prints avaiable encodings to STDOUT instead of STDERR
459 ! lib/Encode/Aliases.pm
460 s/defintion/definition/
461 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
463 ! lib/Encode/Supported.pod
464 ! lib/Encode/Unicode.pm
465 POD revise by Philip Newton. This adds Philip to AUTHORS list.
466 Thank you for the exact quote of Douglas Adams :)
467 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
469 1.31 2002/04/08 18:08:07
470 ! lib/Encode/Encoder.pm
472 Encode::Encoder, once just a placeholder of an idea, is now much more
473 practical. See t/Encode.t to find how practical it can be.
474 + lib/Encode/Config.pm
476 my false laziness at Encode.pm is fixed. Now %ExtModules are set
477 in Encode::Config and they are all literally, not programatically
478 set. My false laziness was resulting many encodings missing from
480 ! lib/Encode/Unicode.pm
482 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
483 so that it does not rely Encode::Unicode for BOM values
484 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
486 1.30 2002/04/08 02:34:51
487 + lib/Encode/Encoder.pm
488 Object Oriented Encoder. I reckon something like this is in need.
491 ! lib/Encode/Supported.pod
492 * autoloading bug that prevented upper-case canonicals such as UTF-16
493 is fixed. Now even UTF/UCS are autoloaded!
494 * encodings() is now more intuitive.
495 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
497 * Obligatory fixes to the POD.
498 ! lib/Encode/Supported.pod
499 Patch from Anton applied.
500 Message-Id: <66641479.20020408033300@motor.ru>
502 ! lib/Encode/Unicode.pm
503 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
505 1.28 2002/04/07 18:58:42
509 Just a MANIFEST for those missing files.
511 1.26 Date: 2002/04/07 15:22:04
514 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
515 And t/Aliases.t fixed to make sure they all exist.
517 1.25 2002/04/07 15:01:25 (Unreleased)
519 ! lib/Encode/Unicode.pm
522 - lib/Encode/UTF_EBCDIC.pm
523 - lib/Encode/Internal.pm
525 Integrated into Encode.pm as closures. That way "one package, one file"
526 rule is preserved yet less files to require.
528 commented out binmode(STDERR ...
537 ! Encode/Makefile_PL.e2x
538 Schwarn's MM-compliance patch merged
539 Message-Id: <20020406082609.GA28758@blackrider>
541 ! lib/Encode/Unicode.pm
542 + lib/Encode/UTF_EBCDIC.pm
544 - lib/Encode/10646_1.pm
545 - lib/Encode/ucs2_le.pm
546 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
547 per-module based (en|de)code, I saved a number of .pm by
548 reorganizing it as per-object base (Well, this is what Encode::XS
549 does under the hood). See Encode::Unicode for details.
550 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
551 This module is used only on EBCDIC environments.
553 1.21 2002/04/05 14:46:34 (Not Released)
559 Are back to make Perl/Tk happy Smile, NI-S.
561 ! lib/Encode/Alias.pm
562 ! lib/Encode/Supported.pm
563 ! lib/Encode/10646_1.pm
564 ! lib/Encode/ucs2_le.pm
565 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
566 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
567 updated to reflect changes.
568 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
569 ! lib/Encode/Alias.pm
570 ! lib/Encode/Supported.pm
571 Anton's revision commited. Added Dan's own fixes as well.
572 Message-Id: <159103166906.20020405161134@motor.ru>
573 ! lib/Encode/Alias.pm
575 < qr/^UCS2-le$/i => '"UCS-2"', );
577 > qr/^UCS2-LE$/i => '"UTF-16LE"');
578 Sigh. Thank you, Anton.
579 Message-Id: <14567692196.20020405062020@motor.ru>
580 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
582 1.20 2002/04/04 19:50:52
584 the last minute addtion. Just give it a try. Docs remains to be done.
585 Not installed by default.
586 ! lib/Encode/Supported.pod
589 ! lib/Encode/Alias.pm
591 ! lib/Encode/10464_1.pm
592 ! lib/Encode/ucs2_le.pm
593 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
594 unchanged but UTF-16BE is added as an alias. Implicit aliases
595 move to Encode::Alias so init_alias() works more as expected.
596 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
597 Though pedantically wrong, This should make perl mongers happier.
598 t/Alias.t is enhanced to test all these.
599 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
601 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
602 of the codepoints are common. CJKT left as is because the saving is
608 ! Encode/Makefile_PL.e2x
616 All occurance of _def.h replaced with .exh so djgpp works happily
617 ever after! To credit this amazing discovery, Laszlo is now in
619 Message-Id: <20020403181424.GA8778@freemail.hu>
620 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
623 ! Encode/Makefile_PL.skel
625 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
626 Message-Id: <20020401222744.GX2000@blackrider>, et al.
628 Two more tests by added jhi
629 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
632 The showstopper fixed -- Memory reallocation bug was causing
633 Encode::XS to fall into infinite loop on certain conditions.
635 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
640 ! lib/Encode/Supported.pod
641 Vendor encodings rebuilt out of original map files at unicode.org.
642 Indic languages such as MacDevanagali remain unspported do to the
643 shortcoming of encengine capabilities (they need algorithmical
644 conversion and I have no knowledge on that!). Pods fixed for added
646 Oh, macJapan.ucm renamed to macJapanese.ucm.
647 macROMnn is macRomanian and macRUMnn is macRumanian.
648 txt2ucm is a crude script that is used to convert them.
650 Unicode Compound Characters (used extensively on Mac) supported
652 Typo fixes and improvements by jhi
653 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
655 1.11 $Date: 2002/05/20 15:49:56 $
659 Missing files from the MANIFEST fixed.
660 Message-Id: <20020401010156.H10509@alpha.hut.fi>
661 Version incremented just to make CPAN happy.
663 1.10 2002/03/31 21:32:42
666 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
667 if you want. This should make Autrijus happy. Also, piconv
668 is added to default install.
671 Here-documented files that enc2xs generates are now exported
672 to *.e2x. Much cleaner and easier to debug.
674 encoding enhances so you can make it act more like such
675 (now prehistoric ) "localized" variations of perl like Jperl.
677 Further test for encoding.pm. Written in euc-jp
681 Message-Id: <20020330174618.B10154@alpha.hut.fi>
686 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
693 ! Encode/macIceland.ucm
694 ! lib/Encode/Alias.pm
695 ! lib/Encode/Supported.pod
696 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
697 Message-Id: <m3lmcavhjt.fsf@anima.de>
699 1.01 2002/03/29 20:59:39
702 s/USE_SCRIPTS/MORE_SCRIPTS/
704 installs enc2xs by default for external Encode:: modules in CPAN,
705 such as Encode::HanExtra
707 More sensible perl core detection via $ENV{PERL_CORE}
709 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
711 Perl core ditection via $^X =~ m/\bminiperl$/o
712 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
716 The version of all files is updated to 1.00 via "ci -f -l1.00",
717 commemorating version 1.00. All files, including *.ucm are now
718 under version control.
721 encode.h moved to Encode/ so it will be installed for the later
724 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
729 compile renamed to enc2xs.
730 Affected Makefle.PL updated
732 "Punt it. HanExtra can take care of that later." -- Autrijus
733 Message-Id: <20020328154338.GA7351@not.autrijus.org>
737 ! lib/Encode/CJKConstants.pm
738 ! lib/Encode/KR/2022_KR.pm
739 Table patches for Euro Signs, 2022-KR fixups by Jungshik
740 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
744 bin/ added for example scripts. They are not installed by default.
745 to install them, "perl Makefile.PL USE_SCRIPTS".
746 piconv is iconv reinvented in perl. in addition to all features
747 of iconv, it also adds perlish features. See L<piconv/1> for more
749 ! lib/Encode/Alias.pm
750 qr/^ replaced with qr/\b so it directly matches locale names
751 such as en_US.US-ASCII
754 Patch by MJD to fix the following problem applied.
755 Subject: [PATCH 5.7.3 Encode]
756 Aliases.t not properly skipped when Encode extension not built
757 Message-Id: <20020328091850.18677.qmail@plover.com>
758 ! lib/Encode/KR/2022_KR.pm
759 ! lib/Encode/CJKConstants.pm
760 Another patch from Jungshik to make iso-2022-kr actually work
761 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
762 ! Encode/Encode/euc-kr.ucm
763 + Encode/Encode/johab.ucm
764 ! Encode/Encode/ksc5601.ucm
766 ! Encode/KR/Makefile.PL
767 ! Encode/lib/Encode/Alias.pm
769 Johab support and complete revision of Korean Encoding by Jungshik
770 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
772 Revised to make up with now-dropped Encode::Details.
773 - lib/Encode/Details.pod
774 Dropped. Besides being obsolete, the topics are now covered in
780 Korean aliases fixed thanks to Jungshik Shin
781 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
782 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
784 =head1 NAME added to all modules to make buildtoc happy
785 Message-Id: <20020327041151.A10618@alpha.hut.fi>
786 - lib/Encode/CJKguide.pod
787 Too controversial and dropped from the dist. Will be available
788 separately on the web.
790 RCS tags added so table debugging gets easier (should that be
791 needed! I hope they all stay 1.00!)
792 + lib/Encode/CJKguide.pod
793 A detailed guide to mainly, but not limited to, CJK multibyte
796 + Encode/hp-roman8.ucm
798 ! Encode/Supported.pod
799 All occurance of "roman8" replaced with "hp-roman8" to avoid
801 ! Encode/Supported.pod
804 Mac Encodings now comply the Inside Macintosh
806 Test for '-raw' conventions added.
808 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
812 "-raw" appended to canonical names.
813 File mames stay unchanged thanks to UCM format.
814 ! lib/Encode/CN/HZ.pm
815 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
816 Message-Id: <20020326035210.GA2091@not.autrijus.org>
819 - lib/Encode/JP/Const.pm
820 + lib/Encode/CJKConstants.pm
821 + lib/Encode/CN/2022_CN.pm
822 + lib/Encode/KR/2022_KR.pm
831 * Support for ISO-2022-KR and ISO-2022-CN added.
833 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
834 * ucm2table autogenerates character table out of UCM files.
837 - lib/Encode/Supports.pod
838 + lib/Encode/Supported.pod
839 Names reverted due to popular demand.
840 8.3 rule applies only when there is a conflict.
841 Message-Id: <20020325095924.GD44120@not.autrijus.org>
846 - lib/Encode/Format/Enc.pod
848 * Character tables is now 100% ucm.
849 * All files under Encode/ is now 8.3-compliant
850 * some of missing encodings added (i.e. gsm0338 and nextstep)
851 * Vendor mappings aggregated with appropriate national std in
852 Makefile.PL, resulting smaller *.so especially for CJK.
853 Following is result on Dan's FreeBSD box.
855 ---------------------------------------------------------------
856 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
857 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
858 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
859 blib/arch/auto/Encode/Encode.so 27,791 27,791
860 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
861 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
862 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
863 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
864 ---------------------------------------------------------------
865 Total 5,375,297 6,343,819
867 * As a result of ucm-transition, Encode::Tcl dropped because
868 Encode::Tcl demands *.enc.
869 Encode::Tcl will be supplied in a separate tarball with *.enc.
870 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
875 -lib/Encode/Supported.pod
876 +lib/Encode/Supports.pod
877 -lib/Encode/iso10646_1.pm
878 +lib/Encode/10646_1.pm
879 -lib/Encode/EncFormat.pod
880 +lib/Encode/Format/Enc.pod
881 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
882 - lib/Encode/JP/Constants.pm
883 + lib/Encode/JP/Consts.pm
884 ! lib/Encode/JP/JIS.pm
885 ! lib/Encode/JP/H2Z.pm
886 Version nit problem and 8.3 rule fix.
887 > Package namespace installed latest in CPAN file
888 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
889 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
890 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
891 accordingly. In addition, legacy "use vars qw()..." are replaced with
893 Message-Id: <20020325011248.D1561@alpha.hut.fi>
894 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
896 - lib/Encode/JP/ISO_2022_JP.pm
897 - lib/Encode/JP/ISO_2022_JP_1.pm
898 + lib/Encode/JP/2022_JP.pm
899 + lib/Encode/JP/2022_JP1.pm
901 8.3 naming conflict for vanilla fat addressed by jhi
902 Message-Id: <20020324201931.V22596@alpha.hut.fi>
905 Typecast fix addressed by jhi
906 Message-Id: <20020324185540.T22596@alpha.hut.fi>
909 ! lib/Encode/Supported.pod
911 + lib/Encode/JP/ISO_2022_JP_1.pm
912 ! lib/Encode/JP/ISO_2022_JP.pm
913 ! lib/Encode/JP/JIS.pm
915 Now Encode::JP is more strict on the difference between ISO-2022-JP
916 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
917 makes Anton happier :) FYI the previous version implements
918 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
919 ! lib/Encode/Supported.pod
922 Avoid core-dump in Encode with PERLIO=mmap by NI-S
923 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
928 ! lib/Encode/Suppoted.pod
929 pod fixes to replace F<http://...> to L<http://...>,
930 as suggested by Autrijius in:
931 Message-Id: <20020324083943.GA14901@not.autrijus.org>
932 ! lib/Encode/Suppoted.pod
933 fixes and enhancements by Anton
934 Message-Id: <10632060120.20020324103753@motor.ru>
935 ! lib/Encode/Alias.pm
936 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
937 added. Suggested by Anton then deobfuscated by Autrijius
938 Message-Id: <20020324064455.GA3667@not.autrijus.org>
940 Further fix by Nicholas Clark
941 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
942 - lib/EncodeFormat.pod
943 + lib/Encode/EncFormat.pod
945 File renamed as suggested by Autrijius
947 ! lib/Encode/Details.pod
948 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
949 ! Encode.pm Sun Mar 24 13:43:47 2002
950 pod fixes by Autrijius.
951 Message-Id: <20020324062804.GA3595@not.autrijus.org>
952 Message-Id: <20020324075627.GB11986@not.autrijus.org>
954 ! lib/Encode/Alias.pm
956 now more EBCDIC conscious;
957 %ExtModules on EBCDIC system excludes CJK so that you don't
958 have to worry about the matched alias resulting cloaking.
959 t/Alias.t also revised to reflect changes. Verified by jhi
960 Message-Id: <20020324022929.D22596@alpha.hut.fi>
966 EBCDIC detection mechanism installed as in JP/JP.pm
967 Message-Id: <20020323211847.G19148@alpha.hut.fi>
975 Now all table files used by compile are postfixed '_t' to avoid
976 namespace collisions in case insensitive file systems once for all!
978 Message-ID: <58290227735.20020323195659@familiehaase.de>
980 Since the Encode::JP is unsupported under EBCDIC we
981 cannot run this test (aliases as such should work fine) -- jhi
982 Message-Id: <20020323202119.D19148@alpha.hut.fi>
984 duplicate occurance of ascii.ucm and 8859-1.ucm
985 causes MacOS X dlyd to cloak
991 < chdir 't' if -d 't';
993 > if (! -d 'blib' and -d 't'){ chdir 't' };
994 When you are "make test"-ing on Encode/ directory, you must not
995 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
996 remain unchanced. Also the situation detection was made simpler
997 in t/JP.t, which was originally;
998 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1000 "Use of uninitialized value in string eq at Encode.pm line 96."
1001 ! Symbol/Makefile.PL
1002 ! EBCDIC/Makefile.PL
1004 -- Problem on case insensitive file systems
1005 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1006 Message-ID: <88254111953.20020323095503@familiehaase.de>
1009 "So I think it's a bug in gcc, not perl. But it still needs to be
1011 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1012 Message-Id: <20020323170509.C96475@plum.flirble.org>
1014 0.96 Sat Mar 23 2002
1016 ! lib/Encode/Encoding.pm
1017 ! lib/Encode/Alias.pm
1018 ! lib/Encode/Supported.pod
1020 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1021 Message-ID: <20020322073908.GB10539@blackrider>
1024 "...I think we should include ISO 8859-1 as well." -- NI-S
1025 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1030 ! lib/Encode/Alias.pm
1031 alias definitions relocated to Encode::Alias so module autoloading
1032 works for aliases also.
1034 encodings() now accepts args to check ExtModules.
1038 + EBCDIC/Makefile.PL
1039 + Symbol/Makefile.PL
1043 Latin and single byte encodings are reorganized so they are
1044 demand-loaded like Encode::XX. Now only ascii is compiled into
1046 ! lib/Encode/Alias.pm
1047 for my $k (keys %hash){ delete $hash{$k}; }
1048 is depreciated; fixed.
1050 0.95 Fri Mar 22 2002
1051 In this update, pod rewrites and alias fixes are the main issues
1052 + lib/Encode/Supported.pod
1053 Describes supported encodings
1055 streamlined compiled-in encodings.
1056 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1058 + Encode/ibm-125?.ucm
1059 Added from icu distibution with any occurance of
1060 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1061 some respect to icu staff, however.
1062 + lib/Encode/Alias.pm
1064 Alias difinitions in Encode.pm relocated.
1067 packWARN patch from Paul Marquess via jhi
1068 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1069 Paul added to AUTHORS as a result.
1070 ! t/CJKalias.t -> t/Aliases.t
1071 Renamed. Checks even more aliases and alias overloading
1074 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1075 gbk => cp936 relocated to CN.pm
1077 Test::More with plans (by jhi)
1079 0.94 Thu Mar 21 2002
1080 + lib/Encode/Description.pod
1081 ! lib/Encode/Encoding.pm
1082 Now the pod in Encode.pm is abridged as programming references.
1083 lib/Encode/Description.pod contains the original, detailed description
1084 and Encode::Encoding explains how to write your own module to
1085 add new encodings. So far, lib/Encode/Description.pod contains
1086 the whole pod once in Encode.pm. This is intentional.
1088 Pod revisions by Anton Tagunov
1089 Message-Id: <517178431.20020320174824@motor.ru>
1091 all occrance of Encode::Tcl::Extended removed including pod
1093 test now checks $encoding->name only; $encoding->{name} are
1094 no longer check to find the canonical name.
1095 ! lib/Encode/JP/JIS.pm
1096 ! lib/Encode/JP/ISO_2022_JP.pm
1097 ->name() added to be more compliant with API
1103 Patch by Autrijus to add aliases to TW and fixes to POD
1104 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1106 SADAHIRO Tomoyuki added as should. My apologies.
1108 0.93 Wed Mar 20 2002
1109 * First release to be uploaded to CPAN. For prehistoric changes,
1110 please see Changes file of perl distibution as well as
1111 perl-unicode@perl.org archive, available at:
1112 http://archive.develooper.com/perl-unicode@perl.org/
1114 Changes Since 0.92 includes;
1119 + Mention to perl-unicode@perl.org added
1121 + Encoding aliases added so you can feed locale names
1122 and MIME Charset="" directly.
1123 - Mention to JISX0212 removed because it's fixed
1126 + Encoding aliases added. Note TW is left untouched because
1127 euc-tw is not implemented in TW but in Encode::HanExtra.
1128 Autrijus, you may fix Encode::HanExtra.
1130 + to test encode aliases added