1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.69 2002/05/04 16:41:18 dankogai Exp dankogai $
6 $Revision: 1.69 $ $Date: 2002/05/04 16:41:18 $
7 ! lib/Encode/MIME/Header
8 Floating-point coerced for UNICOS (in integer arithmetics it folds
9 line one character too early). Verification by Mark is pending.
10 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
12 more doc patch from Elizabeth
13 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
14 ! Encode/Makefile_PL.e2x
15 More platform-independent patch from Benjamin
16 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
17 ! lib/Encode/Guess AUTHORS
18 split regex fix by Graham Barr. Adds him to AUTHORS.
19 Message-Id: <20020504085419.E95940@valueclick.com>
20 ! Encode/Makefile_PL.e2x
21 enc2xs script discovery made smarter and more sensible, first cited
22 by Miyagawa-kun and further suggestions by Rafael and Andreas
23 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
24 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
26 1.68 2002/05/03 12:20:13
27 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
28 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
29 and Supported.pod modified to reflect the change. Elizabeth added
30 to Authors. And H.M. is also added for forwarding her patch among
31 other contributions (I was rather surprised to find his name was not
33 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
35 1.67 2002/05/02 07:33:09
37 Error message now consistent w/ perlqq (\N{U+} -> \x{})
38 done in perl@16308 but Philip linted me further. Now the error
39 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
41 Sanity check for happier -w by Autrijus
43 1.66 2002/05/01 05:41:06
44 ! Encode.xs t/fallback.t
45 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
46 while fallback is in effect. This even came with a welcome side-effect
47 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
48 also modified to test this.
49 And of course, the corresponding varialbles to UV[Xx]f are appropriately
50 cast. This should've concluded NI-XS homework.
52 encode(undef) does warn again! Repented upon suggestion by NI-XS.
53 Document for unless vs. '' added
54 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
56 1.65 2002/04/30 16:13:37
58 encode(undef) no longer warns for C<Use of uninitialized value in
59 subroutine entry>. Suggested by Paul.
60 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
61 ! lib/Encode/Supported.pod
62 Encode::MIME::Header and Encode::Guess mentioned
63 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
65 POD fix by Miyagawa-kun
66 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
68 1.64 2002/04/29 06:54:06
70 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
71 encocde("euc-jisx0213") are still DIFFERENT.
72 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
74 A few white spaces corrected by NI-XS via PerlIO integration to
76 Subject: Change 16247: Integrate perlio;
78 Document fixes by Andreas
79 Message-Id: <m3k7qsf1we.fsf@anima.de>
81 1.63 2002/04/27 18:59:50
82 ! lib/Encode/Encoding.pm
83 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
84 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
85 Make use of the Encode::Encoding base class!
86 And other cleanups in Encode.xs upon NI-XS suggestions
87 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
89 1.62 2002/04/27 11:17:39
91 encodings() now just check %ExtModule instead of eval{require}
92 all of them for ":all" to conserve more memory.
94 more "%x" -> "%" UVxf stuff.
96 s/=over2/=over 2/g # oops.
98 1.61 2002/04/26 03:02:04
100 Now does decent tests besides use_ok()
101 ! lib/Encode/Guess.pm t/guess.t
102 UI streamlined, document added
104 various signed/unsigned mismatch nits (#16173)
105 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
107 POD: utf8-flag-related caveats added. A few sections completely
111 Thou shalt not assume %d works, either!
112 Robin Baker added to AUTHORS for this
113 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
115 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
117 1.60 2002/04/24 20:06:52
119 "Thou shalt not assume %x works." -- jhi
120 Message-Id: <20020424210618.E24347@alpha.hut.fi>
121 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
122 low-memory build machines happy, now *.c is created for each *.ucm
123 (no table aggregation). You can still override this by setting
124 $ENV{AGGREGATE_TABLES}.
125 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
126 + lib/Encode/Guess.pm
127 + lib/Encode/JP/JIS7.pm
128 Encoding-autodetect (mainly for Japanese encoding) added. In a
129 course of development, JIS7.pm was improved.
130 + lib/Encode/HTML/Header.pm
131 + lib/Encode/Config.pm
132 MIME B/Q Header Encoding Added!
133 ! Encode.pm Encode.xs t/fallback.t
134 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
135 Message-Id: <20020424130709.GA14211@tanglefoot>
137 1.59 $ 2002/04/22 23:54:22
138 ! Encode.pm Encode.xs
139 needs_lines() and perlio_ok() are added to Internal encodings such
140 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
142 1.58 2002/04/22 23:54:22
144 s/MacChineseSimp/MacChineseTrad/ # ... oops.
149 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
150 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
151 t/(JP|KR).t are aggregated to t/CJKT.t
152 test data are all remade via bin/ucm2text.
153 And .... They are no longer skipped for -Uuseperlio !
155 1.57 2002/04/22 20:27:30
156 ! t/JP.t t/KR.t t/perlio.t
157 unless (find PerlIO::Layer 'perlio') ... line is back again.
158 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
159 need PerlIO. Sigh....
160 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
161 ->perlio_ok now does eval{ require PerlIO::encoding } there so
162 it correctly returns 1 when PerlIO::encoding is yet loaded.
164 perl-current patch #16072 reflected
166 1.56 2002/04/22 09:48:07
167 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
168 New PerlIO::encoding 0.04 compliance met
170 1.55 2002/04/22 03:43:05
171 ! Encode.pm Encode.xs Unicode/Unicode.pm
172 needs_lines() defined so Encode::Encoding is no longer needed
175 1.54 2002/04/22 02:50:01
176 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
177 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
178 now perlio_ok is true by default if PerlIO::encoding->VERSION is
179 0.03 or larger. POD in Encode::Encoding revised to reflect this.
180 Encode::XS and Encode::Unicode now has perlio_ok() method.
181 ! lib/Encode/Supported.pod
182 s/UP-UX/HP-UX/ by jhi
183 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
184 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
185 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
186 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
187 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
188 ! lib/Encode/Supported.pod
189 Huge document fixes by Philip.
192 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
193 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
195 binmode() after "<:encoding" to make Win32 happy, by Mattia.
196 Mattia added to AUTHORS file
197 Message-Id: <3CC3150F.5798.22A05AE@localhost>
199 1.52 2002/04/20 23:43:47
201 TODO: is now SKIP:, as NI-XS requested. Also adds more
202 eraborate failure analysis added.
204 A note on how to make sure of round-trip safety added to POD
205 section (so Autrijus is happier)
206 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
207 big5-(eten|hkscs) is round-trip safe again!
208 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
210 Typo fixes by Andreas
211 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
212 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
213 PerIO coodination patches from NI-XS.
214 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
216 1.51 2002/04/20 09:58:23
218 Updated test suite by Autrijis so "make test" is happy again
219 Message-Id: <20020420082104.GA25037@not.autrijus.org>
221 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
223 TW/TW.pm TW/Makefile.PL
224 Updates by Autrijus. 'big5' is no longer a canonical but an
225 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
226 Message-Id: <20020419195346.GA19597@not.autrijus.org>
228 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
229 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
231 PerlIO detection a little bit smarter; no longer uses eval qq{}
234 1.50 2002/04/19 06:13:02
235 ! ! Encode.pm Encode.xs Encode/encoding.h
237 New Fallback API imlemented and documented. See "perldoc Encode"
239 ! lib/Encode/JP/JIS7.pm Encode.pm
240 + lib/Encode/PerlIO.pod t/perlio.t
241 API compliance met. However, it still does not work unless perlio
242 implements line buffer. See BUGS section in perldoc Encode::PerlIO
243 As a sensible workaround, perlio_ok() added to Encode.
245 ! lib/Encode/Supported.pod
247 Message-Id: <20020418174647.J8466@alpha.hut.fi>
249 Doc fixes from Autrijus
250 Message-Id: <20020418144131.GA10987@not.autrijus.org>
252 perlqq mode documented
254 + t/jisx0201.euc t/jisx0201.ref
255 ! t/jisx0208.euc t/jisx0208.ref
256 t/JP.t tests more rigorously and with other encodings
257 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
258 in jis0208 (used to contain jisx0201 part).
259 ! Encode/Makefile_PL.e2x
260 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
261 enc2xs and encode.h rather than hard-coded. This allows the resulting
262 module fully CPANizable.
263 ! encoding.pm t/JP.t t/KR.t
264 PerlIO detection simplified (checks %INC instead of eval{})
265 ! Encode.xs Encode/encode.h
266 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
267 - lib/Encode/Unicode.pm
268 (en|de)code_xs relocated to where it belongs. Source reindented
271 Additional (U8 *) cast added as suggested by jhi
272 Message-Id: <20020417165916.A28599@alpha.hut.fi>
274 1.42 Date: 2002/04/17
276 no-op module; Thought of adding a pod there but enc2xs has
281 correct mechanism to detect Perlio::encoding layar installed.
283 PerlIO Layer detached.
285 1.41 2002/04/16 23:35:00
287 binmode(STDIN|STDOUT ...) done iff PerlIO is available
289 Cleaned up PerlIO skip conditions to prepare for the upcoming
290 Encode - PerlIO forking.
292 exported functions are now prototyped.
293 ! lib/Encode/CN/HZ.pm
296 fallback implemented # was /* FIXME */
297 affected programs revised to fit (only HZ was using the try-catch
298 approach which needed to be fixed for API-compliance).
300 ! Encode/KR/2022_KR.pm
302 can find =head1 NAME now, jhi
303 Message-Id: <20020416083059.V30639@alpha.hut.fi>
307 now complies with less warnings with the pickest compilers.
308 Suggested by Craig, fixed by Dan.
309 ! Encode/Makefile_PL.e2x
311 A bug that fails to find *.e2x in certain conditions fixed
313 1.40 2002/04/14 22:27:14
314 + Encode/ConfigLocal_PM.e2x
315 ! lib/Encode/Config.pm
317 "enc2xs -C" now generates/updates Encode::ConfigLocal.
318 ConfigLocal_PM.e2x is a skelton thereof.
319 ! lib/Encode/Config.pm
321 "use Encode::CN::HZ;" was missing.
324 More rigorous tests added to test XS, especially on memory allocation.
326 ! lib/Encode/Unicode.pm
327 NI-S implemented an XS version -- merged
328 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
331 Source filter option added. With this option on, you can write
332 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
333 encodings. t/jperl.t enhanced to test this feature.
335 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
336 apply his suggestion, this degree of nitting is enough to add him
338 Message-Id: <3CB93223.291E5E2E@earthlink.net>
340 + lib/Encode/JP/JIS7.pm
341 - lib/Encode/JP/JIS.pm
342 - lib/Encode/JP/2022_JP.pm
343 - lib/Encode/JP/2022_JP1.pm
344 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
345 JIS7.pm for better maintainability and performance
347 Added caveat for non-ascii identifiers.
349 fixes by jhi, the original author of this pragramtic module.
350 Message-Id: <20020413231527.V1826@alpha.hut.fi>
352 1.34 2002/04/12 20:23:05 (Unreleased)
355 EBCDIC fixes addressed by jhi.
356 Message-Id: <20020412161844.D9383@alpha.hut.fi>
357 ! lib/Encode/Encoder.pm
358 POD fix by Miyagawa-kun
359 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
361 1.33 2002/04/10 22:28:40
363 Philip's mail address corrected.
366 ! lib/Encode/Encoder.pm
367 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
368 This adds Miyagawa-kun to AUTHORS.
369 * encoding() no longer exported by default but on demand
370 * t/Encoder.t updated to test all these
371 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
372 ! lib/Encode/Unicode.pm
373 ! lib/Encode/Supported.pm
374 Further doc fixes by Anton
376 1.32 2002/04/09 20:06:15
379 - ucm/macDevanaga.ucm Unicode Character Map
380 - ucm/macGujarati.ucm Unicode Character Map
381 - ucm/macGurmukhi.ucm Unicode Character Map
382 A utility to check integrity of .ucm files. t/bogus.ucm is a
383 ucm that is deliberately bogus. unused Indic mappings are removed
386 resolve_alias() added as suggested by jhi. Same as
387 find_encoding("alias")->name. For convenience. This one is
388 defined in Encode.pm instead of Alias.pm.
389 Message-Id: <20020409215846.H17022@alpha.hut.fi>
391 Memory Allocate but detected during the devel of ucmlint -- fixed.
392 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
393 ! lib/Encode/Unicode.pm
394 valid_ucs2(0) is false but must be true.
395 3 patches from NI-S as follows. This also has fixed the incident
397 ! lib/Encode/Alias.pm
398 find_alias() recursion prevention
400 Checks for the patch above
401 ! t/Encode/Unicode.pm
402 An extra "F" that causes valid_ucs2() return a bogus value fixed
403 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
404 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
405 2 Small Patches from jhi as follows:
407 Encode->encodings() lists in case-insensitve order (as it was)
409 -l option prints avaiable encodings to STDOUT instead of STDERR
410 ! lib/Encode/Aliases.pm
411 s/defintion/definition/
412 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
414 ! lib/Encode/Supported.pod
415 ! lib/Encode/Unicode.pm
416 POD revise by Philip Newton. This adds Philip to AUTHORS list.
417 Thank you for the exact quote of Douglas Adams :)
418 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
420 1.31 2002/04/08 18:08:07
421 ! lib/Encode/Encoder.pm
423 Encode::Encoder, once just a placeholder of an idea, is now much more
424 practical. See t/Encode.t to find how practical it can be.
425 + lib/Encode/Config.pm
427 my false laziness at Encode.pm is fixed. Now %ExtModules are set
428 in Encode::Config and they are all literally, not programatically
429 set. My false laziness was resulting many encodings missing from
431 ! lib/Encode/Unicode.pm
433 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
434 so that it does not rely Encode::Unicode for BOM values
435 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
437 1.30 2002/04/08 02:34:51
438 + lib/Encode/Encoder.pm
439 Object Oriented Encoder. I reckon something like this is in need.
442 ! lib/Encode/Supported.pod
443 * autoloading bug that prevented upper-case canonicals such as UTF-16
444 is fixed. Now even UTF/UCS are autoloaded!
445 * encodings() is now more intuitive.
446 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
448 * Obligatory fixes to the POD.
449 ! lib/Encode/Supported.pod
450 Patch from Anton applied.
451 Message-Id: <66641479.20020408033300@motor.ru>
453 ! lib/Encode/Unicode.pm
454 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
456 1.28 2002/04/07 18:58:42
460 Just a MANIFEST for those missing files.
462 1.26 Date: 2002/04/07 15:22:04
465 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
466 And t/Aliases.t fixed to make sure they all exist.
468 1.25 2002/04/07 15:01:25 (Unreleased)
470 ! lib/Encode/Unicode.pm
473 - lib/Encode/UTF_EBCDIC.pm
474 - lib/Encode/Internal.pm
476 Integrated into Encode.pm as closures. That way "one package, one file"
477 rule is preserved yet less files to require.
479 commented out binmode(STDERR ...
488 ! Encode/Makefile_PL.e2x
489 Schwarn's MM-compliance patch merged
490 Message-Id: <20020406082609.GA28758@blackrider>
492 ! lib/Encode/Unicode.pm
493 + lib/Encode/UTF_EBCDIC.pm
495 - lib/Encode/10646_1.pm
496 - lib/Encode/ucs2_le.pm
497 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
498 per-module based (en|de)code, I saved a number of .pm by
499 reorganizing it as per-object base (Well, this is what Encode::XS
500 does under the hood). See Encode::Unicode for details.
501 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
502 This module is used only on EBCDIC environments.
504 1.21 2002/04/05 14:46:34 (Not Released)
510 Are back to make Perl/Tk happy Smile, NI-S.
512 ! lib/Encode/Alias.pm
513 ! lib/Encode/Supported.pm
514 ! lib/Encode/10646_1.pm
515 ! lib/Encode/ucs2_le.pm
516 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
517 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
518 updated to reflect changes.
519 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
520 ! lib/Encode/Alias.pm
521 ! lib/Encode/Supported.pm
522 Anton's revision commited. Added Dan's own fixes as well.
523 Message-Id: <159103166906.20020405161134@motor.ru>
524 ! lib/Encode/Alias.pm
526 < qr/^UCS2-le$/i => '"UCS-2"', );
528 > qr/^UCS2-LE$/i => '"UTF-16LE"');
529 Sigh. Thank you, Anton.
530 Message-Id: <14567692196.20020405062020@motor.ru>
531 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
533 1.20 2002/04/04 19:50:52
535 the last minute addtion. Just give it a try. Docs remains to be done.
536 Not installed by default.
537 ! lib/Encode/Supported.pod
540 ! lib/Encode/Alias.pm
542 ! lib/Encode/10464_1.pm
543 ! lib/Encode/ucs2_le.pm
544 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
545 unchanged but UTF-16BE is added as an alias. Implicit aliases
546 move to Encode::Alias so init_alias() works more as expected.
547 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
548 Though pedantically wrong, This should make perl mongers happier.
549 t/Alias.t is enhanced to test all these.
550 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
552 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
553 of the codepoints are common. CJKT left as is because the saving is
559 ! Encode/Makefile_PL.e2x
567 All occurance of _def.h replaced with .exh so djgpp works happily
568 ever after! To credit this amazing discovery, Laszlo is now in
570 Message-Id: <20020403181424.GA8778@freemail.hu>
571 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
574 ! Encode/Makefile_PL.skel
576 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
577 Message-Id: <20020401222744.GX2000@blackrider>, et al.
579 Two more tests by added jhi
580 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
583 The showstopper fixed -- Memory reallocation bug was causing
584 Encode::XS to fall into infinite loop on certain conditions.
586 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
591 ! lib/Encode/Supported.pod
592 Vendor encodings rebuilt out of original map files at unicode.org.
593 Indic languages such as MacDevanagali remain unspported do to the
594 shortcoming of encengine capabilities (they need algorithmical
595 conversion and I have no knowledge on that!). Pods fixed for added
597 Oh, macJapan.ucm renamed to macJapanese.ucm.
598 macROMnn is macRomanian and macRUMnn is macRumanian.
599 txt2ucm is a crude script that is used to convert them.
601 Unicode Compound Characters (used extensively on Mac) supported
603 Typo fixes and improvements by jhi
604 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
606 1.11 $Date: 2002/05/04 16:41:18 $
610 Missing files from the MANIFEST fixed.
611 Message-Id: <20020401010156.H10509@alpha.hut.fi>
612 Version incremented just to make CPAN happy.
614 1.10 2002/03/31 21:32:42
617 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
618 if you want. This should make Autrijus happy. Also, piconv
619 is added to default install.
622 Here-documented files that enc2xs generates are now exported
623 to *.e2x. Much cleaner and easier to debug.
625 encoding enhances so you can make it act more like such
626 (now prehistoric ) "localized" variations of perl like Jperl.
628 Further test for encoding.pm. Written in euc-jp
632 Message-Id: <20020330174618.B10154@alpha.hut.fi>
637 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
644 ! Encode/macIceland.ucm
645 ! lib/Encode/Alias.pm
646 ! lib/Encode/Supported.pod
647 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
648 Message-Id: <m3lmcavhjt.fsf@anima.de>
650 1.01 2002/03/29 20:59:39
653 s/USE_SCRIPTS/MORE_SCRIPTS/
655 installs enc2xs by default for external Encode:: modules in CPAN,
656 such as Encode::HanExtra
658 More sensible perl core detection via $ENV{PERL_CORE}
660 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
662 Perl core ditection via $^X =~ m/\bminiperl$/o
663 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
667 The version of all files is updated to 1.00 via "ci -f -l1.00",
668 commemorating version 1.00. All files, including *.ucm are now
669 under version control.
672 encode.h moved to Encode/ so it will be installed for the later
675 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
680 compile renamed to enc2xs.
681 Affected Makefle.PL updated
683 "Punt it. HanExtra can take care of that later." -- Autrijus
684 Message-Id: <20020328154338.GA7351@not.autrijus.org>
688 ! lib/Encode/CJKConstants.pm
689 ! lib/Encode/KR/2022_KR.pm
690 Table patches for Euro Signs, 2022-KR fixups by Jungshik
691 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
695 bin/ added for example scripts. They are not installed by default.
696 to install them, "perl Makefile.PL USE_SCRIPTS".
697 piconv is iconv reinvented in perl. in addition to all features
698 of iconv, it also adds perlish features. See L<piconv/1> for more
700 ! lib/Encode/Alias.pm
701 qr/^ replaced with qr/\b so it directly matches locale names
702 such as en_US.US-ASCII
705 Patch by MJD to fix the following problem applied.
706 Subject: [PATCH 5.7.3 Encode]
707 Aliases.t not properly skipped when Encode extension not built
708 Message-Id: <20020328091850.18677.qmail@plover.com>
709 ! lib/Encode/KR/2022_KR.pm
710 ! lib/Encode/CJKConstants.pm
711 Another patch from Jungshik to make iso-2022-kr actually work
712 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
713 ! Encode/Encode/euc-kr.ucm
714 + Encode/Encode/johab.ucm
715 ! Encode/Encode/ksc5601.ucm
717 ! Encode/KR/Makefile.PL
718 ! Encode/lib/Encode/Alias.pm
720 Johab support and complete revision of Korean Encoding by Jungshik
721 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
723 Revised to make up with now-dropped Encode::Details.
724 - lib/Encode/Details.pod
725 Dropped. Besides being obsolete, the topics are now covered in
731 Korean aliases fixed thanks to Jungshik Shin
732 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
733 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
735 =head1 NAME added to all modules to make buildtoc happy
736 Message-Id: <20020327041151.A10618@alpha.hut.fi>
737 - lib/Encode/CJKguide.pod
738 Too controversial and dropped from the dist. Will be available
739 separately on the web.
741 RCS tags added so table debugging gets easier (should that be
742 needed! I hope they all stay 1.00!)
743 + lib/Encode/CJKguide.pod
744 A detailed guide to mainly, but not limited to, CJK multibyte
747 + Encode/hp-roman8.ucm
749 ! Encode/Supported.pod
750 All occurance of "roman8" replaced with "hp-roman8" to avoid
752 ! Encode/Supported.pod
755 Mac Encodings now comply the Inside Macintosh
757 Test for '-raw' conventions added.
759 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
763 "-raw" appended to canonical names.
764 File mames stay unchanged thanks to UCM format.
765 ! lib/Encode/CN/HZ.pm
766 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
767 Message-Id: <20020326035210.GA2091@not.autrijus.org>
770 - lib/Encode/JP/Const.pm
771 + lib/Encode/CJKConstants.pm
772 + lib/Encode/CN/2022_CN.pm
773 + lib/Encode/KR/2022_KR.pm
782 * Support for ISO-2022-KR and ISO-2022-CN added.
784 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
785 * ucm2table autogenerates character table out of UCM files.
788 - lib/Encode/Supports.pod
789 + lib/Encode/Supported.pod
790 Names reverted due to popular demand.
791 8.3 rule applies only when there is a conflict.
792 Message-Id: <20020325095924.GD44120@not.autrijus.org>
797 - lib/Encode/Format/Enc.pod
799 * Character tables is now 100% ucm.
800 * All files under Encode/ is now 8.3-compliant
801 * some of missing encodings added (i.e. gsm0338 and nextstep)
802 * Vendor mappings aggregated with appropriate national std in
803 Makefile.PL, resulting smaller *.so especially for CJK.
804 Following is result on Dan's FreeBSD box.
806 ---------------------------------------------------------------
807 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
808 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
809 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
810 blib/arch/auto/Encode/Encode.so 27,791 27,791
811 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
812 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
813 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
814 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
815 ---------------------------------------------------------------
816 Total 5,375,297 6,343,819
818 * As a result of ucm-transition, Encode::Tcl dropped because
819 Encode::Tcl demands *.enc.
820 Encode::Tcl will be supplied in a separate tarball with *.enc.
821 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
826 -lib/Encode/Supported.pod
827 +lib/Encode/Supports.pod
828 -lib/Encode/iso10646_1.pm
829 +lib/Encode/10646_1.pm
830 -lib/Encode/EncFormat.pod
831 +lib/Encode/Format/Enc.pod
832 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
833 - lib/Encode/JP/Constants.pm
834 + lib/Encode/JP/Consts.pm
835 ! lib/Encode/JP/JIS.pm
836 ! lib/Encode/JP/H2Z.pm
837 Version nit problem and 8.3 rule fix.
838 > Package namespace installed latest in CPAN file
839 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
840 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
841 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
842 accordingly. In addition, legacy "use vars qw()..." are replaced with
844 Message-Id: <20020325011248.D1561@alpha.hut.fi>
845 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
847 - lib/Encode/JP/ISO_2022_JP.pm
848 - lib/Encode/JP/ISO_2022_JP_1.pm
849 + lib/Encode/JP/2022_JP.pm
850 + lib/Encode/JP/2022_JP1.pm
852 8.3 naming conflict for vanilla fat addressed by jhi
853 Message-Id: <20020324201931.V22596@alpha.hut.fi>
856 Typecast fix addressed by jhi
857 Message-Id: <20020324185540.T22596@alpha.hut.fi>
860 ! lib/Encode/Supported.pod
862 + lib/Encode/JP/ISO_2022_JP_1.pm
863 ! lib/Encode/JP/ISO_2022_JP.pm
864 ! lib/Encode/JP/JIS.pm
866 Now Encode::JP is more strict on the difference between ISO-2022-JP
867 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
868 makes Anton happier :) FYI the previous version implements
869 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
870 ! lib/Encode/Supported.pod
873 Avoid core-dump in Encode with PERLIO=mmap by NI-S
874 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
879 ! lib/Encode/Suppoted.pod
880 pod fixes to replace F<http://...> to L<http://...>,
881 as suggested by Autrijius in:
882 Message-Id: <20020324083943.GA14901@not.autrijus.org>
883 ! lib/Encode/Suppoted.pod
884 fixes and enhancements by Anton
885 Message-Id: <10632060120.20020324103753@motor.ru>
886 ! lib/Encode/Alias.pm
887 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
888 added. Suggested by Anton then deobfuscated by Autrijius
889 Message-Id: <20020324064455.GA3667@not.autrijus.org>
891 Further fix by Nicholas Clark
892 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
893 - lib/EncodeFormat.pod
894 + lib/Encode/EncFormat.pod
896 File renamed as suggested by Autrijius
898 ! lib/Encode/Details.pod
899 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
900 ! Encode.pm Sun Mar 24 13:43:47 2002
901 pod fixes by Autrijius.
902 Message-Id: <20020324062804.GA3595@not.autrijus.org>
903 Message-Id: <20020324075627.GB11986@not.autrijus.org>
905 ! lib/Encode/Alias.pm
907 now more EBCDIC conscious;
908 %ExtModules on EBCDIC system excludes CJK so that you don't
909 have to worry about the matched alias resulting cloaking.
910 t/Alias.t also revised to reflect changes. Verified by jhi
911 Message-Id: <20020324022929.D22596@alpha.hut.fi>
917 EBCDIC detection mechanism installed as in JP/JP.pm
918 Message-Id: <20020323211847.G19148@alpha.hut.fi>
926 Now all table files used by compile are postfixed '_t' to avoid
927 namespace collisions in case insensitive file systems once for all!
929 Message-ID: <58290227735.20020323195659@familiehaase.de>
931 Since the Encode::JP is unsupported under EBCDIC we
932 cannot run this test (aliases as such should work fine) -- jhi
933 Message-Id: <20020323202119.D19148@alpha.hut.fi>
935 duplicate occurance of ascii.ucm and 8859-1.ucm
936 causes MacOS X dlyd to cloak
942 < chdir 't' if -d 't';
944 > if (! -d 'blib' and -d 't'){ chdir 't' };
945 When you are "make test"-ing on Encode/ directory, you must not
946 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
947 remain unchanced. Also the situation detection was made simpler
948 in t/JP.t, which was originally;
949 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
951 "Use of uninitialized value in string eq at Encode.pm line 96."
955 -- Problem on case insensitive file systems
956 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
957 Message-ID: <88254111953.20020323095503@familiehaase.de>
960 "So I think it's a bug in gcc, not perl. But it still needs to be
962 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
963 Message-Id: <20020323170509.C96475@plum.flirble.org>
967 ! lib/Encode/Encoding.pm
968 ! lib/Encode/Alias.pm
969 ! lib/Encode/Supported.pod
971 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
972 Message-ID: <20020322073908.GB10539@blackrider>
975 "...I think we should include ISO 8859-1 as well." -- NI-S
976 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
981 ! lib/Encode/Alias.pm
982 alias definitions relocated to Encode::Alias so module autoloading
983 works for aliases also.
985 encodings() now accepts args to check ExtModules.
994 Latin and single byte encodings are reorganized so they are
995 demand-loaded like Encode::XX. Now only ascii is compiled into
997 ! lib/Encode/Alias.pm
998 for my $k (keys %hash){ delete $hash{$k}; }
999 is depreciated; fixed.
1001 0.95 Fri Mar 22 2002
1002 In this update, pod rewrites and alias fixes are the main issues
1003 + lib/Encode/Supported.pod
1004 Describes supported encodings
1006 streamlined compiled-in encodings.
1007 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1009 + Encode/ibm-125?.ucm
1010 Added from icu distibution with any occurance of
1011 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1012 some respect to icu staff, however.
1013 + lib/Encode/Alias.pm
1015 Alias difinitions in Encode.pm relocated.
1018 packWARN patch from Paul Marquess via jhi
1019 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1020 Paul added to AUTHORS as a result.
1021 ! t/CJKalias.t -> t/Aliases.t
1022 Renamed. Checks even more aliases and alias overloading
1025 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1026 gbk => cp936 relocated to CN.pm
1028 Test::More with plans (by jhi)
1030 0.94 Thu Mar 21 2002
1031 + lib/Encode/Description.pod
1032 ! lib/Encode/Encoding.pm
1033 Now the pod in Encode.pm is abridged as programming references.
1034 lib/Encode/Description.pod contains the original, detailed description
1035 and Encode::Encoding explains how to write your own module to
1036 add new encodings. So far, lib/Encode/Description.pod contains
1037 the whole pod once in Encode.pm. This is intentional.
1039 Pod revisions by Anton Tagunov
1040 Message-Id: <517178431.20020320174824@motor.ru>
1042 all occrance of Encode::Tcl::Extended removed including pod
1044 test now checks $encoding->name only; $encoding->{name} are
1045 no longer check to find the canonical name.
1046 ! lib/Encode/JP/JIS.pm
1047 ! lib/Encode/JP/ISO_2022_JP.pm
1048 ->name() added to be more compliant with API
1054 Patch by Autrijus to add aliases to TW and fixes to POD
1055 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1057 SADAHIRO Tomoyuki added as should. My apologies.
1059 0.93 Wed Mar 20 2002
1060 * First release to be uploaded to CPAN. For prehistoric changes,
1061 please see Changes file of perl distibution as well as
1062 perl-unicode@perl.org archive, available at:
1063 http://archive.develooper.com/perl-unicode@perl.org/
1065 Changes Since 0.92 includes;
1070 + Mention to perl-unicode@perl.org added
1072 + Encoding aliases added so you can feed locale names
1073 and MIME Charset="" directly.
1074 - Mention to JISX0212 removed because it's fixed
1077 + Encoding aliases added. Note TW is left untouched because
1078 euc-tw is not implemented in TW but in Encode::HanExtra.
1079 Autrijus, you may fix Encode::HanExtra.
1081 + to test encode aliases added