1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.74 2002/05/28 18:33:15 dankogai Exp dankogai $
6 $Revision: 1.74 $ $Date: 2002/05/28 18:33:15 $
7 + ucm/null.ucm ucm/ctrl.ucm
8 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
9 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
10 Subject: Re: unicode -> &# notation
11 Message-ID: <20020518193704.GB40272@not.autrijus.org>
13 1.73 2002/05/28 17:26:18
14 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
15 Chris Nandor has fixed Encode so that it works w/ MacPerl --
16 at least w/ PPC (68k need static linking which does not work due to
17 64k limit). pudge is added to AUTHORS (I'm surprised he was not
18 there in the list). Encode/Makefile_PL.e2x was additionally fixed
19 by dankogai to reflect changes in other Makefile.PL
20 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
22 Subject: Change 16746: -Mutf8 cleanup.
23 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
25 1.72 2002/05/20 15:49:56
27 Subject: [PATCH] Encode should be in perl-core library path
28 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
29 Message-Id: <20020520161201.A11019@alpha.hut.fi>
30 ! lib/Encode/MIME/Header.pm
31 Subject: [PATCH] Encode::MIME::Header
32 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
33 ! Encode/Makefile_PL.e2x
34 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
35 Message-Id: <20020519201031.GA1603@not.autrijus.org>
36 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
37 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
39 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
40 Subject: [PATCH] good day for WinCE port of perl.
41 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
44 Even more linting by Robin via @16532
46 Even more typecast by Sarathy in @16460
48 1.71 2002/05/07 16:22:42
50 even more typecasts by Robin
51 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
53 A very strange bug that was causing a bugus ucm -> C table
54 generation that was revealed by a UCM file that Andreas was
55 working. This is the king of wierdest bug I've encountered
56 in the course of Encode maintenance.
57 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
59 1.70 2002/05/06 10:26:48
61 Made more 'module-safe' with conjunction w/ 'no encoding'.
62 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
63 ! lib/Encode/Encoding.pm
64 'require Encode' because ->Define uses Encode::define_encoding();
65 problem and solution addressed by Miyagawa-kun
66 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
68 Cuts the frill to make djgpp happier, as suggested by Laszlo
69 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
71 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
72 Message-Id: <m3bsbug48n.fsf@anima.de>
74 1.69 2002/05/04 16:41:18
75 ! lib/Encode/MIME/Header
76 Floating-point coerced for UNICOS (in integer arithmetics it folds
77 line one character too early). Verification by Mark is pending.
78 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
80 more doc patch from Elizabeth
81 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
82 ! Encode/Makefile_PL.e2x
83 More platform-independent patch from Benjamin
84 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
85 ! lib/Encode/Guess AUTHORS
86 split regex fix by Graham Barr. Adds him to AUTHORS.
87 Message-Id: <20020504085419.E95940@valueclick.com>
88 ! Encode/Makefile_PL.e2x
89 enc2xs script discovery made smarter and more sensible, first cited
90 by Miyagawa-kun and further suggestions by Rafael and Andreas
91 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
92 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
94 1.68 2002/05/03 12:20:13
95 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
96 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
97 and Supported.pod modified to reflect the change. Elizabeth added
98 to Authors. And H.M. is also added for forwarding her patch among
99 other contributions (I was rather surprised to find his name was not
101 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
103 1.67 2002/05/02 07:33:09
105 Error message now consistent w/ perlqq (\N{U+} -> \x{})
106 done in perl@16308 but Philip linted me further. Now the error
107 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
108 ! lib/Encode/Guess.pm
109 Sanity check for happier -w by Autrijus
111 1.66 2002/05/01 05:41:06
112 ! Encode.xs t/fallback.t
113 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
114 while fallback is in effect. This even came with a welcome side-effect
115 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
116 also modified to test this.
117 And of course, the corresponding varialbles to UV[Xx]f are appropriately
118 cast. This should've concluded NI-XS homework.
120 encode(undef) does warn again! Repented upon suggestion by NI-XS.
121 Document for unless vs. '' added
122 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
124 1.65 2002/04/30 16:13:37
126 encode(undef) no longer warns for C<Use of uninitialized value in
127 subroutine entry>. Suggested by Paul.
128 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
129 ! lib/Encode/Supported.pod
130 Encode::MIME::Header and Encode::Guess mentioned
131 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
132 ! lib/Encode/Guess.pm
133 POD fix by Miyagawa-kun
134 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
136 1.64 2002/04/29 06:54:06
138 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
139 encocde("euc-jisx0213") are still DIFFERENT.
140 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
142 A few white spaces corrected by NI-XS via PerlIO integration to
144 Subject: Change 16247: Integrate perlio;
146 Document fixes by Andreas
147 Message-Id: <m3k7qsf1we.fsf@anima.de>
149 1.63 2002/04/27 18:59:50
150 ! lib/Encode/Encoding.pm
151 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
152 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
153 Make use of the Encode::Encoding base class!
154 And other cleanups in Encode.xs upon NI-XS suggestions
155 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
157 1.62 2002/04/27 11:17:39
159 encodings() now just check %ExtModule instead of eval{require}
160 all of them for ":all" to conserve more memory.
162 more "%x" -> "%" UVxf stuff.
164 s/=over2/=over 2/g # oops.
166 1.61 2002/04/26 03:02:04
168 Now does decent tests besides use_ok()
169 ! lib/Encode/Guess.pm t/guess.t
170 UI streamlined, document added
172 various signed/unsigned mismatch nits (#16173)
173 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
175 POD: utf8-flag-related caveats added. A few sections completely
179 Thou shalt not assume %d works, either!
180 Robin Baker added to AUTHORS for this
181 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
183 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
185 1.60 2002/04/24 20:06:52
187 "Thou shalt not assume %x works." -- jhi
188 Message-Id: <20020424210618.E24347@alpha.hut.fi>
189 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
190 low-memory build machines happy, now *.c is created for each *.ucm
191 (no table aggregation). You can still override this by setting
192 $ENV{AGGREGATE_TABLES}.
193 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
194 + lib/Encode/Guess.pm
195 + lib/Encode/JP/JIS7.pm
196 Encoding-autodetect (mainly for Japanese encoding) added. In a
197 course of development, JIS7.pm was improved.
198 + lib/Encode/HTML/Header.pm
199 + lib/Encode/Config.pm
200 MIME B/Q Header Encoding Added!
201 ! Encode.pm Encode.xs t/fallback.t
202 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
203 Message-Id: <20020424130709.GA14211@tanglefoot>
205 1.59 $ 2002/04/22 23:54:22
206 ! Encode.pm Encode.xs
207 needs_lines() and perlio_ok() are added to Internal encodings such
208 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
210 1.58 2002/04/22 23:54:22
212 s/MacChineseSimp/MacChineseTrad/ # ... oops.
217 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
218 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
219 t/(JP|KR).t are aggregated to t/CJKT.t
220 test data are all remade via bin/ucm2text.
221 And .... They are no longer skipped for -Uuseperlio !
223 1.57 2002/04/22 20:27:30
224 ! t/JP.t t/KR.t t/perlio.t
225 unless (find PerlIO::Layer 'perlio') ... line is back again.
226 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
227 need PerlIO. Sigh....
228 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
229 ->perlio_ok now does eval{ require PerlIO::encoding } there so
230 it correctly returns 1 when PerlIO::encoding is yet loaded.
232 perl-current patch #16072 reflected
234 1.56 2002/04/22 09:48:07
235 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
236 New PerlIO::encoding 0.04 compliance met
238 1.55 2002/04/22 03:43:05
239 ! Encode.pm Encode.xs Unicode/Unicode.pm
240 needs_lines() defined so Encode::Encoding is no longer needed
243 1.54 2002/04/22 02:50:01
244 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
245 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
246 now perlio_ok is true by default if PerlIO::encoding->VERSION is
247 0.03 or larger. POD in Encode::Encoding revised to reflect this.
248 Encode::XS and Encode::Unicode now has perlio_ok() method.
249 ! lib/Encode/Supported.pod
250 s/UP-UX/HP-UX/ by jhi
251 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
252 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
253 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
254 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
255 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
256 ! lib/Encode/Supported.pod
257 Huge document fixes by Philip.
260 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
261 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
263 binmode() after "<:encoding" to make Win32 happy, by Mattia.
264 Mattia added to AUTHORS file
265 Message-Id: <3CC3150F.5798.22A05AE@localhost>
267 1.52 2002/04/20 23:43:47
269 TODO: is now SKIP:, as NI-XS requested. Also adds more
270 eraborate failure analysis added.
272 A note on how to make sure of round-trip safety added to POD
273 section (so Autrijus is happier)
274 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
275 big5-(eten|hkscs) is round-trip safe again!
276 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
278 Typo fixes by Andreas
279 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
280 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
281 PerIO coodination patches from NI-XS.
282 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
284 1.51 2002/04/20 09:58:23
286 Updated test suite by Autrijis so "make test" is happy again
287 Message-Id: <20020420082104.GA25037@not.autrijus.org>
289 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
291 TW/TW.pm TW/Makefile.PL
292 Updates by Autrijus. 'big5' is no longer a canonical but an
293 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
294 Message-Id: <20020419195346.GA19597@not.autrijus.org>
296 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
297 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
299 PerlIO detection a little bit smarter; no longer uses eval qq{}
302 1.50 2002/04/19 06:13:02
303 ! ! Encode.pm Encode.xs Encode/encoding.h
305 New Fallback API imlemented and documented. See "perldoc Encode"
307 ! lib/Encode/JP/JIS7.pm Encode.pm
308 + lib/Encode/PerlIO.pod t/perlio.t
309 API compliance met. However, it still does not work unless perlio
310 implements line buffer. See BUGS section in perldoc Encode::PerlIO
311 As a sensible workaround, perlio_ok() added to Encode.
313 ! lib/Encode/Supported.pod
315 Message-Id: <20020418174647.J8466@alpha.hut.fi>
317 Doc fixes from Autrijus
318 Message-Id: <20020418144131.GA10987@not.autrijus.org>
320 perlqq mode documented
322 + t/jisx0201.euc t/jisx0201.ref
323 ! t/jisx0208.euc t/jisx0208.ref
324 t/JP.t tests more rigorously and with other encodings
325 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
326 in jis0208 (used to contain jisx0201 part).
327 ! Encode/Makefile_PL.e2x
328 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
329 enc2xs and encode.h rather than hard-coded. This allows the resulting
330 module fully CPANizable.
331 ! encoding.pm t/JP.t t/KR.t
332 PerlIO detection simplified (checks %INC instead of eval{})
333 ! Encode.xs Encode/encode.h
334 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
335 - lib/Encode/Unicode.pm
336 (en|de)code_xs relocated to where it belongs. Source reindented
339 Additional (U8 *) cast added as suggested by jhi
340 Message-Id: <20020417165916.A28599@alpha.hut.fi>
342 1.42 Date: 2002/04/17
344 no-op module; Thought of adding a pod there but enc2xs has
349 correct mechanism to detect Perlio::encoding layar installed.
351 PerlIO Layer detached.
353 1.41 2002/04/16 23:35:00
355 binmode(STDIN|STDOUT ...) done iff PerlIO is available
357 Cleaned up PerlIO skip conditions to prepare for the upcoming
358 Encode - PerlIO forking.
360 exported functions are now prototyped.
361 ! lib/Encode/CN/HZ.pm
364 fallback implemented # was /* FIXME */
365 affected programs revised to fit (only HZ was using the try-catch
366 approach which needed to be fixed for API-compliance).
368 ! Encode/KR/2022_KR.pm
370 can find =head1 NAME now, jhi
371 Message-Id: <20020416083059.V30639@alpha.hut.fi>
375 now complies with less warnings with the pickest compilers.
376 Suggested by Craig, fixed by Dan.
377 ! Encode/Makefile_PL.e2x
379 A bug that fails to find *.e2x in certain conditions fixed
381 1.40 2002/04/14 22:27:14
382 + Encode/ConfigLocal_PM.e2x
383 ! lib/Encode/Config.pm
385 "enc2xs -C" now generates/updates Encode::ConfigLocal.
386 ConfigLocal_PM.e2x is a skelton thereof.
387 ! lib/Encode/Config.pm
389 "use Encode::CN::HZ;" was missing.
392 More rigorous tests added to test XS, especially on memory allocation.
394 ! lib/Encode/Unicode.pm
395 NI-S implemented an XS version -- merged
396 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
399 Source filter option added. With this option on, you can write
400 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
401 encodings. t/jperl.t enhanced to test this feature.
403 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
404 apply his suggestion, this degree of nitting is enough to add him
406 Message-Id: <3CB93223.291E5E2E@earthlink.net>
408 + lib/Encode/JP/JIS7.pm
409 - lib/Encode/JP/JIS.pm
410 - lib/Encode/JP/2022_JP.pm
411 - lib/Encode/JP/2022_JP1.pm
412 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
413 JIS7.pm for better maintainability and performance
415 Added caveat for non-ascii identifiers.
417 fixes by jhi, the original author of this pragramtic module.
418 Message-Id: <20020413231527.V1826@alpha.hut.fi>
420 1.34 2002/04/12 20:23:05 (Unreleased)
423 EBCDIC fixes addressed by jhi.
424 Message-Id: <20020412161844.D9383@alpha.hut.fi>
425 ! lib/Encode/Encoder.pm
426 POD fix by Miyagawa-kun
427 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
429 1.33 2002/04/10 22:28:40
431 Philip's mail address corrected.
434 ! lib/Encode/Encoder.pm
435 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
436 This adds Miyagawa-kun to AUTHORS.
437 * encoding() no longer exported by default but on demand
438 * t/Encoder.t updated to test all these
439 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
440 ! lib/Encode/Unicode.pm
441 ! lib/Encode/Supported.pm
442 Further doc fixes by Anton
444 1.32 2002/04/09 20:06:15
447 - ucm/macDevanaga.ucm Unicode Character Map
448 - ucm/macGujarati.ucm Unicode Character Map
449 - ucm/macGurmukhi.ucm Unicode Character Map
450 A utility to check integrity of .ucm files. t/bogus.ucm is a
451 ucm that is deliberately bogus. unused Indic mappings are removed
454 resolve_alias() added as suggested by jhi. Same as
455 find_encoding("alias")->name. For convenience. This one is
456 defined in Encode.pm instead of Alias.pm.
457 Message-Id: <20020409215846.H17022@alpha.hut.fi>
459 Memory Allocate but detected during the devel of ucmlint -- fixed.
460 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
461 ! lib/Encode/Unicode.pm
462 valid_ucs2(0) is false but must be true.
463 3 patches from NI-S as follows. This also has fixed the incident
465 ! lib/Encode/Alias.pm
466 find_alias() recursion prevention
468 Checks for the patch above
469 ! t/Encode/Unicode.pm
470 An extra "F" that causes valid_ucs2() return a bogus value fixed
471 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
472 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
473 2 Small Patches from jhi as follows:
475 Encode->encodings() lists in case-insensitve order (as it was)
477 -l option prints avaiable encodings to STDOUT instead of STDERR
478 ! lib/Encode/Aliases.pm
479 s/defintion/definition/
480 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
482 ! lib/Encode/Supported.pod
483 ! lib/Encode/Unicode.pm
484 POD revise by Philip Newton. This adds Philip to AUTHORS list.
485 Thank you for the exact quote of Douglas Adams :)
486 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
488 1.31 2002/04/08 18:08:07
489 ! lib/Encode/Encoder.pm
491 Encode::Encoder, once just a placeholder of an idea, is now much more
492 practical. See t/Encode.t to find how practical it can be.
493 + lib/Encode/Config.pm
495 my false laziness at Encode.pm is fixed. Now %ExtModules are set
496 in Encode::Config and they are all literally, not programatically
497 set. My false laziness was resulting many encodings missing from
499 ! lib/Encode/Unicode.pm
501 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
502 so that it does not rely Encode::Unicode for BOM values
503 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
505 1.30 2002/04/08 02:34:51
506 + lib/Encode/Encoder.pm
507 Object Oriented Encoder. I reckon something like this is in need.
510 ! lib/Encode/Supported.pod
511 * autoloading bug that prevented upper-case canonicals such as UTF-16
512 is fixed. Now even UTF/UCS are autoloaded!
513 * encodings() is now more intuitive.
514 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
516 * Obligatory fixes to the POD.
517 ! lib/Encode/Supported.pod
518 Patch from Anton applied.
519 Message-Id: <66641479.20020408033300@motor.ru>
521 ! lib/Encode/Unicode.pm
522 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
524 1.28 2002/04/07 18:58:42
528 Just a MANIFEST for those missing files.
530 1.26 Date: 2002/04/07 15:22:04
533 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
534 And t/Aliases.t fixed to make sure they all exist.
536 1.25 2002/04/07 15:01:25 (Unreleased)
538 ! lib/Encode/Unicode.pm
541 - lib/Encode/UTF_EBCDIC.pm
542 - lib/Encode/Internal.pm
544 Integrated into Encode.pm as closures. That way "one package, one file"
545 rule is preserved yet less files to require.
547 commented out binmode(STDERR ...
556 ! Encode/Makefile_PL.e2x
557 Schwarn's MM-compliance patch merged
558 Message-Id: <20020406082609.GA28758@blackrider>
560 ! lib/Encode/Unicode.pm
561 + lib/Encode/UTF_EBCDIC.pm
563 - lib/Encode/10646_1.pm
564 - lib/Encode/ucs2_le.pm
565 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
566 per-module based (en|de)code, I saved a number of .pm by
567 reorganizing it as per-object base (Well, this is what Encode::XS
568 does under the hood). See Encode::Unicode for details.
569 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
570 This module is used only on EBCDIC environments.
572 1.21 2002/04/05 14:46:34 (Not Released)
578 Are back to make Perl/Tk happy Smile, NI-S.
580 ! lib/Encode/Alias.pm
581 ! lib/Encode/Supported.pm
582 ! lib/Encode/10646_1.pm
583 ! lib/Encode/ucs2_le.pm
584 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
585 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
586 updated to reflect changes.
587 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
588 ! lib/Encode/Alias.pm
589 ! lib/Encode/Supported.pm
590 Anton's revision commited. Added Dan's own fixes as well.
591 Message-Id: <159103166906.20020405161134@motor.ru>
592 ! lib/Encode/Alias.pm
594 < qr/^UCS2-le$/i => '"UCS-2"', );
596 > qr/^UCS2-LE$/i => '"UTF-16LE"');
597 Sigh. Thank you, Anton.
598 Message-Id: <14567692196.20020405062020@motor.ru>
599 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
601 1.20 2002/04/04 19:50:52
603 the last minute addtion. Just give it a try. Docs remains to be done.
604 Not installed by default.
605 ! lib/Encode/Supported.pod
608 ! lib/Encode/Alias.pm
610 ! lib/Encode/10464_1.pm
611 ! lib/Encode/ucs2_le.pm
612 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
613 unchanged but UTF-16BE is added as an alias. Implicit aliases
614 move to Encode::Alias so init_alias() works more as expected.
615 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
616 Though pedantically wrong, This should make perl mongers happier.
617 t/Alias.t is enhanced to test all these.
618 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
620 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
621 of the codepoints are common. CJKT left as is because the saving is
627 ! Encode/Makefile_PL.e2x
635 All occurance of _def.h replaced with .exh so djgpp works happily
636 ever after! To credit this amazing discovery, Laszlo is now in
638 Message-Id: <20020403181424.GA8778@freemail.hu>
639 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
642 ! Encode/Makefile_PL.skel
644 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
645 Message-Id: <20020401222744.GX2000@blackrider>, et al.
647 Two more tests by added jhi
648 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
651 The showstopper fixed -- Memory reallocation bug was causing
652 Encode::XS to fall into infinite loop on certain conditions.
654 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
659 ! lib/Encode/Supported.pod
660 Vendor encodings rebuilt out of original map files at unicode.org.
661 Indic languages such as MacDevanagali remain unspported do to the
662 shortcoming of encengine capabilities (they need algorithmical
663 conversion and I have no knowledge on that!). Pods fixed for added
665 Oh, macJapan.ucm renamed to macJapanese.ucm.
666 macROMnn is macRomanian and macRUMnn is macRumanian.
667 txt2ucm is a crude script that is used to convert them.
669 Unicode Compound Characters (used extensively on Mac) supported
671 Typo fixes and improvements by jhi
672 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
674 1.11 $Date: 2002/05/28 18:33:15 $
678 Missing files from the MANIFEST fixed.
679 Message-Id: <20020401010156.H10509@alpha.hut.fi>
680 Version incremented just to make CPAN happy.
682 1.10 2002/03/31 21:32:42
685 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
686 if you want. This should make Autrijus happy. Also, piconv
687 is added to default install.
690 Here-documented files that enc2xs generates are now exported
691 to *.e2x. Much cleaner and easier to debug.
693 encoding enhances so you can make it act more like such
694 (now prehistoric ) "localized" variations of perl like Jperl.
696 Further test for encoding.pm. Written in euc-jp
700 Message-Id: <20020330174618.B10154@alpha.hut.fi>
705 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
712 ! Encode/macIceland.ucm
713 ! lib/Encode/Alias.pm
714 ! lib/Encode/Supported.pod
715 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
716 Message-Id: <m3lmcavhjt.fsf@anima.de>
718 1.01 2002/03/29 20:59:39
721 s/USE_SCRIPTS/MORE_SCRIPTS/
723 installs enc2xs by default for external Encode:: modules in CPAN,
724 such as Encode::HanExtra
726 More sensible perl core detection via $ENV{PERL_CORE}
728 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
730 Perl core ditection via $^X =~ m/\bminiperl$/o
731 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
735 The version of all files is updated to 1.00 via "ci -f -l1.00",
736 commemorating version 1.00. All files, including *.ucm are now
737 under version control.
740 encode.h moved to Encode/ so it will be installed for the later
743 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
748 compile renamed to enc2xs.
749 Affected Makefle.PL updated
751 "Punt it. HanExtra can take care of that later." -- Autrijus
752 Message-Id: <20020328154338.GA7351@not.autrijus.org>
756 ! lib/Encode/CJKConstants.pm
757 ! lib/Encode/KR/2022_KR.pm
758 Table patches for Euro Signs, 2022-KR fixups by Jungshik
759 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
763 bin/ added for example scripts. They are not installed by default.
764 to install them, "perl Makefile.PL USE_SCRIPTS".
765 piconv is iconv reinvented in perl. in addition to all features
766 of iconv, it also adds perlish features. See L<piconv/1> for more
768 ! lib/Encode/Alias.pm
769 qr/^ replaced with qr/\b so it directly matches locale names
770 such as en_US.US-ASCII
773 Patch by MJD to fix the following problem applied.
774 Subject: [PATCH 5.7.3 Encode]
775 Aliases.t not properly skipped when Encode extension not built
776 Message-Id: <20020328091850.18677.qmail@plover.com>
777 ! lib/Encode/KR/2022_KR.pm
778 ! lib/Encode/CJKConstants.pm
779 Another patch from Jungshik to make iso-2022-kr actually work
780 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
781 ! Encode/Encode/euc-kr.ucm
782 + Encode/Encode/johab.ucm
783 ! Encode/Encode/ksc5601.ucm
785 ! Encode/KR/Makefile.PL
786 ! Encode/lib/Encode/Alias.pm
788 Johab support and complete revision of Korean Encoding by Jungshik
789 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
791 Revised to make up with now-dropped Encode::Details.
792 - lib/Encode/Details.pod
793 Dropped. Besides being obsolete, the topics are now covered in
799 Korean aliases fixed thanks to Jungshik Shin
800 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
801 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
803 =head1 NAME added to all modules to make buildtoc happy
804 Message-Id: <20020327041151.A10618@alpha.hut.fi>
805 - lib/Encode/CJKguide.pod
806 Too controversial and dropped from the dist. Will be available
807 separately on the web.
809 RCS tags added so table debugging gets easier (should that be
810 needed! I hope they all stay 1.00!)
811 + lib/Encode/CJKguide.pod
812 A detailed guide to mainly, but not limited to, CJK multibyte
815 + Encode/hp-roman8.ucm
817 ! Encode/Supported.pod
818 All occurance of "roman8" replaced with "hp-roman8" to avoid
820 ! Encode/Supported.pod
823 Mac Encodings now comply the Inside Macintosh
825 Test for '-raw' conventions added.
827 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
831 "-raw" appended to canonical names.
832 File mames stay unchanged thanks to UCM format.
833 ! lib/Encode/CN/HZ.pm
834 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
835 Message-Id: <20020326035210.GA2091@not.autrijus.org>
838 - lib/Encode/JP/Const.pm
839 + lib/Encode/CJKConstants.pm
840 + lib/Encode/CN/2022_CN.pm
841 + lib/Encode/KR/2022_KR.pm
850 * Support for ISO-2022-KR and ISO-2022-CN added.
852 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
853 * ucm2table autogenerates character table out of UCM files.
856 - lib/Encode/Supports.pod
857 + lib/Encode/Supported.pod
858 Names reverted due to popular demand.
859 8.3 rule applies only when there is a conflict.
860 Message-Id: <20020325095924.GD44120@not.autrijus.org>
865 - lib/Encode/Format/Enc.pod
867 * Character tables is now 100% ucm.
868 * All files under Encode/ is now 8.3-compliant
869 * some of missing encodings added (i.e. gsm0338 and nextstep)
870 * Vendor mappings aggregated with appropriate national std in
871 Makefile.PL, resulting smaller *.so especially for CJK.
872 Following is result on Dan's FreeBSD box.
874 ---------------------------------------------------------------
875 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
876 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
877 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
878 blib/arch/auto/Encode/Encode.so 27,791 27,791
879 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
880 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
881 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
882 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
883 ---------------------------------------------------------------
884 Total 5,375,297 6,343,819
886 * As a result of ucm-transition, Encode::Tcl dropped because
887 Encode::Tcl demands *.enc.
888 Encode::Tcl will be supplied in a separate tarball with *.enc.
889 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
894 -lib/Encode/Supported.pod
895 +lib/Encode/Supports.pod
896 -lib/Encode/iso10646_1.pm
897 +lib/Encode/10646_1.pm
898 -lib/Encode/EncFormat.pod
899 +lib/Encode/Format/Enc.pod
900 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
901 - lib/Encode/JP/Constants.pm
902 + lib/Encode/JP/Consts.pm
903 ! lib/Encode/JP/JIS.pm
904 ! lib/Encode/JP/H2Z.pm
905 Version nit problem and 8.3 rule fix.
906 > Package namespace installed latest in CPAN file
907 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
908 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
909 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
910 accordingly. In addition, legacy "use vars qw()..." are replaced with
912 Message-Id: <20020325011248.D1561@alpha.hut.fi>
913 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
915 - lib/Encode/JP/ISO_2022_JP.pm
916 - lib/Encode/JP/ISO_2022_JP_1.pm
917 + lib/Encode/JP/2022_JP.pm
918 + lib/Encode/JP/2022_JP1.pm
920 8.3 naming conflict for vanilla fat addressed by jhi
921 Message-Id: <20020324201931.V22596@alpha.hut.fi>
924 Typecast fix addressed by jhi
925 Message-Id: <20020324185540.T22596@alpha.hut.fi>
928 ! lib/Encode/Supported.pod
930 + lib/Encode/JP/ISO_2022_JP_1.pm
931 ! lib/Encode/JP/ISO_2022_JP.pm
932 ! lib/Encode/JP/JIS.pm
934 Now Encode::JP is more strict on the difference between ISO-2022-JP
935 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
936 makes Anton happier :) FYI the previous version implements
937 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
938 ! lib/Encode/Supported.pod
941 Avoid core-dump in Encode with PERLIO=mmap by NI-S
942 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
947 ! lib/Encode/Suppoted.pod
948 pod fixes to replace F<http://...> to L<http://...>,
949 as suggested by Autrijius in:
950 Message-Id: <20020324083943.GA14901@not.autrijus.org>
951 ! lib/Encode/Suppoted.pod
952 fixes and enhancements by Anton
953 Message-Id: <10632060120.20020324103753@motor.ru>
954 ! lib/Encode/Alias.pm
955 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
956 added. Suggested by Anton then deobfuscated by Autrijius
957 Message-Id: <20020324064455.GA3667@not.autrijus.org>
959 Further fix by Nicholas Clark
960 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
961 - lib/EncodeFormat.pod
962 + lib/Encode/EncFormat.pod
964 File renamed as suggested by Autrijius
966 ! lib/Encode/Details.pod
967 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
968 ! Encode.pm Sun Mar 24 13:43:47 2002
969 pod fixes by Autrijius.
970 Message-Id: <20020324062804.GA3595@not.autrijus.org>
971 Message-Id: <20020324075627.GB11986@not.autrijus.org>
973 ! lib/Encode/Alias.pm
975 now more EBCDIC conscious;
976 %ExtModules on EBCDIC system excludes CJK so that you don't
977 have to worry about the matched alias resulting cloaking.
978 t/Alias.t also revised to reflect changes. Verified by jhi
979 Message-Id: <20020324022929.D22596@alpha.hut.fi>
985 EBCDIC detection mechanism installed as in JP/JP.pm
986 Message-Id: <20020323211847.G19148@alpha.hut.fi>
994 Now all table files used by compile are postfixed '_t' to avoid
995 namespace collisions in case insensitive file systems once for all!
997 Message-ID: <58290227735.20020323195659@familiehaase.de>
999 Since the Encode::JP is unsupported under EBCDIC we
1000 cannot run this test (aliases as such should work fine) -- jhi
1001 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1003 duplicate occurance of ascii.ucm and 8859-1.ucm
1004 causes MacOS X dlyd to cloak
1010 < chdir 't' if -d 't';
1012 > if (! -d 'blib' and -d 't'){ chdir 't' };
1013 When you are "make test"-ing on Encode/ directory, you must not
1014 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1015 remain unchanced. Also the situation detection was made simpler
1016 in t/JP.t, which was originally;
1017 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1019 "Use of uninitialized value in string eq at Encode.pm line 96."
1020 ! Symbol/Makefile.PL
1021 ! EBCDIC/Makefile.PL
1023 -- Problem on case insensitive file systems
1024 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1025 Message-ID: <88254111953.20020323095503@familiehaase.de>
1028 "So I think it's a bug in gcc, not perl. But it still needs to be
1030 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1031 Message-Id: <20020323170509.C96475@plum.flirble.org>
1033 0.96 Sat Mar 23 2002
1035 ! lib/Encode/Encoding.pm
1036 ! lib/Encode/Alias.pm
1037 ! lib/Encode/Supported.pod
1039 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1040 Message-ID: <20020322073908.GB10539@blackrider>
1043 "...I think we should include ISO 8859-1 as well." -- NI-S
1044 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1049 ! lib/Encode/Alias.pm
1050 alias definitions relocated to Encode::Alias so module autoloading
1051 works for aliases also.
1053 encodings() now accepts args to check ExtModules.
1057 + EBCDIC/Makefile.PL
1058 + Symbol/Makefile.PL
1062 Latin and single byte encodings are reorganized so they are
1063 demand-loaded like Encode::XX. Now only ascii is compiled into
1065 ! lib/Encode/Alias.pm
1066 for my $k (keys %hash){ delete $hash{$k}; }
1067 is depreciated; fixed.
1069 0.95 Fri Mar 22 2002
1070 In this update, pod rewrites and alias fixes are the main issues
1071 + lib/Encode/Supported.pod
1072 Describes supported encodings
1074 streamlined compiled-in encodings.
1075 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1077 + Encode/ibm-125?.ucm
1078 Added from icu distibution with any occurance of
1079 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1080 some respect to icu staff, however.
1081 + lib/Encode/Alias.pm
1083 Alias difinitions in Encode.pm relocated.
1086 packWARN patch from Paul Marquess via jhi
1087 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1088 Paul added to AUTHORS as a result.
1089 ! t/CJKalias.t -> t/Aliases.t
1090 Renamed. Checks even more aliases and alias overloading
1093 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1094 gbk => cp936 relocated to CN.pm
1096 Test::More with plans (by jhi)
1098 0.94 Thu Mar 21 2002
1099 + lib/Encode/Description.pod
1100 ! lib/Encode/Encoding.pm
1101 Now the pod in Encode.pm is abridged as programming references.
1102 lib/Encode/Description.pod contains the original, detailed description
1103 and Encode::Encoding explains how to write your own module to
1104 add new encodings. So far, lib/Encode/Description.pod contains
1105 the whole pod once in Encode.pm. This is intentional.
1107 Pod revisions by Anton Tagunov
1108 Message-Id: <517178431.20020320174824@motor.ru>
1110 all occrance of Encode::Tcl::Extended removed including pod
1112 test now checks $encoding->name only; $encoding->{name} are
1113 no longer check to find the canonical name.
1114 ! lib/Encode/JP/JIS.pm
1115 ! lib/Encode/JP/ISO_2022_JP.pm
1116 ->name() added to be more compliant with API
1122 Patch by Autrijus to add aliases to TW and fixes to POD
1123 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1125 SADAHIRO Tomoyuki added as should. My apologies.
1127 0.93 Wed Mar 20 2002
1128 * First release to be uploaded to CPAN. For prehistoric changes,
1129 please see Changes file of perl distibution as well as
1130 perl-unicode@perl.org archive, available at:
1131 http://archive.develooper.com/perl-unicode@perl.org/
1133 Changes Since 0.92 includes;
1138 + Mention to perl-unicode@perl.org added
1140 + Encoding aliases added so you can feed locale names
1141 and MIME Charset="" directly.
1142 - Mention to JISX0212 removed because it's fixed
1145 + Encoding aliases added. Note TW is left untouched because
1146 euc-tw is not implemented in TW but in Encode::HanExtra.
1147 Autrijus, you may fix Encode::HanExtra.
1149 + to test encode aliases added