1 # Revision history for Perl extension Encode.
3 # $Id: Changes,v 1.75 2002/06/01 18:07:49 dankogai Exp dankogai $
6 $Revision: 1.75 $ $Date: 2002/06/01 18:07:49 $
7 ! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm
8 glibc compliance cited by Autrijus.
9 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html
10 ! bin/enc2xs bin/piconv
11 Subject: Re: forewarning: usedevel and versiononly
12 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl>
14 1.74 2002/05/28 18:33:15
15 + ucm/null.ucm ucm/ctrl.ucm
16 ! Makefile.PL bin/enc2xs lib/Encode/Supported.pod
17 "null" and "ascii-ctrl" encodings added upon the request of Autrijus
18 Subject: Re: unicode -> &# notation
19 Message-ID: <20020518193704.GB40272@not.autrijus.org>
21 1.73 2002/05/28 17:26:18
22 ! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS
23 Chris Nandor has fixed Encode so that it works w/ MacPerl --
24 at least w/ PPC (68k need static linking which does not work due to
25 64k limit). pudge is added to AUTHORS (I'm surprised he was not
26 there in the list). Encode/Makefile_PL.e2x was additionally fixed
27 by dankogai to reflect changes in other Makefile.PL
28 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]>
30 Subject: Change 16746: -Mutf8 cleanup.
31 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com>
33 1.72 2002/05/20 15:49:56
35 Subject: [PATCH] Encode should be in perl-core library path
36 Message-Id: <86r8k7h738.wl@mail.edge.co.jp>
37 Message-Id: <20020520161201.A11019@alpha.hut.fi>
38 ! lib/Encode/MIME/Header.pm
39 Subject: [PATCH] Encode::MIME::Header
40 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp>
41 ! Encode/Makefile_PL.e2x
42 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32
43 Message-Id: <20020519201031.GA1603@not.autrijus.org>
44 ! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL
45 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL
47 @16628 and @16652 from Vadim. Vadim was added to AUTHORS.
48 Subject: [PATCH] good day for WinCE port of perl.
49 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad>
52 Even more linting by Robin via @16532
54 Even more typecast by Sarathy in @16460
56 1.71 2002/05/07 16:22:42
58 even more typecasts by Robin
59 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk>
61 A very strange bug that was causing a bugus ucm -> C table
62 generation that was revealed by a UCM file that Andreas was
63 working. This is the king of wierdest bug I've encountered
64 in the course of Encode maintenance.
65 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp>
67 1.70 2002/05/06 10:26:48
69 Made more 'module-safe' with conjunction w/ 'no encoding'.
70 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp>
71 ! lib/Encode/Encoding.pm
72 'require Encode' because ->Define uses Encode::define_encoding();
73 problem and solution addressed by Miyagawa-kun
74 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp>
76 Cuts the frill to make djgpp happier, as suggested by Laszlo
77 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se>
79 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas
80 Message-Id: <m3bsbug48n.fsf@anima.de>
82 1.69 2002/05/04 16:41:18
83 ! lib/Encode/MIME/Header
84 Floating-point coerced for UNICOS (in integer arithmetics it folds
85 line one character too early). Verification by Mark is pending.
86 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp>
88 more doc patch from Elizabeth
89 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl>
90 ! Encode/Makefile_PL.e2x
91 More platform-independent patch from Benjamin
92 Message-Id: <3CD31BE0.69F79B06@earthlink.net>
93 ! lib/Encode/Guess AUTHORS
94 split regex fix by Graham Barr. Adds him to AUTHORS.
95 Message-Id: <20020504085419.E95940@valueclick.com>
96 ! Encode/Makefile_PL.e2x
97 enc2xs script discovery made smarter and more sensible, first cited
98 by Miyagawa-kun and further suggestions by Rafael and Andreas
99 ! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t
100 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi
102 1.68 2002/05/03 12:20:13
103 ! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS
104 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t
105 and Supported.pod modified to reflect the change. Elizabeth added
106 to Authors. And H.M. is also added for forwarding her patch among
107 other contributions (I was rather surprised to find his name was not
109 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl>
111 1.67 2002/05/02 07:33:09
113 Error message now consistent w/ perlqq (\N{U+} -> \x{})
114 done in perl@16308 but Philip linted me further. Now the error
115 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP
116 ! lib/Encode/Guess.pm
117 Sanity check for happier -w by Autrijus
119 1.66 2002/05/01 05:41:06
120 ! Encode.xs t/fallback.t
121 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning
122 while fallback is in effect. This even came with a welcome side-effect
123 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is
124 also modified to test this.
125 And of course, the corresponding varialbles to UV[Xx]f are appropriately
126 cast. This should've concluded NI-XS homework.
128 encode(undef) does warn again! Repented upon suggestion by NI-XS.
129 Document for unless vs. '' added
130 Message-Id: <20020430171547.3322.13@bactrian.elixent.com>
132 1.65 2002/04/30 16:13:37
134 encode(undef) no longer warns for C<Use of uninitialized value in
135 subroutine entry>. Suggested by Paul.
136 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com>
137 ! lib/Encode/Supported.pod
138 Encode::MIME::Header and Encode::Guess mentioned
139 Updated for Encode::HanExtra 0.05 and Encode::JIS2K
140 ! lib/Encode/Guess.pm
141 POD fix by Miyagawa-kun
142 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp>
144 1.64 2002/04/29 06:54:06
146 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and
147 encocde("euc-jisx0213") are still DIFFERENT.
148 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp>
150 A few white spaces corrected by NI-XS via PerlIO integration to
152 Subject: Change 16247: Integrate perlio;
154 Document fixes by Andreas
155 Message-Id: <m3k7qsf1we.fsf@anima.de>
157 1.63 2002/04/27 18:59:50
158 ! lib/Encode/Encoding.pm
159 ! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm
160 ! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm
161 Make use of the Encode::Encoding base class!
162 And other cleanups in Encode.xs upon NI-XS suggestions
163 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com>
165 1.62 2002/04/27 11:17:39
167 encodings() now just check %ExtModule instead of eval{require}
168 all of them for ":all" to conserve more memory.
170 more "%x" -> "%" UVxf stuff.
172 s/=over2/=over 2/g # oops.
174 1.61 2002/04/26 03:02:04
176 Now does decent tests besides use_ok()
177 ! lib/Encode/Guess.pm t/guess.t
178 UI streamlined, document added
180 various signed/unsigned mismatch nits (#16173)
181 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173
183 POD: utf8-flag-related caveats added. A few sections completely
187 Thou shalt not assume %d works, either!
188 Robin Baker added to AUTHORS for this
189 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk>
191 "Change 16144 by gsar@onru on 2002/04/24 18:59:05"
193 1.60 2002/04/24 20:06:52
195 "Thou shalt not assume %x works." -- jhi
196 Message-Id: <20020424210618.E24347@alpha.hut.fi>
197 ! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make
198 low-memory build machines happy, now *.c is created for each *.ucm
199 (no table aggregation). You can still override this by setting
200 $ENV{AGGREGATE_TABLES}.
201 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp>
202 + lib/Encode/Guess.pm
203 + lib/Encode/JP/JIS7.pm
204 Encoding-autodetect (mainly for Japanese encoding) added. In a
205 course of development, JIS7.pm was improved.
206 + lib/Encode/HTML/Header.pm
207 + lib/Encode/Config.pm
208 MIME B/Q Header Encoding Added!
209 ! Encode.pm Encode.xs t/fallback.t
210 new fallbacks; XMLCREF and HTMLCREF upon Bart's request.
211 Message-Id: <20020424130709.GA14211@tanglefoot>
213 1.59 $ 2002/04/22 23:54:22
214 ! Encode.pm Encode.xs
215 needs_lines() and perlio_ok() are added to Internal encodings such
216 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped.
218 1.58 2002/04/22 23:54:22
220 s/MacChineseSimp/MacChineseTrad/ # ... oops.
225 Now all CJKT encodings go thru round-trip test via t/CJKT.t.
226 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t
227 t/(JP|KR).t are aggregated to t/CJKT.t
228 test data are all remade via bin/ucm2text.
229 And .... They are no longer skipped for -Uuseperlio !
231 1.57 2002/04/22 20:27:30
232 ! t/JP.t t/KR.t t/perlio.t
233 unless (find PerlIO::Layer 'perlio') ... line is back again.
234 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines
235 need PerlIO. Sigh....
236 ! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t
237 ->perlio_ok now does eval{ require PerlIO::encoding } there so
238 it correctly returns 1 when PerlIO::encoding is yet loaded.
240 perl-current patch #16072 reflected
242 1.56 2002/04/22 09:48:07
243 ! Encode.pm encoding.pm t/perlio.t t/jperl.t
244 New PerlIO::encoding 0.04 compliance met
246 1.55 2002/04/22 03:43:05
247 ! Encode.pm Encode.xs Unicode/Unicode.pm
248 needs_lines() defined so Encode::Encoding is no longer needed
251 1.54 2002/04/22 02:50:01
252 ! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t
253 ! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm
254 now perlio_ok is true by default if PerlIO::encoding->VERSION is
255 0.03 or larger. POD in Encode::Encoding revised to reflect this.
256 Encode::XS and Encode::Unicode now has perlio_ok() method.
257 ! lib/Encode/Supported.pod
258 s/UP-UX/HP-UX/ by jhi
259 ! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README
260 ! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv
261 ! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm
262 ! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm
263 ! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod
264 ! lib/Encode/Supported.pod
265 Huge document fixes by Philip.
268 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS
269 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049
271 binmode() after "<:encoding" to make Win32 happy, by Mattia.
272 Mattia added to AUTHORS file
273 Message-Id: <3CC3150F.5798.22A05AE@localhost>
275 1.52 2002/04/20 23:43:47
277 TODO: is now SKIP:, as NI-XS requested. Also adds more
278 eraborate failure analysis added.
280 A note on how to make sure of round-trip safety added to POD
281 section (so Autrijus is happier)
282 ! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm
283 big5-(eten|hkscs) is round-trip safe again!
284 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp>
286 Typo fixes by Andreas
287 ! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm
288 ! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t
289 PerIO coodination patches from NI-XS.
290 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp>
292 1.51 2002/04/20 09:58:23
294 Updated test suite by Autrijis so "make test" is happy again
295 Message-Id: <20020420082104.GA25037@not.autrijus.org>
297 ! ucm/big5-hkscs.ucm lib/Encode/Alias.pm
299 TW/TW.pm TW/Makefile.PL
300 Updates by Autrijus. 'big5' is no longer a canonical but an
301 alias to 'big5-eten'. big5-hkscs is now in 2001 edition.
302 Message-Id: <20020419195346.GA19597@not.autrijus.org>
304 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK
305 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com>
307 PerlIO detection a little bit smarter; no longer uses eval qq{}
310 1.50 2002/04/19 06:13:02
311 ! ! Encode.pm Encode.xs Encode/encoding.h
313 New Fallback API imlemented and documented. See "perldoc Encode"
315 ! lib/Encode/JP/JIS7.pm Encode.pm
316 + lib/Encode/PerlIO.pod t/perlio.t
317 API compliance met. However, it still does not work unless perlio
318 implements line buffer. See BUGS section in perldoc Encode::PerlIO
319 As a sensible workaround, perlio_ok() added to Encode.
321 ! lib/Encode/Supported.pod
323 Message-Id: <20020418174647.J8466@alpha.hut.fi>
325 Doc fixes from Autrijus
326 Message-Id: <20020418144131.GA10987@not.autrijus.org>
328 perlqq mode documented
330 + t/jisx0201.euc t/jisx0201.ref
331 ! t/jisx0208.euc t/jisx0208.ref
332 t/JP.t tests more rigorously and with other encodings
333 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY
334 in jis0208 (used to contain jisx0201 part).
335 ! Encode/Makefile_PL.e2x
336 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers
337 enc2xs and encode.h rather than hard-coded. This allows the resulting
338 module fully CPANizable.
339 ! encoding.pm t/JP.t t/KR.t
340 PerlIO detection simplified (checks %INC instead of eval{})
341 ! Encode.xs Encode/encode.h
342 + Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs
343 - lib/Encode/Unicode.pm
344 (en|de)code_xs relocated to where it belongs. Source reindented
347 Additional (U8 *) cast added as suggested by jhi
348 Message-Id: <20020417165916.A28599@alpha.hut.fi>
350 1.42 Date: 2002/04/17
352 no-op module; Thought of adding a pod there but enc2xs has
357 correct mechanism to detect Perlio::encoding layar installed.
359 PerlIO Layer detached.
361 1.41 2002/04/16 23:35:00
363 binmode(STDIN|STDOUT ...) done iff PerlIO is available
365 Cleaned up PerlIO skip conditions to prepare for the upcoming
366 Encode - PerlIO forking.
368 exported functions are now prototyped.
369 ! lib/Encode/CN/HZ.pm
372 fallback implemented # was /* FIXME */
373 affected programs revised to fit (only HZ was using the try-catch
374 approach which needed to be fixed for API-compliance).
376 ! Encode/KR/2022_KR.pm
378 can find =head1 NAME now, jhi
379 Message-Id: <20020416083059.V30639@alpha.hut.fi>
383 now complies with less warnings with the pickest compilers.
384 Suggested by Craig, fixed by Dan.
385 ! Encode/Makefile_PL.e2x
387 A bug that fails to find *.e2x in certain conditions fixed
389 1.40 2002/04/14 22:27:14
390 + Encode/ConfigLocal_PM.e2x
391 ! lib/Encode/Config.pm
393 "enc2xs -C" now generates/updates Encode::ConfigLocal.
394 ConfigLocal_PM.e2x is a skelton thereof.
395 ! lib/Encode/Config.pm
397 "use Encode::CN::HZ;" was missing.
400 More rigorous tests added to test XS, especially on memory allocation.
402 ! lib/Encode/Unicode.pm
403 NI-S implemented an XS version -- merged
404 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com>
407 Source filter option added. With this option on, you can write
408 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy
409 encodings. t/jperl.t enhanced to test this feature.
411 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly
412 apply his suggestion, this degree of nitting is enough to add him
414 Message-Id: <3CB93223.291E5E2E@earthlink.net>
416 + lib/Encode/JP/JIS7.pm
417 - lib/Encode/JP/JIS.pm
418 - lib/Encode/JP/2022_JP.pm
419 - lib/Encode/JP/2022_JP1.pm
420 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to
421 JIS7.pm for better maintainability and performance
423 Added caveat for non-ascii identifiers.
425 fixes by jhi, the original author of this pragramtic module.
426 Message-Id: <20020413231527.V1826@alpha.hut.fi>
428 1.34 2002/04/12 20:23:05 (Unreleased)
431 EBCDIC fixes addressed by jhi.
432 Message-Id: <20020412161844.D9383@alpha.hut.fi>
433 ! lib/Encode/Encoder.pm
434 POD fix by Miyagawa-kun
435 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp>
437 1.33 2002/04/10 22:28:40
439 Philip's mail address corrected.
442 ! lib/Encode/Encoder.pm
443 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :)
444 This adds Miyagawa-kun to AUTHORS.
445 * encoding() no longer exported by default but on demand
446 * t/Encoder.t updated to test all these
447 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp>
448 ! lib/Encode/Unicode.pm
449 ! lib/Encode/Supported.pm
450 Further doc fixes by Anton
452 1.32 2002/04/09 20:06:15
455 - ucm/macDevanaga.ucm Unicode Character Map
456 - ucm/macGujarati.ucm Unicode Character Map
457 - ucm/macGurmukhi.ucm Unicode Character Map
458 A utility to check integrity of .ucm files. t/bogus.ucm is a
459 ucm that is deliberately bogus. unused Indic mappings are removed
462 resolve_alias() added as suggested by jhi. Same as
463 find_encoding("alias")->name. For convenience. This one is
464 defined in Encode.pm instead of Alias.pm.
465 Message-Id: <20020409215846.H17022@alpha.hut.fi>
467 Memory Allocate but detected during the devel of ucmlint -- fixed.
468 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp>
469 ! lib/Encode/Unicode.pm
470 valid_ucs2(0) is false but must be true.
471 3 patches from NI-S as follows. This also has fixed the incident
473 ! lib/Encode/Alias.pm
474 find_alias() recursion prevention
476 Checks for the patch above
477 ! t/Encode/Unicode.pm
478 An extra "F" that causes valid_ucs2() return a bogus value fixed
479 Message-Id: <20020409133927.17803.1@bactrian.elixent.com>
480 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu>
481 2 Small Patches from jhi as follows:
483 Encode->encodings() lists in case-insensitve order (as it was)
485 -l option prints avaiable encodings to STDOUT instead of STDERR
486 ! lib/Encode/Aliases.pm
487 s/defintion/definition/
488 Message-Id: <200204082306.CAA21033@alpha.hut.fi>
490 ! lib/Encode/Supported.pod
491 ! lib/Encode/Unicode.pm
492 POD revise by Philip Newton. This adds Philip to AUTHORS list.
493 Thank you for the exact quote of Douglas Adams :)
494 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com>
496 1.31 2002/04/08 18:08:07
497 ! lib/Encode/Encoder.pm
499 Encode::Encoder, once just a placeholder of an idea, is now much more
500 practical. See t/Encode.t to find how practical it can be.
501 + lib/Encode/Config.pm
503 my false laziness at Encode.pm is fixed. Now %ExtModules are set
504 in Encode::Config and they are all literally, not programatically
505 set. My false laziness was resulting many encodings missing from
507 ! lib/Encode/Unicode.pm
509 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed
510 so that it does not rely Encode::Unicode for BOM values
511 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp>
513 1.30 2002/04/08 02:34:51
514 + lib/Encode/Encoder.pm
515 Object Oriented Encoder. I reckon something like this is in need.
518 ! lib/Encode/Supported.pod
519 * autoloading bug that prevented upper-case canonicals such as UTF-16
520 is fixed. Now even UTF/UCS are autoloaded!
521 * encodings() is now more intuitive.
522 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are
524 * Obligatory fixes to the POD.
525 ! lib/Encode/Supported.pod
526 Patch from Anton applied.
527 Message-Id: <66641479.20020408033300@motor.ru>
529 ! lib/Encode/Unicode.pm
530 Cosmetic changes: "bless $obj, $class" => "bless $obj => class"
532 1.28 2002/04/07 18:58:42
536 Just a MANIFEST for those missing files.
538 1.26 Date: 2002/04/07 15:22:04
541 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored.
542 And t/Aliases.t fixed to make sure they all exist.
544 1.25 2002/04/07 15:01:25 (Unreleased)
546 ! lib/Encode/Unicode.pm
549 - lib/Encode/UTF_EBCDIC.pm
550 - lib/Encode/Internal.pm
552 Integrated into Encode.pm as closures. That way "one package, one file"
553 rule is preserved yet less files to require.
555 commented out binmode(STDERR ...
564 ! Encode/Makefile_PL.e2x
565 Schwarn's MM-compliance patch merged
566 Message-Id: <20020406082609.GA28758@blackrider>
568 ! lib/Encode/Unicode.pm
569 + lib/Encode/UTF_EBCDIC.pm
571 - lib/Encode/10646_1.pm
572 - lib/Encode/ucs2_le.pm
573 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of
574 per-module based (en|de)code, I saved a number of .pm by
575 reorganizing it as per-object base (Well, this is what Encode::XS
576 does under the hood). See Encode::Unicode for details.
577 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm.
578 This module is used only on EBCDIC environments.
580 1.21 2002/04/05 14:46:34 (Not Released)
586 Are back to make Perl/Tk happy Smile, NI-S.
588 ! lib/Encode/Alias.pm
589 ! lib/Encode/Supported.pm
590 ! lib/Encode/10646_1.pm
591 ! lib/Encode/ucs2_le.pm
592 UCS-16BE is now canonical for UCS-2/ISO-10646-1.
593 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents
594 updated to reflect changes.
595 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com>
596 ! lib/Encode/Alias.pm
597 ! lib/Encode/Supported.pm
598 Anton's revision commited. Added Dan's own fixes as well.
599 Message-Id: <159103166906.20020405161134@motor.ru>
600 ! lib/Encode/Alias.pm
602 < qr/^UCS2-le$/i => '"UCS-2"', );
604 > qr/^UCS2-LE$/i => '"UTF-16LE"');
605 Sigh. Thank you, Anton.
606 Message-Id: <14567692196.20020405062020@motor.ru>
607 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp>
609 1.20 2002/04/04 19:50:52
611 the last minute addtion. Just give it a try. Docs remains to be done.
612 Not installed by default.
613 ! lib/Encode/Supported.pod
616 ! lib/Encode/Alias.pm
618 ! lib/Encode/10464_1.pm
619 ! lib/Encode/ucs2_le.pm
620 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left
621 unchanged but UTF-16BE is added as an alias. Implicit aliases
622 move to Encode::Alias so init_alias() works more as expected.
623 Also, 'utf8' is now canonical with 'UTF-8' being an alias.
624 Though pedantically wrong, This should make perl mongers happier.
625 t/Alias.t is enhanced to test all these.
626 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp>
628 Now all .ucm are stacked in byte_t; They all share ascii part so 50%
629 of the codepoints are common. CJKT left as is because the saving is
635 ! Encode/Makefile_PL.e2x
643 All occurance of _def.h replaced with .exh so djgpp works happily
644 ever after! To credit this amazing discovery, Laszlo is now in
646 Message-Id: <20020403181424.GA8778@freemail.hu>
647 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp>
650 ! Encode/Makefile_PL.skel
652 No more @INC fiddling! Uses $ENV{PERL_CORE} instead
653 Message-Id: <20020401222744.GX2000@blackrider>, et al.
655 Two more tests by added jhi
656 Message-Id: <200204020000.DAA25121@alpha.hut.fi>
659 The showstopper fixed -- Memory reallocation bug was causing
660 Encode::XS to fall into infinite loop on certain conditions.
662 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al
667 ! lib/Encode/Supported.pod
668 Vendor encodings rebuilt out of original map files at unicode.org.
669 Indic languages such as MacDevanagali remain unspported do to the
670 shortcoming of encengine capabilities (they need algorithmical
671 conversion and I have no knowledge on that!). Pods fixed for added
673 Oh, macJapan.ucm renamed to macJapanese.ucm.
674 macROMnn is macRomanian and macRUMnn is macRumanian.
675 txt2ucm is a crude script that is used to convert them.
677 Unicode Compound Characters (used extensively on Mac) supported
679 Typo fixes and improvements by jhi
680 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al.
682 1.11 $Date: 2002/06/01 18:07:49 $
686 Missing files from the MANIFEST fixed.
687 Message-Id: <20020401010156.H10509@alpha.hut.fi>
688 Version incremented just to make CPAN happy.
690 1.10 2002/03/31 21:32:42
693 INSTALL_UCM option added to Makefile.PL so you can install *.ucm
694 if you want. This should make Autrijus happy. Also, piconv
695 is added to default install.
698 Here-documented files that enc2xs generates are now exported
699 to *.e2x. Much cleaner and easier to debug.
701 encoding enhances so you can make it act more like such
702 (now prehistoric ) "localized" variations of perl like Jperl.
704 Further test for encoding.pm. Written in euc-jp
708 Message-Id: <20020330174618.B10154@alpha.hut.fi>
713 *.ucm relocated to ucm/ so MakeMaker will not install'em by default.
720 ! Encode/macIceland.ucm
721 ! lib/Encode/Alias.pm
722 ! lib/Encode/Supported.pod
723 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS.
724 Message-Id: <m3lmcavhjt.fsf@anima.de>
726 1.01 2002/03/29 20:59:39
729 s/USE_SCRIPTS/MORE_SCRIPTS/
731 installs enc2xs by default for external Encode:: modules in CPAN,
732 such as Encode::HanExtra
734 More sensible perl core detection via $ENV{PERL_CORE}
736 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US>
738 Perl core ditection via $^X =~ m/\bminiperl$/o
739 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp>
743 The version of all files is updated to 1.00 via "ci -f -l1.00",
744 commemorating version 1.00. All files, including *.ucm are now
745 under version control.
748 encode.h moved to Encode/ so it will be installed for the later
751 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)"
756 compile renamed to enc2xs.
757 Affected Makefle.PL updated
759 "Punt it. HanExtra can take care of that later." -- Autrijus
760 Message-Id: <20020328154338.GA7351@not.autrijus.org>
764 ! lib/Encode/CJKConstants.pm
765 ! lib/Encode/KR/2022_KR.pm
766 Table patches for Euro Signs, 2022-KR fixups by Jungshik
767 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org>
771 bin/ added for example scripts. They are not installed by default.
772 to install them, "perl Makefile.PL USE_SCRIPTS".
773 piconv is iconv reinvented in perl. in addition to all features
774 of iconv, it also adds perlish features. See L<piconv/1> for more
776 ! lib/Encode/Alias.pm
777 qr/^ replaced with qr/\b so it directly matches locale names
778 such as en_US.US-ASCII
781 Patch by MJD to fix the following problem applied.
782 Subject: [PATCH 5.7.3 Encode]
783 Aliases.t not properly skipped when Encode extension not built
784 Message-Id: <20020328091850.18677.qmail@plover.com>
785 ! lib/Encode/KR/2022_KR.pm
786 ! lib/Encode/CJKConstants.pm
787 Another patch from Jungshik to make iso-2022-kr actually work
788 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org>
789 ! Encode/Encode/euc-kr.ucm
790 + Encode/Encode/johab.ucm
791 ! Encode/Encode/ksc5601.ucm
793 ! Encode/KR/Makefile.PL
794 ! Encode/lib/Encode/Alias.pm
796 Johab support and complete revision of Korean Encoding by Jungshik
797 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org>
799 Revised to make up with now-dropped Encode::Details.
800 - lib/Encode/Details.pod
801 Dropped. Besides being obsolete, the topics are now covered in
807 Korean aliases fixed thanks to Jungshik Shin
808 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936
809 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org>
811 =head1 NAME added to all modules to make buildtoc happy
812 Message-Id: <20020327041151.A10618@alpha.hut.fi>
813 - lib/Encode/CJKguide.pod
814 Too controversial and dropped from the dist. Will be available
815 separately on the web.
817 RCS tags added so table debugging gets easier (should that be
818 needed! I hope they all stay 1.00!)
819 + lib/Encode/CJKguide.pod
820 A detailed guide to mainly, but not limited to, CJK multibyte
823 + Encode/hp-roman8.ucm
825 ! Encode/Supported.pod
826 All occurance of "roman8" replaced with "hp-roman8" to avoid
828 ! Encode/Supported.pod
831 Mac Encodings now comply the Inside Macintosh
833 Test for '-raw' conventions added.
835 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr
839 "-raw" appended to canonical names.
840 File mames stay unchanged thanks to UCM format.
841 ! lib/Encode/CN/HZ.pm
842 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting
843 Message-Id: <20020326035210.GA2091@not.autrijus.org>
846 - lib/Encode/JP/Const.pm
847 + lib/Encode/CJKConstants.pm
848 + lib/Encode/CN/2022_CN.pm
849 + lib/Encode/KR/2022_KR.pm
858 * Support for ISO-2022-KR and ISO-2022-CN added.
860 * more t/*.{euc,ref} added, which was autogenerated from ucm2table
861 * ucm2table autogenerates character table out of UCM files.
864 - lib/Encode/Supports.pod
865 + lib/Encode/Supported.pod
866 Names reverted due to popular demand.
867 8.3 rule applies only when there is a conflict.
868 Message-Id: <20020325095924.GD44120@not.autrijus.org>
873 - lib/Encode/Format/Enc.pod
875 * Character tables is now 100% ucm.
876 * All files under Encode/ is now 8.3-compliant
877 * some of missing encodings added (i.e. gsm0338 and nextstep)
878 * Vendor mappings aggregated with appropriate national std in
879 Makefile.PL, resulting smaller *.so especially for CJK.
880 Following is result on Dan's FreeBSD box.
882 ---------------------------------------------------------------
883 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042
884 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685
885 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476
886 blib/arch/auto/Encode/Encode.so 27,791 27,791
887 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811
888 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587
889 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990
890 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437
891 ---------------------------------------------------------------
892 Total 5,375,297 6,343,819
894 * As a result of ucm-transition, Encode::Tcl dropped because
895 Encode::Tcl demands *.enc.
896 Encode::Tcl will be supplied in a separate tarball with *.enc.
897 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp>
902 -lib/Encode/Supported.pod
903 +lib/Encode/Supports.pod
904 -lib/Encode/iso10646_1.pm
905 +lib/Encode/10646_1.pm
906 -lib/Encode/EncFormat.pod
907 +lib/Encode/Format/Enc.pod
908 Files renamed 8.3 filename compliance. Affected modules/scripts revised.
909 - lib/Encode/JP/Constants.pm
910 + lib/Encode/JP/Consts.pm
911 ! lib/Encode/JP/JIS.pm
912 ! lib/Encode/JP/H2Z.pm
913 Version nit problem and 8.3 rule fix.
914 > Package namespace installed latest in CPAN file
915 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz
916 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3
917 rule. Contants.pm renamed to Consts.pm and affected modules are fixed
918 accordingly. In addition, legacy "use vars qw()..." are replaced with
920 Message-Id: <20020325011248.D1561@alpha.hut.fi>
921 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp>
923 - lib/Encode/JP/ISO_2022_JP.pm
924 - lib/Encode/JP/ISO_2022_JP_1.pm
925 + lib/Encode/JP/2022_JP.pm
926 + lib/Encode/JP/2022_JP1.pm
928 8.3 naming conflict for vanilla fat addressed by jhi
929 Message-Id: <20020324201931.V22596@alpha.hut.fi>
932 Typecast fix addressed by jhi
933 Message-Id: <20020324185540.T22596@alpha.hut.fi>
936 ! lib/Encode/Supported.pod
938 + lib/Encode/JP/ISO_2022_JP_1.pm
939 ! lib/Encode/JP/ISO_2022_JP.pm
940 ! lib/Encode/JP/JIS.pm
942 Now Encode::JP is more strict on the difference between ISO-2022-JP
943 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move
944 makes Anton happier :) FYI the previous version implements
945 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support.
946 ! lib/Encode/Supported.pod
949 Avoid core-dump in Encode with PERLIO=mmap by NI-S
950 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com>
955 ! lib/Encode/Suppoted.pod
956 pod fixes to replace F<http://...> to L<http://...>,
957 as suggested by Autrijius in:
958 Message-Id: <20020324083943.GA14901@not.autrijus.org>
959 ! lib/Encode/Suppoted.pod
960 fixes and enhancements by Anton
961 Message-Id: <10632060120.20020324103753@motor.ru>
962 ! lib/Encode/Alias.pm
963 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' );
964 added. Suggested by Anton then deobfuscated by Autrijius
965 Message-Id: <20020324064455.GA3667@not.autrijus.org>
967 Further fix by Nicholas Clark
968 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
969 - lib/EncodeFormat.pod
970 + lib/Encode/EncFormat.pod
972 File renamed as suggested by Autrijius
974 ! lib/Encode/Details.pod
975 ! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002
976 ! Encode.pm Sun Mar 24 13:43:47 2002
977 pod fixes by Autrijius.
978 Message-Id: <20020324062804.GA3595@not.autrijus.org>
979 Message-Id: <20020324075627.GB11986@not.autrijus.org>
981 ! lib/Encode/Alias.pm
983 now more EBCDIC conscious;
984 %ExtModules on EBCDIC system excludes CJK so that you don't
985 have to worry about the matched alias resulting cloaking.
986 t/Alias.t also revised to reflect changes. Verified by jhi
987 Message-Id: <20020324022929.D22596@alpha.hut.fi>
993 EBCDIC detection mechanism installed as in JP/JP.pm
994 Message-Id: <20020323211847.G19148@alpha.hut.fi>
1000 ! Symbol/Makefile.PL
1002 Now all table files used by compile are postfixed '_t' to avoid
1003 namespace collisions in case insensitive file systems once for all!
1005 Message-ID: <58290227735.20020323195659@familiehaase.de>
1007 Since the Encode::JP is unsupported under EBCDIC we
1008 cannot run this test (aliases as such should work fine) -- jhi
1009 Message-Id: <20020323202119.D19148@alpha.hut.fi>
1011 duplicate occurance of ascii.ucm and 8859-1.ucm
1012 causes MacOS X dlyd to cloak
1018 < chdir 't' if -d 't';
1020 > if (! -d 'blib' and -d 't'){ chdir 't' };
1021 When you are "make test"-ing on Encode/ directory, you must not
1022 change $ENV{PWD}. t/JP.t has been fixed before but others somehow
1023 remain unchanced. Also the situation detection was made simpler
1024 in t/JP.t, which was originally;
1025 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o;
1027 "Use of uninitialized value in string eq at Encode.pm line 96."
1028 ! Symbol/Makefile.PL
1029 ! EBCDIC/Makefile.PL
1031 -- Problem on case insensitive file systems
1032 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible"
1033 Message-ID: <88254111953.20020323095503@familiehaase.de>
1036 "So I think it's a bug in gcc, not perl. But it still needs to be
1038 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net>
1039 Message-Id: <20020323170509.C96475@plum.flirble.org>
1041 0.96 Sat Mar 23 2002
1043 ! lib/Encode/Encoding.pm
1044 ! lib/Encode/Alias.pm
1045 ! lib/Encode/Supported.pod
1047 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi
1048 Message-ID: <20020322073908.GB10539@blackrider>
1051 "...I think we should include ISO 8859-1 as well." -- NI-S
1052 Message-Id: <20020322120230.1332.8@bactrian.elixent.com>
1057 ! lib/Encode/Alias.pm
1058 alias definitions relocated to Encode::Alias so module autoloading
1059 works for aliases also.
1061 encodings() now accepts args to check ExtModules.
1065 + EBCDIC/Makefile.PL
1066 + Symbol/Makefile.PL
1070 Latin and single byte encodings are reorganized so they are
1071 demand-loaded like Encode::XX. Now only ascii is compiled into
1073 ! lib/Encode/Alias.pm
1074 for my $k (keys %hash){ delete $hash{$k}; }
1075 is depreciated; fixed.
1077 0.95 Fri Mar 22 2002
1078 In this update, pod rewrites and alias fixes are the main issues
1079 + lib/Encode/Supported.pod
1080 Describes supported encodings
1082 streamlined compiled-in encodings.
1083 ! lib/Encode/Description.pod -> lib/Encode/Details.pod
1085 + Encode/ibm-125?.ucm
1086 Added from icu distibution with any occurance of
1087 "IBM-125?" to "cp125?". Filenames remain unchanged to pay
1088 some respect to icu staff, however.
1089 + lib/Encode/Alias.pm
1091 Alias difinitions in Encode.pm relocated.
1094 packWARN patch from Paul Marquess via jhi
1095 Message-Id: <20020321010101.O28978@alpha.hut.fi>
1096 Paul added to AUTHORS as a result.
1097 ! t/CJKalias.t -> t/Aliases.t
1098 Renamed. Checks even more aliases and alias overloading
1101 duplicate alias for ujis => euc-jp removed (Encode::JP has one)
1102 gbk => cp936 relocated to CN.pm
1104 Test::More with plans (by jhi)
1106 0.94 Thu Mar 21 2002
1107 + lib/Encode/Description.pod
1108 ! lib/Encode/Encoding.pm
1109 Now the pod in Encode.pm is abridged as programming references.
1110 lib/Encode/Description.pod contains the original, detailed description
1111 and Encode::Encoding explains how to write your own module to
1112 add new encodings. So far, lib/Encode/Description.pod contains
1113 the whole pod once in Encode.pm. This is intentional.
1115 Pod revisions by Anton Tagunov
1116 Message-Id: <517178431.20020320174824@motor.ru>
1118 all occrance of Encode::Tcl::Extended removed including pod
1120 test now checks $encoding->name only; $encoding->{name} are
1121 no longer check to find the canonical name.
1122 ! lib/Encode/JP/JIS.pm
1123 ! lib/Encode/JP/ISO_2022_JP.pm
1124 ->name() added to be more compliant with API
1130 Patch by Autrijus to add aliases to TW and fixes to POD
1131 Message-Id: <20020320090619.GA24774@not.autrijus.org>
1133 SADAHIRO Tomoyuki added as should. My apologies.
1135 0.93 Wed Mar 20 2002
1136 * First release to be uploaded to CPAN. For prehistoric changes,
1137 please see Changes file of perl distibution as well as
1138 perl-unicode@perl.org archive, available at:
1139 http://archive.develooper.com/perl-unicode@perl.org/
1141 Changes Since 0.92 includes;
1146 + Mention to perl-unicode@perl.org added
1148 + Encoding aliases added so you can feed locale names
1149 and MIME Charset="" directly.
1150 - Mention to JISX0212 removed because it's fixed
1153 + Encoding aliases added. Note TW is left untouched because
1154 euc-tw is not implemented in TW but in Encode::HanExtra.
1155 Autrijus, you may fix Encode::HanExtra.
1157 + to test encode aliases added