Unicode properties: support \p{(?:Is)?L&} as an alias for \pL.
[p5sagit/p5-mst-13.2.git] / pod / perltodo.pod
CommitLineData
7711098a 1=head1 NAME
2
3perltodo - Perl TO-DO List
4
5=head1 DESCRIPTION
e50bb9a1 6
722d2a37 7This is a list of wishes for Perl. Send updates to
e50bb9a1 8I<perl5-porters@perl.org>. If you want to work on any of these
9projects, be sure to check the perl5-porters archives for past ideas,
10flames, and propaganda. This will save you time and also prevent you
11from implementing something that Larry has already vetoed. One set
12of archives may be found at:
13
14 http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/
15
722d2a37 16=head1 To do during 5.6.x
e50bb9a1 17
722d2a37 18=head2 Support for I/O disciplines
e50bb9a1 19
722d2a37 20C<perlio> provides this, but the interface could be a lot more
21straightforward.
e50bb9a1 22
4b3b956a 23=head2 Autoload bytes.pm
e50bb9a1 24
4b3b956a 25When the lexer sees, for instance, C<bytes::length>, it should
26automatically load the C<bytes> pragma.
27
28=head2 Make "\u{XXXX}" et al work
29
30Danger, Will Robinson! Discussing the semantics of C<"\x{F00}">,
31C<"\xF00"> and C<"\U{F00}"> on P5P I<will> lead to a long and boring
32flamewar.
e50bb9a1 33
c6287c21 34=head2 Create a char *sv_pvprintify(sv, STRLEN *lenp, UV flags)
0562c0e3 35
36For displaying PVs with control characters, embedded nulls, and Unicode.
37This would be useful for printing warnings, or data and regex dumping,
38not_a_number(), and so on.
39
f35392ae 40Requirements: should handle both byte and UTF8 strings. isPRINT()
41characters printed as-is, character less than 256 as \xHH, Unicode
0661e9a4 42characters as \x{HHH}. Don't assume ASCII-like, either, get somebody
43on EBCDIC to test the output.
f35392ae 44
45Possible options, controlled by the flags:
0661e9a4 46- whitespace (other than ' ' of isPRINT()) printed as-is
f35392ae 47- use isPRINT_LC() instead of isPRINT()
48- print control characters like this: "\cA"
49- print control characters like this: "^A"
0661e9a4 50- non-PRINTables printed as '.' instead of \xHH
51- use \OOO instead of \xHH
52- use the C/Perl-metacharacters like \n, \t
f35392ae 53- have a maximum length for the produced string (read it from *lenp)
54- append a "..." to the produced string if the maximum length is exceeded
0661e9a4 55- really fancy: print unicode characters as \N{...}
f35392ae 56
722d2a37 57=head2 Overloadable regex assertions
e50bb9a1 58
722d2a37 59This may or may not be possible with the current regular expression
60engine. The idea is that, for instance, C<\b> needs to be
61algorithmically computed if you're dealing with Thai text. Hence, the
62B<\b> assertion wants to be overloaded by a function.
e50bb9a1 63
776f8809 64=head2 Unicode
65
66=over 4
67
68=item *
e50bb9a1 69
f34dec15 70Allow for long form of the General Category Properties, e.g
71C<\p{IsOpenPunctuation}>, not just the abbreviated form, e.g.
72C<\p{IsPs}>.
73
74=item *
75
76Allow for the metaproperties C<Any> and C<Assigned>, and C<Common>;
77C<Alphabetic>, C<Ideographic>, C<Lowercase>, C<Uppercase> (note that
78are large classes than the general categories C<Lu> and C<Ll>),
79C<White Space>, C<Bidi Control>, C<Join Control>, C<ASCII Hex Digit>,
80C<Hex Digit>, <Noncharacter Code Point>, C<ID Start>, C<ID Continue>,
81C<XID Start>, C<XID Continue>, C<NF*_NO>, C<NF*_MAYBE>.
82
83There are also enumerated properties: C<Decomposition Type>,
84C<Numeric Type>, C<East Asian Width>, C<Line Break>. These
85properties have multiple values: for uniqueness the property
86value should be appended. For example, C<\p{IsAlphabetic}>
87wouldbe the binary property, while C<\p{AlphabeticLineBreak}>
88would mean the enumerated property.
89
90=item *
91
722d2a37 92 Case Mappings? http://www.unicode.org/unicode/reports/tr21/
e50bb9a1 93
6f16a292 94lc(), uc(), lcfirst(), and ucfirst() work only for some of the
95simplest cases, where the mapping goes from a single Unicode character
96to another single Unicode character. See lib/unicore/SpecCase.txt
97(and CaseFold.txt).
ac1256e8 98
776f8809 99=item *
e50bb9a1 100
c6287c21 101They have some tricks Perl doesn't yet implement like character
102class subtraction.
e50bb9a1 103
722d2a37 104 http://www.unicode.org/unicode/reports/tr18/
e50bb9a1 105
776f8809 106=back
107
108See L<perlunicode/UNICODE REGULAR EXPRESSION SUPPORT LEVEL> for what's
f34dec15 109there and what's missing. Almost all of Levels 2 and 3 is missing,
110and as of 5.8.0 not even all of Level 1 is there.
776f8809 111
722d2a37 112=head2 use Thread for iThreads
e50bb9a1 113
722d2a37 114Artur Bergman's C<iThreads> module is a start on this, but needs to
115be more mature.
e50bb9a1 116
dd0afe54 117=head2 make perl_clone optionally clone ops
118
119So that pseudoforking, mod_perl, iThreads and nvi will work properly
120(but not as efficiently) until the regex engine is fixed to be threadsafe.
121
722d2a37 122=head2 Work out exit/die semantics for threads
e50bb9a1 123
722d2a37 124=head2 Typed lexicals for compiler
e50bb9a1 125
722d2a37 126=head2 Compiler workarounds for Win32
e50bb9a1 127
722d2a37 128=head2 AUTOLOADing in the compiler
e50bb9a1 129
722d2a37 130=head2 Fixing comppadlist when compiling
e50bb9a1 131
722d2a37 132=head2 Cleaning up exported namespace
e50bb9a1 133
722d2a37 134=head2 Complete signal handling
e50bb9a1 135
722d2a37 136Add C<PERL_ASYNC_CHECK> to opcodes which loop; replace C<sigsetjmp> with
137C<sigjmp>; check C<wait> for signal safety.
e50bb9a1 138
722d2a37 139=head2 Out-of-source builds
e50bb9a1 140
722d2a37 141This was done for 5.6.0, but needs reworking for 5.7.x
e50bb9a1 142
722d2a37 143=head2 POSIX realtime support
e50bb9a1 144
722d2a37 145POSIX 1003.1 1996 Edition support--realtime stuff: POSIX semaphores,
146message queues, shared memory, realtime clocks, timers, signals (the
147metaconfig units mostly already exist for these)
e50bb9a1 148
722d2a37 149=head2 UNIX98 support
e50bb9a1 150
722d2a37 151Reader-writer locks, realtime/asynchronous IO
e50bb9a1 152
722d2a37 153=head2 IPv6 Support
e50bb9a1 154
722d2a37 155There are non-core modules, such as C<Net::IPv6>, but these will need
156integrating when IPv6 actually starts to really happen. See RFC 2292
157and RFC 2553.
e50bb9a1 158
722d2a37 159=head2 Long double conversion
e50bb9a1 160
722d2a37 161Floating point formatting is still causing some weird test failures.
e50bb9a1 162
722d2a37 163=head2 Locales
e50bb9a1 164
722d2a37 165Locales and Unicode interact with each other in unpleasant ways.
166One possible solution would be to adopt/support ICU:
e50bb9a1 167
722d2a37 168 http://oss.software.ibm.com/developerworks/opensource/icu/project/
e50bb9a1 169
722d2a37 170=head2 Thread-safe regexes
e50bb9a1 171
722d2a37 172The regular expression engine is currently non-threadsafe.
e50bb9a1 173
722d2a37 174=head2 Arithmetic on non-Arabic numerals
e50bb9a1 175
722d2a37 176C<[1234567890]> aren't the only numerals any more.
e50bb9a1 177
722d2a37 178=head2 POSIX Unicode character classes
e50bb9a1 179
722d2a37 180([=a=] for equivalance classes, [.ch.] for collation.)
181These are dependent on Unicode normalization and collation.
e50bb9a1 182
722d2a37 183=head2 Factoring out common suffices/prefices in regexps (trie optimization)
c47ff5f1 184
722d2a37 185Currently, the user has to optimize C<foo|far> and C<foo|goo> into
186C<f(?:oo|ar)> and C<[fg]oo> by hand; this could be done automatically.
e50bb9a1 187
722d2a37 188=head2 Security audit shipped utilities
e50bb9a1 189
722d2a37 190All the code we ship with Perl needs to be sensible about temporary file
191handling, locking, input validation, and so on.
e50bb9a1 192
722d2a37 193=head2 Custom opcodes
e50bb9a1 194
722d2a37 195Have a way to introduce user-defined opcodes without the subroutine call
196overhead of an XSUB; the user should be able to create PP code. Simon
197Cozens has some ideas on this.
e50bb9a1 198
722d2a37 199=head2 spawnvp() on Win32
e50bb9a1 200
722d2a37 201Win32 has problems spawning processes, particularly when the arguments
202to the child process contain spaces, quotes or tab characters.
e50bb9a1 203
722d2a37 204=head2 DLL Versioning
e50bb9a1 205
722d2a37 206Windows needs a way to know what version of a XS or C<libperl> DLL it's
207loading.
e50bb9a1 208
722d2a37 209=head2 Introduce @( and @)
e50bb9a1 210
722d2a37 211C<$(> may return "foo bar baz". Unfortunately, since groups can
212theoretically have spaces in their names, this could be one, two or
213three groups.
e50bb9a1 214
722d2a37 215=head2 Floating point handling
e50bb9a1 216
722d2a37 217C<NaN> and C<inf> support is particularly troublesome.
218(fp_classify(), fp_class(), fp_class_d(), class(), isinf(),
219isfinite(), finite(), isnormal(), unordered(), <ieeefp.h>,
220<fp_class.h> (there are metaconfig units for all these) (I think),
221fp_setmask(), fp_getmask(), fp_setround(), fp_getround()
222(no metaconfig units yet for these). Don't forget finitel(), fp_classl(),
223fp_class_l(), (yes, both do, unfortunately, exist), and unorderedl().)
e50bb9a1 224
722d2a37 225As of Perl 5.6.1 is a Perl macro, Perl_isnan().
e50bb9a1 226
722d2a37 227=head2 IV/UV preservation
e50bb9a1 228
722d2a37 229Nicholas Clark has done a lot of work on this, but work is continuing.
230C<+>, C<-> and C<*> work, but guards need to be in place for C<%>, C</>,
231C<&>, C<oct>, C<hex> and C<pack>.
e50bb9a1 232
722d2a37 233=head2 Replace pod2html with something using Pod::Parser
83df6a1d 234
722d2a37 235The CPAN module C<Malik::Pod::Html> may be a more suitable basis for a
236C<pod2html> convertor; the current one duplicates the functionality
237abstracted in C<Pod::Parser>, which makes updating the POD language
238difficult.
e50bb9a1 239
722d2a37 240=head2 Automate module testing on CPAN
e50bb9a1 241
722d2a37 242When a new Perl is being beta tested, porters have to manually grab
243their favourite CPAN modules and test them - this should be done
244automatically.
e50bb9a1 245
722d2a37 246=head2 sendmsg and recvmsg
83df6a1d 247
722d2a37 248We have all the other BSD socket functions but these. There are
249metaconfig units for these functions which can be added. To avoid these
250being new opcodes, a solution similar to the way C<sockatmark> was added
251would be preferable. (Autoload the C<IO::whatever> module.)
e50bb9a1 252
722d2a37 253=head2 Rewrite perlre documentation
e50bb9a1 254
722d2a37 255The new-style patterns need full documentation, and the whole document
256needs to be a lot clearer.
e50bb9a1 257
722d2a37 258=head2 Convert example code to IO::Handle filehandles
e50bb9a1 259
722d2a37 260=head2 Document Win32 choices
e50bb9a1 261
722d2a37 262=head2 Check new modules
e50bb9a1 263
722d2a37 264=head2 Make roffitall find pods and libs itself
e50bb9a1 265
722d2a37 266Simon Cozens has done some work on this but it needs a rethink.
e50bb9a1 267
722d2a37 268=head1 To do at some point
e50bb9a1 269
722d2a37 270These are ideas that have been regularly tossed around, that most
271people believe should be done maybe during 5.8.x
e50bb9a1 272
722d2a37 273=head2 Remove regular expression recursion
e50bb9a1 274
722d2a37 275Because the regular expression engine is recursive, badly designed
276expressions can lead to lots of recursion filling up the stack. Ilya
277claims that it is easy to convert the engine to being iterative, but
278this has still not yet been done. There may be a regular expression
279engine hit squad meeting at TPC5.
e50bb9a1 280
722d2a37 281=head2 Memory leaks after failed eval
e50bb9a1 282
722d2a37 283Perl will leak memory if you C<eval "hlagh hlagh hlagh hlagh">. This is
284partially because it attempts to build up an op tree for that code and
285doesn't properly free it. The same goes for non-syntactically-correct
286regular expressions. Hugo looked into this, but decided it needed a
287mark-and-sweep GC implementation.
e50bb9a1 288
722d2a37 289Alan notes that: The basic idea was to extend the parser token stack
290(C<YYSTYPE>) to include a type field so we knew what sort of thing each
291element of the stack was. The F<<perly.c> code would then have to be
292postprocessed to record the type of each entry on the stack as it was
293created, and the parser patched so that it could unroll the stack
294properly on error.
e50bb9a1 295
722d2a37 296This is possible to do, but would be pretty messy to implement, as it
297would rely on even more sed hackery in F<perly.fixer>.
e50bb9a1 298
722d2a37 299=head2 pack "(stuff)*"
e50bb9a1 300
722d2a37 301That's to say, C<pack "(sI)40"> would be the same as C<pack "sI"x40>
e50bb9a1 302
722d2a37 303=head2 bitfields in pack
e50bb9a1 304
722d2a37 305=head2 Cross compilation
e50bb9a1 306
722d2a37 307Make Perl buildable with a cross-compiler. This will play havoc with
308Configure, which needs to how how the target system will respond to
309its tests; maybe C<microperl> will be a good starting point here.
310(Indeed, Bart Schuller reports that he compiled up C<microperl> for
311the Agenda PDA and it works fine.) A really big spanner in the works
312is the bootstrapping build process of Perl: if the filesystem the
313target systems sees is not the same what the build host sees, various
314input, output, and (Perl) library files need to be copied back and forth.
e50bb9a1 315
f86a8bc5 316As of 5.8.0 Configure mostly works for cross-compilation
317(used successfully for iPAQ Linux), miniperl gets built,
318but then building DynaLoader (and other extensions) fails
319since MakeMaker knows nothing of cross-compilation.
320(See INSTALL/Cross-compilation for the state of things.)
321
722d2a37 322=head2 Perl preprocessor / macros
e50bb9a1 323
722d2a37 324Source filters help with this, but do not get us all the way. For
325instance, it should be possible to implement the C<??> operator somehow;
326source filters don't (quite) cut it.
e50bb9a1 327
722d2a37 328=head2 Perl lexer in Perl
a45bd81d 329
722d2a37 330Damian Conway is planning to work on this, but it hasn't happened yet.
e50bb9a1 331
722d2a37 332=head2 Using POSIX calls internally
e50bb9a1 333
722d2a37 334When faced with a BSD vs. SySV -style interface to some library or
335system function, perl's roots show in that it typically prefers the BSD
336interface (but falls back to the SysV one). One example is getpgrp().
337Other examples include C<memcpy> vs. C<bcopy>. There are others, mostly in
338F<<pp_sys.c>.
e50bb9a1 339
722d2a37 340Mostly, this item is a suggestion for which way to start a journey into
341an C<#ifdef> forest. It is not primarily a suggestion to eliminate any of
342the C<#ifdef> forests.
e50bb9a1 343
722d2a37 344POSIX calls are perhaps more likely to be portable to unexpected
345architectures. They are also perhaps more likely to be actively
346maintained by a current vendor. They are also perhaps more likely to be
347available in thread-safe versions, if appropriate.
e50bb9a1 348
722d2a37 349=head2 -i rename file when changed
e50bb9a1 350
722d2a37 351It's only necessary to rename a file when inplace editing when the file
352has changed. Detecting a change is perhaps the difficult bit.
e50bb9a1 353
722d2a37 354=head2 All ARGV input should act like E<lt>E<gt>
e50bb9a1 355
2d84a16a 356eg C<read(ARGV, ...)> doesn't currently read across multiple files.
357
722d2a37 358=head2 Support for rerunning debugger
e50bb9a1 359
722d2a37 360There should be a way of restarting the debugger on demand.
e50bb9a1 361
c6287c21 362=head2 Test Suite for the Debugger
363
364The debugger is a complex piece of software and fixing something
365here may inadvertently break something else over there. To tame
366this chaotic behaviour, a test suite is necessary.
367
722d2a37 368=head2 my sub foo { }
c47ff5f1 369
722d2a37 370The basic principle is sound, but there are problems with the semantics
371of self-referential and mutually referential lexical subs: how to
372declare the subs?
c47ff5f1 373
722d2a37 374=head2 One-pass global destruction
c47ff5f1 375
722d2a37 376Sweeping away all the allocated memory in one go is a laudable goal, but
377it's difficult and in most cases, it's easier to let the memory get
378freed by exiting.
e50bb9a1 379
722d2a37 380=head2 Rewrite regexp parser
e50bb9a1 381
722d2a37 382There has been talk recently of rewriting the regular expression parser
383to produce an optree instead of a chain of opcodes; it's unclear whether
384or not this would be a win.
e50bb9a1 385
722d2a37 386=head2 Cache recently used regexps
e50bb9a1 387
722d2a37 388This is to speed up
e50bb9a1 389
722d2a37 390 for my $re (@regexps) {
391 $matched++ if /$re/
392 }
e50bb9a1 393
722d2a37 394C<qr//> already gives us a way of saving compiled regexps, but it should
395be done automatically.
e50bb9a1 396
722d2a37 397=head2 Re-entrant functions
e50bb9a1 398
722d2a37 399Add configure probes for C<_r> forms of system calls and fit them to the
400core. Unfortunately, calling conventions for these functions and not
401standardised.
04c70446 402
722d2a37 403=head2 Cross-compilation support
04c70446 404
722d2a37 405Bart Schuller reports that using C<microperl> and a cross-compiler, he
406got Perl working on the Agenda PDA. However, one cannot build a full
407Perl because Configure needs to get the results for the target platform,
408for the host.
e50bb9a1 409
722d2a37 410=head2 Bit-shifting bitvectors
e50bb9a1 411
722d2a37 412Given:
e50bb9a1 413
722d2a37 414 vec($v, 1000, 1) = 1;
e50bb9a1 415
722d2a37 416One should be able to do
e50bb9a1 417
722d2a37 418 $v <<= 1;
e50bb9a1 419
722d2a37 420and have the 999'th bit set.
e50bb9a1 421
722d2a37 422Currently if you try with shift bitvectors you shift the NV/UV, instead
423of the bits in the PV. Not very logical.
e50bb9a1 424
722d2a37 425=head2 debugger pragma
e50bb9a1 426
722d2a37 427The debugger is implemented in Perl in F<perl5db.pl>; turning it into a
428pragma should be easy, but making it work lexically might be more
429difficult. Fiddling with C<$^P> would be necessary.
e50bb9a1 430
722d2a37 431=head2 use less pragma
e50bb9a1 432
722d2a37 433Identify areas where speed/memory tradeoffs can be made and have a hint
434to switch between them.
e50bb9a1 435
722d2a37 436=head2 switch structures
e50bb9a1 437
722d2a37 438Although we have C<Switch.pm> in core, Larry points to the dormant
439C<nswitch> and C<cswitch> ops in F<pp.c>; using these opcodes would be
440much faster.
e50bb9a1 441
722d2a37 442=head2 Cache eval tree
e50bb9a1 443
722d2a37 444=head2 rcatmaybe
e50bb9a1 445
722d2a37 446=head2 Shrink opcode tables
e50bb9a1 447
722d2a37 448=head2 Optimize away @_
e50bb9a1 449
722d2a37 450Look at the "reification" code in C<av.c>
e50bb9a1 451
722d2a37 452=head2 Prototypes versus indirect objects
e50bb9a1 453
722d2a37 454Currently, indirect object syntax bypasses prototype checks.
e50bb9a1 455
722d2a37 456=head2 Install HMTL
e50bb9a1 457
722d2a37 458HTML versions of the documentation need to be installed by default; a
459call to C<installhtml> from C<installperl> may be all that's necessary.
e50bb9a1 460
722d2a37 461=head2 Prototype method calls
e50bb9a1 462
722d2a37 463=head2 Return context prototype declarations
e50bb9a1 464
722d2a37 465=head2 magic_setisa
e50bb9a1 466
722d2a37 467=head2 Garbage collection
e50bb9a1 468
722d2a37 469There have been persistent mumblings about putting a mark-and-sweep
470garbage detector into Perl; Alan Burlison has some ideas about this.
e50bb9a1 471
722d2a37 472=head2 IO tutorial
e50bb9a1 473
722d2a37 474Mark-Jason Dominus has the beginnings of one of these.
e50bb9a1 475
722d2a37 476=head2 pack/unpack tutorial
e50bb9a1 477
722d2a37 478Simon Cozens has the beginnings of one of these.
e50bb9a1 479
722d2a37 480=head2 Rewrite perldoc
e50bb9a1 481
722d2a37 482There are a few suggestions for what to do with C<perldoc>: maybe a
483full-text search, an index function, locating pages on a particular
484high-level subject, and so on.
e50bb9a1 485
3958b146 486=head2 Install .3p manpages
e50bb9a1 487
3958b146 488This is a bone of contention; we can create C<.3p> manpages for each
722d2a37 489built-in function, but should we install them by default? Tcl does this,
490and it clutters up C<apropos>.
e50bb9a1 491
722d2a37 492=head2 Unicode tutorial
e50bb9a1 493
722d2a37 494Simon Cozens promises to do this before he gets old.
e50bb9a1 495
722d2a37 496=head2 Update POSIX.pm for 1003.1-2
3958b146 497
722d2a37 498=head2 Retargetable installation
e50bb9a1 499
722d2a37 500Allow C<@INC> to be changed after Perl is built.
e50bb9a1 501
722d2a37 502=head2 POSIX emulation on non-POSIX systems
e50bb9a1 503
722d2a37 504Make C<POSIX.pm> behave as POSIXly as possible everywhere, meaning we
505have to implement POSIX equivalents for some functions if necessary.
e50bb9a1 506
722d2a37 507=head2 Rename Win32 headers
e50bb9a1 508
722d2a37 509=head2 Finish off lvalue functions
510
511They don't work in the debugger, and they don't work for list or hash
512slices.
e50bb9a1 513
722d2a37 514=head2 Update sprintf documentation
e50bb9a1 515
722d2a37 516Hugo van der Sanden plans to look at this.
e50bb9a1 517
722d2a37 518=head2 Use fchown/fchmod internally
e50bb9a1 519
722d2a37 520This has been done in places, but needs a thorough code review.
521Also fchdir is available in some platforms.
e50bb9a1 522
722d2a37 523=head1 Vague ideas
e50bb9a1 524
722d2a37 525Ideas which have been discussed, and which may or may not happen.
e50bb9a1 526
722d2a37 527=head2 ref() in list context
e50bb9a1 528
722d2a37 529It's unclear what this should do or how to do it without breaking old
530code.
e50bb9a1 531
f86a8bc5 532=head2 Make tr/// return histogram of characters in list context
e50bb9a1 533
722d2a37 534There is a patch for this, but it may require Unicodification.
e50bb9a1 535
722d2a37 536=head2 Compile to real threaded code
3958b146 537
722d2a37 538=head2 Structured types
3958b146 539
722d2a37 540=head2 Modifiable $1 et al.
e50bb9a1 541
722d2a37 542 ($x = "elephant") =~ /e(ph)/;
543 $1 = "g"; # $x = "elegant"
e50bb9a1 544
722d2a37 545What happens if there are multiple (nested?) brackets? What if the
546string changes between the match and the assignment?
e50bb9a1 547
722d2a37 548=head2 Procedural interfaces for IO::*, etc.
e50bb9a1 549
722d2a37 550Some core modules have been accused of being overly-OO. Adding
551procedural interfaces could demystify them.
e50bb9a1 552
722d2a37 553=head2 RPC modules
e50bb9a1 554
722d2a37 555=head2 Attach/detach debugger from running program
e50bb9a1 556
722d2a37 557With C<gdb>, you can attach the debugger to a running program if you
558pass the process ID. It would be good to do this with the Perl debugger
559on a running Perl program, although I'm not sure how it would be done.
e50bb9a1 560
722d2a37 561=head2 Alternative RE syntax module
e50bb9a1 562
722d2a37 563 use Regex::Newbie;
564 $re = Regex::Newbie->new
565 ->start
566 ->match("foo")
567 ->repeat(Regex::Newbie->class("char"),3)
568 ->end;
569 /$re/;
e50bb9a1 570
722d2a37 571=head2 GUI::Native
e50bb9a1 572
722d2a37 573A non-core module that would use "native" GUI to create graphical
574applications.
e50bb9a1 575
722d2a37 576=head2 foreach(reverse ...)
e50bb9a1 577
722d2a37 578Currently
e50bb9a1 579
722d2a37 580 foreach (reverse @_) { ... }
e50bb9a1 581
722d2a37 582puts C<@_> on the stack, reverses it putting the reversed version on the
583stack, then iterates forwards. Instead, it could be special-cased to put
584C<@_> on the stack then iterate backwards.
e50bb9a1 585
722d2a37 586=head2 Constant function cache
e50bb9a1 587
722d2a37 588=head2 Approximate regular expression matching
e50bb9a1 589
722d2a37 590=head1 Ongoing
e50bb9a1 591
722d2a37 592These items B<always> need doing:
e50bb9a1 593
722d2a37 594=head2 Update guts documentation
e50bb9a1 595
722d2a37 596Simon Cozens tries to do this when possible, and contributions to the
597C<perlapi> documentation is welcome.
e50bb9a1 598
722d2a37 599=head2 Add more tests
e50bb9a1 600
722d2a37 601Michael Schwern will donate $500 to Yet Another Society when all core
602modules have tests.
e50bb9a1 603
722d2a37 604=head2 Update auxiliary tools
e50bb9a1 605
722d2a37 606The code we ship with Perl should look like good Perl 5.
e50bb9a1 607
722d2a37 608=head1 Recently done things
e50bb9a1 609
722d2a37 610These are things which have been on the todo lists in previous releases
611but have recently been completed.
e50bb9a1 612
722d2a37 613=head2 Safe signal handling
e50bb9a1 614
722d2a37 615A new signal model went into 5.7.1 without much fanfare. Operations and
616C<malloc>s are no longer interrupted by signals, which are handled
617between opcodes. This means that C<PERL_ASYNC_CHECK> now actually does
618something. However, there are still a few things that need to be done.
e50bb9a1 619
722d2a37 620=head2 Tie Modules
e50bb9a1 621
722d2a37 622Modules which implement arrays in terms of strings, substrings or files
623can be found on the CPAN.
e50bb9a1 624
722d2a37 625=head2 gettimeofday
e50bb9a1 626
722d2a37 627C<Time::Hires> has been integrated into the core.
e50bb9a1 628
722d2a37 629=head2 setitimer and getimiter
e50bb9a1 630
722d2a37 631Adding C<Time::Hires> got us this too.
e50bb9a1 632
722d2a37 633=head2 Testing __DIE__ hook
634
635Tests have been added.
636
637=head2 CPP equivalent in Perl
e50bb9a1 638
722d2a37 639A C Yardley will probably have done this by the time you can read this.
640This allows for a generalization of the C constant detection used in
641building C<Errno.pm>.
e50bb9a1 642
722d2a37 643=head2 Explicit switch statements
e50bb9a1 644
722d2a37 645C<Switch.pm> has been integrated into the core to give you all manner of
646C<switch...case> semantics.
e50bb9a1 647
722d2a37 648=head2 autocroak
e50bb9a1 649
722d2a37 650This is C<Fatal.pm>.
e50bb9a1 651
722d2a37 652=head2 UTF/EBCDIC
e50bb9a1 653
722d2a37 654Nick Ing-Simmons has made UTF-EBCDIC (UTR13) work with Perl.
e50bb9a1 655
722d2a37 656 EBCDIC? http://www.unicode.org/unicode/reports/tr16/
e50bb9a1 657
722d2a37 658=head2 UTF Regexes
e50bb9a1 659
722d2a37 660Although there are probably some small bugs to be rooted out, Jarkko
661Hietaniemi has made regular expressions polymorphic between bytes and
662characters.
e50bb9a1 663
722d2a37 664=head2 perlcc to produce executable
e50bb9a1 665
722d2a37 666C<perlcc> was recently rewritten, and can now produce standalone
667executables.
e50bb9a1 668
722d2a37 669=head2 END blocks saved in compiled output
e50bb9a1 670
722d2a37 671=head2 Secure temporary file module
e50bb9a1 672
722d2a37 673Tim Jenness' C<File::Temp> is now in core.
e50bb9a1 674
722d2a37 675=head2 Integrate Time::HiRes
e50bb9a1 676
722d2a37 677This module is now part of core.
e50bb9a1 678
722d2a37 679=head2 Turn Cwd into XS
e50bb9a1 680
722d2a37 681Benjamin Sugars has done this.
e50bb9a1 682
722d2a37 683=head2 Mmap for input
e50bb9a1 684
722d2a37 685Nick Ing-Simmons' C<perlio> supports an C<mmap> IO method.
e50bb9a1 686
722d2a37 687=head2 Byte to/from UTF8 and UTF8 to/from local conversion
e50bb9a1 688
722d2a37 689C<Encode> provides this.
e50bb9a1 690
722d2a37 691=head2 Add sockatmark support
e50bb9a1 692
722d2a37 693Added in 5.7.1
e50bb9a1 694
722d2a37 695=head2 Mailing list archives
696
697http://lists.perl.org/, http://archive.develooper.com/
698
699=head2 Bug tracking
700
701Richard Foley has written the bug tracking system at http://bugs.perl.org/
e50bb9a1 702
722d2a37 703=head2 Integrate MacPerl
e50bb9a1 704
722d2a37 705Chris Nandor and Matthias Neeracher have integrated the MacPerl changes
706into 5.6.0.
e50bb9a1 707
722d2a37 708=head2 Web "nerve center" for Perl
e50bb9a1 709
722d2a37 710http://use.perl.org/ is what you're looking for.
e50bb9a1 711
722d2a37 712=head2 Regular expression tutorial
e50bb9a1 713
722d2a37 714C<perlretut>, provided by Mark Kvale.
e50bb9a1 715
722d2a37 716=head2 Debugging Tutorial
e50bb9a1 717
722d2a37 718C<perldebtut>, written by Richard Foley.
e50bb9a1 719
722d2a37 720=head2 Integrate new modules
e50bb9a1 721
722d2a37 722Jarkko has been integrating madly into 5.7.x
e50bb9a1 723
722d2a37 724=head2 Integrate profiler
e50bb9a1 725
722d2a37 726C<Devel::DProf> is now a core module.
e50bb9a1 727
722d2a37 728=head2 Y2K error detection
e50bb9a1 729
722d2a37 730There's a configure option to detect unsafe concatenation with "19", and
731a CPAN module. (C<D'oh::Year>)
e50bb9a1 732
722d2a37 733=head2 Regular expression debugger
e50bb9a1 734
722d2a37 735While not part of core, Mark-Jason Dominus has written C<Rx> and has
736also come up with a generalised strategy for regular expression
737debugging.
e50bb9a1 738
722d2a37 739=head2 POD checker
e50bb9a1 740
722d2a37 741That's, uh, F<podchecker>
e50bb9a1 742
722d2a37 743=head2 "Dynamic" lexicals
e50bb9a1 744
722d2a37 745=head2 Cache precompiled modules
e50bb9a1 746
722d2a37 747=head1 Deprecated Wishes
e50bb9a1 748
722d2a37 749These are items which used to be in the todo file, but have been
750deprecated for some reason.
e50bb9a1 751
722d2a37 752=head2 Loop control on do{}
e50bb9a1 753
722d2a37 754This would break old code; use C<do{{ }}> instead.
e50bb9a1 755
722d2a37 756=head2 Lexically scoped typeglobs
e50bb9a1 757
722d2a37 758Not needed now we have lexical IO handles.
e50bb9a1 759
722d2a37 760=head2 format BOTTOM
3958b146 761
722d2a37 762=head2 report HANDLE
e50bb9a1 763
722d2a37 764Damian Conway's text formatting modules seem to be the Way To Go.
e50bb9a1 765
722d2a37 766=head2 Generalised want()/caller())
3958b146 767
722d2a37 768=head2 Named prototypes
e50bb9a1 769
722d2a37 770These both seem to be delayed until Perl 6.
e50bb9a1 771
722d2a37 772=head2 Built-in globbing
e50bb9a1 773
722d2a37 774The C<File::Glob> module has been used to replace the C<glob> function.
e50bb9a1 775
722d2a37 776=head2 Regression tests for suidperl
e50bb9a1 777
722d2a37 778C<suidperl> is deprecated in favour of common sense.
e50bb9a1 779
722d2a37 780=head2 Cached hash values
e50bb9a1 781
722d2a37 782We have shared hash keys, which perform the same job.
e50bb9a1 783
722d2a37 784=head2 Add compression modules
e50bb9a1 785
722d2a37 786The compression modules are a little heavy; meanwhile, Nick Clark is
787working on experimental pragmata to do transparent decompression on
788input.
e50bb9a1 789
722d2a37 790=head2 Reorganise documentation into tutorials/references
e50bb9a1 791
722d2a37 792Could not get consensus on P5P about this.
e50bb9a1 793
722d2a37 794=head2 Remove distinction between functions and operators
795
796Caution: highly flammable.
797
798=head2 Make XS easier to use
e50bb9a1 799
722d2a37 800Use C<Inline> instead, or SWIG.
e50bb9a1 801
722d2a37 802=head2 Make embedding easier to use
e50bb9a1 803
722d2a37 804Use C<Inline::CPR>.
e50bb9a1 805
722d2a37 806=head2 man for perl
04c70446 807
722d2a37 808See the Perl Power Tools. (http://language.perl.com/ppt/)
04c70446 809
722d2a37 810=head2 my $Package::variable
04c70446 811
722d2a37 812Use C<our> instead.
04c70446 813
722d2a37 814=head2 "or" tests defined, not truth
04c70446 815
722d2a37 816Suggesting this on P5P B<will> cause a boring and interminable flamewar.
04c70446 817
722d2a37 818=head2 "class"-based lexicals
04c70446 819
cbb3fa72 820Use flyweight objects, secure hashes or, dare I say it, pseudo-hashes instead.
f86a8bc5 821(Or whatever will replace pseudohashes in 5.10.)
04c70446 822
722d2a37 823=head2 byteperl
04c70446 824
722d2a37 825C<ByteLoader> covers this.
04c70446 826
722d2a37 827=head2 Lazy evaluation / tail recursion removal
04c70446 828
f86a8bc5 829C<List::Util> gives first() (a short-circuiting grep); tail recursion
830removal is done manually, with C<goto &whoami;>. (However, MJD has
831found that C<goto &whoami> introduces a performance penalty, so maybe
832there should be a way to do this after all: C<sub foo {START: ... goto
833START;> is better.)
0562c0e3 834
835=head2 Make "use utf8" the default
836
f86a8bc5 837Because of backward compatibility this is difficult: scripts could not
838contain B<any legacy eight-bit data> (like Latin-1) anymore, even in
839string literals or pod. Also would introduce a measurable slowdown of
840at least few percentages since all regular expression operations would
841be done in full UTF-8. But if you want to try this, add
842-DUSE_UTF8_SCRIPTS to your compilation flags.
843
3298bd4d 844=head2 Unicode collation and normalization
845
846The Unicode::Collate and Unicode::Normalize modules
847by SADAHIRO Tomoyuki have been included since 5.8.0.
848
849 Collation? http://www.unicode.org/unicode/reports/tr10/
850 Normalization? http://www.unicode.org/unicode/reports/tr15/
0562c0e3 851
825b3abc 852=head2 Create debugging macros
853
854Debugging macros (like printsv, dump) can make debugging perl inside a
855C debugger much easier. A good set for gdb comes with mod_perl.
856Something similar should be distributed with perl.
857
858The proper way to do this is to use and extend Devel::DebugInit.
859Devel::DebugInit also needs to be extended to support threads.
860
861See p5p archives for late May/early June 2001 for a recent discussion
862on this topic.
863
3298bd4d 864=cut