Fix fd leak on Via(bogus).
[p5sagit/p5-mst-13.2.git] / pod / perltodo.pod
CommitLineData
7711098a 1=head1 NAME
2
3perltodo - Perl TO-DO List
4
5=head1 DESCRIPTION
e50bb9a1 6
722d2a37 7This is a list of wishes for Perl. Send updates to
e50bb9a1 8I<perl5-porters@perl.org>. If you want to work on any of these
9projects, be sure to check the perl5-porters archives for past ideas,
10flames, and propaganda. This will save you time and also prevent you
11from implementing something that Larry has already vetoed. One set
12of archives may be found at:
13
14 http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/
15
722d2a37 16=head1 To do during 5.6.x
e50bb9a1 17
722d2a37 18=head2 Support for I/O disciplines
e50bb9a1 19
722d2a37 20C<perlio> provides this, but the interface could be a lot more
21straightforward.
e50bb9a1 22
4b3b956a 23=head2 Autoload bytes.pm
e50bb9a1 24
4b3b956a 25When the lexer sees, for instance, C<bytes::length>, it should
26automatically load the C<bytes> pragma.
27
28=head2 Make "\u{XXXX}" et al work
29
30Danger, Will Robinson! Discussing the semantics of C<"\x{F00}">,
31C<"\xF00"> and C<"\U{F00}"> on P5P I<will> lead to a long and boring
32flamewar.
e50bb9a1 33
c6287c21 34=head2 Create a char *sv_pvprintify(sv, STRLEN *lenp, UV flags)
0562c0e3 35
36For displaying PVs with control characters, embedded nulls, and Unicode.
37This would be useful for printing warnings, or data and regex dumping,
38not_a_number(), and so on.
39
f35392ae 40Requirements: should handle both byte and UTF8 strings. isPRINT()
41characters printed as-is, character less than 256 as \xHH, Unicode
0661e9a4 42characters as \x{HHH}. Don't assume ASCII-like, either, get somebody
43on EBCDIC to test the output.
f35392ae 44
45Possible options, controlled by the flags:
0661e9a4 46- whitespace (other than ' ' of isPRINT()) printed as-is
f35392ae 47- use isPRINT_LC() instead of isPRINT()
48- print control characters like this: "\cA"
49- print control characters like this: "^A"
0661e9a4 50- non-PRINTables printed as '.' instead of \xHH
51- use \OOO instead of \xHH
52- use the C/Perl-metacharacters like \n, \t
f35392ae 53- have a maximum length for the produced string (read it from *lenp)
54- append a "..." to the produced string if the maximum length is exceeded
0661e9a4 55- really fancy: print unicode characters as \N{...}
f35392ae 56
1626a787 57NOTE: pv_display(), pv_uni_display(), sv_uni_display() are already
58doing something like the above.
c5fc23ff 59
722d2a37 60=head2 Overloadable regex assertions
e50bb9a1 61
722d2a37 62This may or may not be possible with the current regular expression
63engine. The idea is that, for instance, C<\b> needs to be
64algorithmically computed if you're dealing with Thai text. Hence, the
65B<\b> assertion wants to be overloaded by a function.
e50bb9a1 66
776f8809 67=head2 Unicode
68
69=over 4
70
71=item *
e50bb9a1 72
f34dec15 73Allow for long form of the General Category Properties, e.g
74C<\p{IsOpenPunctuation}>, not just the abbreviated form, e.g.
75C<\p{IsPs}>.
76
77=item *
78
1ac13f9a 79Allow for the metaproperties: C<XID Start>, C<XID Continue>,
80C<NF*_NO>, C<NF*_MAYBE> (require the DerivedCoreProperties and
81DerviceNormalizationProperties files).
f34dec15 82
71d929cb 83There are also multiple value properties still unimplemented:
84C<Numeric Type>, C<East Asian Width>.
f34dec15 85
86=item *
87
722d2a37 88 Case Mappings? http://www.unicode.org/unicode/reports/tr21/
e50bb9a1 89
6f16a292 90lc(), uc(), lcfirst(), and ucfirst() work only for some of the
91simplest cases, where the mapping goes from a single Unicode character
92to another single Unicode character. See lib/unicore/SpecCase.txt
93(and CaseFold.txt).
ac1256e8 94
776f8809 95=item *
e50bb9a1 96
8d3e8850 97UTF-8 identifier names should probably be canonicalized: NFC?
e50bb9a1 98
20eafb1c 99=item *
100
101UTF-8 in package names and sub names? The first is problematic
8d3e8850 102because of the mapping to pathnames, ditto for the second one if
20eafb1c 103one does autosplitting, for example.
e50bb9a1 104
776f8809 105=back
106
107See L<perlunicode/UNICODE REGULAR EXPRESSION SUPPORT LEVEL> for what's
f34dec15 108there and what's missing. Almost all of Levels 2 and 3 is missing,
109and as of 5.8.0 not even all of Level 1 is there.
8d3e8850 110They have some tricks Perl doesn't yet implement, such as character
20eafb1c 111class subtraction.
112
113 http://www.unicode.org/unicode/reports/tr18/
776f8809 114
722d2a37 115=head2 use Thread for iThreads
e50bb9a1 116
722d2a37 117Artur Bergman's C<iThreads> module is a start on this, but needs to
118be more mature.
e50bb9a1 119
dd0afe54 120=head2 make perl_clone optionally clone ops
121
122So that pseudoforking, mod_perl, iThreads and nvi will work properly
123(but not as efficiently) until the regex engine is fixed to be threadsafe.
124
722d2a37 125=head2 Work out exit/die semantics for threads
e50bb9a1 126
722d2a37 127=head2 Typed lexicals for compiler
e50bb9a1 128
722d2a37 129=head2 Compiler workarounds for Win32
e50bb9a1 130
722d2a37 131=head2 AUTOLOADing in the compiler
e50bb9a1 132
722d2a37 133=head2 Fixing comppadlist when compiling
e50bb9a1 134
722d2a37 135=head2 Cleaning up exported namespace
e50bb9a1 136
722d2a37 137=head2 Complete signal handling
e50bb9a1 138
722d2a37 139Add C<PERL_ASYNC_CHECK> to opcodes which loop; replace C<sigsetjmp> with
140C<sigjmp>; check C<wait> for signal safety.
e50bb9a1 141
722d2a37 142=head2 Out-of-source builds
e50bb9a1 143
722d2a37 144This was done for 5.6.0, but needs reworking for 5.7.x
e50bb9a1 145
722d2a37 146=head2 POSIX realtime support
e50bb9a1 147
722d2a37 148POSIX 1003.1 1996 Edition support--realtime stuff: POSIX semaphores,
149message queues, shared memory, realtime clocks, timers, signals (the
150metaconfig units mostly already exist for these)
e50bb9a1 151
722d2a37 152=head2 UNIX98 support
e50bb9a1 153
722d2a37 154Reader-writer locks, realtime/asynchronous IO
e50bb9a1 155
722d2a37 156=head2 IPv6 Support
e50bb9a1 157
fe854a6f 158There are non-core modules, such as C<Socket6>, but these will need
722d2a37 159integrating when IPv6 actually starts to really happen. See RFC 2292
160and RFC 2553.
e50bb9a1 161
722d2a37 162=head2 Long double conversion
e50bb9a1 163
722d2a37 164Floating point formatting is still causing some weird test failures.
e50bb9a1 165
722d2a37 166=head2 Locales
e50bb9a1 167
722d2a37 168Locales and Unicode interact with each other in unpleasant ways.
169One possible solution would be to adopt/support ICU:
e50bb9a1 170
722d2a37 171 http://oss.software.ibm.com/developerworks/opensource/icu/project/
e50bb9a1 172
722d2a37 173=head2 Thread-safe regexes
e50bb9a1 174
722d2a37 175The regular expression engine is currently non-threadsafe.
e50bb9a1 176
722d2a37 177=head2 Arithmetic on non-Arabic numerals
e50bb9a1 178
722d2a37 179C<[1234567890]> aren't the only numerals any more.
e50bb9a1 180
722d2a37 181=head2 POSIX Unicode character classes
e50bb9a1 182
210b36aa 183(C<[=a=]> for equivalance classes, C<[.ch.]> for collation.)
722d2a37 184These are dependent on Unicode normalization and collation.
e50bb9a1 185
722d2a37 186=head2 Factoring out common suffices/prefices in regexps (trie optimization)
c47ff5f1 187
722d2a37 188Currently, the user has to optimize C<foo|far> and C<foo|goo> into
189C<f(?:oo|ar)> and C<[fg]oo> by hand; this could be done automatically.
e50bb9a1 190
722d2a37 191=head2 Security audit shipped utilities
e50bb9a1 192
722d2a37 193All the code we ship with Perl needs to be sensible about temporary file
194handling, locking, input validation, and so on.
e50bb9a1 195
c8d2171d 196=head2 Sort out the uid-setting mess
197
198Currently there are several problems with the setting of uids ($<, $>
199for the real and effective uids). Firstly, what exactly setuid() call
200gets invoked in which platform is simply a big mess that needs to be
201untangled. Secondly, the effects are apparently not standard across
202platforms, (if you first set $< and then $>, or vice versa, being
666f95b9 203uid == euid == zero, or just euid == zero, or as a normal user, what are
c8d2171d 204the results?). The test suite not (usually) being run as root means
205that these things do not get much testing. Thirdly, there's quite
206often a third uid called saved uid, and Perl has no knowledge of that
207feature in any way. (If one has the saved uid of zero, one can get
208back any real and effective uids.) As an example, to change also the
209saved uid, one needs to set the real and effective uids B<twice>-- in
210most systems, that is: in HP-UX that doesn't seem to work.
666f95b9 211
722d2a37 212=head2 Custom opcodes
e50bb9a1 213
722d2a37 214Have a way to introduce user-defined opcodes without the subroutine call
215overhead of an XSUB; the user should be able to create PP code. Simon
216Cozens has some ideas on this.
e50bb9a1 217
722d2a37 218=head2 DLL Versioning
e50bb9a1 219
d1be9408 220Windows needs a way to know what version of an XS or C<libperl> DLL it's
722d2a37 221loading.
e50bb9a1 222
722d2a37 223=head2 Introduce @( and @)
e50bb9a1 224
722d2a37 225C<$(> may return "foo bar baz". Unfortunately, since groups can
226theoretically have spaces in their names, this could be one, two or
227three groups.
e50bb9a1 228
722d2a37 229=head2 Floating point handling
e50bb9a1 230
722d2a37 231C<NaN> and C<inf> support is particularly troublesome.
232(fp_classify(), fp_class(), fp_class_d(), class(), isinf(),
233isfinite(), finite(), isnormal(), unordered(), <ieeefp.h>,
234<fp_class.h> (there are metaconfig units for all these) (I think),
235fp_setmask(), fp_getmask(), fp_setround(), fp_getround()
236(no metaconfig units yet for these). Don't forget finitel(), fp_classl(),
237fp_class_l(), (yes, both do, unfortunately, exist), and unorderedl().)
e50bb9a1 238
210b36aa 239As of Perl 5.6.1, there is a Perl macro, Perl_isnan().
e50bb9a1 240
722d2a37 241=head2 IV/UV preservation
e50bb9a1 242
722d2a37 243Nicholas Clark has done a lot of work on this, but work is continuing.
244C<+>, C<-> and C<*> work, but guards need to be in place for C<%>, C</>,
245C<&>, C<oct>, C<hex> and C<pack>.
e50bb9a1 246
722d2a37 247=head2 Replace pod2html with something using Pod::Parser
83df6a1d 248
fe854a6f 249The CPAN module C<Marek::Pod::Html> may be a more suitable basis for a
722d2a37 250C<pod2html> convertor; the current one duplicates the functionality
251abstracted in C<Pod::Parser>, which makes updating the POD language
252difficult.
e50bb9a1 253
722d2a37 254=head2 Automate module testing on CPAN
e50bb9a1 255
722d2a37 256When a new Perl is being beta tested, porters have to manually grab
257their favourite CPAN modules and test them - this should be done
258automatically.
e50bb9a1 259
722d2a37 260=head2 sendmsg and recvmsg
83df6a1d 261
722d2a37 262We have all the other BSD socket functions but these. There are
263metaconfig units for these functions which can be added. To avoid these
264being new opcodes, a solution similar to the way C<sockatmark> was added
265would be preferable. (Autoload the C<IO::whatever> module.)
e50bb9a1 266
722d2a37 267=head2 Rewrite perlre documentation
e50bb9a1 268
722d2a37 269The new-style patterns need full documentation, and the whole document
270needs to be a lot clearer.
e50bb9a1 271
722d2a37 272=head2 Convert example code to IO::Handle filehandles
e50bb9a1 273
722d2a37 274=head2 Document Win32 choices
e50bb9a1 275
722d2a37 276=head2 Check new modules
e50bb9a1 277
722d2a37 278=head2 Make roffitall find pods and libs itself
e50bb9a1 279
722d2a37 280Simon Cozens has done some work on this but it needs a rethink.
e50bb9a1 281
722d2a37 282=head1 To do at some point
e50bb9a1 283
722d2a37 284These are ideas that have been regularly tossed around, that most
285people believe should be done maybe during 5.8.x
e50bb9a1 286
722d2a37 287=head2 Remove regular expression recursion
e50bb9a1 288
722d2a37 289Because the regular expression engine is recursive, badly designed
290expressions can lead to lots of recursion filling up the stack. Ilya
291claims that it is easy to convert the engine to being iterative, but
292this has still not yet been done. There may be a regular expression
293engine hit squad meeting at TPC5.
e50bb9a1 294
722d2a37 295=head2 Memory leaks after failed eval
e50bb9a1 296
722d2a37 297Perl will leak memory if you C<eval "hlagh hlagh hlagh hlagh">. This is
298partially because it attempts to build up an op tree for that code and
299doesn't properly free it. The same goes for non-syntactically-correct
300regular expressions. Hugo looked into this, but decided it needed a
301mark-and-sweep GC implementation.
e50bb9a1 302
722d2a37 303Alan notes that: The basic idea was to extend the parser token stack
304(C<YYSTYPE>) to include a type field so we knew what sort of thing each
210b36aa 305element of the stack was. The F<perly.c> code would then have to be
722d2a37 306postprocessed to record the type of each entry on the stack as it was
307created, and the parser patched so that it could unroll the stack
308properly on error.
e50bb9a1 309
722d2a37 310This is possible to do, but would be pretty messy to implement, as it
311would rely on even more sed hackery in F<perly.fixer>.
e50bb9a1 312
722d2a37 313=head2 bitfields in pack
e50bb9a1 314
722d2a37 315=head2 Cross compilation
e50bb9a1 316
722d2a37 317Make Perl buildable with a cross-compiler. This will play havoc with
da75cd15 318Configure, which needs to know how the target system will respond to
722d2a37 319its tests; maybe C<microperl> will be a good starting point here.
320(Indeed, Bart Schuller reports that he compiled up C<microperl> for
321the Agenda PDA and it works fine.) A really big spanner in the works
322is the bootstrapping build process of Perl: if the filesystem the
323target systems sees is not the same what the build host sees, various
324input, output, and (Perl) library files need to be copied back and forth.
e50bb9a1 325
f86a8bc5 326As of 5.8.0 Configure mostly works for cross-compilation
327(used successfully for iPAQ Linux), miniperl gets built,
328but then building DynaLoader (and other extensions) fails
329since MakeMaker knows nothing of cross-compilation.
330(See INSTALL/Cross-compilation for the state of things.)
331
722d2a37 332=head2 Perl preprocessor / macros
e50bb9a1 333
722d2a37 334Source filters help with this, but do not get us all the way. For
335instance, it should be possible to implement the C<??> operator somehow;
336source filters don't (quite) cut it.
e50bb9a1 337
722d2a37 338=head2 Perl lexer in Perl
a45bd81d 339
722d2a37 340Damian Conway is planning to work on this, but it hasn't happened yet.
e50bb9a1 341
722d2a37 342=head2 Using POSIX calls internally
e50bb9a1 343
210b36aa 344When faced with a BSD vs. SysV -style interface to some library or
722d2a37 345system function, perl's roots show in that it typically prefers the BSD
346interface (but falls back to the SysV one). One example is getpgrp().
347Other examples include C<memcpy> vs. C<bcopy>. There are others, mostly in
210b36aa 348F<pp_sys.c>.
e50bb9a1 349
722d2a37 350Mostly, this item is a suggestion for which way to start a journey into
351an C<#ifdef> forest. It is not primarily a suggestion to eliminate any of
352the C<#ifdef> forests.
e50bb9a1 353
722d2a37 354POSIX calls are perhaps more likely to be portable to unexpected
355architectures. They are also perhaps more likely to be actively
356maintained by a current vendor. They are also perhaps more likely to be
357available in thread-safe versions, if appropriate.
e50bb9a1 358
722d2a37 359=head2 -i rename file when changed
e50bb9a1 360
722d2a37 361It's only necessary to rename a file when inplace editing when the file
362has changed. Detecting a change is perhaps the difficult bit.
e50bb9a1 363
722d2a37 364=head2 All ARGV input should act like E<lt>E<gt>
e50bb9a1 365
2d84a16a 366eg C<read(ARGV, ...)> doesn't currently read across multiple files.
367
722d2a37 368=head2 Support for rerunning debugger
e50bb9a1 369
722d2a37 370There should be a way of restarting the debugger on demand.
e50bb9a1 371
c6287c21 372=head2 Test Suite for the Debugger
373
374The debugger is a complex piece of software and fixing something
375here may inadvertently break something else over there. To tame
376this chaotic behaviour, a test suite is necessary.
377
722d2a37 378=head2 my sub foo { }
c47ff5f1 379
722d2a37 380The basic principle is sound, but there are problems with the semantics
381of self-referential and mutually referential lexical subs: how to
382declare the subs?
c47ff5f1 383
722d2a37 384=head2 One-pass global destruction
c47ff5f1 385
722d2a37 386Sweeping away all the allocated memory in one go is a laudable goal, but
387it's difficult and in most cases, it's easier to let the memory get
388freed by exiting.
e50bb9a1 389
722d2a37 390=head2 Rewrite regexp parser
e50bb9a1 391
722d2a37 392There has been talk recently of rewriting the regular expression parser
393to produce an optree instead of a chain of opcodes; it's unclear whether
394or not this would be a win.
e50bb9a1 395
722d2a37 396=head2 Cache recently used regexps
e50bb9a1 397
722d2a37 398This is to speed up
e50bb9a1 399
722d2a37 400 for my $re (@regexps) {
401 $matched++ if /$re/
402 }
e50bb9a1 403
722d2a37 404C<qr//> already gives us a way of saving compiled regexps, but it should
405be done automatically.
e50bb9a1 406
722d2a37 407=head2 Re-entrant functions
e50bb9a1 408
722d2a37 409Add configure probes for C<_r> forms of system calls and fit them to the
410core. Unfortunately, calling conventions for these functions and not
411standardised.
04c70446 412
722d2a37 413=head2 Cross-compilation support
04c70446 414
722d2a37 415Bart Schuller reports that using C<microperl> and a cross-compiler, he
416got Perl working on the Agenda PDA. However, one cannot build a full
417Perl because Configure needs to get the results for the target platform,
418for the host.
e50bb9a1 419
722d2a37 420=head2 Bit-shifting bitvectors
e50bb9a1 421
722d2a37 422Given:
e50bb9a1 423
722d2a37 424 vec($v, 1000, 1) = 1;
e50bb9a1 425
722d2a37 426One should be able to do
e50bb9a1 427
722d2a37 428 $v <<= 1;
e50bb9a1 429
722d2a37 430and have the 999'th bit set.
e50bb9a1 431
722d2a37 432Currently if you try with shift bitvectors you shift the NV/UV, instead
433of the bits in the PV. Not very logical.
e50bb9a1 434
722d2a37 435=head2 debugger pragma
e50bb9a1 436
722d2a37 437The debugger is implemented in Perl in F<perl5db.pl>; turning it into a
438pragma should be easy, but making it work lexically might be more
439difficult. Fiddling with C<$^P> would be necessary.
e50bb9a1 440
722d2a37 441=head2 use less pragma
e50bb9a1 442
722d2a37 443Identify areas where speed/memory tradeoffs can be made and have a hint
444to switch between them.
e50bb9a1 445
722d2a37 446=head2 switch structures
e50bb9a1 447
722d2a37 448Although we have C<Switch.pm> in core, Larry points to the dormant
449C<nswitch> and C<cswitch> ops in F<pp.c>; using these opcodes would be
450much faster.
e50bb9a1 451
722d2a37 452=head2 Cache eval tree
e50bb9a1 453
722d2a37 454=head2 rcatmaybe
e50bb9a1 455
722d2a37 456=head2 Shrink opcode tables
e50bb9a1 457
722d2a37 458=head2 Optimize away @_
e50bb9a1 459
722d2a37 460Look at the "reification" code in C<av.c>
e50bb9a1 461
722d2a37 462=head2 Prototypes versus indirect objects
e50bb9a1 463
722d2a37 464Currently, indirect object syntax bypasses prototype checks.
e50bb9a1 465
210b36aa 466=head2 Install HTML
e50bb9a1 467
722d2a37 468HTML versions of the documentation need to be installed by default; a
469call to C<installhtml> from C<installperl> may be all that's necessary.
e50bb9a1 470
722d2a37 471=head2 Prototype method calls
e50bb9a1 472
722d2a37 473=head2 Return context prototype declarations
e50bb9a1 474
722d2a37 475=head2 magic_setisa
e50bb9a1 476
722d2a37 477=head2 Garbage collection
e50bb9a1 478
722d2a37 479There have been persistent mumblings about putting a mark-and-sweep
480garbage detector into Perl; Alan Burlison has some ideas about this.
e50bb9a1 481
722d2a37 482=head2 IO tutorial
e50bb9a1 483
722d2a37 484Mark-Jason Dominus has the beginnings of one of these.
e50bb9a1 485
722d2a37 486=head2 Rewrite perldoc
e50bb9a1 487
722d2a37 488There are a few suggestions for what to do with C<perldoc>: maybe a
489full-text search, an index function, locating pages on a particular
490high-level subject, and so on.
e50bb9a1 491
3958b146 492=head2 Install .3p manpages
e50bb9a1 493
3958b146 494This is a bone of contention; we can create C<.3p> manpages for each
722d2a37 495built-in function, but should we install them by default? Tcl does this,
496and it clutters up C<apropos>.
e50bb9a1 497
722d2a37 498=head2 Unicode tutorial
e50bb9a1 499
722d2a37 500Simon Cozens promises to do this before he gets old.
e50bb9a1 501
722d2a37 502=head2 Update POSIX.pm for 1003.1-2
3958b146 503
722d2a37 504=head2 Retargetable installation
e50bb9a1 505
722d2a37 506Allow C<@INC> to be changed after Perl is built.
e50bb9a1 507
722d2a37 508=head2 POSIX emulation on non-POSIX systems
e50bb9a1 509
722d2a37 510Make C<POSIX.pm> behave as POSIXly as possible everywhere, meaning we
511have to implement POSIX equivalents for some functions if necessary.
e50bb9a1 512
722d2a37 513=head2 Rename Win32 headers
e50bb9a1 514
722d2a37 515=head2 Finish off lvalue functions
516
517They don't work in the debugger, and they don't work for list or hash
518slices.
e50bb9a1 519
722d2a37 520=head2 Update sprintf documentation
e50bb9a1 521
722d2a37 522Hugo van der Sanden plans to look at this.
e50bb9a1 523
722d2a37 524=head2 Use fchown/fchmod internally
e50bb9a1 525
722d2a37 526This has been done in places, but needs a thorough code review.
527Also fchdir is available in some platforms.
e50bb9a1 528
d45541b3 529=head2 Make v-strings overloaded objects
c5fc23ff 530
d45541b3 531Instead of having to guess whether a string is a v-string and thus
532needs to be displayed with %vd, make v-strings (readonly) objects
533(class "vstring"?) with a stringify overload.
c5fc23ff 534
49293501 535=head2 Allow restricted hash assignment
536
537Currently you're not allowed to assign to a restricted hash at all,
538even with the same keys.
539
540 %restricted = (foo => 42); # error
541
542This should be allowed if the new keyset is a subset of the old
543keyset. May require more extra code than we'd like in pp_aassign.
544
5387ccf1 545=head2 Should overload be inheritable?
546
547Should overload be 'contagious' through @ISA so that derived classes
548would inherit their base classes' overload definitions? What to do
549in case of overload conflicts?
550
722d2a37 551=head1 Vague ideas
e50bb9a1 552
722d2a37 553Ideas which have been discussed, and which may or may not happen.
e50bb9a1 554
722d2a37 555=head2 ref() in list context
e50bb9a1 556
722d2a37 557It's unclear what this should do or how to do it without breaking old
558code.
e50bb9a1 559
f86a8bc5 560=head2 Make tr/// return histogram of characters in list context
e50bb9a1 561
722d2a37 562There is a patch for this, but it may require Unicodification.
e50bb9a1 563
722d2a37 564=head2 Compile to real threaded code
3958b146 565
722d2a37 566=head2 Structured types
3958b146 567
722d2a37 568=head2 Modifiable $1 et al.
e50bb9a1 569
722d2a37 570 ($x = "elephant") =~ /e(ph)/;
571 $1 = "g"; # $x = "elegant"
e50bb9a1 572
722d2a37 573What happens if there are multiple (nested?) brackets? What if the
574string changes between the match and the assignment?
e50bb9a1 575
722d2a37 576=head2 Procedural interfaces for IO::*, etc.
e50bb9a1 577
722d2a37 578Some core modules have been accused of being overly-OO. Adding
579procedural interfaces could demystify them.
e50bb9a1 580
722d2a37 581=head2 RPC modules
e50bb9a1 582
722d2a37 583=head2 Attach/detach debugger from running program
e50bb9a1 584
722d2a37 585With C<gdb>, you can attach the debugger to a running program if you
586pass the process ID. It would be good to do this with the Perl debugger
587on a running Perl program, although I'm not sure how it would be done.
e50bb9a1 588
722d2a37 589=head2 GUI::Native
e50bb9a1 590
722d2a37 591A non-core module that would use "native" GUI to create graphical
592applications.
e50bb9a1 593
722d2a37 594=head2 foreach(reverse ...)
e50bb9a1 595
722d2a37 596Currently
e50bb9a1 597
722d2a37 598 foreach (reverse @_) { ... }
e50bb9a1 599
722d2a37 600puts C<@_> on the stack, reverses it putting the reversed version on the
601stack, then iterates forwards. Instead, it could be special-cased to put
602C<@_> on the stack then iterate backwards.
e50bb9a1 603
722d2a37 604=head2 Constant function cache
e50bb9a1 605
722d2a37 606=head2 Approximate regular expression matching
e50bb9a1 607
722d2a37 608=head1 Ongoing
e50bb9a1 609
722d2a37 610These items B<always> need doing:
e50bb9a1 611
722d2a37 612=head2 Update guts documentation
e50bb9a1 613
722d2a37 614Simon Cozens tries to do this when possible, and contributions to the
615C<perlapi> documentation is welcome.
e50bb9a1 616
722d2a37 617=head2 Add more tests
e50bb9a1 618
722d2a37 619Michael Schwern will donate $500 to Yet Another Society when all core
620modules have tests.
e50bb9a1 621
722d2a37 622=head2 Update auxiliary tools
e50bb9a1 623
722d2a37 624The code we ship with Perl should look like good Perl 5.
e50bb9a1 625
1e278fd9 626=head2 Create debugging macros
627
628Debugging macros (like printsv, dump) can make debugging perl inside a
629C debugger much easier. A good set for gdb comes with mod_perl.
630Something similar should be distributed with perl.
631
632The proper way to do this is to use and extend Devel::DebugInit.
633Devel::DebugInit also needs to be extended to support threads.
634
635See p5p archives for late May/early June 2001 for a recent discussion
636on this topic.
637
638=head2 truncate to the people
639
640One can emulate ftruncate() using F_FREESP and F_CHSIZ fcntls
641(see the UNIX FAQ for details). This needs to go somewhere near
642pp_sys.c:pp_truncate().
643
644One can emulate truncate() easily if one has ftruncate().
645This emulation should also go near pp_sys.pp_truncate().
646
647=head2 Unicode in Filenames
648
649chdir, chmod, chown, chroot, exec, glob, link, lstat, mkdir, open, qx,
650readdir, readlink, rename, rmdir, stat, symlink, sysopen, system,
651truncate, unlink, utime. All these could potentially accept Unicode
652filenames either as input or output (and in the case of system and qx
653Unicode in general, as input or output to/from the shell). Whether a
654filesystem - an operating system pair understands Unicode in filenames
655varies.
656
657Known combinations that have some level of understanding include
658Microsoft NTFS, Apple HFS+ (In Mac OS 9 and X) and Apple UFS (in Mac
659OS X), NFS v4 is rumored to be Unicode, and of course Plan 9. How to
660create Unicode filenames, what forms of Unicode are accepted and used
661(UCS-2, UTF-16, UTF-8), what (if any) is the normalization form used,
662and so on, varies. Finding the right level of interfacing to Perl
663requires some thought. Remember that an OS does not implicate a
664filesystem.
665
eb450546 666Note that in Windows the -C command line flag already does quite
667a bit of the above (but even there the support is not complete:
668for example the exec/spawn are not Unicode-aware) by turning on
669the so-called "wide API support".
670
722d2a37 671=head1 Recently done things
e50bb9a1 672
722d2a37 673These are things which have been on the todo lists in previous releases
674but have recently been completed.
e50bb9a1 675
b0b7f283 676=head2 Alternative RE syntax module
677
678The C<Regexp::English> module, available from the CPAN, provides this:
679
680 my $re = Regexp::English
681 -> start_of_line
682 -> literal('Flippers')
683 -> literal(':')
684 -> optional
685 -> whitespace_char
686 -> end
687 -> remember
688 -> multiple
689 -> digit;
690
691 /$re/;
692
722d2a37 693=head2 Safe signal handling
e50bb9a1 694
722d2a37 695A new signal model went into 5.7.1 without much fanfare. Operations and
696C<malloc>s are no longer interrupted by signals, which are handled
697between opcodes. This means that C<PERL_ASYNC_CHECK> now actually does
698something. However, there are still a few things that need to be done.
e50bb9a1 699
722d2a37 700=head2 Tie Modules
e50bb9a1 701
722d2a37 702Modules which implement arrays in terms of strings, substrings or files
703can be found on the CPAN.
e50bb9a1 704
722d2a37 705=head2 gettimeofday
e50bb9a1 706
210b36aa 707C<Time::HiRes> has been integrated into the core.
e50bb9a1 708
722d2a37 709=head2 setitimer and getimiter
e50bb9a1 710
210b36aa 711Adding C<Time::HiRes> got us this too.
e50bb9a1 712
722d2a37 713=head2 Testing __DIE__ hook
714
715Tests have been added.
716
717=head2 CPP equivalent in Perl
e50bb9a1 718
722d2a37 719A C Yardley will probably have done this by the time you can read this.
720This allows for a generalization of the C constant detection used in
721building C<Errno.pm>.
e50bb9a1 722
722d2a37 723=head2 Explicit switch statements
e50bb9a1 724
722d2a37 725C<Switch.pm> has been integrated into the core to give you all manner of
726C<switch...case> semantics.
e50bb9a1 727
722d2a37 728=head2 autocroak
e50bb9a1 729
722d2a37 730This is C<Fatal.pm>.
e50bb9a1 731
722d2a37 732=head2 UTF/EBCDIC
e50bb9a1 733
722d2a37 734Nick Ing-Simmons has made UTF-EBCDIC (UTR13) work with Perl.
e50bb9a1 735
722d2a37 736 EBCDIC? http://www.unicode.org/unicode/reports/tr16/
e50bb9a1 737
722d2a37 738=head2 UTF Regexes
e50bb9a1 739
722d2a37 740Although there are probably some small bugs to be rooted out, Jarkko
741Hietaniemi has made regular expressions polymorphic between bytes and
742characters.
e50bb9a1 743
722d2a37 744=head2 perlcc to produce executable
e50bb9a1 745
722d2a37 746C<perlcc> was recently rewritten, and can now produce standalone
747executables.
e50bb9a1 748
722d2a37 749=head2 END blocks saved in compiled output
e50bb9a1 750
722d2a37 751=head2 Secure temporary file module
e50bb9a1 752
722d2a37 753Tim Jenness' C<File::Temp> is now in core.
e50bb9a1 754
722d2a37 755=head2 Integrate Time::HiRes
e50bb9a1 756
722d2a37 757This module is now part of core.
e50bb9a1 758
722d2a37 759=head2 Turn Cwd into XS
e50bb9a1 760
722d2a37 761Benjamin Sugars has done this.
e50bb9a1 762
722d2a37 763=head2 Mmap for input
e50bb9a1 764
722d2a37 765Nick Ing-Simmons' C<perlio> supports an C<mmap> IO method.
e50bb9a1 766
722d2a37 767=head2 Byte to/from UTF8 and UTF8 to/from local conversion
e50bb9a1 768
722d2a37 769C<Encode> provides this.
e50bb9a1 770
722d2a37 771=head2 Add sockatmark support
e50bb9a1 772
722d2a37 773Added in 5.7.1
e50bb9a1 774
722d2a37 775=head2 Mailing list archives
776
f224927c 777http://lists.perl.org/ , http://archive.develooper.com/
722d2a37 778
779=head2 Bug tracking
780
781Richard Foley has written the bug tracking system at http://bugs.perl.org/
e50bb9a1 782
722d2a37 783=head2 Integrate MacPerl
e50bb9a1 784
722d2a37 785Chris Nandor and Matthias Neeracher have integrated the MacPerl changes
786into 5.6.0.
e50bb9a1 787
722d2a37 788=head2 Web "nerve center" for Perl
e50bb9a1 789
722d2a37 790http://use.perl.org/ is what you're looking for.
e50bb9a1 791
722d2a37 792=head2 Regular expression tutorial
e50bb9a1 793
722d2a37 794C<perlretut>, provided by Mark Kvale.
e50bb9a1 795
722d2a37 796=head2 Debugging Tutorial
e50bb9a1 797
722d2a37 798C<perldebtut>, written by Richard Foley.
e50bb9a1 799
722d2a37 800=head2 Integrate new modules
e50bb9a1 801
722d2a37 802Jarkko has been integrating madly into 5.7.x
e50bb9a1 803
722d2a37 804=head2 Integrate profiler
e50bb9a1 805
722d2a37 806C<Devel::DProf> is now a core module.
e50bb9a1 807
722d2a37 808=head2 Y2K error detection
e50bb9a1 809
722d2a37 810There's a configure option to detect unsafe concatenation with "19", and
811a CPAN module. (C<D'oh::Year>)
e50bb9a1 812
722d2a37 813=head2 Regular expression debugger
e50bb9a1 814
722d2a37 815While not part of core, Mark-Jason Dominus has written C<Rx> and has
816also come up with a generalised strategy for regular expression
817debugging.
e50bb9a1 818
722d2a37 819=head2 POD checker
e50bb9a1 820
722d2a37 821That's, uh, F<podchecker>
e50bb9a1 822
722d2a37 823=head2 "Dynamic" lexicals
e50bb9a1 824
722d2a37 825=head2 Cache precompiled modules
e50bb9a1 826
722d2a37 827=head1 Deprecated Wishes
e50bb9a1 828
722d2a37 829These are items which used to be in the todo file, but have been
830deprecated for some reason.
e50bb9a1 831
722d2a37 832=head2 Loop control on do{}
e50bb9a1 833
722d2a37 834This would break old code; use C<do{{ }}> instead.
e50bb9a1 835
722d2a37 836=head2 Lexically scoped typeglobs
e50bb9a1 837
722d2a37 838Not needed now we have lexical IO handles.
e50bb9a1 839
722d2a37 840=head2 format BOTTOM
3958b146 841
722d2a37 842=head2 report HANDLE
e50bb9a1 843
722d2a37 844Damian Conway's text formatting modules seem to be the Way To Go.
e50bb9a1 845
722d2a37 846=head2 Generalised want()/caller())
3958b146 847
638ae6a9 848Robin Houston's C<Want> module does this.
849
722d2a37 850=head2 Named prototypes
e50bb9a1 851
638ae6a9 852This seems to be delayed until Perl 6.
e50bb9a1 853
722d2a37 854=head2 Built-in globbing
e50bb9a1 855
722d2a37 856The C<File::Glob> module has been used to replace the C<glob> function.
e50bb9a1 857
722d2a37 858=head2 Regression tests for suidperl
e50bb9a1 859
722d2a37 860C<suidperl> is deprecated in favour of common sense.
e50bb9a1 861
722d2a37 862=head2 Cached hash values
e50bb9a1 863
722d2a37 864We have shared hash keys, which perform the same job.
e50bb9a1 865
722d2a37 866=head2 Add compression modules
e50bb9a1 867
722d2a37 868The compression modules are a little heavy; meanwhile, Nick Clark is
869working on experimental pragmata to do transparent decompression on
870input.
e50bb9a1 871
722d2a37 872=head2 Reorganise documentation into tutorials/references
e50bb9a1 873
722d2a37 874Could not get consensus on P5P about this.
e50bb9a1 875
722d2a37 876=head2 Remove distinction between functions and operators
877
878Caution: highly flammable.
879
880=head2 Make XS easier to use
e50bb9a1 881
722d2a37 882Use C<Inline> instead, or SWIG.
e50bb9a1 883
722d2a37 884=head2 Make embedding easier to use
e50bb9a1 885
722d2a37 886Use C<Inline::CPR>.
e50bb9a1 887
722d2a37 888=head2 man for perl
04c70446 889
1577cd80 890See the Perl Power Tools. ( http://language.perl.com/ppt/ )
04c70446 891
722d2a37 892=head2 my $Package::variable
04c70446 893
722d2a37 894Use C<our> instead.
04c70446 895
722d2a37 896=head2 "or" tests defined, not truth
04c70446 897
722d2a37 898Suggesting this on P5P B<will> cause a boring and interminable flamewar.
04c70446 899
722d2a37 900=head2 "class"-based lexicals
04c70446 901
cbb3fa72 902Use flyweight objects, secure hashes or, dare I say it, pseudo-hashes instead.
f86a8bc5 903(Or whatever will replace pseudohashes in 5.10.)
04c70446 904
722d2a37 905=head2 byteperl
04c70446 906
722d2a37 907C<ByteLoader> covers this.
04c70446 908
722d2a37 909=head2 Lazy evaluation / tail recursion removal
04c70446 910
f86a8bc5 911C<List::Util> gives first() (a short-circuiting grep); tail recursion
912removal is done manually, with C<goto &whoami;>. (However, MJD has
913found that C<goto &whoami> introduces a performance penalty, so maybe
914there should be a way to do this after all: C<sub foo {START: ... goto
915START;> is better.)
0562c0e3 916
917=head2 Make "use utf8" the default
918
f86a8bc5 919Because of backward compatibility this is difficult: scripts could not
920contain B<any legacy eight-bit data> (like Latin-1) anymore, even in
921string literals or pod. Also would introduce a measurable slowdown of
922at least few percentages since all regular expression operations would
923be done in full UTF-8. But if you want to try this, add
924-DUSE_UTF8_SCRIPTS to your compilation flags.
925
3298bd4d 926=head2 Unicode collation and normalization
927
928The Unicode::Collate and Unicode::Normalize modules
929by SADAHIRO Tomoyuki have been included since 5.8.0.
930
931 Collation? http://www.unicode.org/unicode/reports/tr10/
932 Normalization? http://www.unicode.org/unicode/reports/tr15/
0562c0e3 933
1626a787 934=head2 pack/unpack tutorial
935
936Wolfgang Laun finished what Simon Cozens started.
937
3298bd4d 938=cut