Don't assume sh and true.
[p5sagit/p5-mst-13.2.git] / pod / perltodo.pod
CommitLineData
7711098a 1=head1 NAME
2
3perltodo - Perl TO-DO List
4
5=head1 DESCRIPTION
e50bb9a1 6
722d2a37 7This is a list of wishes for Perl. Send updates to
e50bb9a1 8I<perl5-porters@perl.org>. If you want to work on any of these
9projects, be sure to check the perl5-porters archives for past ideas,
10flames, and propaganda. This will save you time and also prevent you
11from implementing something that Larry has already vetoed. One set
12of archives may be found at:
13
14 http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/
15
722d2a37 16=head1 To do during 5.6.x
e50bb9a1 17
722d2a37 18=head2 Support for I/O disciplines
e50bb9a1 19
722d2a37 20C<perlio> provides this, but the interface could be a lot more
21straightforward.
e50bb9a1 22
4b3b956a 23=head2 Autoload bytes.pm
e50bb9a1 24
4b3b956a 25When the lexer sees, for instance, C<bytes::length>, it should
26automatically load the C<bytes> pragma.
27
28=head2 Make "\u{XXXX}" et al work
29
30Danger, Will Robinson! Discussing the semantics of C<"\x{F00}">,
31C<"\xF00"> and C<"\U{F00}"> on P5P I<will> lead to a long and boring
32flamewar.
e50bb9a1 33
c6287c21 34=head2 Create a char *sv_pvprintify(sv, STRLEN *lenp, UV flags)
0562c0e3 35
36For displaying PVs with control characters, embedded nulls, and Unicode.
37This would be useful for printing warnings, or data and regex dumping,
38not_a_number(), and so on.
39
f35392ae 40Requirements: should handle both byte and UTF8 strings. isPRINT()
41characters printed as-is, character less than 256 as \xHH, Unicode
0661e9a4 42characters as \x{HHH}. Don't assume ASCII-like, either, get somebody
43on EBCDIC to test the output.
f35392ae 44
45Possible options, controlled by the flags:
0661e9a4 46- whitespace (other than ' ' of isPRINT()) printed as-is
f35392ae 47- use isPRINT_LC() instead of isPRINT()
48- print control characters like this: "\cA"
49- print control characters like this: "^A"
0661e9a4 50- non-PRINTables printed as '.' instead of \xHH
51- use \OOO instead of \xHH
52- use the C/Perl-metacharacters like \n, \t
f35392ae 53- have a maximum length for the produced string (read it from *lenp)
54- append a "..." to the produced string if the maximum length is exceeded
0661e9a4 55- really fancy: print unicode characters as \N{...}
f35392ae 56
1626a787 57NOTE: pv_display(), pv_uni_display(), sv_uni_display() are already
58doing something like the above.
c5fc23ff 59
722d2a37 60=head2 Overloadable regex assertions
e50bb9a1 61
722d2a37 62This may or may not be possible with the current regular expression
63engine. The idea is that, for instance, C<\b> needs to be
64algorithmically computed if you're dealing with Thai text. Hence, the
65B<\b> assertion wants to be overloaded by a function.
e50bb9a1 66
776f8809 67=head2 Unicode
68
69=over 4
70
71=item *
e50bb9a1 72
f34dec15 73Allow for long form of the General Category Properties, e.g
74C<\p{IsOpenPunctuation}>, not just the abbreviated form, e.g.
75C<\p{IsPs}>.
76
77=item *
78
1ac13f9a 79Allow for the metaproperties: C<XID Start>, C<XID Continue>,
80C<NF*_NO>, C<NF*_MAYBE> (require the DerivedCoreProperties and
81DerviceNormalizationProperties files).
f34dec15 82
71d929cb 83There are also multiple value properties still unimplemented:
84C<Numeric Type>, C<East Asian Width>.
f34dec15 85
86=item *
87
722d2a37 88 Case Mappings? http://www.unicode.org/unicode/reports/tr21/
e50bb9a1 89
6f16a292 90lc(), uc(), lcfirst(), and ucfirst() work only for some of the
91simplest cases, where the mapping goes from a single Unicode character
92to another single Unicode character. See lib/unicore/SpecCase.txt
93(and CaseFold.txt).
ac1256e8 94
776f8809 95=item *
e50bb9a1 96
8d3e8850 97UTF-8 identifier names should probably be canonicalized: NFC?
e50bb9a1 98
20eafb1c 99=item *
100
101UTF-8 in package names and sub names? The first is problematic
8d3e8850 102because of the mapping to pathnames, ditto for the second one if
20eafb1c 103one does autosplitting, for example.
e50bb9a1 104
776f8809 105=back
106
107See L<perlunicode/UNICODE REGULAR EXPRESSION SUPPORT LEVEL> for what's
f34dec15 108there and what's missing. Almost all of Levels 2 and 3 is missing,
109and as of 5.8.0 not even all of Level 1 is there.
8d3e8850 110They have some tricks Perl doesn't yet implement, such as character
20eafb1c 111class subtraction.
112
113 http://www.unicode.org/unicode/reports/tr18/
776f8809 114
56490ca2 115=head2 Work out exit/die semantics for threads
e50bb9a1 116
97b33923 117There are some suggestions to use for example something like this:
118default to "(thread exiting first will) wait for the other threads
119until up to 60 seconds". Other possibilities:
120
121 use threads wait => 0;
122
123Do not wait.
124
125 use threads wait_for => 10;
126
127Wait up to 10 seconds.
128
129 use threads wait_for => -1;
130
131Wait for ever.
e50bb9a1 132
56490ca2 133http://archive.develooper.com/perl5-porters@perl.org/msg79618.html
dd0afe54 134
b2f9d798 135=head2 Better support for nonpreemptive threading systems like GNU pth
dd0afe54 136
b2f9d798 137To better support nonpreemptive threading systems, perhaps some of the
138blocking functions internally in Perl should do a yield() before a
139blocking call. (Now certain threads tests ({basic,list,thread.t})
140simply do a yield() before they sleep() to give nonpreemptive thread
141implementations a chance).
cfde3649 142
b2f9d798 143In some cases, like the GNU pth, which has replacement functions that
144are nonblocking (pth_select instead of select), maybe Perl should be
145using them instead when built for threading.
e50bb9a1 146
722d2a37 147=head2 Typed lexicals for compiler
e50bb9a1 148
722d2a37 149=head2 Compiler workarounds for Win32
e50bb9a1 150
722d2a37 151=head2 AUTOLOADing in the compiler
e50bb9a1 152
722d2a37 153=head2 Fixing comppadlist when compiling
e50bb9a1 154
722d2a37 155=head2 Cleaning up exported namespace
e50bb9a1 156
722d2a37 157=head2 Complete signal handling
e50bb9a1 158
722d2a37 159Add C<PERL_ASYNC_CHECK> to opcodes which loop; replace C<sigsetjmp> with
160C<sigjmp>; check C<wait> for signal safety.
e50bb9a1 161
722d2a37 162=head2 Out-of-source builds
e50bb9a1 163
722d2a37 164This was done for 5.6.0, but needs reworking for 5.7.x
e50bb9a1 165
722d2a37 166=head2 POSIX realtime support
e50bb9a1 167
722d2a37 168POSIX 1003.1 1996 Edition support--realtime stuff: POSIX semaphores,
169message queues, shared memory, realtime clocks, timers, signals (the
170metaconfig units mostly already exist for these)
e50bb9a1 171
722d2a37 172=head2 UNIX98 support
e50bb9a1 173
722d2a37 174Reader-writer locks, realtime/asynchronous IO
e50bb9a1 175
722d2a37 176=head2 IPv6 Support
e50bb9a1 177
fe854a6f 178There are non-core modules, such as C<Socket6>, but these will need
722d2a37 179integrating when IPv6 actually starts to really happen. See RFC 2292
180and RFC 2553.
e50bb9a1 181
722d2a37 182=head2 Long double conversion
e50bb9a1 183
722d2a37 184Floating point formatting is still causing some weird test failures.
e50bb9a1 185
722d2a37 186=head2 Locales
e50bb9a1 187
722d2a37 188Locales and Unicode interact with each other in unpleasant ways.
189One possible solution would be to adopt/support ICU:
e50bb9a1 190
722d2a37 191 http://oss.software.ibm.com/developerworks/opensource/icu/project/
e50bb9a1 192
722d2a37 193=head2 Arithmetic on non-Arabic numerals
e50bb9a1 194
722d2a37 195C<[1234567890]> aren't the only numerals any more.
e50bb9a1 196
722d2a37 197=head2 POSIX Unicode character classes
e50bb9a1 198
97b33923 199(C<[=a=]> for equivalence classes, C<[.ch.]> for collation.)
722d2a37 200These are dependent on Unicode normalization and collation.
e50bb9a1 201
722d2a37 202=head2 Factoring out common suffices/prefices in regexps (trie optimization)
c47ff5f1 203
722d2a37 204Currently, the user has to optimize C<foo|far> and C<foo|goo> into
205C<f(?:oo|ar)> and C<[fg]oo> by hand; this could be done automatically.
e50bb9a1 206
722d2a37 207=head2 Security audit shipped utilities
e50bb9a1 208
722d2a37 209All the code we ship with Perl needs to be sensible about temporary file
210handling, locking, input validation, and so on.
e50bb9a1 211
c8d2171d 212=head2 Sort out the uid-setting mess
213
214Currently there are several problems with the setting of uids ($<, $>
215for the real and effective uids). Firstly, what exactly setuid() call
216gets invoked in which platform is simply a big mess that needs to be
217untangled. Secondly, the effects are apparently not standard across
218platforms, (if you first set $< and then $>, or vice versa, being
666f95b9 219uid == euid == zero, or just euid == zero, or as a normal user, what are
c8d2171d 220the results?). The test suite not (usually) being run as root means
221that these things do not get much testing. Thirdly, there's quite
222often a third uid called saved uid, and Perl has no knowledge of that
223feature in any way. (If one has the saved uid of zero, one can get
224back any real and effective uids.) As an example, to change also the
225saved uid, one needs to set the real and effective uids B<twice>-- in
226most systems, that is: in HP-UX that doesn't seem to work.
666f95b9 227
722d2a37 228=head2 Custom opcodes
e50bb9a1 229
722d2a37 230Have a way to introduce user-defined opcodes without the subroutine call
231overhead of an XSUB; the user should be able to create PP code. Simon
232Cozens has some ideas on this.
e50bb9a1 233
722d2a37 234=head2 DLL Versioning
e50bb9a1 235
d1be9408 236Windows needs a way to know what version of an XS or C<libperl> DLL it's
722d2a37 237loading.
e50bb9a1 238
722d2a37 239=head2 Introduce @( and @)
e50bb9a1 240
722d2a37 241C<$(> may return "foo bar baz". Unfortunately, since groups can
242theoretically have spaces in their names, this could be one, two or
243three groups.
e50bb9a1 244
722d2a37 245=head2 Floating point handling
e50bb9a1 246
722d2a37 247C<NaN> and C<inf> support is particularly troublesome.
248(fp_classify(), fp_class(), fp_class_d(), class(), isinf(),
249isfinite(), finite(), isnormal(), unordered(), <ieeefp.h>,
250<fp_class.h> (there are metaconfig units for all these) (I think),
251fp_setmask(), fp_getmask(), fp_setround(), fp_getround()
252(no metaconfig units yet for these). Don't forget finitel(), fp_classl(),
253fp_class_l(), (yes, both do, unfortunately, exist), and unorderedl().)
e50bb9a1 254
210b36aa 255As of Perl 5.6.1, there is a Perl macro, Perl_isnan().
e50bb9a1 256
722d2a37 257=head2 IV/UV preservation
e50bb9a1 258
722d2a37 259Nicholas Clark has done a lot of work on this, but work is continuing.
260C<+>, C<-> and C<*> work, but guards need to be in place for C<%>, C</>,
261C<&>, C<oct>, C<hex> and C<pack>.
e50bb9a1 262
722d2a37 263=head2 Replace pod2html with something using Pod::Parser
83df6a1d 264
fe854a6f 265The CPAN module C<Marek::Pod::Html> may be a more suitable basis for a
97b33923 266C<pod2html> converter; the current one duplicates the functionality
722d2a37 267abstracted in C<Pod::Parser>, which makes updating the POD language
268difficult.
e50bb9a1 269
722d2a37 270=head2 Automate module testing on CPAN
e50bb9a1 271
722d2a37 272When a new Perl is being beta tested, porters have to manually grab
273their favourite CPAN modules and test them - this should be done
274automatically.
e50bb9a1 275
722d2a37 276=head2 sendmsg and recvmsg
83df6a1d 277
722d2a37 278We have all the other BSD socket functions but these. There are
279metaconfig units for these functions which can be added. To avoid these
280being new opcodes, a solution similar to the way C<sockatmark> was added
281would be preferable. (Autoload the C<IO::whatever> module.)
e50bb9a1 282
722d2a37 283=head2 Rewrite perlre documentation
e50bb9a1 284
722d2a37 285The new-style patterns need full documentation, and the whole document
286needs to be a lot clearer.
e50bb9a1 287
722d2a37 288=head2 Convert example code to IO::Handle filehandles
e50bb9a1 289
722d2a37 290=head2 Document Win32 choices
e50bb9a1 291
722d2a37 292=head2 Check new modules
e50bb9a1 293
722d2a37 294=head2 Make roffitall find pods and libs itself
e50bb9a1 295
722d2a37 296Simon Cozens has done some work on this but it needs a rethink.
e50bb9a1 297
722d2a37 298=head1 To do at some point
e50bb9a1 299
722d2a37 300These are ideas that have been regularly tossed around, that most
301people believe should be done maybe during 5.8.x
e50bb9a1 302
722d2a37 303=head2 Remove regular expression recursion
e50bb9a1 304
722d2a37 305Because the regular expression engine is recursive, badly designed
306expressions can lead to lots of recursion filling up the stack. Ilya
307claims that it is easy to convert the engine to being iterative, but
308this has still not yet been done. There may be a regular expression
309engine hit squad meeting at TPC5.
e50bb9a1 310
722d2a37 311=head2 Memory leaks after failed eval
e50bb9a1 312
722d2a37 313Perl will leak memory if you C<eval "hlagh hlagh hlagh hlagh">. This is
314partially because it attempts to build up an op tree for that code and
315doesn't properly free it. The same goes for non-syntactically-correct
316regular expressions. Hugo looked into this, but decided it needed a
317mark-and-sweep GC implementation.
e50bb9a1 318
722d2a37 319Alan notes that: The basic idea was to extend the parser token stack
320(C<YYSTYPE>) to include a type field so we knew what sort of thing each
210b36aa 321element of the stack was. The F<perly.c> code would then have to be
722d2a37 322postprocessed to record the type of each entry on the stack as it was
323created, and the parser patched so that it could unroll the stack
324properly on error.
e50bb9a1 325
722d2a37 326This is possible to do, but would be pretty messy to implement, as it
327would rely on even more sed hackery in F<perly.fixer>.
e50bb9a1 328
722d2a37 329=head2 bitfields in pack
e50bb9a1 330
722d2a37 331=head2 Cross compilation
e50bb9a1 332
722d2a37 333Make Perl buildable with a cross-compiler. This will play havoc with
da75cd15 334Configure, which needs to know how the target system will respond to
722d2a37 335its tests; maybe C<microperl> will be a good starting point here.
336(Indeed, Bart Schuller reports that he compiled up C<microperl> for
337the Agenda PDA and it works fine.) A really big spanner in the works
338is the bootstrapping build process of Perl: if the filesystem the
339target systems sees is not the same what the build host sees, various
340input, output, and (Perl) library files need to be copied back and forth.
e50bb9a1 341
f86a8bc5 342As of 5.8.0 Configure mostly works for cross-compilation
343(used successfully for iPAQ Linux), miniperl gets built,
344but then building DynaLoader (and other extensions) fails
345since MakeMaker knows nothing of cross-compilation.
346(See INSTALL/Cross-compilation for the state of things.)
347
722d2a37 348=head2 Perl preprocessor / macros
e50bb9a1 349
722d2a37 350Source filters help with this, but do not get us all the way. For
351instance, it should be possible to implement the C<??> operator somehow;
352source filters don't (quite) cut it.
e50bb9a1 353
722d2a37 354=head2 Perl lexer in Perl
a45bd81d 355
722d2a37 356Damian Conway is planning to work on this, but it hasn't happened yet.
e50bb9a1 357
722d2a37 358=head2 Using POSIX calls internally
e50bb9a1 359
210b36aa 360When faced with a BSD vs. SysV -style interface to some library or
722d2a37 361system function, perl's roots show in that it typically prefers the BSD
362interface (but falls back to the SysV one). One example is getpgrp().
363Other examples include C<memcpy> vs. C<bcopy>. There are others, mostly in
210b36aa 364F<pp_sys.c>.
e50bb9a1 365
722d2a37 366Mostly, this item is a suggestion for which way to start a journey into
367an C<#ifdef> forest. It is not primarily a suggestion to eliminate any of
368the C<#ifdef> forests.
e50bb9a1 369
722d2a37 370POSIX calls are perhaps more likely to be portable to unexpected
371architectures. They are also perhaps more likely to be actively
372maintained by a current vendor. They are also perhaps more likely to be
373available in thread-safe versions, if appropriate.
e50bb9a1 374
722d2a37 375=head2 -i rename file when changed
e50bb9a1 376
722d2a37 377It's only necessary to rename a file when inplace editing when the file
378has changed. Detecting a change is perhaps the difficult bit.
e50bb9a1 379
722d2a37 380=head2 All ARGV input should act like E<lt>E<gt>
e50bb9a1 381
2d84a16a 382eg C<read(ARGV, ...)> doesn't currently read across multiple files.
383
722d2a37 384=head2 Support for rerunning debugger
e50bb9a1 385
722d2a37 386There should be a way of restarting the debugger on demand.
e50bb9a1 387
c6287c21 388=head2 Test Suite for the Debugger
389
390The debugger is a complex piece of software and fixing something
391here may inadvertently break something else over there. To tame
392this chaotic behaviour, a test suite is necessary.
393
722d2a37 394=head2 my sub foo { }
c47ff5f1 395
722d2a37 396The basic principle is sound, but there are problems with the semantics
397of self-referential and mutually referential lexical subs: how to
398declare the subs?
c47ff5f1 399
722d2a37 400=head2 One-pass global destruction
c47ff5f1 401
722d2a37 402Sweeping away all the allocated memory in one go is a laudable goal, but
403it's difficult and in most cases, it's easier to let the memory get
404freed by exiting.
e50bb9a1 405
722d2a37 406=head2 Rewrite regexp parser
e50bb9a1 407
722d2a37 408There has been talk recently of rewriting the regular expression parser
409to produce an optree instead of a chain of opcodes; it's unclear whether
410or not this would be a win.
e50bb9a1 411
722d2a37 412=head2 Cache recently used regexps
e50bb9a1 413
722d2a37 414This is to speed up
e50bb9a1 415
722d2a37 416 for my $re (@regexps) {
417 $matched++ if /$re/
418 }
e50bb9a1 419
722d2a37 420C<qr//> already gives us a way of saving compiled regexps, but it should
421be done automatically.
e50bb9a1 422
722d2a37 423=head2 Cross-compilation support
04c70446 424
722d2a37 425Bart Schuller reports that using C<microperl> and a cross-compiler, he
426got Perl working on the Agenda PDA. However, one cannot build a full
427Perl because Configure needs to get the results for the target platform,
428for the host.
e50bb9a1 429
722d2a37 430=head2 Bit-shifting bitvectors
e50bb9a1 431
722d2a37 432Given:
e50bb9a1 433
722d2a37 434 vec($v, 1000, 1) = 1;
e50bb9a1 435
722d2a37 436One should be able to do
e50bb9a1 437
722d2a37 438 $v <<= 1;
e50bb9a1 439
722d2a37 440and have the 999'th bit set.
e50bb9a1 441
722d2a37 442Currently if you try with shift bitvectors you shift the NV/UV, instead
443of the bits in the PV. Not very logical.
e50bb9a1 444
722d2a37 445=head2 debugger pragma
e50bb9a1 446
722d2a37 447The debugger is implemented in Perl in F<perl5db.pl>; turning it into a
448pragma should be easy, but making it work lexically might be more
449difficult. Fiddling with C<$^P> would be necessary.
e50bb9a1 450
722d2a37 451=head2 use less pragma
e50bb9a1 452
722d2a37 453Identify areas where speed/memory tradeoffs can be made and have a hint
454to switch between them.
e50bb9a1 455
722d2a37 456=head2 switch structures
e50bb9a1 457
722d2a37 458Although we have C<Switch.pm> in core, Larry points to the dormant
459C<nswitch> and C<cswitch> ops in F<pp.c>; using these opcodes would be
460much faster.
e50bb9a1 461
722d2a37 462=head2 Cache eval tree
e50bb9a1 463
722d2a37 464=head2 rcatmaybe
e50bb9a1 465
722d2a37 466=head2 Shrink opcode tables
e50bb9a1 467
722d2a37 468=head2 Optimize away @_
e50bb9a1 469
722d2a37 470Look at the "reification" code in C<av.c>
e50bb9a1 471
722d2a37 472=head2 Prototypes versus indirect objects
e50bb9a1 473
722d2a37 474Currently, indirect object syntax bypasses prototype checks.
e50bb9a1 475
210b36aa 476=head2 Install HTML
e50bb9a1 477
722d2a37 478HTML versions of the documentation need to be installed by default; a
479call to C<installhtml> from C<installperl> may be all that's necessary.
e50bb9a1 480
722d2a37 481=head2 Prototype method calls
e50bb9a1 482
722d2a37 483=head2 Return context prototype declarations
e50bb9a1 484
722d2a37 485=head2 magic_setisa
e50bb9a1 486
722d2a37 487=head2 Garbage collection
e50bb9a1 488
722d2a37 489There have been persistent mumblings about putting a mark-and-sweep
490garbage detector into Perl; Alan Burlison has some ideas about this.
e50bb9a1 491
722d2a37 492=head2 IO tutorial
e50bb9a1 493
722d2a37 494Mark-Jason Dominus has the beginnings of one of these.
e50bb9a1 495
722d2a37 496=head2 Rewrite perldoc
e50bb9a1 497
722d2a37 498There are a few suggestions for what to do with C<perldoc>: maybe a
499full-text search, an index function, locating pages on a particular
500high-level subject, and so on.
e50bb9a1 501
3958b146 502=head2 Install .3p manpages
e50bb9a1 503
3958b146 504This is a bone of contention; we can create C<.3p> manpages for each
722d2a37 505built-in function, but should we install them by default? Tcl does this,
506and it clutters up C<apropos>.
e50bb9a1 507
722d2a37 508=head2 Unicode tutorial
e50bb9a1 509
722d2a37 510Simon Cozens promises to do this before he gets old.
e50bb9a1 511
722d2a37 512=head2 Update POSIX.pm for 1003.1-2
3958b146 513
722d2a37 514=head2 Retargetable installation
e50bb9a1 515
722d2a37 516Allow C<@INC> to be changed after Perl is built.
e50bb9a1 517
722d2a37 518=head2 POSIX emulation on non-POSIX systems
e50bb9a1 519
722d2a37 520Make C<POSIX.pm> behave as POSIXly as possible everywhere, meaning we
521have to implement POSIX equivalents for some functions if necessary.
e50bb9a1 522
722d2a37 523=head2 Rename Win32 headers
e50bb9a1 524
722d2a37 525=head2 Finish off lvalue functions
526
527They don't work in the debugger, and they don't work for list or hash
528slices.
e50bb9a1 529
722d2a37 530=head2 Update sprintf documentation
e50bb9a1 531
722d2a37 532Hugo van der Sanden plans to look at this.
e50bb9a1 533
722d2a37 534=head2 Use fchown/fchmod internally
e50bb9a1 535
722d2a37 536This has been done in places, but needs a thorough code review.
537Also fchdir is available in some platforms.
e50bb9a1 538
d45541b3 539=head2 Make v-strings overloaded objects
c5fc23ff 540
d45541b3 541Instead of having to guess whether a string is a v-string and thus
542needs to be displayed with %vd, make v-strings (readonly) objects
543(class "vstring"?) with a stringify overload.
c5fc23ff 544
49293501 545=head2 Allow restricted hash assignment
546
547Currently you're not allowed to assign to a restricted hash at all,
548even with the same keys.
549
550 %restricted = (foo => 42); # error
551
552This should be allowed if the new keyset is a subset of the old
553keyset. May require more extra code than we'd like in pp_aassign.
554
5387ccf1 555=head2 Should overload be inheritable?
556
557Should overload be 'contagious' through @ISA so that derived classes
558would inherit their base classes' overload definitions? What to do
559in case of overload conflicts?
560
cbda53d5 561=head2 Taint rethink
562
563Should taint be stopped from affecting control flow, if ($tainted)?
564Should tainted symbolic method calls and subref calls be stopped?
565(Look at Ruby's $SAFE levels for inspiration?)
566
722d2a37 567=head1 Vague ideas
e50bb9a1 568
722d2a37 569Ideas which have been discussed, and which may or may not happen.
e50bb9a1 570
722d2a37 571=head2 ref() in list context
e50bb9a1 572
722d2a37 573It's unclear what this should do or how to do it without breaking old
574code.
e50bb9a1 575
f86a8bc5 576=head2 Make tr/// return histogram of characters in list context
e50bb9a1 577
722d2a37 578There is a patch for this, but it may require Unicodification.
e50bb9a1 579
722d2a37 580=head2 Compile to real threaded code
3958b146 581
722d2a37 582=head2 Structured types
3958b146 583
722d2a37 584=head2 Modifiable $1 et al.
e50bb9a1 585
722d2a37 586 ($x = "elephant") =~ /e(ph)/;
587 $1 = "g"; # $x = "elegant"
e50bb9a1 588
722d2a37 589What happens if there are multiple (nested?) brackets? What if the
590string changes between the match and the assignment?
e50bb9a1 591
722d2a37 592=head2 Procedural interfaces for IO::*, etc.
e50bb9a1 593
722d2a37 594Some core modules have been accused of being overly-OO. Adding
595procedural interfaces could demystify them.
e50bb9a1 596
722d2a37 597=head2 RPC modules
e50bb9a1 598
722d2a37 599=head2 Attach/detach debugger from running program
e50bb9a1 600
722d2a37 601With C<gdb>, you can attach the debugger to a running program if you
602pass the process ID. It would be good to do this with the Perl debugger
603on a running Perl program, although I'm not sure how it would be done.
e50bb9a1 604
722d2a37 605=head2 GUI::Native
e50bb9a1 606
722d2a37 607A non-core module that would use "native" GUI to create graphical
608applications.
e50bb9a1 609
722d2a37 610=head2 foreach(reverse ...)
e50bb9a1 611
722d2a37 612Currently
e50bb9a1 613
722d2a37 614 foreach (reverse @_) { ... }
e50bb9a1 615
722d2a37 616puts C<@_> on the stack, reverses it putting the reversed version on the
617stack, then iterates forwards. Instead, it could be special-cased to put
618C<@_> on the stack then iterate backwards.
e50bb9a1 619
722d2a37 620=head2 Constant function cache
e50bb9a1 621
722d2a37 622=head2 Approximate regular expression matching
e50bb9a1 623
722d2a37 624=head1 Ongoing
e50bb9a1 625
722d2a37 626These items B<always> need doing:
e50bb9a1 627
722d2a37 628=head2 Update guts documentation
e50bb9a1 629
722d2a37 630Simon Cozens tries to do this when possible, and contributions to the
631C<perlapi> documentation is welcome.
e50bb9a1 632
722d2a37 633=head2 Add more tests
e50bb9a1 634
722d2a37 635Michael Schwern will donate $500 to Yet Another Society when all core
636modules have tests.
e50bb9a1 637
722d2a37 638=head2 Update auxiliary tools
e50bb9a1 639
722d2a37 640The code we ship with Perl should look like good Perl 5.
e50bb9a1 641
1e278fd9 642=head2 Create debugging macros
643
644Debugging macros (like printsv, dump) can make debugging perl inside a
645C debugger much easier. A good set for gdb comes with mod_perl.
646Something similar should be distributed with perl.
647
648The proper way to do this is to use and extend Devel::DebugInit.
649Devel::DebugInit also needs to be extended to support threads.
650
651See p5p archives for late May/early June 2001 for a recent discussion
652on this topic.
653
654=head2 truncate to the people
655
656One can emulate ftruncate() using F_FREESP and F_CHSIZ fcntls
657(see the UNIX FAQ for details). This needs to go somewhere near
658pp_sys.c:pp_truncate().
659
660One can emulate truncate() easily if one has ftruncate().
661This emulation should also go near pp_sys.pp_truncate().
662
663=head2 Unicode in Filenames
664
665chdir, chmod, chown, chroot, exec, glob, link, lstat, mkdir, open, qx,
666readdir, readlink, rename, rmdir, stat, symlink, sysopen, system,
667truncate, unlink, utime. All these could potentially accept Unicode
668filenames either as input or output (and in the case of system and qx
669Unicode in general, as input or output to/from the shell). Whether a
670filesystem - an operating system pair understands Unicode in filenames
671varies.
672
673Known combinations that have some level of understanding include
674Microsoft NTFS, Apple HFS+ (In Mac OS 9 and X) and Apple UFS (in Mac
675OS X), NFS v4 is rumored to be Unicode, and of course Plan 9. How to
676create Unicode filenames, what forms of Unicode are accepted and used
677(UCS-2, UTF-16, UTF-8), what (if any) is the normalization form used,
678and so on, varies. Finding the right level of interfacing to Perl
679requires some thought. Remember that an OS does not implicate a
680filesystem.
681
eb450546 682Note that in Windows the -C command line flag already does quite
683a bit of the above (but even there the support is not complete:
684for example the exec/spawn are not Unicode-aware) by turning on
685the so-called "wide API support".
686
722d2a37 687=head1 Recently done things
e50bb9a1 688
722d2a37 689These are things which have been on the todo lists in previous releases
690but have recently been completed.
e50bb9a1 691
b0b7f283 692=head2 Alternative RE syntax module
693
694The C<Regexp::English> module, available from the CPAN, provides this:
695
696 my $re = Regexp::English
697 -> start_of_line
698 -> literal('Flippers')
699 -> literal(':')
700 -> optional
701 -> whitespace_char
702 -> end
703 -> remember
704 -> multiple
705 -> digit;
706
707 /$re/;
708
722d2a37 709=head2 Safe signal handling
e50bb9a1 710
722d2a37 711A new signal model went into 5.7.1 without much fanfare. Operations and
712C<malloc>s are no longer interrupted by signals, which are handled
713between opcodes. This means that C<PERL_ASYNC_CHECK> now actually does
714something. However, there are still a few things that need to be done.
e50bb9a1 715
722d2a37 716=head2 Tie Modules
e50bb9a1 717
722d2a37 718Modules which implement arrays in terms of strings, substrings or files
719can be found on the CPAN.
e50bb9a1 720
722d2a37 721=head2 gettimeofday
e50bb9a1 722
210b36aa 723C<Time::HiRes> has been integrated into the core.
e50bb9a1 724
722d2a37 725=head2 setitimer and getimiter
e50bb9a1 726
210b36aa 727Adding C<Time::HiRes> got us this too.
e50bb9a1 728
722d2a37 729=head2 Testing __DIE__ hook
730
731Tests have been added.
732
733=head2 CPP equivalent in Perl
e50bb9a1 734
722d2a37 735A C Yardley will probably have done this by the time you can read this.
736This allows for a generalization of the C constant detection used in
737building C<Errno.pm>.
e50bb9a1 738
722d2a37 739=head2 Explicit switch statements
e50bb9a1 740
722d2a37 741C<Switch.pm> has been integrated into the core to give you all manner of
742C<switch...case> semantics.
e50bb9a1 743
722d2a37 744=head2 autocroak
e50bb9a1 745
722d2a37 746This is C<Fatal.pm>.
e50bb9a1 747
722d2a37 748=head2 UTF/EBCDIC
e50bb9a1 749
722d2a37 750Nick Ing-Simmons has made UTF-EBCDIC (UTR13) work with Perl.
e50bb9a1 751
722d2a37 752 EBCDIC? http://www.unicode.org/unicode/reports/tr16/
e50bb9a1 753
722d2a37 754=head2 UTF Regexes
e50bb9a1 755
722d2a37 756Although there are probably some small bugs to be rooted out, Jarkko
757Hietaniemi has made regular expressions polymorphic between bytes and
758characters.
e50bb9a1 759
722d2a37 760=head2 perlcc to produce executable
e50bb9a1 761
722d2a37 762C<perlcc> was recently rewritten, and can now produce standalone
763executables.
e50bb9a1 764
722d2a37 765=head2 END blocks saved in compiled output
e50bb9a1 766
722d2a37 767=head2 Secure temporary file module
e50bb9a1 768
722d2a37 769Tim Jenness' C<File::Temp> is now in core.
e50bb9a1 770
722d2a37 771=head2 Integrate Time::HiRes
e50bb9a1 772
722d2a37 773This module is now part of core.
e50bb9a1 774
722d2a37 775=head2 Turn Cwd into XS
e50bb9a1 776
722d2a37 777Benjamin Sugars has done this.
e50bb9a1 778
722d2a37 779=head2 Mmap for input
e50bb9a1 780
722d2a37 781Nick Ing-Simmons' C<perlio> supports an C<mmap> IO method.
e50bb9a1 782
722d2a37 783=head2 Byte to/from UTF8 and UTF8 to/from local conversion
e50bb9a1 784
722d2a37 785C<Encode> provides this.
e50bb9a1 786
722d2a37 787=head2 Add sockatmark support
e50bb9a1 788
722d2a37 789Added in 5.7.1
e50bb9a1 790
722d2a37 791=head2 Mailing list archives
792
f224927c 793http://lists.perl.org/ , http://archive.develooper.com/
722d2a37 794
795=head2 Bug tracking
796
797Richard Foley has written the bug tracking system at http://bugs.perl.org/
e50bb9a1 798
722d2a37 799=head2 Integrate MacPerl
e50bb9a1 800
722d2a37 801Chris Nandor and Matthias Neeracher have integrated the MacPerl changes
802into 5.6.0.
e50bb9a1 803
722d2a37 804=head2 Web "nerve center" for Perl
e50bb9a1 805
722d2a37 806http://use.perl.org/ is what you're looking for.
e50bb9a1 807
722d2a37 808=head2 Regular expression tutorial
e50bb9a1 809
722d2a37 810C<perlretut>, provided by Mark Kvale.
e50bb9a1 811
722d2a37 812=head2 Debugging Tutorial
e50bb9a1 813
722d2a37 814C<perldebtut>, written by Richard Foley.
e50bb9a1 815
722d2a37 816=head2 Integrate new modules
e50bb9a1 817
722d2a37 818Jarkko has been integrating madly into 5.7.x
e50bb9a1 819
722d2a37 820=head2 Integrate profiler
e50bb9a1 821
722d2a37 822C<Devel::DProf> is now a core module.
e50bb9a1 823
722d2a37 824=head2 Y2K error detection
e50bb9a1 825
722d2a37 826There's a configure option to detect unsafe concatenation with "19", and
827a CPAN module. (C<D'oh::Year>)
e50bb9a1 828
722d2a37 829=head2 Regular expression debugger
e50bb9a1 830
722d2a37 831While not part of core, Mark-Jason Dominus has written C<Rx> and has
832also come up with a generalised strategy for regular expression
833debugging.
e50bb9a1 834
722d2a37 835=head2 POD checker
e50bb9a1 836
722d2a37 837That's, uh, F<podchecker>
e50bb9a1 838
722d2a37 839=head2 "Dynamic" lexicals
e50bb9a1 840
722d2a37 841=head2 Cache precompiled modules
e50bb9a1 842
722d2a37 843=head1 Deprecated Wishes
e50bb9a1 844
722d2a37 845These are items which used to be in the todo file, but have been
846deprecated for some reason.
e50bb9a1 847
722d2a37 848=head2 Loop control on do{}
e50bb9a1 849
722d2a37 850This would break old code; use C<do{{ }}> instead.
e50bb9a1 851
722d2a37 852=head2 Lexically scoped typeglobs
e50bb9a1 853
722d2a37 854Not needed now we have lexical IO handles.
e50bb9a1 855
722d2a37 856=head2 format BOTTOM
3958b146 857
722d2a37 858=head2 report HANDLE
e50bb9a1 859
722d2a37 860Damian Conway's text formatting modules seem to be the Way To Go.
e50bb9a1 861
722d2a37 862=head2 Generalised want()/caller())
3958b146 863
638ae6a9 864Robin Houston's C<Want> module does this.
865
722d2a37 866=head2 Named prototypes
e50bb9a1 867
638ae6a9 868This seems to be delayed until Perl 6.
e50bb9a1 869
722d2a37 870=head2 Built-in globbing
e50bb9a1 871
722d2a37 872The C<File::Glob> module has been used to replace the C<glob> function.
e50bb9a1 873
722d2a37 874=head2 Regression tests for suidperl
e50bb9a1 875
722d2a37 876C<suidperl> is deprecated in favour of common sense.
e50bb9a1 877
722d2a37 878=head2 Cached hash values
e50bb9a1 879
722d2a37 880We have shared hash keys, which perform the same job.
e50bb9a1 881
722d2a37 882=head2 Add compression modules
e50bb9a1 883
722d2a37 884The compression modules are a little heavy; meanwhile, Nick Clark is
885working on experimental pragmata to do transparent decompression on
886input.
e50bb9a1 887
722d2a37 888=head2 Reorganise documentation into tutorials/references
e50bb9a1 889
722d2a37 890Could not get consensus on P5P about this.
e50bb9a1 891
722d2a37 892=head2 Remove distinction between functions and operators
893
894Caution: highly flammable.
895
896=head2 Make XS easier to use
e50bb9a1 897
722d2a37 898Use C<Inline> instead, or SWIG.
e50bb9a1 899
722d2a37 900=head2 Make embedding easier to use
e50bb9a1 901
722d2a37 902Use C<Inline::CPR>.
e50bb9a1 903
722d2a37 904=head2 man for perl
04c70446 905
1577cd80 906See the Perl Power Tools. ( http://language.perl.com/ppt/ )
04c70446 907
722d2a37 908=head2 my $Package::variable
04c70446 909
722d2a37 910Use C<our> instead.
04c70446 911
722d2a37 912=head2 "or" tests defined, not truth
04c70446 913
722d2a37 914Suggesting this on P5P B<will> cause a boring and interminable flamewar.
04c70446 915
722d2a37 916=head2 "class"-based lexicals
04c70446 917
cbb3fa72 918Use flyweight objects, secure hashes or, dare I say it, pseudo-hashes instead.
f86a8bc5 919(Or whatever will replace pseudohashes in 5.10.)
04c70446 920
722d2a37 921=head2 byteperl
04c70446 922
722d2a37 923C<ByteLoader> covers this.
04c70446 924
722d2a37 925=head2 Lazy evaluation / tail recursion removal
04c70446 926
f86a8bc5 927C<List::Util> gives first() (a short-circuiting grep); tail recursion
928removal is done manually, with C<goto &whoami;>. (However, MJD has
929found that C<goto &whoami> introduces a performance penalty, so maybe
930there should be a way to do this after all: C<sub foo {START: ... goto
931START;> is better.)
0562c0e3 932
933=head2 Make "use utf8" the default
934
f86a8bc5 935Because of backward compatibility this is difficult: scripts could not
936contain B<any legacy eight-bit data> (like Latin-1) anymore, even in
937string literals or pod. Also would introduce a measurable slowdown of
938at least few percentages since all regular expression operations would
939be done in full UTF-8. But if you want to try this, add
940-DUSE_UTF8_SCRIPTS to your compilation flags.
941
3298bd4d 942=head2 Unicode collation and normalization
943
944The Unicode::Collate and Unicode::Normalize modules
945by SADAHIRO Tomoyuki have been included since 5.8.0.
946
947 Collation? http://www.unicode.org/unicode/reports/tr10/
948 Normalization? http://www.unicode.org/unicode/reports/tr15/
0562c0e3 949
1626a787 950=head2 pack/unpack tutorial
951
952Wolfgang Laun finished what Simon Cozens started.
953
3298bd4d 954=cut