Commit | Line | Data |
7711098a |
1 | =head1 NAME |
2 | |
3 | perltodo - Perl TO-DO List |
4 | |
5 | =head1 DESCRIPTION |
e50bb9a1 |
6 | |
722d2a37 |
7 | This is a list of wishes for Perl. Send updates to |
e50bb9a1 |
8 | I<perl5-porters@perl.org>. If you want to work on any of these |
9 | projects, be sure to check the perl5-porters archives for past ideas, |
10 | flames, and propaganda. This will save you time and also prevent you |
11 | from implementing something that Larry has already vetoed. One set |
12 | of archives may be found at: |
13 | |
14 | http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/ |
15 | |
722d2a37 |
16 | =head1 To do during 5.6.x |
e50bb9a1 |
17 | |
722d2a37 |
18 | =head2 Support for I/O disciplines |
e50bb9a1 |
19 | |
722d2a37 |
20 | C<perlio> provides this, but the interface could be a lot more |
21 | straightforward. |
e50bb9a1 |
22 | |
4b3b956a |
23 | =head2 Autoload bytes.pm |
e50bb9a1 |
24 | |
4b3b956a |
25 | When the lexer sees, for instance, C<bytes::length>, it should |
26 | automatically load the C<bytes> pragma. |
27 | |
28 | =head2 Make "\u{XXXX}" et al work |
29 | |
30 | Danger, Will Robinson! Discussing the semantics of C<"\x{F00}">, |
31 | C<"\xF00"> and C<"\U{F00}"> on P5P I<will> lead to a long and boring |
32 | flamewar. |
e50bb9a1 |
33 | |
c6287c21 |
34 | =head2 Create a char *sv_pvprintify(sv, STRLEN *lenp, UV flags) |
0562c0e3 |
35 | |
36 | For displaying PVs with control characters, embedded nulls, and Unicode. |
37 | This would be useful for printing warnings, or data and regex dumping, |
38 | not_a_number(), and so on. |
39 | |
f35392ae |
40 | Requirements: should handle both byte and UTF8 strings. isPRINT() |
41 | characters printed as-is, character less than 256 as \xHH, Unicode |
0661e9a4 |
42 | characters as \x{HHH}. Don't assume ASCII-like, either, get somebody |
43 | on EBCDIC to test the output. |
f35392ae |
44 | |
45 | Possible options, controlled by the flags: |
0661e9a4 |
46 | - whitespace (other than ' ' of isPRINT()) printed as-is |
f35392ae |
47 | - use isPRINT_LC() instead of isPRINT() |
48 | - print control characters like this: "\cA" |
49 | - print control characters like this: "^A" |
0661e9a4 |
50 | - non-PRINTables printed as '.' instead of \xHH |
51 | - use \OOO instead of \xHH |
52 | - use the C/Perl-metacharacters like \n, \t |
f35392ae |
53 | - have a maximum length for the produced string (read it from *lenp) |
54 | - append a "..." to the produced string if the maximum length is exceeded |
0661e9a4 |
55 | - really fancy: print unicode characters as \N{...} |
f35392ae |
56 | |
722d2a37 |
57 | =head2 Overloadable regex assertions |
e50bb9a1 |
58 | |
722d2a37 |
59 | This may or may not be possible with the current regular expression |
60 | engine. The idea is that, for instance, C<\b> needs to be |
61 | algorithmically computed if you're dealing with Thai text. Hence, the |
62 | B<\b> assertion wants to be overloaded by a function. |
e50bb9a1 |
63 | |
776f8809 |
64 | =head2 Unicode |
65 | |
66 | =over 4 |
67 | |
68 | =item * |
e50bb9a1 |
69 | |
f34dec15 |
70 | Allow for long form of the General Category Properties, e.g |
71 | C<\p{IsOpenPunctuation}>, not just the abbreviated form, e.g. |
72 | C<\p{IsPs}>. |
73 | |
74 | =item * |
75 | |
1ac13f9a |
76 | Allow for the metaproperties: C<XID Start>, C<XID Continue>, |
77 | C<NF*_NO>, C<NF*_MAYBE> (require the DerivedCoreProperties and |
78 | DerviceNormalizationProperties files). |
f34dec15 |
79 | |
71d929cb |
80 | There are also multiple value properties still unimplemented: |
81 | C<Numeric Type>, C<East Asian Width>. |
f34dec15 |
82 | |
83 | =item * |
84 | |
722d2a37 |
85 | Case Mappings? http://www.unicode.org/unicode/reports/tr21/ |
e50bb9a1 |
86 | |
6f16a292 |
87 | lc(), uc(), lcfirst(), and ucfirst() work only for some of the |
88 | simplest cases, where the mapping goes from a single Unicode character |
89 | to another single Unicode character. See lib/unicore/SpecCase.txt |
90 | (and CaseFold.txt). |
ac1256e8 |
91 | |
776f8809 |
92 | =item * |
e50bb9a1 |
93 | |
c6287c21 |
94 | They have some tricks Perl doesn't yet implement like character |
95 | class subtraction. |
e50bb9a1 |
96 | |
722d2a37 |
97 | http://www.unicode.org/unicode/reports/tr18/ |
e50bb9a1 |
98 | |
776f8809 |
99 | =back |
100 | |
101 | See L<perlunicode/UNICODE REGULAR EXPRESSION SUPPORT LEVEL> for what's |
f34dec15 |
102 | there and what's missing. Almost all of Levels 2 and 3 is missing, |
103 | and as of 5.8.0 not even all of Level 1 is there. |
776f8809 |
104 | |
722d2a37 |
105 | =head2 use Thread for iThreads |
e50bb9a1 |
106 | |
722d2a37 |
107 | Artur Bergman's C<iThreads> module is a start on this, but needs to |
108 | be more mature. |
e50bb9a1 |
109 | |
dd0afe54 |
110 | =head2 make perl_clone optionally clone ops |
111 | |
112 | So that pseudoforking, mod_perl, iThreads and nvi will work properly |
113 | (but not as efficiently) until the regex engine is fixed to be threadsafe. |
114 | |
722d2a37 |
115 | =head2 Work out exit/die semantics for threads |
e50bb9a1 |
116 | |
722d2a37 |
117 | =head2 Typed lexicals for compiler |
e50bb9a1 |
118 | |
722d2a37 |
119 | =head2 Compiler workarounds for Win32 |
e50bb9a1 |
120 | |
722d2a37 |
121 | =head2 AUTOLOADing in the compiler |
e50bb9a1 |
122 | |
722d2a37 |
123 | =head2 Fixing comppadlist when compiling |
e50bb9a1 |
124 | |
722d2a37 |
125 | =head2 Cleaning up exported namespace |
e50bb9a1 |
126 | |
722d2a37 |
127 | =head2 Complete signal handling |
e50bb9a1 |
128 | |
722d2a37 |
129 | Add C<PERL_ASYNC_CHECK> to opcodes which loop; replace C<sigsetjmp> with |
130 | C<sigjmp>; check C<wait> for signal safety. |
e50bb9a1 |
131 | |
722d2a37 |
132 | =head2 Out-of-source builds |
e50bb9a1 |
133 | |
722d2a37 |
134 | This was done for 5.6.0, but needs reworking for 5.7.x |
e50bb9a1 |
135 | |
722d2a37 |
136 | =head2 POSIX realtime support |
e50bb9a1 |
137 | |
722d2a37 |
138 | POSIX 1003.1 1996 Edition support--realtime stuff: POSIX semaphores, |
139 | message queues, shared memory, realtime clocks, timers, signals (the |
140 | metaconfig units mostly already exist for these) |
e50bb9a1 |
141 | |
722d2a37 |
142 | =head2 UNIX98 support |
e50bb9a1 |
143 | |
722d2a37 |
144 | Reader-writer locks, realtime/asynchronous IO |
e50bb9a1 |
145 | |
722d2a37 |
146 | =head2 IPv6 Support |
e50bb9a1 |
147 | |
722d2a37 |
148 | There are non-core modules, such as C<Net::IPv6>, but these will need |
149 | integrating when IPv6 actually starts to really happen. See RFC 2292 |
150 | and RFC 2553. |
e50bb9a1 |
151 | |
722d2a37 |
152 | =head2 Long double conversion |
e50bb9a1 |
153 | |
722d2a37 |
154 | Floating point formatting is still causing some weird test failures. |
e50bb9a1 |
155 | |
722d2a37 |
156 | =head2 Locales |
e50bb9a1 |
157 | |
722d2a37 |
158 | Locales and Unicode interact with each other in unpleasant ways. |
159 | One possible solution would be to adopt/support ICU: |
e50bb9a1 |
160 | |
722d2a37 |
161 | http://oss.software.ibm.com/developerworks/opensource/icu/project/ |
e50bb9a1 |
162 | |
722d2a37 |
163 | =head2 Thread-safe regexes |
e50bb9a1 |
164 | |
722d2a37 |
165 | The regular expression engine is currently non-threadsafe. |
e50bb9a1 |
166 | |
722d2a37 |
167 | =head2 Arithmetic on non-Arabic numerals |
e50bb9a1 |
168 | |
722d2a37 |
169 | C<[1234567890]> aren't the only numerals any more. |
e50bb9a1 |
170 | |
722d2a37 |
171 | =head2 POSIX Unicode character classes |
e50bb9a1 |
172 | |
722d2a37 |
173 | ([=a=] for equivalance classes, [.ch.] for collation.) |
174 | These are dependent on Unicode normalization and collation. |
e50bb9a1 |
175 | |
722d2a37 |
176 | =head2 Factoring out common suffices/prefices in regexps (trie optimization) |
c47ff5f1 |
177 | |
722d2a37 |
178 | Currently, the user has to optimize C<foo|far> and C<foo|goo> into |
179 | C<f(?:oo|ar)> and C<[fg]oo> by hand; this could be done automatically. |
e50bb9a1 |
180 | |
722d2a37 |
181 | =head2 Security audit shipped utilities |
e50bb9a1 |
182 | |
722d2a37 |
183 | All the code we ship with Perl needs to be sensible about temporary file |
184 | handling, locking, input validation, and so on. |
e50bb9a1 |
185 | |
722d2a37 |
186 | =head2 Custom opcodes |
e50bb9a1 |
187 | |
722d2a37 |
188 | Have a way to introduce user-defined opcodes without the subroutine call |
189 | overhead of an XSUB; the user should be able to create PP code. Simon |
190 | Cozens has some ideas on this. |
e50bb9a1 |
191 | |
722d2a37 |
192 | =head2 DLL Versioning |
e50bb9a1 |
193 | |
722d2a37 |
194 | Windows needs a way to know what version of a XS or C<libperl> DLL it's |
195 | loading. |
e50bb9a1 |
196 | |
722d2a37 |
197 | =head2 Introduce @( and @) |
e50bb9a1 |
198 | |
722d2a37 |
199 | C<$(> may return "foo bar baz". Unfortunately, since groups can |
200 | theoretically have spaces in their names, this could be one, two or |
201 | three groups. |
e50bb9a1 |
202 | |
722d2a37 |
203 | =head2 Floating point handling |
e50bb9a1 |
204 | |
722d2a37 |
205 | C<NaN> and C<inf> support is particularly troublesome. |
206 | (fp_classify(), fp_class(), fp_class_d(), class(), isinf(), |
207 | isfinite(), finite(), isnormal(), unordered(), <ieeefp.h>, |
208 | <fp_class.h> (there are metaconfig units for all these) (I think), |
209 | fp_setmask(), fp_getmask(), fp_setround(), fp_getround() |
210 | (no metaconfig units yet for these). Don't forget finitel(), fp_classl(), |
211 | fp_class_l(), (yes, both do, unfortunately, exist), and unorderedl().) |
e50bb9a1 |
212 | |
722d2a37 |
213 | As of Perl 5.6.1 is a Perl macro, Perl_isnan(). |
e50bb9a1 |
214 | |
722d2a37 |
215 | =head2 IV/UV preservation |
e50bb9a1 |
216 | |
722d2a37 |
217 | Nicholas Clark has done a lot of work on this, but work is continuing. |
218 | C<+>, C<-> and C<*> work, but guards need to be in place for C<%>, C</>, |
219 | C<&>, C<oct>, C<hex> and C<pack>. |
e50bb9a1 |
220 | |
722d2a37 |
221 | =head2 Replace pod2html with something using Pod::Parser |
83df6a1d |
222 | |
722d2a37 |
223 | The CPAN module C<Malik::Pod::Html> may be a more suitable basis for a |
224 | C<pod2html> convertor; the current one duplicates the functionality |
225 | abstracted in C<Pod::Parser>, which makes updating the POD language |
226 | difficult. |
e50bb9a1 |
227 | |
722d2a37 |
228 | =head2 Automate module testing on CPAN |
e50bb9a1 |
229 | |
722d2a37 |
230 | When a new Perl is being beta tested, porters have to manually grab |
231 | their favourite CPAN modules and test them - this should be done |
232 | automatically. |
e50bb9a1 |
233 | |
722d2a37 |
234 | =head2 sendmsg and recvmsg |
83df6a1d |
235 | |
722d2a37 |
236 | We have all the other BSD socket functions but these. There are |
237 | metaconfig units for these functions which can be added. To avoid these |
238 | being new opcodes, a solution similar to the way C<sockatmark> was added |
239 | would be preferable. (Autoload the C<IO::whatever> module.) |
e50bb9a1 |
240 | |
722d2a37 |
241 | =head2 Rewrite perlre documentation |
e50bb9a1 |
242 | |
722d2a37 |
243 | The new-style patterns need full documentation, and the whole document |
244 | needs to be a lot clearer. |
e50bb9a1 |
245 | |
722d2a37 |
246 | =head2 Convert example code to IO::Handle filehandles |
e50bb9a1 |
247 | |
722d2a37 |
248 | =head2 Document Win32 choices |
e50bb9a1 |
249 | |
722d2a37 |
250 | =head2 Check new modules |
e50bb9a1 |
251 | |
722d2a37 |
252 | =head2 Make roffitall find pods and libs itself |
e50bb9a1 |
253 | |
722d2a37 |
254 | Simon Cozens has done some work on this but it needs a rethink. |
e50bb9a1 |
255 | |
722d2a37 |
256 | =head1 To do at some point |
e50bb9a1 |
257 | |
722d2a37 |
258 | These are ideas that have been regularly tossed around, that most |
259 | people believe should be done maybe during 5.8.x |
e50bb9a1 |
260 | |
722d2a37 |
261 | =head2 Remove regular expression recursion |
e50bb9a1 |
262 | |
722d2a37 |
263 | Because the regular expression engine is recursive, badly designed |
264 | expressions can lead to lots of recursion filling up the stack. Ilya |
265 | claims that it is easy to convert the engine to being iterative, but |
266 | this has still not yet been done. There may be a regular expression |
267 | engine hit squad meeting at TPC5. |
e50bb9a1 |
268 | |
722d2a37 |
269 | =head2 Memory leaks after failed eval |
e50bb9a1 |
270 | |
722d2a37 |
271 | Perl will leak memory if you C<eval "hlagh hlagh hlagh hlagh">. This is |
272 | partially because it attempts to build up an op tree for that code and |
273 | doesn't properly free it. The same goes for non-syntactically-correct |
274 | regular expressions. Hugo looked into this, but decided it needed a |
275 | mark-and-sweep GC implementation. |
e50bb9a1 |
276 | |
722d2a37 |
277 | Alan notes that: The basic idea was to extend the parser token stack |
278 | (C<YYSTYPE>) to include a type field so we knew what sort of thing each |
279 | element of the stack was. The F<<perly.c> code would then have to be |
280 | postprocessed to record the type of each entry on the stack as it was |
281 | created, and the parser patched so that it could unroll the stack |
282 | properly on error. |
e50bb9a1 |
283 | |
722d2a37 |
284 | This is possible to do, but would be pretty messy to implement, as it |
285 | would rely on even more sed hackery in F<perly.fixer>. |
e50bb9a1 |
286 | |
722d2a37 |
287 | =head2 pack "(stuff)*" |
e50bb9a1 |
288 | |
722d2a37 |
289 | That's to say, C<pack "(sI)40"> would be the same as C<pack "sI"x40> |
e50bb9a1 |
290 | |
722d2a37 |
291 | =head2 bitfields in pack |
e50bb9a1 |
292 | |
722d2a37 |
293 | =head2 Cross compilation |
e50bb9a1 |
294 | |
722d2a37 |
295 | Make Perl buildable with a cross-compiler. This will play havoc with |
296 | Configure, which needs to how how the target system will respond to |
297 | its tests; maybe C<microperl> will be a good starting point here. |
298 | (Indeed, Bart Schuller reports that he compiled up C<microperl> for |
299 | the Agenda PDA and it works fine.) A really big spanner in the works |
300 | is the bootstrapping build process of Perl: if the filesystem the |
301 | target systems sees is not the same what the build host sees, various |
302 | input, output, and (Perl) library files need to be copied back and forth. |
e50bb9a1 |
303 | |
f86a8bc5 |
304 | As of 5.8.0 Configure mostly works for cross-compilation |
305 | (used successfully for iPAQ Linux), miniperl gets built, |
306 | but then building DynaLoader (and other extensions) fails |
307 | since MakeMaker knows nothing of cross-compilation. |
308 | (See INSTALL/Cross-compilation for the state of things.) |
309 | |
722d2a37 |
310 | =head2 Perl preprocessor / macros |
e50bb9a1 |
311 | |
722d2a37 |
312 | Source filters help with this, but do not get us all the way. For |
313 | instance, it should be possible to implement the C<??> operator somehow; |
314 | source filters don't (quite) cut it. |
e50bb9a1 |
315 | |
722d2a37 |
316 | =head2 Perl lexer in Perl |
a45bd81d |
317 | |
722d2a37 |
318 | Damian Conway is planning to work on this, but it hasn't happened yet. |
e50bb9a1 |
319 | |
722d2a37 |
320 | =head2 Using POSIX calls internally |
e50bb9a1 |
321 | |
722d2a37 |
322 | When faced with a BSD vs. SySV -style interface to some library or |
323 | system function, perl's roots show in that it typically prefers the BSD |
324 | interface (but falls back to the SysV one). One example is getpgrp(). |
325 | Other examples include C<memcpy> vs. C<bcopy>. There are others, mostly in |
326 | F<<pp_sys.c>. |
e50bb9a1 |
327 | |
722d2a37 |
328 | Mostly, this item is a suggestion for which way to start a journey into |
329 | an C<#ifdef> forest. It is not primarily a suggestion to eliminate any of |
330 | the C<#ifdef> forests. |
e50bb9a1 |
331 | |
722d2a37 |
332 | POSIX calls are perhaps more likely to be portable to unexpected |
333 | architectures. They are also perhaps more likely to be actively |
334 | maintained by a current vendor. They are also perhaps more likely to be |
335 | available in thread-safe versions, if appropriate. |
e50bb9a1 |
336 | |
722d2a37 |
337 | =head2 -i rename file when changed |
e50bb9a1 |
338 | |
722d2a37 |
339 | It's only necessary to rename a file when inplace editing when the file |
340 | has changed. Detecting a change is perhaps the difficult bit. |
e50bb9a1 |
341 | |
722d2a37 |
342 | =head2 All ARGV input should act like E<lt>E<gt> |
e50bb9a1 |
343 | |
2d84a16a |
344 | eg C<read(ARGV, ...)> doesn't currently read across multiple files. |
345 | |
722d2a37 |
346 | =head2 Support for rerunning debugger |
e50bb9a1 |
347 | |
722d2a37 |
348 | There should be a way of restarting the debugger on demand. |
e50bb9a1 |
349 | |
c6287c21 |
350 | =head2 Test Suite for the Debugger |
351 | |
352 | The debugger is a complex piece of software and fixing something |
353 | here may inadvertently break something else over there. To tame |
354 | this chaotic behaviour, a test suite is necessary. |
355 | |
722d2a37 |
356 | =head2 my sub foo { } |
c47ff5f1 |
357 | |
722d2a37 |
358 | The basic principle is sound, but there are problems with the semantics |
359 | of self-referential and mutually referential lexical subs: how to |
360 | declare the subs? |
c47ff5f1 |
361 | |
722d2a37 |
362 | =head2 One-pass global destruction |
c47ff5f1 |
363 | |
722d2a37 |
364 | Sweeping away all the allocated memory in one go is a laudable goal, but |
365 | it's difficult and in most cases, it's easier to let the memory get |
366 | freed by exiting. |
e50bb9a1 |
367 | |
722d2a37 |
368 | =head2 Rewrite regexp parser |
e50bb9a1 |
369 | |
722d2a37 |
370 | There has been talk recently of rewriting the regular expression parser |
371 | to produce an optree instead of a chain of opcodes; it's unclear whether |
372 | or not this would be a win. |
e50bb9a1 |
373 | |
722d2a37 |
374 | =head2 Cache recently used regexps |
e50bb9a1 |
375 | |
722d2a37 |
376 | This is to speed up |
e50bb9a1 |
377 | |
722d2a37 |
378 | for my $re (@regexps) { |
379 | $matched++ if /$re/ |
380 | } |
e50bb9a1 |
381 | |
722d2a37 |
382 | C<qr//> already gives us a way of saving compiled regexps, but it should |
383 | be done automatically. |
e50bb9a1 |
384 | |
722d2a37 |
385 | =head2 Re-entrant functions |
e50bb9a1 |
386 | |
722d2a37 |
387 | Add configure probes for C<_r> forms of system calls and fit them to the |
388 | core. Unfortunately, calling conventions for these functions and not |
389 | standardised. |
04c70446 |
390 | |
722d2a37 |
391 | =head2 Cross-compilation support |
04c70446 |
392 | |
722d2a37 |
393 | Bart Schuller reports that using C<microperl> and a cross-compiler, he |
394 | got Perl working on the Agenda PDA. However, one cannot build a full |
395 | Perl because Configure needs to get the results for the target platform, |
396 | for the host. |
e50bb9a1 |
397 | |
722d2a37 |
398 | =head2 Bit-shifting bitvectors |
e50bb9a1 |
399 | |
722d2a37 |
400 | Given: |
e50bb9a1 |
401 | |
722d2a37 |
402 | vec($v, 1000, 1) = 1; |
e50bb9a1 |
403 | |
722d2a37 |
404 | One should be able to do |
e50bb9a1 |
405 | |
722d2a37 |
406 | $v <<= 1; |
e50bb9a1 |
407 | |
722d2a37 |
408 | and have the 999'th bit set. |
e50bb9a1 |
409 | |
722d2a37 |
410 | Currently if you try with shift bitvectors you shift the NV/UV, instead |
411 | of the bits in the PV. Not very logical. |
e50bb9a1 |
412 | |
722d2a37 |
413 | =head2 debugger pragma |
e50bb9a1 |
414 | |
722d2a37 |
415 | The debugger is implemented in Perl in F<perl5db.pl>; turning it into a |
416 | pragma should be easy, but making it work lexically might be more |
417 | difficult. Fiddling with C<$^P> would be necessary. |
e50bb9a1 |
418 | |
722d2a37 |
419 | =head2 use less pragma |
e50bb9a1 |
420 | |
722d2a37 |
421 | Identify areas where speed/memory tradeoffs can be made and have a hint |
422 | to switch between them. |
e50bb9a1 |
423 | |
722d2a37 |
424 | =head2 switch structures |
e50bb9a1 |
425 | |
722d2a37 |
426 | Although we have C<Switch.pm> in core, Larry points to the dormant |
427 | C<nswitch> and C<cswitch> ops in F<pp.c>; using these opcodes would be |
428 | much faster. |
e50bb9a1 |
429 | |
722d2a37 |
430 | =head2 Cache eval tree |
e50bb9a1 |
431 | |
722d2a37 |
432 | =head2 rcatmaybe |
e50bb9a1 |
433 | |
722d2a37 |
434 | =head2 Shrink opcode tables |
e50bb9a1 |
435 | |
722d2a37 |
436 | =head2 Optimize away @_ |
e50bb9a1 |
437 | |
722d2a37 |
438 | Look at the "reification" code in C<av.c> |
e50bb9a1 |
439 | |
722d2a37 |
440 | =head2 Prototypes versus indirect objects |
e50bb9a1 |
441 | |
722d2a37 |
442 | Currently, indirect object syntax bypasses prototype checks. |
e50bb9a1 |
443 | |
722d2a37 |
444 | =head2 Install HMTL |
e50bb9a1 |
445 | |
722d2a37 |
446 | HTML versions of the documentation need to be installed by default; a |
447 | call to C<installhtml> from C<installperl> may be all that's necessary. |
e50bb9a1 |
448 | |
722d2a37 |
449 | =head2 Prototype method calls |
e50bb9a1 |
450 | |
722d2a37 |
451 | =head2 Return context prototype declarations |
e50bb9a1 |
452 | |
722d2a37 |
453 | =head2 magic_setisa |
e50bb9a1 |
454 | |
722d2a37 |
455 | =head2 Garbage collection |
e50bb9a1 |
456 | |
722d2a37 |
457 | There have been persistent mumblings about putting a mark-and-sweep |
458 | garbage detector into Perl; Alan Burlison has some ideas about this. |
e50bb9a1 |
459 | |
722d2a37 |
460 | =head2 IO tutorial |
e50bb9a1 |
461 | |
722d2a37 |
462 | Mark-Jason Dominus has the beginnings of one of these. |
e50bb9a1 |
463 | |
722d2a37 |
464 | =head2 pack/unpack tutorial |
e50bb9a1 |
465 | |
722d2a37 |
466 | Simon Cozens has the beginnings of one of these. |
e50bb9a1 |
467 | |
722d2a37 |
468 | =head2 Rewrite perldoc |
e50bb9a1 |
469 | |
722d2a37 |
470 | There are a few suggestions for what to do with C<perldoc>: maybe a |
471 | full-text search, an index function, locating pages on a particular |
472 | high-level subject, and so on. |
e50bb9a1 |
473 | |
3958b146 |
474 | =head2 Install .3p manpages |
e50bb9a1 |
475 | |
3958b146 |
476 | This is a bone of contention; we can create C<.3p> manpages for each |
722d2a37 |
477 | built-in function, but should we install them by default? Tcl does this, |
478 | and it clutters up C<apropos>. |
e50bb9a1 |
479 | |
722d2a37 |
480 | =head2 Unicode tutorial |
e50bb9a1 |
481 | |
722d2a37 |
482 | Simon Cozens promises to do this before he gets old. |
e50bb9a1 |
483 | |
722d2a37 |
484 | =head2 Update POSIX.pm for 1003.1-2 |
3958b146 |
485 | |
722d2a37 |
486 | =head2 Retargetable installation |
e50bb9a1 |
487 | |
722d2a37 |
488 | Allow C<@INC> to be changed after Perl is built. |
e50bb9a1 |
489 | |
722d2a37 |
490 | =head2 POSIX emulation on non-POSIX systems |
e50bb9a1 |
491 | |
722d2a37 |
492 | Make C<POSIX.pm> behave as POSIXly as possible everywhere, meaning we |
493 | have to implement POSIX equivalents for some functions if necessary. |
e50bb9a1 |
494 | |
722d2a37 |
495 | =head2 Rename Win32 headers |
e50bb9a1 |
496 | |
722d2a37 |
497 | =head2 Finish off lvalue functions |
498 | |
499 | They don't work in the debugger, and they don't work for list or hash |
500 | slices. |
e50bb9a1 |
501 | |
722d2a37 |
502 | =head2 Update sprintf documentation |
e50bb9a1 |
503 | |
722d2a37 |
504 | Hugo van der Sanden plans to look at this. |
e50bb9a1 |
505 | |
722d2a37 |
506 | =head2 Use fchown/fchmod internally |
e50bb9a1 |
507 | |
722d2a37 |
508 | This has been done in places, but needs a thorough code review. |
509 | Also fchdir is available in some platforms. |
e50bb9a1 |
510 | |
722d2a37 |
511 | =head1 Vague ideas |
e50bb9a1 |
512 | |
722d2a37 |
513 | Ideas which have been discussed, and which may or may not happen. |
e50bb9a1 |
514 | |
722d2a37 |
515 | =head2 ref() in list context |
e50bb9a1 |
516 | |
722d2a37 |
517 | It's unclear what this should do or how to do it without breaking old |
518 | code. |
e50bb9a1 |
519 | |
f86a8bc5 |
520 | =head2 Make tr/// return histogram of characters in list context |
e50bb9a1 |
521 | |
722d2a37 |
522 | There is a patch for this, but it may require Unicodification. |
e50bb9a1 |
523 | |
722d2a37 |
524 | =head2 Compile to real threaded code |
3958b146 |
525 | |
722d2a37 |
526 | =head2 Structured types |
3958b146 |
527 | |
722d2a37 |
528 | =head2 Modifiable $1 et al. |
e50bb9a1 |
529 | |
722d2a37 |
530 | ($x = "elephant") =~ /e(ph)/; |
531 | $1 = "g"; # $x = "elegant" |
e50bb9a1 |
532 | |
722d2a37 |
533 | What happens if there are multiple (nested?) brackets? What if the |
534 | string changes between the match and the assignment? |
e50bb9a1 |
535 | |
722d2a37 |
536 | =head2 Procedural interfaces for IO::*, etc. |
e50bb9a1 |
537 | |
722d2a37 |
538 | Some core modules have been accused of being overly-OO. Adding |
539 | procedural interfaces could demystify them. |
e50bb9a1 |
540 | |
722d2a37 |
541 | =head2 RPC modules |
e50bb9a1 |
542 | |
722d2a37 |
543 | =head2 Attach/detach debugger from running program |
e50bb9a1 |
544 | |
722d2a37 |
545 | With C<gdb>, you can attach the debugger to a running program if you |
546 | pass the process ID. It would be good to do this with the Perl debugger |
547 | on a running Perl program, although I'm not sure how it would be done. |
e50bb9a1 |
548 | |
722d2a37 |
549 | =head2 Alternative RE syntax module |
e50bb9a1 |
550 | |
722d2a37 |
551 | use Regex::Newbie; |
552 | $re = Regex::Newbie->new |
553 | ->start |
554 | ->match("foo") |
555 | ->repeat(Regex::Newbie->class("char"),3) |
556 | ->end; |
557 | /$re/; |
e50bb9a1 |
558 | |
722d2a37 |
559 | =head2 GUI::Native |
e50bb9a1 |
560 | |
722d2a37 |
561 | A non-core module that would use "native" GUI to create graphical |
562 | applications. |
e50bb9a1 |
563 | |
722d2a37 |
564 | =head2 foreach(reverse ...) |
e50bb9a1 |
565 | |
722d2a37 |
566 | Currently |
e50bb9a1 |
567 | |
722d2a37 |
568 | foreach (reverse @_) { ... } |
e50bb9a1 |
569 | |
722d2a37 |
570 | puts C<@_> on the stack, reverses it putting the reversed version on the |
571 | stack, then iterates forwards. Instead, it could be special-cased to put |
572 | C<@_> on the stack then iterate backwards. |
e50bb9a1 |
573 | |
722d2a37 |
574 | =head2 Constant function cache |
e50bb9a1 |
575 | |
722d2a37 |
576 | =head2 Approximate regular expression matching |
e50bb9a1 |
577 | |
722d2a37 |
578 | =head1 Ongoing |
e50bb9a1 |
579 | |
722d2a37 |
580 | These items B<always> need doing: |
e50bb9a1 |
581 | |
722d2a37 |
582 | =head2 Update guts documentation |
e50bb9a1 |
583 | |
722d2a37 |
584 | Simon Cozens tries to do this when possible, and contributions to the |
585 | C<perlapi> documentation is welcome. |
e50bb9a1 |
586 | |
722d2a37 |
587 | =head2 Add more tests |
e50bb9a1 |
588 | |
722d2a37 |
589 | Michael Schwern will donate $500 to Yet Another Society when all core |
590 | modules have tests. |
e50bb9a1 |
591 | |
722d2a37 |
592 | =head2 Update auxiliary tools |
e50bb9a1 |
593 | |
722d2a37 |
594 | The code we ship with Perl should look like good Perl 5. |
e50bb9a1 |
595 | |
722d2a37 |
596 | =head1 Recently done things |
e50bb9a1 |
597 | |
722d2a37 |
598 | These are things which have been on the todo lists in previous releases |
599 | but have recently been completed. |
e50bb9a1 |
600 | |
722d2a37 |
601 | =head2 Safe signal handling |
e50bb9a1 |
602 | |
722d2a37 |
603 | A new signal model went into 5.7.1 without much fanfare. Operations and |
604 | C<malloc>s are no longer interrupted by signals, which are handled |
605 | between opcodes. This means that C<PERL_ASYNC_CHECK> now actually does |
606 | something. However, there are still a few things that need to be done. |
e50bb9a1 |
607 | |
722d2a37 |
608 | =head2 Tie Modules |
e50bb9a1 |
609 | |
722d2a37 |
610 | Modules which implement arrays in terms of strings, substrings or files |
611 | can be found on the CPAN. |
e50bb9a1 |
612 | |
722d2a37 |
613 | =head2 gettimeofday |
e50bb9a1 |
614 | |
722d2a37 |
615 | C<Time::Hires> has been integrated into the core. |
e50bb9a1 |
616 | |
722d2a37 |
617 | =head2 setitimer and getimiter |
e50bb9a1 |
618 | |
722d2a37 |
619 | Adding C<Time::Hires> got us this too. |
e50bb9a1 |
620 | |
722d2a37 |
621 | =head2 Testing __DIE__ hook |
622 | |
623 | Tests have been added. |
624 | |
625 | =head2 CPP equivalent in Perl |
e50bb9a1 |
626 | |
722d2a37 |
627 | A C Yardley will probably have done this by the time you can read this. |
628 | This allows for a generalization of the C constant detection used in |
629 | building C<Errno.pm>. |
e50bb9a1 |
630 | |
722d2a37 |
631 | =head2 Explicit switch statements |
e50bb9a1 |
632 | |
722d2a37 |
633 | C<Switch.pm> has been integrated into the core to give you all manner of |
634 | C<switch...case> semantics. |
e50bb9a1 |
635 | |
722d2a37 |
636 | =head2 autocroak |
e50bb9a1 |
637 | |
722d2a37 |
638 | This is C<Fatal.pm>. |
e50bb9a1 |
639 | |
722d2a37 |
640 | =head2 UTF/EBCDIC |
e50bb9a1 |
641 | |
722d2a37 |
642 | Nick Ing-Simmons has made UTF-EBCDIC (UTR13) work with Perl. |
e50bb9a1 |
643 | |
722d2a37 |
644 | EBCDIC? http://www.unicode.org/unicode/reports/tr16/ |
e50bb9a1 |
645 | |
722d2a37 |
646 | =head2 UTF Regexes |
e50bb9a1 |
647 | |
722d2a37 |
648 | Although there are probably some small bugs to be rooted out, Jarkko |
649 | Hietaniemi has made regular expressions polymorphic between bytes and |
650 | characters. |
e50bb9a1 |
651 | |
722d2a37 |
652 | =head2 perlcc to produce executable |
e50bb9a1 |
653 | |
722d2a37 |
654 | C<perlcc> was recently rewritten, and can now produce standalone |
655 | executables. |
e50bb9a1 |
656 | |
722d2a37 |
657 | =head2 END blocks saved in compiled output |
e50bb9a1 |
658 | |
722d2a37 |
659 | =head2 Secure temporary file module |
e50bb9a1 |
660 | |
722d2a37 |
661 | Tim Jenness' C<File::Temp> is now in core. |
e50bb9a1 |
662 | |
722d2a37 |
663 | =head2 Integrate Time::HiRes |
e50bb9a1 |
664 | |
722d2a37 |
665 | This module is now part of core. |
e50bb9a1 |
666 | |
722d2a37 |
667 | =head2 Turn Cwd into XS |
e50bb9a1 |
668 | |
722d2a37 |
669 | Benjamin Sugars has done this. |
e50bb9a1 |
670 | |
722d2a37 |
671 | =head2 Mmap for input |
e50bb9a1 |
672 | |
722d2a37 |
673 | Nick Ing-Simmons' C<perlio> supports an C<mmap> IO method. |
e50bb9a1 |
674 | |
722d2a37 |
675 | =head2 Byte to/from UTF8 and UTF8 to/from local conversion |
e50bb9a1 |
676 | |
722d2a37 |
677 | C<Encode> provides this. |
e50bb9a1 |
678 | |
722d2a37 |
679 | =head2 Add sockatmark support |
e50bb9a1 |
680 | |
722d2a37 |
681 | Added in 5.7.1 |
e50bb9a1 |
682 | |
722d2a37 |
683 | =head2 Mailing list archives |
684 | |
685 | http://lists.perl.org/, http://archive.develooper.com/ |
686 | |
687 | =head2 Bug tracking |
688 | |
689 | Richard Foley has written the bug tracking system at http://bugs.perl.org/ |
e50bb9a1 |
690 | |
722d2a37 |
691 | =head2 Integrate MacPerl |
e50bb9a1 |
692 | |
722d2a37 |
693 | Chris Nandor and Matthias Neeracher have integrated the MacPerl changes |
694 | into 5.6.0. |
e50bb9a1 |
695 | |
722d2a37 |
696 | =head2 Web "nerve center" for Perl |
e50bb9a1 |
697 | |
722d2a37 |
698 | http://use.perl.org/ is what you're looking for. |
e50bb9a1 |
699 | |
722d2a37 |
700 | =head2 Regular expression tutorial |
e50bb9a1 |
701 | |
722d2a37 |
702 | C<perlretut>, provided by Mark Kvale. |
e50bb9a1 |
703 | |
722d2a37 |
704 | =head2 Debugging Tutorial |
e50bb9a1 |
705 | |
722d2a37 |
706 | C<perldebtut>, written by Richard Foley. |
e50bb9a1 |
707 | |
722d2a37 |
708 | =head2 Integrate new modules |
e50bb9a1 |
709 | |
722d2a37 |
710 | Jarkko has been integrating madly into 5.7.x |
e50bb9a1 |
711 | |
722d2a37 |
712 | =head2 Integrate profiler |
e50bb9a1 |
713 | |
722d2a37 |
714 | C<Devel::DProf> is now a core module. |
e50bb9a1 |
715 | |
722d2a37 |
716 | =head2 Y2K error detection |
e50bb9a1 |
717 | |
722d2a37 |
718 | There's a configure option to detect unsafe concatenation with "19", and |
719 | a CPAN module. (C<D'oh::Year>) |
e50bb9a1 |
720 | |
722d2a37 |
721 | =head2 Regular expression debugger |
e50bb9a1 |
722 | |
722d2a37 |
723 | While not part of core, Mark-Jason Dominus has written C<Rx> and has |
724 | also come up with a generalised strategy for regular expression |
725 | debugging. |
e50bb9a1 |
726 | |
722d2a37 |
727 | =head2 POD checker |
e50bb9a1 |
728 | |
722d2a37 |
729 | That's, uh, F<podchecker> |
e50bb9a1 |
730 | |
722d2a37 |
731 | =head2 "Dynamic" lexicals |
e50bb9a1 |
732 | |
722d2a37 |
733 | =head2 Cache precompiled modules |
e50bb9a1 |
734 | |
722d2a37 |
735 | =head1 Deprecated Wishes |
e50bb9a1 |
736 | |
722d2a37 |
737 | These are items which used to be in the todo file, but have been |
738 | deprecated for some reason. |
e50bb9a1 |
739 | |
722d2a37 |
740 | =head2 Loop control on do{} |
e50bb9a1 |
741 | |
722d2a37 |
742 | This would break old code; use C<do{{ }}> instead. |
e50bb9a1 |
743 | |
722d2a37 |
744 | =head2 Lexically scoped typeglobs |
e50bb9a1 |
745 | |
722d2a37 |
746 | Not needed now we have lexical IO handles. |
e50bb9a1 |
747 | |
722d2a37 |
748 | =head2 format BOTTOM |
3958b146 |
749 | |
722d2a37 |
750 | =head2 report HANDLE |
e50bb9a1 |
751 | |
722d2a37 |
752 | Damian Conway's text formatting modules seem to be the Way To Go. |
e50bb9a1 |
753 | |
722d2a37 |
754 | =head2 Generalised want()/caller()) |
3958b146 |
755 | |
722d2a37 |
756 | =head2 Named prototypes |
e50bb9a1 |
757 | |
722d2a37 |
758 | These both seem to be delayed until Perl 6. |
e50bb9a1 |
759 | |
722d2a37 |
760 | =head2 Built-in globbing |
e50bb9a1 |
761 | |
722d2a37 |
762 | The C<File::Glob> module has been used to replace the C<glob> function. |
e50bb9a1 |
763 | |
722d2a37 |
764 | =head2 Regression tests for suidperl |
e50bb9a1 |
765 | |
722d2a37 |
766 | C<suidperl> is deprecated in favour of common sense. |
e50bb9a1 |
767 | |
722d2a37 |
768 | =head2 Cached hash values |
e50bb9a1 |
769 | |
722d2a37 |
770 | We have shared hash keys, which perform the same job. |
e50bb9a1 |
771 | |
722d2a37 |
772 | =head2 Add compression modules |
e50bb9a1 |
773 | |
722d2a37 |
774 | The compression modules are a little heavy; meanwhile, Nick Clark is |
775 | working on experimental pragmata to do transparent decompression on |
776 | input. |
e50bb9a1 |
777 | |
722d2a37 |
778 | =head2 Reorganise documentation into tutorials/references |
e50bb9a1 |
779 | |
722d2a37 |
780 | Could not get consensus on P5P about this. |
e50bb9a1 |
781 | |
722d2a37 |
782 | =head2 Remove distinction between functions and operators |
783 | |
784 | Caution: highly flammable. |
785 | |
786 | =head2 Make XS easier to use |
e50bb9a1 |
787 | |
722d2a37 |
788 | Use C<Inline> instead, or SWIG. |
e50bb9a1 |
789 | |
722d2a37 |
790 | =head2 Make embedding easier to use |
e50bb9a1 |
791 | |
722d2a37 |
792 | Use C<Inline::CPR>. |
e50bb9a1 |
793 | |
722d2a37 |
794 | =head2 man for perl |
04c70446 |
795 | |
722d2a37 |
796 | See the Perl Power Tools. (http://language.perl.com/ppt/) |
04c70446 |
797 | |
722d2a37 |
798 | =head2 my $Package::variable |
04c70446 |
799 | |
722d2a37 |
800 | Use C<our> instead. |
04c70446 |
801 | |
722d2a37 |
802 | =head2 "or" tests defined, not truth |
04c70446 |
803 | |
722d2a37 |
804 | Suggesting this on P5P B<will> cause a boring and interminable flamewar. |
04c70446 |
805 | |
722d2a37 |
806 | =head2 "class"-based lexicals |
04c70446 |
807 | |
cbb3fa72 |
808 | Use flyweight objects, secure hashes or, dare I say it, pseudo-hashes instead. |
f86a8bc5 |
809 | (Or whatever will replace pseudohashes in 5.10.) |
04c70446 |
810 | |
722d2a37 |
811 | =head2 byteperl |
04c70446 |
812 | |
722d2a37 |
813 | C<ByteLoader> covers this. |
04c70446 |
814 | |
722d2a37 |
815 | =head2 Lazy evaluation / tail recursion removal |
04c70446 |
816 | |
f86a8bc5 |
817 | C<List::Util> gives first() (a short-circuiting grep); tail recursion |
818 | removal is done manually, with C<goto &whoami;>. (However, MJD has |
819 | found that C<goto &whoami> introduces a performance penalty, so maybe |
820 | there should be a way to do this after all: C<sub foo {START: ... goto |
821 | START;> is better.) |
0562c0e3 |
822 | |
823 | =head2 Make "use utf8" the default |
824 | |
f86a8bc5 |
825 | Because of backward compatibility this is difficult: scripts could not |
826 | contain B<any legacy eight-bit data> (like Latin-1) anymore, even in |
827 | string literals or pod. Also would introduce a measurable slowdown of |
828 | at least few percentages since all regular expression operations would |
829 | be done in full UTF-8. But if you want to try this, add |
830 | -DUSE_UTF8_SCRIPTS to your compilation flags. |
831 | |
3298bd4d |
832 | =head2 Unicode collation and normalization |
833 | |
834 | The Unicode::Collate and Unicode::Normalize modules |
835 | by SADAHIRO Tomoyuki have been included since 5.8.0. |
836 | |
837 | Collation? http://www.unicode.org/unicode/reports/tr10/ |
838 | Normalization? http://www.unicode.org/unicode/reports/tr15/ |
0562c0e3 |
839 | |
825b3abc |
840 | =head2 Create debugging macros |
841 | |
842 | Debugging macros (like printsv, dump) can make debugging perl inside a |
843 | C debugger much easier. A good set for gdb comes with mod_perl. |
844 | Something similar should be distributed with perl. |
845 | |
846 | The proper way to do this is to use and extend Devel::DebugInit. |
847 | Devel::DebugInit also needs to be extended to support threads. |
848 | |
849 | See p5p archives for late May/early June 2001 for a recent discussion |
850 | on this topic. |
851 | |
3298bd4d |
852 | =cut |