Integrate with Sarathy.
[p5sagit/p5-mst-13.2.git] / pod / perldebug.pod
CommitLineData
a0d0e21e 1=head1 NAME
2
3perldebug - Perl debugging
4
5=head1 DESCRIPTION
6
7First of all, have you tried using the B<-w> switch?
8
4e1d3b43 9=head1 The Perl Debugger
10
26f28346 11"As soon as we started programming, we found to our
12surprise that it wasn't as easy to get programs right
13as we had thought. Debugging had to be discovered.
14I can remember the exact instant when I realized that
15a large part of my life from then on was going to be
16spent in finding mistakes in my own programs."
84902520 17
18I< --Maurice Wilkes, 1949>
26f28346 19
4e1d3b43 20If you invoke Perl with the B<-d> switch, your script runs under the
21Perl source debugger. This works like an interactive Perl
22environment, prompting for debugger commands that let you examine
68dc0745 23source code, set breakpoints, get stack backtraces, change the values of
4e1d3b43 24variables, etc. This is so convenient that you often fire up
54310121 25the debugger all by itself just to test out Perl constructs
4e1d3b43 26interactively to see what they do. For example:
27
28 perl -d -e 42
29
30In Perl, the debugger is not a separate program as it usually is in the
31typical compiled environment. Instead, the B<-d> flag tells the compiler
32to insert source information into the parse trees it's about to hand off
33to the interpreter. That means your code must first compile correctly
34for the debugger to work on it. Then when the interpreter starts up, it
54310121 35preloads a Perl library file containing the debugger itself.
4e1d3b43 36
37The program will halt I<right before> the first run-time executable
38statement (but see below regarding compile-time statements) and ask you
39to enter a debugger command. Contrary to popular expectations, whenever
40the debugger halts and shows you a line of code, it always displays the
41line it's I<about> to execute, rather than the one it has just executed.
42
43Any command not recognized by the debugger is directly executed
44(C<eval>'d) as Perl code in the current package. (The debugger uses the
45DB package for its own state information.)
46
47Leading white space before a command would cause the debugger to think
48it's I<NOT> a debugger command but for Perl, so be careful not to do
49that.
50
51=head2 Debugger Commands
52
53The debugger understands the following commands:
a0d0e21e 54
55=over 12
56
4e1d3b43 57=item h [command]
58
54310121 59Prints out a help message.
4e1d3b43 60
61If you supply another debugger command as an argument to the C<h> command,
62it prints out the description for just that command. The special
63argument of C<h h> produces a more compact help listing, designed to fit
64together on one screen.
65
7b8d334a 66If the output of the C<h> command (or any command, for that matter) scrolls
4e1d3b43 67past your screen, either precede the command with a leading pipe symbol so
68it's run through your pager, as in
69
70 DB> |h
71
e7ea3e70 72You may change the pager which is used via C<O pager=...> command.
73
4e1d3b43 74=item p expr
75
36477c24 76Same as C<print {$DB::OUT} expr> in the current package. In particular,
5f05dabc 77because this is just Perl's own B<print> function, this means that nested
4e1d3b43 78data structures and objects are not dumped, unlike with the C<x> command.
79
e7ea3e70 80The C<DB::OUT> filehandle is opened to F</dev/tty>, regardless of
81where STDOUT may be redirected to.
82
4e1d3b43 83=item x expr
84
54310121 85Evaluates its expression in list context and dumps out the result
4e1d3b43 86in a pretty-printed fashion. Nested data structures are printed out
87recursively, unlike the C<print> function.
88
36477c24 89The details of printout are governed by multiple C<O>ptions.
90
4e1d3b43 91=item V [pkg [vars]]
92
93Display all (or some) variables in package (defaulting to the C<main>
94package) using a data pretty-printer (hashes show their keys and values so
95you see what's what, control characters are made printable, etc.). Make
96sure you don't put the type specifier (like C<$>) there, just the symbol
97names, like this:
98
99 V DB filename line
100
101Use C<~pattern> and C<!pattern> for positive and negative regexps.
a0d0e21e 102
4e1d3b43 103Nested data structures are printed out in a legible fashion, unlike
104the C<print> function.
105
36477c24 106The details of printout are governed by multiple C<O>ptions.
107
4e1d3b43 108=item X [vars]
109
110Same as C<V currentpackage [vars]>.
a0d0e21e 111
112=item T
113
68dc0745 114Produce a stack backtrace. See below for details on its output.
a0d0e21e 115
4e1d3b43 116=item s [expr]
a0d0e21e 117
118Single step. Executes until it reaches the beginning of another
4e1d3b43 119statement, descending into subroutine calls. If an expression is
120supplied that includes function calls, it too will be single-stepped.
a0d0e21e 121
e7ea3e70 122=item n [expr]
a0d0e21e 123
124Next. Executes over subroutine calls, until it reaches the beginning
774d564b 125of the next statement. If an expression is supplied that includes
126function calls, those functions will be executed with stops before
127each statement.
a0d0e21e 128
184e9718 129=item E<lt>CRE<gt>
a0d0e21e 130
4e1d3b43 131Repeat last C<n> or C<s> command.
a0d0e21e 132
36477c24 133=item c [line|sub]
a0d0e21e 134
4e1d3b43 135Continue, optionally inserting a one-time-only breakpoint
36477c24 136at the specified line or subroutine.
a0d0e21e 137
4e1d3b43 138=item l
a0d0e21e 139
4e1d3b43 140List next window of lines.
a0d0e21e 141
142=item l min+incr
143
4e1d3b43 144List C<incr+1> lines starting at C<min>.
a0d0e21e 145
146=item l min-max
147
1761cee5 148List lines C<min> through C<max>. C<l E<45>> is synonymous to C<E<45>>.
a0d0e21e 149
150=item l line
151
4e1d3b43 152List a single line.
a0d0e21e 153
4e1d3b43 154=item l subname
a0d0e21e 155
83ee9e09 156List first window of lines from subroutine. I<subname> may
157be a variable which contains a code reference.
a0d0e21e 158
159=item -
160
4e1d3b43 161List previous window of lines.
a0d0e21e 162
4e1d3b43 163=item w [line]
a0d0e21e 164
4e1d3b43 165List window (a few lines) around the current line.
a0d0e21e 166
4e1d3b43 167=item .
a0d0e21e 168
4e1d3b43 169Return debugger pointer to the last-executed line and
170print it out.
171
172=item f filename
173
774d564b 174Switch to viewing a different file or eval statement. If C<filename>
e7ea3e70 175is not a full filename as found in values of %INC, it is considered as
176a regexp.
a0d0e21e 177
bee32ff8 178C<eval>ed strings (when accessible) are considered to be filenames:
179C<f (eval 7)> and C<f eval 7\b> access the body of the 7th C<eval>ed string
180(in the order of execution). The bodies of currently executed C<eval>
181and of C<eval>ed strings which define subroutines are saved, thus are
182accessible by this mechanism.
183
a0d0e21e 184=item /pattern/
185
4e1d3b43 186Search forwards for pattern; final / is optional.
a0d0e21e 187
188=item ?pattern?
189
4e1d3b43 190Search backwards for pattern; final ? is optional.
a0d0e21e 191
192=item L
193
36477c24 194List all breakpoints and actions.
a0d0e21e 195
4e1d3b43 196=item S [[!]pattern]
a0d0e21e 197
4e1d3b43 198List subroutine names [not] matching pattern.
a0d0e21e 199
200=item t
201
36477c24 202Toggle trace mode (see also C<AutoTrace> C<O>ption).
4e1d3b43 203
204=item t expr
205
206Trace through execution of expr. For example:
207
208 $ perl -de 42
209 Stack dump during die enabled outside of evals.
a0d0e21e 210
4e1d3b43 211 Loading DB routines from perl5db.pl patch level 0.94
212 Emacs support available.
213
214 Enter h or `h h' for help.
215
216 main::(-e:1): 0
217 DB<1> sub foo { 14 }
218
219 DB<2> sub bar { 3 }
220
221 DB<3> t print foo() * bar()
222 main::((eval 172):3): print foo() + bar();
223 main::foo((eval 168):2):
224 main::bar((eval 170):2):
225 42
36477c24 226
227or, with the C<O>ption C<frame=2> set,
228
229 DB<4> O f=2
230 frame = '2'
231 DB<5> t print foo() * bar()
232 3: foo() * bar()
233 entering main::foo
234 2: sub foo { 14 };
235 exited main::foo
236 entering main::bar
237 2: sub bar { 3 };
238 exited main::bar
239 42
4e1d3b43 240
241=item b [line] [condition]
a0d0e21e 242
243Set a breakpoint. If line is omitted, sets a breakpoint on the line
4e1d3b43 244that is about to be executed. If a condition is specified, it's
a0d0e21e 245evaluated each time the statement is reached and a breakpoint is taken
5f05dabc 246only if the condition is true. Breakpoints may be set on only lines
4e1d3b43 247that begin an executable statement. Conditions don't use B<if>:
a0d0e21e 248
249 b 237 $x > 30
36477c24 250 b 237 ++$count237 < 11
a0d0e21e 251 b 33 /pattern/i
252
4e1d3b43 253=item b subname [condition]
a0d0e21e 254
83ee9e09 255Set a breakpoint at the first line of the named subroutine. I<subname> may
256be a variable which contains a code reference (in this case I<condition>
257is not supported).
a0d0e21e 258
36477c24 259=item b postpone subname [condition]
260
261Set breakpoint at first line of subroutine after it is compiled.
262
263=item b load filename
264
774d564b 265Set breakpoint at the first executed line of the file. Filename should
e7ea3e70 266be a full name as found in values of %INC.
267
268=item b compile subname
269
270Sets breakpoint at the first statement executed after the subroutine
271is compiled.
36477c24 272
4e1d3b43 273=item d [line]
a0d0e21e 274
4e1d3b43 275Delete a breakpoint at the specified line. If line is omitted, deletes
276the breakpoint on the line that is about to be executed.
a0d0e21e 277
278=item D
279
4e1d3b43 280Delete all installed breakpoints.
281
282=item a [line] command
283
284Set an action to be done before the line is executed.
285The sequence of steps taken by the debugger is
286
8ebc5c01 287 1. check for a breakpoint at this line
288 2. print the line if necessary (tracing)
289 3. do any actions associated with that line
290 4. prompt user if at a breakpoint or in single-step
291 5. evaluate line
a0d0e21e 292
7b8d334a 293For example, this will print out $foo every time line
4e1d3b43 29453 is passed:
a0d0e21e 295
4e1d3b43 296 a 53 print "DB FOUND $foo\n"
a0d0e21e 297
298=item A
299
4e1d3b43 300Delete all installed actions.
301
6ee623d5 302=item W [expr]
303
304Add a global watch-expression.
305
306=item W
307
308Delete all watch-expressions.
309
c739111c 310=item r
311
312Continue until return from the current subroutine, and dump the return value.
313
4e1d3b43 314=item O [opt[=val]] [opt"val"] [opt?]...
315
316Set or query values of options. val defaults to 1. opt can
317be abbreviated. Several options can be listed.
318
319=over 12
320
e7ea3e70 321=item C<recallCommand>, C<ShellBang>
4e1d3b43 322
323The characters used to recall command or spawn shell. By
324default, these are both set to C<!>.
325
e7ea3e70 326=item C<pager>
4e1d3b43 327
328Program to use for output of pager-piped commands (those
329beginning with a C<|> character.) By default,
330C<$ENV{PAGER}> will be used.
331
e7ea3e70 332=item C<tkRunning>
36477c24 333
334Run Tk while prompting (with ReadLine).
335
e7ea3e70 336=item C<signalLevel>, C<warnLevel>, C<dieLevel>
337
774d564b 338Level of verbosity. By default the debugger is in a sane verbose mode,
e7ea3e70 339thus it will print backtraces on all the warnings and die-messages
340which are going to be printed out, and will print a message when
54310121 341interesting uncaught signals arrive.
36477c24 342
774d564b 343To disable this behaviour, set these values to 0. If C<dieLevel> is 2,
e7ea3e70 344then the messages which will be caught by surrounding C<eval> are also
345printed.
36477c24 346
e7ea3e70 347=item C<AutoTrace>
36477c24 348
e7ea3e70 349Trace mode (similar to C<t> command, but can be put into
350C<PERLDB_OPTS>).
36477c24 351
e7ea3e70 352=item C<LineInfo>
36477c24 353
e7ea3e70 354File or pipe to print line number info to. If it is a pipe (say,
355C<|visual_perl_db>), then a short, "emacs like" message is used.
36477c24 356
357=item C<inhibit_exit>
358
359If 0, allows I<stepping off> the end of the script.
360
54310121 361=item C<PrintRet>
36477c24 362
363affects printing of return value after C<r> command.
364
28d1fb14 365=item C<ornaments>
366
3e3baf6d 367affects screen appearance of the command line (see L<Term::ReadLine>).
28d1fb14 368
54310121 369=item C<frame>
36477c24 370
371affects printing messages on entry and exit from subroutines. If
372C<frame & 2> is false, messages are printed on entry only. (Printing
5f05dabc 373on exit may be useful if inter(di)spersed with other messages.)
36477c24 374
375If C<frame & 4>, arguments to functions are printed as well as the
774d564b 376context and caller info. If C<frame & 8>, overloaded C<stringify> and
28d1fb14 377C<tie>d C<FETCH> are enabled on the printed arguments. If C<frame &
37816>, the return value from the subroutine is printed as well.
379
380The length at which the argument list is truncated is governed by the
381next option:
e7ea3e70 382
383=item C<maxTraceLen>
384
385length at which the argument list is truncated when C<frame> option's
386bit 4 is set.
36477c24 387
4e1d3b43 388=back
389
390The following options affect what happens with C<V>, C<X>, and C<x>
391commands:
392
393=over 12
394
e7ea3e70 395=item C<arrayDepth>, C<hashDepth>
4e1d3b43 396
397Print only first N elements ('' for all).
398
e7ea3e70 399=item C<compactDump>, C<veryCompact>
4e1d3b43 400
774d564b 401Change style of array and hash dump. If C<compactDump>, short array
e7ea3e70 402may be printed on one line.
4e1d3b43 403
e7ea3e70 404=item C<globPrint>
4e1d3b43 405
406Whether to print contents of globs.
407
e7ea3e70 408=item C<DumpDBFiles>
4e1d3b43 409
410Dump arrays holding debugged files.
411
e7ea3e70 412=item C<DumpPackages>
4e1d3b43 413
414Dump symbol tables of packages.
415
6ee623d5 416=item C<DumpReused>
417
418Dump contents of "reused" addresses.
419
e7ea3e70 420=item C<quote>, C<HighBit>, C<undefPrint>
421
774d564b 422Change style of string dump. Default value of C<quote> is C<auto>, one
e7ea3e70 423can enable either double-quotish dump, or single-quotish by setting it
774d564b 424to C<"> or C<'>. By default, characters with high bit set are printed
e7ea3e70 425I<as is>.
426
54310121 427=item C<UsageOnly>
4e1d3b43 428
774d564b 429I<very> rudimentally per-package memory usage dump. Calculates total
e7ea3e70 430size of strings in variables in the package.
4e1d3b43 431
36477c24 432=back
4e1d3b43 433
36477c24 434During startup options are initialized from C<$ENV{PERLDB_OPTS}>.
435You can put additional initialization options C<TTY>, C<noTTY>,
436C<ReadLine>, and C<NonStop> there.
437
438Example rc file:
4e1d3b43 439
e7ea3e70 440 &parse_options("NonStop=1 LineInfo=db.out AutoTrace");
4e1d3b43 441
36477c24 442The script will run without human intervention, putting trace information
443into the file I<db.out>. (If you interrupt it, you would better reset
444C<LineInfo> to something "interactive"!)
4e1d3b43 445
36477c24 446=over 12
4e1d3b43 447
36477c24 448=item C<TTY>
4e1d3b43 449
36477c24 450The TTY to use for debugging I/O.
451
36477c24 452=item C<noTTY>
453
774d564b 454If set, goes in C<NonStop> mode, and would not connect to a TTY. If
36477c24 455interrupt (or if control goes to debugger via explicit setting of
456$DB::signal or $DB::single from the Perl script), connects to a TTY
457specified by the C<TTY> option at startup, or to a TTY found at
458runtime using C<Term::Rendezvous> module of your choice.
459
460This module should implement a method C<new> which returns an object
461with two methods: C<IN> and C<OUT>, returning two filehandles to use
774d564b 462for debugging input and output correspondingly. Method C<new> may
36477c24 463inspect an argument which is a value of C<$ENV{PERLDB_NOTTY}> at
464startup, or is C<"/tmp/perldbtty$$"> otherwise.
465
466=item C<ReadLine>
467
468If false, readline support in debugger is disabled, so you can debug
469ReadLine applications.
470
471=item C<NonStop>
472
54310121 473If set, debugger goes into noninteractive mode until interrupted, or
36477c24 474programmatically by setting $DB::signal or $DB::single.
475
476=back
477
478Here's an example of using the C<$ENV{PERLDB_OPTS}> variable:
4e1d3b43 479
e7ea3e70 480 $ PERLDB_OPTS="N f=2" perl -d myprogram
4e1d3b43 481
482will run the script C<myprogram> without human intervention, printing
483out the call tree with entry and exit points. Note that C<N f=2> is
774d564b 484equivalent to C<NonStop=1 frame=2>. Note also that at the moment when
4e1d3b43 485this documentation was written all the options to the debugger could
36477c24 486be uniquely abbreviated by the first letter (with exception of
487C<Dump*> options).
4e1d3b43 488
36477c24 489Other examples may include
a0d0e21e 490
e7ea3e70 491 $ PERLDB_OPTS="N f A L=listing" perl -d myprogram
a0d0e21e 492
54310121 493- runs script noninteractively, printing info on each entry into a
36477c24 494subroutine and each executed line into the file F<listing>. (If you
495interrupt it, you would better reset C<LineInfo> to something
496"interactive"!)
497
498
e7ea3e70 499 $ env "PERLDB_OPTS=R=0 TTY=/dev/ttyc" perl -d myprogram
36477c24 500
501may be useful for debugging a program which uses C<Term::ReadLine>
774d564b 502itself. Do not forget detach shell from the TTY in the window which
36477c24 503corresponds to F</dev/ttyc>, say, by issuing a command like
504
e7ea3e70 505 $ sleep 1000000
36477c24 506
507See L<"Debugger Internals"> below for more details.
508
509=item E<lt> [ command ]
510
511Set an action (Perl command) to happen before every debugger prompt.
4a6725af 512A multi-line command may be entered by backslashing the newlines. If
36477c24 513C<command> is missing, resets the list of actions.
514
515=item E<lt>E<lt> command
516
517Add an action (Perl command) to happen before every debugger prompt.
4a6725af 518A multi-line command may be entered by backslashing the newlines.
a0d0e21e 519
184e9718 520=item E<gt> command
a0d0e21e 521
36477c24 522Set an action (Perl command) to happen after the prompt when you've
4a6725af 523just given a command to return to executing the script. A multi-line
36477c24 524command may be entered by backslashing the newlines. If C<command> is
525missing, resets the list of actions.
526
527=item E<gt>E<gt> command
528
529Adds an action (Perl command) to happen after the prompt when you've
4a6725af 530just given a command to return to executing the script. A multi-line
36477c24 531command may be entered by backslashing the newlines.
532
533=item { [ command ]
534
535Set an action (debugger command) to happen before every debugger prompt.
4a6725af 536A multi-line command may be entered by backslashing the newlines. If
36477c24 537C<command> is missing, resets the list of actions.
538
539=item {{ command
540
541Add an action (debugger command) to happen before every debugger prompt.
4a6725af 542A multi-line command may be entered by backslashing the newlines.
a0d0e21e 543
4e1d3b43 544=item ! number
a0d0e21e 545
4e1d3b43 546Redo a previous command (default previous command).
a0d0e21e 547
4e1d3b43 548=item ! -number
a0d0e21e 549
4e1d3b43 550Redo number'th-to-last command.
a0d0e21e 551
4e1d3b43 552=item ! pattern
a0d0e21e 553
4e1d3b43 554Redo last command that started with pattern.
555See C<O recallCommand>, too.
a0d0e21e 556
4e1d3b43 557=item !! cmd
a0d0e21e 558
4e1d3b43 559Run cmd in a subprocess (reads from DB::IN, writes to DB::OUT)
560See C<O shellBang> too.
a0d0e21e 561
562=item H -number
563
564Display last n commands. Only commands longer than one character are
565listed. If number is omitted, lists them all.
566
567=item q or ^D
568
36477c24 569Quit. ("quit" doesn't work for this.) This is the only supported way
570to exit the debugger, though typing C<exit> twice may do it too.
571
572Set an C<O>ption C<inhibit_exit> to 0 if you want to be able to I<step
19799a22 573off> the end the script. You may also need to set $finished to 0 at
36477c24 574some moment if you want to step through global destruction.
a0d0e21e 575
4e1d3b43 576=item R
577
578Restart the debugger by B<exec>ing a new session. It tries to maintain
579your history across this, but internal settings and command line options
580may be lost.
581
5f05dabc 582Currently the following setting are preserved: history, breakpoints,
54310121 583actions, debugger C<O>ptions, and the following command line
5f05dabc 584options: B<-w>, B<-I>, and B<-e>.
36477c24 585
4e1d3b43 586=item |dbcmd
587
588Run debugger command, piping DB::OUT to current pager.
589
590=item ||dbcmd
591
592Same as C<|dbcmd> but DB::OUT is temporarily B<select>ed as well.
593Often used with commands that would otherwise produce long
594output, such as
595
596 |V main
597
598=item = [alias value]
599
e7ea3e70 600Define a command alias, like
601
602 = quit q
603
604or list current aliases.
4e1d3b43 605
a0d0e21e 606=item command
607
608Execute command as a Perl statement. A missing semicolon will be
609supplied.
610
e7ea3e70 611=item m expr
a0d0e21e 612
e7ea3e70 613The expression is evaluated, and the methods which may be applied to
614the result are listed.
615
616=item m package
617
618The methods which may be applied to objects in the C<package> are listed.
a0d0e21e 619
620=back
621
e7ea3e70 622=head2 Debugger input/output
623
624=over 8
625
626=item Prompt
627
4e1d3b43 628The debugger prompt is something like
629
630 DB<8>
631
632or even
633
634 DB<<17>>
635
636where that number is the command number, which you'd use to access with
54310121 637the builtin B<csh>-like history mechanism, e.g., C<!17> would repeat
4e1d3b43 638command number 17. The number of angle brackets indicates the depth of
639the debugger. You could get more than one set of brackets, for example, if
640you'd already at a breakpoint and then printed out the result of a
36477c24 641function call that itself also has a breakpoint, or you step into an
642expression via C<s/n/t expression> command.
4e1d3b43 643
54310121 644=item Multiline commands
e7ea3e70 645
4a6725af 646If you want to enter a multi-line command, such as a subroutine
e7ea3e70 647definition with several statements, or a format, you may escape the
648newline that would normally end the debugger command with a backslash.
649Here's an example:
a0d0e21e 650
4e1d3b43 651 DB<1> for (1..4) { \
652 cont: print "ok\n"; \
653 cont: }
654 ok
655 ok
656 ok
657 ok
658
659Note that this business of escaping a newline is specific to interactive
660commands typed into the debugger.
661
e7ea3e70 662=item Stack backtrace
663
68dc0745 664Here's an example of what a stack backtrace via C<T> command might
e7ea3e70 665look like:
4e1d3b43 666
667 $ = main::infested called from file `Ambulation.pm' line 10
668 @ = Ambulation::legs(1, 2, 3, 4) called from file `camel_flea' line 7
669 $ = main::pests('bactrian', 4) called from file `camel_flea' line 4
670
671The left-hand character up there tells whether the function was called
672in a scalar or list context (we bet you can tell which is which). What
673that says is that you were in the function C<main::infested> when you ran
674the stack dump, and that it was called in a scalar context from line 10
675of the file I<Ambulation.pm>, but without any arguments at all, meaning
676it was called as C<&infested>. The next stack frame shows that the
677function C<Ambulation::legs> was called in a list context from the
678I<camel_flea> file with four arguments. The last stack frame shows that
679C<main::pests> was called in a scalar context, also from I<camel_flea>,
680but from line 4.
681
e7ea3e70 682Note that if you execute C<T> command from inside an active C<use>
7b8d334a 683statement, the backtrace will contain both C<require>
684frame and an C<eval>) frame.
e7ea3e70 685
686=item Listing
687
688Listing given via different flavors of C<l> command looks like this:
689
690 DB<<13>> l
691 101: @i{@i} = ();
692 102:b @isa{@i,$pack} = ()
693 103 if(exists $i{$prevpack} || exists $isa{$pack});
694 104 }
695 105
696 106 next
697 107==> if(exists $isa{$pack});
698 108
699 109:a if ($extra-- > 0) {
700 110: %isa = ($pack,1);
701
702Note that the breakable lines are marked with C<:>, lines with
703breakpoints are marked by C<b>, with actions by C<a>, and the
704next executed line is marked by C<==E<gt>>.
705
706=item Frame listing
707
708When C<frame> option is set, debugger would print entered (and
709optionally exited) subroutines in different styles.
710
54310121 711What follows is the start of the listing of
e7ea3e70 712
28d1fb14 713 env "PERLDB_OPTS=f=n N" perl -d -V
714
715for different values of C<n>:
e7ea3e70 716
717=over 4
718
719=item 1
720
721 entering main::BEGIN
722 entering Config::BEGIN
723 Package lib/Exporter.pm.
724 Package lib/Carp.pm.
725 Package lib/Config.pm.
726 entering Config::TIEHASH
727 entering Exporter::import
728 entering Exporter::export
729 entering Config::myconfig
730 entering Config::FETCH
731 entering Config::FETCH
732 entering Config::FETCH
733 entering Config::FETCH
734
735=item 2
736
737 entering main::BEGIN
738 entering Config::BEGIN
739 Package lib/Exporter.pm.
740 Package lib/Carp.pm.
741 exited Config::BEGIN
742 Package lib/Config.pm.
743 entering Config::TIEHASH
744 exited Config::TIEHASH
745 entering Exporter::import
746 entering Exporter::export
747 exited Exporter::export
748 exited Exporter::import
749 exited main::BEGIN
750 entering Config::myconfig
751 entering Config::FETCH
752 exited Config::FETCH
753 entering Config::FETCH
754 exited Config::FETCH
755 entering Config::FETCH
756
757=item 4
758
759 in $=main::BEGIN() from /dev/nul:0
760 in $=Config::BEGIN() from lib/Config.pm:2
761 Package lib/Exporter.pm.
762 Package lib/Carp.pm.
763 Package lib/Config.pm.
764 in $=Config::TIEHASH('Config') from lib/Config.pm:644
765 in $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/nul:0
766 in $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from li
767 in @=Config::myconfig() from /dev/nul:0
768 in $=Config::FETCH(ref(Config), 'package') from lib/Config.pm:574
769 in $=Config::FETCH(ref(Config), 'baserev') from lib/Config.pm:574
cceca5ed 770 in $=Config::FETCH(ref(Config), 'PERL_VERSION') from lib/Config.pm:574
771 in $=Config::FETCH(ref(Config), 'PERL_SUBVERSION') from lib/Config.pm:574
e7ea3e70 772 in $=Config::FETCH(ref(Config), 'osname') from lib/Config.pm:574
773 in $=Config::FETCH(ref(Config), 'osvers') from lib/Config.pm:574
774
775=item 6
776
777 in $=main::BEGIN() from /dev/nul:0
778 in $=Config::BEGIN() from lib/Config.pm:2
779 Package lib/Exporter.pm.
780 Package lib/Carp.pm.
781 out $=Config::BEGIN() from lib/Config.pm:0
782 Package lib/Config.pm.
783 in $=Config::TIEHASH('Config') from lib/Config.pm:644
784 out $=Config::TIEHASH('Config') from lib/Config.pm:644
785 in $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/nul:0
786 in $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/
787 out $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/
788 out $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/nul:0
789 out $=main::BEGIN() from /dev/nul:0
790 in @=Config::myconfig() from /dev/nul:0
791 in $=Config::FETCH(ref(Config), 'package') from lib/Config.pm:574
792 out $=Config::FETCH(ref(Config), 'package') from lib/Config.pm:574
793 in $=Config::FETCH(ref(Config), 'baserev') from lib/Config.pm:574
794 out $=Config::FETCH(ref(Config), 'baserev') from lib/Config.pm:574
cceca5ed 795 in $=Config::FETCH(ref(Config), 'PERL_VERSION') from lib/Config.pm:574
796 out $=Config::FETCH(ref(Config), 'PERL_VERSION') from lib/Config.pm:574
797 in $=Config::FETCH(ref(Config), 'PERL_SUBVERSION') from lib/Config.pm:574
e7ea3e70 798
799=item 14
800
801 in $=main::BEGIN() from /dev/nul:0
802 in $=Config::BEGIN() from lib/Config.pm:2
803 Package lib/Exporter.pm.
804 Package lib/Carp.pm.
805 out $=Config::BEGIN() from lib/Config.pm:0
806 Package lib/Config.pm.
807 in $=Config::TIEHASH('Config') from lib/Config.pm:644
808 out $=Config::TIEHASH('Config') from lib/Config.pm:644
809 in $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/nul:0
810 in $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/E
811 out $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/E
812 out $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/nul:0
813 out $=main::BEGIN() from /dev/nul:0
814 in @=Config::myconfig() from /dev/nul:0
815 in $=Config::FETCH('Config=HASH(0x1aa444)', 'package') from lib/Config.pm:574
816 out $=Config::FETCH('Config=HASH(0x1aa444)', 'package') from lib/Config.pm:574
817 in $=Config::FETCH('Config=HASH(0x1aa444)', 'baserev') from lib/Config.pm:574
818 out $=Config::FETCH('Config=HASH(0x1aa444)', 'baserev') from lib/Config.pm:574
819
28d1fb14 820=item 30
821
822 in $=CODE(0x15eca4)() from /dev/null:0
823 in $=CODE(0x182528)() from lib/Config.pm:2
824 Package lib/Exporter.pm.
825 out $=CODE(0x182528)() from lib/Config.pm:0
826 scalar context return from CODE(0x182528): undef
827 Package lib/Config.pm.
828 in $=Config::TIEHASH('Config') from lib/Config.pm:628
829 out $=Config::TIEHASH('Config') from lib/Config.pm:628
830 scalar context return from Config::TIEHASH: empty hash
831 in $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/null:0
832 in $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/Exporter.pm:171
833 out $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/Exporter.pm:171
834 scalar context return from Exporter::export: ''
835 out $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/null:0
836 scalar context return from Exporter::import: ''
837
838
e7ea3e70 839=back
840
841In all the cases indentation of lines shows the call tree, if bit 2 of
842C<frame> is set, then a line is printed on exit from a subroutine as
843well, if bit 4 is set, then the arguments are printed as well as the
844caller info, if bit 8 is set, the arguments are printed even if they
28d1fb14 845are tied or references, if bit 16 is set, the return value is printed
846as well.
e7ea3e70 847
848When a package is compiled, a line like this
849
850 Package lib/Carp.pm.
851
852is printed with proper indentation.
853
854=back
855
856=head2 Debugging compile-time statements
857
4e1d3b43 858If you have any compile-time executable statements (code within a BEGIN
859block or a C<use> statement), these will C<NOT> be stopped by debugger,
36477c24 860although C<require>s will (and compile-time statements can be traced
54310121 861with C<AutoTrace> option set in C<PERLDB_OPTS>). From your own Perl
36477c24 862code, however, you can
4e1d3b43 863transfer control back to the debugger using the following statement,
864which is harmless if the debugger is not running:
a0d0e21e 865
866 $DB::single = 1;
867
4e1d3b43 868If you set C<$DB::single> to the value 2, it's equivalent to having
869just typed the C<n> command, whereas a value of 1 means the C<s>
870command. The C<$DB::trace> variable should be set to 1 to simulate
871having typed the C<t> command.
872
e7ea3e70 873Another way to debug compile-time code is to start debugger, set a
874breakpoint on I<load> of some module thusly
875
876 DB<7> b load f:/perllib/lib/Carp.pm
877 Will stop on load of `f:/perllib/lib/Carp.pm'.
878
774d564b 879and restart debugger by C<R> command (if possible). One can use C<b
e7ea3e70 880compile subname> for the same purpose.
881
4e1d3b43 882=head2 Debugger Customization
a0d0e21e 883
7b8d334a 884Most probably you do not want to modify the debugger, it contains enough
774d564b 885hooks to satisfy most needs. You may change the behaviour of debugger
36477c24 886from the debugger itself, using C<O>ptions, from the command line via
887C<PERLDB_OPTS> environment variable, and from I<customization files>.
a0d0e21e 888
889You can do some customization by setting up a F<.perldb> file which
890contains initialization code. For instance, you could make aliases
4e1d3b43 891like these (the last one is one people expect to be there):
a0d0e21e 892
4e1d3b43 893 $DB::alias{'len'} = 's/^len(.*)/p length($1)/';
a0d0e21e 894 $DB::alias{'stop'} = 's/^stop (at|in)/b/';
4e1d3b43 895 $DB::alias{'ps'} = 's/^ps\b/p scalar /';
896 $DB::alias{'quit'} = 's/^quit(\s*)/exit\$/';
897
36477c24 898One changes options from F<.perldb> file via calls like this one;
899
900 parse_options("NonStop=1 LineInfo=db.out AutoTrace=1 frame=2");
901
774d564b 902(the code is executed in the package C<DB>). Note that F<.perldb> is
903processed before processing C<PERLDB_OPTS>. If F<.perldb> defines the
36477c24 904subroutine C<afterinit>, it is called after all the debugger
774d564b 905initialization ends. F<.perldb> may be contained in the current
36477c24 906directory, or in the C<LOGDIR>/C<HOME> directory.
907
908If you want to modify the debugger, copy F<perl5db.pl> from the Perl
909library to another name and modify it as necessary. You'll also want
910to set your C<PERL5DB> environment variable to say something like this:
911
912 BEGIN { require "myperl5db.pl" }
913
914As the last resort, one can use C<PERL5DB> to customize debugger by
915directly setting internal variables or calling debugger functions.
916
4e1d3b43 917=head2 Readline Support
918
919As shipped, the only command line history supplied is a simplistic one
920that checks for leading exclamation points. However, if you install
921the Term::ReadKey and Term::ReadLine modules from CPAN, you will
922have full editing capabilities much like GNU I<readline>(3) provides.
923Look for these in the F<modules/by-module/Term> directory on CPAN.
924
54310121 925A rudimentary command line completion is also available.
e7ea3e70 926Unfortunately, the names of lexical variables are not available for
927completion.
928
4e1d3b43 929=head2 Editor Support for Debugging
930
931If you have GNU B<emacs> installed on your system, it can interact with
932the Perl debugger to provide an integrated software development
933environment reminiscent of its interactions with C debuggers.
934
935Perl is also delivered with a start file for making B<emacs> act like a
936syntax-directed editor that understands (some of) Perl's syntax. Look in
937the I<emacs> directory of the Perl source distribution.
938
939(Historically, a similar setup for interacting with B<vi> and the
940X11 window system had also been available, but at the time of this
941writing, no debugger support for B<vi> currently exists.)
942
943=head2 The Perl Profiler
944
945If you wish to supply an alternative debugger for Perl to run, just
946invoke your script with a colon and a package argument given to the B<-d>
947flag. One of the most popular alternative debuggers for Perl is
948B<DProf>, the Perl profiler. As of this writing, B<DProf> is not
949included with the standard Perl distribution, but it is expected to
950be included soon, for certain values of "soon".
951
952Meanwhile, you can fetch the Devel::Dprof module from CPAN. Assuming
953it's properly installed on your system, to profile your Perl program in
954the file F<mycode.pl>, just type:
955
956 perl -d:DProf mycode.pl
957
958When the script terminates the profiler will dump the profile information
959to a file called F<tmon.out>. A tool like B<dprofpp> (also supplied with
960the Devel::DProf package) can be used to interpret the information which is
961in that profile.
962
36477c24 963=head2 Debugger support in perl
4e1d3b43 964
e7ea3e70 965When you call the B<caller> function (see L<perlfunc/caller>) from the
966package DB, Perl sets the array @DB::args to contain the arguments the
54310121 967corresponding stack frame was called with.
4e1d3b43 968
36477c24 969If perl is run with B<-d> option, the following additional features
84902520 970are enabled (cf. L<perlvar/$^P>):
a0d0e21e 971
36477c24 972=over
4e1d3b43 973
36477c24 974=item *
4e1d3b43 975
36477c24 976Perl inserts the contents of C<$ENV{PERL5DB}> (or C<BEGIN {require
977'perl5db.pl'}> if not present) before the first line of the
978application.
4e1d3b43 979
36477c24 980=item *
4e1d3b43 981
7b8d334a 982The array C<@{"_E<lt>$filename"}> is the line-by-line contents of
774d564b 983$filename for all the compiled files. Same for C<eval>ed strings which
19799a22 984contain subroutines, or which are currently executed. The $filename
36477c24 985for C<eval>ed strings looks like C<(eval 34)>.
4e1d3b43 986
36477c24 987=item *
4e1d3b43 988
7b8d334a 989The hash C<%{"_E<lt>$filename"}> contains breakpoints and action (it is
36477c24 990keyed by line number), and individual entries are settable (as opposed
774d564b 991to the whole hash). Only true/false is important to Perl, though the
36477c24 992values used by F<perl5db.pl> have the form
774d564b 993C<"$break_condition\0$action">. Values are magical in numeric context:
36477c24 994they are zeros if the line is not breakable.
4e1d3b43 995
36477c24 996Same for evaluated strings which contain subroutines, or which are
7b8d334a 997currently executed. The $filename for C<eval>ed strings looks like
36477c24 998C<(eval 34)>.
4e1d3b43 999
36477c24 1000=item *
4e1d3b43 1001
7b8d334a 1002The scalar C<${"_E<lt>$filename"}> contains C<"_E<lt>$filename">. Same for
36477c24 1003evaluated strings which contain subroutines, or which are currently
7b8d334a 1004executed. The $filename for C<eval>ed strings looks like C<(eval
36477c24 100534)>.
4e1d3b43 1006
36477c24 1007=item *
4e1d3b43 1008
36477c24 1009After each C<require>d file is compiled, but before it is executed,
7b8d334a 1010C<DB::postponed(*{"_E<lt>$filename"})> is called (if subroutine
774d564b 1011C<DB::postponed> exists). Here the $filename is the expanded name of
7b8d334a 1012the C<require>d file (as found in values of %INC).
4e1d3b43 1013
36477c24 1014=item *
4e1d3b43 1015
36477c24 1016After each subroutine C<subname> is compiled existence of
774d564b 1017C<$DB::postponed{subname}> is checked. If this key exists,
36477c24 1018C<DB::postponed(subname)> is called (if subroutine C<DB::postponed>
1019exists).
4e1d3b43 1020
36477c24 1021=item *
4e1d3b43 1022
36477c24 1023A hash C<%DB::sub> is maintained, with keys being subroutine names,
774d564b 1024values having the form C<filename:startline-endline>. C<filename> has
36477c24 1025the form C<(eval 31)> for subroutines defined inside C<eval>s.
4e1d3b43 1026
36477c24 1027=item *
1028
5f05dabc 1029When execution of the application reaches a place that can have
1030a breakpoint, a call to C<DB::DB()> is performed if any one of
1031variables $DB::trace, $DB::single, or $DB::signal is true. (Note that
36477c24 1032these variables are not C<local>izable.) This feature is disabled when
1033the control is inside C<DB::DB()> or functions called from it (unless
e7ea3e70 1034C<$^D & (1E<lt>E<lt>30)>).
36477c24 1035
1036=item *
1037
5f05dabc 1038When execution of the application reaches a subroutine call, a call
36477c24 1039to C<&DB::sub>(I<args>) is performed instead, with C<$DB::sub> being
1040the name of the called subroutine. (Unless the subroutine is compiled
1041in the package C<DB>.)
4e1d3b43 1042
1043=back
a0d0e21e 1044
84902520 1045Note that if C<&DB::sub> needs some external data to be setup for it
1046to work, no subroutine call is possible until this is done. For the
1047standard debugger C<$DB::deep> (how many levels of recursion deep into
1048the debugger you can go before a mandatory break) gives an example of
1049such a dependency.
e7ea3e70 1050
84902520 1051The minimal working debugger consists of one line
e7ea3e70 1052
1053 sub DB::DB {}
1054
1055which is quite handy as contents of C<PERL5DB> environment
1056variable:
1057
1058 env "PERL5DB=sub DB::DB {}" perl -d your-script
1059
1060Another (a little bit more useful) minimal debugger can be created
1061with the only line being
1062
1063 sub DB::DB {print ++$i; scalar <STDIN>}
1064
1065This debugger would print the sequential number of encountered
1066statement, and would wait for your C<CR> to continue.
1067
1068The following debugger is quite functional:
1069
54310121 1070 {
1071 package DB;
1072 sub DB {}
e7ea3e70 1073 sub sub {print ++$i, " $sub\n"; &$sub}
1074 }
1075
1076It prints the sequential number of subroutine call and the name of the
774d564b 1077called subroutine. Note that C<&DB::sub> should be compiled into the
e7ea3e70 1078package C<DB>.
36477c24 1079
1080=head2 Debugger Internals
1081
1082At the start, the debugger reads your rc file (F<./.perldb> or
54310121 1083F<~/.perldb> under Unix), which can set important options. This file may
36477c24 1084define a subroutine C<&afterinit> to be executed after the debugger is
1085initialized.
1086
5f05dabc 1087After the rc file is read, the debugger reads environment variable
36477c24 1088PERLDB_OPTS and parses it as a rest of C<O ...> line in debugger prompt.
1089
1090It also maintains magical internal variables, such as C<@DB::dbline>,
1091C<%DB::dbline>, which are aliases for C<@{"::_<current_file"}>
774d564b 1092C<%{"::_<current_file"}>. Here C<current_file> is the currently
36477c24 1093selected (with the debugger's C<f> command, or by flow of execution)
1094file.
1095
774d564b 1096Some functions are provided to simplify customization. See L<"Debugger
1097Customization"> for description of C<DB::parse_options(string)>. The
36477c24 1098function C<DB::dump_trace(skip[, count])> skips the specified number
1d2dff63 1099of frames, and returns a list containing info about the caller
774d564b 1100frames (all if C<count> is missing). Each entry is a hash with keys
36477c24 1101C<context> (C<$> or C<@>), C<sub> (subroutine name, or info about
5f05dabc 1102eval), C<args> (C<undef> or a reference to an array), C<file>, and
36477c24 1103C<line>.
1104
54310121 1105The function C<DB::print_trace(FH, skip[, count[, short]])> prints
774d564b 1106formatted info about caller frames. The last two functions may be
36477c24 1107convenient as arguments to C<E<lt>>, C<E<lt>E<lt>> commands.
1108
a0d0e21e 1109=head2 Other resources
1110
1111You did try the B<-w> switch, didn't you?
1112
a77df738 1113=head2 BUGS
a0d0e21e 1114
4e1d3b43 1115You cannot get the stack frame information or otherwise debug functions
1116that were not compiled by Perl, such as C or C++ extensions.
a0d0e21e 1117
4e1d3b43 1118If you alter your @_ arguments in a subroutine (such as with B<shift>
68dc0745 1119or B<pop>, the stack backtrace will not show the original values.
a77df738 1120
1121=head1 Debugging Perl memory usage
1122
1123Perl is I<very> frivolous with memory. There is a saying that to
1124estimate memory usage of Perl, assume a reasonable algorithm of
c2611fb3 1125allocation, and multiply your estimates by 10. This is not absolutely
a77df738 1126true, but may give you a good grasp of what happens.
1127
1128Say, an integer cannot take less than 20 bytes of memory, a float
1129cannot take less than 24 bytes, a string cannot take less than 32
1130bytes (all these examples assume 32-bit architectures, the result are
1131much worse on 64-bit architectures). If a variable is accessed in two
1132of three different ways (which require an integer, a float, or a
1133string), the memory footprint may increase by another 20 bytes. A
1134sloppy malloc() implementation will make these numbers yet more.
1135
1136On the opposite end of the scale, a declaration like
1137
1138 sub foo;
1139
1140may take (on some versions of perl) up to 500 bytes of memory.
1141
1142Off-the-cuff anecdotal estimates of a code bloat give a factor around
11438. This means that the compiled form of reasonable (commented
1144indented etc.) code will take approximately 8 times more than the
1145disk space the code takes.
1146
1147There are two Perl-specific ways to analyze the memory usage:
1148$ENV{PERL_DEBUG_MSTATS} and B<-DL> switch. First one is available
1149only if perl is compiled with Perl's malloc(), the second one only if
1150Perl compiled with C<-DDEBUGGING> (as with giving C<-D optimise=-g>
1151option to F<Configure>).
1152
1153=head2 Using C<$ENV{PERL_DEBUG_MSTATS}>
1154
1155If your perl is using Perl's malloc(), and compiled with correct
1156switches (this is the default), then it will print memory usage
1157statistics after compiling your code (if C<$ENV{PERL_DEBUG_MSTATS}> >
11581), and before termination of the script (if
1159C<$ENV{PERL_DEBUG_MSTATS}> >= 1). The report format is similar to one
1160in the following example:
1161
1162 env PERL_DEBUG_MSTATS=2 perl -e "require Carp"
1163 Memory allocation statistics after compilation: (buckets 4(4)..8188(8192)
1164 14216 free: 130 117 28 7 9 0 2 2 1 0 0
1165 437 61 36 0 5
1166 60924 used: 125 137 161 55 7 8 6 16 2 0 1
1167 74 109 304 84 20
1168 Total sbrk(): 77824/21:119. Odd ends: pad+heads+chain+tail: 0+636+0+2048.
1169 Memory allocation statistics after execution: (buckets 4(4)..8188(8192)
1170 30888 free: 245 78 85 13 6 2 1 3 2 0 1
1171 315 162 39 42 11
1172 175816 used: 265 176 1112 111 26 22 11 27 2 1 1
1173 196 178 1066 798 39
1174 Total sbrk(): 215040/47:145. Odd ends: pad+heads+chain+tail: 0+2192+0+6144.
1175
1176It is possible to ask for such a statistic at arbitrary moment by
c2611fb3 1177using Devel::Peek::mstats() (module Devel::Peek is available on CPAN).
a77df738 1178
1179Here is the explanation of different parts of the format:
1180
1181=over
1182
1183=item C<buckets SMALLEST(APPROX)..GREATEST(APPROX)>
1184
1185Perl's malloc() uses bucketed allocations. Every request is rounded
1186up to the closest bucket size available, and a bucket of these size is
1187taken from the pool of the buckets of this size.
1188
1189The above line describes limits of buckets currently in use. Each
1190bucket has two sizes: memory footprint, and the maximal size of user
1191data which may be put into this bucket. Say, in the above example the
1192smallest bucket is both sizes 4. The biggest bucket has usable size
11938188, and the memory footprint 8192.
1194
1195With debugging Perl some buckets may have negative usable size. This
1196means that these buckets cannot (and will not) be used. For greater
1197buckets the memory footprint may be one page greater than a power of
11982. In such a case the corresponding power of two is printed instead
1199in the C<APPROX> field above.
1200
1201=item Free/Used
1202
1203The following 1 or 2 rows of numbers correspond to the number of
1204buckets of each size between C<SMALLEST> and C<GREATEST>. In the
1205first row the sizes (memory footprints) of buckets are powers of two
1206(or possibly one page greater). In the second row (if present) the
1207memory footprints of the buckets are between memory footprints of two
1208buckets "above".
1209
1210Say, with the above example the memory footprints are (with current
c2611fb3 1211algorithm)
a77df738 1212
1213 free: 8 16 32 64 128 256 512 1024 2048 4096 8192
1214 4 12 24 48 80
1215
1216With non-C<DEBUGGING> perl the buckets starting from C<128>-long ones
1217have 4-byte overhead, thus 8192-long bucket may take up to
12188188-byte-long allocations.
1219
1220=item C<Total sbrk(): SBRKed/SBRKs:CONTINUOUS>
1221
1222The first two fields give the total amount of memory perl sbrk()ed,
1223and number of sbrk()s used. The third number is what perl thinks
1224about continuity of returned chunks. As far as this number is
1225positive, malloc() will assume that it is probable that sbrk() will
1226provide continuous memory.
1227
1228The amounts sbrk()ed by external libraries is not counted.
1229
1230=item C<pad: 0>
1231
1232The amount of sbrk()ed memory needed to keep buckets aligned.
1233
1234=item C<heads: 2192>
1235
1236While memory overhead of bigger buckets is kept inside the bucket, for
1237smaller buckets it is kept in separate areas. This field gives the
1238total size of these areas.
1239
1240=item C<chain: 0>
1241
1242malloc() may want to subdivide a bigger bucket into smaller buckets.
1243If only a part of the deceased-bucket is left non-subdivided, the rest
1244is kept as an element of a linked list. This field gives the total
1245size of these chunks.
1246
1247=item C<tail: 6144>
1248
1249To minimize amount of sbrk()s malloc() asks for more memory. This
1250field gives the size of the yet-unused part, which is sbrk()ed, but
1251never touched.
1252
1253=back
1254
1255=head2 Example of using B<-DL> switch
1256
1257Below we show how to analyse memory usage by
1258
1259 do 'lib/auto/POSIX/autosplit.ix';
1260
1261The file in question contains a header and 146 lines similar to
1262
1263 sub getcwd ;
1264
1265B<Note:> I<the discussion below supposes 32-bit architecture. In the
1266newer versions of perl the memory usage of the constructs discussed
1267here is much improved, but the story discussed below is a real-life
1268story. This story is very terse, and assumes more than cursory
1269knowledge of Perl internals.>
1270
1271Here is the itemized list of Perl allocations performed during parsing
1272of this file:
1273
1274 !!! "after" at test.pl line 3.
1275 Id subtot 4 8 12 16 20 24 28 32 36 40 48 56 64 72 80 80+
1276 0 02 13752 . . . . 294 . . . . . . . . . . 4
1277 0 54 5545 . . 8 124 16 . . . 1 1 . . . . . 3
1278 5 05 32 . . . . . . . 1 . . . . . . . .
1279 6 02 7152 . . . . . . . . . . 149 . . . . .
1280 7 02 3600 . . . . . 150 . . . . . . . . . .
1281 7 03 64 . -1 . 1 . . 2 . . . . . . . . .
1282 7 04 7056 . . . . . . . . . . . . . . . 7
1283 7 17 38404 . . . . . . . 1 . . 442 149 . . 147 .
1284 9 03 2078 17 249 32 . . . . 2 . . . . . . . .
1285
1286
1287To see this list insert two C<warn('!...')> statements around the call:
1288
1289 warn('!');
1290 do 'lib/auto/POSIX/autosplit.ix';
1291 warn('!!! "after"');
1292
1293and run it with B<-DL> option. The first warn() will print memory
1294allocation info before the parsing of the file, and will memorize the
1295statistics at this point (we ignore what it prints). The second warn()
1296will print increments w.r.t. this memorized statistics. This is the
1297above printout.
1298
1299Different I<Id>s on the left correspond to different subsystems of
1300perl interpreter, they are just first argument given to perl memory
1301allocation API New(). To find what C<9 03> means C<grep> the perl
1302source for C<903>. You will see that it is F<util.c>, function
1303savepvn(). This function is used to store a copy of existing chunk of
1304memory. Using C debugger, one can see that it is called either
1305directly from gv_init(), or via sv_magic(), and gv_init() is called
1306from gv_fetchpv() - which is called from newSUB().
1307
1308B<Note:> to reach this place in debugger and skip all the calls to
1309savepvn during the compilation of the main script, set a C breakpoint
1310in Perl_warn(), C<continue> this point is reached, I<then> set
1311breakpoint in Perl_savepvn(). Note that you may need to skip a
1312handful of Perl_savepvn() which do not correspond to mass production
1313of CVs (there are more C<903> allocations than 146 similar lines of
1314F<lib/auto/POSIX/autosplit.ix>). Note also that C<Perl_> prefixes are
1315added by macroization code in perl header files to avoid conflicts
1316with external libraries.
1317
1318Anyway, we see that C<903> ids correspond to creation of globs, twice
1319per glob - for glob name, and glob stringification magic.
1320
1321Here are explanations for other I<Id>s above:
1322
1323=over
1324
1325=item C<717>
1326
1327is for creation of bigger C<XPV*> structures. In the above case it
1328creates 3 C<AV> per subroutine, one for a list of lexical variable
1329names, one for a scratchpad (which contains lexical variables and
1330C<targets>), and one for the array of scratchpads needed for
1331recursion.
1332
1333It also creates a C<GV> and a C<CV> per subroutine (all called from
1334start_subparse()).
1335
1336=item C<002>
1337
1338Creates C array corresponding to the C<AV> of scratchpads, and the
1339scratchpad itself (the first fake entry of this scratchpad is created
1340though the subroutine itself is not defined yet).
1341
1342It also creates C arrays to keep data for the stash (this is one HV,
1343but it grows, thus there are 4 big allocations: the big chunks are not
c2611fb3 1344freed, but are kept as additional arenas for C<SV> allocations).
a77df738 1345
1346=item C<054>
1347
1348creates a C<HEK> for the name of the glob for the subroutine (this
1349name is a key in a I<stash>).
1350
1351Big allocations with this I<Id> correspond to allocations of new
1352arenas to keep C<HE>.
1353
1354=item C<602>
1355
1356creates a C<GP> for the glob for the subroutine.
1357
1358=item C<702>
1359
1360creates the C<MAGIC> for the glob for the subroutine.
1361
1362=item C<704>
1363
1364creates I<arenas> which keep SVs.
1365
1366=back
1367
1368=head2 B<-DL> details
1369
1370If Perl is run with B<-DL> option, then warn()s which start with `!'
1371behave specially. They print a list of I<categories> of memory
1372allocations, and statistics of allocations of different sizes for
1373these categories.
1374
1375If warn() string starts with
1376
1377=over
1378
1379=item C<!!!>
1380
1381print changed categories only, print the differences in counts of allocations;
1382
1383=item C<!!>
1384
1385print grown categories only; print the absolute values of counts, and totals;
1386
1387=item C<!>
1388
1389print nonempty categories, print the absolute values of counts and totals.
1390
1391=back
1392
1393=head2 Limitations of B<-DL> statistic
1394
1395If an extension or an external library does not use Perl API to
1396allocate memory, these allocations are not counted.
1397
54dc92de 1398=head1 Debugging regular expressions
1399
1400There are two ways to enable debugging output for regular expressions.
1401
1402If your perl is compiled with C<-DDEBUGGING>, you may use the
1403B<-Dr> flag on the command line.
1404
1405Otherwise, one can C<use re 'debug'>, which has effects both at
1406compile time, and at run time (and is I<not> lexically scoped).
1407
1408=head2 Compile-time output
1409
1410The debugging output for the compile time looks like this:
1411
1412 compiling RE `[bc]d(ef*g)+h[ij]k$'
1413 size 43 first at 1
1414 1: ANYOF(11)
1415 11: EXACT <d>(13)
1416 13: CURLYX {1,32767}(27)
1417 15: OPEN1(17)
1418 17: EXACT <e>(19)
1419 19: STAR(22)
1420 20: EXACT <f>(0)
1421 22: EXACT <g>(24)
1422 24: CLOSE1(26)
1423 26: WHILEM(0)
1424 27: NOTHING(28)
1425 28: EXACT <h>(30)
1426 30: ANYOF(40)
1427 40: EXACT <k>(42)
1428 42: EOL(43)
1429 43: END(0)
1430 anchored `de' at 1 floating `gh' at 3..2147483647 (checking floating)
1431 stclass `ANYOF' minlen 7
1432
1433The first line shows the pre-compiled form of the regexp, and the
1434second shows the size of the compiled form (in arbitrary units,
1435usually 4-byte words) and the label I<id> of the first node which
1436does a match.
1437
1438The last line (split into two lines in the above) contains the optimizer
1439info. In the example shown, the optimizer found that the match
1440should contain a substring C<de> at the offset 1, and substring C<gh>
1441at some offset between 3 and infinity. Moreover, when checking for
1442these substrings (to abandon impossible matches quickly) it will check
1443for the substring C<gh> before checking for the substring C<de>. The
1444optimizer may also use the knowledge that the match starts (at the
1445C<first> I<id>) with a character class, and the match cannot be
1446shorter than 7 chars.
1447
1448The fields of interest which may appear in the last line are
1449
1450=over
1451
1452=item C<anchored> I<STRING> C<at> I<POS>
1453
1454=item C<floating> I<STRING> C<at> I<POS1..POS2>
1455
1456see above;
1457
1458=item C<matching floating/anchored>
1459
1460which substring to check first;
1461
1462=item C<minlen>
1463
1464the minimal length of the match;
1465
1466=item C<stclass> I<TYPE>
1467
1468The type of the first matching node.
1469
1470=item C<noscan>
1471
1472which advises to not scan for the found substrings;
1473
1474=item C<isall>
1475
1476which says that the optimizer info is in fact all that the regular
1477expression contains (thus one does not need to enter the RE engine at
1478all);
1479
1480=item C<GPOS>
1481
1482if the pattern contains C<\G>;
1483
1484=item C<plus>
1485
1486if the pattern starts with a repeated char (as in C<x+y>);
1487
1488=item C<implicit>
1489
1490if the pattern starts with C<.*>;
1491
1492=item C<with eval>
1493
1494if the pattern contain eval-groups (see L<perlre/(?{ code })>);
1495
1496=item C<anchored(TYPE)>
1497
1498if the pattern may
1499match only at a handful of places (with C<TYPE> being
1500C<BOL>, C<MBOL>, or C<GPOS>, see the table below).
1501
1502=back
1503
1504If a substring is known to match at end-of-line only, it may be
1505followed by C<$>, as in C<floating `k'$>.
1506
1507The optimizer-specific info is used to avoid entering (a slow) RE
1508engine on strings which will definitely not match. If C<isall> flag
1509is set, a call to the RE engine may be avoided even when optimizer
1510found an appropriate place for the match.
1511
1512The rest of the output contains the list of I<nodes> of the compiled
1513form of the RE. Each line has format
1514
1515C< >I<id>: I<TYPE> I<OPTIONAL-INFO> (I<next-id>)
1516
1517=head2 Types of nodes
1518
1519Here is the list of possible types with short descriptions:
1520
1521 # TYPE arg-description [num-args] [longjump-len] DESCRIPTION
1522
1523 # Exit points
1524 END no End of program.
1525 SUCCEED no Return from a subroutine, basically.
1526
1527 # Anchors:
1528 BOL no Match "" at beginning of line.
1529 MBOL no Same, assuming multiline.
1530 SBOL no Same, assuming singleline.
1531 EOS no Match "" at end of string.
1532 EOL no Match "" at end of line.
1533 MEOL no Same, assuming multiline.
1534 SEOL no Same, assuming singleline.
1535 BOUND no Match "" at any word boundary
1536 BOUNDL no Match "" at any word boundary
1537 NBOUND no Match "" at any word non-boundary
1538 NBOUNDL no Match "" at any word non-boundary
1539 GPOS no Matches where last m//g left off.
1540
1541 # [Special] alternatives
1542 ANY no Match any one character (except newline).
1543 SANY no Match any one character.
1544 ANYOF sv Match character in (or not in) this class.
1545 ALNUM no Match any alphanumeric character
1546 ALNUML no Match any alphanumeric char in locale
1547 NALNUM no Match any non-alphanumeric character
1548 NALNUML no Match any non-alphanumeric char in locale
1549 SPACE no Match any whitespace character
1550 SPACEL no Match any whitespace char in locale
1551 NSPACE no Match any non-whitespace character
1552 NSPACEL no Match any non-whitespace char in locale
1553 DIGIT no Match any numeric character
1554 NDIGIT no Match any non-numeric character
1555
1556 # BRANCH The set of branches constituting a single choice are hooked
1557 # together with their "next" pointers, since precedence prevents
1558 # anything being concatenated to any individual branch. The
1559 # "next" pointer of the last BRANCH in a choice points to the
1560 # thing following the whole choice. This is also where the
1561 # final "next" pointer of each individual branch points; each
1562 # branch starts with the operand node of a BRANCH node.
1563 #
1564 BRANCH node Match this alternative, or the next...
1565
1566 # BACK Normal "next" pointers all implicitly point forward; BACK
1567 # exists to make loop structures possible.
1568 # not used
1569 BACK no Match "", "next" ptr points backward.
1570
1571 # Literals
1572 EXACT sv Match this string (preceded by length).
1573 EXACTF sv Match this string, folded (prec. by length).
1574 EXACTFL sv Match this string, folded in locale (w/len).
1575
1576 # Do nothing
1577 NOTHING no Match empty string.
1578 # A variant of above which delimits a group, thus stops optimizations
1579 TAIL no Match empty string. Can jump here from outside.
1580
1581 # STAR,PLUS '?', and complex '*' and '+', are implemented as circular
1582 # BRANCH structures using BACK. Simple cases (one character
1583 # per match) are implemented with STAR and PLUS for speed
1584 # and to minimize recursive plunges.
1585 #
1586 STAR node Match this (simple) thing 0 or more times.
1587 PLUS node Match this (simple) thing 1 or more times.
1588
1589 CURLY sv 2 Match this simple thing {n,m} times.
1590 CURLYN no 2 Match next-after-this simple thing
1591 # {n,m} times, set parenths.
1592 CURLYM no 2 Match this medium-complex thing {n,m} times.
1593 CURLYX sv 2 Match this complex thing {n,m} times.
1594
1595 # This terminator creates a loop structure for CURLYX
1596 WHILEM no Do curly processing and see if rest matches.
1597
1598 # OPEN,CLOSE,GROUPP ...are numbered at compile time.
1599 OPEN num 1 Mark this point in input as start of #n.
1600 CLOSE num 1 Analogous to OPEN.
1601
1602 REF num 1 Match some already matched string
1603 REFF num 1 Match already matched string, folded
1604 REFFL num 1 Match already matched string, folded in loc.
1605
1606 # grouping assertions
1607 IFMATCH off 1 2 Succeeds if the following matches.
1608 UNLESSM off 1 2 Fails if the following matches.
1609 SUSPEND off 1 1 "Independent" sub-RE.
1610 IFTHEN off 1 1 Switch, should be preceeded by switcher .
1611 GROUPP num 1 Whether the group matched.
1612
1613 # Support for long RE
1614 LONGJMP off 1 1 Jump far away.
1615 BRANCHJ off 1 1 BRANCH with long offset.
1616
1617 # The heavy worker
1618 EVAL evl 1 Execute some Perl code.
1619
1620 # Modifiers
1621 MINMOD no Next operator is not greedy.
1622 LOGICAL no Next opcode should set the flag only.
1623
1624 # This is not used yet
1625 RENUM off 1 1 Group with independently numbered parens.
1626
1627 # This is not really a node, but an optimized away piece of a "long" node.
1628 # To simplify debugging output, we mark it as if it were a node
1629 OPTIMIZED off Placeholder for dump.
1630
1631=head2 Run-time output
1632
1633First of all, when doing a match, one may get no run-time output even
1634if debugging is enabled. this means that the RE engine was never
1635entered, all of the job was done by the optimizer.
1636
1637If RE engine was entered, the output may look like this:
1638
1639 Matching `[bc]d(ef*g)+h[ij]k$' against `abcdefg__gh__'
1640 Setting an EVAL scope, savestack=3
1641 2 <ab> <cdefg__gh_> | 1: ANYOF
1642 3 <abc> <defg__gh_> | 11: EXACT <d>
1643 4 <abcd> <efg__gh_> | 13: CURLYX {1,32767}
1644 4 <abcd> <efg__gh_> | 26: WHILEM
1645 0 out of 1..32767 cc=effff31c
1646 4 <abcd> <efg__gh_> | 15: OPEN1
1647 4 <abcd> <efg__gh_> | 17: EXACT <e>
1648 5 <abcde> <fg__gh_> | 19: STAR
1649 EXACT <f> can match 1 times out of 32767...
1650 Setting an EVAL scope, savestack=3
1651 6 <bcdef> <g__gh__> | 22: EXACT <g>
1652 7 <bcdefg> <__gh__> | 24: CLOSE1
1653 7 <bcdefg> <__gh__> | 26: WHILEM
1654 1 out of 1..32767 cc=effff31c
1655 Setting an EVAL scope, savestack=12
1656 7 <bcdefg> <__gh__> | 15: OPEN1
1657 7 <bcdefg> <__gh__> | 17: EXACT <e>
1658 restoring \1 to 4(4)..7
1659 failed, try continuation...
1660 7 <bcdefg> <__gh__> | 27: NOTHING
1661 7 <bcdefg> <__gh__> | 28: EXACT <h>
1662 failed...
1663 failed...
1664
1665The most significant information in the output is about the particular I<node>
1666of the compiled RE which is currently being tested against the target string.
1667The format of these lines is
1668
1669C< >I<STRING-OFFSET> <I<PRE-STRING>> <I<POST-STRING>> |I<ID>: I<TYPE>
1670
1671The I<TYPE> info is indented with respect to the backtracking level.
1672Other incidental information appears interspersed within.
1673
a77df738 1674=cut