clarify docs on return value from binding operators
[p5sagit/p5-mst-13.2.git] / pod / perldebug.pod
CommitLineData
a0d0e21e 1=head1 NAME
2
3perldebug - Perl debugging
4
5=head1 DESCRIPTION
6
7First of all, have you tried using the B<-w> switch?
8
4e1d3b43 9=head1 The Perl Debugger
10
26f28346 11"As soon as we started programming, we found to our
12surprise that it wasn't as easy to get programs right
13as we had thought. Debugging had to be discovered.
14I can remember the exact instant when I realized that
15a large part of my life from then on was going to be
16spent in finding mistakes in my own programs."
84902520 17
18I< --Maurice Wilkes, 1949>
26f28346 19
4e1d3b43 20If you invoke Perl with the B<-d> switch, your script runs under the
21Perl source debugger. This works like an interactive Perl
22environment, prompting for debugger commands that let you examine
68dc0745 23source code, set breakpoints, get stack backtraces, change the values of
4e1d3b43 24variables, etc. This is so convenient that you often fire up
54310121 25the debugger all by itself just to test out Perl constructs
4e1d3b43 26interactively to see what they do. For example:
27
28 perl -d -e 42
29
30In Perl, the debugger is not a separate program as it usually is in the
31typical compiled environment. Instead, the B<-d> flag tells the compiler
32to insert source information into the parse trees it's about to hand off
33to the interpreter. That means your code must first compile correctly
34for the debugger to work on it. Then when the interpreter starts up, it
54310121 35preloads a Perl library file containing the debugger itself.
4e1d3b43 36
37The program will halt I<right before> the first run-time executable
38statement (but see below regarding compile-time statements) and ask you
39to enter a debugger command. Contrary to popular expectations, whenever
40the debugger halts and shows you a line of code, it always displays the
41line it's I<about> to execute, rather than the one it has just executed.
42
43Any command not recognized by the debugger is directly executed
44(C<eval>'d) as Perl code in the current package. (The debugger uses the
45DB package for its own state information.)
46
47Leading white space before a command would cause the debugger to think
48it's I<NOT> a debugger command but for Perl, so be careful not to do
49that.
50
51=head2 Debugger Commands
52
53The debugger understands the following commands:
a0d0e21e 54
55=over 12
56
4e1d3b43 57=item h [command]
58
54310121 59Prints out a help message.
4e1d3b43 60
61If you supply another debugger command as an argument to the C<h> command,
62it prints out the description for just that command. The special
63argument of C<h h> produces a more compact help listing, designed to fit
64together on one screen.
65
7b8d334a 66If the output of the C<h> command (or any command, for that matter) scrolls
4e1d3b43 67past your screen, either precede the command with a leading pipe symbol so
68it's run through your pager, as in
69
70 DB> |h
71
e7ea3e70 72You may change the pager which is used via C<O pager=...> command.
73
4e1d3b43 74=item p expr
75
36477c24 76Same as C<print {$DB::OUT} expr> in the current package. In particular,
5f05dabc 77because this is just Perl's own B<print> function, this means that nested
4e1d3b43 78data structures and objects are not dumped, unlike with the C<x> command.
79
e7ea3e70 80The C<DB::OUT> filehandle is opened to F</dev/tty>, regardless of
81where STDOUT may be redirected to.
82
4e1d3b43 83=item x expr
84
54310121 85Evaluates its expression in list context and dumps out the result
4e1d3b43 86in a pretty-printed fashion. Nested data structures are printed out
87recursively, unlike the C<print> function.
88
36477c24 89The details of printout are governed by multiple C<O>ptions.
90
4e1d3b43 91=item V [pkg [vars]]
92
93Display all (or some) variables in package (defaulting to the C<main>
94package) using a data pretty-printer (hashes show their keys and values so
95you see what's what, control characters are made printable, etc.). Make
96sure you don't put the type specifier (like C<$>) there, just the symbol
97names, like this:
98
99 V DB filename line
100
101Use C<~pattern> and C<!pattern> for positive and negative regexps.
a0d0e21e 102
4e1d3b43 103Nested data structures are printed out in a legible fashion, unlike
104the C<print> function.
105
36477c24 106The details of printout are governed by multiple C<O>ptions.
107
4e1d3b43 108=item X [vars]
109
110Same as C<V currentpackage [vars]>.
a0d0e21e 111
112=item T
113
68dc0745 114Produce a stack backtrace. See below for details on its output.
a0d0e21e 115
4e1d3b43 116=item s [expr]
a0d0e21e 117
118Single step. Executes until it reaches the beginning of another
4e1d3b43 119statement, descending into subroutine calls. If an expression is
120supplied that includes function calls, it too will be single-stepped.
a0d0e21e 121
e7ea3e70 122=item n [expr]
a0d0e21e 123
124Next. Executes over subroutine calls, until it reaches the beginning
774d564b 125of the next statement. If an expression is supplied that includes
126function calls, those functions will be executed with stops before
127each statement.
a0d0e21e 128
dce0c882 129=item r
130
131Continue until return from the current subroutine. Dump the return
04cf9722 132value, if the PrintRet option is set (default).
dce0c882 133
c47ff5f1 134=item <CR>
a0d0e21e 135
4e1d3b43 136Repeat last C<n> or C<s> command.
a0d0e21e 137
36477c24 138=item c [line|sub]
a0d0e21e 139
4e1d3b43 140Continue, optionally inserting a one-time-only breakpoint
36477c24 141at the specified line or subroutine.
a0d0e21e 142
4e1d3b43 143=item l
a0d0e21e 144
4e1d3b43 145List next window of lines.
a0d0e21e 146
147=item l min+incr
148
4e1d3b43 149List C<incr+1> lines starting at C<min>.
a0d0e21e 150
151=item l min-max
152
c47ff5f1 153List lines C<min> through C<max>. C<l -> is synonymous to C<->.
a0d0e21e 154
155=item l line
156
4e1d3b43 157List a single line.
a0d0e21e 158
4e1d3b43 159=item l subname
a0d0e21e 160
83ee9e09 161List first window of lines from subroutine. I<subname> may
162be a variable which contains a code reference.
a0d0e21e 163
164=item -
165
4e1d3b43 166List previous window of lines.
a0d0e21e 167
4e1d3b43 168=item w [line]
a0d0e21e 169
4e1d3b43 170List window (a few lines) around the current line.
a0d0e21e 171
4e1d3b43 172=item .
a0d0e21e 173
4e1d3b43 174Return debugger pointer to the last-executed line and
175print it out.
176
177=item f filename
178
774d564b 179Switch to viewing a different file or eval statement. If C<filename>
e7ea3e70 180is not a full filename as found in values of %INC, it is considered as
181a regexp.
a0d0e21e 182
bee32ff8 183C<eval>ed strings (when accessible) are considered to be filenames:
184C<f (eval 7)> and C<f eval 7\b> access the body of the 7th C<eval>ed string
185(in the order of execution). The bodies of currently executed C<eval>
186and of C<eval>ed strings which define subroutines are saved, thus are
187accessible by this mechanism.
188
a0d0e21e 189=item /pattern/
190
4e1d3b43 191Search forwards for pattern; final / is optional.
a0d0e21e 192
193=item ?pattern?
194
4e1d3b43 195Search backwards for pattern; final ? is optional.
a0d0e21e 196
197=item L
198
36477c24 199List all breakpoints and actions.
a0d0e21e 200
4e1d3b43 201=item S [[!]pattern]
a0d0e21e 202
4e1d3b43 203List subroutine names [not] matching pattern.
a0d0e21e 204
205=item t
206
36477c24 207Toggle trace mode (see also C<AutoTrace> C<O>ption).
4e1d3b43 208
209=item t expr
210
211Trace through execution of expr. For example:
212
213 $ perl -de 42
214 Stack dump during die enabled outside of evals.
a0d0e21e 215
4e1d3b43 216 Loading DB routines from perl5db.pl patch level 0.94
217 Emacs support available.
218
219 Enter h or `h h' for help.
220
221 main::(-e:1): 0
222 DB<1> sub foo { 14 }
223
224 DB<2> sub bar { 3 }
225
226 DB<3> t print foo() * bar()
227 main::((eval 172):3): print foo() + bar();
228 main::foo((eval 168):2):
229 main::bar((eval 170):2):
230 42
36477c24 231
232or, with the C<O>ption C<frame=2> set,
233
234 DB<4> O f=2
235 frame = '2'
236 DB<5> t print foo() * bar()
237 3: foo() * bar()
238 entering main::foo
239 2: sub foo { 14 };
240 exited main::foo
241 entering main::bar
242 2: sub bar { 3 };
243 exited main::bar
244 42
4e1d3b43 245
246=item b [line] [condition]
a0d0e21e 247
248Set a breakpoint. If line is omitted, sets a breakpoint on the line
4e1d3b43 249that is about to be executed. If a condition is specified, it's
a0d0e21e 250evaluated each time the statement is reached and a breakpoint is taken
5f05dabc 251only if the condition is true. Breakpoints may be set on only lines
4e1d3b43 252that begin an executable statement. Conditions don't use B<if>:
a0d0e21e 253
254 b 237 $x > 30
36477c24 255 b 237 ++$count237 < 11
a0d0e21e 256 b 33 /pattern/i
257
4e1d3b43 258=item b subname [condition]
a0d0e21e 259
83ee9e09 260Set a breakpoint at the first line of the named subroutine. I<subname> may
261be a variable which contains a code reference (in this case I<condition>
262is not supported).
a0d0e21e 263
36477c24 264=item b postpone subname [condition]
265
266Set breakpoint at first line of subroutine after it is compiled.
267
268=item b load filename
269
774d564b 270Set breakpoint at the first executed line of the file. Filename should
e7ea3e70 271be a full name as found in values of %INC.
272
273=item b compile subname
274
275Sets breakpoint at the first statement executed after the subroutine
276is compiled.
36477c24 277
4e1d3b43 278=item d [line]
a0d0e21e 279
4e1d3b43 280Delete a breakpoint at the specified line. If line is omitted, deletes
281the breakpoint on the line that is about to be executed.
a0d0e21e 282
283=item D
284
4e1d3b43 285Delete all installed breakpoints.
286
287=item a [line] command
288
289Set an action to be done before the line is executed.
290The sequence of steps taken by the debugger is
291
8ebc5c01 292 1. check for a breakpoint at this line
293 2. print the line if necessary (tracing)
294 3. do any actions associated with that line
295 4. prompt user if at a breakpoint or in single-step
296 5. evaluate line
a0d0e21e 297
7b8d334a 298For example, this will print out $foo every time line
4e1d3b43 29953 is passed:
a0d0e21e 300
4e1d3b43 301 a 53 print "DB FOUND $foo\n"
a0d0e21e 302
303=item A
304
4e1d3b43 305Delete all installed actions.
306
6ee623d5 307=item W [expr]
308
309Add a global watch-expression.
310
311=item W
312
313Delete all watch-expressions.
314
4e1d3b43 315=item O [opt[=val]] [opt"val"] [opt?]...
316
317Set or query values of options. val defaults to 1. opt can
318be abbreviated. Several options can be listed.
319
320=over 12
321
e7ea3e70 322=item C<recallCommand>, C<ShellBang>
4e1d3b43 323
324The characters used to recall command or spawn shell. By
325default, these are both set to C<!>.
326
e7ea3e70 327=item C<pager>
4e1d3b43 328
329Program to use for output of pager-piped commands (those
330beginning with a C<|> character.) By default,
331C<$ENV{PAGER}> will be used.
332
e7ea3e70 333=item C<tkRunning>
36477c24 334
335Run Tk while prompting (with ReadLine).
336
e7ea3e70 337=item C<signalLevel>, C<warnLevel>, C<dieLevel>
338
774d564b 339Level of verbosity. By default the debugger is in a sane verbose mode,
e7ea3e70 340thus it will print backtraces on all the warnings and die-messages
341which are going to be printed out, and will print a message when
54310121 342interesting uncaught signals arrive.
36477c24 343
774d564b 344To disable this behaviour, set these values to 0. If C<dieLevel> is 2,
e7ea3e70 345then the messages which will be caught by surrounding C<eval> are also
346printed.
36477c24 347
e7ea3e70 348=item C<AutoTrace>
36477c24 349
e7ea3e70 350Trace mode (similar to C<t> command, but can be put into
351C<PERLDB_OPTS>).
36477c24 352
e7ea3e70 353=item C<LineInfo>
36477c24 354
e7ea3e70 355File or pipe to print line number info to. If it is a pipe (say,
356C<|visual_perl_db>), then a short, "emacs like" message is used.
36477c24 357
358=item C<inhibit_exit>
359
360If 0, allows I<stepping off> the end of the script.
361
54310121 362=item C<PrintRet>
36477c24 363
04cf9722 364Print return value after C<r> command if set (default).
36477c24 365
28d1fb14 366=item C<ornaments>
367
3e3baf6d 368affects screen appearance of the command line (see L<Term::ReadLine>).
28d1fb14 369
54310121 370=item C<frame>
36477c24 371
372affects printing messages on entry and exit from subroutines. If
373C<frame & 2> is false, messages are printed on entry only. (Printing
5f05dabc 374on exit may be useful if inter(di)spersed with other messages.)
36477c24 375
376If C<frame & 4>, arguments to functions are printed as well as the
774d564b 377context and caller info. If C<frame & 8>, overloaded C<stringify> and
28d1fb14 378C<tie>d C<FETCH> are enabled on the printed arguments. If C<frame &
37916>, the return value from the subroutine is printed as well.
380
381The length at which the argument list is truncated is governed by the
382next option:
e7ea3e70 383
384=item C<maxTraceLen>
385
386length at which the argument list is truncated when C<frame> option's
387bit 4 is set.
36477c24 388
4e1d3b43 389=back
390
391The following options affect what happens with C<V>, C<X>, and C<x>
392commands:
393
394=over 12
395
e7ea3e70 396=item C<arrayDepth>, C<hashDepth>
4e1d3b43 397
398Print only first N elements ('' for all).
399
e7ea3e70 400=item C<compactDump>, C<veryCompact>
4e1d3b43 401
774d564b 402Change style of array and hash dump. If C<compactDump>, short array
e7ea3e70 403may be printed on one line.
4e1d3b43 404
e7ea3e70 405=item C<globPrint>
4e1d3b43 406
407Whether to print contents of globs.
408
e7ea3e70 409=item C<DumpDBFiles>
4e1d3b43 410
411Dump arrays holding debugged files.
412
e7ea3e70 413=item C<DumpPackages>
4e1d3b43 414
415Dump symbol tables of packages.
416
6ee623d5 417=item C<DumpReused>
418
419Dump contents of "reused" addresses.
420
e7ea3e70 421=item C<quote>, C<HighBit>, C<undefPrint>
422
774d564b 423Change style of string dump. Default value of C<quote> is C<auto>, one
e7ea3e70 424can enable either double-quotish dump, or single-quotish by setting it
774d564b 425to C<"> or C<'>. By default, characters with high bit set are printed
e7ea3e70 426I<as is>.
427
54310121 428=item C<UsageOnly>
4e1d3b43 429
774d564b 430I<very> rudimentally per-package memory usage dump. Calculates total
e7ea3e70 431size of strings in variables in the package.
4e1d3b43 432
36477c24 433=back
4e1d3b43 434
36477c24 435During startup options are initialized from C<$ENV{PERLDB_OPTS}>.
436You can put additional initialization options C<TTY>, C<noTTY>,
437C<ReadLine>, and C<NonStop> there.
438
439Example rc file:
4e1d3b43 440
e7ea3e70 441 &parse_options("NonStop=1 LineInfo=db.out AutoTrace");
4e1d3b43 442
36477c24 443The script will run without human intervention, putting trace information
444into the file I<db.out>. (If you interrupt it, you would better reset
445C<LineInfo> to something "interactive"!)
4e1d3b43 446
36477c24 447=over 12
4e1d3b43 448
36477c24 449=item C<TTY>
4e1d3b43 450
36477c24 451The TTY to use for debugging I/O.
452
36477c24 453=item C<noTTY>
454
774d564b 455If set, goes in C<NonStop> mode, and would not connect to a TTY. If
36477c24 456interrupt (or if control goes to debugger via explicit setting of
457$DB::signal or $DB::single from the Perl script), connects to a TTY
458specified by the C<TTY> option at startup, or to a TTY found at
459runtime using C<Term::Rendezvous> module of your choice.
460
461This module should implement a method C<new> which returns an object
462with two methods: C<IN> and C<OUT>, returning two filehandles to use
774d564b 463for debugging input and output correspondingly. Method C<new> may
36477c24 464inspect an argument which is a value of C<$ENV{PERLDB_NOTTY}> at
465startup, or is C<"/tmp/perldbtty$$"> otherwise.
466
467=item C<ReadLine>
468
469If false, readline support in debugger is disabled, so you can debug
470ReadLine applications.
471
472=item C<NonStop>
473
54310121 474If set, debugger goes into noninteractive mode until interrupted, or
36477c24 475programmatically by setting $DB::signal or $DB::single.
476
477=back
478
479Here's an example of using the C<$ENV{PERLDB_OPTS}> variable:
4e1d3b43 480
e7ea3e70 481 $ PERLDB_OPTS="N f=2" perl -d myprogram
4e1d3b43 482
483will run the script C<myprogram> without human intervention, printing
484out the call tree with entry and exit points. Note that C<N f=2> is
774d564b 485equivalent to C<NonStop=1 frame=2>. Note also that at the moment when
4e1d3b43 486this documentation was written all the options to the debugger could
36477c24 487be uniquely abbreviated by the first letter (with exception of
488C<Dump*> options).
4e1d3b43 489
36477c24 490Other examples may include
a0d0e21e 491
e7ea3e70 492 $ PERLDB_OPTS="N f A L=listing" perl -d myprogram
a0d0e21e 493
54310121 494- runs script noninteractively, printing info on each entry into a
36477c24 495subroutine and each executed line into the file F<listing>. (If you
496interrupt it, you would better reset C<LineInfo> to something
497"interactive"!)
498
499
e7ea3e70 500 $ env "PERLDB_OPTS=R=0 TTY=/dev/ttyc" perl -d myprogram
36477c24 501
502may be useful for debugging a program which uses C<Term::ReadLine>
774d564b 503itself. Do not forget detach shell from the TTY in the window which
36477c24 504corresponds to F</dev/ttyc>, say, by issuing a command like
505
e7ea3e70 506 $ sleep 1000000
36477c24 507
508See L<"Debugger Internals"> below for more details.
509
c47ff5f1 510=item < [ command ]
36477c24 511
512Set an action (Perl command) to happen before every debugger prompt.
4a6725af 513A multi-line command may be entered by backslashing the newlines. If
36477c24 514C<command> is missing, resets the list of actions.
515
c47ff5f1 516=item << command
36477c24 517
518Add an action (Perl command) to happen before every debugger prompt.
4a6725af 519A multi-line command may be entered by backslashing the newlines.
a0d0e21e 520
c47ff5f1 521=item > command
a0d0e21e 522
36477c24 523Set an action (Perl command) to happen after the prompt when you've
4a6725af 524just given a command to return to executing the script. A multi-line
36477c24 525command may be entered by backslashing the newlines. If C<command> is
526missing, resets the list of actions.
527
c47ff5f1 528=item >> command
36477c24 529
530Adds an action (Perl command) to happen after the prompt when you've
4a6725af 531just given a command to return to executing the script. A multi-line
36477c24 532command may be entered by backslashing the newlines.
533
534=item { [ command ]
535
536Set an action (debugger command) to happen before every debugger prompt.
4a6725af 537A multi-line command may be entered by backslashing the newlines. If
36477c24 538C<command> is missing, resets the list of actions.
539
540=item {{ command
541
542Add an action (debugger command) to happen before every debugger prompt.
4a6725af 543A multi-line command may be entered by backslashing the newlines.
a0d0e21e 544
4e1d3b43 545=item ! number
a0d0e21e 546
4e1d3b43 547Redo a previous command (default previous command).
a0d0e21e 548
4e1d3b43 549=item ! -number
a0d0e21e 550
4e1d3b43 551Redo number'th-to-last command.
a0d0e21e 552
4e1d3b43 553=item ! pattern
a0d0e21e 554
4e1d3b43 555Redo last command that started with pattern.
556See C<O recallCommand>, too.
a0d0e21e 557
4e1d3b43 558=item !! cmd
a0d0e21e 559
4e1d3b43 560Run cmd in a subprocess (reads from DB::IN, writes to DB::OUT)
561See C<O shellBang> too.
a0d0e21e 562
563=item H -number
564
565Display last n commands. Only commands longer than one character are
566listed. If number is omitted, lists them all.
567
568=item q or ^D
569
36477c24 570Quit. ("quit" doesn't work for this.) This is the only supported way
571to exit the debugger, though typing C<exit> twice may do it too.
572
573Set an C<O>ption C<inhibit_exit> to 0 if you want to be able to I<step
19799a22 574off> the end the script. You may also need to set $finished to 0 at
36477c24 575some moment if you want to step through global destruction.
a0d0e21e 576
4e1d3b43 577=item R
578
579Restart the debugger by B<exec>ing a new session. It tries to maintain
580your history across this, but internal settings and command line options
581may be lost.
582
5f05dabc 583Currently the following setting are preserved: history, breakpoints,
54310121 584actions, debugger C<O>ptions, and the following command line
5f05dabc 585options: B<-w>, B<-I>, and B<-e>.
36477c24 586
4e1d3b43 587=item |dbcmd
588
589Run debugger command, piping DB::OUT to current pager.
590
591=item ||dbcmd
592
593Same as C<|dbcmd> but DB::OUT is temporarily B<select>ed as well.
594Often used with commands that would otherwise produce long
595output, such as
596
597 |V main
598
599=item = [alias value]
600
e7ea3e70 601Define a command alias, like
602
603 = quit q
604
605or list current aliases.
4e1d3b43 606
a0d0e21e 607=item command
608
609Execute command as a Perl statement. A missing semicolon will be
610supplied.
611
e7ea3e70 612=item m expr
a0d0e21e 613
e7ea3e70 614The expression is evaluated, and the methods which may be applied to
615the result are listed.
616
617=item m package
618
619The methods which may be applied to objects in the C<package> are listed.
a0d0e21e 620
621=back
622
e7ea3e70 623=head2 Debugger input/output
624
625=over 8
626
627=item Prompt
628
4e1d3b43 629The debugger prompt is something like
630
631 DB<8>
632
633or even
634
635 DB<<17>>
636
637where that number is the command number, which you'd use to access with
54310121 638the builtin B<csh>-like history mechanism, e.g., C<!17> would repeat
4e1d3b43 639command number 17. The number of angle brackets indicates the depth of
640the debugger. You could get more than one set of brackets, for example, if
641you'd already at a breakpoint and then printed out the result of a
36477c24 642function call that itself also has a breakpoint, or you step into an
643expression via C<s/n/t expression> command.
4e1d3b43 644
54310121 645=item Multiline commands
e7ea3e70 646
4a6725af 647If you want to enter a multi-line command, such as a subroutine
e7ea3e70 648definition with several statements, or a format, you may escape the
649newline that would normally end the debugger command with a backslash.
650Here's an example:
a0d0e21e 651
4e1d3b43 652 DB<1> for (1..4) { \
653 cont: print "ok\n"; \
654 cont: }
655 ok
656 ok
657 ok
658 ok
659
660Note that this business of escaping a newline is specific to interactive
661commands typed into the debugger.
662
e7ea3e70 663=item Stack backtrace
664
68dc0745 665Here's an example of what a stack backtrace via C<T> command might
e7ea3e70 666look like:
4e1d3b43 667
668 $ = main::infested called from file `Ambulation.pm' line 10
669 @ = Ambulation::legs(1, 2, 3, 4) called from file `camel_flea' line 7
670 $ = main::pests('bactrian', 4) called from file `camel_flea' line 4
671
672The left-hand character up there tells whether the function was called
673in a scalar or list context (we bet you can tell which is which). What
674that says is that you were in the function C<main::infested> when you ran
675the stack dump, and that it was called in a scalar context from line 10
676of the file I<Ambulation.pm>, but without any arguments at all, meaning
677it was called as C<&infested>. The next stack frame shows that the
678function C<Ambulation::legs> was called in a list context from the
679I<camel_flea> file with four arguments. The last stack frame shows that
680C<main::pests> was called in a scalar context, also from I<camel_flea>,
681but from line 4.
682
e7ea3e70 683Note that if you execute C<T> command from inside an active C<use>
7b8d334a 684statement, the backtrace will contain both C<require>
685frame and an C<eval>) frame.
e7ea3e70 686
687=item Listing
688
689Listing given via different flavors of C<l> command looks like this:
690
691 DB<<13>> l
692 101: @i{@i} = ();
693 102:b @isa{@i,$pack} = ()
694 103 if(exists $i{$prevpack} || exists $isa{$pack});
695 104 }
696 105
697 106 next
698 107==> if(exists $isa{$pack});
699 108
700 109:a if ($extra-- > 0) {
701 110: %isa = ($pack,1);
702
703Note that the breakable lines are marked with C<:>, lines with
704breakpoints are marked by C<b>, with actions by C<a>, and the
c47ff5f1 705next executed line is marked by C<< ==> >>.
e7ea3e70 706
707=item Frame listing
708
709When C<frame> option is set, debugger would print entered (and
710optionally exited) subroutines in different styles.
711
54310121 712What follows is the start of the listing of
e7ea3e70 713
28d1fb14 714 env "PERLDB_OPTS=f=n N" perl -d -V
715
716for different values of C<n>:
e7ea3e70 717
718=over 4
719
720=item 1
721
722 entering main::BEGIN
723 entering Config::BEGIN
724 Package lib/Exporter.pm.
725 Package lib/Carp.pm.
726 Package lib/Config.pm.
727 entering Config::TIEHASH
728 entering Exporter::import
729 entering Exporter::export
730 entering Config::myconfig
731 entering Config::FETCH
732 entering Config::FETCH
733 entering Config::FETCH
734 entering Config::FETCH
735
736=item 2
737
738 entering main::BEGIN
739 entering Config::BEGIN
740 Package lib/Exporter.pm.
741 Package lib/Carp.pm.
742 exited Config::BEGIN
743 Package lib/Config.pm.
744 entering Config::TIEHASH
745 exited Config::TIEHASH
746 entering Exporter::import
747 entering Exporter::export
748 exited Exporter::export
749 exited Exporter::import
750 exited main::BEGIN
751 entering Config::myconfig
752 entering Config::FETCH
753 exited Config::FETCH
754 entering Config::FETCH
755 exited Config::FETCH
756 entering Config::FETCH
757
758=item 4
759
760 in $=main::BEGIN() from /dev/nul:0
761 in $=Config::BEGIN() from lib/Config.pm:2
762 Package lib/Exporter.pm.
763 Package lib/Carp.pm.
764 Package lib/Config.pm.
765 in $=Config::TIEHASH('Config') from lib/Config.pm:644
766 in $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/nul:0
767 in $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from li
768 in @=Config::myconfig() from /dev/nul:0
769 in $=Config::FETCH(ref(Config), 'package') from lib/Config.pm:574
770 in $=Config::FETCH(ref(Config), 'baserev') from lib/Config.pm:574
cceca5ed 771 in $=Config::FETCH(ref(Config), 'PERL_VERSION') from lib/Config.pm:574
772 in $=Config::FETCH(ref(Config), 'PERL_SUBVERSION') from lib/Config.pm:574
e7ea3e70 773 in $=Config::FETCH(ref(Config), 'osname') from lib/Config.pm:574
774 in $=Config::FETCH(ref(Config), 'osvers') from lib/Config.pm:574
775
776=item 6
777
778 in $=main::BEGIN() from /dev/nul:0
779 in $=Config::BEGIN() from lib/Config.pm:2
780 Package lib/Exporter.pm.
781 Package lib/Carp.pm.
782 out $=Config::BEGIN() from lib/Config.pm:0
783 Package lib/Config.pm.
784 in $=Config::TIEHASH('Config') from lib/Config.pm:644
785 out $=Config::TIEHASH('Config') from lib/Config.pm:644
786 in $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/nul:0
787 in $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/
788 out $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/
789 out $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/nul:0
790 out $=main::BEGIN() from /dev/nul:0
791 in @=Config::myconfig() from /dev/nul:0
792 in $=Config::FETCH(ref(Config), 'package') from lib/Config.pm:574
793 out $=Config::FETCH(ref(Config), 'package') from lib/Config.pm:574
794 in $=Config::FETCH(ref(Config), 'baserev') from lib/Config.pm:574
795 out $=Config::FETCH(ref(Config), 'baserev') from lib/Config.pm:574
cceca5ed 796 in $=Config::FETCH(ref(Config), 'PERL_VERSION') from lib/Config.pm:574
797 out $=Config::FETCH(ref(Config), 'PERL_VERSION') from lib/Config.pm:574
798 in $=Config::FETCH(ref(Config), 'PERL_SUBVERSION') from lib/Config.pm:574
e7ea3e70 799
800=item 14
801
802 in $=main::BEGIN() from /dev/nul:0
803 in $=Config::BEGIN() from lib/Config.pm:2
804 Package lib/Exporter.pm.
805 Package lib/Carp.pm.
806 out $=Config::BEGIN() from lib/Config.pm:0
807 Package lib/Config.pm.
808 in $=Config::TIEHASH('Config') from lib/Config.pm:644
809 out $=Config::TIEHASH('Config') from lib/Config.pm:644
810 in $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/nul:0
811 in $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/E
812 out $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/E
813 out $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/nul:0
814 out $=main::BEGIN() from /dev/nul:0
815 in @=Config::myconfig() from /dev/nul:0
816 in $=Config::FETCH('Config=HASH(0x1aa444)', 'package') from lib/Config.pm:574
817 out $=Config::FETCH('Config=HASH(0x1aa444)', 'package') from lib/Config.pm:574
818 in $=Config::FETCH('Config=HASH(0x1aa444)', 'baserev') from lib/Config.pm:574
819 out $=Config::FETCH('Config=HASH(0x1aa444)', 'baserev') from lib/Config.pm:574
820
28d1fb14 821=item 30
822
823 in $=CODE(0x15eca4)() from /dev/null:0
824 in $=CODE(0x182528)() from lib/Config.pm:2
825 Package lib/Exporter.pm.
826 out $=CODE(0x182528)() from lib/Config.pm:0
827 scalar context return from CODE(0x182528): undef
828 Package lib/Config.pm.
829 in $=Config::TIEHASH('Config') from lib/Config.pm:628
830 out $=Config::TIEHASH('Config') from lib/Config.pm:628
831 scalar context return from Config::TIEHASH: empty hash
832 in $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/null:0
833 in $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/Exporter.pm:171
834 out $=Exporter::export('Config', 'main', 'myconfig', 'config_vars') from lib/Exporter.pm:171
835 scalar context return from Exporter::export: ''
836 out $=Exporter::import('Config', 'myconfig', 'config_vars') from /dev/null:0
837 scalar context return from Exporter::import: ''
838
839
e7ea3e70 840=back
841
842In all the cases indentation of lines shows the call tree, if bit 2 of
843C<frame> is set, then a line is printed on exit from a subroutine as
844well, if bit 4 is set, then the arguments are printed as well as the
845caller info, if bit 8 is set, the arguments are printed even if they
28d1fb14 846are tied or references, if bit 16 is set, the return value is printed
847as well.
e7ea3e70 848
849When a package is compiled, a line like this
850
851 Package lib/Carp.pm.
852
853is printed with proper indentation.
854
855=back
856
857=head2 Debugging compile-time statements
858
4e1d3b43 859If you have any compile-time executable statements (code within a BEGIN
860block or a C<use> statement), these will C<NOT> be stopped by debugger,
36477c24 861although C<require>s will (and compile-time statements can be traced
54310121 862with C<AutoTrace> option set in C<PERLDB_OPTS>). From your own Perl
36477c24 863code, however, you can
4e1d3b43 864transfer control back to the debugger using the following statement,
865which is harmless if the debugger is not running:
a0d0e21e 866
867 $DB::single = 1;
868
4e1d3b43 869If you set C<$DB::single> to the value 2, it's equivalent to having
870just typed the C<n> command, whereas a value of 1 means the C<s>
871command. The C<$DB::trace> variable should be set to 1 to simulate
872having typed the C<t> command.
873
e7ea3e70 874Another way to debug compile-time code is to start debugger, set a
875breakpoint on I<load> of some module thusly
876
877 DB<7> b load f:/perllib/lib/Carp.pm
878 Will stop on load of `f:/perllib/lib/Carp.pm'.
879
774d564b 880and restart debugger by C<R> command (if possible). One can use C<b
e7ea3e70 881compile subname> for the same purpose.
882
4e1d3b43 883=head2 Debugger Customization
a0d0e21e 884
7b8d334a 885Most probably you do not want to modify the debugger, it contains enough
774d564b 886hooks to satisfy most needs. You may change the behaviour of debugger
36477c24 887from the debugger itself, using C<O>ptions, from the command line via
888C<PERLDB_OPTS> environment variable, and from I<customization files>.
a0d0e21e 889
890You can do some customization by setting up a F<.perldb> file which
891contains initialization code. For instance, you could make aliases
4e1d3b43 892like these (the last one is one people expect to be there):
a0d0e21e 893
4e1d3b43 894 $DB::alias{'len'} = 's/^len(.*)/p length($1)/';
a0d0e21e 895 $DB::alias{'stop'} = 's/^stop (at|in)/b/';
4e1d3b43 896 $DB::alias{'ps'} = 's/^ps\b/p scalar /';
897 $DB::alias{'quit'} = 's/^quit(\s*)/exit\$/';
898
36477c24 899One changes options from F<.perldb> file via calls like this one;
900
901 parse_options("NonStop=1 LineInfo=db.out AutoTrace=1 frame=2");
902
774d564b 903(the code is executed in the package C<DB>). Note that F<.perldb> is
904processed before processing C<PERLDB_OPTS>. If F<.perldb> defines the
36477c24 905subroutine C<afterinit>, it is called after all the debugger
774d564b 906initialization ends. F<.perldb> may be contained in the current
36477c24 907directory, or in the C<LOGDIR>/C<HOME> directory.
908
909If you want to modify the debugger, copy F<perl5db.pl> from the Perl
910library to another name and modify it as necessary. You'll also want
911to set your C<PERL5DB> environment variable to say something like this:
912
913 BEGIN { require "myperl5db.pl" }
914
915As the last resort, one can use C<PERL5DB> to customize debugger by
916directly setting internal variables or calling debugger functions.
917
4e1d3b43 918=head2 Readline Support
919
920As shipped, the only command line history supplied is a simplistic one
921that checks for leading exclamation points. However, if you install
922the Term::ReadKey and Term::ReadLine modules from CPAN, you will
923have full editing capabilities much like GNU I<readline>(3) provides.
924Look for these in the F<modules/by-module/Term> directory on CPAN.
925
54310121 926A rudimentary command line completion is also available.
e7ea3e70 927Unfortunately, the names of lexical variables are not available for
928completion.
929
4e1d3b43 930=head2 Editor Support for Debugging
931
932If you have GNU B<emacs> installed on your system, it can interact with
933the Perl debugger to provide an integrated software development
934environment reminiscent of its interactions with C debuggers.
935
936Perl is also delivered with a start file for making B<emacs> act like a
937syntax-directed editor that understands (some of) Perl's syntax. Look in
938the I<emacs> directory of the Perl source distribution.
939
940(Historically, a similar setup for interacting with B<vi> and the
941X11 window system had also been available, but at the time of this
942writing, no debugger support for B<vi> currently exists.)
943
944=head2 The Perl Profiler
945
946If you wish to supply an alternative debugger for Perl to run, just
947invoke your script with a colon and a package argument given to the B<-d>
948flag. One of the most popular alternative debuggers for Perl is
949B<DProf>, the Perl profiler. As of this writing, B<DProf> is not
950included with the standard Perl distribution, but it is expected to
951be included soon, for certain values of "soon".
952
953Meanwhile, you can fetch the Devel::Dprof module from CPAN. Assuming
954it's properly installed on your system, to profile your Perl program in
955the file F<mycode.pl>, just type:
956
957 perl -d:DProf mycode.pl
958
959When the script terminates the profiler will dump the profile information
960to a file called F<tmon.out>. A tool like B<dprofpp> (also supplied with
961the Devel::DProf package) can be used to interpret the information which is
962in that profile.
963
36477c24 964=head2 Debugger support in perl
4e1d3b43 965
e7ea3e70 966When you call the B<caller> function (see L<perlfunc/caller>) from the
967package DB, Perl sets the array @DB::args to contain the arguments the
54310121 968corresponding stack frame was called with.
4e1d3b43 969
36477c24 970If perl is run with B<-d> option, the following additional features
84902520 971are enabled (cf. L<perlvar/$^P>):
a0d0e21e 972
36477c24 973=over
4e1d3b43 974
36477c24 975=item *
4e1d3b43 976
36477c24 977Perl inserts the contents of C<$ENV{PERL5DB}> (or C<BEGIN {require
978'perl5db.pl'}> if not present) before the first line of the
979application.
4e1d3b43 980
36477c24 981=item *
4e1d3b43 982
c47ff5f1 983The array C<@{"_<$filename"}> is the line-by-line contents of
774d564b 984$filename for all the compiled files. Same for C<eval>ed strings which
19799a22 985contain subroutines, or which are currently executed. The $filename
36477c24 986for C<eval>ed strings looks like C<(eval 34)>.
4e1d3b43 987
36477c24 988=item *
4e1d3b43 989
c47ff5f1 990The hash C<%{"_<$filename"}> contains breakpoints and action (it is
36477c24 991keyed by line number), and individual entries are settable (as opposed
774d564b 992to the whole hash). Only true/false is important to Perl, though the
36477c24 993values used by F<perl5db.pl> have the form
774d564b 994C<"$break_condition\0$action">. Values are magical in numeric context:
36477c24 995they are zeros if the line is not breakable.
4e1d3b43 996
36477c24 997Same for evaluated strings which contain subroutines, or which are
7b8d334a 998currently executed. The $filename for C<eval>ed strings looks like
36477c24 999C<(eval 34)>.
4e1d3b43 1000
36477c24 1001=item *
4e1d3b43 1002
c47ff5f1 1003The scalar C<${"_<$filename"}> contains C<"_<$filename">. Same for
36477c24 1004evaluated strings which contain subroutines, or which are currently
7b8d334a 1005executed. The $filename for C<eval>ed strings looks like C<(eval
36477c24 100634)>.
4e1d3b43 1007
36477c24 1008=item *
4e1d3b43 1009
36477c24 1010After each C<require>d file is compiled, but before it is executed,
c47ff5f1 1011C<DB::postponed(*{"_<$filename"})> is called (if subroutine
774d564b 1012C<DB::postponed> exists). Here the $filename is the expanded name of
7b8d334a 1013the C<require>d file (as found in values of %INC).
4e1d3b43 1014
36477c24 1015=item *
4e1d3b43 1016
36477c24 1017After each subroutine C<subname> is compiled existence of
774d564b 1018C<$DB::postponed{subname}> is checked. If this key exists,
36477c24 1019C<DB::postponed(subname)> is called (if subroutine C<DB::postponed>
1020exists).
4e1d3b43 1021
36477c24 1022=item *
4e1d3b43 1023
36477c24 1024A hash C<%DB::sub> is maintained, with keys being subroutine names,
774d564b 1025values having the form C<filename:startline-endline>. C<filename> has
36477c24 1026the form C<(eval 31)> for subroutines defined inside C<eval>s.
4e1d3b43 1027
36477c24 1028=item *
1029
5f05dabc 1030When execution of the application reaches a place that can have
1031a breakpoint, a call to C<DB::DB()> is performed if any one of
1032variables $DB::trace, $DB::single, or $DB::signal is true. (Note that
36477c24 1033these variables are not C<local>izable.) This feature is disabled when
1034the control is inside C<DB::DB()> or functions called from it (unless
c47ff5f1 1035C<$^D & (1<<30)>).
36477c24 1036
1037=item *
1038
5f05dabc 1039When execution of the application reaches a subroutine call, a call
36477c24 1040to C<&DB::sub>(I<args>) is performed instead, with C<$DB::sub> being
1041the name of the called subroutine. (Unless the subroutine is compiled
1042in the package C<DB>.)
4e1d3b43 1043
1044=back
a0d0e21e 1045
84902520 1046Note that if C<&DB::sub> needs some external data to be setup for it
1047to work, no subroutine call is possible until this is done. For the
1048standard debugger C<$DB::deep> (how many levels of recursion deep into
1049the debugger you can go before a mandatory break) gives an example of
1050such a dependency.
e7ea3e70 1051
84902520 1052The minimal working debugger consists of one line
e7ea3e70 1053
1054 sub DB::DB {}
1055
1056which is quite handy as contents of C<PERL5DB> environment
1057variable:
1058
1059 env "PERL5DB=sub DB::DB {}" perl -d your-script
1060
1061Another (a little bit more useful) minimal debugger can be created
1062with the only line being
1063
1064 sub DB::DB {print ++$i; scalar <STDIN>}
1065
1066This debugger would print the sequential number of encountered
1067statement, and would wait for your C<CR> to continue.
1068
1069The following debugger is quite functional:
1070
54310121 1071 {
1072 package DB;
1073 sub DB {}
e7ea3e70 1074 sub sub {print ++$i, " $sub\n"; &$sub}
1075 }
1076
1077It prints the sequential number of subroutine call and the name of the
774d564b 1078called subroutine. Note that C<&DB::sub> should be compiled into the
e7ea3e70 1079package C<DB>.
36477c24 1080
1081=head2 Debugger Internals
1082
1083At the start, the debugger reads your rc file (F<./.perldb> or
54310121 1084F<~/.perldb> under Unix), which can set important options. This file may
36477c24 1085define a subroutine C<&afterinit> to be executed after the debugger is
1086initialized.
1087
5f05dabc 1088After the rc file is read, the debugger reads environment variable
36477c24 1089PERLDB_OPTS and parses it as a rest of C<O ...> line in debugger prompt.
1090
1091It also maintains magical internal variables, such as C<@DB::dbline>,
1092C<%DB::dbline>, which are aliases for C<@{"::_<current_file"}>
774d564b 1093C<%{"::_<current_file"}>. Here C<current_file> is the currently
36477c24 1094selected (with the debugger's C<f> command, or by flow of execution)
1095file.
1096
774d564b 1097Some functions are provided to simplify customization. See L<"Debugger
1098Customization"> for description of C<DB::parse_options(string)>. The
36477c24 1099function C<DB::dump_trace(skip[, count])> skips the specified number
1d2dff63 1100of frames, and returns a list containing info about the caller
774d564b 1101frames (all if C<count> is missing). Each entry is a hash with keys
36477c24 1102C<context> (C<$> or C<@>), C<sub> (subroutine name, or info about
5f05dabc 1103eval), C<args> (C<undef> or a reference to an array), C<file>, and
36477c24 1104C<line>.
1105
54310121 1106The function C<DB::print_trace(FH, skip[, count[, short]])> prints
774d564b 1107formatted info about caller frames. The last two functions may be
c47ff5f1 1108convenient as arguments to C<< < >>, C<< << >> commands.
36477c24 1109
a0d0e21e 1110=head2 Other resources
1111
1112You did try the B<-w> switch, didn't you?
1113
a77df738 1114=head2 BUGS
a0d0e21e 1115
4e1d3b43 1116You cannot get the stack frame information or otherwise debug functions
1117that were not compiled by Perl, such as C or C++ extensions.
a0d0e21e 1118
4e1d3b43 1119If you alter your @_ arguments in a subroutine (such as with B<shift>
68dc0745 1120or B<pop>, the stack backtrace will not show the original values.
a77df738 1121
1122=head1 Debugging Perl memory usage
1123
1124Perl is I<very> frivolous with memory. There is a saying that to
1125estimate memory usage of Perl, assume a reasonable algorithm of
c2611fb3 1126allocation, and multiply your estimates by 10. This is not absolutely
a77df738 1127true, but may give you a good grasp of what happens.
1128
1129Say, an integer cannot take less than 20 bytes of memory, a float
1130cannot take less than 24 bytes, a string cannot take less than 32
1131bytes (all these examples assume 32-bit architectures, the result are
1132much worse on 64-bit architectures). If a variable is accessed in two
1133of three different ways (which require an integer, a float, or a
1134string), the memory footprint may increase by another 20 bytes. A
1135sloppy malloc() implementation will make these numbers yet more.
1136
1137On the opposite end of the scale, a declaration like
1138
1139 sub foo;
1140
1141may take (on some versions of perl) up to 500 bytes of memory.
1142
1143Off-the-cuff anecdotal estimates of a code bloat give a factor around
11448. This means that the compiled form of reasonable (commented
1145indented etc.) code will take approximately 8 times more than the
1146disk space the code takes.
1147
1148There are two Perl-specific ways to analyze the memory usage:
1149$ENV{PERL_DEBUG_MSTATS} and B<-DL> switch. First one is available
1150only if perl is compiled with Perl's malloc(), the second one only if
1151Perl compiled with C<-DDEBUGGING> (as with giving C<-D optimise=-g>
1152option to F<Configure>).
1153
1154=head2 Using C<$ENV{PERL_DEBUG_MSTATS}>
1155
1156If your perl is using Perl's malloc(), and compiled with correct
1157switches (this is the default), then it will print memory usage
1158statistics after compiling your code (if C<$ENV{PERL_DEBUG_MSTATS}> >
11591), and before termination of the script (if
1160C<$ENV{PERL_DEBUG_MSTATS}> >= 1). The report format is similar to one
1161in the following example:
1162
1163 env PERL_DEBUG_MSTATS=2 perl -e "require Carp"
1164 Memory allocation statistics after compilation: (buckets 4(4)..8188(8192)
1165 14216 free: 130 117 28 7 9 0 2 2 1 0 0
1166 437 61 36 0 5
1167 60924 used: 125 137 161 55 7 8 6 16 2 0 1
1168 74 109 304 84 20
1169 Total sbrk(): 77824/21:119. Odd ends: pad+heads+chain+tail: 0+636+0+2048.
1170 Memory allocation statistics after execution: (buckets 4(4)..8188(8192)
1171 30888 free: 245 78 85 13 6 2 1 3 2 0 1
1172 315 162 39 42 11
1173 175816 used: 265 176 1112 111 26 22 11 27 2 1 1
1174 196 178 1066 798 39
1175 Total sbrk(): 215040/47:145. Odd ends: pad+heads+chain+tail: 0+2192+0+6144.
1176
1177It is possible to ask for such a statistic at arbitrary moment by
c2611fb3 1178using Devel::Peek::mstats() (module Devel::Peek is available on CPAN).
a77df738 1179
1180Here is the explanation of different parts of the format:
1181
1182=over
1183
1184=item C<buckets SMALLEST(APPROX)..GREATEST(APPROX)>
1185
1186Perl's malloc() uses bucketed allocations. Every request is rounded
1187up to the closest bucket size available, and a bucket of these size is
1188taken from the pool of the buckets of this size.
1189
1190The above line describes limits of buckets currently in use. Each
1191bucket has two sizes: memory footprint, and the maximal size of user
1192data which may be put into this bucket. Say, in the above example the
1193smallest bucket is both sizes 4. The biggest bucket has usable size
11948188, and the memory footprint 8192.
1195
1196With debugging Perl some buckets may have negative usable size. This
1197means that these buckets cannot (and will not) be used. For greater
1198buckets the memory footprint may be one page greater than a power of
11992. In such a case the corresponding power of two is printed instead
1200in the C<APPROX> field above.
1201
1202=item Free/Used
1203
1204The following 1 or 2 rows of numbers correspond to the number of
1205buckets of each size between C<SMALLEST> and C<GREATEST>. In the
1206first row the sizes (memory footprints) of buckets are powers of two
1207(or possibly one page greater). In the second row (if present) the
1208memory footprints of the buckets are between memory footprints of two
1209buckets "above".
1210
1211Say, with the above example the memory footprints are (with current
c2611fb3 1212algorithm)
a77df738 1213
1214 free: 8 16 32 64 128 256 512 1024 2048 4096 8192
1215 4 12 24 48 80
1216
1217With non-C<DEBUGGING> perl the buckets starting from C<128>-long ones
1218have 4-byte overhead, thus 8192-long bucket may take up to
12198188-byte-long allocations.
1220
1221=item C<Total sbrk(): SBRKed/SBRKs:CONTINUOUS>
1222
1223The first two fields give the total amount of memory perl sbrk()ed,
1224and number of sbrk()s used. The third number is what perl thinks
1225about continuity of returned chunks. As far as this number is
1226positive, malloc() will assume that it is probable that sbrk() will
1227provide continuous memory.
1228
1229The amounts sbrk()ed by external libraries is not counted.
1230
1231=item C<pad: 0>
1232
1233The amount of sbrk()ed memory needed to keep buckets aligned.
1234
1235=item C<heads: 2192>
1236
1237While memory overhead of bigger buckets is kept inside the bucket, for
1238smaller buckets it is kept in separate areas. This field gives the
1239total size of these areas.
1240
1241=item C<chain: 0>
1242
1243malloc() may want to subdivide a bigger bucket into smaller buckets.
1244If only a part of the deceased-bucket is left non-subdivided, the rest
1245is kept as an element of a linked list. This field gives the total
1246size of these chunks.
1247
1248=item C<tail: 6144>
1249
1250To minimize amount of sbrk()s malloc() asks for more memory. This
1251field gives the size of the yet-unused part, which is sbrk()ed, but
1252never touched.
1253
1254=back
1255
1256=head2 Example of using B<-DL> switch
1257
1258Below we show how to analyse memory usage by
1259
1260 do 'lib/auto/POSIX/autosplit.ix';
1261
1262The file in question contains a header and 146 lines similar to
1263
1264 sub getcwd ;
1265
1266B<Note:> I<the discussion below supposes 32-bit architecture. In the
1267newer versions of perl the memory usage of the constructs discussed
1268here is much improved, but the story discussed below is a real-life
1269story. This story is very terse, and assumes more than cursory
1270knowledge of Perl internals.>
1271
1272Here is the itemized list of Perl allocations performed during parsing
1273of this file:
1274
1275 !!! "after" at test.pl line 3.
1276 Id subtot 4 8 12 16 20 24 28 32 36 40 48 56 64 72 80 80+
1277 0 02 13752 . . . . 294 . . . . . . . . . . 4
1278 0 54 5545 . . 8 124 16 . . . 1 1 . . . . . 3
1279 5 05 32 . . . . . . . 1 . . . . . . . .
1280 6 02 7152 . . . . . . . . . . 149 . . . . .
1281 7 02 3600 . . . . . 150 . . . . . . . . . .
1282 7 03 64 . -1 . 1 . . 2 . . . . . . . . .
1283 7 04 7056 . . . . . . . . . . . . . . . 7
1284 7 17 38404 . . . . . . . 1 . . 442 149 . . 147 .
1285 9 03 2078 17 249 32 . . . . 2 . . . . . . . .
1286
1287
1288To see this list insert two C<warn('!...')> statements around the call:
1289
1290 warn('!');
1291 do 'lib/auto/POSIX/autosplit.ix';
1292 warn('!!! "after"');
1293
1294and run it with B<-DL> option. The first warn() will print memory
1295allocation info before the parsing of the file, and will memorize the
1296statistics at this point (we ignore what it prints). The second warn()
1297will print increments w.r.t. this memorized statistics. This is the
1298above printout.
1299
1300Different I<Id>s on the left correspond to different subsystems of
1301perl interpreter, they are just first argument given to perl memory
1302allocation API New(). To find what C<9 03> means C<grep> the perl
1303source for C<903>. You will see that it is F<util.c>, function
1304savepvn(). This function is used to store a copy of existing chunk of
1305memory. Using C debugger, one can see that it is called either
1306directly from gv_init(), or via sv_magic(), and gv_init() is called
1307from gv_fetchpv() - which is called from newSUB().
1308
1309B<Note:> to reach this place in debugger and skip all the calls to
1310savepvn during the compilation of the main script, set a C breakpoint
1311in Perl_warn(), C<continue> this point is reached, I<then> set
1312breakpoint in Perl_savepvn(). Note that you may need to skip a
1313handful of Perl_savepvn() which do not correspond to mass production
1314of CVs (there are more C<903> allocations than 146 similar lines of
1315F<lib/auto/POSIX/autosplit.ix>). Note also that C<Perl_> prefixes are
1316added by macroization code in perl header files to avoid conflicts
1317with external libraries.
1318
1319Anyway, we see that C<903> ids correspond to creation of globs, twice
1320per glob - for glob name, and glob stringification magic.
1321
1322Here are explanations for other I<Id>s above:
1323
1324=over
1325
1326=item C<717>
1327
1328is for creation of bigger C<XPV*> structures. In the above case it
1329creates 3 C<AV> per subroutine, one for a list of lexical variable
1330names, one for a scratchpad (which contains lexical variables and
1331C<targets>), and one for the array of scratchpads needed for
1332recursion.
1333
1334It also creates a C<GV> and a C<CV> per subroutine (all called from
1335start_subparse()).
1336
1337=item C<002>
1338
1339Creates C array corresponding to the C<AV> of scratchpads, and the
1340scratchpad itself (the first fake entry of this scratchpad is created
1341though the subroutine itself is not defined yet).
1342
1343It also creates C arrays to keep data for the stash (this is one HV,
1344but it grows, thus there are 4 big allocations: the big chunks are not
c2611fb3 1345freed, but are kept as additional arenas for C<SV> allocations).
a77df738 1346
1347=item C<054>
1348
1349creates a C<HEK> for the name of the glob for the subroutine (this
1350name is a key in a I<stash>).
1351
1352Big allocations with this I<Id> correspond to allocations of new
1353arenas to keep C<HE>.
1354
1355=item C<602>
1356
1357creates a C<GP> for the glob for the subroutine.
1358
1359=item C<702>
1360
1361creates the C<MAGIC> for the glob for the subroutine.
1362
1363=item C<704>
1364
1365creates I<arenas> which keep SVs.
1366
1367=back
1368
1369=head2 B<-DL> details
1370
1371If Perl is run with B<-DL> option, then warn()s which start with `!'
1372behave specially. They print a list of I<categories> of memory
1373allocations, and statistics of allocations of different sizes for
1374these categories.
1375
1376If warn() string starts with
1377
1378=over
1379
1380=item C<!!!>
1381
1382print changed categories only, print the differences in counts of allocations;
1383
1384=item C<!!>
1385
1386print grown categories only; print the absolute values of counts, and totals;
1387
1388=item C<!>
1389
1390print nonempty categories, print the absolute values of counts and totals.
1391
1392=back
1393
1394=head2 Limitations of B<-DL> statistic
1395
1396If an extension or an external library does not use Perl API to
1397allocate memory, these allocations are not counted.
1398
54dc92de 1399=head1 Debugging regular expressions
1400
1401There are two ways to enable debugging output for regular expressions.
1402
1403If your perl is compiled with C<-DDEBUGGING>, you may use the
1404B<-Dr> flag on the command line.
1405
1406Otherwise, one can C<use re 'debug'>, which has effects both at
1407compile time, and at run time (and is I<not> lexically scoped).
1408
1409=head2 Compile-time output
1410
1411The debugging output for the compile time looks like this:
1412
1413 compiling RE `[bc]d(ef*g)+h[ij]k$'
1414 size 43 first at 1
1415 1: ANYOF(11)
1416 11: EXACT <d>(13)
1417 13: CURLYX {1,32767}(27)
1418 15: OPEN1(17)
1419 17: EXACT <e>(19)
1420 19: STAR(22)
1421 20: EXACT <f>(0)
1422 22: EXACT <g>(24)
1423 24: CLOSE1(26)
1424 26: WHILEM(0)
1425 27: NOTHING(28)
1426 28: EXACT <h>(30)
1427 30: ANYOF(40)
1428 40: EXACT <k>(42)
1429 42: EOL(43)
1430 43: END(0)
1431 anchored `de' at 1 floating `gh' at 3..2147483647 (checking floating)
1432 stclass `ANYOF' minlen 7
1433
1434The first line shows the pre-compiled form of the regexp, and the
1435second shows the size of the compiled form (in arbitrary units,
1436usually 4-byte words) and the label I<id> of the first node which
1437does a match.
1438
1439The last line (split into two lines in the above) contains the optimizer
1440info. In the example shown, the optimizer found that the match
1441should contain a substring C<de> at the offset 1, and substring C<gh>
1442at some offset between 3 and infinity. Moreover, when checking for
1443these substrings (to abandon impossible matches quickly) it will check
1444for the substring C<gh> before checking for the substring C<de>. The
1445optimizer may also use the knowledge that the match starts (at the
1446C<first> I<id>) with a character class, and the match cannot be
1447shorter than 7 chars.
1448
1449The fields of interest which may appear in the last line are
1450
1451=over
1452
1453=item C<anchored> I<STRING> C<at> I<POS>
1454
1455=item C<floating> I<STRING> C<at> I<POS1..POS2>
1456
1457see above;
1458
1459=item C<matching floating/anchored>
1460
1461which substring to check first;
1462
1463=item C<minlen>
1464
1465the minimal length of the match;
1466
1467=item C<stclass> I<TYPE>
1468
1469The type of the first matching node.
1470
1471=item C<noscan>
1472
1473which advises to not scan for the found substrings;
1474
1475=item C<isall>
1476
1477which says that the optimizer info is in fact all that the regular
1478expression contains (thus one does not need to enter the RE engine at
1479all);
1480
1481=item C<GPOS>
1482
1483if the pattern contains C<\G>;
1484
1485=item C<plus>
1486
1487if the pattern starts with a repeated char (as in C<x+y>);
1488
1489=item C<implicit>
1490
1491if the pattern starts with C<.*>;
1492
1493=item C<with eval>
1494
1495if the pattern contain eval-groups (see L<perlre/(?{ code })>);
1496
1497=item C<anchored(TYPE)>
1498
1499if the pattern may
1500match only at a handful of places (with C<TYPE> being
1501C<BOL>, C<MBOL>, or C<GPOS>, see the table below).
1502
1503=back
1504
1505If a substring is known to match at end-of-line only, it may be
1506followed by C<$>, as in C<floating `k'$>.
1507
1508The optimizer-specific info is used to avoid entering (a slow) RE
1509engine on strings which will definitely not match. If C<isall> flag
1510is set, a call to the RE engine may be avoided even when optimizer
1511found an appropriate place for the match.
1512
1513The rest of the output contains the list of I<nodes> of the compiled
1514form of the RE. Each line has format
1515
1516C< >I<id>: I<TYPE> I<OPTIONAL-INFO> (I<next-id>)
1517
1518=head2 Types of nodes
1519
1520Here is the list of possible types with short descriptions:
1521
1522 # TYPE arg-description [num-args] [longjump-len] DESCRIPTION
1523
1524 # Exit points
1525 END no End of program.
1526 SUCCEED no Return from a subroutine, basically.
1527
1528 # Anchors:
1529 BOL no Match "" at beginning of line.
1530 MBOL no Same, assuming multiline.
1531 SBOL no Same, assuming singleline.
1532 EOS no Match "" at end of string.
1533 EOL no Match "" at end of line.
1534 MEOL no Same, assuming multiline.
1535 SEOL no Same, assuming singleline.
1536 BOUND no Match "" at any word boundary
1537 BOUNDL no Match "" at any word boundary
1538 NBOUND no Match "" at any word non-boundary
1539 NBOUNDL no Match "" at any word non-boundary
1540 GPOS no Matches where last m//g left off.
1541
1542 # [Special] alternatives
1543 ANY no Match any one character (except newline).
1544 SANY no Match any one character.
1545 ANYOF sv Match character in (or not in) this class.
1546 ALNUM no Match any alphanumeric character
1547 ALNUML no Match any alphanumeric char in locale
1548 NALNUM no Match any non-alphanumeric character
1549 NALNUML no Match any non-alphanumeric char in locale
1550 SPACE no Match any whitespace character
1551 SPACEL no Match any whitespace char in locale
1552 NSPACE no Match any non-whitespace character
1553 NSPACEL no Match any non-whitespace char in locale
1554 DIGIT no Match any numeric character
1555 NDIGIT no Match any non-numeric character
1556
1557 # BRANCH The set of branches constituting a single choice are hooked
1558 # together with their "next" pointers, since precedence prevents
1559 # anything being concatenated to any individual branch. The
1560 # "next" pointer of the last BRANCH in a choice points to the
1561 # thing following the whole choice. This is also where the
1562 # final "next" pointer of each individual branch points; each
1563 # branch starts with the operand node of a BRANCH node.
1564 #
1565 BRANCH node Match this alternative, or the next...
1566
1567 # BACK Normal "next" pointers all implicitly point forward; BACK
1568 # exists to make loop structures possible.
1569 # not used
1570 BACK no Match "", "next" ptr points backward.
1571
1572 # Literals
1573 EXACT sv Match this string (preceded by length).
1574 EXACTF sv Match this string, folded (prec. by length).
1575 EXACTFL sv Match this string, folded in locale (w/len).
1576
1577 # Do nothing
1578 NOTHING no Match empty string.
1579 # A variant of above which delimits a group, thus stops optimizations
1580 TAIL no Match empty string. Can jump here from outside.
1581
1582 # STAR,PLUS '?', and complex '*' and '+', are implemented as circular
1583 # BRANCH structures using BACK. Simple cases (one character
1584 # per match) are implemented with STAR and PLUS for speed
1585 # and to minimize recursive plunges.
1586 #
1587 STAR node Match this (simple) thing 0 or more times.
1588 PLUS node Match this (simple) thing 1 or more times.
1589
1590 CURLY sv 2 Match this simple thing {n,m} times.
1591 CURLYN no 2 Match next-after-this simple thing
1592 # {n,m} times, set parenths.
1593 CURLYM no 2 Match this medium-complex thing {n,m} times.
1594 CURLYX sv 2 Match this complex thing {n,m} times.
1595
1596 # This terminator creates a loop structure for CURLYX
1597 WHILEM no Do curly processing and see if rest matches.
1598
1599 # OPEN,CLOSE,GROUPP ...are numbered at compile time.
1600 OPEN num 1 Mark this point in input as start of #n.
1601 CLOSE num 1 Analogous to OPEN.
1602
1603 REF num 1 Match some already matched string
1604 REFF num 1 Match already matched string, folded
1605 REFFL num 1 Match already matched string, folded in loc.
1606
1607 # grouping assertions
1608 IFMATCH off 1 2 Succeeds if the following matches.
1609 UNLESSM off 1 2 Fails if the following matches.
1610 SUSPEND off 1 1 "Independent" sub-RE.
1611 IFTHEN off 1 1 Switch, should be preceeded by switcher .
1612 GROUPP num 1 Whether the group matched.
1613
1614 # Support for long RE
1615 LONGJMP off 1 1 Jump far away.
1616 BRANCHJ off 1 1 BRANCH with long offset.
1617
1618 # The heavy worker
1619 EVAL evl 1 Execute some Perl code.
1620
1621 # Modifiers
1622 MINMOD no Next operator is not greedy.
1623 LOGICAL no Next opcode should set the flag only.
1624
1625 # This is not used yet
1626 RENUM off 1 1 Group with independently numbered parens.
1627
1628 # This is not really a node, but an optimized away piece of a "long" node.
1629 # To simplify debugging output, we mark it as if it were a node
1630 OPTIMIZED off Placeholder for dump.
1631
1632=head2 Run-time output
1633
1634First of all, when doing a match, one may get no run-time output even
1635if debugging is enabled. this means that the RE engine was never
1636entered, all of the job was done by the optimizer.
1637
1638If RE engine was entered, the output may look like this:
1639
1640 Matching `[bc]d(ef*g)+h[ij]k$' against `abcdefg__gh__'
1641 Setting an EVAL scope, savestack=3
1642 2 <ab> <cdefg__gh_> | 1: ANYOF
1643 3 <abc> <defg__gh_> | 11: EXACT <d>
1644 4 <abcd> <efg__gh_> | 13: CURLYX {1,32767}
1645 4 <abcd> <efg__gh_> | 26: WHILEM
1646 0 out of 1..32767 cc=effff31c
1647 4 <abcd> <efg__gh_> | 15: OPEN1
1648 4 <abcd> <efg__gh_> | 17: EXACT <e>
1649 5 <abcde> <fg__gh_> | 19: STAR
1650 EXACT <f> can match 1 times out of 32767...
1651 Setting an EVAL scope, savestack=3
1652 6 <bcdef> <g__gh__> | 22: EXACT <g>
1653 7 <bcdefg> <__gh__> | 24: CLOSE1
1654 7 <bcdefg> <__gh__> | 26: WHILEM
1655 1 out of 1..32767 cc=effff31c
1656 Setting an EVAL scope, savestack=12
1657 7 <bcdefg> <__gh__> | 15: OPEN1
1658 7 <bcdefg> <__gh__> | 17: EXACT <e>
1659 restoring \1 to 4(4)..7
1660 failed, try continuation...
1661 7 <bcdefg> <__gh__> | 27: NOTHING
1662 7 <bcdefg> <__gh__> | 28: EXACT <h>
1663 failed...
1664 failed...
1665
1666The most significant information in the output is about the particular I<node>
1667of the compiled RE which is currently being tested against the target string.
1668The format of these lines is
1669
1670C< >I<STRING-OFFSET> <I<PRE-STRING>> <I<POST-STRING>> |I<ID>: I<TYPE>
1671
1672The I<TYPE> info is indented with respect to the backtracking level.
1673Other incidental information appears interspersed within.
1674
a77df738 1675=cut