5 Benchmark - benchmark running times of code
7 timethis - run a chunk of code several times
9 timethese - run several chunks of code several times
11 timeit - run a chunk of code and see how long it goes
15 timethis ($count, "code");
17 # Use Perl code in strings...
19 'Name1' => '...code1...',
20 'Name2' => '...code2...',
23 # ... or use subroutine references.
25 'Name1' => sub { ...code1... },
26 'Name2' => sub { ...code2... },
29 $t = timeit($count, '...other code...')
30 print "$count loops of other code took:",timestr($t),"\n";
34 The Benchmark module encapsulates a number of routines to help you
35 figure out how long it takes to execute some code.
43 Returns the current time. Example:
47 # ... your code here ...
49 $td = timediff($t1, $t0);
50 print "the code took:",timestr($td),"\n";
54 Enables or disable debugging by setting the C<$Benchmark::Debug> flag:
57 $t = timeit(10, ' 5 ** $Global ');
62 =head2 Standard Exports
64 The following routines will be exported into your namespace
65 if you use the Benchmark module:
69 =item timeit(COUNT, CODE)
71 Arguments: COUNT is the number of times to run the loop, and CODE is
72 the code to run. CODE may be either a code reference or a string to
73 be eval'd; either way it will be run in the caller's package.
75 Returns: a Benchmark object.
77 =item timethis ( COUNT, CODE, [ TITLE, [ STYLE ]] )
79 Time COUNT iterations of CODE. CODE may be a string to eval or a
80 code reference; either way the CODE will run in the caller's package.
81 Results will be printed to STDOUT as TITLE followed by the times.
82 TITLE defaults to "timethis COUNT" if none is provided. STYLE
83 determines the format of the output, as described for timestr() below.
85 The COUNT can be zero or negative: this means the I<minimum number of
86 CPU seconds> to run. A zero signifies the default of 3 seconds. For
87 example to run at least for 10 seconds:
91 or to run two pieces of code tests for at least 3 seconds:
93 timethese(0, { test1 => '...', test2 => '...'})
95 CPU seconds is, in UNIX terms, the user time plus the system time of
96 the process itself, as opposed to the real (wallclock) time and the
97 time spent by the child processes. Less than 0.1 seconds is not
98 accepted (-0.01 as the count, for example, will cause a fatal runtime
101 Note that the CPU seconds is the B<minimum> time: CPU scheduling and
102 other operating system factors may complicate the attempt so that a
103 little bit more time is spent. The benchmark output will, however,
104 also tell the number of C<$code> runs/second, which should be a more
105 interesting number than the actually spent seconds.
107 Returns a Benchmark object.
109 =item timethese ( COUNT, CODEHASHREF, [ STYLE ] )
111 The CODEHASHREF is a reference to a hash containing names as keys
112 and either a string to eval or a code reference for each value.
113 For each (KEY, VALUE) pair in the CODEHASHREF, this routine will
116 timethis(COUNT, VALUE, KEY, STYLE)
118 The routines are called in string comparison order of KEY.
120 The COUNT can be zero or negative, see timethis().
122 =item timediff ( T1, T2 )
124 Returns the difference between two Benchmark times as a Benchmark
125 object suitable for passing to timestr().
127 =item timesum ( T1, T2 )
129 Returns the sum of two Benchmark times as a Benchmark object suitable
130 for passing to timestr().
132 =item timestr ( TIMEDIFF, [ STYLE, [ FORMAT ] ] )
134 Returns a string that formats the times in the TIMEDIFF object in
135 the requested STYLE. TIMEDIFF is expected to be a Benchmark object
136 similar to that returned by timediff().
138 STYLE can be any of 'all', 'noc', 'nop' or 'auto'. 'all' shows each
139 of the 5 times available ('wallclock' time, user time, system time,
140 user time of children, and system time of children). 'noc' shows all
141 except the two children times. 'nop' shows only wallclock and the
142 two children times. 'auto' (the default) will act as 'all' unless
143 the children times are both zero, in which case it acts as 'noc'.
145 FORMAT is the L<printf(3)>-style format specifier (without the
146 leading '%') to use to print the times. It defaults to '5.2f'.
150 =head2 Optional Exports
152 The following routines will be exported into your namespace
153 if you specifically ask that they be imported:
157 =item clearcache ( COUNT )
159 Clear the cached time for COUNT rounds of the null loop.
161 =item clearallcache ( )
163 Clear all cached times.
165 =item disablecache ( )
167 Disable caching of timings for the null loop. This will force Benchmark
168 to recalculate these timings for each new piece of code timed.
170 =item enablecache ( )
172 Enable caching of timings for the null loop. The time taken for COUNT
173 rounds of the null loop will be calculated only once for each
174 different COUNT used.
180 The data is stored as a list of values from the time and times
183 ($real, $user, $system, $children_user, $children_system)
185 in seconds for the whole loop (not divided by the number of rounds).
187 The timing is done using time(3) and times(3).
189 Code is executed in the caller's package.
191 The time of the null loop (a loop with the same
192 number of rounds but empty loop body) is subtracted
193 from the time of the real loop.
195 The null loop times are cached, the key being the
196 number of rounds. The caching can be controlled using
207 Benchmark inherits from no other class, except of course
212 Comparing eval'd strings with code references will give you
213 inaccurate results: a code reference will show a slower
214 execution time than the equivalent eval'd string.
216 The real time timing is done using time(2) and
217 the granularity is therefore only one second.
219 Short tests may produce negative figures because perl
220 can appear to take longer to execute the empty loop
221 than a short test; try:
225 The system time of the null loop might be slightly
226 more than the system time of the loop with the actual
227 code and therefore the difference might end up being E<lt> 0.
231 Jarkko Hietaniemi <F<jhi@iki.fi>>, Tim Bunce <F<Tim.Bunce@ig.co.uk>>
233 =head1 MODIFICATION HISTORY
235 September 8th, 1994; by Tim Bunce.
237 March 28th, 1997; by Hugo van der Sanden: added support for code
238 references and the already documented 'debug' method; revamped
241 April 04-07th, 1997: by Jarkko Hietaniemi, added the run-for-some-time
246 # evaluate something in a clean lexical environment
247 sub _doeval { eval shift }
250 # put any lexicals at file scope AFTER here
256 @EXPORT=qw(timeit timethis timethese timediff timesum timestr);
257 @EXPORT_OK=qw(clearcache clearallcache disablecache enablecache);
265 $defaultfmt = '5.2f';
266 $defaultstyle = 'auto';
267 # The cache can cause a slight loss of sys time accuracy. If a
268 # user does many tests (>10) with *very* large counts (>10000)
269 # or works on a very slow machine the cache may be useful.
274 sub debug { $debug = ($_[1] != 0); }
276 sub clearcache { delete $cache{$_[0]}; }
277 sub clearallcache { %cache = (); }
278 sub enablecache { $cache = 1; }
279 sub disablecache { $cache = 0; }
281 # --- Functions to process the 'time' data type
283 sub new { my @t = (time, times, @_ == 2 ? $_[1] : 0);
284 print "new=@t\n" if $debug;
287 sub cpu_p { my($r,$pu,$ps,$cu,$cs) = @{$_[0]}; $pu+$ps ; }
288 sub cpu_c { my($r,$pu,$ps,$cu,$cs) = @{$_[0]}; $cu+$cs ; }
289 sub cpu_a { my($r,$pu,$ps,$cu,$cs) = @{$_[0]}; $pu+$ps+$cu+$cs ; }
290 sub real { my($r,$pu,$ps,$cu,$cs) = @{$_[0]}; $r ; }
295 for (my $i=0; $i < @$a; ++$i) {
296 push(@r, $a->[$i] - $b->[$i]);
304 for (my $i=0; $i < @$a; ++$i) {
305 push(@r, $a->[$i] + $b->[$i]);
311 my($tr, $style, $f) = @_;
313 warn "bad time value (@t)" unless @t==6;
314 my($r, $pu, $ps, $cu, $cs, $n) = @t;
315 my($pt, $ct, $t) = ($tr->cpu_p, $tr->cpu_c, $tr->cpu_a);
316 $f = $defaultfmt unless defined $f;
317 # format a time in the required style, other formats may be added here
318 $style ||= $defaultstyle;
319 $style = ($ct>0) ? 'all' : 'noc' if $style eq 'auto';
320 my $s = "@t $style"; # default for unknown style
321 $s=sprintf("%2d wallclock secs (%$f usr %$f sys + %$f cusr %$f csys = %$f CPU)",
322 @t,$t) if $style eq 'all';
323 $s=sprintf("%2d wallclock secs (%$f usr + %$f sys = %$f CPU)",
324 $r,$pu,$ps,$pt) if $style eq 'noc';
325 $s=sprintf("%2d wallclock secs (%$f cusr + %$f csys = %$f CPU)",
326 $r,$cu,$cs,$ct) if $style eq 'nop';
327 $s .= sprintf(" @ %$f/s (n=$n)", $n / ( $pu + $ps )) if $n;
333 print STDERR "$msg",timestr($t),"\n" if $debug;
336 # --- Functions implementing low-level support for timing loops
341 $n+=0; # force numeric now, so garbage won't creep into the eval
342 croak "negative loopcount $n" if $n<0;
343 confess "Usage: runloop(number, [string | coderef])" unless defined $c;
344 my($t0, $t1, $td); # before, after, difference
346 # find package of caller so we can execute code there
347 my($curpack) = caller(0);
349 while (($pack) = caller(++$i)) {
350 last if $pack ne $curpack;
353 my ($subcode, $subref);
354 if (ref $c eq 'CODE') {
355 $subcode = "sub { for (1 .. $n) { local \$_; package $pack; &\$c; } }";
356 $subref = eval $subcode;
359 $subcode = "sub { for (1 .. $n) { local \$_; package $pack; $c;} }";
360 $subref = _doeval($subcode);
362 croak "runloop unable to compile '$c': $@\ncode: $subcode\n" if $@;
363 print STDERR "runloop $n '$subcode'\n" if $debug;
365 $t0 = Benchmark->new(0);
367 $t1 = Benchmark->new($n);
368 $td = &timediff($t1, $t0);
370 timedebug("runloop:",$td);
379 printf STDERR "timeit $n $code\n" if $debug;
381 if ($cache && exists $cache{$n}) {
384 $wn = &runloop($n, '');
388 $wc = &runloop($n, $code);
390 $wd = timediff($wc, $wn);
392 timedebug("timeit: ",$wc);
393 timedebug(" - ",$wn);
394 timedebug(" = ",$wd);
404 my ($code, $tmax) = @_;
406 if ( not defined $tmax or $tmax == 0 ) {
407 $tmax = $default_for;
408 } elsif ( $tmax < 0 ) {
412 die "runfor(..., $tmax): timelimit cannot be less than $min_for.\n"
415 my ($n, $td, $tc, $ntot, $rtot, $utot, $stot, $cutot, $cstot );
417 # First find the minimum $n that gives a non-zero timing.
421 for ($n = 1, $tc = 0; $tc <= 0; $n *= 2 ) {
422 $td = timeit($n, $code);
423 $tc = $td->[1] + $td->[2];
429 my $tpra = 0.05 * $tmax; # Target/time practice.
431 # Double $n until we have think we have practiced enough.
432 for ( $n = 1; $ttot < $tpra; $n *= 2 ) {
433 $td = timeit($n, $code);
439 $ttot = $utot + $stot;
446 # Then iterate towards the $tmax.
447 while ( $ttot < $tmax ) {
448 $r = $tmax / $ttot - 1; # Linear approximation.
450 $n = $nmin if $n < $nmin;
451 $td = timeit($n, $code);
456 $ttot = $utot + $stot;
461 return bless [ $rtot, $utot, $stot, $cutot, $cstot, $ntot ];
464 # --- Functions implementing high-level time-then-print utilities
468 return $n == 0 ? $default_for : $n < 0 ? -$n : undef;
472 my($n, $code, $title, $style) = @_;
476 croak "non-integer loopcount $n, stopped" if int($n)<$n;
477 $t = timeit($n, $code);
478 $title = "timethis $n" unless defined $title;
480 $fort = n_to_for( $n );
481 $t = runfor($code, $fort);
482 $title = "timethis for $fort" unless defined $title;
486 $style = "" unless defined $style;
487 printf("%10s: ", $title);
488 print timestr($t, $style, $defaultfmt),"\n";
490 $n = $forn if defined $forn;
492 # A conservative warning to spot very silly tests.
493 # Don't assume that your benchmark is ok simply because
494 # you don't get this warning!
495 print " (warning: too few iterations for a reliable count)\n"
497 || ($t->real < 1 && $n < 1000)
498 || $t->cpu_a < $min_cpu;
503 my($n, $alt, $style) = @_;
504 die "usage: timethese(count, { 'Name1'=>'code1', ... }\n"
505 unless ref $alt eq HASH;
506 my @names = sort keys %$alt;
507 $style = "" unless defined $style;
510 croak "non-integer loopcount $n, stopped" if int($n)<$n;
511 print "timing $n iterations of";
515 print " ", join(', ',@names);
517 my $for = n_to_for( $n );
518 print ", each for at least $for CPU seconds";
522 # we could save the results in an array and produce a summary here
523 # sum, min, max, avg etc etc
524 foreach my $name (@names) {
525 timethis ($n, $alt -> {$name}, $name, $style);