Integrate perlio:
[p5sagit/p5-mst-13.2.git] / pod / perlembed.pod
CommitLineData
a0d0e21e 1=head1 NAME
2
cb1a09d0 3perlembed - how to embed perl in your C program
a0d0e21e 4
5=head1 DESCRIPTION
6
cb1a09d0 7=head2 PREAMBLE
8
9Do you want to:
10
11=over 5
12
96dbc785 13=item B<Use C from Perl?>
cb1a09d0 14
4929bf7b 15Read L<perlxstut>, L<perlxs>, L<h2xs>, L<perlguts>, and L<perlapi>.
cb1a09d0 16
54310121 17=item B<Use a Unix program from Perl?>
cb1a09d0 18
5f05dabc 19Read about back-quotes and about C<system> and C<exec> in L<perlfunc>.
cb1a09d0 20
96dbc785 21=item B<Use Perl from Perl?>
cb1a09d0 22
e010571f 23Read about L<perlfunc/do> and L<perlfunc/eval> and L<perlfunc/require>
24and L<perlfunc/use>.
cb1a09d0 25
96dbc785 26=item B<Use C from C?>
cb1a09d0 27
28Rethink your design.
29
96dbc785 30=item B<Use Perl from C?>
cb1a09d0 31
32Read on...
33
34=back
35
36=head2 ROADMAP
37
707102d0 38=over 5
39
551e1d92 40=item *
cb1a09d0 41
551e1d92 42Compiling your C program
cb1a09d0 43
551e1d92 44=item *
cb1a09d0 45
551e1d92 46Adding a Perl interpreter to your C program
cb1a09d0 47
551e1d92 48=item *
cb1a09d0 49
551e1d92 50Calling a Perl subroutine from your C program
cb1a09d0 51
551e1d92 52=item *
cb1a09d0 53
551e1d92 54Evaluating a Perl statement from your C program
a6006777 55
551e1d92 56=item *
8ebc5c01 57
551e1d92 58Performing Perl pattern matches and substitutions from your C program
59
60=item *
61
62Fiddling with the Perl stack from your C program
63
64=item *
65
66Maintaining a persistent interpreter
67
68=item *
69
70Maintaining multiple interpreter instances
71
72=item *
73
74Using Perl modules, which themselves use C libraries, from your C program
75
76=item *
77
78Embedding Perl under Win32
96dbc785 79
e010571f 80=back
cb1a09d0 81
82=head2 Compiling your C program
83
8a7dc658 84If you have trouble compiling the scripts in this documentation,
85you're not alone. The cardinal rule: COMPILE THE PROGRAMS IN EXACTLY
86THE SAME WAY THAT YOUR PERL WAS COMPILED. (Sorry for yelling.)
cb1a09d0 87
8a7dc658 88Also, every C program that uses Perl must link in the I<perl library>.
cb1a09d0 89What's that, you ask? Perl is itself written in C; the perl library
90is the collection of compiled C programs that were used to create your
91perl executable (I</usr/bin/perl> or equivalent). (Corollary: you
92can't use Perl from your C program unless Perl has been compiled on
93your machine, or installed properly--that's why you shouldn't blithely
94copy Perl executables from machine to machine without also copying the
95I<lib> directory.)
96
8a7dc658 97When you use Perl from C, your C program will--usually--allocate,
98"run", and deallocate a I<PerlInterpreter> object, which is defined by
99the perl library.
cb1a09d0 100
101If your copy of Perl is recent enough to contain this documentation
a6006777 102(version 5.002 or later), then the perl library (and I<EXTERN.h> and
8a7dc658 103I<perl.h>, which you'll also need) will reside in a directory
104that looks like this:
cb1a09d0 105
106 /usr/local/lib/perl5/your_architecture_here/CORE
107
108or perhaps just
109
110 /usr/local/lib/perl5/CORE
111
112or maybe something like
113
114 /usr/opt/perl5/CORE
115
116Execute this statement for a hint about where to find CORE:
117
96dbc785 118 perl -MConfig -e 'print $Config{archlib}'
cb1a09d0 119
54310121 120Here's how you'd compile the example in the next section,
e010571f 121L<Adding a Perl interpreter to your C program>, on my Linux box:
cb1a09d0 122
54310121 123 % gcc -O2 -Dbool=char -DHAS_BOOL -I/usr/local/include
8a7dc658 124 -I/usr/local/lib/perl5/i586-linux/5.003/CORE
54310121 125 -L/usr/local/lib/perl5/i586-linux/5.003/CORE
8a7dc658 126 -o interp interp.c -lperl -lm
cb1a09d0 127
e010571f 128(That's all one line.) On my DEC Alpha running old 5.003_05, the
129incantation is a bit different:
8a7dc658 130
54310121 131 % cc -O2 -Olimit 2900 -DSTANDARD_C -I/usr/local/include
132 -I/usr/local/lib/perl5/alpha-dec_osf/5.00305/CORE
133 -L/usr/local/lib/perl5/alpha-dec_osf/5.00305/CORE -L/usr/local/lib
8a7dc658 134 -D__LANGUAGE_C__ -D_NO_PROTO -o interp interp.c -lperl -lm
135
136How can you figure out what to add? Assuming your Perl is post-5.001,
137execute a C<perl -V> command and pay special attention to the "cc" and
54310121 138"ccflags" information.
8a7dc658 139
54310121 140You'll have to choose the appropriate compiler (I<cc>, I<gcc>, et al.) for
8a7dc658 141your machine: C<perl -MConfig -e 'print $Config{cc}'> will tell you what
54310121 142to use.
8a7dc658 143
144You'll also have to choose the appropriate library directory
145(I</usr/local/lib/...>) for your machine. If your compiler complains
146that certain functions are undefined, or that it can't locate
147I<-lperl>, then you need to change the path following the C<-L>. If it
148complains that it can't find I<EXTERN.h> and I<perl.h>, you need to
149change the path following the C<-I>.
cb1a09d0 150
151You may have to add extra libraries as well. Which ones?
96dbc785 152Perhaps those printed by
153
154 perl -MConfig -e 'print $Config{libs}'
155
54310121 156Provided your perl binary was properly configured and installed the
8a7dc658 157B<ExtUtils::Embed> module will determine all of this information for
158you:
96dbc785 159
160 % cc -o interp interp.c `perl -MExtUtils::Embed -e ccopts -e ldopts`
161
8a7dc658 162If the B<ExtUtils::Embed> module isn't part of your Perl distribution,
163you can retrieve it from
be94a901 164http://www.perl.com/perl/CPAN/modules/by-module/ExtUtils/. (If
8a7dc658 165this documentation came from your Perl distribution, then you're
166running 5.004 or better and you already have it.)
96dbc785 167
8a7dc658 168The B<ExtUtils::Embed> kit on CPAN also contains all source code for
54310121 169the examples in this document, tests, additional examples and other
8a7dc658 170information you may find useful.
cb1a09d0 171
172=head2 Adding a Perl interpreter to your C program
173
174In a sense, perl (the C program) is a good example of embedding Perl
175(the language), so I'll demonstrate embedding with I<miniperlmain.c>,
e010571f 176included in the source distribution. Here's a bastardized, nonportable
8a7dc658 177version of I<miniperlmain.c> containing the essentials of embedding:
cb1a09d0 178
cb1a09d0 179 #include <EXTERN.h> /* from the Perl distribution */
180 #include <perl.h> /* from the Perl distribution */
96dbc785 181
cb1a09d0 182 static PerlInterpreter *my_perl; /*** The Perl interpreter ***/
96dbc785 183
c07a80fd 184 int main(int argc, char **argv, char **env)
cb1a09d0 185 {
186 my_perl = perl_alloc();
187 perl_construct(my_perl);
d95b23b2 188 PL_exit_flags |= PERL_EXIT_DESTRUCT_END;
96dbc785 189 perl_parse(my_perl, NULL, argc, argv, (char **)NULL);
cb1a09d0 190 perl_run(my_perl);
191 perl_destruct(my_perl);
192 perl_free(my_perl);
193 }
194
8a7dc658 195Notice that we don't use the C<env> pointer. Normally handed to
196C<perl_parse> as its final argument, C<env> here is replaced by
197C<NULL>, which means that the current environment will be used.
96dbc785 198
cb1a09d0 199Now compile this program (I'll call it I<interp.c>) into an executable:
200
96dbc785 201 % cc -o interp interp.c `perl -MExtUtils::Embed -e ccopts -e ldopts`
cb1a09d0 202
203After a successful compilation, you'll be able to use I<interp> just
204like perl itself:
205
206 % interp
207 print "Pretty Good Perl \n";
208 print "10890 - 9801 is ", 10890 - 9801;
209 <CTRL-D>
210 Pretty Good Perl
211 10890 - 9801 is 1089
212
213or
214
215 % interp -e 'printf("%x", 3735928559)'
216 deadbeef
217
218You can also read and execute Perl statements from a file while in the
219midst of your C program, by placing the filename in I<argv[1]> before
e010571f 220calling I<perl_run>.
cb1a09d0 221
222=head2 Calling a Perl subroutine from your C program
223
4929bf7b 224To call individual Perl subroutines, you can use any of the B<call_*>
7b8d334a 225functions documented in L<perlcall>.
4929bf7b 226In this example we'll use C<call_argv>.
cb1a09d0 227
228That's shown below, in a program I'll call I<showtime.c>.
229
cb1a09d0 230 #include <EXTERN.h>
96dbc785 231 #include <perl.h>
232
233 static PerlInterpreter *my_perl;
234
c07a80fd 235 int main(int argc, char **argv, char **env)
cb1a09d0 236 {
8ebc5c01 237 char *args[] = { NULL };
cb1a09d0 238 my_perl = perl_alloc();
239 perl_construct(my_perl);
96dbc785 240
241 perl_parse(my_perl, NULL, argc, argv, NULL);
d95b23b2 242 PL_exit_flags |= PERL_EXIT_DESTRUCT_END;
96dbc785 243
8ebc5c01 244 /*** skipping perl_run() ***/
245
4929bf7b 246 call_argv("showtime", G_DISCARD | G_NOARGS, args);
8ebc5c01 247
cb1a09d0 248 perl_destruct(my_perl);
249 perl_free(my_perl);
250 }
251
252where I<showtime> is a Perl subroutine that takes no arguments (that's the
96dbc785 253I<G_NOARGS>) and for which I'll ignore the return value (that's the
cb1a09d0 254I<G_DISCARD>). Those flags, and others, are discussed in L<perlcall>.
255
256I'll define the I<showtime> subroutine in a file called I<showtime.pl>:
257
258 print "I shan't be printed.";
96dbc785 259
cb1a09d0 260 sub showtime {
261 print time;
262 }
263
264Simple enough. Now compile and run:
265
96dbc785 266 % cc -o showtime showtime.c `perl -MExtUtils::Embed -e ccopts -e ldopts`
267
cb1a09d0 268 % showtime showtime.pl
269 818284590
270
271yielding the number of seconds that elapsed between January 1, 1970
8a7dc658 272(the beginning of the Unix epoch), and the moment I began writing this
cb1a09d0 273sentence.
274
d95b23b2 275In this particular case we don't have to call I<perl_run>, as we set
276the PL_exit_flag PERL_EXIT_DESTRUCT_END which executes END blocks in
277perl_destruct.
8ebc5c01 278
8a7dc658 279If you want to pass arguments to the Perl subroutine, you can add
280strings to the C<NULL>-terminated C<args> list passed to
4929bf7b 281I<call_argv>. For other data types, or to examine return values,
13a2d996 282you'll need to manipulate the Perl stack. That's demonstrated in
283L<Fiddling with the Perl stack from your C program>.
cb1a09d0 284
285=head2 Evaluating a Perl statement from your C program
286
137443ea 287Perl provides two API functions to evaluate pieces of Perl code.
4929bf7b 288These are L<perlapi/eval_sv> and L<perlapi/eval_pv>.
137443ea 289
290Arguably, these are the only routines you'll ever need to execute
e010571f 291snippets of Perl code from within your C program. Your code can be as
292long as you wish; it can contain multiple statements; it can employ
293L<perlfunc/use>, L<perlfunc/require>, and L<perlfunc/do> to
294include external Perl files.
cb1a09d0 295
4929bf7b 296I<eval_pv> lets us evaluate individual Perl strings, and then
96dbc785 297extract variables for coercion into C types. The following program,
cb1a09d0 298I<string.c>, executes three Perl strings, extracting an C<int> from
299the first, a C<float> from the second, and a C<char *> from the third.
300
cb1a09d0 301 #include <EXTERN.h>
302 #include <perl.h>
c47ff5f1 303
cb1a09d0 304 static PerlInterpreter *my_perl;
c47ff5f1 305
c07a80fd 306 main (int argc, char **argv, char **env)
cb1a09d0 307 {
2d8e6c8d 308 STRLEN n_a;
137443ea 309 char *embedding[] = { "", "-e", "0" };
c47ff5f1 310
137443ea 311 my_perl = perl_alloc();
312 perl_construct( my_perl );
c47ff5f1 313
137443ea 314 perl_parse(my_perl, NULL, 3, embedding, NULL);
d95b23b2 315 PL_exit_flags |= PERL_EXIT_DESTRUCT_END;
137443ea 316 perl_run(my_perl);
c47ff5f1 317
137443ea 318 /** Treat $a as an integer **/
4929bf7b 319 eval_pv("$a = 3; $a **= 2", TRUE);
320 printf("a = %d\n", SvIV(get_sv("a", FALSE)));
c47ff5f1 321
137443ea 322 /** Treat $a as a float **/
4929bf7b 323 eval_pv("$a = 3.14; $a **= 2", TRUE);
324 printf("a = %f\n", SvNV(get_sv("a", FALSE)));
c47ff5f1 325
137443ea 326 /** Treat $a as a string **/
4929bf7b 327 eval_pv("$a = 'rekcaH lreP rehtonA tsuJ'; $a = reverse($a);", TRUE);
328 printf("a = %s\n", SvPV(get_sv("a", FALSE), n_a));
c47ff5f1 329
137443ea 330 perl_destruct(my_perl);
331 perl_free(my_perl);
cb1a09d0 332 }
333
4929bf7b 334All of those strange functions with I<sv> in their names help convert Perl scalars to C types. They're described in L<perlguts> and L<perlapi>.
cb1a09d0 335
336If you compile and run I<string.c>, you'll see the results of using
337I<SvIV()> to create an C<int>, I<SvNV()> to create a C<float>, and
338I<SvPV()> to create a string:
339
340 a = 9
341 a = 9.859600
342 a = Just Another Perl Hacker
343
8f183262 344In the example above, we've created a global variable to temporarily
345store the computed value of our eval'd expression. It is also
346possible and in most cases a better strategy to fetch the return value
4929bf7b 347from I<eval_pv()> instead. Example:
8f183262 348
8f183262 349 ...
2d8e6c8d 350 STRLEN n_a;
4929bf7b 351 SV *val = eval_pv("reverse 'rekcaH lreP rehtonA tsuJ'", TRUE);
2d8e6c8d 352 printf("%s\n", SvPV(val,n_a));
8f183262 353 ...
354
355This way, we avoid namespace pollution by not creating global
356variables and we've simplified our code as well.
cb1a09d0 357
358=head2 Performing Perl pattern matches and substitutions from your C program
359
4929bf7b 360The I<eval_sv()> function lets us evaluate strings of Perl code, so we can
cb1a09d0 361define some functions that use it to "specialize" in matches and
362substitutions: I<match()>, I<substitute()>, and I<matches()>.
363
e010571f 364 I32 match(SV *string, char *pattern);
cb1a09d0 365
8a7dc658 366Given a string and a pattern (e.g., C<m/clasp/> or C</\b\w*\b/>, which
367in your C program might appear as "/\\b\\w*\\b/"), match()
cb1a09d0 368returns 1 if the string matches the pattern and 0 otherwise.
369
1f05cdcd 370 int substitute(SV **string, char *pattern);
cb1a09d0 371
1f05cdcd 372Given a pointer to an C<SV> and an C<=~> operation (e.g.,
8a7dc658 373C<s/bob/robert/g> or C<tr[A-Z][a-z]>), substitute() modifies the string
1f05cdcd 374within the C<AV> at according to the operation, returning the number of substitutions
8a7dc658 375made.
cb1a09d0 376
1f05cdcd 377 int matches(SV *string, char *pattern, AV **matches);
cb1a09d0 378
1f05cdcd 379Given an C<SV>, a pattern, and a pointer to an empty C<AV>,
90fdbbb7 380matches() evaluates C<$string =~ $pattern> in a list context, and
1f05cdcd 381fills in I<matches> with the array elements, returning the number of matches found.
cb1a09d0 382
96dbc785 383Here's a sample program, I<match.c>, that uses all three (long lines have
384been wrapped here):
cb1a09d0 385
1f05cdcd 386 #include <EXTERN.h>
387 #include <perl.h>
c47ff5f1 388
4929bf7b 389 /** my_eval_sv(code, error_check)
390 ** kinda like eval_sv(),
1f05cdcd 391 ** but we pop the return value off the stack
392 **/
4929bf7b 393 SV* my_eval_sv(SV *sv, I32 croak_on_error)
1f05cdcd 394 {
395 dSP;
396 SV* retval;
2d8e6c8d 397 STRLEN n_a;
c47ff5f1 398
924508f0 399 PUSHMARK(SP);
4929bf7b 400 eval_sv(sv, G_SCALAR);
c47ff5f1 401
1f05cdcd 402 SPAGAIN;
403 retval = POPs;
404 PUTBACK;
c47ff5f1 405
9cde0e7f 406 if (croak_on_error && SvTRUE(ERRSV))
2d8e6c8d 407 croak(SvPVx(ERRSV, n_a));
c47ff5f1 408
1f05cdcd 409 return retval;
410 }
c47ff5f1 411
1f05cdcd 412 /** match(string, pattern)
413 **
414 ** Used for matches in a scalar context.
415 **
416 ** Returns 1 if the match was successful; 0 otherwise.
417 **/
c47ff5f1 418
1f05cdcd 419 I32 match(SV *string, char *pattern)
420 {
8c52afec 421 SV *command = NEWSV(1099, 0), *retval;
2d8e6c8d 422 STRLEN n_a;
c47ff5f1 423
1f05cdcd 424 sv_setpvf(command, "my $string = '%s'; $string =~ %s",
2d8e6c8d 425 SvPV(string,n_a), pattern);
c47ff5f1 426
4929bf7b 427 retval = my_eval_sv(command, TRUE);
1f05cdcd 428 SvREFCNT_dec(command);
c47ff5f1 429
1f05cdcd 430 return SvIV(retval);
431 }
c47ff5f1 432
1f05cdcd 433 /** substitute(string, pattern)
434 **
435 ** Used for =~ operations that modify their left-hand side (s/// and tr///)
436 **
437 ** Returns the number of successful matches, and
438 ** modifies the input string if there were any.
439 **/
c47ff5f1 440
1f05cdcd 441 I32 substitute(SV **string, char *pattern)
442 {
8c52afec 443 SV *command = NEWSV(1099, 0), *retval;
2d8e6c8d 444 STRLEN n_a;
c47ff5f1 445
1f05cdcd 446 sv_setpvf(command, "$string = '%s'; ($string =~ %s)",
2d8e6c8d 447 SvPV(*string,n_a), pattern);
c47ff5f1 448
4929bf7b 449 retval = my_eval_sv(command, TRUE);
1f05cdcd 450 SvREFCNT_dec(command);
c47ff5f1 451
4929bf7b 452 *string = get_sv("string", FALSE);
1f05cdcd 453 return SvIV(retval);
454 }
c47ff5f1 455
1f05cdcd 456 /** matches(string, pattern, matches)
457 **
90fdbbb7 458 ** Used for matches in a list context.
1f05cdcd 459 **
460 ** Returns the number of matches,
461 ** and fills in **matches with the matching substrings
462 **/
c47ff5f1 463
1f05cdcd 464 I32 matches(SV *string, char *pattern, AV **match_list)
465 {
8c52afec 466 SV *command = NEWSV(1099, 0);
cb1a09d0 467 I32 num_matches;
2d8e6c8d 468 STRLEN n_a;
c47ff5f1 469
1f05cdcd 470 sv_setpvf(command, "my $string = '%s'; @array = ($string =~ %s)",
2d8e6c8d 471 SvPV(string,n_a), pattern);
c47ff5f1 472
4929bf7b 473 my_eval_sv(command, TRUE);
1f05cdcd 474 SvREFCNT_dec(command);
c47ff5f1 475
4929bf7b 476 *match_list = get_av("array", FALSE);
1f05cdcd 477 num_matches = av_len(*match_list) + 1; /** assume $[ is 0 **/
c47ff5f1 478
cb1a09d0 479 return num_matches;
1f05cdcd 480 }
c47ff5f1 481
1f05cdcd 482 main (int argc, char **argv, char **env)
483 {
484 PerlInterpreter *my_perl = perl_alloc();
a6006777 485 char *embedding[] = { "", "-e", "0" };
1f05cdcd 486 AV *match_list;
487 I32 num_matches, i;
8c52afec 488 SV *text = NEWSV(1099,0);
2d8e6c8d 489 STRLEN n_a;
c47ff5f1 490
1f05cdcd 491 perl_construct(my_perl);
96dbc785 492 perl_parse(my_perl, NULL, 3, embedding, NULL);
d95b23b2 493 PL_exit_flags |= PERL_EXIT_DESTRUCT_END;
c47ff5f1 494
1f05cdcd 495 sv_setpv(text, "When he is at a convenience store and the bill comes to some amount like 76 cents, Maynard is aware that there is something he *should* do, something that will enable him to get back a quarter, but he has no idea *what*. He fumbles through his red squeezey changepurse and gives the boy three extra pennies with his dollar, hoping that he might luck into the correct amount. The boy gives him back two of his own pennies and then the big shiny quarter that is his prize. -RICHH");
c47ff5f1 496
96dbc785 497 if (match(text, "m/quarter/")) /** Does text contain 'quarter'? **/
1f05cdcd 498 printf("match: Text contains the word 'quarter'.\n\n");
96dbc785 499 else
1f05cdcd 500 printf("match: Text doesn't contain the word 'quarter'.\n\n");
c47ff5f1 501
96dbc785 502 if (match(text, "m/eighth/")) /** Does text contain 'eighth'? **/
1f05cdcd 503 printf("match: Text contains the word 'eighth'.\n\n");
96dbc785 504 else
1f05cdcd 505 printf("match: Text doesn't contain the word 'eighth'.\n\n");
c47ff5f1 506
96dbc785 507 /** Match all occurrences of /wi../ **/
508 num_matches = matches(text, "m/(wi..)/g", &match_list);
509 printf("matches: m/(wi..)/g found %d matches...\n", num_matches);
c47ff5f1 510
96dbc785 511 for (i = 0; i < num_matches; i++)
2d8e6c8d 512 printf("match: %s\n", SvPV(*av_fetch(match_list, i, FALSE),n_a));
cb1a09d0 513 printf("\n");
c47ff5f1 514
96dbc785 515 /** Remove all vowels from text **/
516 num_matches = substitute(&text, "s/[aeiou]//gi");
cb1a09d0 517 if (num_matches) {
1f05cdcd 518 printf("substitute: s/[aeiou]//gi...%d substitutions made.\n",
519 num_matches);
2d8e6c8d 520 printf("Now text is: %s\n\n", SvPV(text,n_a));
cb1a09d0 521 }
c47ff5f1 522
96dbc785 523 /** Attempt a substitution **/
524 if (!substitute(&text, "s/Perl/C/")) {
1f05cdcd 525 printf("substitute: s/Perl/C...No substitution made.\n\n");
cb1a09d0 526 }
c47ff5f1 527
1f05cdcd 528 SvREFCNT_dec(text);
9cde0e7f 529 PL_perl_destruct_level = 1;
cb1a09d0 530 perl_destruct(my_perl);
531 perl_free(my_perl);
1f05cdcd 532 }
cb1a09d0 533
96dbc785 534which produces the output (again, long lines have been wrapped here)
cb1a09d0 535
8a7dc658 536 match: Text contains the word 'quarter'.
96dbc785 537
8a7dc658 538 match: Text doesn't contain the word 'eighth'.
96dbc785 539
8a7dc658 540 matches: m/(wi..)/g found 2 matches...
cb1a09d0 541 match: will
542 match: with
96dbc785 543
8a7dc658 544 substitute: s/[aeiou]//gi...139 substitutions made.
54310121 545 Now text is: Whn h s t cnvnnc str nd th bll cms t sm mnt lk 76 cnts,
96dbc785 546 Mynrd s wr tht thr s smthng h *shld* d, smthng tht wll nbl hm t gt bck
547 qrtr, bt h hs n d *wht*. H fmbls thrgh hs rd sqzy chngprs nd gvs th by
548 thr xtr pnns wth hs dllr, hpng tht h mght lck nt th crrct mnt. Th by gvs
549 hm bck tw f hs wn pnns nd thn th bg shny qrtr tht s hs prz. -RCHH
550
8a7dc658 551 substitute: s/Perl/C...No substitution made.
96dbc785 552
cb1a09d0 553=head2 Fiddling with the Perl stack from your C program
554
555When trying to explain stacks, most computer science textbooks mumble
556something about spring-loaded columns of cafeteria plates: the last
557thing you pushed on the stack is the first thing you pop off. That'll
558do for our purposes: your C program will push some arguments onto "the Perl
559stack", shut its eyes while some magic happens, and then pop the
560results--the return value of your Perl subroutine--off the stack.
96dbc785 561
cb1a09d0 562First you'll need to know how to convert between C types and Perl
563types, with newSViv() and sv_setnv() and newAV() and all their
4929bf7b 564friends. They're described in L<perlguts> and L<perlapi>.
cb1a09d0 565
566Then you'll need to know how to manipulate the Perl stack. That's
567described in L<perlcall>.
568
96dbc785 569Once you've understood those, embedding Perl in C is easy.
cb1a09d0 570
54310121 571Because C has no builtin function for integer exponentiation, let's
cb1a09d0 572make Perl's ** operator available to it (this is less useful than it
5f05dabc 573sounds, because Perl implements ** with C's I<pow()> function). First
cb1a09d0 574I'll create a stub exponentiation function in I<power.pl>:
575
576 sub expo {
577 my ($a, $b) = @_;
578 return $a ** $b;
579 }
580
581Now I'll create a C program, I<power.c>, with a function
582I<PerlPower()> that contains all the perlguts necessary to push the
583two arguments into I<expo()> and to pop the return value out. Take a
584deep breath...
585
cb1a09d0 586 #include <EXTERN.h>
587 #include <perl.h>
96dbc785 588
cb1a09d0 589 static PerlInterpreter *my_perl;
96dbc785 590
cb1a09d0 591 static void
592 PerlPower(int a, int b)
593 {
594 dSP; /* initialize stack pointer */
595 ENTER; /* everything created after here */
596 SAVETMPS; /* ...is a temporary variable. */
924508f0 597 PUSHMARK(SP); /* remember the stack pointer */
cb1a09d0 598 XPUSHs(sv_2mortal(newSViv(a))); /* push the base onto the stack */
599 XPUSHs(sv_2mortal(newSViv(b))); /* push the exponent onto stack */
600 PUTBACK; /* make local stack pointer global */
4929bf7b 601 call_pv("expo", G_SCALAR); /* call the function */
cb1a09d0 602 SPAGAIN; /* refresh stack pointer */
603 /* pop the return value from stack */
604 printf ("%d to the %dth power is %d.\n", a, b, POPi);
96dbc785 605 PUTBACK;
cb1a09d0 606 FREETMPS; /* free that return value */
607 LEAVE; /* ...and the XPUSHed "mortal" args.*/
608 }
96dbc785 609
610 int main (int argc, char **argv, char **env)
cb1a09d0 611 {
95b76e31 612 char *my_argv[] = { "", "power.pl" };
96dbc785 613
cb1a09d0 614 my_perl = perl_alloc();
615 perl_construct( my_perl );
96dbc785 616
95b76e31 617 perl_parse(my_perl, NULL, 2, my_argv, (char **)NULL);
d95b23b2 618 PL_exit_flags |= PERL_EXIT_DESTRUCT_END;
8ebc5c01 619 perl_run(my_perl);
96dbc785 620
cb1a09d0 621 PerlPower(3, 4); /*** Compute 3 ** 4 ***/
96dbc785 622
cb1a09d0 623 perl_destruct(my_perl);
624 perl_free(my_perl);
625 }
96dbc785 626
cb1a09d0 627
628
629Compile and run:
630
96dbc785 631 % cc -o power power.c `perl -MExtUtils::Embed -e ccopts -e ldopts`
632
633 % power
cb1a09d0 634 3 to the 4th power is 81.
635
a6006777 636=head2 Maintaining a persistent interpreter
637
8a7dc658 638When developing interactive and/or potentially long-running
639applications, it's a good idea to maintain a persistent interpreter
640rather than allocating and constructing a new interpreter multiple
641times. The major reason is speed: since Perl will only be loaded into
54310121 642memory once.
8a7dc658 643
644However, you have to be more cautious with namespace and variable
645scoping when using a persistent interpreter. In previous examples
646we've been using global variables in the default package C<main>. We
647knew exactly what code would be run, and assumed we could avoid
648variable collisions and outrageous symbol table growth.
649
650Let's say your application is a server that will occasionally run Perl
651code from some arbitrary file. Your server has no way of knowing what
652code it's going to run. Very dangerous.
653
654If the file is pulled in by C<perl_parse()>, compiled into a newly
655constructed interpreter, and subsequently cleaned out with
656C<perl_destruct()> afterwards, you're shielded from most namespace
657troubles.
658
659One way to avoid namespace collisions in this scenario is to translate
660the filename into a guaranteed-unique package name, and then compile
e010571f 661the code into that package using L<perlfunc/eval>. In the example
8a7dc658 662below, each file will only be compiled once. Or, the application
663might choose to clean out the symbol table associated with the file
4929bf7b 664after it's no longer needed. Using L<perlapi/call_argv>, We'll
8a7dc658 665call the subroutine C<Embed::Persistent::eval_file> which lives in the
666file C<persistent.pl> and pass the filename and boolean cleanup/cache
a6006777 667flag as arguments.
668
8a7dc658 669Note that the process will continue to grow for each file that it
670uses. In addition, there might be C<AUTOLOAD>ed subroutines and other
671conditions that cause Perl's symbol table to grow. You might want to
672add some logic that keeps track of the process size, or restarts
673itself after a certain number of requests, to ensure that memory
674consumption is minimized. You'll also want to scope your variables
e010571f 675with L<perlfunc/my> whenever possible.
a6006777 676
54310121 677
a6006777 678 package Embed::Persistent;
679 #persistent.pl
54310121 680
a6006777 681 use strict;
77ca0c92 682 our %Cache;
1ee082b7 683 use Symbol qw(delete_package);
54310121 684
a6006777 685 sub valid_package_name {
686 my($string) = @_;
687 $string =~ s/([^A-Za-z0-9\/])/sprintf("_%2x",unpack("C",$1))/eg;
688 # second pass only for words starting with a digit
689 $string =~ s|/(\d)|sprintf("/_%2x",unpack("C",$1))|eg;
54310121 690
a6006777 691 # Dress it up as a real package name
692 $string =~ s|/|::|g;
693 return "Embed" . $string;
694 }
54310121 695
a6006777 696 sub eval_file {
697 my($filename, $delete) = @_;
698 my $package = valid_package_name($filename);
699 my $mtime = -M $filename;
700 if(defined $Cache{$package}{mtime}
701 &&
54310121 702 $Cache{$package}{mtime} <= $mtime)
a6006777 703 {
54310121 704 # we have compiled this subroutine already,
8ebc5c01 705 # it has not been updated on disk, nothing left to do
706 print STDERR "already compiled $package->handler\n";
a6006777 707 }
708 else {
8ebc5c01 709 local *FH;
710 open FH, $filename or die "open '$filename' $!";
711 local($/) = undef;
712 my $sub = <FH>;
713 close FH;
54310121 714
8ebc5c01 715 #wrap the code into a subroutine inside our unique package
716 my $eval = qq{package $package; sub handler { $sub; }};
717 {
718 # hide our variables within this block
719 my($filename,$mtime,$package,$sub);
720 eval $eval;
721 }
722 die $@ if $@;
54310121 723
8ebc5c01 724 #cache it unless we're cleaning out each time
725 $Cache{$package}{mtime} = $mtime unless $delete;
a6006777 726 }
54310121 727
a6006777 728 eval {$package->handler;};
729 die $@ if $@;
54310121 730
a6006777 731 delete_package($package) if $delete;
54310121 732
a6006777 733 #take a look if you want
734 #print Devel::Symdump->rnew($package)->as_string, $/;
735 }
54310121 736
a6006777 737 1;
54310121 738
a6006777 739 __END__
740
741 /* persistent.c */
54310121 742 #include <EXTERN.h>
743 #include <perl.h>
744
a6006777 745 /* 1 = clean out filename's symbol table after each request, 0 = don't */
746 #ifndef DO_CLEAN
747 #define DO_CLEAN 0
748 #endif
54310121 749
a6006777 750 static PerlInterpreter *perl = NULL;
54310121 751
a6006777 752 int
753 main(int argc, char **argv, char **env)
754 {
755 char *embedding[] = { "", "persistent.pl" };
756 char *args[] = { "", DO_CLEAN, NULL };
757 char filename [1024];
758 int exitstatus = 0;
2d8e6c8d 759 STRLEN n_a;
54310121 760
a6006777 761 if((perl = perl_alloc()) == NULL) {
8ebc5c01 762 fprintf(stderr, "no memory!");
763 exit(1);
a6006777 764 }
54310121 765 perl_construct(perl);
766
a6006777 767 exitstatus = perl_parse(perl, NULL, 2, embedding, NULL);
d95b23b2 768 PL_exit_flags |= PERL_EXIT_DESTRUCT_END;
54310121 769 if(!exitstatus) {
8ebc5c01 770 exitstatus = perl_run(perl);
54310121 771
8ebc5c01 772 while(printf("Enter file name: ") && gets(filename)) {
54310121 773
8ebc5c01 774 /* call the subroutine, passing it the filename as an argument */
775 args[0] = filename;
4929bf7b 776 call_argv("Embed::Persistent::eval_file",
8ebc5c01 777 G_DISCARD | G_EVAL, args);
54310121 778
8ebc5c01 779 /* check $@ */
9cde0e7f 780 if(SvTRUE(ERRSV))
2d8e6c8d 781 fprintf(stderr, "eval error: %s\n", SvPV(ERRSV,n_a));
8ebc5c01 782 }
a6006777 783 }
54310121 784
9cde0e7f 785 PL_perl_destruct_level = 0;
54310121 786 perl_destruct(perl);
787 perl_free(perl);
a6006777 788 exit(exitstatus);
789 }
790
a6006777 791Now compile:
792
54310121 793 % cc -o persistent persistent.c `perl -MExtUtils::Embed -e ccopts -e ldopts`
a6006777 794
d1be9408 795Here's an example script file:
a6006777 796
797 #test.pl
798 my $string = "hello";
799 foo($string);
800
801 sub foo {
802 print "foo says: @_\n";
803 }
804
805Now run:
806
807 % persistent
808 Enter file name: test.pl
809 foo says: hello
810 Enter file name: test.pl
811 already compiled Embed::test_2epl->handler
812 foo says: hello
813 Enter file name: ^C
814
d95b23b2 815=head2 Execution of END blocks
816
817Traditionally END blocks have been executed at the end of the perl_run.
818This causes problems for applications that never call perl_run. Since
819perl 5.7.2 you can specify C<PL_exit_flags |= PERL_EXIT_DESTRUCT_END>
820to get the new behaviour. This also enables the running of END blocks if
821the perl_prase fails and C<perl_destruct> will return the exit value.
822
8ebc5c01 823=head2 Maintaining multiple interpreter instances
824
8a7dc658 825Some rare applications will need to create more than one interpreter
826during a session. Such an application might sporadically decide to
54310121 827release any resources associated with the interpreter.
8a7dc658 828
829The program must take care to ensure that this takes place I<before>
9bbedd82 830the next interpreter is constructed. By default, when perl is not
831built with any special options, the global variable
9cde0e7f 832C<PL_perl_destruct_level> is set to C<0>, since extra cleaning isn't
9bbedd82 833usually needed when a program only ever creates a single interpreter
834in its entire lifetime.
8a7dc658 835
9cde0e7f 836Setting C<PL_perl_destruct_level> to C<1> makes everything squeaky clean:
8a7dc658 837
9cde0e7f 838 PL_perl_destruct_level = 1;
8ebc5c01 839
8ebc5c01 840 while(1) {
841 ...
9cde0e7f 842 /* reset global variables here with PL_perl_destruct_level = 1 */
54310121 843 perl_construct(my_perl);
8ebc5c01 844 ...
845 /* clean and reset _everything_ during perl_destruct */
54310121 846 perl_destruct(my_perl);
847 perl_free(my_perl);
8ebc5c01 848 ...
849 /* let's go do it again! */
850 }
851
54310121 852When I<perl_destruct()> is called, the interpreter's syntax parse tree
853and symbol tables are cleaned up, and global variables are reset.
8ebc5c01 854
8a7dc658 855Now suppose we have more than one interpreter instance running at the
9bbedd82 856same time. This is feasible, but only if you used the Configure option
857C<-Dusemultiplicity> or the options C<-Dusethreads -Duseithreads> when
858building Perl. By default, enabling one of these Configure options
859sets the per-interpreter global variable C<PL_perl_destruct_level> to
860C<1>, so that thorough cleaning is automatic.
861
862Using C<-Dusethreads -Duseithreads> rather than C<-Dusemultiplicity>
863is more appropriate if you intend to run multiple interpreters
864concurrently in different threads, because it enables support for
865linking in the thread libraries of your system with the interpreter.
8ebc5c01 866
867Let's give it a try:
868
869
870 #include <EXTERN.h>
8a7dc658 871 #include <perl.h>
8ebc5c01 872
873 /* we're going to embed two interpreters */
874 /* we're going to embed two interpreters */
875
8ebc5c01 876 #define SAY_HELLO "-e", "print qq(Hi, I'm $^X\n)"
877
8ebc5c01 878 int main(int argc, char **argv, char **env)
879 {
54310121 880 PerlInterpreter
8ebc5c01 881 *one_perl = perl_alloc(),
54310121 882 *two_perl = perl_alloc();
8ebc5c01 883 char *one_args[] = { "one_perl", SAY_HELLO };
884 char *two_args[] = { "two_perl", SAY_HELLO };
885
9bbedd82 886 PERL_SET_CONTEXT(one_perl);
8ebc5c01 887 perl_construct(one_perl);
9bbedd82 888 PERL_SET_CONTEXT(two_perl);
8ebc5c01 889 perl_construct(two_perl);
890
9bbedd82 891 PERL_SET_CONTEXT(one_perl);
8ebc5c01 892 perl_parse(one_perl, NULL, 3, one_args, (char **)NULL);
9bbedd82 893 PERL_SET_CONTEXT(two_perl);
8ebc5c01 894 perl_parse(two_perl, NULL, 3, two_args, (char **)NULL);
895
9bbedd82 896 PERL_SET_CONTEXT(one_perl);
8ebc5c01 897 perl_run(one_perl);
9bbedd82 898 PERL_SET_CONTEXT(two_perl);
8ebc5c01 899 perl_run(two_perl);
900
9bbedd82 901 PERL_SET_CONTEXT(one_perl);
8ebc5c01 902 perl_destruct(one_perl);
9bbedd82 903 PERL_SET_CONTEXT(two_perl);
8ebc5c01 904 perl_destruct(two_perl);
905
9bbedd82 906 PERL_SET_CONTEXT(one_perl);
8ebc5c01 907 perl_free(one_perl);
9bbedd82 908 PERL_SET_CONTEXT(two_perl);
8ebc5c01 909 perl_free(two_perl);
910 }
911
9bbedd82 912Note the calls to PERL_SET_CONTEXT(). These are necessary to initialize
913the global state that tracks which interpreter is the "current" one on
914the particular process or thread that may be running it. It should
915always be used if you have more than one interpreter and are making
916perl API calls on both interpreters in an interleaved fashion.
917
918PERL_SET_CONTEXT(interp) should also be called whenever C<interp> is
919used by a thread that did not create it (using either perl_alloc(), or
920the more esoteric perl_clone()).
8ebc5c01 921
922Compile as usual:
923
924 % cc -o multiplicity multiplicity.c `perl -MExtUtils::Embed -e ccopts -e ldopts`
925
926Run it, Run it:
927
928 % multiplicity
929 Hi, I'm one_perl
930 Hi, I'm two_perl
931
96dbc785 932=head2 Using Perl modules, which themselves use C libraries, from your C program
933
934If you've played with the examples above and tried to embed a script
935that I<use()>s a Perl module (such as I<Socket>) which itself uses a C or C++ library,
936this probably happened:
937
938
939 Can't load module Socket, dynamic loading not available in this perl.
940 (You may need to build a new perl executable which either supports
941 dynamic loading or has the Socket module statically linked into it.)
942
943
944What's wrong?
945
946Your interpreter doesn't know how to communicate with these extensions
947on its own. A little glue will help. Up until now you've been
948calling I<perl_parse()>, handing it NULL for the second argument:
949
950 perl_parse(my_perl, NULL, argc, my_argv, NULL);
951
952That's where the glue code can be inserted to create the initial contact between
953Perl and linked C/C++ routines. Let's take a look some pieces of I<perlmain.c>
954to see how Perl does this:
955
cc7dda15 956 static void xs_init (pTHX);
96dbc785 957
cc7dda15 958 EXTERN_C void boot_DynaLoader (pTHX_ CV* cv);
959 EXTERN_C void boot_Socket (pTHX_ CV* cv);
96dbc785 960
961
962 EXTERN_C void
cc7dda15 963 xs_init(pTHX)
96dbc785 964 {
965 char *file = __FILE__;
966 /* DynaLoader is a special case */
967 newXS("DynaLoader::boot_DynaLoader", boot_DynaLoader, file);
968 newXS("Socket::bootstrap", boot_Socket, file);
969 }
970
971Simply put: for each extension linked with your Perl executable
972(determined during its initial configuration on your
973computer or when adding a new extension),
974a Perl subroutine is created to incorporate the extension's
975routines. Normally, that subroutine is named
976I<Module::bootstrap()> and is invoked when you say I<use Module>. In
977turn, this hooks into an XSUB, I<boot_Module>, which creates a Perl
978counterpart for each of the extension's XSUBs. Don't worry about this
979part; leave that to the I<xsubpp> and extension authors. If your
980extension is dynamically loaded, DynaLoader creates I<Module::bootstrap()>
981for you on the fly. In fact, if you have a working DynaLoader then there
5f05dabc 982is rarely any need to link in any other extensions statically.
96dbc785 983
984
985Once you have this code, slap it into the second argument of I<perl_parse()>:
986
987
988 perl_parse(my_perl, xs_init, argc, my_argv, NULL);
989
990
991Then compile:
992
8a7dc658 993 % cc -o interp interp.c `perl -MExtUtils::Embed -e ccopts -e ldopts`
96dbc785 994
995 % interp
996 use Socket;
997 use SomeDynamicallyLoadedModule;
998
999 print "Now I can use extensions!\n"'
1000
1001B<ExtUtils::Embed> can also automate writing the I<xs_init> glue code.
1002
8a7dc658 1003 % perl -MExtUtils::Embed -e xsinit -- -o perlxsi.c
96dbc785 1004 % cc -c perlxsi.c `perl -MExtUtils::Embed -e ccopts`
1005 % cc -c interp.c `perl -MExtUtils::Embed -e ccopts`
8a7dc658 1006 % cc -o interp perlxsi.o interp.o `perl -MExtUtils::Embed -e ldopts`
96dbc785 1007
4929bf7b 1008Consult L<perlxs>, L<perlguts>, and L<perlapi> for more details.
96dbc785 1009
13a2d996 1010=head1 Embedding Perl under Win32
53f52f58 1011
cc7dda15 1012In general, all of the source code shown here should work unmodified under
1013Windows.
53f52f58 1014
cc7dda15 1015However, there are some caveats about the command-line examples shown.
1016For starters, backticks won't work under the Win32 native command shell.
53f52f58 1017The ExtUtils::Embed kit on CPAN ships with a script called
1018B<genmake>, which generates a simple makefile to build a program from
e010571f 1019a single C source file. It can be used like this:
53f52f58 1020
1021 C:\ExtUtils-Embed\eg> perl genmake interp.c
1022 C:\ExtUtils-Embed\eg> nmake
1023 C:\ExtUtils-Embed\eg> interp -e "print qq{I'm embedded in Win32!\n}"
1024
e010571f 1025You may wish to use a more robust environment such as the Microsoft
1026Developer Studio. In this case, run this to generate perlxsi.c:
53f52f58 1027
1028 perl -MExtUtils::Embed -e xsinit
1029
e010571f 1030Create a new project and Insert -> Files into Project: perlxsi.c,
1031perl.lib, and your own source files, e.g. interp.c. Typically you'll
1032find perl.lib in B<C:\perl\lib\CORE>, if not, you should see the
1033B<CORE> directory relative to C<perl -V:archlib>. The studio will
1034also need this path so it knows where to find Perl include files.
1035This path can be added via the Tools -> Options -> Directories menu.
1036Finally, select Build -> Build interp.exe and you're ready to go.
96dbc785 1037
cb1a09d0 1038=head1 MORAL
1039
1040You can sometimes I<write faster code> in C, but
5f05dabc 1041you can always I<write code faster> in Perl. Because you can use
cb1a09d0 1042each from the other, combine them as you wish.
1043
1044
1045=head1 AUTHOR
1046
e010571f 1047Jon Orwant <F<orwant@tpj.com>> and Doug MacEachern
1048<F<dougm@osf.org>>, with small contributions from Tim Bunce, Tom
1049Christiansen, Guy Decoux, Hallvard Furuseth, Dov Grobgeld, and Ilya
1050Zakharevich.
cb1a09d0 1051
e010571f 1052Doug MacEachern has an article on embedding in Volume 1, Issue 4 of
1053The Perl Journal (http://tpj.com). Doug is also the developer of the
1054most widely-used Perl embedding: the mod_perl system
1055(perl.apache.org), which embeds Perl in the Apache web server.
1056Oracle, Binary Evolution, ActiveState, and Ben Sugars's nsapi_perl
1057have used this model for Oracle, Netscape and Internet Information
1058Server Perl plugins.
cb1a09d0 1059
e010571f 1060July 22, 1998
8a7dc658 1061
1062=head1 COPYRIGHT
1063
e010571f 1064Copyright (C) 1995, 1996, 1997, 1998 Doug MacEachern and Jon Orwant. All
8a7dc658 1065Rights Reserved.
1066
e010571f 1067Permission is granted to make and distribute verbatim copies of this
1068documentation provided the copyright notice and this permission notice are
1069preserved on all copies.
1070
1071Permission is granted to copy and distribute modified versions of this
1072documentation under the conditions for verbatim copying, provided also
1073that they are marked clearly as modified versions, that the authors'
1074names and title are unchanged (though subtitles and additional
1075authors' names may be added), and that the entire resulting derived
1076work is distributed under the terms of a permission notice identical
1077to this one.
1078
1079Permission is granted to copy and distribute translations of this
1080documentation into another language, under the above conditions for
1081modified versions.