Commit | Line | Data |
a0d0e21e |
1 | package File::Basename; |
2 | |
f06db76b |
3 | =head1 NAME |
4 | |
f06db76b |
5 | fileparse - split a pathname into pieces |
6 | |
7 | basename - extract just the filename from a path |
8 | |
9 | dirname - extract just the directory from a path |
10 | |
11 | =head1 SYNOPSIS |
12 | |
13 | use File::Basename; |
14 | |
15 | ($name,$path,$suffix) = fileparse($fullname,@suffixlist) |
16 | fileparse_set_fstype($os_string); |
17 | $basename = basename($fullname,@suffixlist); |
18 | $dirname = dirname($fullname); |
19 | |
3b421ef0 |
20 | ($name,$path,$suffix) = fileparse("lib/File/Basename.pm",qr{\.pm}); |
f06db76b |
21 | fileparse_set_fstype("VMS"); |
3b421ef0 |
22 | $basename = basename("lib/File/Basename.pm",qr{\.pm}); |
f06db76b |
23 | $dirname = dirname("lib/File/Basename.pm"); |
24 | |
25 | =head1 DESCRIPTION |
26 | |
27 | These routines allow you to parse file specifications into useful |
28 | pieces using the syntax of different operating systems. |
29 | |
30 | =over 4 |
31 | |
32 | =item fileparse_set_fstype |
33 | |
34 | You select the syntax via the routine fileparse_set_fstype(). |
ee2ff9ea |
35 | |
f06db76b |
36 | If the argument passed to it contains one of the substrings |
68dc0745 |
37 | "VMS", "MSDOS", "MacOS", "AmigaOS" or "MSWin32", the file specification |
55497cff |
38 | syntax of that operating system is used in future calls to |
39 | fileparse(), basename(), and dirname(). If it contains none of |
c7b9dd21 |
40 | these substrings, Unix syntax is used. This pattern matching is |
f06db76b |
41 | case-insensitive. If you've selected VMS syntax, and the file |
42 | specification you pass to one of these routines contains a "/", |
c7b9dd21 |
43 | they assume you are using Unix emulation and apply the Unix syntax |
f06db76b |
44 | rules instead, for that function call only. |
45 | |
ee2ff9ea |
46 | If the argument passed to it contains one of the substrings "VMS", |
68dc0745 |
47 | "MSDOS", "MacOS", "AmigaOS", "os2", "MSWin32" or "RISCOS", then the pattern |
ee2ff9ea |
48 | matching for suffix removal is performed without regard for case, |
49 | since those systems are not case-sensitive when opening existing files |
50 | (though some of them preserve case on file creation). |
51 | |
f06db76b |
52 | If you haven't called fileparse_set_fstype(), the syntax is chosen |
f0c6ccdf |
53 | by examining the builtin variable C<$^O> according to these rules. |
f06db76b |
54 | |
55 | =item fileparse |
56 | |
57 | The fileparse() routine divides a file specification into three |
58 | parts: a leading B<path>, a file B<name>, and a B<suffix>. The |
59 | B<path> contains everything up to and including the last directory |
60 | separator in the input file specification. The remainder of the input |
61 | file specification is then divided into B<name> and B<suffix> based on |
62 | the optional patterns you specify in C<@suffixlist>. Each element of |
3b421ef0 |
63 | this list can be a qr-quoted pattern (or a string which is interpreted |
64 | as a regular expression), and is matched |
f06db76b |
65 | against the end of B<name>. If this succeeds, the matching portion of |
66 | B<name> is removed and prepended to B<suffix>. By proper use of |
67 | C<@suffixlist>, you can remove file types or versions for examination. |
68 | |
69 | You are guaranteed that if you concatenate B<path>, B<name>, and |
7e2183d3 |
70 | B<suffix> together in that order, the result will denote the same |
71 | file as the input file specification. |
f06db76b |
72 | |
73 | =back |
74 | |
75 | =head1 EXAMPLES |
76 | |
c7b9dd21 |
77 | Using Unix file syntax: |
f06db76b |
78 | |
7e2183d3 |
79 | ($base,$path,$type) = fileparse('/virgil/aeneid/draft.book7', |
3b421ef0 |
80 | qr{\.book\d+}); |
f06db76b |
81 | |
82 | would yield |
83 | |
84 | $base eq 'draft' |
7e2183d3 |
85 | $path eq '/virgil/aeneid/', |
f0542300 |
86 | $type eq '.book7' |
f06db76b |
87 | |
88 | Similarly, using VMS syntax: |
89 | |
90 | ($name,$dir,$type) = fileparse('Doc_Root:[Help]Rhetoric.Rnh', |
3b421ef0 |
91 | qr{\..*}); |
f06db76b |
92 | |
93 | would yield |
94 | |
95 | $name eq 'Rhetoric' |
96 | $dir eq 'Doc_Root:[Help]' |
97 | $type eq '.Rnh' |
98 | |
95e8664e |
99 | =over |
2ae324a7 |
100 | |
f06db76b |
101 | =item C<basename> |
102 | |
103 | The basename() routine returns the first element of the list produced |
44a8e56a |
104 | by calling fileparse() with the same arguments, except that it always |
105 | quotes metacharacters in the given suffixes. It is provided for |
c7b9dd21 |
106 | programmer compatibility with the Unix shell command basename(1). |
f06db76b |
107 | |
108 | =item C<dirname> |
109 | |
110 | The dirname() routine returns the directory portion of the input file |
111 | specification. When using VMS or MacOS syntax, this is identical to the |
112 | second element of the list produced by calling fileparse() with the same |
7e2183d3 |
113 | input file specification. (Under VMS, if there is no directory information |
114 | in the input file specification, then the current default device and |
c7b9dd21 |
115 | directory are returned.) When using Unix or MSDOS syntax, the return |
116 | value conforms to the behavior of the Unix shell command dirname(1). This |
f06db76b |
117 | is usually the same as the behavior of fileparse(), but differs in some |
118 | cases. For example, for the input file specification F<lib/>, fileparse() |
119 | considers the directory name to be F<lib/>, while dirname() considers the |
120 | directory name to be F<.>). |
121 | |
2ae324a7 |
122 | =back |
123 | |
f06db76b |
124 | =cut |
125 | |
b3eb6a9b |
126 | |
127 | ## use strict; |
1f47e8e2 |
128 | # A bit of juggling to insure that C<use re 'taint';> always works, since |
918c0b2d |
129 | # File::Basename is used during the Perl build, when the re extension may |
130 | # not be available. |
131 | BEGIN { |
132 | unless (eval { require re; }) |
133 | { eval ' sub re::import { $^H |= 0x00100000; } ' } |
134 | import re 'taint'; |
135 | } |
136 | |
137 | |
138 | |
3b825e41 |
139 | use 5.006; |
b395063c |
140 | use warnings; |
17f410f9 |
141 | our(@ISA, @EXPORT, $VERSION, $Fileparse_fstype, $Fileparse_igncase); |
a0d0e21e |
142 | require Exporter; |
143 | @ISA = qw(Exporter); |
748a9306 |
144 | @EXPORT = qw(fileparse fileparse_set_fstype basename dirname); |
88d01e8d |
145 | $VERSION = "2.71"; |
7e2183d3 |
146 | |
a0d0e21e |
147 | |
148 | # fileparse_set_fstype() - specify OS-based rules used in future |
149 | # calls to routines in this package |
150 | # |
ee2ff9ea |
151 | # Currently recognized values: VMS, MSDOS, MacOS, AmigaOS, os2, RISCOS |
152 | # Any other name uses Unix-style rules and is case-sensitive |
a0d0e21e |
153 | |
154 | sub fileparse_set_fstype { |
ee2ff9ea |
155 | my @old = ($Fileparse_fstype, $Fileparse_igncase); |
44a8e56a |
156 | if (@_) { |
157 | $Fileparse_fstype = $_[0]; |
39e571d4 |
158 | $Fileparse_igncase = ($_[0] =~ /^(?:MacOS|VMS|AmigaOS|os2|RISCOS|MSWin32|MSDOS)/i); |
44a8e56a |
159 | } |
160 | wantarray ? @old : $old[0]; |
a0d0e21e |
161 | } |
162 | |
163 | # fileparse() - parse file specification |
164 | # |
f0542300 |
165 | # Version 2.4 27-Sep-1996 Charles Bailey bailey@genetics.upenn.edu |
a0d0e21e |
166 | |
167 | |
168 | sub fileparse { |
169 | my($fullname,@suffices) = @_; |
ee2ff9ea |
170 | my($fstype,$igncase) = ($Fileparse_fstype, $Fileparse_igncase); |
7e2183d3 |
171 | my($dirpath,$tail,$suffix,$basename); |
12cbd720 |
172 | my($taint) = substr($fullname,0,0); # Is $fullname tainted? |
a0d0e21e |
173 | |
174 | if ($fstype =~ /^VMS/i) { |
175 | if ($fullname =~ m#/#) { $fstype = '' } # We're doing Unix emulation |
176 | else { |
c7b9dd21 |
177 | ($dirpath,$basename) = ($fullname =~ /^(.*[:>\]])?(.*)/s); |
12cbd720 |
178 | $dirpath ||= ''; # should always be defined |
a0d0e21e |
179 | } |
180 | } |
fa6a1c44 |
181 | if ($fstype =~ /^MS(DOS|Win32)|epoc/i) { |
c7b9dd21 |
182 | ($dirpath,$basename) = ($fullname =~ /^((?:.*[:\\\/])?)(.*)/s); |
183 | $dirpath .= '.\\' unless $dirpath =~ /[\\\/]\z/; |
a0d0e21e |
184 | } |
f1e20921 |
185 | elsif ($fstype =~ /^os2/i) { |
186 | ($dirpath,$basename) = ($fullname =~ m#^((?:.*[:\\/])?)(.*)#s); |
187 | $dirpath = './' unless $dirpath; # Can't be 0 |
188 | $dirpath .= '/' unless $dirpath =~ m#[\\/]\z#; |
189 | } |
c7b9dd21 |
190 | elsif ($fstype =~ /^MacOS/si) { |
191 | ($dirpath,$basename) = ($fullname =~ /^(.*:)?(.*)/s); |
95e8664e |
192 | $dirpath = ':' unless $dirpath; |
a0d0e21e |
193 | } |
55497cff |
194 | elsif ($fstype =~ /^AmigaOS/i) { |
c7b9dd21 |
195 | ($dirpath,$basename) = ($fullname =~ /(.*[:\/])?(.*)/s); |
a3156fc3 |
196 | $dirpath = './' unless $dirpath; |
55497cff |
197 | } |
748a9306 |
198 | elsif ($fstype !~ /^VMS/i) { # default to Unix |
c7b9dd21 |
199 | ($dirpath,$basename) = ($fullname =~ m#^(.*/)?(.*)#s); |
e3830a4e |
200 | if ($^O eq 'VMS' and $fullname =~ m:^(/[^/]+/000000(/|$))(.*):) { |
491527d0 |
201 | # dev:[000000] is top of VMS tree, similar to Unix '/' |
e3830a4e |
202 | # so strip it off and treat the rest as "normal" |
203 | my $devspec = $1; |
204 | my $remainder = $3; |
205 | ($dirpath,$basename) = ($remainder =~ m#^(.*/)?(.*)#s); |
5fa137f1 |
206 | $dirpath ||= ''; # should always be defined |
e3830a4e |
207 | $dirpath = $devspec.$dirpath; |
491527d0 |
208 | } |
f0c6ccdf |
209 | $dirpath = './' unless $dirpath; |
a0d0e21e |
210 | } |
211 | |
212 | if (@suffices) { |
f06db76b |
213 | $tail = ''; |
a0d0e21e |
214 | foreach $suffix (@suffices) { |
ee2ff9ea |
215 | my $pat = ($igncase ? '(?i)' : '') . "($suffix)\$"; |
c7b9dd21 |
216 | if ($basename =~ s/$pat//s) { |
12cbd720 |
217 | $taint .= substr($suffix,0,0); |
44a8e56a |
218 | $tail = $1 . $tail; |
a0d0e21e |
219 | } |
220 | } |
221 | } |
222 | |
12cbd720 |
223 | $tail .= $taint if defined $tail; # avoid warning if $tail == undef |
8d6d96c1 |
224 | wantarray ? ($basename .= $taint, $dirpath .= $taint, $tail) |
225 | : $basename .= $taint; |
a0d0e21e |
226 | } |
227 | |
228 | |
229 | # basename() - returns first element of list returned by fileparse() |
230 | |
231 | sub basename { |
748a9306 |
232 | my($name) = shift; |
233 | (fileparse($name, map("\Q$_\E",@_)))[0]; |
a0d0e21e |
234 | } |
7e2183d3 |
235 | |
a0d0e21e |
236 | |
237 | # dirname() - returns device and directory portion of file specification |
238 | # Behavior matches that of Unix dirname(1) exactly for Unix and MSDOS |
748a9306 |
239 | # filespecs except for names ending with a separator, e.g., "/xx/yy/". |
240 | # This differs from the second element of the list returned |
a0d0e21e |
241 | # by fileparse() in that the trailing '/' (Unix) or '\' (MSDOS) (and |
242 | # the last directory name if the filespec ends in a '/' or '\'), is lost. |
243 | |
244 | sub dirname { |
245 | my($basename,$dirname) = fileparse($_[0]); |
246 | my($fstype) = $Fileparse_fstype; |
247 | |
248 | if ($fstype =~ /VMS/i) { |
748a9306 |
249 | if ($_[0] =~ m#/#) { $fstype = '' } |
7e2183d3 |
250 | else { return $dirname || $ENV{DEFAULT} } |
a0d0e21e |
251 | } |
084592ab |
252 | if ($fstype =~ /MacOS/i) { |
253 | if( !length($basename) && $dirname !~ /^[^:]+:\z/) { |
254 | $dirname =~ s/([^:]):\z/$1/s; |
255 | ($basename,$dirname) = fileparse $dirname; |
256 | } |
257 | $dirname .= ":" unless $dirname =~ /:\z/; |
258 | } |
f1e20921 |
259 | elsif ($fstype =~ /MS(DOS|Win32)|os2/i) { |
c7b9dd21 |
260 | $dirname =~ s/([^:])[\\\/]*\z/$1/; |
68dc0745 |
261 | unless( length($basename) ) { |
262 | ($basename,$dirname) = fileparse $dirname; |
c7b9dd21 |
263 | $dirname =~ s/([^:])[\\\/]*\z/$1/; |
68dc0745 |
264 | } |
265 | } |
55497cff |
266 | elsif ($fstype =~ /AmigaOS/i) { |
c7b9dd21 |
267 | if ( $dirname =~ /:\z/) { return $dirname } |
55497cff |
268 | chop $dirname; |
c7b9dd21 |
269 | $dirname =~ s#[^:/]+\z## unless length($basename); |
55497cff |
270 | } |
084592ab |
271 | else { |
c7b9dd21 |
272 | $dirname =~ s:(.)/*\z:$1:s; |
42568e28 |
273 | unless( length($basename) ) { |
274 | local($File::Basename::Fileparse_fstype) = $fstype; |
275 | ($basename,$dirname) = fileparse $dirname; |
c7b9dd21 |
276 | $dirname =~ s:(.)/*\z:$1:s; |
42568e28 |
277 | } |
a0d0e21e |
278 | } |
279 | |
280 | $dirname; |
281 | } |
282 | |
44a8e56a |
283 | fileparse_set_fstype $^O; |
a0d0e21e |
284 | |
285 | 1; |