Had to turn off caching, but I've merged everything from SPROUT's fixes
[dbsrgits/DBM-Deep.git] / lib / DBM / Deep / Engine.pm
CommitLineData
bf941eae 1package DBM::Deep::Engine;
2
3use 5.006_000;
4
5use strict;
6use warnings FATAL => 'all';
8385c429 7no warnings 'recursion';
29460253 8
bf941eae 9use DBM::Deep::Iterator ();
10
11# File-wide notes:
12# * Every method in here assumes that the storage has been appropriately
13# safeguarded. This can be anything from flock() to some sort of manual
14# mutex. But, it's the caller's responsability to make sure that this has
15# been done.
16
a4d36ff6 17sub SIG_HASH () { 'H' }
18sub SIG_ARRAY () { 'A' }
19
64a531e5 20=head1 NAME
21
22DBM::Deep::Engine
23
24=head1 PURPOSE
25
1c62d370 26This is an internal-use-only object for L<DBM::Deep>. It mediates the low-level
27mapping between the L<DBM::Deep> objects and the storage medium.
64a531e5 28
29The purpose of this documentation is to provide low-level documentation for
30developers. It is B<not> intended to be used by the general public. This
31documentation and what it documents can and will change without notice.
32
33=head1 OVERVIEW
34
35The engine exposes an API to the DBM::Deep objects (DBM::Deep, DBM::Deep::Array,
36and DBM::Deep::Hash) for their use to access the actual stored values. This API
37is the following:
38
39=over 4
40
41=item * new
42
43=item * read_value
44
45=item * get_classname
46
47=item * make_reference
48
49=item * key_exists
50
51=item * delete_key
52
53=item * write_value
54
55=item * get_next_key
56
f4d0ac97 57=item * setup
64a531e5 58
59=item * begin_work
60
61=item * commit
62
63=item * rollback
64
65=item * lock_exclusive
66
67=item * lock_shared
68
69=item * unlock
70
71=back
72
73They are explained in their own sections below. These methods, in turn, may
74provide some bounds-checking, but primarily act to instantiate objects in the
75Engine::Sector::* hierarchy and dispatch to them.
76
77=head1 TRANSACTIONS
78
79Transactions in DBM::Deep are implemented using a variant of MVCC. This attempts
80to keep the amount of actual work done against the file low while stil providing
81Atomicity, Consistency, and Isolation. Durability, unfortunately, cannot be done
82with only one file.
83
84=head2 STALENESS
85
86If another process uses a transaction slot and writes stuff to it, then
87terminates, the data that process wrote it still within the file. In order to
88address this, there is also a transaction staleness counter associated within
89every write. Each time a transaction is started, that process increments that
90transaction's staleness counter. If, when it reads a value, the staleness
91counters aren't identical, DBM::Deep will consider the value on disk to be stale
92and discard it.
93
94=head2 DURABILITY
95
96The fourth leg of ACID is Durability, the guarantee that when a commit returns,
97the data will be there the next time you read from it. This should be regardless
98of any crashes or powerdowns in between the commit and subsequent read.
99DBM::Deep does provide that guarantee; once the commit returns, all of the data
100has been transferred from the transaction shadow to the HEAD. The issue arises
101with partial commits - a commit that is interrupted in some fashion. In keeping
102with DBM::Deep's "tradition" of very light error-checking and non-existent
103error-handling, there is no way to recover from a partial commit. (This is
104probably a failure in Consistency as well as Durability.)
105
106Other DBMSes use transaction logs (a separate file, generally) to achieve
107Durability. As DBM::Deep is a single-file, we would have to do something
108similar to what SQLite and BDB do in terms of committing using synchonized
109writes. To do this, we would have to use a much higher RAM footprint and some
110serious programming that make my head hurts just to think about it.
111
112=cut
113
f4d0ac97 114=head2 read_value( $obj, $key )
64a531e5 115
f4d0ac97 116This takes an object that provides _base_offset() and a string. It returns the
117value stored in the corresponding Sector::Value's data section.
118
119=cut
120
121sub read_value { die "read_value must be implemented in a child class" }
122
123=head2 get_classname( $obj )
124
125This takes an object that provides _base_offset() and returns the classname (if
126any) associated with it.
127
128It delegates to Sector::Reference::get_classname() for the heavy lifting.
129
130It performs a staleness check.
131
132=cut
133
134sub get_classname { die "get_classname must be implemented in a child class" }
135
136=head2 make_reference( $obj, $old_key, $new_key )
137
138This takes an object that provides _base_offset() and two strings. The
139strings correspond to the old key and new key, respectively. This operation
140is equivalent to (given C<< $db->{foo} = []; >>) C<< $db->{bar} = $db->{foo} >>.
141
142This returns nothing.
143
144=cut
145
146sub make_reference { die "make_reference must be implemented in a child class" }
147
148=head2 key_exists( $obj, $key )
149
150This takes an object that provides _base_offset() and a string for
151the key to be checked. This returns 1 for true and "" for false.
152
153=cut
154
155sub key_exists { die "key_exists must be implemented in a child class" }
156
157=head2 delete_key( $obj, $key )
158
159This takes an object that provides _base_offset() and a string for
160the key to be deleted. This returns the result of the Sector::Reference
161delete_key() method.
162
163=cut
164
165sub delete_key { die "delete_key must be implemented in a child class" }
166
167=head2 write_value( $obj, $key, $value )
168
169This takes an object that provides _base_offset(), a string for the
1c62d370 170key, and a value. This value can be anything storable within L<DBM::Deep>.
f4d0ac97 171
172This returns 1 upon success.
173
174=cut
175
176sub write_value { die "write_value must be implemented in a child class" }
177
178=head2 setup( $obj )
179
180This takes an object that provides _base_offset(). It will do everything needed
181in order to properly initialize all values for necessary functioning. If this is
182called upon an already initialized object, this will also reset the inode.
183
184This returns 1.
185
186=cut
187
188sub setup { die "setup must be implemented in a child class" }
189
190=head2 begin_work( $obj )
191
192This takes an object that provides _base_offset(). It will set up all necessary
193bookkeeping in order to run all work within a transaction.
194
195If $obj is already within a transaction, an error wiill be thrown. If there are
196no more available transactions, an error will be thrown.
197
198This returns undef.
199
200=cut
201
202sub begin_work { die "begin_work must be implemented in a child class" }
203
204=head2 rollback( $obj )
205
206This takes an object that provides _base_offset(). It will revert all
207actions taken within the running transaction.
208
209If $obj is not within a transaction, an error will be thrown.
210
211This returns 1.
212
213=cut
214
215sub rollback { die "rollback must be implemented in a child class" }
216
217=head2 commit( $obj )
218
219This takes an object that provides _base_offset(). It will apply all
220actions taken within the transaction to the HEAD.
221
222If $obj is not within a transaction, an error will be thrown.
223
224This returns 1.
225
226=cut
227
228sub commit { die "commit must be implemented in a child class" }
64a531e5 229
bf941eae 230=head2 get_next_key( $obj, $prev_key )
231
232This takes an object that provides _base_offset() and an optional string
233representing the prior key returned via a prior invocation of this method.
234
235This method delegates to C<< DBM::Deep::Iterator->get_next_key() >>.
236
237=cut
238
239# XXX Add staleness here
240sub get_next_key {
241 my $self = shift;
242 my ($obj, $prev_key) = @_;
243
f4d0ac97 244 # XXX Need to add logic about resetting the iterator if any key in the
245 # reference has changed
0b3cba50 246 unless ( defined $prev_key ) {
19b913ce 247 $obj->{iterator} = $self->iterator_class->new({
bf941eae 248 base_offset => $obj->_base_offset,
249 engine => $self,
250 });
251 }
252
253 return $obj->{iterator}->get_next_key( $obj );
254}
255
f4d0ac97 256=head2 lock_exclusive()
257
258This takes an object that provides _base_offset(). It will guarantee that
259the storage has taken precautions to be safe for a write.
260
261This returns nothing.
262
263=cut
264
265sub lock_exclusive {
266 my $self = shift;
267 my ($obj) = @_;
268 return $self->storage->lock_exclusive( $obj );
269}
270
271=head2 lock_shared()
272
273This takes an object that provides _base_offset(). It will guarantee that
274the storage has taken precautions to be safe for a read.
275
276This returns nothing.
277
278=cut
279
280sub lock_shared {
281 my $self = shift;
282 my ($obj) = @_;
283 return $self->storage->lock_shared( $obj );
284}
285
286=head2 unlock()
287
288This takes an object that provides _base_offset(). It will guarantee that
289the storage has released the most recently-taken lock.
290
291This returns nothing.
292
293=cut
294
295sub unlock {
296 my $self = shift;
297 my ($obj) = @_;
298
299 my $rv = $self->storage->unlock( $obj );
300
301 $self->flush if $rv;
302
303 return $rv;
304}
305
306=head1 INTERNAL METHODS
307
308The following methods are internal-use-only to DBM::Deep::Engine and its
309child classes.
310
311=cut
312
313=head2 flush()
314
315This takes no arguments. It will do everything necessary to flush all things to
316disk. This is usually called during unlock() and setup().
317
318This returns nothing.
319
320=cut
321
322sub flush {
323 my $self = shift;
324
325 # Why do we need to have the storage flush? Shouldn't autoflush take care of
326 # things? -RobK, 2008-06-26
327 $self->storage->flush;
328
329 return;
330}
331
d6ecf579 332=head2 load_sector( $loc )
333
334This takes an id/location/offset and loads the sector based on the engine's
335defined sector type.
336
337=cut
338
339sub load_sector { $_[0]->sector_type->load( @_ ) }
340
2ba14e04 341=head2 clear
342
343=cut
344
345=head2 clear( $obj )
346
347This takes an object that provides _base_offset() and deletes all its
348elements, returning nothing.
349
350=cut
351
4f034d8f 352=head2 cache / clear_cache
353
354This is the cache of loaded Reference sectors.
355
356=cut
357
358sub cache { $_[0]{cache} ||= {} }
359sub clear_cache { %{$_[0]->cache} = () }
360
580e5ee2 361=head2 supports( $option )
362
363This returns a boolean depending on if this instance of DBM::Dep supports
364that feature. C<$option> can be one of:
365
366=over 4
367
368=item * transactions
369
370=back
371
372=cut
373
374sub supports { die "supports must be implemented in a child class" }
375
d6ecf579 376=head2 ACCESSORS
377
378The following are readonly attributes.
379
380=over 4
381
382=item * storage
383
4f034d8f 384=item * sector_type
385
d6ecf579 386=back
387
388=cut
389
390sub storage { $_[0]{storage} }
391
392sub sector_type { die "sector_type must be implemented in a child class" }
393
c2472ede 394# This code is to make sure we write all the values in the $value to the
395# disk and to make sure all changes to $value after the assignment are
396# reflected on disk. This may be counter-intuitive at first, but it is
397# correct dwimmery.
398# NOTE - simply tying $value won't perform a STORE on each value. Hence,
399# the copy to a temp value.
400sub _descend {
401 my $self = shift;
402 my ($value, $value_sector) = @_;
403 my $r = Scalar::Util::reftype( $value ) || '';
404
405 if ( $r eq 'ARRAY' ) {
406 my @temp = @$value;
407 tie @$value, 'DBM::Deep', {
408 base_offset => $value_sector->offset,
409 staleness => $value_sector->staleness,
410 storage => $self->storage,
411 engine => $self,
412 };
413 @$value = @temp;
414 bless $value, 'DBM::Deep::Array' unless Scalar::Util::blessed( $value );
415 }
416 elsif ( $r eq 'HASH' ) {
417 my %temp = %$value;
418 tie %$value, 'DBM::Deep', {
419 base_offset => $value_sector->offset,
420 staleness => $value_sector->staleness,
421 storage => $self->storage,
422 engine => $self,
423 };
424 %$value = %temp;
425 bless $value, 'DBM::Deep::Hash' unless Scalar::Util::blessed( $value );
426 }
bb1daf85 427
428 return;
c2472ede 429}
430
bf941eae 4311;
432__END__