To make the most efficient possible use of iterators:
Close an iterator's cursor as soon as possible.
Each iterator has an open cursor associated with it, so when you are finished using the iterator it is a good habit to explicitly close its cursor. This can potentially improve performance by avoiding locking issues, which will enhanced concurrency. Dbstl will close the cursor when the iterator is destroyed, but you can close the cursor before that time. If the cursor is closed, the associated iterator cannot any longer be used.
In some functions of container classes, an
iterator is used to access the database, and its
cursor is internally created by dbstl. So if you
want to specify a non-zero flag for the
Db::cursor()
call,
you need to call the container's
set_cursor_open_flag()
function to do so.
Use const iterators where applicable.
If your data access is read only, you are strongly recommended to use a const iterator. In order to create a const iterator, you must use a const reference to the container object. For example, supposed we have:
db_vector<int> intv(10);
then we must use a:
const db_vector<int>& intv_ref = intv;
reference to invoke the const begin/end
functions.
intv_ref.begin()
will
give you a const iterator. You can use a const
iterator only to read its referenced data
elements, not update them. However, you should
have better performance with this iterator using,
for example, either
iterator::operator*
or
iterator::operator->member
.
Also, using array indices like
intv_ref[i]
will also
perform better.
All functions in dbstl's containers which return an iterator or data element reference have two versions — one returns a const iterator/reference, the other returns an iterator/reference. If your access is read only, choose the version returning const iterators/references.
Remember that you can only use a const
reference to a container object to call the const
versions of operator*
and
operator[]
.
You can also use the non-const container object
or its non-const reference to create a read only
iterator by passing true
to the
readonly
parameter in the container's
begin()
method.
Use pre-increment/pre-decrement rather than post-increment/post-decrement where possible
Pre-increment operations are more efficient
because the ++iterator
avoids
two iterator copy constructions. This is true when
you are using C++ standard STL iterators as well.
Use bulk retrieval in iterators
If your access pattern is to go through the entire database read only, or if you are reading a continuous range of the database, bulk retrieval can be very useful because it returns multiple key/data pairs in one database call. But be aware that you can only read the returned data, you can not update it. Also, if you do a bulk retrieval and read the data, and simultaneously some other thread of control updates that same data, then unless you are using a serializable transaction, you will now be working with old data.
To make the most efficient possible use of containers:
Avoid using container methods that return references. These because they are a little more expensive.
To implement reference semantics, dbstl has to wrap the data element with the current key/data pair, and must invoke two iterator copy constructions and two Berkeley DB cursor duplications for each such a call. This is true of non-const versions of these functions:
db_vector<T>::operator[]()
|
db_vector<T>::front()
|
db_vector<T>::back()
|
db_vector<T>::at()
|
db_map<>::operator[]()
|
There are alternatives to these functions, mainly through explicit use of iterators.
Use const containers where possible.
The const versions of the functions listed above have less overhead than their non-const counterparts. Using const containers and iterators can bring more performance when you call the const version of the overloaded container/iterator methods. To do so, you define a const container reference to an existing container, and then use this reference to call the methods. For example, if you have:
db_vector<int> container int_vec
then you can define a const reference to
int_vec
:
const db_vector<int>& int_vec_ref;
Then you use
int_vec_ref.begin()
to create a const iterator,
citr
. You can now can use
int_vec_ref
to call the
const versions of the container's member
functions, and then use citr
to
access the data read only. By using
int_vec_ref
and
citr
, we can gain better
performance.
It is acceptable to call the non-const versions
of container functions that return non-const
iterators, and then assign these return values to
const iterator objects. But if you are using
Berkeley DB concurrent data store (CDS), be sure
to set the readonly
parameter for each container method that returns an iterator to
true
. This is because each
iterator corresponds to a Berkeley DB cursor, and
so for best performance you should specify that
the returned iterator be read-only so that the
underlying cursor is also read-only. Otherwise,
the cursor will be a writable cursor, and
performance might be somewhat degraded. If you are
not using CDS, but instead TDS or DS or HA, there
is no distinction between read-only cursors and
read-write cursors. Consequently, you do not need
to specify the readonly parameter at all.