sdbm(3)


NAME
     sdbm, dbm_open, dbm_prep, dbm_close,  dbm_fetch,  dbm_store,  dbm_delete,
     dbm_firstkey, dbm_nextkey, dbm_hash, dbm_rdonly, dbm_error, dbm_clearerr,
     dbm_dirfno, dbm_pagfno - data base subroutines

SYNOPSIS
     #include <sdbm.h>

     typedef struct {
             char *dptr;
             int dsize;
     } datum;

     datum nullitem = { NULL, 0 };

     DBM *dbm_open(char *file, int flags, int mode)

     DBM *dbm_prep(char *dirname, char *pagname, int flags, int mode)

     void dbm_close(DBM *db)

     datum dbm_fetch(DBM *db, key)

     int dbm_store(DBM *db, datum key, datum val, int flags)

     int dbm_delete(DBM *db, datum key)

     datum dbm_firstkey(DBM *db)

     datum dbm_nextkey(DBM *db)

     long dbm_hash(char *string, int len)

     int dbm_rdonly(DBM *db)
     int dbm_error(DBM *db)
     dbm_clearerr(DBM *db)
     int dbm_dirfno(DBM *db)
     int dbm_pagfno(DBM *db)

DESCRIPTION

     This package allows an application to maintain a mapping  of  <key,value>
     pairs  in  disk  files.   This  is  not  to be considered a real database
     system, but is still useful in many simple applications built around fast
     retrieval  of  a  data  value  from  a  key.  This implementation uses an
     external hashing scheme, called Dynamic Hashing, as described by Per-Aake
     Larson  in  BIT  18  (1978)  pp.  184-201.  Retrieval of any item usually
     requires a single disk access.  The application interface  is  compatible
     with the ndbm(3) library.

     An sdbm database is kept in two files usually given the  extensions  .dir
     and  .pag.   The  .dir  file  contains  a bitmap representing a forest of
     binary hash trees, the leaves of which indicate data pages  in  the  .pag
     file.

     The application interface uses the datum structure to describe both  keys
     and  values.   A datum specifies a byte sequence of dsize size pointed to
     by dptr.  If you use ASCII strings as  keys  or  values,  then  you  must
     decide whether or not to include the terminating NUL byte which sometimes
     defines strings.  Including it will require larger database files, but it
     will be possible to get sensible output from a strings(1) command applied
     to the data file.

     In order to allow a process using this  package  to  manipulate  multiple
     databases,  the applications interface always requires a handle, a DBM *,
     to identify the database  to  be  manipulated.   Such  a  handle  can  be
     obtained from the only routines that do not require it, namely dbm_open()
     or dbm_prep().  Either of these will open or  create  the  two  necessary
     files.   The  difference  is that the latter allows explicitly naming the
     bitmap and data files whereas dbm_open() will take a base file  name  and
     call  dbm_prep()  with  the  default  extensions.   The  flags  and  mode
     parameters are the same as for open(2).

     To free the resources occupied while a database handle  is  active,  call
     dbm_close().

     Given a handle, one can retrieve data associated with a key by using  the
     dbm_fetch()  routine,  and  associate  data  with  a  key  by  using  the
     dbm_store() routine.

     The  values  of  the  flags  parameter  for  dbm_store()  can  be  either
     DBM_INSERT, which will not change an existing entry with the same key, or
     DBM_REPLACE, which will replace an existing  entry  with  the  same  key.
     Keys are unique within the database.

     To delete a key and its associated value use the dbm_delete() routine.

     To retrieve every key in the database, use a loop like:

     for (key = dbm_firstkey(db); key.dptr != NULL; key = dbm_nextkey(db))
             ;

     The order of retrieval is unspecified.

     If you determine that the performance of the database  is  inadequate  or
     you  notice  clustering  or  other effects that may be due to the hashing
     algorithm used by this package, you can override it by supplying your own
     dbm_hash()  routine.   Doing  so will make the database unintelligable to
     any other applications that do not use your specialized hash function.

     The following macros are defined in the header file:

          dbm_rdonly() returns true if the database has been opened read-only.

          dbm_error() returns true if an I/O error has occurred.

          dbm_clearerr() allows you to clear the error flag if you  think  you
          know what the error was and insist on ignoring it.

          dbm_dirfno() returns the file descriptor associated with the  bitmap
          file.

          dbm_pagfno() returns the file descriptor associated  with  the  data
          file.

SEE ALSO
     open(2).

DIAGNOSTICS
     Functions that return a DBM * handle will use NULL to indicate an  error.
     Functions  that  return  an  int  will  use -1 to indicate an error.  The
     normal return value in that case is 0.  Functions  that  return  a  datum
     will return nullitem to indicate an error.

     As a special case of dbm_store(), if it is  called  with  the  DBM_INSERT
     flag and the key already exists in the database, the return value will be
     1.

     In general, if a function parameter is invalid,  errno  will  be  set  to
     EINVAL.  If a write operation is requested on a read-only database, errno
     will be set to ENOPERM.  If a memory allocation (using malloc(3)) failed,
     errno  will  be  set  to  ENOMEM.   For I/O operation failures errno will
     contain the value set by the relevant failed system call, either read(2),
     write(2), or lseek(2).

AUTHOR
     Ozan S. Yigit (oz@nexus.yorku.ca)

BUGS
     The sum of key and value data sizes must not exceed PAIRMAX (1008 bytes).

     The sum of the key and value data sizes where several keys  hash  to  the
     same value must fit within one bitmap page.

     The .pag file will contain holes, so its apparent size is larger than its
     contents.  When copied through the filesystem the holes will be filled.

     The contents of datum values returned are in volatile  storage.   If  you
     want  to  retain  the  values  pointed to, you must copy them immediately
     before another call to this package.
     The only safe way for multiple processes to (read and) update a  database
     at  the  same time, is to implement a private locking scheme outside this
     package and open and close the database between lock acquisitions.  It is
     safe for multiple processes to concurrently access a database read-only.

APPLICATIONS PORTABILITY
     For complete source code compatibility with  the  Berkeley  Unix  ndbm(3)
     library,    the    sdbm.h    header   file   should   be   installed   in
     /usr/include/ndbm.h.

     The nullitem data item, and  the  dbm_prep(),  dbm_hash(),  dbm_rdonly(),
     dbm_dirfno(), and dbm_pagfno() functions are unique to this package.