Hashtable or array of structs?

Alfonso Morra · Aug 5, 2005

I am implementing an application which requires the storage of a large
number of items in a cache.

I have 3-tuple key reprented by a struct, as well as an n-tuple (n is
fixed) dataset, also represented by a struct. At run time, I will no
exactly the number of items in the hashtable (I will have populated it
myself by loading the data from a database). During the course of the
applications lifetime, I will retrieve data in the hashtable and update it.

My question about the suitability of using a hashtable (as opposed to a
simple array of structs) is this:

The table will be 100% full - and I know that hashtables begin to suffer
a performance hit once they get to about 70% filled. (or maybe I should
create the hashtable to be able to hold a larger number of items than I
know I will need? - The number is fixed and does not vary after initial
population).

If my key was a single item, then it would be relatively trivial to
implement the cache as an array of structs. However, The are three items
that uniquely identify a record (this 3-tuple actually form the
composite primary key loaded from the db schema).

I would much prefer to implement this as a hashtable, as I can easily
use the composite lookup key. I have also chosen the keys to be
integers, to further speed the computation of the has key. I would
appreciate any feedback on this choice.

Thanks

CBFalconer · Aug 6, 2005

Alfonso said:
.... snip ...

I would much prefer to implement this as a hashtable, as I can
easily use the composite lookup key. I have also chosen the keys
to be integers, to further speed the computation of the has key.
I would appreciate any feedback on this choice.

Take a look at hashlib and the example programs with it. It will
allow entering one item in multiple databases and easy
experimentation with hashfunctions. Another usage example is
id2id-20. Both can be found at:

<http://cbfalconer.home.att.net/download/>

Malcolm · Aug 6, 2005

Alfonso Morra said:
My question about the suitability of using a hashtable (as opposed to a
simple array of structs) is this:

The table will be 100% full - and I know that hashtables begin to suffer a
performance hit once they get to about 70% filled. (or maybe I should
create the hashtable to be able to hold a larger number of items than I
know I will need? - The number is fixed and does not vary after initial
population).

You need free slots in a hash table, or the algorithm begind to break down.
So you will need memory for about 2 * the number of entries.

If my key was a single item, then it would be relatively trivial to
implement the cache as an array of structs. However, The are three items
that uniquely identify a record (this 3-tuple actually form the composite
primary key loaded from the db schema).

If I understand this correctly you will need three arrays / hashtables for
each entry. If the keys are mutable then array indexing is a poor choice -
you will need to constantly rebuild the arrays.
Make the hashtables store pointers to reduce memory overhead and avoid
synchronisation problems.

Array of structs function pointer	10	Jul 16, 2023
Copy array of structs in one go	24	Apr 15, 2014
packed structs	35	Sep 22, 2012
structs	29	May 22, 2010
variable size structs and diminishing returns	2	Jun 9, 2012
Copy string from 2D array to a 1D array in C	1	Nov 1, 2023
packing and structs	5	Oct 25, 2012
HashTable	26	Jul 17, 2007

Hashtable or array of structs?

Alfonso Morra

CBFalconer

Malcolm

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads