hash two keys to one index

Chris Uppal · Nov 27, 2006

Mark said:
Newly allocated objects are usually adjacent in memory regardless of
their type. Depending on the garbage collector, objects with links to
each other may well be copied to nearby locations in the main heap.

[I'm mostly repeating myself, but just to bring some closure to this]

Sure, but hash-tables don't tend to place sequentially-allocated objects in
sequential slots. The GC will probably group the chain-links together, but
probably not the objects refered to by the links. In the end open chaining adds
another layer of indirection, double-hashing adds an extra "bounce" to the
memory access patterns. It's up to the developer to choose (with or without
measurement) a design which appeals; my point is only that considerations of
the /number/ of probes doesn't exhaust the influences on performance.

-- chris

Mark Thornton · Nov 27, 2006

Chris said:
Mark Thornton wrote:

Newly allocated objects are usually adjacent in memory regardless of
their type. Depending on the garbage collector, objects with links to
each other may well be copied to nearby locations in the main heap.

Click to expand...

[I'm mostly repeating myself, but just to bring some closure to this]

Sure, but hash-tables don't tend to place sequentially-allocated objects in
sequential slots. The GC will probably group the chain-links together, but
probably not the objects refered to by the links.

Allegedly the train collector is likely to do exactly that depending on
what other links exist and whether the objects are of similar 'age' to
the hash table.

measurement) a design which appeals; my point is only that considerations of
the /number/ of probes doesn't exhaust the influences on performance.

Indeed not. Which is why it is good to have many different
implementations of Map.

Mark Thornton

Mark · Dec 5, 2006

Chris said:
When considering the efficiency of data-structures in the modern world, it's
always worth thinking a bit about cache effects. Both open chaining, and any
form of double hashing, have poor effects on locality-of-reference, with memory
accesses jumping all over the shop rather than sequential.

May not make much difference in Java (which tends to be cache-unfriendly anyway
for objects), but worth thinking about. Especially if you end up considering
on-disk structures (I realise that isn't relevant to your immediate task).

-- chris

What about with LRU associative mapping? Then it doesn't really matter
where it's stored in memory -- although you might only have one useful
value in each cache block rather than the multiple you might get by not
using a sparse array.

Accessing array index addresses with custom datatype in a function	0	Jun 2, 2022
How to have two html audio players on one page?	0	May 3, 2022
Minimising chi square to fit two parameters	1	Dec 11, 2022
JS querySelector addEventListenerer index getElementsByClassName parent div only	1	Jan 25, 2023
<Button ...> display is fine, except for two things	1	Oct 23, 2023
Add a list of videos each one in a different button in a web page	1	Dec 10, 2022
%hash + @keys -> @value_refs; existing associations only	3	Mar 15, 2012
Hash key types and equality of hash keys	2	Mar 1, 2012

hash two keys to one index

Chris Uppal

Mark Thornton

Mark

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads