question about hash ordering

John · Oct 2, 2003

HI,

I have this.

@arr = qw(ab a b ab ba ba ab ba);

and I would like to classify the index with the values:

@output = ([0, 3, 6],
[1],
[2],
[4, 5, 7]
);
or in hash

%output = (
0 => [0, 3, 6],
1 => [1],
2 => [2],
4 => [4, 5, 7]
);

on this situation I don't know which approach is the best the array
one or the hash, in order to classify then and also in performance.

What will be you approach?

Thanks for your help

J

Sam Holden · Oct 2, 2003

HI,

I have this.

@arr = qw(ab a b ab ba ba ab ba);

and I would like to classify the index with the values:

@output = ([0, 3, 6],
[1],
[2],
[4, 5, 7]
);
or in hash

%output = (
0 => [0, 3, 6],
1 => [1],
2 => [2],
4 => [4, 5, 7]
);

on this situation I don't know which approach is the best the array
one or the hash, in order to classify then and also in performance.

It depends on what you want to do with it, and how big the data
will be, and numerous other factors.

For example, if you want to know what the nth unique element is
(where, for example, the 4th unique element in that example is 'ba'),
then the array will allow you to do so in constant time:

$arr[$output[$i][0]]

While the hash will require O(NlogN) time (you'd have to sort the keys).

Then again, if you know the index of the first occurance of an item you
wish to find the list of occurances for (or want to know if that index
is the first occurance of the item), then the hash approach will
allow you to do so in constant time, while the array approach will
require O(logN).

So performance depends on the operations you need (as always).

Tad McClellan · Oct 2, 2003

John said:
and I would like to classify the index with the values:

What will be you approach?

My approach would depend on how I plan to _use_ the data structures,
which you have not shared with us...

If performance really matters (it probably doesn't), then try it
both ways and benchmark them.

------------------------
#!/usr/bin/perl
use strict;
use warnings;
use Data:

umper;

my @arr = qw(ab a b ab ba ba ab ba);

my @array = by_array(@arr);
print Dumper \@array;

my %hash = by_hash(@arr);
print Dumper \%hash;

sub by_array {
my @array;

my %seen;
foreach my $i ( 0 .. $#_ ) {
$seen{$_[$i]} = $i unless exists $seen{$_[$i]};
push @{ $array[$seen{$_[$i]}] }, $i;
}

return @array;
}

sub by_hash {
my %hash;

my %seen;
foreach my $i ( 0 .. $#_ ) {
$seen{$_[$i]} = $i unless exists $seen{$_[$i]};
push @{ $hash{$seen{$_[$i]}} }, $i;
}

return %hash;
}

Need help for javascript code	3	Sep 28, 2022
Minimum Total Difficulty	0	Nov 15, 2023
C program: memory leak/ segmentation fault/ memory limit exceeded	0	Nov 12, 2022
Hash key iteration order	2	Dec 3, 2010
Machine Learning.. Endless Struggle	3	Feb 16, 2023
Z-Ordering (Morton ordering) question	2	Nov 5, 2009
Hash	4	Dec 23, 2011
Code help please	4	May 19, 2023

question about hash ordering

John

Sam Holden

Tad McClellan

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads