Array dereferencing

Jan Fure · Jun 25, 2003

Hi;

I the code below, I am creating a data structure of array references.
I want to loop through the keys, and for each key, which corresponds
to multiple values, I want to dereference the data to get a plain
array containing the values. I have read 'perlref', but it is still
not clear to me.

I made the code based on the 'Hashes with Multiple Values Per Key'
from the Perl Coocbook, and also inspired by some posts to this group.

I used Data:

umper to see that I have the right data.

my %data;
for $row ( @DATA2 ){ #@DATA2 is an array of arrays
push @{ $data{$$row[0]} }, $$row[1];
}
my @points;
foreach ( sort { $a <=> $b } keys %data ) {
@{$data{$_}} = sort { $a <=> $b } @{$data{$_}};
push @{$points[0]}, $_;
push @{$points[1]}, $data{$_};
}

Jay Tilton · Jun 25, 2003

(e-mail address removed) (Jan Fure) wrote:

: I the code below, I am creating a data structure of array references.
: I want to loop through the keys, and for each key, which corresponds
: to multiple values, I want to dereference the data to get a plain
: array containing the values. I have read 'perlref', but it is still
: not clear to me.

Could you phrase that in the form of a question? It's not clear what
you are misunderstanding.

: my %data;
: for $row ( @DATA2 ){ #@DATA2 is an array of arrays
: push @{ $data{$$row[0]} }, $$row[1];
: }
: my @points;
: foreach ( sort { $a <=> $b } keys %data ) {
: @{$data{$_}} = sort { $a <=> $b } @{$data{$_}};
^^^^^^^^^^^^ ^^^^^^^^^^^^
$_ is a hash key.

$data{$_} is the hash value for that key. That value is an array
reference.

@{ $data{$_} } is the referenced array.

Looks like you figured it out pretty well on your own.

: push @{$points[0]}, $_;
: push @{$points[1]}, $data{$_};
: }

James E Keenan · Jun 25, 2003

Jan Fure said:
Hi;

I the code below, I am creating a data structure of array references.
I want to loop through the keys, and for each key, which corresponds
to multiple values, I want to dereference the data to get a plain
array containing the values. I have read 'perlref', but it is still
not clear to me.

I think you're confusing two different aspects of dealing with data
structures. If you are "creating a data structure of array references," you
are concerned with putting data *into* either an array of arrays or a hash
of arrays. But if you "want to dereference the data to get a plain array"
you are concerned with getting data *out* of a pre-existing hash of arrays.
(Your use of the term "key" effectively narrows the choice of data
structures to a hash of arrays.)

How you get data *into* the hash of arrays depends on the data source.
Suppose that the data source is a file which you read line-by-line in a
while() loop and that the raw data looks like this:

alpha gomez halter icicle
beta halter jocular kingdom lambda
gamma beta gomez lambda zebra

....where you can for arbitrary reasons assume that the first word in each
line can serve as a unique identifier for that line. Then you can load up
your data structure like this:

my (%data);
while (<FILE>) {
my @line = split; # by default, split on whitespace
$data{$line[0]} = [ @line ];
}

To get data out of this structure, code like this:

foreach (keys %data) {
print "Key: $_\tValues: @{$data{$_}}\n";
}

See if you can reduce your code to this level of simplicity. My hunch is
that the example you took from the Cookbook is more elaborate than you
really need and that that is confusing you. Also, get the code to do what
you want *without sorting the keys first*. If you aren't clear on what
you're doing, throwing in the sort function will only confuse you more.
HTH.

Jan Fure · Jun 26, 2003

Could you phrase that in the form of a question? It's not clear what
you are misunderstanding.

Given data of the form:

1 2 4
1 2 5
1 3 4.5
2 2 6
2 3 7

How can I order it like:
([1,(2 2 3), (4 5 4.5)], [2, (2 3), (6 7)])

In verbal form, I want to group the values from columns 2 to n by
those corresponding to the same value in column 1, for further
processing for the purpose of finding mean, median, standard deviation
etc.

The ultimate goal is to get an output file like:

1 2.333 4.5
2 2.5 6.5

for the data above in the case of mean/average. To get this, I need to
have some array to loop through, and calculate the statistics for each
row.

In my first post, I showed the code which generated a data structure
with multiple data pairs with the same key, but I am stuck at that
point, as my understanding of referencing is too poor to properly
access the data.
Data:

umper showed me that the data appears to be properly ordered.

Jan Fure

Michael Budash · Jun 26, 2003

(e-mail address removed) (Jay Tilton) wrote in message

Could you phrase that in the form of a question? It's not clear what
you are misunderstanding.

Click to expand...

Given data of the form:

1 2 4
1 2 5
1 3 4.5
2 2 6
2 3 7

How can I order it like:
([1,(2 2 3), (4 5 4.5)], [2, (2 3), (6 7)])

In verbal form, I want to group the values from columns 2 to n by
those corresponding to the same value in column 1, for further
processing for the purpose of finding mean, median, standard deviation
etc.

The ultimate goal is to get an output file like:

1 2.333 4.5
2 2.5 6.5

for the data above in the case of mean/average. To get this, I need to
have some array to loop through, and calculate the statistics for each
row.

In my first post, I showed the code which generated a data structure
with multiple data pairs with the same key, but I am stuck at that
point, as my understanding of referencing is too poor to properly
access the data.
Data:umper showed me that the data appears to be properly ordered.

Jan Fure

one way:

#---------------------------------------------------
use strict;
use Data:

umper;

my @data = ("1 2 4",
"1 2 5",
"1 3 4.5",
"2 2 6",
"2 3 7",
);

my %hash;
foreach (@data) {
my @values = split /\s+/;
foreach my $i (1..$#values) {
push @{$hash{$values[0]}->[$i-1]}, $values[$i];
}
}

my @final;
foreach (sort keys %hash) {
push @final, [ $_, @{$hash{$_}} ];
}

print Dumper(@final);
#---------------------------------------------------

yields:

$VAR1 = [
'1',
[
'2',
'2',
'3'
],
[
'4',
'5',
'4.5'
]
];
$VAR2 = [
'2',
[
'2',
'3'
],
[
'6',
'7'
]
];

John W. Krahn · Jun 26, 2003

Jan said:
Given data of the form:

1 2 4
1 2 5
1 3 4.5
2 2 6
2 3 7

How can I order it like:
([1,(2 2 3), (4 5 4.5)], [2, (2 3), (6 7)])

In verbal form, I want to group the values from columns 2 to n by
those corresponding to the same value in column 1, for further
processing for the purpose of finding mean, median, standard deviation
etc.

The ultimate goal is to get an output file like:

1 2.333 4.5
2 2.5 6.5

Here is one way to do it:

#!/usr/bin/perl
use warnings;
use strict;

my %data;
my @keys;
while ( <DATA> ) {
my ( $key, @data ) = split;
push @keys, $key unless exists $data{ $key };
push @{ $data{ $key }[ $_ ] }, $data[ $_ ] for 0 .. $#data;
}

for my $key ( @keys ) {
print $key;
for my $array ( @{ $data{ $key } } ) {
my $sum;
$sum += $_ for @$array;
( my $avg = $sum / @$array ) =~ s/(\.\d{3})\d+/$1/;
print " $avg";
}
print "\n";
}

__DATA__
1 2 4
1 2 5
1 3 4.5
2 2 6
2 3 7

John

Dereferencing the arrays of array references	3	Jan 29, 2012
Instead of [ new Array() ]	5	Sep 24, 2023
sorting file according to a unicode column	17	May 28, 2014
Dereferencing Hash of Arrays	7	Jul 12, 2006
Help with array	4	Jan 8, 2023
Array of structs function pointer	10	Jul 16, 2023
Sort by number of characters	1	Nov 2, 2023
Javascript fill function data from multidimensonal array	0	Dec 12, 2022

Array dereferencing

Jan Fure

Jay Tilton

James E Keenan

Jan Fure

Michael Budash

John W. Krahn

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads