Load file into a hash

Bill H · Oct 6, 2007

Is there a "perl" way of loading a file directly into a hash instead
of using something like this quick example:

open(FILE,"test.txt");
while(<FILE>)
{
$line = $_;
chop $line;
@dbf = split(/\t/,$line);
$MYHASH{$dbf[0]} = $dbf[1];
}
close(FILE);

where the text file contains entries like this:

NAME0\tsome value
NAME1\tanother value

etc... ?

Bill H

Mark Clements · Oct 6, 2007

Bill said:
Is there a "perl" way of loading a file directly into a hash instead
of using something like this quick example:

open(FILE,"test.txt");
while(<FILE>)
{
$line = $_;
chop $line;
@dbf = split(/\t/,$line);
$MYHASH{$dbf[0]} = $dbf[1];
}
close(FILE);

where the text file contains entries like this:

NAME0\tsome value
NAME1\tanother value

C:\TEMP>cat loadarray.pl
#!perl

use strict;
use warnings;

use Data:

umper;

my $filename = shift;
open my $fh,"<",$filename or die $!;

my %hash = map { chomp; split /\t/ } <$fh>;

print Dumper(\%hash);

close $fh or die $!;

C:\TEMP>cat data.txt
UK London
France Paris
Italy Rome
USA Washington
Germany Berlin

C:\TEMP>loadarray.pl data.txt
$VAR1 = {
'France' => 'Paris',
'UK' => 'London',
'Italy' => 'Rome',
'Germany' => 'Berlin',
'USA' => 'Washington'
};

C:\TEMP>

Mark

Bill H · Oct 6, 2007

Bill said:
Bill said:

Is there a "perl" way of loading a file directly into a hash instead
of using something like this quick example:

Click to expand...

open(FILE,"test.txt");
while(<FILE>)
{
$line = $_;
chop $line;
@dbf = split(/\t/,$line);
$MYHASH{$dbf[0]} = $dbf[1];
}
close(FILE);

Click to expand...

where the text file contains entries like this:

Click to expand...

NAME0\tsome value
NAME1\tanother value

Click to expand...

C:\TEMP>cat loadarray.pl
#!perl

use strict;
use warnings;

use Data:umper;

my $filename = shift;
open my $fh,"<",$filename or die $!;

my %hash = map { chomp; split /\t/ } <$fh>;

print Dumper(\%hash);

close $fh or die $!;

C:\TEMP>cat data.txt
UK London
France Paris
Italy Rome
USA Washington
Germany Berlin

C:\TEMP>loadarray.pl data.txt
$VAR1 = {
'France' => 'Paris',
'UK' => 'London',
'Italy' => 'Rome',
'Germany' => 'Berlin',
'USA' => 'Washington'
};

C:\TEMP>

Mark- Hide quoted text -

- Show quoted text -

I like that Mark. You basically took everything I had in the while
loop and put it on one line. Nice and neat.

Bill H

Brian McCauley · Oct 6, 2007

my %hash = map { chomp; split /\t/ } <$fh>;

That works but is very fragile - one bad line can screw all your data
from then on.

I prefer (the canonical idiom)

my %hash = map { /(.*?)\t(.*)/ } <$fh>;

This will ignore lines with no "\t" in them. Do something vaguely
reasonable with lines containing more than one "\t". Oh, and it's
shorter too.

Brian McCauley · Oct 6, 2007

my %hash = map { /(.*?)\t(.*)/ } <$fh>;

Oh, and since we're slurping the file anyhow we can save a few lines
by using File::Slurp

use File::Slurp;
my %hash = map { /(.*?)\t(.*)/ } read_file('test.txt');

Martijn Lievaart · Oct 6, 2007

That works but is very fragile - one bad line can screw all your data
from then on.

I prefer (the canonical idiom)

my %hash = map { /(.*?)\t(.*)/ } <$fh>;

This will ignore lines with no "\t" in them. Do something vaguely
reasonable with lines containing more than one "\t". Oh, and it's
shorter too.

Doesn't that include the newline on any line in the second field?

M4

Tad McClellan · Oct 6, 2007

Martijn Lievaart said:
On Sat, 06 Oct 2007 14:06:18 +0000, Brian McCauley wrote:

Doesn't that include the newline on any line in the second field?

Which part of the regex can match those newlines?

Tad McClellan · Oct 6, 2007

Bill H said:
Is there a "perl" way of loading a file directly into a hash instead
of using something like this quick example:

open(FILE,"test.txt");

You should always, yes *always*, check the return value from open().

while(<FILE>)
{
$line = $_;

If you want it in $line, then put it there, rather than put it
somewhere else and then move it there:

chop $line;

You should use chomp() to remove newlines.

my %myhash = split /[\t\n]/, do{ local $/; <FILE>};

Martijn Lievaart · Oct 7, 2007

Which part of the regex can match those newlines?

You're right. '.' does not match newlines. Stupid me.

M4

Uri Guttman · Oct 9, 2007

BM> Oh, and since we're slurping the file anyhow we can save a few lines
BM> by using File::Slurp

BM> use File::Slurp;
BM> my %hash = map { /(.*?)\t(.*)/ } read_file('test.txt');

i was going to chime in with slurp as well.

i have a better and faster idiom for slurping files into hashes
(untested for typos):

my %hash = read_file('test.txt') =~ /^([^\t]+)\t(.*)$/gm ;

and for most config type files slurping is fast since they are likely
smaller than even the OS's I/O block size which can be 64k or even
256k. there is no savings using perl's i/o system and reading many files
line by line. it is a great teaching technique but it is slower than
slurping in a whole file and parsing it in one regex call (as is
possible in many cases).

uri

Push regex search result into hash with multiple values	14	May 19, 2014
File Contents into Hash Table?	6	Aug 27, 2010
hash of arrays	1	Sep 13, 2012
Read a hash	5	Apr 28, 2011
How to parse text file into hash table	3	Apr 24, 2007
hash table usage questions	41	Dec 30, 2008
passing a reference to a hash to another page	2	May 4, 2012
suitable key for a hash	8	Oct 12, 2010

Load file into a hash

Bill H

Mark Clements

Bill H

Brian McCauley

Brian McCauley

Martijn Lievaart

Tad McClellan

Tad McClellan

Martijn Lievaart

Uri Guttman

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads