references to filehandle?

Stefan H. · Sep 27, 2004

the first field of my data is the _name_ of the measure.

I need to create from the big file one file per measure containing only
data from that measure. The name of each file must be the same of
measure: ie

bigfile.csv
123 rms 12 132
2312 qrt 12 231
2342 sse 12 231

rms.csv
123 rms 12 132

qrt.csv
2342 sse 12 231

the measure names are changing in name and number, then I cannot code
it. I'd like to do

sub split_measures {

my (%splits);

for (<MYFILE>) {
$splits{[split /;/]->[1]} = '';
}

for (keys %splits) {
open $_, ">$_.csv";
}

for (<MYFILE>) {
print [split /;/;]->1 $_;
}

the error I get is that "strict refs" doesn't permit that. Why? It's
safe to remove that clause? Is there a better way to do that?

Thanks
Stefan

A. Sinan Unur · Sep 27, 2004

the first field of my data is the _name_ of the measure.

Actually, looking at the lines below, it looks like the second field is the
name of the measure, whatever that might mean.

I need to create from the big file one file per measure containing only
data from that measure. The name of each file must be the same of
measure: ie

bigfile.csv
123 rms 12 132
2312 qrt 12 231
2342 sse 12 231

rms.csv
123 rms 12 132

qrt.csv
2342 sse 12 231

the measure names are changing in name and number, then I cannot code
it. I'd like to do

sub split_measures {

my (%splits);

for (<MYFILE>) {
$splits{[split /;/]->[1]} = '';
}

Ahem ... I do not see anything that is separated using ;

for (keys %splits) {
open $_, ">$_.csv";
}

for (<MYFILE>) {
print [split /;/;]->1 $_;
}

Whoa!

D:\Home\test> perl -c s.pl
Number found where operator expected at s.pl line 14, near "->1"
(Missing operator before 1?)
Scalar found where operator expected at s.pl line 14, near "1 $_"
(Missing operator before $_?)
syntax error at s.pl line 14, near "/;/;"
Missing right curly or square bracket at s.pl line 17, at end of line
s.pl had compilation errors.

Did you actually run this thing? You should always post a short, self-
contained script others can run to see what is going on.

the error I get is that "strict refs" doesn't permit that. Why? It's
safe to remove that clause? Is there a better way to do that?

This is how I might do it:

use strict;
use warnings;

while(<DATA>) {
chomp;
if(my @fields = /^\s*(\d+)\s+(\w+)\s+(\d+)\s+(\d+)\s*$/) {
# Using '>>' so as to account for multiple
# lines for a given measure
if(open my $out, '>>', "$fields[1].csv") {
print $out "@{[ join ';', @fields ]}\n";
} else {
warn "Cannot open $fields[1].csv: $!";
}
}
}

__DATA__
123 rms 12 132
2312 qrt 12 231
2342 sse 12 231

__END__

Sinan.

Stefan H. · Sep 27, 2004

Ahem ... I do not see anything that is separated using ;

sorry, I forgot to mention that the data files are semicolon separated
file

print $out "@{[ join ';', @fields ]}\n";
} else {
warn "Cannot open $fields[1].csv: $!";

you don't close the open filehandles. Is this ok?

Thank you very much.

A curiosity:

my code

for (<MYFILE>) {
$splits{[split /;/]->[1]} = '';
}

for (keys %splits) {
open $_, ">$_.csv";
}

was totally wrong? I mean: is it not possible to open filehandles using
scalar variables like that?

And:
$splits{[split /;/]->[1]} = '';

is it correct?

Thank you again
Stefan

Joe Smith · Sep 27, 2004

Stefan said:
for (keys %splits) {
open $_, ">$_.csv";
}

was totally wrong? I mean: is it not possible to open filehandles using
scalar variables like that?

Use a hash to store lexical file handles. Also use 3-argument open().

my %fh; # Hash of file handles
for (keys %splits) {
open {$fh{$_}},'>>',"$_.csv";
}
...
$name = $fields[1];
print {$fh{$name}} $_;

-Joe

Michele Dondi · Sep 27, 2004

I need to create from the big file one file per measure containing only
data from that measure. The name of each file must be the same of
measure: ie

bigfile.csv
123 rms 12 132
2312 qrt 12 231
2342 sse 12 231

rms.csv
123 rms 12 132

I see *basically* two possible approaches:

1. One-pass, repeatedly open()ing and close()ing FHs in '>>' mode,
2. Two-pass, collecting the data and printing it out later.

If bigfile is not really *too big* I'd favour the second solution.

for (<MYFILE>) {
$splits{[split /;/]->[1]} = '';

^^^
^^^

I assume that fields are really semicolon separated rather than
whitespace separated, so (2) above *could* be something like this:

#!/usr/bin/perl

use strict;
use warnings;

my %data;
while (<>) {
my $m=(split /;/)[1] or
warn("Possibly wrong format!"), next;
$m .= '.csv';
push @{ $data{$m} }, $_;
}

for (keys %data) {
open my $fh, '>', $_ or
die "Can't write to `$_': $!\n";
print $fh @{ $data{$_} };
}

__END__

Of course you could/should add finer checks according to how your real
data looks like, e.g.

(my $n=(split /;/)[1]) =~ /^[a-z]{3}$/ or # ...

for (keys %splits) {
open $_, ">$_.csv";
}

Actually you can't do this, you may at most open a lexical FH and
store it in a hash as a value corresponding to $_.

for (<MYFILE>) {
print [split /;/;]->1 $_;

^^^
^^^

Are you sure? ;-)

As a side note it doesn't really do any harm but it is not necessary
to create an anonymous array to dereference it soon after:

(split /;/)[1]

will do!

the error I get is that "strict refs" doesn't permit that. Why? It's
safe to remove that clause? Is there a better way to do that?

There are many better ways to do that. However now I understand what
you *wanted* to do. Indeed it suggests a viable "mixed" solution in
one pass by means of an orkish manouvre:

#!/usr/bin/perl

use strict;
use warnings;

my %fh;
while (<>) {
my $m=(split /;/)[1] or
warn("Possibly wrong format!"), next;
$m .= '.csv';
select $fh{$m} ||= do {
open my $fh, '>', $m or
die "Can't write to `$_': $!\n";
$fh;
};
print;
}

__END__

Here the possible problem is that depending on how many measures you
really have, you could hit the maximum number of open files your OS
permits...

HTH,
Michele

Michele Dondi · Sep 27, 2004

Sorry, I hadn't read Sinan Unur's reply yet when posting my own...

print $out "@{[ join ';', @fields ]}\n";

Click to expand...

^^^^
^^^^

you don't close the open filehandles. Is this ok?

Yes it is. In fact he's using a lexical FH: it will be automatically
closed when going out of scope (or more generally whene there will
remain no references to it).

Thank you very much.

A curiosity:

my code [snip]
was totally wrong? I mean: is it not possible to open filehandles using
scalar variables like that?

It is possible to open FHs using scalar variables. But not *like
that*. For another cmt on your approach, slightly revised, see my
other post in this thread.

And:
$splits{[split /;/]->[1]} = '';

is it correct?

This is syntactically correct. But then again see my other post for a
cmt on this line...

Michele

A. Sinan Unur · Sep 27, 2004

if(my @fields = /^\s*(\d+)\s+(\w+)\s+(\d+)\s+(\d+)\s*$/) {

Actually, that should be

if((my @fields = /^\s*(\d+)\s+(\w+)\s+(\d+)\s+(\d+)\s*$/) == 4) {

Sorry, late night post.

Sinan.

A. Sinan Unur · Sep 27, 2004

sorry, I forgot to mention that the data files are semicolon separated
file

Well, it looks like you should also read the posting guidelines posted here
frequently. The code you posted was not runnable and the data you posted
was not real. That is not nice.

print $out "@{[ join ';', @fields ]}\n";
} else {
warn "Cannot open $fields[1].csv: $!";

Click to expand...

you don't close the open filehandles. Is this ok?

THe original code was:

if(open my $out, '>>', "$fields[1].csv") {
print $out "@{[ join ';', @fields ]}\n";

Click to expand...

I know Michele has already responded to this but here's my two cents.

The crucial part is that the file is opened using open my $out, i.e. the
scope of $out is limited to the if-block. $out is a lexical filehandle.
Lexical filehandles are automatically closed upon going out of scope. On
the other hand, you can also explicitly close them. In fact, that would be
the only way to catch a failure on close.

Sinan.

Michele Dondi · Sep 27, 2004

open my $fh, '>', $m or
die "Can't write to `$_': $!\n";

Sorry, this should be

open my $fh, '>', $m or
die "Can't write to `$m': $!\n";

of course.

Hope there are no more typos left,
Michele
--
#!/usr/bin/perl -lp
BEGIN{*ARGV=do{open $_,q,<,,\$/;$_}}s z^z seek DATA,11,$[;($,
=ucfirst<DATA>)=~s x .*x q^~ZEX69l^^q,^2$;][@,xe.$, zex,s e1e
q 1~BEER XX1^q~4761rA67thb ~eex ,s aba m,P..,,substr$&,$.,age
__END__

A. Sinan Unur · Sep 27, 2004

open my $fh, '>', $m or
die "Can't write to `$m': $!\n";

of course.

Hope there are no more typos left,

Strictly speaking, not a typo, but I am going to suggest dropping the \n
from the error message.

Sinan.

Michele Dondi · Sep 27, 2004

open my $fh, '>', $m or
die "Can't write to `$m': $!\n"; [snip]
Hope there are no more typos left,

Click to expand...

Strictly speaking, not a typo, but I am going to suggest dropping the \n
from the error message.

Hehe! Opinions tend to vary here... IMHO the user should not be
interested in the additional info that omitting \n supplies, and I
find that generally this is the case. Well in this case, he/she may be
interested in the line of input that triggered it, but hopefully
knowing "which $m" is the guilty one should be enough. OTOH you should
have noticed that my code contained also a \n-less warn() because in
that case it seemed to mo more meaningful to do so...

Michele

Abhinav · Sep 28, 2004

Michele said:
open my $fh, '>', $m or
die "Can't write to `$m': $!\n";

Click to expand...

[snip]

Hope there are no more typos left,

Click to expand...

Strictly speaking, not a typo, but I am going to suggest dropping the \n

Click to expand...

from the error message.

Click to expand...

Hehe! Opinions tend to vary here... IMHO the user should not be
interested in the additional info that omitting \n supplies, and I
find that generally this is the case. Well in this case, he/she may be
interested in the line of input that triggered it, but hopefully
knowing "which $m" is the guilty one should be enough. OTOH you should
have noticed that my code contained also a \n-less warn() because in
that case it seemed to mo more meaningful to do so...

That was interesting behaviour(for me..). Where can I find more about this ?

Regards
Abhinav

Tad McClellan · Sep 28, 2004

Abhinav said:
That was interesting behaviour(for me..). Where can I find more about this ?

If you are interested in the die() function, an offbeat place to
look might be the documentation for the die() function...

perldoc -f die

Please stop asking hundreds of people around the world to read
the docs to you, simply read them yourself and post only if you
still have questions after that.

Abhinav · Sep 29, 2004

Tad McClellan wrote:
[SNIP]

Please stop asking hundreds of people around the world to read
the docs to you, simply read them yourself and post only if you
still have questions after that.

I am sorry I asked an obviously inane question like that. Won't happen again.

Thanks
Abhinav

--

Michele Dondi · Sep 29, 2004

That was interesting behaviour(for me..). Where can I find more about this ?

perldoc -f die

Michele

Perl Strings vs FileHandle	9	Sep 6, 2008
Assigning another filehandle to STDOUT, using binmode.	29	Jun 19, 2007
CGI table tidy layout possible?	1	Dec 9, 2011
Trouble with prediction code, for the life of me I can't figure out why it isnt running properly. Help would be appreciated.	0	Jul 8, 2023
I am trying to make an auto-play thing. How do I make it work?	5	Apr 5, 2022
Using References to Formats?	2	Jul 8, 2006
MIME::Entity attaching from an open filehandle?	1	Feb 11, 2004
File handling with subroutines and references	16	Jan 17, 2006

references to filehandle?

Stefan H.

A. Sinan Unur

Stefan H.

Joe Smith

Michele Dondi

Michele Dondi

A. Sinan Unur

A. Sinan Unur

Michele Dondi

A. Sinan Unur

Michele Dondi

Abhinav

Tad McClellan

Abhinav

Michele Dondi

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads