removingCR/LF from unix and windows and mixed files

nntpman68 · Sep 11, 2008

Hi,

I'm having files, which I'd like to slurp into an array
(one file per array)
However I'd like to get rid of the end of line characters of the files.
As files were created by windows users or linux users the lines will end
with either \n or with \r\n.
Additionally linux files might have been modified by windows users
(or vice versa) and not all editors are smart enough to adapt to the
files mode. so some files might have mixed line endings.

I came up with

@a = <$filehandle>
foreach $l (@a) { $l =~ s/(\r\n|\r)$//; }

or with

@a = grep { s/(\r\n|\r)$// } <$filehandle>

or with

@a=()
while(<$fh){ s/(\r\n|\r)$// ; push(@a); }

In above examples I could also replace the substitute with tr/\r\n//d

Is there already something like a strip_eof() function, or should I
stick with one of the above?

N

nntpman68 · Sep 11, 2008

Opps, minor typo.

The last sentence should have been:
"Is there already something like a strip_eol() function"
and not "strip_eof()"

Tad J McClellan · Sep 12, 2008

nntpman68 said:
I'm having files, which I'd like to slurp into an array
(one file per array)
However I'd like to get rid of the end of line characters of the files.
As files were created by windows users or linux users the lines will end
with either \n or with \r\n.
Additionally linux files might have been modified by windows users
(or vice versa) and not all editors are smart enough to adapt to the
files mode. so some files might have mixed line endings.

I came up with

@a = <$filehandle>
foreach $l (@a) { $l =~ s/(\r\n|\r)$//; }

foreach $l (@a) { $l =~ s/\r?\n//; }

Jim Gibson · Sep 12, 2008

nntpman68 said:
Hi,

I'm having files, which I'd like to slurp into an array
(one file per array)
However I'd like to get rid of the end of line characters of the files.
As files were created by windows users or linux users the lines will end
with either \n or with \r\n.
Additionally linux files might have been modified by windows users
(or vice versa) and not all editors are smart enough to adapt to the
files mode. so some files might have mixed line endings.

I came up with

@a = <$filehandle>
foreach $l (@a) { $l =~ s/(\r\n|\r)$//; }

What about lines with just "\n" in them? This is a little shorter and
uses a character class instead of grouping and alternation:

s/[\r\n]+// for @a;

or with

@a = grep { s/(\r\n|\r)$// } <$filehandle>

You will drop lines that don't have "\r" in them. You should probably
use map instead of grep here.

or with

@a=()
while(<$fh){ s/(\r\n|\r)$// ; push(@a); }

In above examples I could also replace the substitute with tr/\r\n//d

That would be a good idea.

Is there already something like a strip_eof() function, or should I
stick with one of the above?

No. Perl's built-in functions are described in 'perldoc perlfunc' and
by 'perldoc -f xxx'. If you don't find something there, then you can
start looking at CPAN (<http://search.cpan.org>).

windows one liner to output unix line feed	13	Aug 19, 2009
Weird Behavior with Rays in C and OpenGL	4	Feb 13, 2024
Why is Python telling me variable is local not global?	3	Sep 2, 2023
codecs.open on Win32 -- converting my newlines to CR+LF	4	Aug 27, 2009
Can't solve problems! please Help	0	Sep 26, 2022
doctest.testfile fails on text files with Windows line endings	1	Apr 11, 2010
PHP RSS Feed Aggregator changing to todays date everytime feed is aggregated	1	Jan 11, 2022
Removing windows CR-LF from middle of text	2	May 5, 2004

removingCR/LF from unix and windows and mixed files

nntpman68

nntpman68

Tad J McClellan

Jim Gibson

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads