How to determine end-of-line sequence?

J Krugman · Dec 16, 2003

I'm writing a cgi script that uploads and processes files. These
files can come from Windows, Linux, or Macintosh machines, so the
script doesn't know ahead of time the end-of-line conventions used
by the uploaded files. What is the best way for a script to
determine the end-of-line sequence used by a given text file?

TIA,

jill

David Efflandt · Dec 17, 2003

I'm writing a cgi script that uploads and processes files. These
files can come from Windows, Linux, or Macintosh machines, so the
script doesn't know ahead of time the end-of-line conventions used
by the uploaded files. What is the best way for a script to
determine the end-of-line sequence used by a given text file?

It is probably best to correct it to what you want regardless of what it
is. Following is a script I wrote awhile ago that converts any text file
type to what you want based on the script name, but could be easily
modified (shortened) to convert to a single type:

#!/usr/bin/perl -w
# txtconv - convert text to or from other OS
# Symlink or rename any of: tounix, 2unix, todos, 2dos, tomac, 2mac
# follow with list of files on commandline
# "tomac" may conflict with a program name
while(@ARGV) {
my $file = shift @ARGV;
unless (open(FILE,"+< $file")) {warn "Can't open $file: $!\n"; next;}
flock(FILE,2); seek(FILE,0,0); binmode FILE;
print "$file before ", -s $file;
my @lines = <FILE>; seek(FILE,0,0); truncate(FILE,0);
if (lc($0) =~ /2unix|tounix/) { $end = "\012" }
elsif (lc($0) =~ /2dos|todos/) { $end = "\015\012" }
elsif (lc($0) =~ /2mac|tomac/) { $end = "\015" }
else { die "file not converted, read $0\n" }
foreach (@lines) { s/(\015\012|[\015\012])/$end/g; print FILE $_; }
close FILE;
print " after ", -s $file,"\n";
}

J Krugman · Dec 17, 2003

It is probably best to correct it to what you want regardless of what it
is. Following is a script I wrote awhile ago that converts any text file
type to what you want based on the script name, but could be easily
modified (shortened) to convert to a single type:

#!/usr/bin/perl -w
# txtconv - convert text to or from other OS
# Symlink or rename any of: tounix, 2unix, todos, 2dos, tomac, 2mac
# follow with list of files on commandline
# "tomac" may conflict with a program name
while(@ARGV) {
my $file = shift @ARGV;
unless (open(FILE,"+< $file")) {warn "Can't open $file: $!\n"; next;}
flock(FILE,2); seek(FILE,0,0); binmode FILE;
print "$file before ", -s $file;
my @lines = <FILE>; seek(FILE,0,0); truncate(FILE,0);
if (lc($0) =~ /2unix|tounix/) { $end = "\012" }
elsif (lc($0) =~ /2dos|todos/) { $end = "\015\012" }
elsif (lc($0) =~ /2mac|tomac/) { $end = "\015" }
else { die "file not converted, read $0\n" }
foreach (@lines) { s/(\015\012|[\015\012])/$end/g; print FILE $_; }
close FILE;
print " after ", -s $file,"\n";
}

Cool. Thanks!

jill

Anno Siegel · Dec 17, 2003

[...]

is. Following is a script I wrote awhile ago that converts any text file
type to what you want based on the script name, but could be easily
modified (shortened) to convert to a single type:

#!/usr/bin/perl -w
# txtconv - convert text to or from other OS
# Symlink or rename any of: tounix, 2unix, todos, 2dos, tomac, 2mac
# follow with list of files on commandline
# "tomac" may conflict with a program name
while(@ARGV) {
my $file = shift @ARGV;
unless (open(FILE,"+< $file")) {warn "Can't open $file: $!\n"; next;}
flock(FILE,2); seek(FILE,0,0); binmode FILE; ^^^^^^^^^^^^^
print "$file before ", -s $file;
my @lines = <FILE>; seek(FILE,0,0); truncate(FILE,0);
if (lc($0) =~ /2unix|tounix/) { $end = "\012" }
elsif (lc($0) =~ /2dos|todos/) { $end = "\015\012" }
elsif (lc($0) =~ /2mac|tomac/) { $end = "\015" }
else { die "file not converted, read $0\n" }
foreach (@lines) { s/(\015\012|[\015\012])/$end/g; print FILE $_; }
close FILE;
print " after ", -s $file,"\n";
}

Just out of interest -- why are you locking the file? I mean, EOL
conversion is not something that is normally done concurrently

Anno

David Efflandt · Dec 18, 2003

[...]

is. Following is a script I wrote awhile ago that converts any text file
type to what you want based on the script name, but could be easily
modified (shortened) to convert to a single type:

#!/usr/bin/perl -w
# txtconv - convert text to or from other OS
# Symlink or rename any of: tounix, 2unix, todos, 2dos, tomac, 2mac
# follow with list of files on commandline
# "tomac" may conflict with a program name
while(@ARGV) {
my $file = shift @ARGV;
unless (open(FILE,"+< $file")) {warn "Can't open $file: $!\n"; next;}
flock(FILE,2); seek(FILE,0,0); binmode FILE; ^^^^^^^^^^^^^
print "$file before ", -s $file;
my @lines = <FILE>; seek(FILE,0,0); truncate(FILE,0);
if (lc($0) =~ /2unix|tounix/) { $end = "\012" }
elsif (lc($0) =~ /2dos|todos/) { $end = "\015\012" }
elsif (lc($0) =~ /2mac|tomac/) { $end = "\015" }
else { die "file not converted, read $0\n" }
foreach (@lines) { s/(\015\012|[\015\012])/$end/g; print FILE $_; }
close FILE;
print " after ", -s $file,"\n";
}

Click to expand...

Just out of interest -- why are you locking the file? I mean, EOL
conversion is not something that is normally done concurrently

Just force of habit from working with CGI, so if doing a list of files
hopefully something else will not try to modify it in the middle of
converting it. I had one situation where if two scripts just tried to
read the same file line by line at the same time, it would confuse the
file pointer or something and both would endlessly loop (maybe something
peculiar about SunOS awhile ago).

end-of-line conventions	16	Aug 13, 2009
Determine actually given command line arguments	14	May 15, 2013
How to read from URL line-wise?	3	May 6, 2014
Using end of line in character class	1	Jun 25, 2009
How to use Flow-guided video completion (FGVC)?	0	Jan 25, 2021
KML to CSV file conversion using Python and Windows Powershell	0	Oct 14, 2022
how to deliver a GUI app to an end user	9	Mar 8, 2010
Why failing correcting new line at end of text file	1	Jul 29, 2009

How to determine end-of-line sequence?

J Krugman

David Efflandt

J Krugman

Anno Siegel

David Efflandt

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads