Making a copy (not reference) of a file handle,or starting stdin over at line 0

Shawn Milochik · Aug 17, 2007

I wrote a script which will convert a tab-delimited file to a
fixed-width file, or a fixed-width file into a tab-delimited. It reads
a config file which defines the field lengths, and uses it to convert
either way.

Here's an example of the config file:

1:6,7:1,8:9,17:15,32:10

This converts a fixed-width file to a tab-delimited where the first
field is the first six characters of the file, the second is the
seventh, etc. Conversely, it converts a tab-delimited file to a file
where the first six characters are the first tab field, right-padded
with spaces, and so on.

What I want to do is look at the file and decide whether to run the
function to convert the file to tab or FW. Here is what works
(mostly):

x = inputFile.readline().split("\t")
inputFile.seek(0)

if len(x) > 1:
toFW(inputFile)
else:
toTab(inputFile)

The problem is that my file accepts the input file via stdin (pipe) or
as an argument to the script. If I send the filename as an argument,
everything works perfectly.

If I pipe the input file into the script, it is unable to seek() it. I
tried making a copy of inputFile and doing a readline() from it, but
being a reference, it makes no difference.

How can I check a line (or two) from my input file (or stdin stream)
and still be able to process all the records with my function?

Thanks,
Shawn

Peter Otten · Aug 17, 2007

Shawn said:
How can I check a line (or two) from my input file (or stdin stream)
and still be able to process all the records with my function?

One way:

from itertools import chain
firstline = instream.next()
head = [firstline]

# loop over entire file
for line in chain(head, instream):
process(line)

You can of course read more than one line as long as you append it to the
head list. Here's an alternative:

from itertools import tee
a, b = tee(instream)

for line in a:
# determine file format,
# break when done

# this is crucial for memory efficiency
# but may have no effect in implementations
# other than CPython
del a

# loop over entire file
for line in b:
# process line

Peter

XML parsing ExpatError with xml.dom.minidom at line 1, column 0	2	Feb 13, 2014
How to use PDF-lib and how to center each line of texts on the page?	1	Aug 16, 2023
How to run a python script with a configuration file at command line?	2	Jun 3, 2010
Python 3.2 bug? Reading the last line of a file	10	May 25, 2011
ANN: Version 0.1.2 of sarge (a subprocess wrapper library) has beenreleased.	0	Dec 17, 2013
Tkinter polling example: file copy with progress bar	7	Dec 12, 2010
Reading in cooked mode (was Re: Python MSI not installing, log fileshowing name of a Viatnemese comm	8	Mar 23, 2014
FAQ 5.2 How do I change, delete, or insert a line in a file, or append to the beginning of a file?	0	Feb 24, 2011

Making a copy (not reference) of a file handle,or starting stdin over at line 0

Shawn Milochik

Peter Otten

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads