Reading just a few lines from a text file

tkpmep · Aug 23, 2005

I have a text file with many hundreds of lines of data. The data of
interest to me, however, resides at the bottom of the file, in the last
20 lines. Right now, I read the entire file and discard the stuff I
don't need. I'd like to speed up my program by reading only the last 20
lines. How do I do this?

Thomas Philips

rafi · Aug 23, 2005

I have a text file with many hundreds of lines of data. The data of
interest to me, however, resides at the bottom of the file, in the last
20 lines. Right now, I read the entire file and discard the stuff I
don't need. I'd like to speed up my program by reading only the last 20
lines. How do I do this?

Thomas Philips

If you are using a Unix:

tail -20 file.txt

draghuram · Aug 23, 2005

I just did strace on "tail -20 <filename>". Apparently, it does seek to
the end and reads enough data to cover 20 lines. I guess it is
calculating this "size" by counting 20 new lines .You may try to do the
same thing.

Thanks,
Raghu.

Paul McGuire · Aug 23, 2005

Are you sure this is really slowing down your program? "Many hundreds
of lines" is not nearly enough to start Python breathing hard. I have
been really impressed with just how quickly Python is able to do file
input and processing, zipping through whole megs of data in just
seconds.

How are you currently reading the file in? A character at a time?
That *will* be slow. Try file.readlines(), or xreadlines(), and spin
off the last 20 in the list.

-- Paul

tkpmep · Aug 23, 2005

Right now my code reads as follows:

infile=file(FileName)
for line in reversed(infile.readlines()): #Search from the bottom up
if int(line.split()[0]) == MyDate:
Data= float(line.split()[-1])
break
infile.close()

I have to read about 10,000 files, each with data. I'm looking to speed
up each individual file open/close cycle.

Thomas Philips

Diez B. Roggisch · Aug 23, 2005

Right now my code reads as follows:

infile=file(FileName)
for line in reversed(infile.readlines()): #Search from the bottom up

Not sure if python does some tricks here - but for me that seems to be
uneccesary shuffling around of data. Better do

for line in reversed(infile.readlines()[:-10]):
...

Diez

John Machin · Aug 24, 2005

I have a text file with many hundreds of lines of data. The data of
interest to me, however, resides at the bottom of the file, in the last
20 lines. Right now, I read the entire file and discard the stuff I
don't need. I'd like to speed up my program by reading only the last 20
lines. How do I do this?

Thomas Philips

What discernible speed increase are you talking about? How long does it
take to read the "many hundreds" of lines?

For "many hundreds", IMHO it's not worth the bother, the complexity, the
documentation, ...

Just do this, it's about as fast as you'll get in pure python for a
smallish file:

def last_n_lines(filename, n):
return open(filename, 'r').readlines()[-n:]

For many hundreds of thousands of lines, one approach might be to open
the file in binary mode, seek to the end of the file, then loop reading
chunks backwards and unpacking the chunks until you've found 21 line
terminators. Or perhaps 20 line separators

Php combine identical lines in text file	4	Oct 11, 2023
Help with importing from multiple files and printing lines in designated spot to spit out one file.	1	Jan 16, 2023
Find and count strings of text from multiple files	17	Dec 16, 2021
Parsing multiple lines from text file using regex	0	Oct 27, 2013
Reading lines from a data file	0	May 20, 2009
Extracting lines from text files - script with a couple of 'side effects'	3	Sep 25, 2013
Python 3.2 bug? Reading the last line of a file	10	May 25, 2011
Python point location of intersect between two lines	0	Feb 28, 2018

Reading just a few lines from a text file

tkpmep

rafi

draghuram

Paul McGuire

tkpmep

Diez B. Roggisch

John Machin

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads