FTP Offset larger than file.

Bakes · Jul 28, 2009

I am writing a python script that performs an identical function to
the 'tail' unix utility, except that it connects to its files over
FTP, rather than the local hard disk.

I am currently using this python script to generate an increasing
'logfile' of garbage.

import time
for i in range(1, 20000):
time.sleep(0.2)
print i
f = open("data1.log","a")
f.write('%s: This logfile is being automatically generated to help Bakes test his python ftptail. \n' % i)
f.close()

and use this script to actually download it.

import time
import os.path
from ftplib import FTP

#Empty the file
filename = 'data1.log'
file = open(filename, 'w')
file.write('')
file.close()

def handleDownload(block):
file.write(block)
print ".",

# Create an instance of the FTP object
# Optionally, you could specify username and password:
ftp=FTP(host, user, pass)

directory = '/temp'
ftp.cwd(directory)

file = open(filename, 'a')

for i in range(1,20000):
size=os.path.getsize('data1.log')
ftp.retrbinary('RETR ' + filename, handleDownload, rest=size)

file.close()

print ftp.close()

Now, my problem is that I get a very strange error. What should be
happening is the script gets the size of the local file before
downloading all of the external file after that offset.

The error I get is:
ftplib.error_temp: 451-Restart offset 24576 is too large for file size
22852.
451 Restart offset reset to 0
which tells me that the local file is larger than the external file,
by about a kilobyte. Certainly, the local file is indeed that size, so
my local script is doing the right things. I do wonder what is going
wrong, can anyone enlighten me?

Hrvoje Niksic · Jul 28, 2009

Bakes said:
The error I get is:
ftplib.error_temp: 451-Restart offset 24576 is too large for file size
22852.
451 Restart offset reset to 0
which tells me that the local file is larger than the external file,
by about a kilobyte. Certainly, the local file is indeed that size, so
my local script is doing the right things. I do wonder what is going
wrong, can anyone enlighten me?

I'd say you failed to take buffering into account. You write into a
buffered file, yet you use os.path.getsize() to find out the current
file size. If the data is not yet flushed, you keep re-reading the same
stuff from the remote file, and writing it out. Once the buffer is
flushed, your file will contain more data than was retrieved from the
remote side, and eventually this will result in the error you see.

As a quick fix, you can add a file.flush() line after the
file.write(...) line, and the problem should go away.

Bakes · Jul 28, 2009

I'd say you failed to take buffering into account. You write into a
buffered file, yet you use os.path.getsize() to find out the current
file size. If the data is not yet flushed, you keep re-reading the same
stuff from the remote file, and writing it out. Once the buffer is
flushed, your file will contain more data than was retrieved from the
remote side, and eventually this will result in the error you see.

As a quick fix, you can add a file.flush() line after the
file.write(...) line, and the problem should go away.

Thank you very much, that worked perfectly.

Bakes · Jul 28, 2009

Thank you very much, that worked perfectly.

Actually, no it didn't. That fix works seamlessly in Linux, but gave
the same error in a Windows environment. Is that expected?

Hrvoje Niksic · Jul 28, 2009

Bakes said:
Actually, no it didn't. That fix works seamlessly in Linux, but gave
the same error in a Windows environment. Is that expected?

Consider opening the file in binary mode, by passing the 'wb' and 'ab'
modes to open instead of 'w' and 'a' respectively. On Windows, python
(and other languages) will convert '\n' to '\r\n' on write.

Dave Angel · Jul 28, 2009

Bakes said:
Actually, no it didn't. That fix works seamlessly in Linux, but gave
the same error in a Windows environment. Is that expected?

This is a text file you're transferring. And you didn't specify "wb".
So the Windows size will be larger than the Unix size, since you're
expanding the newline characters.

getsize() is looking at the size after newlines are expanded to 0d0a,
while The remote file, presumably a Unix system likely has just has 0a.

I think you'd do best just keeping track of the bytes you've written.

DaveA

Bakes · Jul 28, 2009

This is a text file you're transferring. And you didn't specify "wb".
So the Windows size will be larger than the Unix size, since you're
expanding the newline characters.

getsize() is looking at the size after newlines are expanded to 0d0a,
while The remote file, presumably a Unix system likely has just has 0a.

I think you'd do best just keeping track of the bytes you've written.

DaveA

Thank you very much, that worked perfectly.

GIS Shape file upload to FTP server	0	Mar 1, 2007
Logwatch python	1	Feb 9, 2013
FTP Windows AS/400	1	Sep 13, 2005
FTP Error: Windows AS/400	1	Sep 13, 2005
ftp.storlines error	3	Jan 31, 2010
Delete files from FTP Server older then 7 days. Using ftputil andftplib.	2	May 19, 2010
i have error then use ftplib	1	Mar 30, 2006
ftp	7	Dec 15, 2004

FTP Offset larger than file.

Bakes

Hrvoje Niksic

Bakes

Bakes

Hrvoje Niksic

Dave Angel

Bakes

Ask a Question

Similar Threads

Staff online

Members online

Forum statistics

Latest Threads