File compare

PyPK · Oct 12, 2005

I have two files
file1 in format
<id> <val1> <test1> <test2>
'AA' 1 T T
'AB' 1 T F

file2 same as file1
<id> <val1> <test1> <test2>
'AA' 1 T T
'AB' 1 T T

Also the compare should be based on id. So it should look for line
starting with id 'AA' (for example) and then match the line so if in
second case.

so this is what I am looking for:
1. read both files.
2. read id of first line in file1 check if it matches with the same id
in file2.
3. repeat step 2 for all lines in file1.
4. return a percent of success to failure. ie if one line matches and
one lines does'nt then return 0.5 or 50%

I wrote a boolean version ..as a start

def getdata(f):
try:
f1 = open(f,'r')
data=[]
for eachline in f1.readlines():
data.append(re.split("",
re.sub('\n','',strip(re.split('\s\s+',eachline)[0]))))
return data
except IOError:
raise("Invalid File Input")

if __name__=='__main__':

data1 = getdata('file1')
data2 = getdata('file2')

if data1 == data2:
print "True"
else:
print "False"

hope I am clear...

PyPK · Oct 12, 2005

Note that the code i wrote wont do the compare based on id which i am
looking for..it just does a direct file to file compare..

Larry Bates · Oct 13, 2005

Sounds a little like "homework", but I'll help you out.
There are lots of ways, but this works.

import sys
class fobject:
def __init__(self, inputfilename):
try:
fp=open(inputfilename, 'r')
self.lines=fp.readlines()
except IOError:
print "Unable to open and read inputfilename=%s" % inputfilename
sys.exit(3)

self.datadict={}
for line in self.lines:
line=line.strip()
line=line.strip("'")
key, values=line.split(' ',1)
self.datadict[key]=values

return

def keys(self):
return self.datadict.keys()

def compare(self, otherobject):
keys=otherobject.keys()
match=0
for key in keys:
if self.datadict[key] == otherobject.datadict[key]: match+=1

return float(match)/float(len(keys))

if __name__=="__main__":
f1=fobject(r'f:\syscon\python\zbkup\f1.txt')
f2=fobject(r'f:\syscon\python\zbkup\f2.txt')
print f1.compare(f2)

Larry Bates

PyPK · Oct 13, 2005

Not for homework. But anyway thanks much...

Magnus Lycka · Oct 14, 2005

PyPK said:
I have two files
file1 in format
<id> <val1> <test1> <test2>
'AA' 1 T T
'AB' 1 T F

file2 same as file1
<id> <val1> <test1> <test2>
'AA' 1 T T
'AB' 1 T T

Also the compare should be based on id. So it should look for line
starting with id 'AA' (for example) and then match the line so if in
second case.

See the recent thread with subject line "List performance and CSV".

PyPK · Oct 14, 2005

but what if
case 1:
no.of keys in f1 > f2 and
case2:
no.of keys in f1 < f2.
Should'nt we get 1.1 if case 1 and 0.9 if case 2?? it errors of with a
keyerror.?

To compare the content in two files..	4	Nov 17, 2010
Compare Files and Cat File Difference Question	0	Oct 21, 2008
Possibly useful perl script to filter lines in one file out of another.	23	Aug 23, 2009
trim the last blank-line and compare files	6	Mar 2, 2010
Python point location of intersect between two lines	0	Feb 28, 2018
problems using fgets() and sscanf() while modifying file contents	24	Jul 15, 2007
to pass self or not to pass self	19	Mar 15, 2010
itertools, functools, file enhancement ideas	8	Apr 7, 2007

File compare

PyPK

PyPK

Larry Bates

PyPK

Magnus Lycka

PyPK

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads