Python-2.3b1 bugs on Windows2000 with: the new csv module, stringreplace, and the re module

Daniel Ortmann · Jul 1, 2003

These problems only happen on Windows. On Linux everything works fine.
Has anyone else run into these bugs? Any suggestions?

Where do I find out the proper bug reporting process?

Problem #1:

While using the csv module's DictWriter on MSDOS (a.k.a. Windows2000),
the output files get newlines like \x0d\x0d\x0a instead of \x0d\x0a.

csvwriter = csv.DictWriter( file( out1filename, 'w' ), infieldnames, extrasaction='ignore' )
csvwriter.writerow( dict( zip( infieldnames, infieldnames ) ) )

Problem #2:

While trying to fix up the first problem I run into another problem.
The following string replace code works until right around the boundary
at 2^7 * 1024, i.e. near 131072 (around line 1224), and then inserts a
bunch of \x00's in the string!

Before the \x00's, all of the \x0d's were correctly replaced. After the
\x00's, NONE of them were replaced.

content = file( fname, 'rb' ).read().replace( '\x0d', '' )
file( fname, 'wb' ).write( content )

Problem #3:

The same problem also happens with the re module.

content = re.sub( '\x0d', '', file( fname, 'rb' ).read() )
file( fname, 'wb' ).write( content )

Steve Holden · Jul 1, 2003

Daniel Ortmann said:
These problems only happen on Windows. On Linux everything works fine.
Has anyone else run into these bugs? Any suggestions?

Where do I find out the proper bug reporting process?

http://sourceforge.net/tracker/?atid=105470&group_id=5470&func=browse

regards

Skip Montanaro · Jul 2, 2003

Daniel> Problem #1:

Daniel> While using the csv module's DictWriter on MSDOS
Daniel> (a.k.a. Windows2000), the output files get newlines like
Daniel> \x0d\x0d\x0a instead of \x0d\x0a.

Daniel> csvwriter = csv.DictWriter( file( out1filename, 'w' ), infieldnames, extrasaction='ignore' )
Daniel> csvwriter.writerow( dict( zip( infieldnames, infieldnames ) ) )

CSV files are not really plain text files. The line terminator string is an
explicit property of the file. For example, you might want to write a CSV
file on a Windows 2000 machine which you intend to read on a Mac OS9 system
(where the line terminator is just \r). You need to open CSV files with the
'b' flag. This should work for you:

csvwriter = csv.DictWriter( file( out1filename, 'wb' ), infieldnames,
extrasaction='ignore' )
csvwriter.writerow( dict( zip( infieldnames, infieldnames ) ) )

Skip

Daniel Ortmann · Jul 2, 2003

Daniel> While using the csv module's DictWriter on MSDOS
Daniel> (a.k.a. Windows2000), the output files get newlines like
Daniel> \x0d\x0d\x0a instead of \x0d\x0a.

Daniel> csvwriter = csv.DictWriter( file( out1filename, 'w' ), infieldnames, extrasaction='ignore' )
Daniel> csvwriter.writerow( dict( zip( infieldnames, infieldnames ) ) )

Skip> CSV files are not really plain text files. The line terminator
Skip> string is an explicit property of the file. For example, you
Skip> might want to write a CSV file on a Windows 2000 machine which you
Skip> intend to read on a Mac OS9 system (where the line terminator is
Skip> just \r). You need to open CSV files with the 'b' flag. This
Skip> should work for you:

Skip> csvwriter = csv.DictWriter( file( out1filename, 'wb' ), infieldnames, extrasaction='ignore' )
Skip> csvwriter.writerow( dict( zip( infieldnames, infieldnames ) ) )

Ok, that is the same work around that I used. Perhaps the documentation
should say something about using binary mode?

Or perhaps the DictWriter constructure should open the file in binary
mode if given a string rather than a file object?

How do we avoid people stumbling as I did?

Skip Montanaro · Jul 2, 2003

Daniel> Perhaps the documentation should say something about using
Daniel> binary mode?

Good point. I'll fix the docs.

Daniel> Or perhaps the DictWriter constructure should open the file in
Daniel> binary mode if given a string rather than a file object?

Nah, too much overloading going on.

Skip

On: 'The Python CSV Module and Legacy Data'	0	Apr 26, 2004
Personal archive tool, looking for suggestions on improving the code	5	Jul 27, 2010
The devolution of English language and slothful c.l.p behaviors exposed!	50	Jan 24, 2012
Confused newbie needs help with "__init__() takes exactly 11 arguments (1 given)"	4	Aug 18, 2005
Big problem with @array and Chomp ... I think :o	5	Nov 4, 2003
[ANN] JRuby 1.4.0RC2 Released	0	Oct 21, 2009
Ruby Weekly News 15th - 21st August 2005	2	Aug 23, 2005
Ruby Weekly News 24th - 30th January 2005	4	Jan 30, 2005

Python-2.3b1 bugs on Windows2000 with: the new csv module, stringreplace, and the re module

Daniel Ortmann

Steve Holden

Skip Montanaro

Daniel Ortmann

Skip Montanaro

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads