csv module

Laurent Laporte · Dec 28, 2005

hello,

I'm using cvs standard module under Python 2.3 / 2.4 to write a file
delimited with tabs. I use the "excel-tab" dialect to do that.

To read my CSV file, I choose to 'sniff' with a sample data in order to
get the dialect.
The problem I meet is that I get a wrong dialect: the sniffer return an
empty string delimiter. It is probably a bug in _guess_delimiter()
method.

The message I obtain is:
TypeError: bad argument type for built-in operation

Do you know a way to sniff tab-delimited data ?
Is it a known bug ?

Bye.

Fredrik Lundh · Dec 28, 2005

Laurent said:
I'm using cvs standard module under Python 2.3 / 2.4 to write a file
delimited with tabs. I use the "excel-tab" dialect to do that.

To read my CSV file, I choose to 'sniff' with a sample data in order to
get the dialect.
The problem I meet is that I get a wrong dialect: the sniffer return an
empty string delimiter. It is probably a bug in _guess_delimiter()
method.

The message I obtain is:
TypeError: bad argument type for built-in operation

Do you know a way to sniff tab-delimited data ?
Is it a known bug ?

http://www.python.org/sf/1157169

</F>

skip · Dec 28, 2005

Laurent> To read my CSV file, I choose to 'sniff' with a sample data in
Laurent> order to get the dialect. The problem I meet is that I get a
Laurent> wrong dialect: the sniffer return an empty string delimiter. It
Laurent> is probably a bug in _guess_delimiter() method.

Laurent> The message I obtain is:
Laurent> TypeError: bad argument type for built-in operation

Laurent> Do you know a way to sniff tab-delimited data ?
Laurent> Is it a known bug ?

Using a file with the following contents:
'1\t2\tabc\n3\t4\tdef\n'

I get:
'\t'

Can you provide a concrete example (preferably in a bug report on SF)?

Skip

skip · Dec 28, 2005

me> Using a file with the following contents:

me> >>> open("tabber.csv", "rb").read()
me> '1\t2\tabc\n3\t4\tdef\n'

me> I get:

me> >>> sniffer = csv.Sniffer()
me> >>> d = sniffer.sniff(open("tabber.csv", "rb").read())
me> >>> d.delimiter
me> '\t'

BTW, this also seems to work with a Mac-style EOL:
'\t'

Perhaps this has been fixed in CVS.

Skip

Laurent Laporte · Dec 28, 2005

Sorry,

Here is my example:

Python 2.3.1 (#1, Sep 29 2003, 15:42:58)
[GCC 2.96 20000731 (Red Hat Linux 7.1 2.96-98)] on linux2
Type "help", "copyright", "credits" or "license" for more information.''

In fact, I found the pb (thanks to you): I add a newline '\r\n' to
separate the header from the records...

Laurent Laporte · Dec 28, 2005

In fact, there is another bug:

In my CVS file, all the records ends with a trailing tab '\t'
except the header because the last field is always empty.

For example, I get :''

It is done in the _guess_delimiter() method during the building of
frequency tables. A striping is done for each line (why??)
If I change:
freq = line.strip().count(char)
by:
freq = line.count(char)
It works fine.

Do you have a workaround for that?

------- Laurent.

skip · Dec 28, 2005

Laurent> If I change:
Laurent> freq = line.strip().count(char)
Laurent> by:
Laurent> freq = line.count(char)
Laurent> It works fine.

Laurent> Do you have a workaround for that?

Nope. I just checked in precisely your fix to the Python repository.

Skip

problems with CSV module	4	Jun 3, 2010
Translater + module + tkinter	1	Feb 16, 2023
Programming challenge?	4	Jul 23, 2021
comparing dialects of csv-module	3	Dec 19, 2009
csv module and None values	7	Aug 24, 2009
Number of cells, using CSV module	8	May 16, 2013
csv module strangeness.	16	Aug 30, 2006
escape character / csv module	3	Jul 1, 2010

csv module

Laurent Laporte

Fredrik Lundh

skip

skip

Laurent Laporte

Laurent Laporte

skip

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads