CSV readers and UTF-8 files

mk · Feb 19, 2009

Hello everyone,

Is it just me or CSV reader/DictReader and UTF-8 files do not work
correctly in Python 2.6.1 (Windows)?

That is, when I open UTF-8 file in a csv reader (after passing plain
file object), I get fields as plain strings ('str'). Since this has been
mangled, I can't get the non-ascii characters back.

When I do:

csvfo = codecs.open(csvfname, 'rb', 'utf-8')
dl = csv.excel
dl.delimiter=';'
#rd = csv.DictReader(csvfo, dialect=dl)
rd = csv.reader(csvfo, dialect=dl)

...I get plain strings as well (I get <type 'str'> when calling
type(field)), on top of error:

Traceback (most recent call last):
File "C:/Python26/converter3.py", line 99, in <module>
fill_sqla(session,columnlist,rd)
File "C:/Python26/converter3.py", line 73, in fill_sqla
for row in rd:
UnicodeEncodeError: 'ascii' codec can't encode character u'\u0144' in
position 74: ordinal not in range(128)

...when doing:

for row in rd:
....

Regards,
mk

Falcolas · Feb 19, 2009

Hello everyone,

Is it just me or CSV reader/DictReader and UTF-8 files do not work
correctly in Python 2.6.1 (Windows)?

I would point out in the CSV module documentation (http://
docs.python.org/library/csv.html) it explicitly mentions that it can't
handle unicode.

You can use their workaround in the examples section for UTF-8, or
with another form of encoding (I used MIME) for UTF-16.

~G

Chris Rebert · Feb 20, 2009

I would point out in the CSV module documentation (http://
docs.python.org/library/csv.html) it explicitly mentions that it can't
handle unicode.

You can use their workaround in the examples section for UTF-8, or
with another form of encoding (I used MIME) for UTF-16.

~G

This really ought to be fixed for 3.0+ (seems to still be ASCII-only
according to the 3.0 docs...)

Cheers,
Chris

Batch Convert HTML to UTF-8 Files	2	Oct 2, 2023
utf-8 and ctypes	5	Sep 28, 2010
Simple converter of files into their hex components... but i can'tarrange utf-8 parts!	2	Jun 9, 2013
Downloading multiple files based on info extracted from CSV	5	Dec 12, 2013
Newbie question: Tuples and reading csv files	3	Mar 29, 2010
hex dump w/ or w/out utf-8 chars	40	Jul 7, 2013
UTF-8 read & print?	6	Nov 25, 2012
csv and mixed lists of unicode and numbers	6	Nov 24, 2009

CSV readers and UTF-8 files

mk

Falcolas

Chris Rebert

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads