Is str/unicode.encode supposed to work? with replace/ignore

BerlinBrown · Jan 16, 2008

With this code, ignore/replace still generate an error

# Encode to simple ascii format.
field.full_content = field.full_content.encode('ascii', 'replace')

Error:

[0/1] 'ascii' codec can't decode byte 0xe2 in position 14317: ordinal
not in ran
ge(128)

The document in question; is a wikipedia document. I believe they use
latin-1 unicode or something similar. I thought replace and ignore
were supposed to replace and ignore?

Matt Nordhoff · Jan 16, 2008

BerlinBrown said:
With this code, ignore/replace still generate an error

# Encode to simple ascii format.
field.full_content = field.full_content.encode('ascii', 'replace')

Error:

[0/1] 'ascii' codec can't decode byte 0xe2 in position 14317: ordinal
not in ran
ge(128)

The document in question; is a wikipedia document. I believe they use
latin-1 unicode or something similar. I thought replace and ignore
were supposed to replace and ignore?

Is field.full_content a str or a unicode? You probably haven't decoded
it from a byte string yet.

Why do you want to use ASCII? UTF-8 is great.

--

Anoying unicode / str conversion problem	2	Jan 26, 2009
str() should convert ANY object to a string without EXCEPTIONS !	18	Sep 28, 2008
Python 3.1.1 bytes decode with replace bug	9	Oct 24, 2009
Trouble with UnicodeEncodeError and email	0	Jan 8, 2014
Trouble fixing a broken ASCII string - "replace" mode in codec notworking.	2	Feb 6, 2007
replace text in unicode string	2	May 14, 2005
How to work around a unicode problem?	4	Jan 24, 2012
getting rid of —	10	Jul 1, 2009

Is str/unicode.encode supposed to work? with replace/ignore

BerlinBrown

Matt Nordhoff

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads