string conversion latin2 to ascii

Martin Landa · Nov 27, 2007

Hi all,

sorry for a newbie question. I have unicode string (or better say
latin2 encoding) containing non-ascii characters, e.g.

s = "Ukázka_možnosti_využití_programu_OpenJUMP_v_SOA"

I would like to convert this string to plain ascii (using some lookup
table for latin2)

to get

-> Ukazka_moznosti_vyuziti_programu_OpenJUMP_v_SOA

Thanks for any hits! Regards, Martin Landa

kyosohma · Nov 27, 2007

Hi all,

sorry for a newbie question. I have unicode string (or better say
latin2 encoding) containing non-ascii characters, e.g.

s = "Ukázka_možnosti_využití_programu_OpenJUMP_v_SOA"

I would like to convert this string to plain ascii (using some lookup
table for latin2)

to get

-> Ukazka_moznosti_vyuziti_programu_OpenJUMP_v_SOA

Thanks for any hits! Regards, Martin Landa

With a little googling, I found this:

http://www.peterbe.com/plog/unicode-to-ascii

You might also find this article useful:

http://www.reportlab.com/i18n/python_unicode_tutorial.html

Mike

Martin v. LÃ¶wis · Nov 27, 2007

sorry for a newbie question. I have unicode string (or better say

latin2 encoding) containing non-ascii characters, e.g.

s = "UkÃ¡zka_moÅ¾nosti_vyuÅ¾itÃ_programu_OpenJUMP_v_SOA"

That's not a Unicode string (at least in Python 2); it is
a latin-2 encoded byte string; it has nothing to do with Unicode.

I would like to convert this string to plain ascii (using some lookup
table for latin2)

to get

-> Ukazka_moznosti_vyuziti_programu_OpenJUMP_v_SOA

I recommend to use string.translate. You need a translation
table there, which is best generated with string.maketrans.

table=string.maketrans("Ã¡Å¾Ã","azi")
print s.translate(table)

HTH,
Martin

John Machin · Nov 27, 2007

With a little googling, I found this:

http://www.peterbe.com/plog/unicode-to-ascii

and if the OP has the patience to read *ALL* the comments on that blog
entry, he will find that comment[-2] points to

http://effbot.python-hosting.com/file/stuff/sandbox/text/unaccent.py

and comment[-1] (from the blog owner) is "Brilliant! Thank you."

The bottom line is that there is no universal easy solution; you need
to handcraft a translation table suited to your particular purpose
(e.g. do you want u-with-umlaut to become u or ue?). The
unicodedata.normalize function is useful for off-line preparation of a
set of candidate mappings for that table; it should not be applied
either on-line or blindly.

Cheers,
John

Jakub Wilk · Nov 28, 2007

I have unicode string (or better say latin2 encoding) containing
non-ascii characters, e.g.

s = "Ukázka_možnosti_využití_programu_OpenJUMP_v_SOA"

I would like to convert this string to plain ascii (using some lookup
table for latin2)

to get

-> Ukazka_moznosti_vyuziti_programu_OpenJUMP_v_SOA

You may try python-elinks
Ukazka_moznosti_vyuziti_programu_OpenJUMP_v_SOA

kyosohma · Nov 28, 2007

With a little googling, I found this:

Click to expand...

http://www.peterbe.com/plog/unicode-to-ascii

Click to expand...

and if the OP has the patience to read *ALL* the comments on that blog
entry, he will find that comment[-2] points to

http://effbot.python-hosting.com/file/stuff/sandbox/text/unaccent.py

and comment[-1] (from the blog owner) is "Brilliant! Thank you."

The bottom line is that there is no universal easy solution; you need
to handcraft a translation table suited to your particular purpose
(e.g. do you want u-with-umlaut to become u or ue?). The
unicodedata.normalize function is useful for off-line preparation of a
set of candidate mappings for that table; it should not be applied
either on-line or blindly.

Cheers,
John

Sorry...I didn't know about translation tables or I would have
mentioned that instead. My bad.

Mike

Looking for UNICODE to ASCII Conversioni Example Code	15	Oct 18, 2013
EBCDIC <--> ASCII	4	Dec 4, 2008
convert Unicode filenames to good-looking ASCII	3	May 6, 2010
Ascii to Unicode.	4	Jul 28, 2010
Good cross-version ASCII serialisation protocol for simple types	4	Feb 23, 2013
Automatic Type Conversion to String	6	Feb 13, 2012
Converting an Array to a String in JavaScript	7	Sep 22, 2023
extended ASCII Conversion in Java	0	Jan 2, 2013

string conversion latin2 to ascii

Martin Landa

kyosohma

Martin v. LÃ¶wis

John Machin

Jakub Wilk

kyosohma

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads