Charset (hopefully for the last time I ask)

Gandalf · Jun 12, 2008

now I understand my problem better so their is a good chance you
manage to help me.

I have a SQlite database full with ANSI Hebrew text , and program that
uses WXpython
Now, I use a- 'wx.TextCtrl' item to receive input from the user, and
when I try to search the database he don't understand this chars.

it's quite reasonable consider the fact the program set to work on
UTF-8 charset, except for:

1. it doesn't work when I delete the charset too

2. when I try to use function like decode and encode it output error
like this:
ascii' codec can't encode characters in position 0-4: ordinal not in
range(128)
ascii' codec can't encode characters in position 0-2: ordinal not in
range(128)

3. I don't know how to translate my DB from ANSI to UTF-8

4. when I don't use the user WX items input I can change my editor
charset to ansi and it works fine

Thank you all

MRAB · Jun 13, 2008

now I understand my problem better so their is a good chance you
manage to help me.

I have a SQlite database full with ANSI Hebrew text , and program that
uses WXpython
Now, I use a- 'wx.TextCtrl' item to receive input from the user, and
when I try to search the database he don't understand this chars.

it's quite reasonable consider the fact the program set to work on
UTF-8 charset, except for:

1. it doesn't work when I delete the charset too

2. when I try to use function like decode and encode it output error
like this:
ascii' codec can't encode characters in position 0-4: ordinal not in
range(128)
ascii' codec can't encode characters in position 0-2: ordinal not in
range(128)

3. I don't know how to translate my DB from ANSI to UTF-8

4. when I don't use the user WX items input I can change my editor
charset to ansi and it works fine

Thank you all

Have you tried something like:

unicode_text = text_from_db.decode("cp1255")
print unicode_text
utf8_text = unicode_text.encode("utf8")
print utf8_text

(I believe the codepage 1255 is Hebrew.)

Gandalf · Jun 13, 2008

Yes, it is 1255 it's surprising you know that.

any way this is the code I tried

search=cnrl.GetValue()
search= search.decode("cp1255")
search=search.encode("utf8")
word=''
category=1
cur.execute('select * from hebrew_words where word like ?',
[''+search+''])

this is the error it send me :

'ascii' codec can't encode characters in position 0-1: ordinal not in
range(128)

have any idea?

Thank you for trying any way. it worms my Jewish art

Gandalf · Jun 13, 2008

OK it did worked!

I just should have been encoding to cp1255

search=cnrl.GetValue()
search= search.encode("cp1255")
cur.execute('select * from hebrew_words where word like ?',
['%'+search+'%'])

Thank you!

you are the best

Help for my project in the last minute	0	Apr 23, 2022
email with a non-ascii charset in Python3 ?	3	Aug 15, 2012
How do I encode and decode this data to write to a file?	11	Apr 29, 2013
How can I calculate the last payment for Reprofiled Amount column with 2 decimal places to make the sum of all payments to be the same as RC amount?	2	Jul 13, 2023
[email protected]	0	Jan 14, 2014
CSS: How can I stop overflow on the y-axis?	2	Dec 24, 2022
the stupid encoding problem to stdout	16	Jun 9, 2011
UnicodeEncodeError when piping stdout, but not when printingdirectly to the console	4	Jan 4, 2012

Charset (hopefully for the last time I ask)

Gandalf

MRAB

Gandalf

Gandalf

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads