Trouble with the encoding of os.getcwd() in Korean Windows

Erik Bethke · Feb 9, 2005

Hello All,

I have found much help in the google archives but I am still stuck...

here is my code snippet:
path = os.getcwd()
path = path.decode('UTF8')

Now the trouble is I am getting that darn UnicodeDecodeError, where it
is tripping up on the Korean hangul for My Desktop. Now I have tried
utf8 and utf16 and neither of these works.

So is this my question?: What encoding does windows use for Korean
Windows? I thought it might be and so I surfed around
(http://foundationstone.com.au/HtmlSupport/OnlineHelp/Localisation/SupportedEncodings.html)
and there appears to be an encoding called: windows-949 labeled to be
Korean Windows, which of couse is *not* one of the encodings to be
found in the encodings package... which would suck.

But then I thought about it some more, how could you make software to
do things like read the current directory work on different language
machines??? It would be madness to have a try statement for each and
every encoding under the sun...

Why isn't there a get system encoding method?

Or am I on the entirely wrong track?

Thanks,
-Erik

Erik Bethke · Feb 9, 2005

Hello All,

Well as usual, after I post I keep on digging and I found the answer...

http://cjkpython.i18n.org/

Has the encodings for Chinese, Korean and Japanese... and I took the
hint that I found from the foundationstore and tried cp949 and wa-la!
it works...

Now, the question remains, how do I write windows python code that will
work on all flavors of windows languages? The wxPython demo works,
because I have installed it on a path on my machine that does not have
Hangul in the path. But if I distribute something to end users, they
most certainly have Hangul in their path, or Japanese or Chinese, or
some other encoding... so how do you get the correct encoding from the
system?

Thanks,
-Erik

Vincent Wehren · Feb 9, 2005

Erik said:
Hello All,

I have found much help in the google archives but I am still stuck...

here is my code snippet:
path = os.getcwd()
path = path.decode('UTF8')

Now the trouble is I am getting that darn UnicodeDecodeError, where it
is tripping up on the Korean hangul for My Desktop. Now I have tried
utf8 and utf16 and neither of these works.

So is this my question?: What encoding does windows use for Korean
Windows?

Try "mbcs". This is a built-in encoding avalaible only on Windows and
that equals the system's default ANSI codepage. Using "mbcs", which is
short for "multi-byte character set", the conversions to and from
Unicode (decode/encode) are internally handled by the corresponding
win32 api functions.

Erik Bethke · Feb 9, 2005

Thank you Vincent, I will try this...

I did get over my troubles with this new code snippet:

encoding = locale.getpreferredencoding()
htmlpath = os.getcwd()
htmlpath = htmlpath.decode( encoding )

That seems to be working well too. I can write to these files and I
can open them with the file dialog, but this is now failing with the
famous aschii error:

webbrowser.open( htmlpath, True, True )

Erik Bethke · Feb 9, 2005

Hello All,

sorry for all the posts... I am *almost* there now...

okay I have this code:

import sys, os

encoding = locale.getpreferredencoding()
htmlpath = os.getcwd()
htmlpath = htmlpath.decode( encoding )

..... write to the file .....
...... file is written fine, and can be opened by both FireFox and IE
and displays fine ...

webbrowser.open( htmlpath.encode ( encoding ), True, True )

the line above now works fine (fixed the ascii error)

but *NOW* my problem is that FirefOX pops up a message box
complaining that the file does not exist, but it certainly does, it
just doesn't like what it is called...

Any ideas now?

Thanks,
-Erik

Erik Bethke · Feb 9, 2005

Ah and PS, again this is only for paths that are non-aschii or at least
have Korean in them...

The broswer bit launches successfully in other locations.

-Erik

Erik Bethke · Feb 9, 2005

Wow, even more information. When I set my default browser to IE, it
launches fine... so it is something about FireFox being more picky than
IE...

Where would I hunt down this sort of problem? Sounds rare, should I
contact Mozilla, or can you guys spot something silly I am doing?

Thank you,
-Erik

=?ISO-8859-1?Q?Walter_D=F6rwald?= · Feb 9, 2005

Erik said:
Hello All,

sorry for all the posts... I am *almost* there now...

okay I have this code:

import sys, os

encoding = locale.getpreferredencoding()
htmlpath = os.getcwd()
htmlpath = htmlpath.decode( encoding )

You might want to try os.getcwdu() instead of this. According to
http://www.python.org/doc/2.4/lib/os-file-dir.html
this has been added in Python 2.3 and should work on Windows.

Bye,
Walter Dörwald

I'm about to get in trouble with the HTML <body></body> tags	10	Aug 12, 2023
Python Windows release and encoding	1	May 22, 2013
Opening multiple Files in Different Encoding	5	Jul 10, 2012
trouble controlling vim with subprocess on windows machine	11	Jul 9, 2007
Determining the encoding of a text file	4	Mar 1, 2004
encoding problems with pymssql / win	1	Feb 11, 2006
Reading the access attributes of directories in Windows	33	Aug 19, 2010
japanese encoding iso-2022-jp in python vs. perl	4	Oct 23, 2007

Trouble with the encoding of os.getcwd() in Korean Windows

Erik Bethke

Erik Bethke

Vincent Wehren

Erik Bethke

Erik Bethke

Erik Bethke

Erik Bethke

=?ISO-8859-1?Q?Walter_D=F6rwald?=

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads