Anything like 'inspect.getsourceencoding()'?

Jeff Epler · Oct 17, 2004

Is there a convenient way to find the encoding of a source file? I
thought maybe this would be in the inspect module, but I didn't see it
there. Just as nice would be a way to get the file as a unicode string,
I suppose.

(This is related to another thread I've recently posted to, where
another user was having trouble with pydoc's links to source files using
the file: protocol. I suggested having pydoc serve the source files,
and provided a patch, but it's crossed my mind that it would be nice to
tell the browser the encoding of that file.

Jeff

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)

iD8DBQFBcdWJJd01MZaTXX0RAnyUAJ9dN2Shq85BpM/LH6+QinvL3OD40gCfW7Hb
1zvj6WpqwJbtTk1bA1+4+nM=
=K3uY
-----END PGP SIGNATURE-----

Maciej Dziardziel · Oct 17, 2004

Jeff said:
Is there a convenient way to find the encoding of a source file? I
thought maybe this would be in the inspect module, but I didn't see it
there. Just as nice would be a way to get the file as a unicode string,
I suppose.

(This is related to another thread I've recently posted to, where
another user was having trouble with pydoc's links to source files using
the file: protocol. I suggested having pydoc serve the source files,
and provided a patch, but it's crossed my mind that it would be nice to
tell the browser the encoding of that file.

Jeff

According to Python documentation:

It is possible to use encodings different than ASCII in Python source files.
The best way to do it is to put one more special comment line right after
the #! line to define the source file encoding:

# -*- coding: iso-8859-1 -*-

=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?= · Oct 17, 2004

Jeff said:
Is there a convenient way to find the encoding of a source file? I
thought maybe this would be in the inspect module, but I didn't see it
there. Just as nice would be a way to get the file as a unicode string,
I suppose.

No. inspect operates on the byte-code/internal representation level, and
at that level, there is no notion of source encoding. The source
encoding information gets lost during compilation (as it is no longer
needed).

Somebody proposed preserving it in __[en]coding__, but that hasn't been
implemented.

Regards,
Martin

Parsing a graph image	13	May 13, 2011
Resetting state of http.client/httplib HTTPSConnection objects	0	Aug 26, 2013
[RELEASED] Python 3.2.5 and Python 3.3.2	0	May 16, 2013
Flatten an email Message with a non-ASCII body using 8bit CTE	0	Jan 24, 2013
Diagnose a segfault in ipython/readline	0	Mar 6, 2014
Fix and improve a UDF File System Driver	0	Aug 20, 2023
queues? inotify? anything else?	5	Jun 15, 2011
Help with threading.local use in python-memcache module.	0	Dec 17, 2010

Anything like 'inspect.getsourceencoding()'?

Jeff Epler

Maciej Dziardziel

=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads