a simple unicode question

Gabriel Genellina · Oct 28, 2009

En Wed said:
RFC 3629:
"ISO/IEC 10646 and Unicode define several encoding forms of their
common repertoire: UTF-8, UCS-2, UTF-16, UCS-4 and UTF-32."

In other words, Unicode is "not related to any encoding" .. and yet the
UTF-8, UTF-16.. "encoding forms" are clearly "related" to Unicode.

How is that possible?

Start reading "The Absolute Minimum Every Software Developer Absolutely,
Positively Must Know About Unicode and Character Sets (No Excuses!)", by
Joel Spolsky.
http://www.joelonsoftware.com/articles/Unicode.html

Tim Arnold · Oct 28, 2009

Chris Jones said:
Chris Jones wrote:
[..]

Best part of Unicode is that there are multiple encodings, right? ;-)

Click to expand...

No, the best part about Unicode is there is no encoding!

Click to expand...

Unicode does not define any encoding;

Click to expand...

RFC 3629:

"ISO/IEC 10646 and Unicode define several encoding forms of their
common repertoire: UTF-8, UCS-2, UTF-16, UCS-4 and UTF-32."

what it defines is code-points for characters which is not related to
how characters are encoded in files or network transmission.

Click to expand...

In other words, Unicode is "not related to any encoding" .. and yet the
UTF-8, UTF-16.. "encoding forms" are clearly "related" to Unicode.

How is that possible?

CJ

When I first saw it, my first thought was that the subjectline was an
oxymoron.

--Tim Arnold

How do I display unicode value stored in a string variable using ord()	133	Aug 16, 2012
Unicode Question	4	Jan 10, 2006
Benchmarking stripping of Unicode characters which are invalid XML	0	Mar 18, 2012
API for custom Unicode error handlers	5	Oct 4, 2013
Convert unicode escape sequences to unicode in a file	1	Jan 11, 2011
MySQLdb not playing nice with unicode	1	Mar 30, 2013
python3 Unicode is slow	1	Oct 25, 2009
Right solution to unicode error?	21	Nov 7, 2012

a simple unicode question

Gabriel Genellina

Tim Arnold

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads