double-byte character

tony wong · Oct 2, 2005

is it possible to detect any double-byte character in the text? thanks.

tony

Martin Honnen · Oct 2, 2005

tony said:
is it possible to detect any double-byte character in the text? thanks.

Since JavaScript 1.3 (in Netscape 4.06) and JScript 4 (in IE 4) the
strings in JavaScript are sequences of Unicode characters, you can
access any character in a string with
string.charAt(index)
and the Unicode character code of any character in a string with
string.charCodeAt(index)
There is no byte type in JavaScript 1.x and there is no access to the
internal byte representation of an Unicode character or a complete string.
The internal string representation choosen is usually UTF-16 so in that
sense all characters are double byte characters. But as said, as a
scripter you deal with sequences of Unicode characters and the internal
encoding in bytes does not matter for scripting.

Stephen Chalmers · Oct 2, 2005

tony wong said:
is it possible to detect any double-byte character in the text? thanks.

If you mean to detect the presence of any character whose hi-byte is non-zero:

if( /[\u0100-\uffff]/.test( text ) )
...

Thomas 'PointedEars' Lahn · Oct 16, 2005

Martin said:
The internal string representation choosen is usually UTF-16 so
in that sense all characters are double byte characters.

No, they are not. I thought a similar thing before (about UTF-8),
but this is not how UTF works. Additional code units (surrogate
pairs) are used if needed for a character, i.e. all Unicode
characters beyond code point 0xFFFF are represented in UTF-16/UCS2
by two 16-bit words or four bytes each.

<http://www.unicode.org/faq/basic_q.html#19>
<http://en.wikipedia.org/wiki/UTF-16/UCS-2>

PointedEars

Cannot convert (double) to (double*)	1	Sep 5, 2022
[C language] Issue in the Lotka-Volterra model.	0	Jun 28, 2023
change variable in javascript	2	Oct 22, 2006
C basic query	1	Aug 1, 2022
Double $.each statement	2	Jul 9, 2023
Ordenate and remove duplicate cases of an array passed as a c function argument	0	Sep 27, 2022
Why do double and single double quotes work in this line of code?	6	Sep 4, 2022
Struggling with the Automatic movement for my mobile game character	0	May 14, 2022

double-byte character

tony wong

Martin Honnen

Stephen Chalmers

Thomas 'PointedEars' Lahn

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads