why does . match non-ascii chars?

7stud -- · Feb 23, 2009

str = "abcdÃ©f "

result = str.gsub(/./n) do |match|
puts "%%%02X" % match[0]
end
puts

--output:--
%61
%62
%63
%64
%C3
%A9
%66

Doesn't the 'n' option say to match ascii? For what it's worth, I get
the same result without the 'n' option.

Michael Fellinger · Feb 24, 2009

str =3D "abcd=C3=A9f "

result =3D str.gsub(/./n) do |match|
puts "%%%02X" % match[0]
end
puts

--output:--
%61
%62
%63
%64
%C3
%A9
%66

Doesn't the 'n' option say to match ascii? For what it's worth, I get
the same result without the 'n' option.

The default switch of a regex is actually 'n' already, that only
changes if you set $KCODE before.
It has little influence on what is matched when it comes to '.', but
it influences how the matched bytes will be grouped to resemble
characters.

sigma ~ % ruby -e 'p "abcd=C3=A9f ".scan(/./)'
["a", "b", "c", "d", "\303", "\251", "f", " "]

sigma ~ % ruby -e 'p "abcd=C3=A9f ".scan(/./u)'
["a", "b", "c", "d", "\303\251", "f", " "]

sigma ~ % ruby -Kue 'p "abcd=C3=A9f ".scan(/./u)'
["a", "b", "c", "d", "=C3=A9", "f", " "]

sigma ~ % ruby19 -e 'p "abcd=C3=A9f ".scan(/./)'
["a", "b", "c", "d", "=C3=A9", "f", " "]

Please see some excellent articles about this topic from James Edward Gray =
II:

http://blog.grayproductions.net/articles/bytes_and_characters_in_ruby_18
http://blog.grayproductions.net/categories/character_encodings

^ manveru

Command Line Arguments	0	Mar 7, 2023
Ruby 1.9 - ArgumentError: incompatible encoding regexp match(US-ASCII regexp with ISO-2022-JP string	0	Mar 31, 2008
Regex with ASCII and non-ASCII chars	5	Jan 31, 2007
String#split regex \W on non-ASCII text	1	Nov 9, 2010
hex dump w/ or w/out utf-8 chars	40	Jul 8, 2013
geting error as unxpected symbol read: ". in array initialization	0	Mar 27, 2016
VHDL Type Mismatch error indexed name returns a value whose type does not match	0	May 6, 2012
retriving escape unicode sequences from files ...	1	Aug 4, 2012

why does . match non-ascii chars?

7stud --

Michael Fellinger

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads