Regexp - start and end of line or string

Colin Bartlett · Jan 16, 2011

How often do people use \A and \z (match start and end of a string)
instead of ^ and $ (match start and end of a line within a string)?

This question is prompted by:

(1) part of a post in the "What are your ruby rough cuts ?" thread:
* Regexp ^ and $ work match more than just start and end of string.
For example, /^abc$/ does not match only "abc" but also "rm -rf /*\nabc"

Comment: using \A and \z seems to avoid the unwanted(?) matches.
Am I missing something?

(2) a Regexp used in the Find module
if File::ALT_SEPARATOR and file =~ /^(?:[\/\\]|[A-Za-z]:[\/\\]?)$/ then
which matches "start of a line" + X + "end of a line",
where X is one of / \\ C: C:/ C:\\

Comment: using ^ and $ in Find will match in the (admittedly rather
unlikely) situation of a file path string containing "\n/\n", and I'm
wondering why ^ and $ are used in this Find module regexp instead of
using \A and \z.

Justin Collins · Jan 16, 2011

How often do people use \A and \z (match start and end of a string)
instead of ^ and $ (match start and end of a line within a string)?

This question is prompted by:

(1) part of a post in the "What are your ruby rough cuts ?" thread:
* Regexp ^ and $ work match more than just start and end of string.
For example, /^abc$/ does not match only "abc" but also "rm -rf /*\nabc"

Comment: using \A and \z seems to avoid the unwanted(?) matches.
Am I missing something?

Most of the time I am going through data a line at a time, so this is
not a concern.

However, it seems to me that many people are not aware of this
distinction, thus we have things like this:
http://guides.rubyonrails.org/security.html#regular-expressions

(2) a Regexp used in the Find module
if File::ALT_SEPARATOR and file =~ /^(?:[\/\\]|[A-Za-z]:[\/\\]?)$/ then
which matches "start of a line" + X + "end of a line",
where X is one of / \\ C: C:/ C:\\

Comment: using ^ and $ in Find will match in the (admittedly rather
unlikely) situation of a file path string containing "\n/\n", and I'm
wondering why ^ and $ are used in this Find module regexp instead of
using \A and \z.

I suppose this is a bug.

-Justin

Re[rough cuts]: regexp	1	Jan 15, 2011
regexp matching end of line or comma	1	Nov 25, 2010
end-of-line conventions	16	Aug 13, 2009
Multi-line replace with string, not regexp	11	Aug 8, 2010
Bug with end of string characters in regex?	5	Jan 13, 2010
String extraction using RegExp	2	Jun 9, 2008
Search String and Delete Line Containing String	3	Feb 1, 2011
regexp problem	4	Nov 26, 2008

Regexp - start and end of line or string

Colin Bartlett

Justin Collins

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads