boost::regex_search matching extra characters?

A · Aug 27, 2010

I probably missed something stupid but why is this matching extra
characters?

if (boost::regex_search("bla bla (message #12345)",
matches,
boost::regex("\$message #([0-9]+)\$$")))
{
//matches[1].first contains "12345)" but regex is supposed to match only
"12345"
//matches[1].second contains ")"
}

I've tested regex in php so the regex should not match those extra chars
AFAIK

I solved this by removing the length of matches[1].second from first string
but is there something wrong with the above code or this is simply how
boost::regex works?

Thomas J. Gritzan · Aug 27, 2010

Am 27.08.2010 22:21, schrieb A:

I probably missed something stupid but why is this matching extra
characters?

if (boost::regex_search("bla bla (message #12345)",
matches,
boost::regex("\$message #([0-9]+)\$$")))
{
//matches[1].first contains "12345)" but regex is supposed to match only
"12345"
//matches[1].second contains ")"
}

Do you know what first and second are supposed to represent here?

first is the start of the matching sequence, second is one-past the
matching sequence.
This means that first and second are the same as begin() and end() of
all the C++ containers.

To extract the matching string, you can do:

std::string match(matches[1].first, matches[1].second);

A · Aug 28, 2010

first is the start of the matching sequence, second is one-past the

matching sequence.
This means that first and second are the same as begin() and end() of
all the C++ containers.

well, i'm also into php where this is interpreted a little bit different and
where matches are usually what they are supposed to be - matches, so mixing
2 languages can get sometimes a bit confusing.

thank you for the explanation.

Francesco S. Carta · Aug 28, 2010

well, i'm also into php where this is interpreted a little bit different and
where matches are usually what they are supposed to be - matches

Well, why, are they anything different?

The difference resides in how they are reported, and this information
usually is where it is supposed to be - in the documentation ;-)

http://www.boost.org/doc/libs/1_44_0/libs/regex/doc/html/boost_regex/ref/regex_search.html

Boost Logging Lib, v2	6	Oct 3, 2007
Good ole gnu::hash_map, I'm impressed	8	Jul 16, 2008
Any Boost Experts out there for Boost.Regex?	2	May 17, 2004
Regular Expression - Matching Multiples of 3 Characters exactly.	6	Apr 28, 2008
matching over multiple lines	4	Nov 21, 2006
matching first 3 characters	6	Aug 19, 2005
Case-insensitive matching on compiled regex?	12	May 11, 2006
Regex to match a numerical IP range	7	Dec 11, 2010

boost::regex_search matching extra characters?

A

Thomas J. Gritzan

A

Francesco S. Carta

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads