simple question about Ruby Regext

P

Peter Bailey

Hi,
I've been learning RUBY the past 7 months or so, and, now, my assistant
is doing the same. In her perusal of the "Programming RUBY" book, first
edition, she's come across a simple, simple regex truism that throws
her, and throws me, too!

Why is this true?

"banana" =~ /an*/
=>1

This is driving me nuts. Why isn't the RUBY response "=>2?" There are
two "an" stubs in "banana."

I thought that RUBY, like PERL, is inherently greedy and it would find
all instances of said regex expression.

Thanks a lot!
Peter
 
A

Alex Young

Peter said:
Hi,
I've been learning RUBY the past 7 months or so, and, now, my assistant
is doing the same. In her perusal of the "Programming RUBY" book, first
edition, she's come across a simple, simple regex truism that throws
her, and throws me, too!

Why is this true?

"banana" =~ /an*/
=>1

This is driving me nuts. Why isn't the RUBY response "=>2?" There are
two "an" stubs in "banana."
The number returned is the position of the start of match, not the
number of them.
I thought that RUBY, like PERL, is inherently greedy and it would find
all instances of said regex expression.
It is... there's only one match, and it matches everything from the
first 'a' to the end of the string.

Hope this makes sense,
 
J

James Edward Gray II

I thought that RUBY, like PERL, is inherently greedy and it would find
all instances of said regex expression.

I see you already have your answer, so I will just add that those
languages are spelled Ruby and Perl. Welcome to Ruby!

James Edward Gray II
 
P

Peter Bailey

Alex said:
The number returned is the position of the start of match, not the
number of them.

It is... there's only one match, and it matches everything from the
first 'a' to the end of the string.

Hope this makes sense,

Thanks, Gentlemen. I'll watch my spelling of Ruby and Perl from now on.

I understand now that my match is only looking for the position. Cool.
Thanks. But, . . ., here's a similar regex where I don't want the
position, but I want to change all instances of the stub, and, it's only
changing the first one.

"banana".sub(/an/, "ze")
=> "bzeana"

What's with that?

-Peter
 
R

Robert Klemme

The number returned is the position of the start of match, not the
number of them.

It is... there's only one match, and it matches everything from the
first 'a' to the end of the string.

No. It's just matching "an":

irb(main):001:0> "banana"[/an*/]
=> "an"

You were right if the regexp had a dot:

irb(main):002:0> "banana"[/an.*/]
=> "anana"

robert
 
R

Robert Klemme

Thanks, Gentlemen. I'll watch my spelling of Ruby and Perl from now on.

I understand now that my match is only looking for the position. Cool.
Thanks. But, . . ., here's a similar regex where I don't want the
position, but I want to change all instances of the stub, and, it's only
changing the first one.

"banana".sub(/an/, "ze")
=> "bzeana"

What's with that?

ri String#sub
ri String#gsub

And, for completeness reasons

ri String#sub!
ri String#gsub!

Kind regards

robert
 
A

Alex Young

Peter said:
Thanks, Gentlemen. I'll watch my spelling of Ruby and Perl from now on.

I understand now that my match is only looking for the position. Cool.
Thanks. But, . . ., here's a similar regex where I don't want the
position, but I want to change all instances of the stub, and, it's only
changing the first one.

"banana".sub(/an/, "ze")
=> "bzeana"
That's explicitly what sub is for - only changing the first one. You
want gsub.
 
A

Alex Young

Robert said:
The number returned is the position of the start of match, not the
number of them.

It is... there's only one match, and it matches everything from the
first 'a' to the end of the string.

No. It's just matching "an":

irb(main):001:0> "banana"[/an*/]
=> "an"

You were right if the regexp had a dot:

irb(main):002:0> "banana"[/an.*/]
=> "anana"
Oops :) Sorry for any confusion. Not enough blood in my caffeine
system, obviously :)
 
W

Wolfgang Nádasi-Donner

Peter said:
Why is this true?

"banana" =~ /an*/
=>1
As already said, it is the position of the match.
I thought that RUBY, like PERL, is inherently greedy and it would find
all instances of said regex expression.

If you want to find all matches, you should use "scan" in Ruby. There are two
possibilities. the first one...

str = 'banana'
str.scan(/an/) # => ["an", "an"]

....will produce an Array object for each match (see "ri String#scan" for
details). The second one will invoke a block for each match.

str = 'banana'
str.scan(/an/) do
puts "#$`<#$&>#$'"
end

produces:

b<an>ana
ban<an>a

Wolfgang Nádasi-Donner
 
P

Peter Bailey

Alex said:
That's explicitly what sub is for - only changing the first one. You
want gsub.

Got it. Thanks, guys. Yes, I should've used gsub, of course. So, I'm not
going nuts.
 
P

Peter Bailey

Thanks, Wolfgang. So, you suggest the use of "scan" instead of "gsub?"
That would imply the need for a block, which, seems kind of wordy, but,
it does have power. . . . Thanks again.
 
A

Andy Lester

This is indeed the most generous of forums. I would've been flamed
long
ago in the Perl world.

I suggest that it's not a feature of Perl per se, but of the larger
number of people in Perl. Given a large enough group, you'll always
have a percentage of people who are jerks. I've already run into
people who've treated me poorly because I was new and unfamiliar to
them.

xoxo,
Andy
 
R

Robert Klemme

This is indeed the most generous of forums. I would've been flamed long
ago in the Perl world.

Actually we are so kind that we will even arrange for flaming if you
miss it.[1] Just place a hint in your postings. :)))

Kind regards

robert


[1] Though I cannot guarantee that it will be a really bad flame -
people around here might be a bit out of practice. ;-)
 
A

ara.t.howard

I suggest that it's not a feature of Perl per se, but of the larger number
of people in Perl. Given a large enough group, you'll always have a
percentage of people who are jerks. I've already run into people who've
treated me poorly because I was new and unfamiliar to them.

in fairness to perl i'd say this is probably true. on the otherhand i expect
that the ruby community will continue to be kind as a result of the current
community sense of politeness and our desire for a comfortable place to get
ruby help fast and with no bitter after taste. in summary i think it's up to
us to make sure this community doesn't follow the path of many large
communities: we don't have to accept that large means rude and anonymous.

kind regards.

-a
 
P

Peter Bailey

Yeh, I didn't mean to denegrate Perl in any way. I worked with it just
fine for a few years. It probably does just have to do with the size of
the community, but, most of all, with my own greenness. So, kudos to all
jeweled programming languages. ( -:

-Peter
 
J

James Edward Gray II

Thanks, Wolfgang. So, you suggest the use of "scan" instead of "gsub?"
That would imply the need for a block, which, seems kind of wordy,
but,
it does have power. . . . Thanks again.

gsub/gsub! also take a block.

Which one you use depends mainly on your purpose. Here's how I view
the methods, in terms of a "Find and Replace" feature set common to
so many programs:

sub/sub! == Find
gsub/gsub! == Find and Replace
scan == Find All

Hope that helps.

James Edward Gray II
 
P

Peter Bailey

James said:
gsub/gsub! also take a block.

Which one you use depends mainly on your purpose. Here's how I view
the methods, in terms of a "Find and Replace" feature set common to
so many programs:

sub/sub! == Find
gsub/gsub! == Find and Replace
scan == Find All

Hope that helps.

James Edward Gray II

That does help, James. Thanks. But, don't "sub" and "sub!" do a
replacement; so, they're doing more than just finding? At least, they're
finding and replacing the first instance of whatever, just not globally?
 
J

James Edward Gray II

That does help, James. Thanks. But, don't "sub" and "sub!" do a
replacement; so, they're doing more than just finding? At least,
they're
finding and replacing the first instance of whatever, just not
globally?

Ick. Sorry. In my defense I only got four hours of sleep last night.

Let me try my chart one more time:

sub/sub! == Find and Replace
gsub/gsub! == Replace All
scan == Find All

James Edward Gray II
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
474,431
Messages
2,571,679
Members
48,796
Latest member
Greg L.

Latest Threads

Top