RegEx for punctuation?

J

John Joyce

Hey all,
I have a quick question about RegEx, which I don't use often and I
don't have my reference book handy.
How do I gsub all non alpha-numeric characters to nothing,
I know it's something like this:
some_string.gsub!('something here', '')

but I can't for the life of me remember the RegEx for this.

thanks ahead of time

John Joyce
 
A

Alex Young

John said:
Hey all,
I have a quick question about RegEx, which I don't use often and I don't
have my reference book handy.
How do I gsub all non alpha-numeric characters to nothing,
I know it's something like this:
some_string.gsub!('something here', '')
irb(main):002:0> ",.'abc123".gsub(/[^[:alnum:]]/, '')
=> "abc123"
 
J

John Joyce

John said:
Hey all,
I have a quick question about RegEx, which I don't use often and I
don't have my reference book handy.
How do I gsub all non alpha-numeric characters to nothing,
I know it's something like this:
some_string.gsub!('something here', '')
irb(main):002:0> ",.'abc123".gsub(/[^[:alnum:]]/, '')
=> "abc123"
Thanks a million!
I'm going to have to do more RegEx stuff to get it buried into my brain!

One more question though, I previously had this to turn spaces into
underscores
some_string.gsub(' ','')

How can do this while removing all non-space characters?

John Joyce
 
J

John Joyce

John said:
Hey all,
I have a quick question about RegEx, which I don't use often and
I don't have my reference book handy.
How do I gsub all non alpha-numeric characters to nothing,
I know it's something like this:
some_string.gsub!('something here', '')
irb(main):002:0> ",.'abc123".gsub(/[^[:alnum:]]/, '')
=> "abc123"
Thanks a million!
I'm going to have to do more RegEx stuff to get it buried into my
brain!

One more question though, I previously had this to turn spaces into
underscores
some_string.gsub(' ','')

How can do this while removing all non-space characters?

John Joyce
Found it. In Peter Cooper's book! (I knew I bought that ebook for a
reason)
this is my final version:

@asset.permalink = @asset.name.downcase.gsub(' ', '_').gsub(/
\W/,'')

downcase everything, turn spaces into underscores, then take all non-
alpha/non-numeric/non-underscore characters out.
sweet and simple.
For anyone else looking for it in the archives,
\w matches all alpha/numeric/underscore characters
\W matches everything else.

I could go a little further in the case of something like

title = 'Untitled #234'

and do this:

title = title.downcase.gsub(' ', '_').gsub('#', 'number').gsub(/\W/, '')

resulting in:
untitled_number234

Nice and tidy. Could be a bit further refined to make sure a space is
prepended to 'number' if no space precedes '#' and to make sure a
number actually follows '#' in before going to all the trouble.
If I wanted hard to read, I could even try to squeeze it all into one
big RegEx, but the thing I've got above will suffice for now.

cheers,
John Joyce
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,774
Messages
2,569,599
Members
45,175
Latest member
Vinay Kumar_ Nevatia
Top