string stripping issues

orangeDinosaur · Mar 2, 2006

Hello,

I am encountering a behavior I can think of reason for. Sometimes,
when I use the .strip module for strings, it takes away more than what
I've specified. For example:

returns:

'ughes. John</TD>\r\n'

However, if I take another string, for example:

returns:

'Kim, Dong-Hyun</TD>\r\n'

I don't understand why in one case it eats up the 'H' but in the next
case it leaves the 'K' alone.

Ben Cartwright · Mar 2, 2006

orangeDinosaur said:
I am encountering a behavior I can think of reason for. Sometimes,
when I use the .strip module for strings, it takes away more than what
I've specified. For example:

returns:

'ughes. John</TD>\r\n'

However, if I take another string, for example:

returns:

'Kim, Dong-Hyun</TD>\r\n'

I don't understand why in one case it eats up the 'H' but in the next
case it leaves the 'K' alone.

That method... I do not think it means what you think it means. The
argument to str.strip is a *set* of characters, e.g.:
'XabbaX'

For more info, see the string method docs:
http://docs.python.org/lib/string-methods.html
To do what you're trying to do, try this:

>>> prefix = 'hello '
>>> bar = 'hello world!'
>>> if bar.startswith(prefix): bar = bar[:len(prefix)] ...
>>> bar

Click to expand...

Click to expand...

'world!'

--Ben

=?iso-8859-1?B?aWFuYXLp?= · Mar 2, 2006

from the python manual:

strip( [chars])
The chars argument is not a prefix or suffix; rather, all combinations
of its values are stripped: 'example'

in your case since the letter 'H' is in your [chars] and the name
starts with an H it gets stripped, but with the second one the first
letter is a K so it stops there.
Maybe you can use:

'Hughes. John said:
a[31:]

Click to expand...

'Hughes. John said:

b[31:]

Click to expand...

Click to expand...

'Kim, Dong-Hyun</TD>\r\n'

but maybe what you REALLY want is:

a[31:-14] 'Hughes. John'
b[31:-14]

Click to expand...

Click to expand...

'Kim, Dong-Hyun'

Ben Cartwright · Mar 2, 2006

Ben said:
orangeDinosaur said:

I am encountering a behavior I can think of reason for. Sometimes,
when I use the .strip module for strings, it takes away more than what
I've specified. For example:

returns:

'ughes. John</TD>\r\n'

However, if I take another string, for example:

returns:

'Kim, Dong-Hyun</TD>\r\n'

I don't understand why in one case it eats up the 'H' but in the next
case it leaves the 'K' alone.

Click to expand...

That method... I do not think it means what you think it means. The
argument to str.strip is a *set* of characters, e.g.:
'XabbaX'

For more info, see the string method docs:
http://docs.python.org/lib/string-methods.html
To do what you're trying to do, try this:

prefix = 'hello '
bar = 'hello world!'
if bar.startswith(prefix): bar = bar[:len(prefix)] ...
bar

Click to expand...

Click to expand...

'world!'

Apologies, that should be:

>>> prefix = 'hello '
>>> bar = 'hello world!'
>>> if bar.startswith(prefix): bar = bar[len(prefix):] ...
>>> bar

Click to expand...

Click to expand...

'world!'

--Ben

orangeDinosaur · Mar 2, 2006

thanks!

P Boy · Mar 3, 2006

This seems like a web page parsing question. Another approach can be as
follows if you know the limiting token strings:

a.split(' <TD WIDTH=175>')[1].split('</TD>\r\n')[0]

Iain King · Mar 3, 2006

Ben said:
Ben said:

orangeDinosaur said:

I am encountering a behavior I can think of reason for. Sometimes,
when I use the .strip module for strings, it takes away more than what
I've specified. For example:

a = ' <TD WIDTH=175>Hughes. John</TD>\r\n'

a.strip(' <TD WIDTH=175>')

returns:

'ughes. John</TD>\r\n'

However, if I take another string, for example:

b = ' <TD WIDTH=175>Kim, Dong-Hyun</TD>\r\n'

b.strip(' <TD WIDTH=175>')

returns:

'Kim, Dong-Hyun</TD>\r\n'

I don't understand why in one case it eats up the 'H' but in the next
case it leaves the 'K' alone.

Click to expand...

That method... I do not think it means what you think it means. The
argument to str.strip is a *set* of characters, e.g.:

foo = 'abababaXabbaXabababbbb'
foo.strip('ab') 'XabbaX'
foo.strip('aabababaab') # no difference!

Click to expand...

'XabbaX'

For more info, see the string method docs:
http://docs.python.org/lib/string-methods.html
To do what you're trying to do, try this:

prefix = 'hello '
bar = 'hello world!'
if bar.startswith(prefix): bar = bar[:len(prefix)] ...
bar

Click to expand...

'world!'

Click to expand...

Apologies, that should be:

prefix = 'hello '
bar = 'hello world!'
if bar.startswith(prefix): bar = bar[len(prefix):] ...
bar

Click to expand...

Click to expand...

'world!'

or instead of:

a.strip(' <TD WIDTH=175>')

use:

a.replace(' <TD WIDTH=175>','')

Iain

Larry Bates · Mar 3, 2006

orangeDinosaur said:
Hello,

I am encountering a behavior I can think of reason for. Sometimes,
when I use the .strip module for strings, it takes away more than what
I've specified. For example:

returns:

'ughes. John</TD>\r\n'

However, if I take another string, for example:

returns:

'Kim, Dong-Hyun</TD>\r\n'

I don't understand why in one case it eats up the 'H' but in the next
case it leaves the 'K' alone.

Others have explained the exact problem, I'll make a suggestion.
Take a few minutes to look at BeautifulSoup. It parses HTML code
and allows for extractions of data from strings like this in a
very easy to use way. If this is a one-off thing, don't bother.
If you do this commonly, BeautifulSoup is worth a little study.

-Larry Bates

stripping unwanted chars from string	7	May 4, 2006
Filter table rows based on multiple checkboxes value	2	Jan 13, 2023
Blue J Ciphertext Program	2	Nov 22, 2023
My Status, Ciphertext	2	Nov 28, 2023
Problem stripping line feeds	3	Jul 24, 2004
Stripping characters...	1	Jan 17, 2006
Dont work, it´s something whit the loops?	1	Jun 30, 2021
Issue with textbox script?	0	Sep 5, 2022

string stripping issues

orangeDinosaur

Ben Cartwright

=?iso-8859-1?B?aWFuYXLp?=

Ben Cartwright

orangeDinosaur

P Boy

Iain King

Larry Bates

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads