passing multiple strings to string.find()

hokiegal99 · Aug 8, 2003

How do I say:

x = string.find(files, 'this', 'that', 'the-other')

currently I have to write it like this to make it work:

x = string.find(files, 'this')
y = string.find(files, 'that')
z = string.find(files, 'the-other')

Raymond Hettinger · Aug 8, 2003

hokiegal99 said:
How do I say:

x = string.find(files, 'this', 'that', 'the-other')

currently I have to write it like this to make it work:

x = string.find(files, 'this')
y = string.find(files, 'that')
z = string.find(files, 'the-other')

Try this:

x, y, z = map(files.find, ['this', 'that', 'the-other'])

or, if you're just trying to find the first match:

re.search('this|that|the-other', files).start()

OTOH, you've hinted at an application that may not
appropriate for multiple string searches. Instead, look
at building a dictionary or list of files -- they are most
easily searched and better suited for associating other
data such as file sizes, etc.

Raymond Hettinger

Bengt Richter · Aug 8, 2003

How do I say:

x = string.find(files, 'this', 'that', 'the-other')

currently I have to write it like this to make it work:

x = string.find(files, 'this')
y = string.find(files, 'that')
z = string.find(files, 'the-other')

You might try the re module, e.g.,
... m = rxo.search(' Find this or the-other or that and this.', pos)
... if not m: break
... print '%4s: %s' % (m.start(), m.group())
... pos = m.end()
...
6: this
14: the-other
27: that
36: this

If some search strings have a common prefix, you'll have to put
the longest first in the regex, since re grabs the first match it sees.

Regards,
Bengt Richter

=?iso-8859-1?q?Fran=E7ois_Pinard?= · Aug 9, 2003

[Fredrik Lundh]

Francois Pinard wrote:

Given the above,

build_regexp(['this', 'that', 'the-other'])

yields the string 'th(?:is|at|e\\-other)', which one may choose to
`re.compile' before use.

Click to expand...

the SRE compiler looks for common prefixes, so "th(?:is|at|e\\-other)" is
no different from "this|that|the-other" on the engine level.

Thanks for the note. So the `build_regexp' function is not useful after
all. It was indirectly written around a speed problem in the GNU regexp
engine, but seemingly, the Python regexp engine knows better already. As I
wrote earlier, I first saw Emacs Lisp `regexp-opt' used within `enscript'..

A speed comparison between both methods shows that they are fairly
equivalent. A small difference is that `build_regexp', given that one of
the word is a prefix of another, automatically recognises the longest one,
while a naive regexp of '|'.join(words) recognises whatever happens to be
listed first. Of course, this is easily solved by sorting, then reversing
the word list before producing the naive regexp.

Getting a value that follows string.find()	7	Aug 13, 2013
Adding PC Filename Extensions to Macintosh Filenames	1	Aug 6, 2003
oddness in string.find(sub,somestring)	5	Mar 30, 2005
string.find first before location	8	May 2, 2006
String search	3	Feb 27, 2009
How to find all the same words in a text?	10	Feb 10, 2007
Strings and using quotes	1	Dec 5, 2022
Strings in Python	6	Feb 8, 2007

passing multiple strings to string.find()

hokiegal99

Raymond Hettinger

Bengt Richter

=?iso-8859-1?q?Fran=E7ois_Pinard?=

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads