Python equivalent of Perl's $/

John K Masters · Aug 19, 2007

I am currently working my way through Jeffrey Friedl's book Mastering
Regular Expressions. Great book apart from the fact it uses Perl for the
examples.

One particular expression that interests me is '$/ = ".\n"' which,
rather than splitting a file into lines, splits on a period-newline
boundary. Combined with Perl's 'while (<>)' construct this seems a great
way to process the files I am interested in.

Without wishing to start a flame war, is there a way to do this in Python?

Regards, John

kyosohma · Aug 20, 2007

I am currently working my way through Jeffrey Friedl's book Mastering
Regular Expressions. Great book apart from the fact it uses Perl for the
examples.

One particular expression that interests me is '$/ = ".\n"' which,
rather than splitting a file into lines, splits on a period-newline
boundary. Combined with Perl's 'while (<>)' construct this seems a great
way to process the files I am interested in.

Without wishing to start a flame war, is there a way to do this in Python?

Regards, John

Python has a Regular Expressions module. Check it out here:
http://docs.python.org/lib/module-re.html

There's also a chapter from Dive Into Python that covers this topic
too:
http://www.diveintopython.org/regular_expressions/index.html

Finally, Python "while" statement's docs can be found here:
http://docs.python.org/ref/while.html

Hope that helps!

Mike

Mark T · Aug 20, 2007

John K Masters said:
I am currently working my way through Jeffrey Friedl's book Mastering
Regular Expressions. Great book apart from the fact it uses Perl for the
examples.

One particular expression that interests me is '$/ = ".\n"' which,
rather than splitting a file into lines, splits on a period-newline
boundary. Combined with Perl's 'while (<>)' construct this seems a great
way to process the files I am interested in.

Without wishing to start a flame war, is there a way to do this in Python?

Regards, John

['test\ntest2', 'test3\ntest4', 'test5']

-Mark T.

Nick Craig-Wood · Aug 20, 2007

John K Masters said:
I am currently working my way through Jeffrey Friedl's book Mastering
Regular Expressions. Great book apart from the fact it uses Perl for the
examples.

One particular expression that interests me is '$/ = ".\n"' which,
rather than splitting a file into lines, splits on a period-newline
boundary. Combined with Perl's 'while (<>)' construct this seems a great
way to process the files I am interested in.

Without wishing to start a flame war, is there a way to do this in Python?

Regards, John

Something like this maybe?

import re

input_data = """I am currently working my way through Jeffrey Friedl's book Mastering
Regular Expressions. Great book apart from the fact it uses Perl for the
examples.

One particular expression that interests me is '$/ = ".\\n"' which,
rather than splitting a file into lines, splits on a period-newline
boundary. Combined with Perl's 'while (<>)' construct this seems a great
way to process the files I am interested in.

Without wishing to start a flame war, is there a way to do this in Python?
"""

for para in re.split(r"\.\n", input_data):
print "para = %r" % para

John K Masters · Aug 20, 2007

Something like this maybe?

import re

input_data = """I am currently working my way through Jeffrey Friedl's book Mastering
Regular Expressions. Great book apart from the fact it uses Perl for the
examples.

One particular expression that interests me is '$/ = ".\\n"' which,
rather than splitting a file into lines, splits on a period-newline
boundary. Combined with Perl's 'while (<>)' construct this seems a great
way to process the files I am interested in.

Without wishing to start a flame war, is there a way to do this in Python?
"""

for para in re.split(r"\.\n", input_data):
print "para = %r" % para

Thanks, that looks promising. The Perl examples are really confusing
sometimes and throw me off the track of the obvious Python way. That
said, the Python documentation does not always make it clear, at least
not to me, how to get the result one wants.

Regards, John

attn.steven.kuo · Aug 20, 2007

I am currently working my way through Jeffrey Friedl's book Mastering
Regular Expressions. Great book apart from the fact it uses Perl for the
examples.

One particular expression that interests me is '$/ = ".\n"' which,
rather than splitting a file into lines, splits on a period-newline
boundary. Combined with Perl's 'while (<>)' construct this seems a great
way to process the files I am interested in.

Without wishing to start a flame war, is there a way to do this in Python?

import StringIO

text = """\
To mimic Perl's input record separator in
Python, you can use a generator.
And a substring test.
Perhaps something like the following
is what you wanted.
"""

mockfile = StringIO.StringIO(text)

def genrecords(mockfile, sep=".\n"):
buffer = ""
while True:
while sep in buffer:
idx = buffer.find(sep) + len(sep)
yield buffer[:idx]
buffer = buffer[idx:]
rl = mockfile.readline()
if rl == "":
break
else:
buffer = '%s%s' % (buffer, rl)
yield buffer
raise StopIteration

for record in genrecords(mockfile):
print "READ:", record

John K Masters · Aug 20, 2007

import StringIO

text = """\
To mimic Perl's input record separator in
Python, you can use a generator.
And a substring test.
Perhaps something like the following
is what you wanted.
"""

mockfile = StringIO.StringIO(text)

def genrecords(mockfile, sep=".\n"):
buffer = ""
while True:
while sep in buffer:
idx = buffer.find(sep) + len(sep)
yield buffer[:idx]
buffer = buffer[idx:]
rl = mockfile.readline()
if rl == "":
break
else:
buffer = '%s%s' % (buffer, rl)
yield buffer
raise StopIteration

for record in genrecords(mockfile):
print "READ:", record

Thanks, this also looks like a good way to go but ATM beyond my level of
Python knowledge. I've not reached the generator chapter yet but I'll
flag the message and return later.

Regards, John

attn.steven.kuo · Aug 20, 2007

(snipped)

Thanks, this also looks like a good way to go but ATM beyond my level of
Python knowledge. I've not reached the generator chapter yet but I'll
flag the message and return later.

Regards, John

Some features in Perl can be found in Python, so if you know
the former, then learning the latter ought to go smoothly. In
any case, here's an updated version of the generator that
avoid repeating an unncessary string search:

def genrecords(mockfile, sep=".\n"):
"""
"""
buffer = ""
while True:
idx = buffer.find(sep) + len(sep)
while idx >= len(sep):
yield buffer[:idx]
buffer = buffer[idx:]
idx = buffer.find(sep) + len(sep)
rl = mockfile.readline()
if rl == "":
break
else:
buffer = '%s%s' % (buffer, rl)
yield buffer
raise StopIteration

Does Python have a Template::Extract equivalent from Perl's CPAN	5	May 27, 2005
pythonic equivalent of upvar?	3	Dec 20, 2005
Is it possible to store data in a Python file in a way similar toRuby's __END__ section?	1	Apr 2, 2010
Something about stripping C/C++ comments in perldoc	9	Apr 18, 2006
The devolution of English language and slothful c.l.p behaviors exposed!	50	Jan 24, 2012
Feasibility of using Perl to strip selected html tags and/or attributes (scripting/programming novic	2	May 12, 2005
[perl-python] Python documentation moronicities (continued)	75	Apr 12, 2005
What is the Decisive "Clash" of Our Time?	5	Jan 31, 2007

Python equivalent of Perl's $/

John K Masters

kyosohma

Mark T

Nick Craig-Wood

John K Masters

attn.steven.kuo

John K Masters

attn.steven.kuo

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads