recording data between [ and ]

rbt · Apr 21, 2005

Output from 'netstat -b' on a win2003 server will show what binary is
responsible for the connection. For example, it may list something like
this along with other connection specific data:

[lsass.exe]
[System]
[firefox.exe]
[iexplorer.exe]

How might I process the output so that anything within brackets is
recorded to a log file of my own making? I know how to parse and record
things to a file, I don't know how to look make '[' and ']' appear as
special characters so that I can record what's between them.

Basically, I want a script that will read output and stop each time it
encounters a '[' and record until it gets to ']' where upon it would
stop recording and then proceed on repeating the above operation as it
goes thru the remaining data.

Thanks,
rbt

Peter Hansen · Apr 21, 2005

rbt said:
Output from 'netstat -b' on a win2003 server will show what binary is
responsible for the connection. For example, it may list something like
this along with other connection specific data:

[lsass.exe]
[System]
[firefox.exe]
[iexplorer.exe]

How might I process the output so that anything within brackets is
recorded to a log file of my own making? I know how to parse and record
things to a file, I don't know how to look make '[' and ']' appear as
special characters so that I can record what's between them.

Does this help?

>>> import re
>>>
>>> s = '''stuff [lsass.exe]

Click to expand...

Click to expand...

.... [System] more stuff
.... xxxxx [firefox.exe] ......
.... '''

>>>
>>> re.findall(r'\[([^]]*)\]', s)

Click to expand...

Click to expand...

['lsass.exe', 'System', 'firefox.exe']

-Peter

rbt · Apr 21, 2005

Peter said:
rbt said:

Output from 'netstat -b' on a win2003 server will show what binary is
responsible for the connection. For example, it may list something
like this along with other connection specific data:

[lsass.exe]
[System]
[firefox.exe]
[iexplorer.exe]

How might I process the output so that anything within brackets is
recorded to a log file of my own making? I know how to parse and
record things to a file, I don't know how to look make '[' and ']'
appear as special characters so that I can record what's between them.

Click to expand...

Does this help?

import re

s = '''stuff [lsass.exe]

Click to expand...

Click to expand...

... [System] more stuff
... xxxxx [firefox.exe] ......
... '''

re.findall(r'\[([^]]*)\]', s)

Click to expand...

Click to expand...

['lsass.exe', 'System', 'firefox.exe']

-Peter

Yes, it does... may take me a few minutes to get my head around it
though. Why do re's have to be so arcane and complicated... especially
in Python?

It's hard to preach 'ease of use' with stuff such as this in the
language. Perhaps one day it can be rolled up into something that
*really* is easy to understand:

import string

fp = file('filename')
data = fp.read()
fp.close()

string.between(data,[,])

Diez B. Roggisch · Apr 21, 2005

Yes, it does... may take me a few minutes to get my head around it

though. Why do re's have to be so arcane and complicated... especially
in Python?

It's hard to preach 'ease of use' with stuff such as this in the
language. Perhaps one day it can be rolled up into something that
*really* is easy to understand:

Welcome to the wonderful world of programming. Regular expressions are what
they are because they are modeled after a certain theory - that of finite
state automata and their correspondence to certain classes of grammars. And
they require a bit of understanding. And there is no language that does
them different - some integrate them syntactically (like perl), others
don't have them available in the standard lib at all. But if you get them,
they always look like that.

import string

fp = file('filename')
data = fp.read()
fp.close()

string.between(data,[,])

how about

import whatever_is_needed
solve_my_problem()

? Seriously: Programming or maybe better saying the way we tell computers
what to do might evolve by standardization to a point where lots of tasks
get easier.You actual problem might be solved easier one day if
commandline-tools agree on a specific output format (viewed from today
thate means possibly xml) and standard tools to deal with these.

But as the world is complex and people want solutions to their complex
problems, IMHO programming will always be about such nitty gritty details.

Fredrik Lundh · Apr 21, 2005

Diez said:
Welcome to the wonderful world of programming. Regular expressions are what
they are because they are modeled after a certain theory - that of finite
state automata and their correspondence to certain classes of grammars.

(except that Python regexps are not always regular, of course. and that back-
tracking engines like the ones used in Perl and Python differs in subtle ways
from "real" DFA-based engines, etc. but as long as you're looking at things
from a proper distance, you're right, of course)

</F>

Roy Smith · Apr 21, 2005

Diez B. Roggisch said:
Welcome to the wonderful world of programming. Regular expressions are what
they are because they are modeled after a certain theory - that of finite
state automata and their correspondence to certain classes of
grammars.

Another way to look at it is that RE's are a programming language of
their own, and Python just provides an interface to it, just like it
provides interfaces to databases, network protocols, and operating
systems.

RE's predate Python by many years (at least as far back as the early
70's in a form we would recognize today), and have evolved over the
decades to become more powerful. Unfortunately, with power came
arcane syntax. On the good side, most of the time you can use a
smallish subset of the full RE syntax and still have some pretty
powerfull pattern matching.

Python's motto is "there's one way to do it". Sometimes that means
"let's do it the way everybody else does it instead of reinventing it
ourselves". The Python RE module is certainly an example of that.

BTW, there's a pretty good Wikipedia article on RE's
(http://en.wikipedia.org/wiki/Regular_expression).

rbt · Apr 21, 2005

Roy said:
Another way to look at it is that RE's are a programming language of
their own, and Python just provides an interface to it, just like it
provides interfaces to databases, network protocols, and operating
systems.

RE's predate Python by many years (at least as far back as the early
70's in a form we would recognize today), and have evolved over the
decades to become more powerful. Unfortunately, with power came
arcane syntax. On the good side, most of the time you can use a
smallish subset of the full RE syntax and still have some pretty
powerfull pattern matching.

Python's motto is "there's one way to do it". Sometimes that means
"let's do it the way everybody else does it instead of reinventing it
ourselves". The Python RE module is certainly an example of that.

BTW, there's a pretty good Wikipedia article on RE's
(http://en.wikipedia.org/wiki/Regular_expression).

Thanks guys... nothing against Python... just RE's in general.

Simon Brunning · Apr 21, 2005

string.between(data,[,])

def between(data, start, end):
return re.findall(re.escape(start) + r'([^]]*)'+ re.escape(end), data)

foo = '''stuff [lsass.exe]
[System] more stuff
xxxxx [firefox.exe] ......
'''

print between(foo, '[', ']')

jay graves · Apr 21, 2005

I haven't used either of these tools but they might help a little.

http://lfw.org/python/rxb15.py
http://pyparsing.sourceforge.net/

If you want to help building traditional regex patterns, I find
programs like these to be invaluable.

Tools/scripts/redemo.py in the python standard lib.
http://kodos.sourceforge.net/home.html
http://weitz.de/regex-coach/

HTH,
jay graves

Paul McGuire · Apr 21, 2005

Jay -

Thanks for the pyparsing plug.

Here is how the OP's program would look using pyparsing:

import pyparsing

fp = file('filename')
data = fp.read()
fp.close()

foo = '''stuff [lsass.exe]
[System] more stuff
xxxxx [firefox.exe] ......
'''

LBRACK = pyparsing.Literal("[").suppress()
RBRACK = pyparsing.Literal("]").suppress()
brackettedStuff = LBRACK + pyparsing.SkipTo( RBRACK ) + RBRACK

for tokens,start,end in brackettedStuff.scanString( foo ):
print tokens[0]

--- fin ---
Now this is not nearly as terse as the regexp version, nor will it run
as fast. But I think I'd rather come back to this version 6 months
from now and try to figure "what was this program doing again?".

-- Paul

jay graves · Apr 21, 2005

Paul said:
Jay -
Thanks for the pyparsing plug.

NP. pyparsing is on my list of stuff to play around with. I'm just
waiting for the proper problem to present itself.

Here is how the OP's program would look using pyparsing:

And the exact reason that I could 'plug' pyparsing is that I have read
many of your responses with sample pyparsing code. viral marketing at
its best and another reason to love c.l.py

....

jay

Diez B. Roggisch · Apr 21, 2005

Fredrik said:
(except that Python regexps are not always regular, of course. and that
back- tracking engines like the ones used in Perl and Python differs in
subtle ways
from "real" DFA-based engines, etc. but as long as you're looking at
things from a proper distance, you're right, of course)

They use backtracking? Thats news to me. I always thought that mechanisms
like prefixes and backreferences can work in a strict DFA paragigm - with
possibly very large automata, but nevertheless.

Gotta go google I think....

Jim Sizelove · Apr 21, 2005

Simon said:
string.between(data,[,])

Click to expand...

def between(data, start, end):
return re.findall(re.escape(start) + r'([^]]*)'+ re.escape(end), data)

That's cool!
But it doesn't quite work if the end tag is not ']':
.... return re.findall(re.escape(start) + r'([^]]*)'+ re.escape(end), data)
....

>>> foo = '''<stuff> [lsass.exe]

Click to expand...

Click to expand...

.... [System] <more> stuff

.... xxxxx said:
>>> print between(foo, '[', ']') ['lsass.exe', 'System', 'firefox.exe']
>>> print between(foo, '<', '>')

Click to expand...

Click to expand...

['stuff', 'more> stuff\nxxxxx<qqq']

Here's a revised version that will work with other tags:
.... pattern = re.escape(start) + ' # start tag \n' +\
.... r'([^' + re.escape(end) + r']*)' + " # anything except end
tag \n" +\
.... re.escape(end) + ' # end tag \n'
.... return re.findall(pattern, data, re.VERBOSE)
....

>>> print between2(foo, '[', ']') ['lsass.exe', 'System', 'firefox.exe']
>>> print between2(foo, '<', '>')

Click to expand...

Click to expand...

['stuff', 'more', 'qqq']

Regards,
Jim Sizelove

bearophileHUGS · Apr 21, 2005

Diez B. Roggisch>But as the world is complex and people want solutions
to their complex problems, IMHO programming will always be about such
nitty gritty details.<

REs are like assembly, but high-level languages show us that for a
mammal there are (often) better (higher) ways to program a computer.
The computer can also compile the high-level language, to run it quite
quickly.

Beside rxb15, there is also redict, in the standard lib (Jay Graves
shows the HD path):
http://home.earthlink.net/~jasonrandharper/reverb.py

Maybe a higher level language (this is just a wrapper) like this can
become the standard way to make REs in Python.

Bearophile

jay graves · Apr 21, 2005

[email protected] said:
Beside rxb15, there is also redict, in the standard lib (Jay Graves
shows the HD path):
http://home.earthlink.net/~jasonrandharper/reverb.py

I knew there was a newer one out there but my google skills failed me.
Thanks for the link.

EEG stream data with mne and brainfolw	0	Jul 26, 2023
recording input/outputs, attributes and parameters of modules	0	Sep 17, 2010
Fading effect between play and play-over and pause and pause-over	0	Oct 16, 2021
j2me and video recording	2	Nov 28, 2006
nested dictionaries and functions in data structures.	0	Jan 7, 2014
Recording dynamic data	1	Apr 15, 2004
Recording messages and print statements in a textfile during programexecution.	1	Sep 16, 2004
Passing data between objects and calling all objects of a class in turn	1	Aug 25, 2010

recording data between [ and ]

rbt

Peter Hansen

rbt

Diez B. Roggisch

Fredrik Lundh

Roy Smith

rbt

Simon Brunning

jay graves

Paul McGuire

jay graves

Diez B. Roggisch

Jim Sizelove

bearophileHUGS

jay graves

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads