Advice on long running processes

commander_coder · Oct 11, 2007

Hello,

I write a lot of CGI scripts, in Python of course. Now I need to
convert some to long-running processes. I'm having trouble finding
resources about the best practices to do that.

I've found a lot of email discussions that say something like, "You
need to educate yourself about the differences when you have long-
running processes" but I've not had a lot of luck with finding things
that explain the differences. I've seen some talk about database
timeouts, for instance, but I'm not sure I understand the problems.
Can anyone here suggest some resources? I'd be happy with web sites,
with buying a book, anything.

I ask here because I write in Python and so if those resources used
Python then that would be super.

Thanks,
Jim

Roy Smith · Oct 11, 2007

Hello,

I write a lot of CGI scripts, in Python of course. Now I need to
convert some to long-running processes. I'm having trouble finding
resources about the best practices to do that.

I've found a lot of email discussions that say something like, "You
need to educate yourself about the differences when you have long-
running processes" but I've not had a lot of luck with finding things
that explain the differences.

The biggest differences between run-and-exit vs. long running processes are
resource management and error recovery. Let's take them one at a time.

Resource management. In a short-lived process, you really don't have to
worry about this at all. Snarf as much memory as you need, open as many
files as you want, and when you exit, the operating system cleans it all up
for you. With a long running process, you have to worry about stuff like
that.

In Python, you're isolate from the low-level details of memory management,
but still need to think about it a bit. Imagine you had code that looked
like this in your main loop:

for request in getNextRequest():
requestList.append (request)
processRequest(request)

requestList is going to keep growing without bounds and eventually will eat
up all available memory in the system and your process will crash.
Everything you store, you also need to delete when you're done with it.

Same with files. In a short-lived process, you can generally open as many
files as you want and never worry about closing them. It unlikely you will
ever run out of file descriptors. In a long running process, that's not
the case. If you open a new file each time you get a request and never
close it, after a few hundred requests (depending on the operating system,
maybe even a few thousand), you'll run out of file descriptors.

The other big thing is error recovery. In a short lived process, if
something fails, you print an error message and exit. In a long running
process, you need to somehow recover from the error and keep going as best
you can. This can be tricky.

Bruno Barberi Gnecco · Oct 11, 2007

Roy said:
The biggest differences between run-and-exit vs. long running processes are
resource management and error recovery. Let's take them one at a time.

Resource management. In a short-lived process, you really don't have to
worry about this at all. Snarf as much memory as you need, open as many
files as you want, and when you exit, the operating system cleans it all up
for you. With a long running process, you have to worry about stuff like
that.

In Python, you're isolate from the low-level details of memory management,
but still need to think about it a bit. Imagine you had code that looked
like this in your main loop:

for request in getNextRequest():
requestList.append (request)
processRequest(request)

requestList is going to keep growing without bounds and eventually will eat
up all available memory in the system and your process will crash.
Everything you store, you also need to delete when you're done with it.

In particular, it is a good idea to call gc.collect() every now
and then, specially if you are in such a loop. I don't know what is the
gc policy in python, but an application of mine that seemed to eat as much
memory as it was available was reduced to a constant small amount of
memory after I started to call the gc directly.

The other big thing is error recovery. In a short lived process, if
something fails, you print an error message and exit. In a long running
process, you need to somehow recover from the error and keep going as best
you can. This can be tricky.

You should have your main loop inside a try/except, to catch any
exceptions that were not otherwise caught without exiting the application.
Log the exceptions, of course, but in most long running applications
work in a loop like Roy Smith's code above, so if one of them fails it
won't disrupt the others to come.

Bruno Desthuilliers · Oct 11, 2007

(e-mail address removed) a écrit :

Hello,

I write a lot of CGI scripts, in Python of course. Now I need to
convert some to long-running processes. I'm having trouble finding
resources about the best practices to do that.

I've found a lot of email discussions that say something like, "You
need to educate yourself about the differences when you have long-
running processes" but I've not had a lot of luck with finding things
that explain the differences. I've seen some talk about database
timeouts, for instance, but I'm not sure I understand the problems.
Can anyone here suggest some resources? I'd be happy with web sites,
with buying a book, anything.

I ask here because I write in Python and so if those resources used
Python then that would be super.

As far as I'm concerned, I'd go for one of the available wsgi frameworks.

commander_coder · Oct 12, 2007

Thank you to folks for the replies.

Jim

What's with "long running processes" ?	1	Sep 17, 2007
Python programmer looking at transitioning to Java. Any advice or resources?	3	Oct 18, 2022
[web] Long-running process: FCGI? SCGI? WSGI?	1	Sep 5, 2012
any issues with long running python apps?	24	Jul 9, 2010
Running Python web apps on shared ASO servers?	6	Aug 12, 2012
Python and PEP8 - Recommendations on breaking up long lines?	19	Nov 28, 2013
Looking for general advice on complex program	5	Jul 15, 2011
Processes.	1	Sep 15, 2003

Advice on long running processes

commander_coder

Roy Smith

Bruno Barberi Gnecco

Bruno Desthuilliers

commander_coder

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads