Control flow design question

JannaB · Jun 22, 2009

I have a sockets server, listening on a single port for connections of
from 1 to 400 different terminals on the internet.

These terminals are grouped into what I call "channels." That is,
terminal 1,5 and 119 may be all from channel "A."

I only want to process 1 channel at a time. In other words, if I get a
signal from terminal 5, and one from 119, the latter must wait until
the former is processed. (Incidentally, I don;t write out from the
sockets server, rather, it writes some JSON data that is ultimately
transmitted back to the appropriate channel terminals).

Each connection to my sockets server will be making some JDBC inserts
and undates.

I am wondering, structurally, how best to handle this, realising that
archtiectural questions such as this are best handled properly from
the outset. Thank you, Janna B.

Eric Sosman · Jun 22, 2009

JannaB said:
I have a sockets server, listening on a single port for connections of
from 1 to 400 different terminals on the internet.

These terminals are grouped into what I call "channels." That is,
terminal 1,5 and 119 may be all from channel "A."

I only want to process 1 channel at a time. In other words, if I get a
signal from terminal 5, and one from 119, the latter must wait until
the former is processed. (Incidentally, I don;t write out from the
sockets server, rather, it writes some JSON data that is ultimately
transmitted back to the appropriate channel terminals).

Each connection to my sockets server will be making some JDBC inserts
and undates.

I am wondering, structurally, how best to handle this, realising that
archtiectural questions such as this are best handled properly from
the outset. Thank you, Janna B.

My first thought would be to use a different incoming
socket for connections on each channel, then write what
amounts to an ordinary single-threaded one-request-at-a-time
server for each channel's socket. But if you've got to use
the same IP/port for all the clients, that won't work.

Another approach would be to maintain N queues of requests,
one for each channel. A single thread accepts incoming
requests on the single port, figures out which channel each
belongs to, and appends each request to its channel's queue.
Each queue's requests are processed by one dedicated worker
thread.

A disadvantage of the second approach is that it forces
you to have the same number of worker threads as channels,
which might not be convenient (the one-request-per-channel
requirement means you can't have *more* workers than channels,
but if the channel count is large compared to the "CPU count"
you might well want to have fewer). Instead, you could have
N queues as above but just M worker threads: A worker finds
the queue with the oldest (or highest-priority) request at
the front and processes that request, but during the processing
it marks the queue "ineligible" so no other worker will look
at it. When the queue's first request is completed, the queue
becomes "eligible" again and the worker repeats its cycle.

JannaB · Jun 22, 2009

Thanks Eric,

As it turns out, I CAN use a different IP/Port for each socket, so I
CAN go with your former idea. Seems like the most efficient AND
safest. However, what to do about db connections? I must keep
persistent connections, I cannot keep re-esablishing connections. -
Janna

Robert Klemme · Jun 22, 2009

My first thought would be to use a different incoming
socket for connections on each channel, then write what
amounts to an ordinary single-threaded one-request-at-a-time
server for each channel's socket. But if you've got to use
the same IP/port for all the clients, that won't work.

That does not exactly meet the requirement as stated above: OP said that
he wanted to process only one channel at a time. It may be though that
he meant that he wants to process only one event at a time _per channel_.

Another approach would be to maintain N queues of requests,
one for each channel. A single thread accepts incoming
requests on the single port, figures out which channel each
belongs to, and appends each request to its channel's queue.
Each queue's requests are processed by one dedicated worker
thread.

Yep. That would be my favorite although I have no idea how the channel
is obtained from the connection.

A disadvantage of the second approach is that it forces
you to have the same number of worker threads as channels,
which might not be convenient (the one-request-per-channel
requirement means you can't have *more* workers than channels,
but if the channel count is large compared to the "CPU count"
you might well want to have fewer). Instead, you could have
N queues as above but just M worker threads: A worker finds
the queue with the oldest (or highest-priority) request at
the front and processes that request, but during the processing
it marks the queue "ineligible" so no other worker will look
at it. When the queue's first request is completed, the queue
becomes "eligible" again and the worker repeats its cycle.

In that case I'd rather have M queues and M workers and put something
into the event that is enqueued which allows detection of the channel.
Then place events for multiple channels in one queue.

Kind regards

robert

John B. Matthews · Jun 22, 2009

As it turns out, I CAN use a different IP/Port for each socket, so I
CAN go with your former idea. Seems like the most efficient AND
safest. However, what to do about db connections? I must keep
persistent connections, I cannot keep re-esablishing connections. -

If you're going with Eric's first plan, a single socket for each
channel that handles one-request-at-a-time, let each handler maintain
it's own persistent connection. Each channel handler can use the same
or a different login for its individual connection. That user should
generally have the fewest privilege needed to effect a transaction for
a given channel.

Alternatively, you might look at connection pooling:

<http://java.sun.com/developer/onlineTraining/Programming/JDCBook/conpool.html>

[Please don't top post.]

Robert Klemme · Jun 22, 2009

If I understand you, requests would be sprayed across all M
queues and thus be eligible for processing by any of the M workers.

Yes, but not requests of a single channel. Those all go to a single
queue! That's what I meant although I see that I did not state it
explicitly.

But if worker W1 is processing a request from channel A, worker W2
must not start work on another channel A request until W1 finishes.

The would be easily achieved by having fixed assignments of channels to
queues. Of course, this could waste threads but since we are talking
about a "huge amount of channels" scenario this is probably negligible.

That could be solved with a more complex scheme but, as you said, I
believe we have provided quite a bit food for thought already.

Also, if it is important to process A's requests in the order they
arrived or in order by their priorities (the O.P. didn't address
the matter, but an ordering discipline of some kind is often wanted),
spraying A's events across multiple queues will make it harder
to keep their relative order intact.

This is also solved by the fixed assignment of channels to queues.

At any rate, it seems we've given the O.P. sufficient food
for thought.

Definitively.

I'd also throw Doug Lea's book into the mix, which is
an excellent source:
http://gee.cs.oswego.edu/dl/cpj/

Kind regards

robert

Robert Klemme · Jun 22, 2009

I still don't get it. If channels A and B send all their
requests to queue Q1 for service by thread T1, and if only T1
processes requests on Q1, then in effect you've combined A and B
into one super-channel and applied the "No simultaneous service"
constraint to the combined channel instead of to A and B separately.
In particular, while T1 is busy with an A request, idle thread T2
cannot work on a B request -- a constraint not present in the
original problem.

And if the threads *can* pluck work from any queue at all,
then ensuring that T1 and T2 don't both work on A requests at
the same time becomes difficult again.

As I said, there may be some thread wastage and different schemes can be
realized. Since we are talking about the high load situation your
concern is valid but negligible - at least if there is traffic on all
channels most of the time. All threads will be busy most of the time
anyway.

Cheers

robert

Design Question	6	Mar 11, 2005
[ANN] Halcyon 0.3.18 Release	0	Jan 1, 2008
[ANN] Halcyon 0.4.0	0	Feb 11, 2008
[ANN] Halcyon 0.3.7	2	Dec 17, 2007
[ANN] Halcyon 0.3.22	0	Jan 7, 2008
comp.lang.c Answers to Frequently Asked Questions (FAQ List)	15	Apr 1, 2006
comp.lang.c Answers to Frequently Asked Questions (FAQ List)	1	Feb 1, 2004

Control flow design question

JannaB

Eric Sosman

JannaB

Robert Klemme

John B. Matthews

Robert Klemme

Robert Klemme

Ask a Question

Similar Threads

Staff online

Members online

Forum statistics

Latest Threads