Ruby & Threads

Michael Boutros · Jul 14, 2008

Hello all,

I'm building an application that has to branch out and call about 10
other Ruby scripts. Since each script will run for a few seconds,
waiting for each one to finish will take a while, which is too much. So,
I've been looking into threads and I have a system that's working (in
tests), but I have a few questions. First of all, the system:

require 'enumerator'

holder = []

array = (1..10).to_a
puts array.inspect

array.each_slice(3) do |group|
group.each do |number|
@thread = Thread.new do
puts "Starting #{number}...\n"
sleep(5)
holder << number
end
end
end

@thread.join
puts holder.inspect

In theory, the output should look something like this:

[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]
Starting 1...
Starting 2...
Starting 3...
Starting 4...
Starting 5...
Starting 6...
Starting 7...
Starting 8...
Starting 9...
Starting 10...
[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

However, sometimes the second thread may finish before the first, etc.,
but the order doesn't matter. What matters is that the script's
execution time just went from over 20 seconds to under two! However, as
you can see, I have to call '@thread.join'. I do this because if I
don't, the script will exit before all of the threads are done
executing, so holder is always an emtpy array. Am I right? Or is there
some other way to keep the main script from exiting until all the
threads are done? Is there anything else I'm doing wrong?

Thanks,
Michael Boutros

Joel VanderWerf · Jul 14, 2008

Michael said:
require 'enumerator'

holder = []

array = (1..10).to_a
puts array.inspect

array.each_slice(3) do |group|
group.each do |number|
@thread = Thread.new do
puts "Starting #{number}...\n"
sleep(5)
holder << number
end
end
end

@thread.join

#join is definitely a good idea, because otherwise (as you observed) the
main thread will exit before the others have finished, but you are
overwriting the @thread variable on each iteration through the loop.

The usual idiom for this is something like:

threads = array.map { Thread.new {...} }
threads.each {|th| th.join}

Michael Boutros · Jul 14, 2008

Joel said:
#join is definitely a good idea, because otherwise (as you observed) the
main thread will exit before the others have finished, but you are
overwriting the @thread variable on each iteration through the loop.

The usual idiom for this is something like:

threads = array.map { Thread.new {...} }
threads.each {|th| th.join}

Joel,

Initially I meant to do that because I thought that I would only need to
"join" one thread to get them all to continue, until I realized that
some might finish before others, so I altered the code to the method
that you described.

Erik Veenstra · Jul 14, 2008

In plain Ruby, you might want to rewrite this to a more
functional style:

holder =
array.collect do |number|
Thread.new do
puts "Starting #{number}...\n"
sleep(5)
number
end
end.collect do |thread|
thread.value
end

And, using ThreadLimiter [1,2], you can reduce it to:

require "threadlimiter"

holder =
array.threaded_collect do |number|
puts "Starting #{number}...\n"
sleep(5)
number
end

gegroet,
Erik V. - http://www.erikveen.dds.nl/

[1] http://www.erikveen.dds.nl/threadlimiter/doc/index.html
[2] http://rubyforge.org/projects/threadlimiter/

Minimum Total Difficulty	0	Nov 15, 2023
How do I make this in C with for loop	3	Jan 16, 2023
Newbie: Thread variables in Ruby Threads	1	Jan 1, 2011
Machine Learning.. Endless Struggle	3	Feb 16, 2023
Problem with codewars.	5	Dec 4, 2023
C program: memory leak/ segmentation fault/ memory limit exceeded	0	Nov 12, 2022
I would like to use awk to calculate the total number of records processed	1	Aug 25, 2022
ruby/pcap with threads	3	Nov 16, 2007

Ruby & Threads

Michael Boutros

Joel VanderWerf

Michael Boutros

Erik Veenstra

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads