Array.sort problem

L

Larme

Dear all, I'm a newbie to ruby and today when I'm writing a simple
script to process some data, I found something I can't understand.

The data is stored in several column seperated by tab or space. I use
the following code to get the data (assuming the data comes from
standard input and all numbers are integer)

data=[]
counter = 0
while line = STDIN.gets
data[counter] = line.split
data[counter].map! {|str| str.to_i}
counter += 1
end

hence data is an array hold all the numbers in the ith line, data
[j] is the number on ith line and jth column. Then what I want the
script to do is sorting lines according to a specified column. I
thought the following code should work:

result = data.sort {|x, y| x[col] <=> y[col] }

where the col determine which column the script will sort according
to. However ruby raise a error saying:
"undefined method `<=>' for nil:NilClass (NoMethodError)
from ana.rb:16:in `sort'
from ana.rb:16
"

I have to write the code as

result = data.sort {|x, y| x[col].to_i <=> y[col].to_i }

to let the script run properly. I'm quite confused here. I think the
elements of array data are converted to integer when the code

data[counter].map! {|str| str.to_i}

finished. However why ruby still requires a explicit conversion when I
use the data.sort?
 
H

hemant

Dear all, I'm a newbie to ruby and today when I'm writing a simple
script to process some data, I found something I can't understand.

The data is stored in several column seperated by tab or space. I use
the following code to get the data (assuming the data comes from
standard input and all numbers are integer)

data=[]
counter = 0
while line = STDIN.gets
data[counter] = line.split
data[counter].map! {|str| str.to_i}
counter += 1
end

hence data is an array hold all the numbers in the ith line, data
[j] is the number on ith line and jth column. Then what I want the
script to do is sorting lines according to a specified column. I
thought the following code should work:

result = data.sort {|x, y| x[col] <=> y[col] }

where the col determine which column the script will sort according
to. However ruby raise a error saying:
"undefined method `<=>' for nil:NilClass (NoMethodError)
from ana.rb:16:in `sort'
from ana.rb:16
"

I have to write the code as

result = data.sort {|x, y| x[col].to_i <=> y[col].to_i }

to let the script run properly. I'm quite confused here. I think the
elements of array data are converted to integer when the code

data[counter].map! {|str| str.to_i}

finished. However why ruby still requires a explicit conversion when I
use the data.sort?


Yes, they are. You might be doing some typo or other crap. Following
program runs verbatim:

data=[]
counter = 0
while line = STDIN.gets
data[counter] = line.split
data[counter].map! {|str| str.to_i}
counter += 1
end

#p data.sort_by { |x| x[2] } #=> you can use sort_by as an alternative

p data.sort {|x,y| x[2] <=> y[2] } #=> This also work.
 
S

Sebastian Hungerecker

Larme said:
I have to write the code as

result = data.sort {|x, y| x[col].to_i <=> y[col].to_i }

to let the script run properly. I'm quite confused here. I think the
elements of array data are converted to integer when the code

data[counter].map! {|str| str.to_i}

finished. However why ruby still requires a explicit conversion when I
use the data.sort?

If you have an array arr with n elements which you all turn into integers,
arr with i>=n will still be nil. So if in your case not all rows have more
than col columns, that would explain the problem.


HTH,
Sebastian Hungerecker
 
R

Robert Klemme

Dear all, I'm a newbie to ruby and today when I'm writing a simple
script to process some data, I found something I can't understand.

The data is stored in several column seperated by tab or space. I use
the following code to get the data (assuming the data comes from
standard input and all numbers are integer)

data=[]
counter = 0
while line = STDIN.gets
data[counter] = line.split
data[counter].map! {|str| str.to_i}
counter += 1
end

hence data is an array hold all the numbers in the ith line, data
[j] is the number on ith line and jth column. Then what I want the
script to do is sorting lines according to a specified column. I
thought the following code should work:

result = data.sort {|x, y| x[col] <=> y[col] }

where the col determine which column the script will sort according
to. However ruby raise a error saying:
"undefined method `<=>' for nil:NilClass (NoMethodError)
from ana.rb:16:in `sort'
from ana.rb:16
"

I have to write the code as

result = data.sort {|x, y| x[col].to_i <=> y[col].to_i }

to let the script run properly. I'm quite confused here. I think the
elements of array data are converted to integer when the code

data[counter].map! {|str| str.to_i}

finished. However why ruby still requires a explicit conversion when I
use the data.sort?


You probably have lines with differing number of entries and thus
sometimes x[col] just returns nil.

You could do something like:

data.sort_by {|x| x[col].to_i}
data.sort_by {|x| x[col] || 0}

If you want to be sure that you extract only integers you could do

l.scan(/\d+/).map {|x| x.to_i}

instead of the split.

Lots of options...

Kind regards

robert
 
L

Larme

Dear all, I'm a newbie to ruby and today when I'm writing a simple
script to process some data, I found something I can't understand.
The data is stored in several column seperated by tab or space. I use
the following code to get the data (assuming the data comes from
standard input and all numbers are integer)
data=[]
counter = 0
while line = STDIN.gets
data[counter] = line.split
data[counter].map! {|str| str.to_i}
counter += 1
end
hence data is an array hold all the numbers in the ith line, data
[j] is the number on ith line and jth column. Then what I want the
script to do is sorting lines according to a specified column. I
thought the following code should work:

result = data.sort {|x, y| x[col] <=> y[col] }
where the col determine which column the script will sort according
to. However ruby raise a error saying:
"undefined method `<=>' for nil:NilClass (NoMethodError)
from ana.rb:16:in `sort'
from ana.rb:16
"
I have to write the code as
result = data.sort {|x, y| x[col].to_i <=> y[col].to_i }
to let the script run properly. I'm quite confused here. I think the
elements of array data are converted to integer when the code
data[counter].map! {|str| str.to_i}
finished. However why ruby still requires a explicit conversion when I
use the data.sort?

You probably have lines with differing number of entries and thus
sometimes x[col] just returns nil.

You could do something like:

data.sort_by {|x| x[col].to_i}
data.sort_by {|x| x[col] || 0}

If you want to be sure that you extract only integers you could do

l.scan(/\d+/).map {|x| x.to_i}

instead of the split.

Lots of options...

Kind regards

robert



Thank all of you. Yes, the problem is that the column number is not
fixed -- it should be, but my friend who prepare the data made some
thing wrong.
 
A

Ari Brown

On Apr 21, 2007, at 2:20 PM, Larme wrote:
1
2 data=[]
3 counter = 0
4 while line = STDIN.gets
5 data[counter] = line.split
6 data[counter].map! {|str| str.to_i}
7 counter += 1
8 end

Ok, so I'm a noob trying to learn something here...

Shouldn't line 4 be presented with a double = (==) since you're
trying do something while line == STDIN.gets?

Also, does STDIN.gets need to be stated in a previous line? Or will
it be prompted from within the 'while' statement.

<snip>
--------------------------------------------|
If you're not living on the edge,
then you're just wasting space.
 
P

Philip M. Gollucci

4 while line = STDIN.gets
Ok, so I'm a noob trying to learn something here...

Shouldn't line 4 be presented with a double = (==) since you're trying
do something while line == STDIN.gets?
No, the assignment of line = STDIN.gets happens first which reads up to
a '\n' including it. While then loops while this value is true.

When there is no more inmput, line will be assigned nothing which is
false and the loop ends.

This works identically to other languages like perl/python/php


--
------------------------------------------------------------------------
Philip M. Gollucci ([email protected]) 323.219.4708
Consultant / http://p6m7g8.net/Resume/resume.shtml
Senior Software Engineer - TicketMaster - http://ticketmaster.com
1024D/EC88A0BF 0DE5 C55C 6BF3 B235 2DAB B89E 1324 9B4F EC88 A0BF

Work like you don't need the money,
love like you'll never get hurt,
and dance like nobody's watching.
 
A

Ari Brown

No, the assignment of line = STDIN.gets happens first which reads
up to a '\n' including it. While then loops while this value is true.

Oh, so it's not checking whether line == STDIN.gets (immediately),
but rather assigning STDIN.gets to line, and then checking whether
that's true?

Nice, I learned something!

---------------------------------------------------------------|
~Ari
"I don't suffer from insanity. I enjoy every minute of it" --1337est
man alive
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,781
Messages
2,569,615
Members
45,296
Latest member
HeikeHolli

Latest Threads

Top