Returning the positions of a list that are non-zero

Benjamin Goudey · Jul 9, 2008

I have a very large list of integers representing data needed for a
histogram that I'm going to plot using pylab. However, most of these
values (85%-95%) are zero and I would like to remove them to reduce
the amount of memory I'm using and save time when it comes to plotting
the data. To do this, I'm trying to find the best way to remove all of
the zero values and produce a list of indices of where the non-zero
values used to be.

For example, if my original list is [0,0,1,2,1,0,0] I would like to
produce the lists [1,2,1] (the non zero values) and [2,3,4] (indices
of where the non-zero values used to be). Removing non-zero values is
very easy but determining the indicies is where I'm having difficulty.

Thanks in advance for any help

Rajanikanth Jammalamadaka · Jul 9, 2008

Try this:

li=[0,0,1,2,1,0,0]
li [0, 0, 1, 2, 1, 0, 0]
[i for i in range(len(li)) if li != 0]

Click to expand...

Click to expand...

[2, 3, 4]

Cheers,

Raj

I have a very large list of integers representing data needed for a
histogram that I'm going to plot using pylab. However, most of these
values (85%-95%) are zero and I would like to remove them to reduce
the amount of memory I'm using and save time when it comes to plotting
the data. To do this, I'm trying to find the best way to remove all of
the zero values and produce a list of indices of where the non-zero
values used to be.

For example, if my original list is [0,0,1,2,1,0,0] I would like to
produce the lists [1,2,1] (the non zero values) and [2,3,4] (indices
of where the non-zero values used to be). Removing non-zero values is
very easy but determining the indicies is where I'm having difficulty.

Thanks in advance for any help

Click to expand...

--
"For him who has conquered the mind, the mind is the best of friends;
but for one who has failed to do so, his very mind will be the
greatest enemy."

Rajanikanth

Luis Zarrabeitia · Jul 9, 2008

This could work:

l = [0,0,1,2,1,0,0]
indexes, values = zip(*((index,value) for index,value in enumerate(l) if value
!= 0))

But I guess it would be a little less cryptic (and maybe a lot more efficient)
if there were an unzip function instead of using the zip(*sequence) trick..

I think a more readable way would be:

indexes = [index for index,value in enumerate(l) if value != 0]
values = [value for value in l if value != 0]

Cheers.

Andrii V. Mishkovskyi · Jul 9, 2008

2008/7/9 Benjamin Goudey said:
I have a very large list of integers representing data needed for a
histogram that I'm going to plot using pylab. However, most of these
values (85%-95%) are zero and I would like to remove them to reduce
the amount of memory I'm using and save time when it comes to plotting
the data. To do this, I'm trying to find the best way to remove all of
the zero values and produce a list of indices of where the non-zero
values used to be.

For example, if my original list is [0,0,1,2,1,0,0] I would like to
produce the lists [1,2,1] (the non zero values) and [2,3,4] (indices
of where the non-zero values used to be). Removing non-zero values is
very easy but determining the indicies is where I'm having difficulty.

Thanks in advance for any help

l = [0, 0, 1, 2, 1, 0, 0]
zip(*[(item, index) for (index, item) in enumerate(l) if item != 0])

Click to expand...

Click to expand...

[(1, 2, 1), (2, 3, 4)]

Chris · Jul 9, 2008

Try this:

li=[0,0,1,2,1,0,0]
li

Click to expand...

Click to expand...

[0, 0, 1, 2, 1, 0, 0]>>> [i for i in range(len(li)) if li != 0]

[2, 3, 4]

Cheers,

Raj

I have a very large list of integers representing data needed for a
histogram that I'm going to plot using pylab. However, most of these
values (85%-95%) are zero and I would like to remove them to reduce
the amount of memory I'm using and save time when it comes to plotting
the data. To do this, I'm trying to find the best way to remove all of
the zero values and produce a list of indices of where the non-zero
values used to be.

Click to expand...

For example, if my original list is [0,0,1,2,1,0,0] I would like to
produce the lists [1,2,1] (the non zero values) and [2,3,4] (indices
of where the non-zero values used to be). Removing non-zero values is
very easy but determining the indicies is where I'm having difficulty.

Click to expand...

Thanks in advance for any help

Click to expand...

--
"For him who has conquered the mind, the mind is the best of friends;
but for one who has failed to do so, his very mind will be the
greatest enemy."

Rajanikanth

That's a waste

li=[0,0,1,2,1,0,0]
[i for i in li if i]

Click to expand...

Click to expand...

Click to expand...

That's all you need.

Chris · Jul 9, 2008

Try this:

li=[0,0,1,2,1,0,0]
li

Click to expand...

Click to expand...

[0, 0, 1, 2, 1, 0, 0]>>> [i for i in range(len(li)) if li != 0]

[2, 3, 4]

Cheers,

Raj

I have a very large list of integers representing data needed for a
histogram that I'm going to plot using pylab. However, most of these
values (85%-95%) are zero and I would like to remove them to reduce
the amount of memory I'm using and save time when it comes to plotting
the data. To do this, I'm trying to find the best way to remove all of
the zero values and produce a list of indices of where the non-zero
values used to be.

Click to expand...

For example, if my original list is [0,0,1,2,1,0,0] I would like to
produce the lists [1,2,1] (the non zero values) and [2,3,4] (indices
of where the non-zero values used to be). Removing non-zero values is
very easy but determining the indicies is where I'm having difficulty.

Click to expand...

Thanks in advance for any help

Click to expand...

--
"For him who has conquered the mind, the mind is the best of friends;
but for one who has failed to do so, his very mind will be the
greatest enemy."

Rajanikanth

Whoops, misread the question

li =[0,0,1,2,1,0,0]
[(index,data) for index,data in enumerate(li) if data]

Paul McGuire · Jul 9, 2008

I have a very large list of integers representing data needed for a
histogram that I'm going to plot using pylab. However, most of these
values (85%-95%) are zero and I would like to remove them to reduce
the amount of memory I'm using and save time when it comes to plotting
the data. To do this, I'm trying to find the best way to remove all of
the zero values and produce a list of indices of where the non-zero
values used to be.

For example, if my original list is [0,0,1,2,1,0,0] I would like to
produce the lists [1,2,1] (the non zero values) and [2,3,4] (indices
of where the non-zero values used to be). Removing non-zero values is
very easy but determining the indicies is where I'm having difficulty.

sparse_data = [0, 0, 1, 2, 1, 0, 0]
values,locns = zip(*[ (x,i) for i,x in enumerate(sparse_data) if x ])
print values (1, 2, 1)
print locns (2, 3, 4)

Click to expand...

Click to expand...

-- Paul

How to only get a list of the names of the non-directory files incurrent directory ('.')?	8	Nov 6, 2012
Trying to build a SARIMAX model to forecast the S&P500 trend	0	Nov 5, 2023
Fibonacci: returning a selection of the series	6	Aug 29, 2010
make sublists of a list broken at nth certain list items	2	Jul 8, 2013
Plotting the integer-and-fraction remainder of a function valuemodulo 360	2	Apr 10, 2014
Iterate through a list of tuples for processing	0	Sep 20, 2013
help on Implementing a list of dicts with no data pattern	12	May 9, 2013
Putting the loop inside of loop properly	1	Mar 1, 2013

Returning the positions of a list that are non-zero

Benjamin Goudey

Rajanikanth Jammalamadaka

Luis Zarrabeitia

Andrii V. Mishkovskyi

Chris

Chris

Paul McGuire

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads