How do I iterate over items in a dict grouped by N number ofelements?

Noah · Mar 14, 2008

What is the fastest way to select N items at a time from a dictionary?
I'm iterating over a dictionary of many thousands of items.
I want to operate on only 100 items at a time.
I want to avoid copying items using any sort of slicing.
Does itertools copy items?

This works, but is ugly:

from itertools import *
D = {'a':1, 'b':2, 'c':3, 'd':4, 'e':5, 'f':6, 'g':7, 'h':8, 'i':9, 'j':10}
N = 3
for G in izip(*[chain(D.items(), repeat(None, N-1))]*N):

Click to expand...

Click to expand...

.... print G
....
(('a', 1), ('c', 3), ('b', 2))
(('e', 5), ('d', 4), ('g', 7))
(('f', 6), ('i', 9), ('h', 8))
(('j', 10), None, None)

I'd prefer the last sequence not return None
elements and instead just return (('j',10)), but this isn't a huge
deal.

This works and is clear, but it makes copies of items:
.... print ii[i:i+N]
....
[('a', 1), ('c', 3), ('b', 2)]
[('e', 5), ('d', 4), ('g', 7)]
[('f', 6), ('i', 9), ('h', 8)]
[('j', 10)]

Paul Rubin · Mar 14, 2008

Noah said:
What is the fastest way to select N items at a time from a dictionary?
I'm iterating over a dictionary of many thousands of items.
I want to operate on only 100 items at a time.
I want to avoid copying items using any sort of slicing.

I'd do something like (untested):

def groups(seq, n):
while True:
s = list(itertools.islice(seq, n))
if not s: return
yield s

items = d.iteritems()
for g in groups(items, 100):
operate_on (g)

Does itertools copy items?

I don't understand this question.

attn.steven.kuo · Mar 14, 2008

What is the fastest way to select N items at a time from a dictionary?
I'm iterating over a dictionary of many thousands of items.
I want to operate on only 100 items at a time.
I want to avoid copying items using any sort of slicing.
Does itertools copy items?

This works, but is ugly:

from itertools import *
D = {'a':1, 'b':2, 'c':3, 'd':4, 'e':5, 'f':6, 'g':7, 'h':8, 'i':9, 'j':10}
N = 3
for G in izip(*[chain(D.items(), repeat(None, N-1))]*N):

Click to expand...

Click to expand...

... print G
...
(('a', 1), ('c', 3), ('b', 2))
(('e', 5), ('d', 4), ('g', 7))
(('f', 6), ('i', 9), ('h', 8))
(('j', 10), None, None)

I'd prefer the last sequence not return None
elements and instead just return (('j',10)), but this isn't a huge
deal.

This works and is clear, but it makes copies of items:

... print ii[i:i+N]
...
[('a', 1), ('c', 3), ('b', 2)]
[('e', 5), ('d', 4), ('g', 7)]
[('f', 6), ('i', 9), ('h', 8)]
[('j', 10)]

groupby?

import itertools

D = {'a':1, 'b':2, 'c':3, 'd':4, 'e':5, 'f':6, 'g':7, 'h':8, 'i':9,
'j':10}
N = 3

it = itertools.groupby(enumerate(D.items()), lambda t: int(t[0]/N))

for each in it:
print tuple(t[1] for t in each[1])

Arnaud Delobelle · Mar 14, 2008

What is the fastest way to select N items at a time from a dictionary?
I'm iterating over a dictionary of many thousands of items.
I want to operate on only 100 items at a time.
I want to avoid copying items using any sort of slicing.
Does itertools copy items?

This works, but is ugly:

from itertools import *
D = {'a':1, 'b':2, 'c':3, 'd':4, 'e':5, 'f':6, 'g':7, 'h':8, 'i':9, 'j':10}
N = 3
for G in izip(*[chain(D.items(), repeat(None, N-1))]*N):

Click to expand...

Click to expand...

This solution matches exactly the one proposed in itertools. The
following is an extract from http://docs.python.org/lib/itertools-functions.html.

Note, the left-to-right evaluation order of the iterables is
guaranteed. This makes possible an idiom for clustering a data series
into n-length groups using "izip(*[iter(s)]*n)". For data that doesn't
fit n-length groups exactly, the last tuple can be pre-padded with
fill values using "izip(*[chain(s, [None]*(n-1))]*n)".

How do I make this in C with for loop	3	Jan 16, 2023
Sort by number of characters	1	Nov 2, 2023
How to output accept items for repair	1	Nov 18, 2021
Grouping items by a key?	1	Mar 22, 2013
How to multiply two matrices of size in using inline assembly in C++	2	Mar 3, 2024
A number everyday of the month "and" a different number depending on the day of the month´s day time	2	Mar 16, 2021
I would like to use awk to calculate the total number of records processed	1	Aug 25, 2022
for loop skips items	13	Feb 15, 2012

How do I iterate over items in a dict grouped by N number ofelements?

Noah

Paul Rubin

attn.steven.kuo

Arnaud Delobelle

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads