Efficient Running Median

Raymond Hettinger · Jan 15, 2010

I've updated the running median recipe to use a new algorithm with O
(log n) updates for a large sliding window traversing a data stream.
See http://code.activestate.com/recipes/576930/

The engine is a new collection class called IndexableSkiplist. It is
like a regular skiplist as detailed at http://en.wikipedia.org/wiki/Skip_list,
but it also records the width of each link field. That allows values
to be retrieved by their position index in O(log n) time.

The key operations are:
O(log n) -- sl.insert(value) # add a value to the skiplist,
maintaining sorted order
O(log n) -- s.remove(value) # remove a value from the skiplist,
maintaining sorted order
O(log n) -- s # retrieve the i-th item
O(n) -- list(sl) # iterate over the skiplist in sorted
order
O(1) -- len(sl) # number of items in the skiplist

The performance of an IndexableSkiplist is similar to a B+tree but the
implementation in pure python is much simpler.

Raymond

Aahz · Jan 22, 2010

The performance of an IndexableSkiplist is similar to a B+tree but the
implementation in pure python is much simpler.

Nice! Can you summarize why IndexableSkipList is simpler?

Bearophile · Jan 23, 2010

Very nice. I have added a comment at the bottom, using a circular
queue instead of a deque may be faster.

Bye,
bearophile

efficient running median	26	Oct 13, 2009
Best way to insert sorted in a list	10	Jun 17, 2011
program not working properly. book example. program included.	6	Sep 27, 2013
Median of values in a std::map	5	Aug 30, 2006
Builtin classes list, set, dict reimplemented via B-trees	1	Sep 14, 2005
Custom Minecraft launcher client error; I think regarding java	0	Sep 7, 2022
Proposed implementation for an Ordered Dictionary	22	Feb 26, 2009
order in map	3	Jan 3, 2007

Efficient Running Median

Raymond Hettinger

Aahz

Bearophile

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads