Standard Deviation One-liner

Billy Mays · Jun 3, 2011

I'm trying to shorten a one-liner I have for calculating the standard
deviation of a list of numbers. I have something so far, but I was
wondering if it could be made any shorter (without imports).

Here's my function:

a=lambda d

sum((x-1.*sum(d)/len(d))**2 for x in d)/(1.*(len(d)-1)))**.5

The functions is invoked as follows:

a([1,2,3,4])

Click to expand...

Click to expand...

1.2909944487358056

Alain Ketterlin · Jun 3, 2011

Billy Mays said:
I'm trying to shorten a one-liner I have for calculating the standard
deviation of a list of numbers. I have something so far, but I was
wondering if it could be made any shorter (without imports).

a=lambda dsum((x-1.*sum(d)/len(d))**2 for x in d)/(1.*(len(d)-1)))**.5

You should make it two half-liners, because this one repeatedly computes
sum(d). I would suggest:

aux = lambda s1,s2,n: (s2 - s1*s1/n)/(n-1)
sv = lambda d: aux(sum(d),sum(x*x for x in d),len(d))

(after some algebra). Completely untested, assumes data come in as
floats. You get the idea.

-- Alain.

Alain Ketterlin · Jun 3, 2011

Alain Ketterlin said:
aux = lambda s1,s2,n: (s2 - s1*s1/n)/(n-1)
sv = lambda d: aux(sum(d),sum(x*x for x in d),len(d))

Err, sorry, the final square root is missing.

-- Alain.

Raymond Hettinger · Jun 3, 2011

I'm trying to shorten a one-liner I have for calculating the standard
deviation of a list of numbers. I have something so far, but I was
wondering if it could be made any shorter (without imports).

Here's my function:

a=lambda dsum((x-1.*sum(d)/len(d))**2 for x in d)/(1.*(len(d)-1)))**.5

The functions is invoked as follows:

>>> a([1,2,3,4])
1.2909944487358056

Besides trying to do it one line, it is also interesting to write an
one-pass version with incremental results:

http://mathcentral.uregina.ca/QQ/database/QQ.09.06/h/murtaza2.html

Another interesting avenue to is aim for highest possible accuracy.
Consider using math.fsum() to avoid rounding errors in the summation
of large numbers of nearly equal values.

Raymond

Steven D'Aprano · Jun 5, 2011

I'm trying to shorten a one-liner I have for calculating the standard
deviation of a list of numbers. Â I have something so far, but I was
wondering if it could be made any shorter (without imports).

Here's my function:

a=lambda dsum((x-1.*sum(d)/len(d))**2 for x in
d)/(1.*(len(d)-1)))**.5

The functions is invoked as follows:

Â >>> a([1,2,3,4])
1.2909944487358056

Click to expand...

Besides trying to do it one line, it is also interesting to write an
one-pass version with incremental results:

http://mathcentral.uregina.ca/QQ/database/QQ.09.06/h/murtaza2.html

I'm not convinced that's a good approach, although I haven't tried it. In
general, the so-called "computational formula" for variance is optimized
for pencil and paper calculations of small amounts of data, but is
numerically unstable.

See

http://www.johndcook.com/blog/2008/09/26/comparing-three-methods-of-
computing-standard-deviation/

http://en.wikipedia.org/wiki/Algorithms_for_calculating_variance

I'll also take this opportunity to plug my experimental stats package,
which includes coroutine-based running statistics, including standard
deviation:
1.4999999999999998

The non-running calculation of stdev gives this:

stats.stdev([3, 2, 5, 5])

Click to expand...

Click to expand...

1.5

http://pypi.python.org/pypi/stats/
http://code.google.com/p/pycalcstats/

Be warned that the version on Google Code is unstable, and currently
broken.

Feedback is welcome!

Ethan Furman · Jun 5, 2011

Steven said:
I'm trying to shorten a one-liner I have for calculating the standard
deviation of a list of numbers. I have something so far, but I was
wondering if it could be made any shorter (without imports).

Here's my function:

a=lambda dsum((x-1.*sum(d)/len(d))**2 for x in
d)/(1.*(len(d)-1)))**.5

The functions is invoked as follows:

a([1,2,3,4])
1.2909944487358056

Click to expand...

Besides trying to do it one line, it is also interesting to write an
one-pass version with incremental results:

http://mathcentral.uregina.ca/QQ/database/QQ.09.06/h/murtaza2.html

Click to expand...

I'm not convinced that's a good approach, although I haven't tried it. In
general, the so-called "computational formula" for variance is optimized
for pencil and paper calculations of small amounts of data, but is
numerically unstable.

See

http://www.johndcook.com/blog/2008/09/26/comparing-three-methods-of-
computing-standard-deviation/

http://en.wikipedia.org/wiki/Algorithms_for_calculating_variance

I'll also take this opportunity to plug my experimental stats package,
which includes coroutine-based running statistics, including standard
deviation:

--> s = stats.co.stdev()
--> s.send(3)
nan

Look! A NaN in the wild!

~Ethan~

Trouble with prediction code, for the life of me I can't figure out why it isnt running properly. Help would be appreciated.	0	Jul 8, 2023
C program: memory leak/ segmentation fault/ memory limit exceeded	0	Nov 12, 2022
I Need Fix In Code	1	Apr 12, 2023
? get negative from prod(x) when x is positive integers	0	Jun 28, 2013
standard deviation	19	Jun 5, 2011
Flatten a two-level list --> one liner?	3	Mar 8, 2007
Perl-Python-a-Day: one-liner loop Functional Style	0	Oct 20, 2005
Decreasing the "standard deviation" of Java	3	May 25, 2006

Standard Deviation One-liner

Billy Mays

Alain Ketterlin

Alain Ketterlin

Raymond Hettinger

Steven D'Aprano

Ethan Furman

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads