String manipulations

Lorn · May 28, 2005

I'm trying to work on a dataset that has it's primary numbers saved as
floats in string format. I'd like to work with them as integers with an
implied decimal to the hundredth. The problem is that the current
precision is variable. For instance, some numbers have 4 decimal places
while others have 2, etc. (10.7435 vs 1074.35)... all numbers are of
fixed length.

I have some ideas of how to do this, but I'm wondering if there's a
better way. My current way is to brute force search where the decimal
is by slicing and then cutoff the extraneous numbers, however, it would
be nice to stay away from a bunch of if then's.

Does anyone have any ideas on how to do this more efficiently?

Many Thanks,
Lorn

Philippe C. Martin · May 28, 2005

Multiply them by 10000 ?

Lorn · May 28, 2005

Yes, that would get rid of the decimals... but it wouldn't get rid of
the extraneous precision. Unfortunately, the precision out to the ten
thousandth is noise... I don't need to round it either as the numbers
are artifacts of an integer to float conversion. Basically, I need to
know how many decimal places there are and then make the necessary
deletions before I can normalize by adding zeros, multiplying, etc.

Thanks for your suggestion, though.

John Roth · May 28, 2005

Lorn said:
I'm trying to work on a dataset that has its primary numbers saved as
floats in string format. I'd like to work with them as integers with an
implied decimal to the hundredth. The problem is that the current
precision is variable. For instance, some numbers have 4 decimal places
while others have 2, etc. (10.7435 vs 1074.35)... all numbers are of
fixed length.

I have some ideas of how to do this, but I'm wondering if there's a
better way. My current way is to brute force search where the decimal
is by slicing and then cutoff the extraneous numbers, however, it would
be nice to stay away from a bunch of if then's.

Does anyone have any ideas on how to do this more efficiently?

If you can live with a small possibility of error, then:

int(float(numIn) * 100.0)

should do the trick.

If you can't, and the numbers are guaranteed to have a decimal point,
this (untested) could do what you want:

aList = numIn.split(".")
result int(aList[0]) * 100 + int(aList[1][:2])

HTH

John Roth

Elliot Temple · May 28, 2005

Yes, that would get rid of the decimals... but it wouldn't get rid of
the extraneous precision. Unfortunately, the precision out to the ten
thousandth is noise... I don't need to round it either as the numbers
are artifacts of an integer to float conversion. Basically, I need to
know how many decimal places there are and then make the necessary
deletions before I can normalize by adding zeros, multiplying, etc.

Thanks for your suggestion, though.

for s in numbers:
decimal_index = s.find('.')
decimal_places = len(s) - decimal_index - 1

Anything wrong with this? (it will mess up if there isn't a decimal
but you can fix that if you want)

-- Elliot Temple
http://www.curi.us/

Lorn · May 29, 2005

Thank you Elliot, this solution is the one I was trying to come up
with. Thank you for your help and thank you to everyone for their
suggestions.

Best regards,
Lorn

Significant digits in a float?	65	Apr 28, 2014
New python module to simulate arbitrary fixed and infinite precisionbinary floating point	2	Aug 10, 2008
Very simple parser... not for me	36	Jan 27, 2014
string differences; max irritation	6	Jan 24, 2008
Rough draft: Proposed format specifier for a thousands separator	39	Mar 12, 2009
A Revised Rational Proposal	21	Dec 26, 2004
attribute access and indirection	0	Oct 22, 2009
Decimal to Binary	22	Jun 30, 2010

String manipulations

Lorn

Philippe C. Martin

Lorn

John Roth

Elliot Temple

Lorn

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads