Hexadecimal list conversion

Neil Webster · Dec 20, 2007

Hi All.

I have a list which is a line from a file:
['\x003\x008\x001\x004\x007\x005\x00.\x005\x000\x002\x005\x009\x009\x00',
'\x002\x001\x003\x006\x002\x002\x00.\x001\x007\x004\x002\x008\x002\x00']

This should be in the format:
['381475.502599', '213622.174282']

I've tried a few options using replace (replacing "\x00" with "") and
trying to convert from hexademical to decimal.

But nothing has worked. Can anybody give any tips to help?

Thanks.

Paul Hankin · Dec 20, 2007

Hi All.

I have a list which is a line from a file:
['\x003\x008\x001\x004\x007\x005\x00.\x005\x000\x002\x005\x009\x009\x00',
'\x002\x001\x003\x006\x002\x002\x00.\x001\x007\x004\x002\x008\x002\x00']

This should be in the format:
['381475.502599', '213622.174282']

I've tried a few options using replace (replacing "\x00" with "") and
trying to convert from hexademical to decimal.

But nothing has worked. Can anybody give any tips to help?

Is your file utf-16 (that would explain why your file has \x00 in
between every character)? If so, use codecs.open to read it, and you
won't get the \x00's (you'll get a unicode string).

Or you can remove them using replace:

a = a.replace('\x00', '')

HTH

Andreas Tawn · Dec 20, 2007

Hi All.

I have a list which is a line from a file:
['\x003\x008\x001\x004\x007\x005\x00.\x005\x000\x002\x005\x009
\x009\x00',
'\x002\x001\x003\x006\x002\x002\x00.\x001\x007\x004\x002\x008\
x002\x00']

This should be in the format:
['381475.502599', '213622.174282']

I've tried a few options using replace (replacing "\x00" with "") and
trying to convert from hexademical to decimal.

But nothing has worked. Can anybody give any tips to help?

Thanks.

Somthing like:

line =
['\x003\x008\x001\x004\x007\x005\x00.\x005\x000\x002\x005\x009\x009\x00'
,
'\x002\x001\x003\x006\x002\x002\x00.\x001\x007\x004\x002\x008\x002\x00']

result = [''.join(x.split('\x00')) for x in line]

Cheers,

Drea

Gabriel Genellina · Dec 20, 2007

I have a list which is a line from a file:
['\x003\x008\x001\x004\x007\x005\x00.\x005\x000\x002\x005\x009\x009\x00',
'\x002\x001\x003\x006\x002\x002\x00.\x001\x007\x004\x002\x008\x002\x00']

This should be in the format:
['381475.502599', '213622.174282']

I've tried a few options using replace (replacing "\x00" with "") and
trying to convert from hexademical to decimal.

The replace works:

py> for item in L:
.... print item.replace('\x00','')
....
381475.502599
213622.174282

If you got that from a file, I bet you read it using the wrong encoding.
Try opening the file using codecs.open("filename", "rb",
encoding="utf-16-be") instead of plain open. When your read it, you'll get
unicode objects instead of strings, but with the right contents. If you
wish you can convert to strings using
line_read.encode(your_system_encoding); if all your data is numeric the
encoding used is irrelevant and can be omited.

Mark T · Dec 20, 2007

Gabriel Genellina said:
I have a list which is a line from a file:
['\x003\x008\x001\x004\x007\x005\x00.\x005\x000\x002\x005\x009\x009\x00',
'\x002\x001\x003\x006\x002\x002\x00.\x001\x007\x004\x002\x008\x002\x00']

This should be in the format:
['381475.502599', '213622.174282']

I've tried a few options using replace (replacing "\x00" with "") and
trying to convert from hexademical to decimal.

Click to expand...

The replace works:

py> for item in L:
... print item.replace('\x00','')
...
381475.502599
213622.174282

If you got that from a file, I bet you read it using the wrong encoding.
Try opening the file using codecs.open("filename", "rb",
encoding="utf-16-be") instead of plain open. When your read it, you'll get
unicode objects instead of strings, but with the right contents. If you
wish you can convert to strings using
line_read.encode(your_system_encoding); if all your data is numeric the
encoding used is irrelevant and can be omited.

There is an odd number of bytes in each string. Each begins and ends with
\x00, so it doesn't look like utf-16-be. But replace works:

L=['\x003\x008\x001\x004\x007\x005\x00.\x005\x000\x002\x005\x009\x009\x00','\x002\x001\x003\x006\x002\x002\x00.\x001\x007\x004\x002\x008\x002\x00']
[s.replace('\x00','') for s in L]

Click to expand...

Click to expand...

['381475.502599', '213622.174282']

-Mark Tolonen

Peter Otten · Dec 20, 2007

There is an odd number of bytes in each string. Each begins and ends
with \x00, so it doesn't look like utf-16-be.

I think Gabriel is right. The OP probably butchered the original structure
with

open(filename).read().split("\n")

Peter

John Machin · Dec 20, 2007

I think Gabriel is right. The OP probably butchered the original structure
with

open(filename).read().split("\n")

Or he's read the file "normally" and then done
line = lineZAP
where ZAP is one of [:-1], .rstrip(), .rstrip("\n"), etc

However that accounts only for the rightmost trailing \x00. Looks like
each line has been chainsawed with .split(",") or whatever the
original field separator was.

If Gabriel's instructions don't "work" for the OP, the OP should show
us an unambiguous representation of the first few bytes of the
original file, instead of leaving it to guesswork:

print repr(open("the_file", "rb").read()[:200])

Gabriel Genellina · Dec 20, 2007

I think Gabriel is right. The OP probably butchered the original
structure
with

open(filename).read().split("\n")

Sure! I take bets on this too.

problem with logic in reading a binary file	9	Mar 29, 2008
Windows 2008 Server: Reading Text File with Ruby.	7	Apr 6, 2011
24 bit signed integer binary conversion help needed	5	Jan 8, 2010
number conversion	2	Apr 6, 2005
Collect Excel Data from Website	5	Apr 30, 2022
TypeError: Can't convert 'int' object to str implicitly	12	Apr 26, 2013
Conversion from DTD and W3C to Relax NG	2	May 22, 2008
ADV - Conversion Services for FrameMaker	0	Jan 10, 2007

Hexadecimal list conversion

Neil Webster

Paul Hankin

Andreas Tawn

Gabriel Genellina

Mark T

Peter Otten

John Machin

Gabriel Genellina

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads