parsing tab separated data efficiently into numpy/pylab arrays

per · Mar 13, 2009

hi all,

what's the most efficient / preferred python way of parsing tab
separated data into arrays? for example if i have a file containing
two columns one corresponding to names the other numbers:

col1 \t col 2
joe \t 12.3
jane \t 155.0

i'd like to parse into an array() such that i can do: mydata[:, 0] and
mydata[:, 1] to easily access all the columns.

right now i can iterate through the file, parse it manually using the
split('\t') command and construct a list out of it, then convert it to
arrays. but there must be a better way?

also, my first column is just a name, and so it is variable in length
-- is there still a way to store it as an array so i can access: mydata
[:, 0] to get all the names (as a list)?

thank you.

Matteo · Mar 13, 2009

hi all,

what's the most efficient / preferred python way of parsing tab
separated data into arrays? for example if i have a file containing
two columns one corresponding to names the other numbers:

col1 \t col 2
joe \t 12.3
jane \t 155.0

i'd like to parse into an array() such that i can do: mydata[:, 0] and
mydata[:, 1] to easily access all the columns.

right now i can iterate through the file, parse it manually using the
split('\t') command and construct a list out of it, then convert it to
arrays. but there must be a better way?

also, my first column is just a name, and so it is variable in length
-- is there still a way to store it as an array so i can access: mydata
[:, 0] to get all the names (as a list)?

thank you.

I think you can do it through:

array.fromfile()
array.reshape()

but you should look up the reference for those.

mapb81 · Mar 24, 2009

You could take a look/use the very handy csv2rec function in
matplotlib.mlab, which creates numpy struct arrays.

Marco

SENTINEL CONTROL LOOP WHEN DEALING WITH TWO ARRAYS	1	Oct 26, 2023
How do i Do this function(dealing with arrays)	1	Dec 10, 2021
parsing tab and newline delimited text	6	Aug 4, 2010
Efficiently Parsing Data	9	Dec 14, 2007
parsing string into dict	3	Sep 1, 2010
numpy 00 character bug?	2	Jun 5, 2009
concatenating numpy arrays	1	Oct 31, 2006
Help needed with nested parsing of file into objects	12	Jun 4, 2012

parsing tab separated data efficiently into numpy/pylab arrays

per

Matteo

mapb81

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads