parse a csv file into a text file

Dave Angel · Feb 6, 2014

Oops. Forgot the newline.

In python 2.x,

Instead of
f.write (a + " " + b) f.write (a + " " + b + "\n")
you can use
print >> f, a, b

print will add in the space and newline, just as it does to
sys.stdout.

MRAB · Feb 6, 2014

On 2014-02-06 07:52, Zhen Zhang wrote:> On Wednesday, February 5, 2014

Zhen Zhang said:
Zhen Zhang said:

Code:

import csv file = open('raw.csv') reader = csv.reader(file) f = open('NicelyDone.text','w') for line in reader: f.write("%s %s"%line[1],%line[5])

Click to expand...

Are you using Python 2 or 3?

Here is my question:
1:What is the data format for line[1],

Click to expand...

That's something you can easily figure out by printing out the
intermediate values. Try something like:

for line in reader:
print type(line[1]), repr(line(1))

Click to expand...

See if that prints what you expect.

how come f.write() does not work.

Click to expand...

What does "does not work" mean? What does get written to the file?
Or do you get some sort of error?

I'm pretty sure I see your error, but I'm trying to lead you to being
able to diagnose it yourself

Click to expand...

Hi Roy ,

Thank you so much for the reply,
I am currenly running python 2.7

i run the
print type(line[1]), repr(line(1))
It tells me that 'list object is not callable

"line" is a list and within repr you're using (...) (parentheses)
instead of [...] (square brackets).

It might be clearer if you call the variable "row" because the CSV
reader returns rows, and each row is a list of strings.

It seems the entire line is a data type of list instead of a data
type of "line" as i thought.

The line[1] is a string element of list after all.

f.write("%s %s %s" %(output,location,output))works great,
as MRAB mentioned, I have to do write it in term of tuples.

This is the code I am currently using

for line in reader:
location ="%s"%(line[1])
if '(' in location:
# at this point, bits = ['Toronto ', 'Ont.)']
bits = location.split('(')
location = bits[0].strip()
output = "%s %s\n" %(location,line[5])
f.write("%s" %(output))

A 1-tuple (a tuple containing one item) is:

(item, )

It's actually the comma that makes it a tuple (except for the 0-tuple
"()"); it's just that it's often necessary to wrap it in (...), and
people then think it's those that are making it a tuple, but it's not!

It extracts desired information into a text file as i wanted.
however, the python program gives me a Error after the execution.
location="%s"%(line[1])
IndexError: list index out of range

I failed to figure out why.

What is the value of "line" at that point?

Rustom Mody · Feb 6, 2014

It's actually the comma that makes it a tuple (except for the 0-tuple
"()"); it's just that it's often necessary to wrap it in (...), and
people then think it's those that are making it a tuple, but it's not!

Interesting viewpoint -- didn't know that!

Neil Cerutti · Feb 6, 2014

Hi, every one.

I am a second year EE student.
I just started learning python for my project.

I intend to parse a csv file with a format like

3520005,"Toronto (Ont.)",C > ,F,2503281,2481494,F,F,0.9,1040597,979330,630.1763,3972.4,1

[...]
into a text file like the following

Toronto 2503281 [...]
This is what i have so far.

Code:

import csv
file = open('raw.csv')[/QUOTE]

You must open the file in binary mode, as that is what the csv
module expects in Python 2.7. newline handling can be enscrewed
if you forget.

file = open('raw.csv', 'b')

Mark Lawrence · Feb 6, 2014

You must open the file in binary mode, as that is what the csv
module expects in Python 2.7. newline handling can be enscrewed
if you forget.

file = open('raw.csv', 'b')

I've never opened a file in binary mode to read with the csv module
using any Python version. Where does it state that you must do this?

Tim Chase · Feb 6, 2014

I've never opened a file in binary mode to read with the csv module
using any Python version. Where does it state that you must do
this?

While the docs don't currently say anything about it, all the
examples at [1] use 'rb' or 'wb' when opening the file. I've long
wondered about that. Especially as I've passed non-file objects like
lists/iterators to the csv.reader/csv.DictReader and had them work
just fine (and would be a little perturbed if they broke).

-tkc

[1] http://docs.python.org/2/library/csv.html

Tim Golden · Feb 6, 2014

I've never opened a file in binary mode to read with the csv module
using any Python version. Where does it state that you must do this?

If you don't, you tend to get interleaved blank lines. (Presumably
unless your .csv is using \n-only linefeeds).

TJG

Neil Cerutti · Feb 6, 2014

I've never opened a file in binary mode to read with the csv module
using any Python version. Where does it state that you must do
this?

Click to expand...

While the docs don't currently say anything about it, all the
examples at [1] use 'rb' or 'wb' when opening the file. I've
long wondered about that. Especially as I've passed non-file
objects like lists/iterators to the csv.reader/csv.DictReader
and had them work just fine (and would be a little perturbed if
they broke).

They do actually mention it.

From: http://docs.python.org/2/library/csv.html

csv.reader(csvfile, dialect='excel', **fmtparams)

Return a reader object which will iterate over lines in the
given csvfile. csvfile can be any object which supports the
iterator protocol and returns a string each time its next()
method is called — file objects and list objects are both
suitable. If csvfile is a file object, it must be opened with
the ‘b’ flag on platforms where that makes a difference.

So it's stipulated only for file objects on systems where it
might make a difference.

Tim Chase · Feb 6, 2014

[first, it looks like you're posting via Google Groups which
annoyingly double-spaces everything in your reply. It's possible to
work around this, but you might want to subscribe via email or an
actual newsgroup client. You can read more at
https://wiki.python.org/moin/GoogleGroupsPython ]

Does the split make a list or tuple?

In this case, it happens to return a list, which you can check with

print type("one two three".split())

However, also in this case, it doesn't matter, since either indexes
just fine.

when i do location=line[1],
it gives me a error even though the program did run correctly and
output the correct file. location=line[1]
IndexError: list index out of range

Then it looks like you've got a blank line that doesn't actually have
data in it, so when it tries index into it, the only thing there is
[0], not [1]. As the message suggests

-tkc

Tim Chase · Feb 6, 2014

They do actually mention it.

From: http://docs.python.org/2/library/csv.html

If csvfile is a file object, it must be opened with
the â€˜bâ€™ flag on platforms where that makes a difference..

So it's stipulated only for file objects on systems where it
might make a difference.

Ah, I *knew* I'd read that somewhere but my searches in firefox (for
"binary", "rb" and "wb") didn't manage to catch that particular
instance. Thanks for disinterring that.

-tkc

How can I extract PST data into a CSV file?	1	Mar 20, 2026
How to Make CSV Contact Files Work Seamlessly Across All Smartphones?	0	Sep 17, 2025
SQL Problem Using Extract Command	0	Apr 7, 2022
HTML form to csv file on server	1	Feb 12, 2025
Why should I convert PST file to CSV format?	1	Apr 2, 2026
How to sort a CSV file with merge sort JAVA	7	May 6, 2021
Read xml column inside csv file with Python	0	Jul 22, 2022
How to read from a .csv file in Java?	1	Nov 6, 2023

parse a csv file into a text file

Dave Angel

MRAB

Rustom Mody

Neil Cerutti

Mark Lawrence

Tim Chase

Tim Golden

Neil Cerutti

Tim Chase

Tim Chase

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads