slicing the end of a string in a list

John Salerno · Mar 2, 2006

Here's the code I wrote:

file = open('C:\switches.txt', 'r')
switches = file.readlines()
i = 0

for line in switches:
line = switches[:-1]
i += 1

print switches

You can probably tell what I'm doing. Read a list of lines from a file,
and then I want to slice off the '\n' character from each line. But
after this code runs, the \n is still there. I thought it might have
something to do with the fact that strings are immutable, but a test
such as:

switches[0][:-1]

does slice off the \n character. So I guess the problem lies in the
assignment or somewhere in there.

Also, is this the best way to index the list?

Ben Cartwright · Mar 2, 2006

John said:
You can probably tell what I'm doing. Read a list of lines from a file,
and then I want to slice off the '\n' character from each line. But
after this code runs, the \n is still there. I thought it might have
something to do with the fact that strings are immutable, but a test
such as:

switches[0][:-1]

does slice off the \n character.

Actually, it creates a new string instance with the \n character
removed, then discards it. The original switches[0] string hasn't
changed.

>>> foo = 'Hello world!'
>>> foo[:-1] 'Hello world'
>>> foo

Click to expand...

Click to expand...

'Hello world!'

So I guess the problem lies in the
assignment or somewhere in there.

Yes. You are repeated assigning a new string instance to "line", which
is then never referenced again. If you want to update the switches
list, then instead of assigning to "line" inside the loop, you need:

switches = switches[:-1]

Also, is this the best way to index the list?

Click to expand...

No, since the line variable is unused. This:

i = 0
for line in switches:
line = switches[:-1]
i += 1

Would be better written as:

for i in range(len(switches)):
switches = switches[:-1]

For most looping scenarios in Python, you shouldn't have to manually
increment a counter variable.

--Ben

PS - actually, you can accomplish all of the above in a single line of
code:
print [line[:-1] for line in open('C:\\switches.txt')]

John Salerno · Mar 2, 2006

Ben said:
Actually, it creates a new string instance with the \n character
removed, then discards it. The original switches[0] string hasn't
changed.
Yes. You are repeated assigning a new string instance to "line", which
is then never referenced again.

Ah, thank you!

PS - actually, you can accomplish all of the above in a single line of
code:
print [line[:-1] for line in open('C:\\switches.txt')]

Wow, that just replaced 7 lines of code! So *this* is why Python is
good.

John Salerno · Mar 2, 2006

Ben said:
print [line[:-1] for line in open('C:\\switches.txt')]

Hmm, I just realized in my original code that I didn't escape the
backslash. Why did it still work properly?

By the way, this whole 'one line' thing has blown me away. I wasn't
thinking about list comprehensions when I started working on this, but
just the fact that it can all be done in one line is amazing. I tried
this in C# and of course I had to create a class first, and open the
file streams, etc.

And do I not need the 'r' parameter in the open function?

Paul Rubin · Mar 2, 2006

John Salerno said:
print [line[:-1] for line in open('C:\\switches.txt')]

Click to expand...

Hmm, I just realized in my original code that I didn't escape the
backslash. Why did it still work properly?

The character the backslash isn't special: \s doesn't get into
a code like \n, so the backslash is passed through. Best not to
rely on that.

The preferred way to remove the newline is more like:
for line in open('C:\\switches.txt'):
print line.rstrip()

the rstrip method removes trailing whitespace, which might be \n
on some systems, \r\n on other systems, etc.

And do I not need the 'r' parameter in the open function?

No you get 'r' by default. If you want to write to the file you need
to pass the parameter.

John Salerno · Mar 2, 2006

Paul said:
The preferred way to remove the newline is more like:
for line in open('C:\\switches.txt'):
print line.rstrip()

Interesting. So I would say:

[line.rstrip() for line in open('C:\\switches.txt')]

John Salerno · Mar 2, 2006

John said:
Paul said:

The preferred way to remove the newline is more like:
for line in open('C:\\switches.txt'):
print line.rstrip()

Click to expand...

Interesting. So I would say:

[line.rstrip() for line in open('C:\\switches.txt')]

That seems to work. And on a related note, it seems to allow me to end
my file on the last line, instead of having to add a newline character
at the end of it so it will get sliced properly too.

Paul Rubin · Mar 3, 2006

John Salerno said:
Interesting. So I would say:

[line.rstrip() for line in open('C:\\switches.txt')]

Yes, you could do that. Note that it builds up an in-memory list of
all those lines, instead of processing the file one line at a time.
If the file is very large, that might be a problem.

If you use parentheses instead:

(line.rstrip() for line in open('C:\\switches.txt'))

you get what's called a generator expression that you can loop
through, but that's a bit complicated to explain, it's probably better
to get used to other parts of Python before worrying about that.

Leif K-Brooks · Mar 3, 2006

Ben said:
No, since the line variable is unused. This:

i = 0
for line in switches:
line = switches[:-1]
i += 1

Would be better written as:

for i in range(len(switches)):
switches = switches[:-1]

This is better, IMHO:

for i, switch in enumerate(switches):
switches = switch[:-1]

Peter Otten · Mar 3, 2006

John said:
You can probably tell what I'm doing. Read a list of lines from a file,
and then I want to slice off the '\n' character from each line.

If you are not concerned about memory consumption there is also

open(filename).read().splitlines()

Peter

P Boy · Mar 3, 2006

One liners are cool. Personally however, I would not promote one liners
in Python. Python code is meant to be read. Cryptic coding is in perl's
world.

Code below is intuitive and almost a three year old would understand.

for line in open('C:\\switches.txt'):
print line.rstrip()

BTW, if the file is huge, one may want to consider using
open('c:\\switches.txt', 'rb') instead.

Peter Otten · Mar 3, 2006

P said:
BTW, if the file is huge, one may want to consider using
open('c:\\switches.txt', 'rb') instead.

Why?

P Boy · Mar 3, 2006

I had some issues while ago trying to open a large binary file.

Anyway, from file() man page:

If mode is omitted, it defaults to 'r'. When opening a binary file, you
should append 'b' to the mode value for improved portability. (It's
useful even on systems which don't treat binary and text files
differently, where it serves as documentation.) The optional bufsize
argument specifies the file's desired buffer size: 0 means unbuffered,
1 means line buffered, any other positive value means use a buffer of
(approximately) that size. A negative bufsize means to use the system
default, which is usually line buffered for tty devices and fully
buffered for other files. If omitted, the system default is used.2.3

Steven D'Aprano · Mar 3, 2006

I had some issues while ago trying to open a large binary file.

The important term there is BINARY, not large. Many problems *reading*
(not opening) binary files will go away if you use 'rb', regardless of
whether they are small, medium or large.

Anyway, from file() man page:

If mode is omitted, it defaults to 'r'. When opening a binary file, you
should append 'b' to the mode value for improved portability. (It's
useful even on systems which don't treat binary and text files
differently, where it serves as documentation.)

Which does not suggest that using 'rb' is better for large files and 'r'
for small. It suggests that using 'rb' is better for binary files and 'r'
for text.

The optional bufsize
argument specifies the file's desired buffer size: 0 means unbuffered,
1 means line buffered, any other positive value means use a buffer of
(approximately) that size. A negative bufsize means to use the system
default, which is usually line buffered for tty devices and fully
buffered for other files. If omitted, the system default is used.2.3

If you are having problems with large files, changing the buffering will
help far more than changing the mode.

John Salerno · Mar 3, 2006

Steven said:
The important term there is BINARY, not large. Many problems *reading*
(not opening) binary files will go away if you use 'rb', regardless of
whether they are small, medium or large.

Is 'b' the proper parameter to use when you want to read/write a binary
file? I was wondering about this, because the book I'm reading doesn't
talk about dealing with binary files.

Steven D'Aprano · Mar 4, 2006

Is 'b' the proper parameter to use when you want to read/write a binary
file? I was wondering about this, because the book I'm reading doesn't
talk about dealing with binary files.

The interactive interpreter is your friend. Call help(file), and you will
get:

class file(object)
| file(name[, mode[, buffering]]) -> file object
|
| Open a file. The mode can be 'r', 'w' or 'a' for reading (default),
| writing or appending. The file will be created if it doesn't exist
| when opened for writing or appending; it will be truncated when
| opened for writing. Add a 'b' to the mode for binary files.

plus extra information.

Take note that the mode is NOT "b". It is "rb".

John Salerno · Mar 4, 2006

Steven said:
Is 'b' the proper parameter to use when you want to read/write a binary
file? I was wondering about this, because the book I'm reading doesn't
talk about dealing with binary files.

Click to expand...

The interactive interpreter is your friend. Call help(file), and you will
get:

class file(object)
| file(name[, mode[, buffering]]) -> file object
|
| Open a file. The mode can be 'r', 'w' or 'a' for reading (default),
| writing or appending. The file will be created if it doesn't exist
| when opened for writing or appending; it will be truncated when
| opened for writing. Add a 'b' to the mode for binary files.

plus extra information.

Take note that the mode is NOT "b". It is "rb".

Awesome! I'm trying to push away thoughts of C#'s binary reader and
writer classes now.

John Salerno · Mar 6, 2006

Paul said:
John Salerno said:

Interesting. So I would say:

[line.rstrip() for line in open('C:\\switches.txt')]

Click to expand...

How would I manually close a file that's been opened this way? Or is it
not possible in this case? Is it necessary?

Steve Holden · Mar 6, 2006

John said:
Paul said:

John Salerno said:

Interesting. So I would say:

[line.rstrip() for line in open('C:\\switches.txt')]

Click to expand...

Click to expand...

How would I manually close a file that's been opened this way? Or is it
not possible in this case? Is it necessary?

It's not possible to perform an explicit close if, as in this case, you
don't have an explicit reference to the file object.

In CPython it's not strictly necessary to close the file, but other
implementations don't guarantee that a file will be closed after the
last reference is deleted.

So for fullest portability it's better explicitly close the file.

regards
Steve

John Salerno · Mar 6, 2006

Steve said:
It's not possible to perform an explicit close if, as in this case, you
don't have an explicit reference to the file object.

In CPython it's not strictly necessary to close the file, but other
implementations don't guarantee that a file will be closed after the
last reference is deleted.

So for fullest portability it's better explicitly close the file.

regards
Steve

Thanks!

Changing string value that is an element of a list	3	Feb 10, 2025
How does a HEAD pointer end up pointing to the first node in a linked list?	3	Jan 24, 2023
List filenames that end in .mp4 and add to a list	10	Dec 25, 2023
Slice lists and extended slicing	0	Jan 26, 2011
Average of MultiMode of a list of a list	1	Oct 28, 2022
The layout of braces	2	Sep 2, 2024
Select files based on text list of filenames(part of the name:date) with condition	0	May 4, 2022
Range / empty list issues??	1	Dec 10, 2023

slicing the end of a string in a list

John Salerno

Ben Cartwright

John Salerno

John Salerno

Paul Rubin

John Salerno

John Salerno

Paul Rubin

Leif K-Brooks

Peter Otten

P Boy

Peter Otten

P Boy

Steven D'Aprano

John Salerno

Steven D'Aprano

John Salerno

John Salerno

Steve Holden

John Salerno

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads