codecs.open() doesn't handle platform-specific line terminator

Thread starter John Machin
Start date May 9, 2011

John Machin

May 9, 2011

According to the 3.2 docs
(http://docs.python.org/py3k/library/codecs.html#codecs.open),

"""Files are always opened in binary mode, even if no binary mode was
specified. This is done to avoid data loss due to encodings using 8-bit
values. This means that no automatic conversion of b'\n' is done on
reading and writing."""

The first point is that one would NOT expect "conversion of b'\n'" anyway.
One expects '\n' -> os.sep.encode(the_encoding) on writing and vice versa
on reading.

The second point is that there is no such restriction with the built-in
open(), which appears to work as expected, doing (e.g. Windows, UTF-16LE)
'\n' -> b'\r\x00\n\x00' when writing and vice versa on reading, and not
striking out when thrown curve balls like '\u0a0a'.

Why is codecs.open() different? What does "encodings using 8-bit values"
mean? What data loss?

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads

codecs.open on Win32 -- converting my newlines to CR+LF	4	Aug 26, 2009
Implementing a Q-Learning Algorithm with Logistic Regression Normalization in C++	0	Jun 4, 2025
PEP8, line continuations and string formatting operations	1	Jan 21, 2011
ANN: ConfigObj 4.6.0 and Validate 1.0.0 released	0	Apr 17, 2009
ANN: eGenix mxODBC Connect 2.1.0 - Python ODBC Database Interface	0	May 28, 2014
How to bypass Windows 'cooking' the I/O? (One more time, please) II	2	Jul 7, 2008
UTF - SEEK_SET workaround for BOM encoding(utf-16/32) layer Bug	2	Aug 5, 2009
printing bits ... the right way	2	Apr 1, 2010

Facebook Twitter Reddit Pinterest Tumblr WhatsApp Email Link

Members online

No members online now.

Total: 213 (members: 0, guests: 213)
Robots: 360

Forum statistics

Threads: 474,432

Messages: 2,571,680

Members: 48,796

Latest member: Greg L.

Latest Threads

Will programmers be doomed since AI can write code in seconds?
- Started by John Joe
- Yesterday at 12:39 PM
Files Uploaded to Google Drive but Not Visible Anywhere
- Started by henrywalker
- Thursday at 5:54 AM
Why cant I print my secured PDF file and how can I fix it?
- Started by vorix28193
- Wednesday at 8:14 AM
Can PST files be converted to EML without Outlook?
- Started by samikshasen34
- Wednesday at 6:35 AM
How Can I Convert Outlook PST Files to MBOX Without Losing Attachments?
- Started by annawelson
- Monday at 2:24 PM
Lost in Multiple Mail Folders? Merge PST Files Easily
- Started by juliewhite
- May 27, 2026
Colspan probs
- Started by jakey
- May 21, 2026
Dicy dice
- Started by WhiteCube
- May 13, 2026
Need a reliable PST Converter Software for Outlook mailbox conversion
- Started by Damian01
- May 9, 2026
Need a PST Converter Free Download to Check Emails Before Export
- Started by vorix28193
- May 5, 2026

Top