Built-in open() with buffering > 1

Marco · Aug 24, 2012

Please, can anyone explain me the meaning of the
"buffering > 1" in the built-in open()?
The doc says: "...and an integer > 1 to indicate the size
of a fixed-size chunk buffer."
So I thought this size was the number of bytes or chars, but
it is not:
'abcdefghi\n'

Regards,
Marco

Marco · Aug 24, 2012

Please, can anyone explain me the meaning of the
"buffering > 1" in the built-in open()?
The doc says: "...and an integer > 1 to indicate the size
of a fixed-size chunk buffer."

Sorry, I get it:
.... n = f.write(str(i))
.... print(i, open('myfile').read(), sep=':')
....
0:
1:
2:
3:
4:
5:012345

Ramchandra Apte · Aug 24, 2012

`f._CHUNK_SIZE = 5` is modifying Python's internal variables - don't do that
google buffering to find out what it is
buffering is how much Python will keep in memory
f.read(1) will actually read `buffering` bytes of memory so that when you read later, the reading can be done from memory

Hans Mulder · Aug 26, 2012

Please, can anyone explain me the meaning of the
"buffering > 1" in the built-in open()?
The doc says: "...and an integer > 1 to indicate the size
of a fixed-size chunk buffer."
So I thought this size was the number of bytes or chars, but
it is not

The algorithm is explained at
http://docs.python.org/library/io.html#io.DEFAULT_BUFFER_SIZE

In other words: open() tries to find a suitable size by
calling os.stat(your_file).st_blksize and if that fails,
it uses io.DEFAULT_BUFFER_SIZE, which is 8192 on my box.

Whether you call open with buffering=2 or any larger
number, does not matter: the buffer size will be the
outcome of this algorithm.

Hope this helps,

-- HansM

Marco · Aug 30, 2012

The algorithm is explained at
http://docs.python.org/library/io.html#io.DEFAULT_BUFFER_SIZE

Thanks

In other words: open() tries to find a suitable size by
calling os.stat(your_file).st_blksize and if that fails,
it uses io.DEFAULT_BUFFER_SIZE, which is 8192 on my box.

Yes, when the parameter `buffering` is a negative integer
that is right

Whether you call open with buffering=2 or any larger
number, does not matter: the buffer size will be the
outcome of this algorithm.

Mmm, I think it is not right, because in this case
the buffer size is not computed but it is
the value you assign to the buffering parameter.
In fact:

Now two bytes are in the buffer and the buffer is full.
If you write another byte, it will not be written in the
buffer, because the bytes in the queue will be transferred
into the buffer only when they are more than f._CHUNK_SIZE:

Now, if you write another byte 'd', the chunk 'cd' will
be transferred to the buffer, but because it is full,
its content 'ab' will be transferred to the disk, and
after 'cd' written to the buffer, that still full:
'ab'

So, the buffer is really of size 2

Buffering of sys.stdout and sys.stderr in python3 (and documentation)	4	Dec 9, 2011
Weird Behavior with Rays in C and OpenGL	4	Feb 13, 2024
py_compile vs. built-in compile, with __future__	7	Jun 10, 2013
How can I view / open / render / display a pdf file with c code?	0	Sep 23, 2023
Buffering object	8	Jan 27, 2011
Question about how to get line buffering from paramiko	0	Jul 5, 2011
multiprocessing.sharedctypes and built-in locks	4	Mar 14, 2009
"Don't rebind built-in names*" - it confuses readers	20	Jun 11, 2013

Built-in open() with buffering > 1

Marco

Marco

Ramchandra Apte

Hans Mulder

Marco

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads