pointer past end of buffer

John Goche · Oct 30, 2006

A lot of C++ code allocates a buffer and initializes
start and end pointers as follows:

+-------------------------------+
+ +
+-------------------------------+
^ ^
| |
pStart pEnd

setting pEnd = pStart + bufLen

But what if the buffer is allocated at the very end of memory
and just fits. Then pEnd == MEM_MAX + 1 == 0 and so
library users could tamper with code by creating a buffer
of suitable size. Can this happen in practice?

JG

Alf P. Steinbach · Oct 30, 2006

* John Goche:

A lot of C++ code allocates a buffer and initializes
start and end pointers as follows:

+-------------------------------+
+ +
+-------------------------------+
^ ^
| |
pStart pEnd

setting pEnd = pStart + bufLen

But what if the buffer is allocated at the very end of memory
and just fits. Then pEnd == MEM_MAX + 1 == 0 and so
library users could tamper with code by creating a buffer
of suitable size. Can this happen in practice?

The wrapping can not be a /problem/ with a conforming compiler.

And in practice such wrapping will not (be allowed to) happen.

But theoretically a compiler could allow that and make you unaware that
it happens unless you do low-level machine-specific things to inspect
the bit patterns of pointers.

Phlip · Oct 30, 2006

John said:
A lot of C++ code allocates a buffer and initializes
start and end pointers as follows:

+-------------------------------+
+ +
+-------------------------------+
^ ^
| |
pStart pEnd

setting pEnd = pStart + bufLen

But what if the buffer is allocated at the very end of memory
and just fits. Then pEnd == MEM_MAX + 1 == 0 and so
library users could tamper with code by creating a buffer
of suitable size. Can this happen in practice?

The C++ Standard reputedly declares that pointing and indexing
one-off-the-end of an array is well-defined. (Copying out the value of that
bogus element is undefined, except if the element is a char, where it's
simply garbage.)

That means a C++ implementation may not, for example, place any array right
at the end of memory, such that its one-off-the-end location occupies an
overflowed pointer value, or a storage location protected by hardware.

This rule permits all the idioms you have noted, including all of STL's
"asymetric extents". The "start" of anything must be a valid element, and
the "end" must use -- to get to a valid element.

After you become familiar with this effect, it becomes vaguely elegant. But
also extremely useful!

John Goche · Oct 30, 2006

Alf said:
* John Goche:

The wrapping can not be a /problem/ with a conforming compiler.

Is there something in the C++ standard that states this?

Thanks,

JG

Frederick Gotham · Oct 30, 2006

JG Posted:

A lot of C++ code allocates a buffer and initializes
start and end pointers as follows:

+-------------------------------+
+ +
+-------------------------------+
^ ^
| |
pStart pEnd

setting pEnd = pStart + bufLen

Indeed.

size_t const buf_size = 512;

char unsigned *const p = (char unsigned*)malloc(buf_size);
char unsigned const *const pover = p + buf_size;

But what if the buffer is allocated at the very end of memory
and just fits. Then pEnd == MEM_MAX + 1 == 0

That's a possible way of doing it, yes.

and so library users could tamper with code by creating a buffer of
suitable size. Can this happen in practice?

I don't understand what you're saying. . . how could they tamper with code?

Phlip:

The C++ Standard reputedly declares that pointing and indexing
one-off-the-end of an array is well-defined. (Copying out the value of
that bogus element is undefined, except if the element is a char, where
it's simply garbage.)

That's incorrect; the behaviour of the following is undefined:

int main()
{
char buf[12];

buf[12];
}

That means a C++ implementation may not, for example, place any array
right at the end of memory, such that its one-off-the-end location
occupies an overflowed pointer value, or a storage location protected by
hardware.

The C++ Standard imposes no such restriction.

The whole "pointer to one past last" concept has been discussed in depth
many times. Things to note are:

(1) The null pointer value need not be represented by all bits zero.
(2) Pointer arithmetic need not be calculated internally in the same
fashion that unsigned arithmetic is (i.e. wrap-around overflow).
(3) The "pointer to one past last" may compare equal to null.

This leaves the door wide open for implementors, just so long as the code
behaves as it should.

Jim Langston · Oct 30, 2006

John Goche said:
Is there something in the C++ standard that states this?

Yes.

John Goche · Oct 30, 2006

Jim said:
Yes.

So I understand that for a buffer of length buflen > 0 we can
assume that p < q so long as q is set to a value q <= p + buflen.
In the case where we set q > p + buflen then it is not guaranteed
that p < q holds due to possible pointer overflow. Is this correct?

Thanks,

JG

Ron Natalie · Oct 31, 2006

Phlip said:
That means a C++ implementation may not, for example, place any array right
at the end of memory, such that its one-off-the-end location occupies an
overflowed pointer value, or a storage location protected by hardware.

It can be a protected location if the protection is limited to accessing
the memory at that location. If you get a trap for just having that
address in a pointer (very uncommon) well then it's not allowed.

Andrew Koenig · Nov 2, 2006

So I understand that for a buffer of length buflen > 0 we can

buflen >= 0 (which could happen if the buffer is dynamically allocated)

assume that p < q so long as q is set to a value q <= p + buflen.

we can assume that p <= q (because buflen might be 0) so long as
p <= q <= p + buflen (i.e. you can't have q < p and still expect p < q

)

In the case where we set q > p + buflen then it is not guaranteed
that p < q holds due to possible pointer overflow. Is this correct?

Correct. In that case you're not even assured that you can evaluate p<q.

Andrew Koenig · Nov 2, 2006

The C++ Standard reputedly declares that pointing and indexing
one-off-the-end of an array is well-defined. (Copying out the value of
that bogus element is undefined, except if the element is a char, where
it's simply garbage.)

Unsigned char, I think.

This rule permits all the idioms you have noted, including all of STL's
"asymetric extents". The "start" of anything must be a valid element, and
the "end" must use -- to get to a valid element.

The "start" of anything must be a valid element as long as it's not equal to
the "end", which is how you would indicate an empty sequence.

In other words, if c is an empty container, c.begin() == c.end() will be
true, but you are not assured of being able to evaluate *c.begin().

Old Wolf · Nov 2, 2006

Andrew said:
Correct. In that case you're not even assured that you can evaluate p<q.

5.9#2 says that p<q is unspecified in this case (ie. you can
evaluate it but it could evaluate to either true or false).

In the C language, the behaviour is undefined.

Old Wolf · Nov 2, 2006

Old said:
5.9#2 says that p<q is unspecified in this case (ie. you can
evaluate it but it could evaluate to either true or false).

Of course it is also undefined if q no longer points to a valid
object, which I believe we are currently debating in c.l.c

Ron Natalie · Nov 3, 2006

Andrew said:
Unsigned char, I think.

It's still undefined.

You're mistaking the rule that says you can use a char pointer to
access all the ALLOCATED bytes comprising any object. Once you
get outside the bounds of an object, your in undefined land.

How does a HEAD pointer end up pointing to the first node in a linked list?	3	Jan 24, 2023
Always safe to free() a pointer one byte past the end of an allocatedblock?	15	Aug 3, 2013
vector, but without the overhead of dynamic memory allocation	5	Mar 9, 2011
Incrementing a pointer to a one-past-the-end value?	8	Jul 27, 2005
One-past-end-of-object pointers	3	Sep 1, 2004
Strict aliasing and buffer handling	20	Jun 20, 2011
Can't solve problems! please Help	0	Sep 26, 2022
Neatest way to get the end pointer?	94	Feb 5, 2008

pointer past end of buffer

John Goche

Alf P. Steinbach

Phlip

John Goche

Frederick Gotham

Jim Langston

John Goche

Ron Natalie

Andrew Koenig

Andrew Koenig

Old Wolf

Old Wolf

Ron Natalie

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads