Portable pointer arithmetics?

matt · Jul 24, 2010

I (think I have) understood the pitfalls of pointer arithmetics.

For example, the output of this code depends on the platform on which
this program will run:

char * cp = "Hello World";
char * c2p = NULL;
int * ip = (int *) cp;
ip = ip + 1;
c2p = (char *) ip;
printf("%c\n", *c2p);

The output of this program is "o" where the size of integer is 4 and "l"
where the size of integer is 2 bytes. However, this problem is intrinsic
in the application, where we scan an array of chars with a pointer to
integers (through the typecasting).

On the other hand, is the following (pseudo)code

mytype mta[5];
...
mytype * mtp = &mta; // 0 <= i <= 4
mytypeAmazingFunction(mtp, mtp-1, <some_other_args>);

*always* portable to any architecture?

It is equivalent to

mytypeAmazingFunction(&mtp, &mtp[i-1], <some_other_args>);

which is always portable, isn't it?

Gene · Jul 24, 2010

I (think I have) understood the pitfalls of pointer arithmetics.

For example, the output of this code depends on the platform on which
this program will run:

char * cp = "Hello World";
char * c2p = NULL;
int * ip = (int *) cp;
ip = ip + 1;
c2p = (char *) ip;
printf("%c\n", *c2p);

The output of this program is "o" where the size of integer is 4 and "l"
where the size of integer is 2 bytes. However, this problem is intrinsic
in the application, where we scan an array of chars with a pointer to
integers (through the typecasting).

On the other hand, is the following (pseudo)code

mytype mta[5];
...
mytype * mtp = &mta; // 0 <= i <= 4
mytypeAmazingFunction(mtp, mtp-1, <some_other_args>);

*always* portable to any architecture?

It is equivalent to

mytypeAmazingFunction(&mtp, &mtp[i-1], <some_other_args>);

which is always portable, isn't it?

When i == 0, mtp - 1 causes undefined behavior.

Barry Schwarz · Jul 24, 2010

I (think I have) understood the pitfalls of pointer arithmetics.

For example, the output of this code depends on the platform on which
this program will run:

char * cp = "Hello World";
char * c2p = NULL;
int * ip = (int *) cp;

This statement will invoke undefined behavior if the string literal
happens to be improperly aligned for an int.

ip = ip + 1;
c2p = (char *) ip;
printf("%c\n", *c2p);

The output of this program is "o" where the size of integer is 4 and "l"
where the size of integer is 2 bytes. However, this problem is intrinsic
in the application, where we scan an array of chars with a pointer to
integers (through the typecasting).

You could eliminate the potentially undefined behavior by eliminating
ip and assigning c2p the value cp+sizeof(int). The result is
implementation dependent but at least well defined.

On the other hand, is the following (pseudo)code

mytype mta[5];
...
mytype * mtp = &mta; // 0 <= i <= 4
mytypeAmazingFunction(mtp, mtp-1, <some_other_args>);

If i is 0, then evaluating mtp-1 invokes undefined behavior.

*always* portable to any architecture?

It is equivalent to

mytypeAmazingFunction(&mtp, &mtp[i-1], <some_other_args>);

Click to expand...

No. The two arguments should be &mta and &mta[i-1]. Evaluating
the second argument still invokes undefined behavior when i is 0.

which is always portable, isn't it?

Click to expand...

Since the statement invokes undefined behavior on all systems when i
is 0, that is a perverse form of portability. Most use the term
portability to mean produce equivalent results on multiple systems
(taking into account inevitable differences due to implementation
details such as the number of significant digits in floating point
types). Since undefined behavior is not guaranteed to be consistent
across implementations, I think most would say a program invoking
undefined behavior cannot be portable.

I do not see how second discussion using mytype relates to your first
discussion about the implementation dependent size of an int.

matt · Jul 24, 2010

It is equivalent to

mytypeAmazingFunction(&mtp, &mtp[i-1], <some_other_args>);

which is always portable, isn't it?

Sorry, of course I meant

mytypeAmazingFunction(&mta, &mta[i-1], <some_other_args>);

matt · Jul 24, 2010

You could eliminate the potentially undefined behavior by eliminating
ip and assigning c2p the value cp+sizeof(int). The result is
implementation dependent but at least well defined.

ok thanks.

On the other hand, is the following (pseudo)code

mytype mta[5];
...
mytype * mtp =&mta; // 0<= i<= 4
mytypeAmazingFunction(mtp, mtp-1,<some_other_args>);

Click to expand...

If i is 0, then evaluating mtp-1 invokes undefined behavior.

*always* portable to any architecture?

It is equivalent to

mytypeAmazingFunction(&mtp,&mtp[i-1],<some_other_args>);

Click to expand...

No. The two arguments should be&mta and&mta[i-1]. Evaluating
the second argument still invokes undefined behavior when i is 0.

Sorry for both. Bad cut&paste:

1 <= i <= 4
and

mytypeAmazingFunction(&mta[i] said:

I do not see how second discussion using mytype relates to your first
discussion about the implementation dependent size of an int.

Click to expand...

I didn't mean discuss about the implementation dependent size of an int.

I meant: ok I understood the common pitfalls working with pointer
arithmetic (e.g. scanning a vector of chars with a pointer to integers);
however, I'm dealing with pointer arithmetic in different contest
(mytypeAmazingFunction() on mytype data type). Does
mytypeAmazingFunction(mtp, mtp-1, <some_other_args>) (using pointer
arithmetic) suffer of some problem if ported across multiple platforms,
so that I am constrained to use
mytypeAmazingFunction(&mta,&mta[i-1],<some_other_args>) (which is
always portable), or not?

Eric Sosman · Jul 24, 2010

I (think I have) understood the pitfalls of pointer arithmetics.

For example, the output of this code depends on the platform on which
this program will run:

char * cp = "Hello World";
char * c2p = NULL;
int * ip = (int *) cp;
ip = ip + 1;
c2p = (char *) ip;
printf("%c\n", *c2p);

The output of this program is "o" where the size of integer is 4 and "l"
where the size of integer is 2 bytes. However, this problem is intrinsic
in the application, where we scan an array of chars with a pointer to
integers (through the typecasting).

The output might also be "Haddocks' Eyes," or there might not
be any output at all, or your hard drive might fly away like a
Frisbee and get chewed by a frolicking spaniel (in theory, anyhow).
The problem is that the nameless char[] array created by the literal
might not begin at an address that is suitably aligned for an int,
so an int* is not necessarily able to point at that address. The
initialization of `ip' is therefore problematic, and there's really
no telling what might happen.

On "mainstream" machines nowadays the potential misalignment will
cause no trouble (in this case). But if the rapid and turbulent
cascade of change in computerland keeps flowing as swiftly as it has
for the last half-century, today's mainstream is tomorrow's backwater.

On the other hand, is the following (pseudo)code

mytype mta[5];
...
mytype * mtp = &mta; // 0 <= i <= 4
mytypeAmazingFunction(mtp, mtp-1, <some_other_args>);

*always* portable to any architecture?

Not if `i' is zero, which causes you to try to form a pointer
to the nonexistent element `mta[-1]'. There's a special rule that
lets you form a pointer to the nonexistent element just *after* an
array (`mta[5]', in this case), provided you don't try to access
that fictitious element, but there's no similar special case for
pointing at an imaginary element preceding an array.

It is equivalent to

mytypeAmazingFunction(&mtp, &mtp[i-1], <some_other_args>);

which is always portable, isn't it?

Click to expand...

Not if `i' is zero.

Ben Bacarisse · Jul 24, 2010

matt said:
mytype mta[5];
...
mytype * mtp =&mta; // 0<= i<= 4

Click to expand...

Click to expand...

I meant: ok I understood the common pitfalls working with pointer
arithmetic (e.g. scanning a vector of chars with a pointer to
integers); however, I'm dealing with pointer arithmetic in different
contest (mytypeAmazingFunction() on mytype data type). Does
mytypeAmazingFunction(mtp, mtp-1, <some_other_args>) (using pointer
arithmetic) suffer of some problem if ported across multiple
platforms, so that I am constrained to use
mytypeAmazingFunction(&mta,&mta[i-1],<some_other_args>) (which is
always portable), or not?

Click to expand...

E1[E2] means *((E1) + (E2)). &*(E) means E. Thus &mta means mta + i
and &mta[i-1] means mta + (i-1). Your two function calls are
equivalent -- and they are both undefined when i == 0.

The brackets round (i-1) are significant in some cases. Were I to get a
value of 6, &mta[i-1] == mta + 5 is valid (provided you don't
dereference it) but mtp will already have been set to an invalid pointer
and all bets are off.

Portable (??) pointer alignment	27	Apr 18, 2012
Comparison of Integer and Pointer (that's supposed to be an Integer). Where did I go wrong?	0	Nov 19, 2022
Realloc and pointer arithmetics	6	Apr 18, 2008
Array of structs function pointer	10	Jul 16, 2023
Need help! Following code isnt working fully Comparison of integer and pointer	0	Nov 20, 2022
Portable custom integer width definitions	37	Jul 9, 2013
pointer arithmetic	16	Feb 21, 2014
Pointer arithmetics in Solaris	8	Mar 30, 2007

Portable pointer arithmetics?

matt

Gene

Barry Schwarz

matt

matt

Eric Sosman

Ben Bacarisse

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads