HP/UX Itanium C comiler & std C

S

Sion Roberts

While porting our application to HP/UX 11.23 Itanium, I came across
this situation where the compiler acts differently to any other UNIX
C/C++ compiler that I have come across in the last 10 years (including
other HP-UX platforms).

Consider the following code:

#include <stdio.h>

int main()
{
char *ptr = "AB";
char *ans;

ans = (*ptr++) ? ptr : "";

printf ("Answer is %s\n", ans);

return 0;
}

When running this, I would usually expect to see:

Answer is B

On Itanium it displays:

Answer is AB

There are a few HPUX compilers to choose from on this platform, but
they all behave in the same way. Compiler switches don't make any
difference. I am now using GCC, which works as I expect.

aCC on HPUX 11 PA-RISC also displays "AB".

Has anyone come across this before or confirm that the results I've
seen are invalid for ANSI C?

Sion.
 
M

Mark A. Odell

(e-mail address removed) (Sion Roberts) wrote in

While porting our application to HP/UX 11.23 Itanium, I came across
this situation where the compiler acts differently to any other UNIX
C/C++ compiler that I have come across in the last 10 years (including
other HP-UX platforms).

Consider the following code:

#include <stdio.h>

int main()
{
char *ptr = "AB";
char *ans;

ans = (*ptr++) ? ptr : "";

printf ("Answer is %s\n", ans);

return 0;
}

When running this, I would usually expect to see:

Answer is B

No, not unless you used %c instead of %s.
On Itanium it displays:

Answer is AB

This is correct. Strings are null terminated. The char 'B' is not '\0'
thus it gets printed along with 'A'.
 
L

Lawrence Kirby

While porting our application to HP/UX 11.23 Itanium, I came across this
situation where the compiler acts differently to any other UNIX C/C++
compiler that I have come across in the last 10 years (including other
HP-UX platforms).

Consider the following code:

#include <stdio.h>

int main()
{
char *ptr = "AB";
char *ans;

ans = (*ptr++) ? ptr : "";

printf ("Answer is %s\n", ans);

return 0;
}

When running this, I would usually expect to see:

Answer is B

Correct, the ?: operator defines a sequence point after its first operand
is evaluated. Therefore the side-effect of incrementing ptr must be
complete before the value of ptr is read for the second operand (which it
is because *ptr++ is 'A' which is not null). So a pointer to the 2nd
character in the string should be assigned to ans.
On Itanium it displays:

Answer is AB

Looks like you've found a bona fide compiler bug.

Lawrence
 
K

Kenny McCormack

(e-mail address removed) (Sion Roberts) wrote in



No, not unless you used %c instead of %s.

Um, %c would print the lowest 8 bits of the pointer value stored in "ans",
as an ASCII character (on most machines - obviously O/T, of course...)
This is correct. Strings are null terminated. The char 'B' is not '\0'
thus it gets printed along with 'A'.

The idea of the above is that in:

ans = (*ptr++) ? ptr : "";

"ptr" is "supposed" to be incremented after it is tested but before its
value is fetched and assigned to "ans". The pedants in this group will
probably point out that this behavior is not required by "the standard".
 
P

pete

Mark said:
(e-mail address removed) (Sion Roberts) wrote in


No, not unless you used %c instead of %s.

Answer is B.
ans points to a string regardless of whether it points
to the first or second element of "AB".
 
?

=?ISO-8859-1?Q?Bj=F8rn_Augestad?=

Lawrence said:
Correct, the ?: operator defines a sequence point after its first operand
is evaluated. Therefore the side-effect of incrementing ptr must be
complete before the value of ptr is read for the second operand (which it
is because *ptr++ is 'A' which is not null). So a pointer to the 2nd
character in the string should be assigned to ans.

Can someone please clarify what is meant by this text, from C99 §6.5.15
Conditional operator, #4.

"If an attempt is made to modify the result of a conditional operator or
to access it after the next sequence point, the behavior is undefined".

Does the "access it" part apply to the OP's code or am I just
misinterpreting the standard?

TIA
Bjørn


[snip]
 
P

pete

Bjørn Augestad said:
Can someone please clarify what is meant by this text, from C99 §6.5.15
Conditional operator, #4.

"If an attempt is made to modify the result of a conditional
operator

I think it means that (a ? b : c) is an rvalue
and that you can't do something like this: ((a ? b : c)++)
or to access it after the next sequence point,
the behavior is undefined".
Does the "access it" part apply to the OP's code or am I just
misinterpreting the standard?

I don't know what that means.
 
R

Richard Tobin

Kenny McCormack said:
The idea of the above is that in:

ans = (*ptr++) ? ptr : "";

"ptr" is "supposed" to be incremented after it is tested but before its
value is fetched and assigned to "ans". The pedants in this group will
probably point out that this behavior is not required by "the standard".

No. The standard says there is a sequence point after the first operand
of the ? operator.

-- Richard
 
L

Lawrence Kirby

(rearranged to bring the standard text together)
I think it means that (a ? b : c) is an rvalue
and that you can't do something like this: ((a ? b : c)++)

That is true but I don't think it is what the text is for, because that is
caught by a constraint violation for ++ i.e. its operand must be a
modifiable lvalue. A constraint violation is the more significant error,
if you have a CV then further undefined behaviour is not an issue.

I'm wondering if this has something to do with "rvalue" structures e.g.

struct foo { int bar; } s1 = { 1 }, s2 = { 2 };

(expr ? s1 : s2).bar

but I haven't come up with an example that triggers undefined behaviour
from 6.5.15p4 and doesn't trip up elsewhere in the standard. That sentence
is an addition from C90, so either they spotted a case that hadn't been
spotted at the time of C90 or this has something to do with new C99
features.
I don't know what that means.

The standard talks about modifying the RESULT of the conditional
operator or accessing after the next sequence point, which the OPs code
does not do.

Lawrence
 
D

Dennis Handly

Richard Tobin ([email protected]) wrote:
: In article <[email protected]>,
: >The idea of the above is that in:
: > ans = (*ptr++) ? ptr : "";
: >
: >"ptr" is "supposed" to be incremented after it is tested but before its
: >value is fetched and assigned to "ans". The pedants in this group will
: >probably point out that this behavior is not required by "the standard".

: No. The standard says there is a sequence point after the first operand
: of the ? operator.
: -- Richard

Right, we finally saw that too. This is CR JAGae79312:
Incorrect order of evaluation for: (x++) ? x ...

This is fixed in A.05.57:
http://h21007.www2.hp.com/dspp/tech/tech_TechSoftwareDetailPage_IDX/1,1703,1743,00.html
 
C

Christian Bau

The idea of the above is that in:

ans = (*ptr++) ? ptr : "";

"ptr" is "supposed" to be incremented after it is tested but before its
value is fetched and assigned to "ans". The pedants in this group will
probably point out that this behavior is not required by "the standard".

Only pedants who don't know C. Those who know C will replace "supposed"
with "required" and remove the quotes:

"ptr" is absolutely, one hundred percent required to be incremented
after it is tested but before its value is fetched and assigned to
"ans".
 
K

Kenny McCormack

Only pedants who don't know C. Those who know C will replace "supposed"
with "required" and remove the quotes:

"ptr" is absolutely, one hundred percent required to be incremented
after it is tested but before its value is fetched and assigned to
"ans".

Well, that is good to know. Always nice to hear of instances where
sensible behavior is required by the standard.
 
P

pete

Lawrence said:
The standard talks about modifying the RESULT of the conditional
operator or accessing after the next sequence point,
which the OPs code does not do.

I still don't understand.
Would you please change the code in the above program to something
which does attempt to access the result of the conditional
operator after the next sequence point,
to better illustrate what you and the standard are talking about?
 
P

pete

"ptr" is absolutely, one hundred percent required to be incremented
after it is tested but before its value is fetched and assigned to
"ans".

I don't see ptr being tested.
*ptr++ is tested.
ptr may be incremented before the test instead of after.
It is the result of the postfix increment operation
which is the operand of the indirection operator. *(ptr++)
 
C

Christian Bau

pete said:
I still don't understand.
Would you please change the code in the above program to something
which does attempt to access the result of the conditional
operator after the next sequence point,
to better illustrate what you and the standard are talking about?

It is a very obscure thing, and difficult to write code that actually
creates the situation. Something like this:

typedef struct { int x; int a [2]; } mystruct;
static mystruct r, s;
static int i;

(i > 0 ? r : s).a [1] = 0;

The result of the ?: is some object, but not one of the objects r, s: It
is an unnamed object which contains either a copy of r or a copy of s.
Modifying that unnamed object invokes undefined behavior. You may have
to do some more work to make this compile.
 
S

Sion Roberts

Richard Tobin ([email protected]) wrote:
: In article <[email protected]>,
: >The idea of the above is that in:
: > ans = (*ptr++) ? ptr : "";
: >
: >"ptr" is "supposed" to be incremented after it is tested but before its
: >value is fetched and assigned to "ans". The pedants in this group will
: >probably point out that this behavior is not required by "the standard".

: No. The standard says there is a sequence point after the first operand
: of the ? operator.
: -- Richard

Right, we finally saw that too. This is CR JAGae79312:
Incorrect order of evaluation for: (x++) ? x ...

This is fixed in A.05.57:
http://h21007.www2.hp.com/dspp/tech/tech_TechSoftwareDetailPage_IDX/1,1703,1743,00.html

Dennis,

Thank you for this information. This is exactly what I wanted to know.

Sion.
 
L

Lawrence Kirby

I don't see ptr being tested.

True, although its value is used in the text expression.
*ptr++ is tested.
True

ptr may be incremented before the test instead of after.
True

It is the result of the postfix increment operation
which is the operand of the indirection operator. *(ptr++)

True, and the result of that operation is the unincremented value of ptr.
But you are correct that this doesn't stop the ptr object having an
incremented value written to it first.

Lawrence
 
C

Charlie Gordon

Christian Bau said:
It is a very obscure thing, and difficult to write code that actually
creates the situation. Something like this:

typedef struct { int x; int a [2]; } mystruct;
static mystruct r, s;
static int i;

(i > 0 ? r : s).a [1] = 0;

The result of the ?: is some object, but not one of the objects r, s: It
is an unnamed object which contains either a copy of r or a copy of s.
Modifying that unnamed object invokes undefined behavior. You may have
to do some more work to make this compile.

I guess the "proper" way to achieve this would be :

(i > 0 ? &r : &s)->a[1] = 0;

Are we dealing with the same issue in the code :

(ignore_case ? strcasecmp : strcmp) (s1, s2);

assuming proper declaration of strcasecmp.

How should it be written then ?

Chqrlie.

PS: I put emphasis around proper to express that standard compliant does not
necessarily mean readable and advisable.
 
P

pete

Christian said:
pete said:
I still don't understand.
Would you please change the code in the above program to something
which does attempt to access the result of the conditional
operator after the next sequence point,
to better illustrate what you and the standard are talking about?

It is a very obscure thing, and difficult to write code that actually
creates the situation. Something like this:

typedef struct { int x; int a [2]; } mystruct;
static mystruct r, s;
static int i;

(i > 0 ? r : s).a [1] = 0;

The result of the ?: is some object,
but not one of the objects r, s: It
is an unnamed object which contains either a copy of r or a copy of s.
Modifying that unnamed object invokes undefined behavior. You may have
to do some more work to make this compile.

Thank you.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,770
Messages
2,569,586
Members
45,097
Latest member
RayE496148

Latest Threads

Top