sequence points and evaluation order

J

John Smith

I've been playing with splint, which returns the following
warning for the code below:

statlib.c: (in function log_norm_pdf)
statlib.c(1054,31): Expression has undefined behavior (left
operand uses errno,
modified by right operand): (log(x) - mu) * (log(x) - mu)
Code has unspecified behavior. Order of evaluation of function
parameters or
subexpressions is not defined, so if a value is used and
modified in
different places not separated by a sequence point
constraining evaluation
order, then the result of the expression is unspecified.

How can the offending expression be written so that the order of
evaluation is properly specified? Can t2num be assigned the value
of any single expression equivalent to ((log(x)-mu) ^ 2) * -1.0?


/* Return log normal probability density at x.
mu=a_mean(ln(X)); sigma=s_stdev(ln(X)) (output OK) */
double log_norm_pdf(double x, double mu, double sigma)
{
double t1, t2num, t2den;

if(x <= 0.0) return 0.0;

t1 = 1.0 / (sqrt(2.0 * PI) * sigma * x);
t2num = -((log(x) - mu) * (log(x) - mu)); /* problem here */
t2den = (2.0 * (sigma * sigma));

return t1 * exp(t2num / t2den);
}

BTW: this function has been rigorously tested.
 
S

Spiros Bousbouras

John said:
I've been playing with splint, which returns the following
warning for the code below:

statlib.c: (in function log_norm_pdf)
statlib.c(1054,31): Expression has undefined behavior (left
operand uses errno,
modified by right operand): (log(x) - mu) * (log(x) - mu)
Code has unspecified behavior. Order of evaluation of function
parameters or
subexpressions is not defined, so if a value is used and
modified in
different places not separated by a sequence point
constraining evaluation
order, then the result of the expression is unspecified.

I'm a bit puzzled by this.
Is it unspecified or undefined ? And why is it either ?
When it says "operand uses errno" does it mean reads
the value of errno or it means reads or writes the value
of errno ?
How can the offending expression be written so that the order of
evaluation is properly specified? Can t2num be assigned the value
of any single expression equivalent to ((log(x)-mu) ^ 2) * -1.0?


/* Return log normal probability density at x.
mu=a_mean(ln(X)); sigma=s_stdev(ln(X)) (output OK) */
double log_norm_pdf(double x, double mu, double sigma)
{
double t1, t2num, t2den;

if(x <= 0.0) return 0.0;

t1 = 1.0 / (sqrt(2.0 * PI) * sigma * x);
t2num = -((log(x) - mu) * (log(x) - mu)); /* problem here */

You can write this as
t2num = -( a = log(x) - mu , a * a) ;
Obviously a needs to be defined.
You can even write
{ double a ;
t2num = -( a = log(x) - mu , a * a) ;
}
to let the compiler know that the value of a
won't be needed anymore. Perhaps it will
affect optimization.
 
R

Robert Gamble

John said:
I've been playing with splint, which returns the following
warning for the code below:

statlib.c: (in function log_norm_pdf)
statlib.c(1054,31): Expression has undefined behavior (left
operand uses errno,
modified by right operand): (log(x) - mu) * (log(x) - mu)
Code has unspecified behavior. Order of evaluation of function
parameters or
subexpressions is not defined, so if a value is used and
modified in
different places not separated by a sequence point
constraining evaluation
order, then the result of the expression is unspecified.

How can the offending expression be written so that the order of
evaluation is properly specified? Can t2num be assigned the value
of any single expression equivalent to ((log(x)-mu) ^ 2) * -1.0?


/* Return log normal probability density at x.
mu=a_mean(ln(X)); sigma=s_stdev(ln(X)) (output OK) */
double log_norm_pdf(double x, double mu, double sigma)
{
double t1, t2num, t2den;

if(x <= 0.0) return 0.0;

t1 = 1.0 / (sqrt(2.0 * PI) * sigma * x);
t2num = -((log(x) - mu) * (log(x) - mu)); /* problem here */

What splint is trying to tell you is that the value of errno is
dependent on the order of evaluation of the 2 function calls. There is
no undefined behavior here (there is a sequence point between any
possible modifications to errno) and since you are not using errno
there isn't anything to worry about. You could use the value of errno,
it's value is not indeterminate, but you just can't be sure which call
to log() is the one that set the current value of errno. If you want
to get rid of the splint message you could store the value of "log(x) -
mu" in a temporary variable and do the multiplication seperately
although you might incur a precision hit. Your best bet is probably to
disable the warning inline for this particular case:

/* @-evalorder@ */
t2num = -((log(x) - mu) * (log(x) - mu)); /* problem here */
/* @+evalorder@ */

Robert Gamble
 
F

Flash Gordon

Spiros said:
I'm a bit puzzled by this.
Is it unspecified or undefined ? And why is it either ?
When it says "operand uses errno" does it mean reads
the value of errno or it means reads or writes the value
of errno ?

The order of the two function calls is unspecified, but IIRC there is a
sequence point on calling a function, and the two calls are not
permitted to be run in parallel, so I can't see any undefined behaviour.
If either call sets errno it is guaranteed that errno will be set to a
non-0 value (standard functions do not set it to 0) so the code can
still test errno to see if an error has occurred and what at least one
error was. So I really don't see a problem.
You can write this as
t2num = -( a = log(x) - mu , a * a) ;
Obviously a needs to be defined.
You can even write
{ double a ;
t2num = -( a = log(x) - mu , a * a) ;
}
to let the compiler know that the value of a
won't be needed anymore. Perhaps it will
affect optimization.

Personally I would do something more like
{
double a = log(x) - mu;
t2num -= a * a;
}

In my opinion this is easier to read. I don't see any benefit in using
the comma operator for this. Alternatively, of course, have it defined
earlier. I would consider calling log with the same value twice to be a
needless introduction of inefficiency and one that is less likely to be
fixed by the compiler than many.

For similar reasons I would probably #define ROOT_2PI

Needless bracketing which, in my opinion, does not make it easier to read.

When assigning t1 I would not have used 1.0/whatever, instead I would
divide by t1 here.

It may test OK but it would not have got through a code review with me
without changes or *very* good technical explanations of why they should
not be made.
 
K

Keith Thompson

Flash Gordon said:
The order of the two function calls is unspecified, but IIRC there is
a sequence point on calling a function, and the two calls are not
permitted to be run in parallel, so I can't see any undefined
behaviour. If either call sets errno it is guaranteed that errno will
be set to a non-0 value (standard functions do not set it to 0) so the
code can still test errno to see if an error has occurred and what at
least one error was. So I really don't see a problem.
[...]

If log() is implemented as a macro, there may not be a sequence point.
 
R

Robert Gamble

Keith said:
Flash Gordon said:
The order of the two function calls is unspecified, but IIRC there is
a sequence point on calling a function, and the two calls are not
permitted to be run in parallel, so I can't see any undefined
behaviour. If either call sets errno it is guaranteed that errno will
be set to a non-0 value (standard functions do not set it to 0) so the
code can still test errno to see if an error has occurred and what at
least one error was. So I really don't see a problem.
[...]

If log() is implemented as a macro, there may not be a sequence point.

This is true, although it would be quite difficult to imagine a macro
version that calculates the log and sets errno all without introducing
a sequence point. In any case, it is definitely a QOI issue although I
find it somewhat disturbing that the Standard allows this. I wonder
why the Standard committee didn't stick in something that prohibits the
modification of an object with static duration from a macro version of
a library function without appropriate sequence points.

Robert Gamble
 
E

ena8t8si

Robert said:
Keith said:
Flash Gordon said:
Spiros Bousbouras wrote:
John Smith wrote:

I've been playing with splint, which returns the following
warning for the code below:

statlib.c: (in function log_norm_pdf)
statlib.c(1054,31): Expression has undefined behavior (left
operand uses errno,
modified by right operand): (log(x) - mu) * (log(x) - mu)
Code has unspecified behavior. Order of evaluation of function
parameters or
subexpressions is not defined, so if a value is used and
modified in
different places not separated by a sequence point
constraining evaluation
order, then the result of the expression is unspecified.
I'm a bit puzzled by this.
Is it unspecified or undefined ? And why is it either ?
When it says "operand uses errno" does it mean reads
the value of errno or it means reads or writes the value
of errno ?

The order of the two function calls is unspecified, but IIRC there is
a sequence point on calling a function, and the two calls are not
permitted to be run in parallel, so I can't see any undefined
behaviour. If either call sets errno it is guaranteed that errno will
be set to a non-0 value (standard functions do not set it to 0) so the
code can still test errno to see if an error has occurred and what at
least one error was. So I really don't see a problem.
[...]

If log() is implemented as a macro, there may not be a sequence point.

This is true, although it would be quite difficult to imagine a macro
version that calculates the log and sets errno all without introducing
a sequence point.

Despite the evidence of the error message above? That
makes it pretty easy to imagine I would say.
In any case, it is definitely a QOI issue although I
find it somewhat disturbing that the Standard allows this. I wonder
why the Standard committee didn't stick in something that prohibits the
modification of an object with static duration from a macro version of
a library function without appropriate sequence points.

Better still, a requirement that a macro version behave
as if the actual function would.
 
S

Spoon

Flash said:
Personally I would do something more like
{
double a = log(x) - mu;
t2num -= a * a;
}

Is -= a typo?

The original code was:

t2num = -((log(x) - mu) * (log(x) - mu));

(with t2num uninitialized.)

Isn't
t2num -= a * a;
equivalent to
t2num = t2num - (a * a);
?
 
R

Robert Gamble

Robert said:
Keith said:
Spiros Bousbouras wrote:
John Smith wrote:

I've been playing with splint, which returns the following
warning for the code below:

statlib.c: (in function log_norm_pdf)
statlib.c(1054,31): Expression has undefined behavior (left
operand uses errno,
modified by right operand): (log(x) - mu) * (log(x) - mu)
Code has unspecified behavior. Order of evaluation of function
parameters or
subexpressions is not defined, so if a value is used and
modified in
different places not separated by a sequence point
constraining evaluation
order, then the result of the expression is unspecified.
I'm a bit puzzled by this.
Is it unspecified or undefined ? And why is it either ?
When it says "operand uses errno" does it mean reads
the value of errno or it means reads or writes the value
of errno ?

The order of the two function calls is unspecified, but IIRC there is
a sequence point on calling a function, and the two calls are not
permitted to be run in parallel, so I can't see any undefined
behaviour. If either call sets errno it is guaranteed that errno will
be set to a non-0 value (standard functions do not set it to 0) so the
code can still test errno to see if an error has occurred and what at
least one error was. So I really don't see a problem.
[...]

If log() is implemented as a macro, there may not be a sequence point.

This is true, although it would be quite difficult to imagine a macro
version that calculates the log and sets errno all without introducing
a sequence point.

Despite the evidence of the error message above? That
makes it pretty easy to imagine I would say.

It's not too difficult to imagine *a* function that would do this as
virtually any library function may set errno, but it would be difficult
to imagine a conforming implementation of the log function that does as
its implementation is non-trivial and it returns a non-int value.
Better still, a requirement that a macro version behave
as if the actual function would.

There is a sequence point before any function in C returns. Requiring
that all macro versions of library functions have this sequence point
would unnecessarily complicate and invalidate many existing macros
such as those covering functions in ctype.h. As long as the macro does
not modify objects with static duration I don't see a need for forcing
the macro version to implement the sequence point.

Robert Gamble
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,764
Messages
2,569,567
Members
45,041
Latest member
RomeoFarnh

Latest Threads

Top