Direct computation of integer limits in K&R2?

Peter Nilsson · Mar 12, 2008

Ark Khasin said:
Peter said:

Quoting pete:

#if !(1 & -1)
printf("ones complement\n");
#elif -1 & 2
printf("twos complement\n");
#else
printf("sign magnitude\n");
#endif

Pete asked if greycode or other weird representations
could be used for negative integers, but it seems that
is not so, despite the loose wording of C90.
[cf <http://groups.google.com/group/comp.std.c/msg/5f332b9aa22b92ec>]

Click to expand...

Is there any assurance that the representation of integer
constants in the preprocessor is in any way related to the
representation of integer objects
- falls into one of the three models of representation of
integer objects
?

Notionally, yes.

C89 draft 3.8.1

The resulting tokens comprise the controlling constant
expression which is evaluated according to the rules of
$3.4 using arithmetic that has at least the ranges
specified in $2.2.4.2, except that int and unsigned int
act as if they have the same representation as,
respectively, long and unsigned long .

There's similar wording for C99 though intmax_t and uintmax_t
are used.

Unfortunately, many implementations are somewhat inconsistent.
Consider...

#include <limits.h>
#include <stdio.h>

int main(void)
{
printf("ULONG_MAX = %lu\n", ULONG_MAX);

#if -1 == -1ul
puts("-1 == -1ul [pre]");
#endif

if (-1 == -1ul)
puts("-1 == -1ul");

#if 4294967295 == -1ul
puts("4294967295 == -1ul [pre]");
#endif

if (4294967295 == -1ul)
puts("4294967295 == -1ul");

return 0;
}

The output for me using delorie gcc 4.2.1 with -ansi
-pedantic is...

ULONG_MAX = 4294967295
-1 == -1ul [pre]
-1 == -1ul
4294967295 == -1ul

As you can see, there is a discrepancy between the way
that preprocessor arithmetic is evaluated. Fact is,
gcc is not the only compiler to show problems.

[Of course the above can be repaired so as to not use the
preprocessor. Just asking...]

Indeed, using the expressions in a normal 'if' would be
the go. There's no reason why say int couldn't use 2c but
long uses 1c.

Micah Cowan · Mar 12, 2008

6.5.6.2p2 says ("the first two" below are sign-and-magnitude and
two's complement):

"Which of these applies is implementation-defined, as is whether the
value with sign bit 1 and all value bits zero (for the first two),
or with sign bit and all value bits 1 (for ones' complement), is a
trap representation or a normal value."

Huh. I managed to forget that somehow. My bad, Flash.

Ark Khasin · Mar 12, 2008

Peter said:
Ark Khasin said:

Peter said:

Quoting pete:

#if !(1 & -1)
printf("ones complement\n");
#elif -1 & 2
printf("twos complement\n");
#else
printf("sign magnitude\n");
#endif

Pete asked if greycode or other weird representations
could be used for negative integers, but it seems that
is not so, despite the loose wording of C90.
[cf <http://groups.google.com/group/comp.std.c/msg/5f332b9aa22b92ec>]

Click to expand...

Is there any assurance that the representation of integer
constants in the preprocessor is in any way related to the
representation of integer objects
- falls into one of the three models of representation of
integer objects
?

Click to expand...

Notionally, yes.

C89 draft 3.8.1

The resulting tokens comprise the controlling constant
expression which is evaluated according to the rules of
$3.4 using arithmetic that has at least the ranges
specified in $2.2.4.2, except that int and unsigned int
act as if they have the same representation as,
respectively, long and unsigned long .

There's similar wording for C99 though intmax_t and uintmax_t
are used.

Unfortunately, many implementations are somewhat inconsistent.
Consider...

#include <limits.h>
#include <stdio.h>

int main(void)
{
printf("ULONG_MAX = %lu\n", ULONG_MAX);

#if -1 == -1ul
puts("-1 == -1ul [pre]");
#endif

if (-1 == -1ul)
puts("-1 == -1ul");

#if 4294967295 == -1ul
puts("4294967295 == -1ul [pre]");
#endif

if (4294967295 == -1ul)
puts("4294967295 == -1ul");

return 0;
}

The output for me using delorie gcc 4.2.1 with -ansi
-pedantic is...

ULONG_MAX = 4294967295
-1 == -1ul [pre]
-1 == -1ul
4294967295 == -1ul

As you can see, there is a discrepancy between the way
that preprocessor arithmetic is evaluated. Fact is,
gcc is not the only compiler to show problems.

[Of course the above can be repaired so as to not use the
preprocessor. Just asking...]

Click to expand...

Indeed, using the expressions in a normal 'if' would be
the go. There's no reason why say int couldn't use 2c but
long uses 1c.

Wow. I haven't thought of this possibility. It would imply malicious
intent of the implementer for it have an overhead in simplest
conversions. And it won't be for all markets

But I don't see why such a horror implementation would be illegal.
Thank you for the example.
-- Ark

user923005 · Mar 12, 2008

Yes. Unlike C99, unsigned to signed integer conversion
is implementation defined without the possibility of
raising a signal. So...

<http://groups.google.com/group/comp.lang.c/msg/ffe17c645660b76c>

Click to expand...

INT_MIN isn't computed per se, rather it's derived by
determining the representation for negative ints. [I
know pete posted some very simple constant expressions,
though it was some time ago.]

Click to expand...

Would you say that this exercise is overly complex for that point in
K&R2?

I will be pretty amazed to see anyone write a portable solution that
does it all (floating point is also requested).
I guess that signed integer <TYPE>_MIN values will be hard to come up
with.

Will computation of DBL_MAX signal a floating point exception?

I guess that it is the hardest exercise in the whole book, by far.

Flash Gordon · Mar 12, 2008

Micah Cowan wrote, On 12/03/08 00:30:

It's only allowed to be a trap representation on _non_ two's
complement representations. sign bit = 1 and all value bits = 0 (and
padding bits at non-trap values) would necessarily be the minimum
representable value.

Wrong. The C standard explicitly allows for it to be a trap
representation on two's complement representations. Quoting from N1256...

| ... If the sign bit is one, the value shall be modiï¬ed in one of
| the following ways:
| â€” the corresponding value with sign bit 0 is negated (sign and
| magnitude);
| â€” the sign bit has the value âˆ’(2 N ) (twoâ€™s complement);
| â€” the sign bit has the value âˆ’(2 N âˆ’ 1) (onesâ€™ complement ).
| Which of these applies is implementation-deï¬ned, as is whether the
| value with sign bit 1 and all value bits zero (for the ï¬rst two), or
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
| with sign bit and all value bits 1 (for onesâ€™ complement), is a trap
| representation or a normal value. In the case of sign and magnitude
| and onesâ€™ complement, if this representation is a normal value it is
| called a negative zero.

two's complement is one of the first two.

The above is from section 6.2.6.2 para 2.

Flash Gordon · Mar 12, 2008

Micah Cowan wrote, On 12/03/08 05:55:

Huh. I managed to forget that somehow. My bad, Flash.

It's easy to forget. I'm not actually aware of any implementations which
make use of this freedom.

Kaz Kylheku · Mar 12, 2008

Hello all,

In K&R2 one exercise asks the reader to compute and print the limits for
the basic integer types. This is trivial for unsigned types. But is it
possible for signed types without invoking undefined behaviour
triggered by overflow? Remember that the constants in limits.h cannot
be used.

You can use shifting to determine how many bits there are in the given
signed integral type. Start with 1 and keep shifting it left until it
drops off. With that information, you can construct the greatest
possible positive integer value: which is all 1's except for the sign
bit, which is zero. The greatest possible negative value is either the
additive inverse of that value, or, in the case of two's complement,
that value less one. You can detect whether two's complement is in
effect by applying a simple test to the value -1:

switch (-1 & 3) {
case 1: /* ...01: sign magnitude */
break;
case 2: /* ...10: one's complement */
break;
case 3: /* ...11: two's complement */
break;
}

That's the general approach I'd take to the exercise.

ymuntyan · Mar 12, 2008

You can use shifting to determine how many bits there are in the given
signed integral type. Start with 1 and keep shifting it left until it
drops off.

That's UB, no?

With that information, you can construct the greatest
possible positive integer value: which is all 1's except for the sign
bit, which is zero. The greatest possible negative value is either the
additive inverse of that value, or, in the case of two's complement,
that value less one.

And this may be a trap representation.

Yevgen

CBFalconer · Mar 12, 2008

Ark said:
Peter Nilsson wrote:
.... snip ...

Unfortunately, many implementations are somewhat inconsistent.
Consider...

#include <limits.h>
#include <stdio.h>

int main(void) {
printf("ULONG_MAX = %lu\n", ULONG_MAX);

#if -1 == -1ul
puts("-1 == -1ul [pre]");
#endif

if (-1 == -1ul)
puts("-1 == -1ul");

#if 4294967295 == -1ul
puts("4294967295 == -1ul [pre]");
#endif

if (4294967295 == -1ul)
puts("4294967295 == -1ul");
return 0;
}

The output for me using delorie gcc 4.2.1 with -ansi
-pedantic is...

ULONG_MAX = 4294967295
-1 == -1ul [pre]
-1 == -1ul
4294967295 == -1ul

As you can see, there is a discrepancy between the way
that preprocessor arithmetic is evaluated. Fact is,
gcc is not the only compiler to show problems.

Click to expand...

What's wrong with that, remembering that (for gcc, on xx86's) a
long is defined to be identical to an int.

CBFalconer · Mar 12, 2008

Kaz said:
You can use shifting to determine how many bits there are in the
given signed integral type. Start with 1 and keep shifting it
left until it drops off. With that information, you can construct

....

No, because the moment it 'drops off' you have run into
implementation (or undefined) behaviour. You can't write portable
code to do this. You can possibly write code that executes on YOUR
machinery.

Peter Nilsson · Mar 12, 2008

Ark Khasin said:
Wow. I haven't thought of this possibility. It would imply
malicious intent of the implementer

Not necessarily. Non-normalised floating point is one way of
implementing 1c integers. I could imagine that large integers
might theoretically be implemented this way.

[I remember old Macs supporting 80 and 96-bit floating point
types. Such types could be used to implement 64-bit integers
on 16/32 bit machines.]

for it have an overhead in simplest conversions.

Nevertheless such design decisions are sometimes made.

And it won't be for all markets But I don't see why such
a horror implementation would be illegal.

Horror or not, it's just another virtual C machine to me.

Malcolm McLean · Mar 12, 2008

santosh said:
Hello all,

In K&R2 one exercise asks the reader to compute and print the limits for
the basic integer types. This is trivial for unsigned types. But is it
possible for signed types without invoking undefined behaviour
triggered by overflow? Remember that the constants in limits.h cannot
be used.

I don't think there's a perfect answer.

However this is the closest I could get.

double x = 0;
int testme;

do
{
x++;
testme = (int) x;
} while((double) testme == x);

printf("Biggest integer %g\n", x - 1);

It will fail if all ints are not exactly representable by a double. Which is
an int 64 machine.
(Wail, gnash).

Kaz Kylheku · Mar 12, 2008

That's UB, no?

Unfortunately it is. Shifting a bit into the sign is UB. Only a
positive value whose double is representable may be shifted left by
one bit.

This means that the sign bit is quite impervious to bit manipulation.

Ben Bacarisse · Mar 12, 2008

Malcolm McLean said:
I don't think there's a perfect answer.

However this is the closest I could get.

double x = 0;
int testme;

do
{
x++;
testme = (int) x;
} while((double) testme == x);

printf("Biggest integer %g\n", x - 1);

I don't think you need to be so cautious -- ints must use binary, so
you could start at 1 and repeatedly double x and try to convert x-1.
Even so, you have not gained anything -- the conversion to int, when
it is out of range, is still undefined.

Ben Bacarisse · Mar 13, 2008

Kaz Kylheku said:
Unfortunately it is. Shifting a bit into the sign is UB. Only a
positive value whose double is representable may be shifted left by
one bit.

This means that the sign bit is quite impervious to bit
manipulation.

It must participate in other bit operations, though, like ~, &, | and
^. Even so,I can't see any way to avoid UB when trying to calculate
the range of int. Equally, I don't have a persuasive argument that it
*can't* be done, either.

user923005 · Mar 13, 2008

It must participate in other bit operations, though, like ~, &, | and
^. Even so,I can't see any way to avoid UB when trying to calculate
the range of int. Equally, I don't have a persuasive argument that it
*can't* be done, either.

To compound things, imagine a C implementation where all integral
types were 64 bits (including char).
Even the undefined behavior hacks I posted will fail on those.
In short, I think it is a really difficult problem to solve.
If someone can define a sensible solution, I would be very pleased to
see it.
It might be interesting to see what DMR has to say about it.

Ioannis Vranos · Mar 13, 2008

santosh said:
Hello all,

In K&R2 one exercise asks the reader to compute and print the limits for
the basic integer types. This is trivial for unsigned types. But is it
possible for signed types without invoking undefined behaviour
triggered by overflow? Remember that the constants in limits.h cannot
be used.

C95:

#include <stdio.h>

int main()
{
unsigned x= -1;

int INTMAX=x /2;

int INTMIN= -INTMAX -1;

printf("INTMIN: %d\t", INTMIN);

printf("INTMAX: %d\n", INTMAX);

return 0;
}

Peter Nilsson · Mar 13, 2008

Ioannis Vranos said:
#include <stdio.h>

int main()
{
unsigned x= -1;
int INTMAX=x /2;

What if UINT_MAX == INT_MAX, or UINT_MAX = 4*INT_MAX+3?

int INTMIN= -INTMAX -1;

What if INT_MIN == -INT_MAX?

Richard Heathfield · Mar 13, 2008

Peter Nilsson said:

What if UINT_MAX == INT_MAX,

I don't think it can. "For each of the signed integer types, there is a
corresponding (but different) unsigned integer type (designated with the
keyword unsigned) that uses the same amount of storage (including sign
information) and has the same alignment requirements. The range of
nonnegative values of a signed integer type is a subrange of the
corresponding unsigned integer type, and the representation of the same
value in each type is the same." So for UINT_MAX to be == INT_MAX, ints
would need to squeeze twice as many values into the same number of bits as
unsigned ints.

or UINT_MAX = 4*INT_MAX+3?

See above.

What if INT_MIN == -INT_MAX?

That, however, is a valid objection. For example, INT_MIN might be -32767
rather than -32768.

Ioannis Vranos · Mar 13, 2008

Richard said:
Peter Nilsson said:

I don't think it can. "For each of the signed integer types, there is a
corresponding (but different) unsigned integer type (designated with the
keyword unsigned) that uses the same amount of storage (including sign
information) and has the same alignment requirements. The range of
nonnegative values of a signed integer type is a subrange of the
corresponding unsigned integer type, and the representation of the same
value in each type is the same." So for UINT_MAX to be == INT_MAX, ints
would need to squeeze twice as many values into the same number of bits as
unsigned ints.

See above.

That, however, is a valid objection. For example, INT_MIN might be -32767
rather than -32768.

C95:

Since sizeof(N)= sizeof(signed N)= sizeof(unsigned N)

where N can be char, short, int, long

and as you mentioned they use the same amount of storage, how can
INT_MIN be equal to INT_MAX since the range of values is the same.

Ambiguous errata solutions for K&R2	6	Apr 14, 2008
Ambiguous errata solutions for K&R2	0	Apr 14, 2008
promotion and narrowing integer conversion	14	Jan 21, 2010
Use of signed integer types	11	Jul 7, 2006
signed integer overflow	27	Aug 13, 2005
Max value of an integer type?	35	Jul 4, 2006
comp.lang.c Answers to Frequently Asked Questions (FAQ List)	15	Apr 1, 2006
typedef declares object / enum to int	13	Feb 23, 2005

Direct computation of integer limits in K&R2?

Peter Nilsson

Micah Cowan

Ark Khasin

user923005

Flash Gordon

Flash Gordon

Kaz Kylheku

ymuntyan

CBFalconer

CBFalconer

Peter Nilsson

Malcolm McLean

Kaz Kylheku

Ben Bacarisse

Ben Bacarisse

user923005

Ioannis Vranos

Peter Nilsson

Richard Heathfield

Ioannis Vranos

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads