portable and efficient signed bitfields

Andrew Oakley · Jan 22, 2009

I'm trying to fix up some code that uses a signed integer to store
various formed of packed data. Signed integer values are sometimes
present in the top 29 bits. The current code does:

signed int data;
data = ... | value << 3; // store value in top bits
value = data >> 3; //get value out of top bits

This is somewhat broken as shifting of negative numbers is
implementation defined. It also gets very hard to read.

I've got some new code that looks more like this:

static unsigned int store_signed(signed int i, unsigned char len) {
return (i < 0) ? (1 << len) - ((unsigned int)-i) : i;
}
static signed int load_signed(unsigned int i, unsigned char len) {
return (i & ((1 << (len - 1)) - 1)) - (i & (1 << (len - 1)));
}
struct {
unsigned int something_else:3;
unsigned int value:29;
} data;
data.something_else = ...
data.value = store_signed(value, 29);
value = load_signed(data.value, 29);

Using bitfields I can get rid of the excessive bitwise manipulations
happening in the program, which should make it much clearer. The load
and store routines turn the signed values into unsigned values so that
they can be stored in the bitfields (behaviour with signed bitfield
values is also implementation defined).

The store_signed routine is fast - GCC removes the lot, and we end up
with the same code as before, but I can't get the load_signed one to
generate the nice compact code that I want - a single sar (shift
arithmetic right) to do the unpacking and sign extension.

Any ideas would be appreciated - the only real constraint is that I
have to keep the same binary format as before.

Ben Bacarisse · Jan 22, 2009

Andrew Oakley said:
I'm trying to fix up some code that uses a signed integer to store
various formed of packed data. Signed integer values are sometimes
present in the top 29 bits.

I've got some new code that looks more like this:

static unsigned int store_signed(signed int i, unsigned char len) {
return (i < 0) ? (1 << len) - ((unsigned int)-i) : i;
}
static signed int load_signed(unsigned int i, unsigned char len) {
return (i & ((1 << (len - 1)) - 1)) - (i & (1 << (len - 1)));
}

The "usual" code is:

static signed int load_signed(int i, unsigned char len) {
int mask = 1 << (len - 1);
return (i ^ mask) - mask;
}

This is taken from [1]. Obviously you can simplify this since you
have only two cases when len==29 and len==3. You could write load_top
and load_bottom that have a constant mask built-in.

struct {
unsigned int something_else:3;
unsigned int value:29;
} data;
data.something_else = ...
data.value = store_signed(value, 29);
value = load_signed(data.value, 29);

Using bitfields I can get rid of the excessive bitwise manipulations
happening in the program, which should make it much clearer. The load
and store routines turn the signed values into unsigned values so that
they can be stored in the bitfields (behaviour with signed bitfield
values is also implementation defined).

How so? In the bad old days, you could not rely on signed int
bit-fields but I think you can now (i.e. since the 89 standard).

The store_signed routine is fast - GCC removes the lot, and we end up
with the same code as before, but I can't get the load_signed one to
generate the nice compact code that I want - a single sar (shift
arithmetic right) to do the unpacking and sign extension.

Any ideas would be appreciated - the only real constraint is that I
have to keep the same binary format as before.

Signed bit-fields win for me, but I have no idea what code you'll get.

[1] http://graphics.stanford.edu/~seander/bithacks.html.

Andrew Oakley · Jan 22, 2009

The "usual" code is:

static signed int load_signed(int i, unsigned char len) {
int mask = 1 << (len - 1);
return (i ^ mask) - mask;
}

I think thats much clearer, thanks.

How so? In the bad old days, you could not rely on signed int
bit-fields but I think you can now (i.e. since the 89 standard).

I think i must have just got confused by the spec:

A bit-field may have type int , unsigned int , or signed int .
Whether the high-order bit position of a ``plain'' int bit-field is
treated as a sign bit is implementation-defined. A bit-field is
interpreted as an integral type consisting of the specified number of
bits.

I guess that just means if you do "int" instead of "signed int" it is
implementation defined, not that the whole thing is implementation
defined.

Signed bit-fields win for me, but I have no idea what code you'll get.

Yes, having read the spec more carefully I would agree, and the
generated code is pretty elegant in GCC.

Thanks for the help.

Phil Carmody · Jan 24, 2009

Ben Bacarisse said:
The "usual" code is:

static signed int load_signed(int i, unsigned char len) {
int mask = 1 << (len - 1);
return (i ^ mask) - mask;
}

The ^ is IDB or UB according to 6.5 (4) and then the - may invoke
UB according to 6.5 (5), surely?

Phil

Harald van DÄ³k · Jan 24, 2009

Ben Bacarisse said:
Ben Bacarisse said:

[...]
The "usual" code is:

static signed int load_signed(int i, unsigned char len) {
int mask = 1 << (len - 1);
return (i ^ mask) - mask;
}

Click to expand...

The ^ is IDB or UB according to 6.5 (4) and then the - may invoke UB
according to 6.5 (5), surely?

Depending on the function arguments, this function may have
implementation-defined or undefined behaviour, but only if the function
arguments are incorrect for how the function is supposed to be used.

_Bool bitfields and cast / assignment	5	Jun 14, 2010
Signed Bitfields	1	Nov 1, 2007
Bitmask vs bitfields	6	Jun 3, 2010
Alternative approach to bitfields	9	Jul 22, 2012
Portability issues (union, bitfields)	7	Nov 4, 2009
Bitmask vs bitfields	10	Apr 17, 2008
enums and signed/unsigned	4	May 12, 2013
Printing the bitfields	18	Mar 17, 2009

portable and efficient signed bitfields

Andrew Oakley

Ben Bacarisse

Andrew Oakley

Phil Carmody

Harald van DÄ³k

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads