How to determine the byte order of machine.

Joe Pfeiffer · Feb 20, 2012

BartC said:
I was probably thinking of something else where the representation was
not straightforward.

DEC floating point, perhaps?

BartC · Feb 20, 2012

Scott Fluhrer said:
That is incorrect. The 68K processor was consistently bigendian, and so
0xDDCCBBAA would be stored as the bytes (DD, CC, BB, AA).

I'd been looking at this diagram (top of page 461):

http://tinyurl.com/7vjemjc

But perhaps I'd misunderstood what they meant by byte 0, 1, 2 and 3. I
assumed byte 0 was least significant, the same way bit 0 is. I could be
wrong.

Keith Thompson · Feb 20, 2012

Joe Pfeiffer said:
If I wanted to check whether a machine were big-endian or little-endian,
my first choice would be to see whether the htonl macro (which converts
a host-order long to a network-order long, which we know in turn is
defined to be big-endian) changes anything. I expect that macro is
outside the scope of this newsgroup, since I'd be surprised if it were
part of the C standard.

It isn't.

Eric Sosman · Feb 20, 2012

[...]
You could also (on a machine with 8-bit bytes) declare an array of 4
chars, cast the array to a long and assign a value to it, then print the
values of the eight chars.

This particular suggestion has cropped up a couple times already
in this thread, so maybe it's time to point out the error: Since an
array of char has no particular alignment requirement, it might not
be aligned strictly enough for a `long' (or `int' or anything else).

unsigned char c[sizeof(unsigned long)] = { 1 }; // others 0
unsigned long *lp = (unsigned long*)&c[0]; // undefined
printf("%lX\n", *lp); // undefined

If you truly want to engage in this sort of thing, do it the
other way around:

unsigned long l = 1;
unsigned char *cp = (unsigned char*)&l;
for (int i = 0; i < sizeof l; ++i)
printf("%X ", cp);
printf("\n");

That is, begin with the multi-byte value properly aligned, then
inspect its bytes with a character pointer that needs no alignment.

Shao Miller · Feb 20, 2012

[...]
You could also (on a machine with 8-bit bytes) declare an array of 4
chars, cast the array to a long and assign a value to it, then print the
values of the eight chars.

Click to expand...

This particular suggestion has cropped up a couple times already
in this thread, so maybe it's time to point out the error: Since an
array of char has no particular alignment requirement, it might not
be aligned strictly enough for a `long' (or `int' or anything else).

unsigned char c[sizeof(unsigned long)] = { 1 }; // others 0
unsigned long *lp = (unsigned long*)&c[0]; // undefined
printf("%lX\n", *lp); // undefined

If you truly want to engage in this sort of thing, do it the
other way around:

unsigned long l = 1;
unsigned char *cp = (unsigned char*)&l;
for (int i = 0; i < sizeof l; ++i)
printf("%X ", cp);
printf("\n");

That is, begin with the multi-byte value properly aligned, then
inspect its bytes with a character pointer that needs no alignment.

Or use a 'union', perhaps:

int big_endian(void) {
const union {
unsigned long val;
unsigned char bytes[sizeof (unsigned long)];
} test = {42};
return test.bytes[0] != 42;
}

Ben Bacarisse · Feb 20, 2012

BartC said:
I'd been looking at this diagram (top of page 461):

http://tinyurl.com/7vjemjc

But perhaps I'd misunderstood what they meant by byte 0, 1, 2 and 3. I
assumed byte 0 was least significant, the same way bit 0 is. I could
be wrong.

In all such diagrams that I've seen, bytes are numbered by address.
Byte 0 will be the byte with the lowest address. What varies from
machine to machine is the significance of byte 0 vs. that of byte 1.
However you read the numbers, I still can't see how you get BB,AA,DD,CC
from that diagram!

Joe Pfeiffer · Feb 20, 2012

Eric Sosman said:
[...]
You could also (on a machine with 8-bit bytes) declare an array of 4
chars, cast the array to a long and assign a value to it, then print the
values of the eight chars.

Click to expand...

This particular suggestion has cropped up a couple times already
in this thread, so maybe it's time to point out the error: Since an
array of char has no particular alignment requirement, it might not
be aligned strictly enough for a `long' (or `int' or anything else).

unsigned char c[sizeof(unsigned long)] = { 1 }; // others 0
unsigned long *lp = (unsigned long*)&c[0]; // undefined
printf("%lX\n", *lp); // undefined

If you truly want to engage in this sort of thing, do it the
other way around:

unsigned long l = 1;
unsigned char *cp = (unsigned char*)&l;
for (int i = 0; i < sizeof l; ++i)
printf("%X ", cp);
printf("\n");

That is, begin with the multi-byte value properly aligned, then
inspect its bytes with a character pointer that needs no alignment.

Ah, of course. I don't think I've ever seen an array that wasn't
aligned appropriately when required, but you are correct.

BartC · Feb 20, 2012

Ben Bacarisse said:
In all such diagrams that I've seen, bytes are numbered by address.
Byte 0 will be the byte with the lowest address. What varies from
machine to machine is the significance of byte 0 vs. that of byte 1.
However you read the numbers, I still can't see how you get BB,AA,DD,CC
from that diagram!

If the byte numbers are simply offsets from the start address, then the
diagram doesn't show whether the high or low half is stored first. So it
could be BB,AA, DD,CC or DD,CC, BB,AA. But it looks like it's the latter,
and it must be some other system which is mixed up.

Ben Bacarisse · Feb 20, 2012

BartC said:
If the byte numbers are simply offsets from the start address, then
the diagram doesn't show whether the high or low half is stored
first.

No it doesn't. I think the diagram just illustrates addressing, not
significance within a word. It's the following text that adds this
detail at the very end of the paragraph. The diagram show the
significance of words within long words (the (H) and (L) in the example)
but not of bytes within words. It would not have hurt to add (H) and
(L) after the bytes as well, but the author chose not to.

So it could be BB,AA, DD,CC or DD,CC, BB,AA. But it looks like
it's the latter, and it must be some other system which is mixed up.

The (H) makes it clear that the first word (byte pair) must be the
high-order one, so you must reject BB,AA (and AA,BB) based on the
diagram alone. You need to read the text to tell that it's DD,CC;BB,AA
rather than CC,DD;AA,BB.

Nick Keighley · Feb 21, 2012

Your best bet is to write code that works with values, regardless
of how those values are represented. Ideally, you would not even
know how the machine represents eight hundred seventeen; you should
concern yourself with finding its factors or comparing it to nine
hundred twenty-six or whatever. It is occasionally necessary to
pierce the veil, but not very often.

usually when you have to deal with external representation. You may
care about byte order and such like when data gets stuffed down comms
links or into files. But well written programs only have a tiny amount
of code that cares about this

Noob · Feb 21, 2012

Kenneth said:
Does it matter if I write it "six", "6", "VI", "seks",

"seks" machine?

How to keep the order of executing tasks? - Help needed.	1	Feb 21, 2023
WIN32 - Update Text in a Window in order to show its size in Pixels and coordinates	0	Oct 4, 2023
Write int as a 4 byte big-endian to file.	43	Mar 10, 2012
Program to find the largest integer element of an array.	1	Mar 2, 2022
Comparison of Integer and Pointer (that's supposed to be an Integer). Where did I go wrong?	0	Nov 19, 2022
Macro for setting MSB - Intended to work on both Little andBig-endian machines	16	Mar 26, 2013
Macro for setting MSB - Intended to work on both Little andBig-endian machines	0	Mar 26, 2013
Macro for setting MSB - Intended to work on both Little andBig-endian machines	0	Mar 26, 2013

How to determine the byte order of machine.

Joe Pfeiffer

BartC

Keith Thompson

Eric Sosman

Shao Miller

Ben Bacarisse

Joe Pfeiffer

BartC

Ben Bacarisse

Nick Keighley

Noob

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads