Difference of "consume" and "aquire"

Johannes Schaub (litb) · Sep 18, 2010

Hello all,

I'm trying to read 1.10 and I'm wondering what it is about "consume" and
"aquire". The former plays a role in the "dependency-ordered before"
relationship and the latter plays a role in the "synchronized-with"
relationship. How are they different?

Thanks.

Pavel · Sep 20, 2010

Johannes said:
Hello all,

I'm trying to read 1.10 and I'm wondering what it is about "consume" and
"aquire". The former plays a role in the "dependency-ordered before"
relationship and the latter plays a role in the "synchronized-with"
relationship. How are they different?

Thanks.

I have a guess (it would be good if someone on the standard committee
confirmed or denied):

let's say, thread 1 calculates an object o1

then, thread1 calculates an object o2 using object o1 (that is, o1
"carries a dependency" to o2)

then, thread1 releases o2.

then (in a sense of "before") thread2 either acquires or consumes o2.

The difference is that if thread2 acquires o2, it is only guaranteed to
see the value of o2 as calculated by thread1, but not o1

Otherwise, if thread2 consumes o2, it is guaranteed to see both o1 and
o2 as calculated by thread1

In hardware language this may mean that implementation is required to
execute read memory barrier for o2 only if "acquire" operation is used
and for o2 and all its "dependency-carriers" if "consume" is used.

-Pavel

PS. Again, this was just a not-so-educated guess (lock-free approach
have not worked too well for me so far; "hybrid" primitives like modern
implementations of POSIX mutex or LINUX futexes seemed to do better.
They relieve me from thinking too hard when I need to switch from
polling atomics to waiting on a lock and back.. atomically (sounds like
catch 22 which it probably is). I am not sure how or whether such
hybrids can be programmed in C++0x threading model and would be
interested to learn people's thoughts on this).

Joshua Maurice · Sep 20, 2010

I have a guess (it would be good if someone on the standard committee
confirmed or denied):

let's say, thread 1 calculates an object o1

then, thread1 calculates an object o2 using object o1 (that is, o1
"carries a dependency" to o2)

then, thread1 releases o2.

then (in a sense of "before") thread2 either acquires or consumes o2.

The difference is that if thread2 acquires o2, it is only guaranteed to
see the value of o2 as calculated by thread1, but not o1

Otherwise, if thread2 consumes o2, it is guaranteed to see both o1 and
o2 as calculated by thread1

In hardware language this may mean that implementation is required to
execute read memory barrier for o2 only if "acquire" operation is used
and for o2 and all its "dependency-carriers" if "consume" is used.

-Pavel

PS. Again, this was just a not-so-educated guess (lock-free approach
have not worked too well for me so far; "hybrid" primitives like modern
implementations of POSIX mutex or LINUX futexes seemed to do better.
They relieve me from thinking too hard when I need to switch from
polling atomics to waiting on a lock and back.. atomically (sounds like
catch 22 which it probably is). I am not sure how or whether such
hybrids can be programmed in C++0x threading model and would be
interested to learn people's thoughts on this).

I'm not sure I have it quite correct either, but I think that Pavel is
mistaken.

From my understanding, here is a simple example which should prove
enlightening:

//Forgive my pseudo-code like stuff, I don't have access to an actual
compiler atm.

//initial conditions
int x = 0;
int y = 0;
std::atomic<int*> z = 0

//all threads started concurrently from initial conditions

//thread 1
x = 1;
y = 2;
z.store(&y, std::memory_order_release);

//thread 2
int* p = z.load(std::memory_order_acquire);
if (p != 0)
cout << x << " " << *p << endl;

//thread 3
int* p = z.load(std::memory_order_consume);
if (p != 0)
cout << x << " " << *p << endl;

//end example

Now, if I understand this correctly, thread 2, if it prints something,
then it will print "1 2". The acquire read of z in thread 2 read a
release write of thread 1, so there exists a happens-before
relationship between those two memory actions, thus the subsequent
read of x in thread 2 must see the earlier write of x in thread 1.

With thread 3, I'm not sure of the exact particulars - it might have a
race condition. Let's suppose that the consume read on z in thread 3
reads the release write on z in thread 1. This makes a "data
dependency", or whatever exact term the standard wants to use. Unlike
the acquire-release relationship which guarantees a strict happens-
before, the consume-release relationship only applies to reads made as
an indirection on a read which came from that consume, directly or
indirectly. So, the read on x in thread 3 is not from an indirection
on the consume read (on p), so it has no guarantees, so it may be a
race condition. (Anyone more knowledgeable help me out?) However, the
read on the object *p is an indirection from the consume read on the
object p, so given
- the consume read on p in thread 3 read the release write on p in
thread 1,
then
- the subsequent (atomic or non-atomic) read on *p is guaranteed to
see the previous (atomic or non-atomic) write in thread 1.

Or something like that. This is generalizing a bit, but it's intended
to get the point across.

To be clear, std::memory_order_consume provides guarantees which are a
strict subset of the guarantees of std::memory_order_acquire - you can
always replace a correct std::memory_order_consume with
std::memory_order_acquire and keep correctness and the same semantics.

Anthony Williams · Sep 20, 2010

From my understanding, here is a simple example which should prove
enlightening:

//Forgive my pseudo-code like stuff, I don't have access to an actual
compiler atm.

//initial conditions
int x = 0;
int y = 0;
std::atomic<int*> z = 0

//all threads started concurrently from initial conditions

//thread 1
x = 1;
y = 2;
z.store(&y, std::memory_order_release);

//thread 2
int* p = z.load(std::memory_order_acquire);
if (p != 0)
cout << x << " " << *p << endl;

//thread 3
int* p = z.load(std::memory_order_consume);
if (p != 0)
cout << x << " " << *p << endl;

//end example

Now, if I understand this correctly, thread 2, if it prints something,
then it will print "1 2". The acquire read of z in thread 2 read a
release write of thread 1, so there exists a happens-before
relationship between those two memory actions, thus the subsequent
read of x in thread 2 must see the earlier write of x in thread 1.

Yes, that is correct.

With thread 3, I'm not sure of the exact particulars - it might have a
race condition.

Yes, it does.

Let's suppose that the consume read on z in thread 3
reads the release write on z in thread 1. This makes a "data
dependency", or whatever exact term the standard wants to use. Unlike
the acquire-release relationship which guarantees a strict happens-
before, the consume-release relationship only applies to reads made as
an indirection on a read which came from that consume, directly or
indirectly. So, the read on x in thread 3 is not from an indirection
on the consume read (on p), so it has no guarantees, so it may be a
race condition.

Yes, that's right. There is no ordering on x, so it is a data race, and
undefined behaviour.

(Anyone more knowledgeable help me out?) However, the
read on the object *p is an indirection from the consume read on the
object p, so given
- the consume read on p in thread 3 read the release write on p in
thread 1,
then
- the subsequent (atomic or non-atomic) read on *p is guaranteed to
see the previous (atomic or non-atomic) write in thread 1.
Yes.

Or something like that. This is generalizing a bit, but it's intended
to get the point across.

To be clear, std::memory_order_consume provides guarantees which are a
strict subset of the guarantees of std::memory_order_acquire - you can
always replace a correct std::memory_order_consume with
std::memory_order_acquire and keep correctness and the same semantics.

Yes.

Anthony

Chris M. Thomasson · Sep 20, 2010

[...]

I'm not sure I have it quite correct either, but I think that Pavel is
mistaken.

I think you got it right Joshua. :^)

A consume barrier is basically a data-dependant load barrier. It should be
equal to Linux's `read_barrier_depends()' macro. Also, FWIW, it will be a
NOP on most of the architectures out there today. An exception being DEC
Alpha...

[...]

Pavel · Sep 21, 2010

Joshua said:
I'm not sure I have it quite correct either, but I think that Pavel is
mistaken.

From my understanding, here is a simple example which should prove
enlightening:

//Forgive my pseudo-code like stuff, I don't have access to an actual
compiler atm.

//initial conditions
int x = 0;
int y = 0;
std::atomic<int*> z = 0

//all threads started concurrently from initial conditions

//thread 1
x = 1;
y = 2;
z.store(&y, std::memory_order_release);

//thread 2
int* p = z.load(std::memory_order_acquire);
if (p != 0)
cout<< x<< " "<< *p<< endl;

//thread 3
int* p = z.load(std::memory_order_consume);
if (p != 0)
cout<< x<< " "<< *p<< endl;

//end example

Now, if I understand this correctly, thread 2, if it prints something,
then it will print "1 2". The acquire read of z in thread 2 read a
release write of thread 1, so there exists a happens-before
relationship between those two memory actions, thus the subsequent
read of x in thread 2 must see the earlier write of x in thread 1.

With thread 3, I'm not sure of the exact particulars - it might have a
race condition. Let's suppose that the consume read on z in thread 3
reads the release write on z in thread 1. This makes a "data
dependency", or whatever exact term the standard wants to use. Unlike
the acquire-release relationship which guarantees a strict happens-
before, the consume-release relationship only applies to reads made as
an indirection on a read which came from that consume, directly or
indirectly. So, the read on x in thread 3 is not from an indirection
on the consume read (on p), so it has no guarantees, so it may be a
race condition. (Anyone more knowledgeable help me out?) However, the
read on the object *p is an indirection from the consume read on the
object p, so given
- the consume read on p in thread 3 read the release write on p in
thread 1,
then
- the subsequent (atomic or non-atomic) read on *p is guaranteed to
see the previous (atomic or non-atomic) write in thread 1.

Or something like that. This is generalizing a bit, but it's intended
to get the point across.

To be clear, std::memory_order_consume provides guarantees which are a
strict subset of the guarantees of std::memory_order_acquire - you can
always replace a correct std::memory_order_consume with
std::memory_order_acquire and keep correctness and the same semantics.

Thanks Joshua!

For whatever reason, I read it was other way around, that "consume" was
stronger than "acquire".

I have another stupid question then: why are release sequence operations
"subsequent" to the first operation which is a "release" (1.10-6)?
Shouldn't they (other operations, that is) precede the release operation
of the release sequence for their side effect to become visible to an
"acquire" operation that reads the values written by the release
sequence and therefore be "preceding" rather than "subsequent"?

-Pavel

Anthony Williams · Sep 21, 2010

Pavel said:
I have another stupid question then: why are release sequence
operations "subsequent" to the first operation which is a "release"
(1.10-6)? Shouldn't they (other operations, that is) precede the
release operation of the release sequence for their side effect to
become visible to an "acquire" operation that reads the values written
by the release sequence and therefore be "preceding" rather than
"subsequent"?

A "release sequence" is a sequence of Read-Modify-Write (RMW) operations
that follow a release.

If thread A does a store-release on some variable x, and thread B reads
x with a load-acquire on x that reads the value stored by thread A then
the store in A synchronizes with B.

If other threads do intervening RMW operations on x, such that thread B
reads the result of those RMW operations then this forms a release
sequence, and the store in thread A **still** synchronizes-with the load
in thread B.

This is a special property of RMW operations, since the first
intervening RMW operation must have read the value stored by thread A,
and each subsequent one must have read the value stored by the previous
RMW operation. Plain stores from other threads that happen to be later
than the store in A do not have this property, and will not preserve the
synchronizes-with relationship between thread A and thread B.

Anthony

itertools doc example "consume"	0	Mar 8, 2013
Can't understand code	2	Jun 5, 2022
[For Beginners)Difference between Java and JavaScript	3	Nov 7, 2013
consume local web service -ok consume remote service - Not OK	2	Jan 26, 2010
Linux: using "clone3" and "waitid"	0	Oct 17, 2023
I'm tempted to quit out of frustration	1	Aug 13, 2023
Beginner's Guide to getting CipherSweet working with PDO and MYSQL	1	Dec 1, 2022
some random remarks about Moose::Manual::Concepts	12	Mar 2, 2013

Difference of "consume" and "aquire"

Johannes Schaub (litb)

Pavel

Joshua Maurice

Anthony Williams

Chris M. Thomasson

Pavel

Anthony Williams

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads