Explanation of list reference

Chris Angelico · Feb 15, 2014

You just need to recognise that
objects can contain themselves:

py> L = []
py> L.append(L)
py> L in L
True

Of course, if you are a fan of the Doctor Who television show, you won't
be concerned by this. If the TARDIS can materialise inside itself, then
there isn't any problem with lists containing themselves either.

They certainly can. In my MUD, every object has a location (which may
be the equivalent of None); generally, a person's location is a room,
and a grabbable object's location is either a room or a person, but
it's possible for a room to have a location... a Dungeon Master can
create a workroom, which is in his inventory, and then enter that
workroom.

Incidentally, the destruction of an object simply involves moving it
to nowhere. Since "nowhere" doesn't keep any references to the things
put there, those objects promptly cease to exist. It's just like
Western society - you don't actually destroy anything, you just throw
it away (out of sight, out of mind), and let the garbage collector
dispose of it for you

ChrisA

Chris Angelico · Feb 15, 2014

On Sat, 15 Feb 2014 15:37:20 +1100, Chris Angelico wrote:

[...]

This is why dice exist in a variety of colors [1]. Indistinguishable yet
distinct dice...

Click to expand...

Since they have different colours, they can be distinguished and aren't
indistinguishable.

Sorry, what I meant was that different colors are available for
purchase, as a means of avoiding this problem. If you buy a dozen dice
in mixed colors, you can simply declare "the red one is the hundreds,
the blue one is tens, the green one is units", which is the most
common solution to this exact issue of dice being indistinguishable
yet distinct.

One might also distinguish three dice by position in space ("the die
closest to the wall counts as the HUNDREDS digit, the one closest to me
counts as the TENS digit, and the one closest to you counts as the UNITS
digit")

Yes, but prior to rolling, you can't necessarily know what's going to
be easy to determine. Rolling dice tends to randomize their positions
somewhat, and it's impractical to get out the tape measure to
determine which one is closest to the wall

or by space ("roll this die first for the HUNDREDS, then roll
that one for TENS, then this third one for UNITS").

Distinguishing by time is the other common method that I mentioned. In
that case, there's really no reason to distinguish them at all, so you
can use the same object three times - like using the string "foobar"
three times and perhaps finding that it's the same object.

Or scent, or texture, or the typeface used for the numbers.

Then they're not indistinguishable

Having grown up in a business that sells dice (though, oddly enough,
not for RPGs - I learned that d12s are popular for practicing
multiplication tables, not that they're used for a barbarian's hit
points and great-axe damage), I know how many identical dice you can
get hold of. We had trays and trays of them at one point... then
migrated to bulk bags. Though we usually tried to split them up, 50
dice per bag, so we could keep some track of inventory. And we were
retailing, and dice weren't a huge part of our business. There are
plenty of ways for dice to differ, but that's like having 5, 5.0, 5L
(if Py2), and (5+0j), all of which are five, but all of which are
distinct and must be kept separate.

ChrisA

Ian Kelly · Feb 15, 2014

Whats the notion of object identity is the question.
Ok so you reject the memory addr as an 'implementation detail'
Then you are obliged to provide some other way of understanding object-identity

I thought that I did. Repeating myself from what you quoted above:

If two objects share the same creation, then they're the same object.

Marko Rauhamaa · Feb 15, 2014

Ned Batchelder said:
On 2/14/14 4:43 PM, Marko Rauhamaa wrote:
Yes, sometimes for teaching reasons, you have to over-simplify or even
introduce artificial constructs. I'd recommend acknowledging them as
such.

When you say, "There are two fundamentally different kinds of values
in Python," or "So we have four kinds of (memory) slots," you aren't
letting on that this is a teaching construct. It sounds like you mean
that this is how Python actually works.

I'd use words like, "This is an oversimplification, but might
help...", or "You can think of it like ...".

Strictly speaking, I'm not simplifying, but giving an equivalent,
alternative description. I admit that the word "fundamentally" was a bad
choice. I'm not even sure my description was a good illustration. I
definitely was not referring to CPython and was trying to keep the
discussion separate from the implementation of the day.

BTW, I also wasn't oversimplifying, but complicating by bringing in an
unnecessary dichotomy. The challenge is how to present Python's value
model in the most digestible way. For example, how is a beginnger to
understand what's going on in:

n += 1

Is it easier to think that the number held by the variable n is
incremented by 1, or is it easier to understand it orthodoxly through
instantiations and references?

Marko

Marko Rauhamaa · Feb 15, 2014

Chris Angelico said:
Can you give an example of an ambiguous case?

The "x is y" test may yield different outcomes in different, valid
Python implementations:

4 is 4
(x,) is (x,)
"hello" is "hello"

Marko

Marko Rauhamaa · Feb 15, 2014

Ben Finney said:
You should never need to predict the result of an â€˜isâ€™ operation.
(More precisely, for *some* cases you can predict it, but for other
cases you can't.)

No problem there. You have to understand "is" well to use it.

Referring to "objects in memory" when defininig "is" leads to circular
definitions. It think the best way to define the semantics of "is" is
through constraints:

1. if x is y then y ix x

2. if x is y and y is z then x is z

3. after x = y, x is y

4. if x is y, then x == y

That's why "is" is not introductory material.

The constraints define a relation that coincides with == wherever it is
defined. So why would you ever use "is" instead of "=="? After all, all
well-defined programs would behave identically after replacing "is" with
"==". Really, the only reason would be performance; "is" is often faster
to evaluate than "==".

Marko

Marko Rauhamaa · Feb 15, 2014

Christian Gollwitzer said:
Still they are connected. I can imagin that id() is just a debugging
tool for extensions. What useful applications does it have outside of
this?

Calculating hash keys quickly.

Marko

Marko Rauhamaa · Feb 15, 2014

Marko Rauhamaa said:
1. if x is y then y ix x
2. if x is y and y is z then x is z
3. after x = y, x is y
4. if x is y, then x == y

A new attempt:

0. x is x
1. if x is y then y ix x
2. if x is y and y is z then x is z
3. after x = y, x is y
4. if x is y and x == x, then x == y
5. id(x) == id(y) iff x is y

Does that cover it?

Marko

Steven D'Aprano · Feb 15, 2014

The "x is y" test may yield different outcomes in different, valid
Python implementations:

4 is 4
(x,) is (x,)
"hello" is "hello"

But none of those examples are ambiguous. They're merely unspecified by
the language definition. Any specific implementation of Python will
return either True or False; it may be predictable, or it might be
impossible to predict until runtime, but either way we know that every
non-broken Python virtual machine must either treat the two operands as
the same object or different objects.

These are ambiguous sentences:

I saw the man with the binoculars.
Police help assault victim.
Once there was a blind carpenter who picked up his hammer and saw.
Look at that cat with one eye.

"A Python implementation can choose whether or not to re-use immutable
objects" is not ambiguous. It's just a choice.

Steven D'Aprano · Feb 15, 2014

It think the best way to define the semantics of "is" is
through constraints:

1. if x is y then y ix x

2. if x is y and y is z then x is z

3. after x = y, x is y

4. if x is y, then x == y

Incorrect.

py> x = float('nan')
py> y = x
py> x is y
True
py> x == y
False

That's why "is" is not introductory material.

No. "is" is not introductory material because it is an attractive
nuisance for beginners. Beginners have a tendency to think that "is" is a
synonym for "equals", as it can be in English. Sometimes it appears to
work:

x = 2
y = x*3 - 4
x is y
=> probably returns True

but in other cases it fails, confusing the beginner.

With the exception of "is None", beginners almost never need "is".

The constraints define a relation that coincides with == wherever it is
defined.

It certainly does not.

class Wacky:
def __eq__(self, other):
return random.random() < 0.5

So why would you ever use "is" instead of "=="? After all, all
well-defined programs would behave identically after replacing "is" with
"==". Really, the only reason would be performance; "is" is often
faster to evaluate than "==".

Incorrect. "is" is NOT equivalent to ==, the two are not guaranteed to do
the same thing, you CANNOT safely replace "is" with == or visa versa, and
the meaning of the "is" operator is completely different from the meaning
of the == operator.

The "is" operator tests whether the two operands are the same object,
nothing more, nothing less. The == operator tests whether the two
operands compare equal.

Steven D'Aprano · Feb 15, 2014

Am 15.02.14 01:57, schrieb Chris Angelico:

Can you give an example of an ambiguous case? Fundamentally, the 'is'
operator tells you whether its two operands are exactly the same
object, nothing more and nothing less, so I assume your "ambiguous
cases" are ones where it's possible for two things to be either the
same object or two indistinguishable ones.

Click to expand...

What about the thing I posted down in this thread?

import numpy as np
a=np.array([1, 2, 3, 4])
b=a[:]
id(a) 140267900969344
id(b)

Click to expand...

Click to expand...

140267901045920

So, a and b are different things, right?

Correct. They are different objects. But they may share underlying state.

b[1]=37
b array([ 1, 37, 3, 4])
a

Click to expand...

Click to expand...

array([ 1, 37, 3, 4])

And indeed numpy arrays do share state. Why? No idea. Somebody thought
that it was a good idea. (Not me though...)

Still they are connected. I can imagin that id() is just a debugging
tool for extensions. What useful applications does it have outside of
this?

Very few.

Steven D'Aprano · Feb 15, 2014

A new attempt:

0. x is x
1. if x is y then y ix x
2. if x is y and y is z then x is z
3. after x = y, x is y
4. if x is y and x == x, then x == y
5. id(x) == id(y) iff x is y

Python implementations are free to re-use IDs after the object is
destroyed. CPython does; Jython and IronPython do not. So #5 needs to
point out that the condition id(x) == id(y) only applies if x and y still
exist.

# Counter-example
py> x = 230000
py> idx = id(x)
py> del x
py> y = 420000
py> idy = id(y)
py> idx == idy
True

(This is *implementation dependent* so your mileage my vary.)

Does that cover it?

No. Your definition describes some properties of identity-equivalence,
but doesn't explain what identity actually means.

Gregory Ewing · Feb 15, 2014

Steven said:
(1) General relativity tells us that not all observers will agree on the
space-time coordinates of two objects, since not all observers agree on a
single frame of reference.

But that doesn't mean they won't agree about whether
objects are identical or not! The coordinates they
use to describe spacetime locations may differ, but
they will agree on whether or not they are equal.
A Lorentz transformation can't cause a single point
in spacetime to split into two, or two distinct points
to merge into one.

(2) Quantum mechanics tells us that objects are not located at a single
space-time coordinate. Objects are "smeared out" over space (and time).
We cannot really talk about the location of an object, but only about the
probability of a measurement registering the object at a certain location.

But that doesn't mean you can stuff two objects into
the same space at the same time. What we perceive as
solid objects are composed of fermions, which obey
the Pauli exclusion principle. That means you can't
have more than one of them in a given quantum state.
While you *could* have two of them equally spread out
over all of space, they would then have to be
separated in some other dimension such as momentum
or spin.

So if you replace "space-time coordinates" with
"quantum state", the original statement remains
essentially true.

Chris Angelico · Feb 15, 2014

A new attempt:

Sorry, hadn't seen this when I posted.

0. x is x

This is the definition of identity.

1. if x is y then y ix x

Yes, because if x is y, there is absolutely no difference between
using one of those names or the other, in any context.

2. if x is y and y is z then x is z

Extension of the above. The first statement proves that you can
substitute 'x' for 'y' in the second without changing its truthiness;
therefore, based on the definition of identity, 'x is z' must be
identical to 'y is z'.

3. after x = y, x is y

This is the definition of assignment. (Obviously this axiom depends on
x and y being simple names and nothing tampering with the situation in
any way. But yes, this is exactly what assignment is.)

4. if x is y and x == x, then x == y

Yes. As in case 2, 'x is y' implies that you can substitute 'x' for
'y' or vice versa. Therefore, if x == x, then y == y, and x == y, and
y == x; because in each case, what you're doing is "object #1423443,
are you equal to object #1423443 or not?", regardless of the name you
use to access that object.

5. id(x) == id(y) iff x is y

This is the definition of id(). Note that it does depend on something
holding a reference to each of x and y; if it's possible for the
objects' lifetimes to not overlap, it's possible for them to reuse
ids:

[1,2,3] is [2,3,4] False
id([1,2,3]) == id([2,3,4])

Click to expand...

Click to expand...

True

But if x and y are simple names (and therefore retaining their
referent objects), then your statement is valid.

Does that cover it?

Largely axiomatically, yes.

ChrisA

Gregory Ewing · Feb 15, 2014

Steven said:
IDs are a proxy for distinct objects. If you live in a country with an ID
card of some sort, then the IDs acts as an identifier for each unique
individual. But countries without ID cards don't lack unique individual
people.

"You are Number Six."
"I am not an id()! I am an individual object!"

Chris Angelico · Feb 15, 2014

"You are Number Six."
"I am not an id()! I am an individual object!"

Make it so, Number One.

ChrisA

Gregory Ewing · Feb 15, 2014

Jussi said:
Would it help to say that in case 1 the relevant statement acts on the
variable while in case 2 it acts on the value of the variable?

I would try to avoid using words like "value" and "variable",
because they don't have well-defined meanings in Python.

The way I would put it is that

a = b

changes which object the name 'a' refers to, whereas

a = b

does not.

Marko Rauhamaa · Feb 15, 2014

Steven D'Aprano said:
# Counter-example
py> x = 230000
py> idx = id(x)
py> del x
py> y = 420000
py> idy = id(y)
py> idx == idy
True

I don't accept that as a counterexample. You will have to produce:

(id(x) == id(y)) == (x is y)

False

(This is *implementation dependent* so your mileage my vary.)

No. Your definition describes some properties of identity-equivalence,
but doesn't explain what identity actually means.

That's the point. I don't think id() and "is" have any abstract meaning
on top of the formal axioms.

Marko

Gregory Ewing · Feb 15, 2014

Chris said:
But yes, this is an expression, and it evaluates to a reference.

Well, *all* expressions in Python evaluate to references,
so that's not really saying much.

Chris Angelico · Feb 15, 2014

Well, *all* expressions in Python evaluate to references,
so that's not really saying much.

Because everything in Python is an object, and objects always are
handled by their references. This wouldn't be true in every language
(eg it's not true of Java's unboxed types), but it's intrinsic to
Python's object model.

ChrisA

Changing string value that is an element of a list	3	Feb 10, 2025
What's the detailed explanation for why the 1st function is correct and the 2nd is wrong?	3	Dec 16, 2022
Uncaught Reference Errors	1	Oct 9, 2022
Average of MultiMode of a list of a list	1	Oct 28, 2022
How does a HEAD pointer end up pointing to the first node in a linked list?	3	Jan 24, 2023
Select files based on text list of filenames(part of the name:date) with condition	0	May 4, 2022
Need help solving this problem	1	May 9, 2023
Select Eof extension files based on text list of filenames with if condition	1	May 4, 2022

Explanation of list reference

Chris Angelico

Chris Angelico

Ian Kelly

Marko Rauhamaa

Marko Rauhamaa

Marko Rauhamaa

Marko Rauhamaa

Marko Rauhamaa

Steven D'Aprano

Steven D'Aprano

Steven D'Aprano

Steven D'Aprano

Gregory Ewing

Chris Angelico

Gregory Ewing

Chris Angelico

Gregory Ewing

Marko Rauhamaa

Gregory Ewing

Chris Angelico

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads