Is this a bug in int()?

MartinRinehart · Dec 20, 2007

int('0x', 16)
0

I'm working on a tokenizer and I'm thinking about returning a
MALFORMED_NUMBER token (1.2E, .5E+)

Fredrik Lundh · Dec 20, 2007

0

I'm working on a tokenizer and I'm thinking about returning a
MALFORMED_NUMBER token (1.2E, .5E+)

Somewhat surprisingly, "0x" is a valid integer literal in Python:
0

</F>

Duncan Booth · Dec 20, 2007

(e-mail address removed) wrote under the subject line "Is this a
bug in int()?":

0

I think it is a general problem in the tokenizer, not just the 'int'
constructor. The syntax for integers says:

hexinteger ::= "0" ("x" | "X") hexdigit+

but 0x appears to be accepted in source code as an integer.

If I were you, I'd try reporting it as a bug.

I'm working on a tokenizer and I'm thinking about returning a
MALFORMED_NUMBER token (1.2E, .5E+)

Why would you return a token rather than throwing an exception?

Terry Reedy · Dec 20, 2007

MartinRinehart · Dec 21, 2007

Duncan said:
Why would you return a token rather than throwing an exception?

Tokenizers have lots of uses. Colorizing text in an editor, for
example. We've got a MALFORMED_NUMBER when you type '0x'. We've got an
INTEGER when we get your next keystroke (probably).

MartinRinehart · Dec 21, 2007

Tokenizer bug reported.

[email protected] said:
0

I'm working on a tokenizer and I'm thinking about returning a
MALFORMED_NUMBER token (1.2E, .5E+)

MartinRinehart · Dec 22, 2007

Tokenizer accepts "0x" as zero. Spec says its an error not to have at
least one hex digit after "0x".

This is a more serious bug than I had originally thought. Consider
this:

Joe types "security_code = 0x" and then goes off to the Guardian-of-
the-Codes to get the appropriate hex string. Returning to computer,
Joe's boss grabs him. Tells him that effective immediately he's on the
"rescue us from this crisis" team; his other project can wait.

Some hours, days or weeks later Joe returns to the first project. At
this point Joe has a line of code that says "security_code = 0x". I
think Joe would be well-served by a compiler error on that line. As is
now, Joe's program assigns 0 to security_code and compiles without
complaint. I'm pretty sure any line of the form "name = 0x" was a
product of some form of programmer interruptus.

George Sakkis · Dec 22, 2007

Tokenizer accepts "0x" as zero. Spec says its an error not to have at
least one hex digit after "0x".

This is a more serious bug than I had originally thought. Consider
this:

Joe types "security_code = 0x" and then goes off to the Guardian-of-
the-Codes to get the appropriate hex string. Returning to computer,
Joe's boss grabs him. Tells him that effective immediately he's on the
"rescue us from this crisis" team; his other project can wait.

Some hours, days or weeks later Joe returns to the first project. At
this point Joe has a line of code that says "security_code = 0x". I
think Joe would be well-served by a compiler error on that line. As is
now, Joe's program assigns 0 to security_code and compiles without
complaint. I'm pretty sure any line of the form "name = 0x" was a
product of some form of programmer interruptus.

Are you a fiction writer by any chance ? Nice story but I somehow
doubt that the number of lines of the form "name = 0x" ever written in
Python is greater than a single digit (with zero the most likely one).

George

Steven D'Aprano · Dec 22, 2007

Tokenizer accepts "0x" as zero. Spec says its an error not to have at
least one hex digit after "0x".

This is a more serious bug than I had originally thought. Consider this:

Joe types "security_code = 0x" and then goes off to the Guardian-of-
the-Codes to get the appropriate hex string.

Which is *hard coded* in the source code??? How do you revoke a
compromised code, or add a new one?

Let me guess... the Guardian of the Codes stores them on a postit note
stuck to the side of the fridge in the staff lunchroom? Written
backwards, so nobody can guess what they really are.

Returning to computer,
Joe's boss grabs him. Tells him that effective immediately he's on the
"rescue us from this crisis" team; his other project can wait.

Serves him write for writing in hex, when everybody knows that for *real*
security you should store your security codes as octal.

Some hours, days or weeks later Joe returns to the first project. At
this point Joe has a line of code that says "security_code = 0x". I
think Joe would be well-served by a compiler error on that line.

*shrug*

Maybe so, but how is that worse than if he had written "security_code =
123456" intending to come back and put the actual code in later, and
forgot?

As is
now, Joe's program assigns 0 to security_code and compiles without
complaint.

Which Joe will *instantly* discover, the first time he tries to test the
program and discovers that entering the *actual* security code doesn't
work.

I'm pretty sure any line of the form "name = 0x" was a
product of some form of programmer interruptus.

There's no doubt that 0x is a bug, according to the current specs. It
probably should be fixed (as opposed to changing the specs). But trying
to up-sell a trivial bug into a OMG The Sky Is Falling This Is A Huge
Security Risk Panic Panic Panic just makes you seem silly.

Explore the Power of AI: Build Your Own Console Chatbot Using GPT-2 XL in Python	2	Mar 17, 2026
Help me fix this bug in my program! Displaying chart data	1	Apr 8, 2023
Is this a bug?	1	May 27, 2018
Maybe a bug in JavaScript?	2	Sep 2, 2022
Even basic math is at risk? Why is a simple math and logic solution being ignored?	2	Jul 3, 2025
Rock, Paper, Scissor game. Im getting TypeError, unsupported operand type(s) for -=: 'NoneType' and 'int'	2	Aug 28, 2023
Privacy Shield A Clean, C++ Win32 Tool for Temporarily Masking Windows	4	Mar 25, 2026
Universal BMP Steganography Tool (AES-128-CTR + SP800-90A CSPRNG) Full Encoder/Decoder with 3LSB Payload, PasswordDerived Key & External Key File	4	Mar 26, 2026

Is this a bug in int()?

MartinRinehart

Fredrik Lundh

Duncan Booth

Terry Reedy

MartinRinehart

MartinRinehart

MartinRinehart

George Sakkis

Steven D'Aprano

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads