From D

Ben Finney · Jul 26, 2007

[email protected] said:
On Jul 25, 9:04?pm, Steven D'Aprano
Why does it make no sense? Have you never had to scrape a web page
or read a CSV file?

Again, unrelated to the way the Python compiler syntactically treats
the source code.

So this proposal would only apply to string literals at compile
time, not running programs?

Exactly the same way that it works for string literals in source code:
once the source code is compiled, the literal is indistinguishable
from the same value written a different way.

And I want the same error to occur if my CSV parser tries to convert
'123 456' into a single number. I don't want it to assume the
number is '123456'.

Once again, this is a discussion about Python syntax, not the
behaviour of the csv module.

bearophileHUGS · Jul 26, 2007

Sorry for the slow feedback.

Stargaming>Sounds like a good thing to be but the arbitrary
positioning doesnt make any sense.<

The arbitrary positioning allows you to denote 4-digit groups too in
binary/hex literals, like in my example:
auto x = 0b0100_0011;

Stargaming>fits into the current movement towards generator'ing
everything. But (IIRC) this idea came up earlier and there has been a
patch, too.<

Python is old so most simple ideas aren't new

Steven D'Aprano>Underscores in numerics are UGLY.<

I presume it's a matter of taste too. I use them often in D code, and
the _ symbol is very different from the 0..F/0..f digits so you can
tell them apart with no problems.

Steven D'Aprano>Why not take a leaf out of implicit string
concatenation and allow numeric literals to implicitly concatenate?<

The "_" helps my eyes see that those digit groups are part of the same
number. With spaces I think my eyes may need a bit of extra time to
decide if they are parts of the same number literal.

Eric Dexter>I think there is a language bridge so that you can compile
d for python.. looks realy easy but I have python 2.5 and panda and
it try's to go for the panda instalation. It looks much easier than c
to use with python in fact..<

Are you talking about "Pyd"? It's a good bridge, and I like it. It's
actively updated, soon in version 1.0.

Bye,
bearophile

Paul Rubin · Jul 26, 2007

Steven D'Aprano said:
Propose:
123 456 789 => 123456789
123.456 789 => 123.456789

+1

Leo Petr · Jul 26, 2007

Sounds like a good thing to be but the arbitrary positioning doesnt make
any sense. Additionally, I'd suggest 10**n in such cases (eg. 10**6).

http://blogs.msdn.com/oldnewthing/archive/2006/04/17/577483.aspx

Digits are grouped in 2s in India and in 4s in China and Japan.

Regards,

Leons Petrazickis
http://lpetr.org/blog/

mensanator · Jul 26, 2007

Again, unrelated to the way the Python compiler syntactically treats
the source code.

That's what I was enquiring about.

So, just as
123456

is not an error, the proposal is that
SyntaxError: invalid syntax

will not be an error either.

Yet,
Traceback (most recent call last):
File "<pyshell#7>", line 1, in <module>
a = int('123 456')
ValueError: invalid literal for int() with base 10: '123 456'

will still be an error. Just trying to be clear on this. Wouldn't
want that syntax behavior to carry over into run-time.

Exactly the same way that it works for string literals in source code:
once the source code is compiled, the literal is indistinguishable
from the same value written a different way.

Once again, this is a discussion about Python syntax, not the
behaviour of the csv module.

Who said I was using the csv module?

mensanator · Jul 26, 2007

[email protected] said:
[email protected] said:

IDLE 1.2c1

s = '123 456'
s.split()

Click to expand...

['123', '456']

Click to expand...

The str.split method has no bearing on this discussion,

It most certainly does. To make '123 456' into an integer,
you split it and then join it.123456

Just wanted to be sure that this must still be done explicitly
and that the language won't do it for me behind my back.

which is about
the Python language syntax,

Provided it is confined to the language syntax.

and numeric literal values in particular.

Fine, as long as int('123 456') continues to be an error.

Kay Schluehr · Jul 26, 2007

So, spaces will no longer be delimiters? Won't that cause
much wailing and gnashing of teeth?

Nope. Just replace the current grammar rule

atom: ... NAME | STRING+ | NUMBER

by

atom: ... NAME | STRING+ | NUMBER+

The resulting grammar is still free of ambiguities. The tokenizer
doesn't complain anyway - not even yet.

Ryan Ginstrom · Jul 26, 2007

On Behalf Of Leo Petr

Digits are grouped in 2s in India and in 4s in China and Japan.

This is not entirely true in Japan's case. When written without Japanese
characters, Japan employs the same format as the US, for example:

1,000,000
(However, they would read this as $BI4K|(B (hyaku man), literally 100 ten
thousands.)

Raymond is correct in that Japan traditionally groups in fours (and stills
reads it that way regardless, as shown above), but in an ordinary
programming context, this almost never comes into play.

On the original topic of the thread, I personally like the underscore idea
from D, and I like it better than the "concatenation" idea, even though I
agree that it is more consistent with Python's string-format rules.

Regards,
Ryan Ginstrom

Tim Williams · Jul 27, 2007

It most certainly does. To make '123 456' into an integer,
you split it and then join it.
123456

.....but it doesn't if you use replace !! said:
Propose:
123 456 789 => 123456789
123.456 789 => 123.456789

+1 for me too

Ben Finney · Jul 27, 2007

[email protected] said:
So, just as

123456

is not an error, the proposal is that

SyntaxError: invalid syntax

will not be an error either.

More directly: Just as these three statements create the same literal
value:
'abcdef'

the proposal is that these three statements create the same literal
value:
12345.67890

and not be a syntax error.

Yet,

Traceback (most recent call last):
File "<pyshell#7>", line 1, in <module>
a = int('123 456')
ValueError: invalid literal for int() with base 10: '123 456'

will still be an error.

Since that value, '123 456', is one that is rejected by the 'int'
constructor. Nothing to do with this proposal.

Just trying to be clear on this. Wouldn't want that syntax behavior
to carry over into run-time.

The distinction you need to be clear on is between the Python syntax
for writing literal values in code (which is proposed to change by
this), and the behaviour of operations on arbitrary values at runtime
(which is outside the scope of this proposal).

Ben Finney · Jul 27, 2007

[email protected] said:
It most certainly does. To make '123 456' into an integer,
you split it and then join it.

Indeed. Which has nothing to do with the Python syntax for creating a
numeric literal in code.

fdu.xiaojf · Jul 31, 2007

Gabriel said:
Why not? Because in English major numbers are labeled in thousands?
(thousand, million, billion...)
In India, they're grouped by two after the first thousand; in China,
they're grouped each 4 digits (that is, there is a single word for "ten
thousands" = wan4 = ä¸‡, and the next required word is for 10**8 = yi4 = äº¿)

Yes, in China numbers are grouped each 4 digits while it is different in
other countries, so I think it would be better if we could put arbitrary white
spaces inside number literals.

Alex Martelli · Jul 31, 2007

code files? What's the regular expression for
locating a number with an arbitrary number of digits
seperated into an arbitrary number of blocks of an
arbitray number of digits with an arbitrary number
of whitespace characters between each block?

For a decimal integer (or octal) number, I'd use something similar to:
r'\d[\d\s]+'

This also gets trailing whitespace, but that shouldn't be much of a
problem in most practical cases. Of course, just like today, it becomes
a bit hairier if you also want to find hex, oct (to be 0o777 in the
future), other future notations such as binary, floats, complex numbers,
&c

-- but the simple fact that a [\d\s] is accepted where today only
a \d would be, per se, would not contribute to that hair in any
significant way, it seems to me.

Alex

why printf("%d", arg) works with arg of type int, short, char	21	Mar 1, 2014
D-CM; Software Testers	4	Jun 19, 2010
Copy string from 2D array to a 1D array in C	1	Nov 1, 2023
2-D drawing/map with python	2	Mar 12, 2013
Some success with the "Plot" problem :D	0	Jul 14, 2010
D foreach	1	Nov 13, 2005
Efficient python 2-d arrays?	6	Jan 17, 2011
Hi From Canada	3	Nov 26, 2023

From D

Ben Finney

bearophileHUGS

Paul Rubin

Leo Petr

mensanator

mensanator

Kay Schluehr

Ryan Ginstrom

Tim Williams

Ben Finney

Ben Finney

fdu.xiaojf

Alex Martelli

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads