Clever hack or code abomination?

Roy Smith · Dec 1, 2011

I need to try a bunch of names in sequence until I find one that works
(definition of "works" is unimportant). The algorithm is:

1) Given a base name, "foo", first see if just plain "foo" works.

2) If not, try "foo-1", "foo-2", and so on

3) If you reach "foo-20", give up.

What would you say if you saw this:

for suffix in [''] + [str(i) for i in xrange(-1, -20, -1)]:

It generates the right sequence of strings. But, if you came upon that
code, would it make sense to you, or would you spend half the afternoon
trying to figure out what it did and the other half of the afternoon
ranting about the deranged lunatic who wrote it?

Steven D'Aprano · Dec 1, 2011

I need to try a bunch of names in sequence until I find one that works
(definition of "works" is unimportant). The algorithm is:

1) Given a base name, "foo", first see if just plain "foo" works.

2) If not, try "foo-1", "foo-2", and so on

3) If you reach "foo-20", give up.

What would you say if you saw this:

for suffix in [''] + [str(i) for i in xrange(-1, -20, -1)]:

It generates the right sequence of strings. But, if you came upon that
code, would it make sense to you, or would you spend half the afternoon
trying to figure out what it did and the other half of the afternoon
ranting about the deranged lunatic who wrote it?

Nah, it's fine. Not exactly the clearest piece of code in the world, but
hardly worth a rant.

I'd be more likely to write that as:

for suffix in [''] + ["-%d" % i for i in range(1, 21)]:

or if I needed to do it more than once, as a generator:

def suffixes(max=20):
yield ""
for i in range(1, max+1):
yield "-%d" % i

Matt Joiner · Dec 1, 2011

def possible_names():
yield "foo"
for i in range(20):
yield "foo-" + str(i)

à² _à²

Chris Angelico · Dec 1, 2011

for suffix in [''] + [str(i) for i in xrange(-1, -20, -1)]:

It generates the right sequence of strings. But, if you came upon that
code, would it make sense to you, or would you spend half the afternoon
trying to figure out what it did and the other half of the afternoon
ranting about the deranged lunatic who wrote it?

That's a self-contained piece of code.If I came upon it, I'd probably
copy and paste it to IDLE, see what it comes up with, and proceed from
there. Deranged you may be, but so long as code can be dropped into an
interactive interpreter, it's fine.

ChrisA

Arnaud Delobelle · Dec 1, 2011

I need to try a bunch of names in sequence until I find one that works
(definition of "works" is unimportant). Â The algorithm is:

1) Given a base name, "foo", first see if just plain "foo" works.

2) If not, try "foo-1", "foo-2", and so on

3) If you reach "foo-20", give up.

What would you say if you saw this:

for suffix in [''] + [str(i) for i in xrange(-1, -20, -1)]:

It's a little obfuscated

I would go for the simple:

for i in xrange(21):
suffix = "-%s" % i if i else ""
....

Vito 'ZeD' De Tullio · Dec 1, 2011

Arnaud said:
I need to try a bunch of names in sequence until I find one that works
(definition of "works" is unimportant). The algorithm is:

1) Given a base name, "foo", first see if just plain "foo" works.

2) If not, try "foo-1", "foo-2", and so on

3) If you reach "foo-20", give up.

What would you say if you saw this:

for suffix in [''] + [str(i) for i in xrange(-1, -20, -1)]:

Click to expand...

It's a little obfuscated I would go for the simple:

for i in xrange(21):
suffix = "-%s" % i if i else ""
....

obfuscated for obfuscated, you can merge the two ideas:

for suffix in ('-%s' % i if i else '' for i in xrange(21)):
...

Martin P. Hellwig · Dec 1, 2011

On 01/12/2011 03:15, Roy Smith wrote:
<cut>
Well, I have seen much worse, so the WTFs/minute(*) count won't be too bad.

However, as general rule for readability; If you think you have to ask,
don't bother asking, spend that time rethinking and write a more
readable solution.

*) http://www.osnews.com/story/19266/WTFs_m

Steven D'Aprano · Dec 2, 2011

On 01/12/2011 03:15, Roy Smith wrote: <cut>
Well, I have seen much worse, so the WTFs/minute(*) count won't be too
bad.

However, as general rule for readability; If you think you have to ask,
don't bother asking, spend that time rethinking and write a more
readable solution.

That's generally good advice, but in this case, I don't know why Roy
Smith thought he had to ask. His initial snippet was pretty much standard
vanilla Python: a for loop over a list. The list was made by adding a
list containing a single element '', and a second list made from a list
comprehension that converted a bunch of numbers into strings. It uses no
advanced features like generators. A newbie to Python could work it out.

In my opinion, if anyone thinks that's "obfuscated" (as opposed to merely
"not quite idiomatic"), either their standards for readability are
impossibly high, or they just can't read Python code.

Try this on for size.

def obfuscated_prefixes(q=20):
c = str()
while isinstance(type(c), type(type)):
yield None or c
while isinstance(c, object):
try:
f = type(q)(c[c.index(chr(45))+1:])+type(q)(1)
except ValueError:
c = str.join('\n', list(map(chr, (45, 48))) + [c])[::2]
else:
break
if f <= q: pass
else: break
try:
c = (lambda a,b: a+b)(c[:c.index(chr(45))+1], type(c)(f))
except IndexError:
c = c[1::]

Chris Angelico · Dec 2, 2011

Try this on for size.

f = type(q)(c[c.index(chr(45))+1:])+type(q)(1)
c = str.join('\n', list(map(chr, (45, 48))) + [c])[::2]
c = (lambda a,b: a+b)(c[:c.index(chr(45))+1], type(c)(f))

I would consider integer representations of ASCII to be code smell.
It's not instantly obvious that 45 means '-', even if you happen to
know the ASCII table by heart (which most people won't). This is one
thing that I like about C's quote handling; double quotes for a
string, or single quotes for an integer with that character's value.
It's clearer than the Python (and other) requirement to have an actual
function call:

for (int i=0;i<10;++i) {
digit='0'+i;
letter='A'+i;
}

versus

for i in range(10):
digit=chr(ord('0')+i)
letter=chr(ord('A')+i)

Ignoring the fact that you'd probably use a list comp in Python, this
is imho a win for C.

ChrisA

Terry Reedy · Dec 2, 2011

def possible_names():
yield "foo"
for i in range(20):
yield "foo-" + str(i)

This is my favorite -- crystal clear with only the variable part
variable. And it works in both 2.x and 3.x.

Matt Joiner · Dec 2, 2011

Thank you. à² _à²

This is my favorite -- crystal clear with only the variable part variable..
And it works in both 2.x and 3.x.

Steven D'Aprano · Dec 2, 2011

Try this on for size.

Â Â Â Â Â Â Â Â f = type(q)(c[c.index(chr(45))+1:])+type(q)(1) c
Â Â Â Â Â Â Â Â = str.join('\n', list(map(chr, (45, 48))) +
Â Â Â Â Â Â Â Â [c])[::2]
Â Â Â Â Â Â c = (lambda a,b: a+b)(c[:c.index(chr(45))+1],
Â Â Â Â Â Â type(c)(f))

Click to expand...

I would consider integer representations of ASCII to be code smell. It's
not instantly obvious that 45 means '-', even if you happen to know the
ASCII table by heart (which most people won't). This is one thing that I
like about C's quote handling; double quotes for a string, or single
quotes for an integer with that character's value. It's clearer than the
Python (and other) requirement to have an actual function call:

for (int i=0;i<10;++i) {
digit='0'+i;
letter='A'+i;
}

I would disagree that this is clear at all. You're adding what looks like
a character, but is actually an integer, with an integer. And then just
to add insult to injury, you're storing integers into arrays that are
named as if they were characters. In what mad universe would you describe
65 as a letter?

To say nothing of the fact that C's trick only works (for some definition
of works) for ASCII. Take for example one of the many EBCDIC encodings,
cp500. If you expect 'I' + 1 to equal 'J', you will be sorely
disappointed:

py> u'I'.encode('cp500')
'\xc9'
py> u'J'.encode('cp500')
'\xd1'

Characters are not integers, and C conflates them, to the disservice of
all. If fewer people learned C, fewer people would have such trouble
understanding Unicode.

Anyone unfamiliar with C's model would have trouble guessing what 'A' + 1
should mean. Should it be?

- an error
- 'B'
- 'A1'
- the numeric value of variable A plus 1
- 66 (assuming ascii encoding)
- 194 (assuming cp500 encoding)
- some other number
- something else?

How about 1000 + 'A'?

versus

for i in range(10):
digit=chr(ord('0')+i)
letter=chr(ord('A')+i)

Click to expand...

It's a tad more verbose, but it's explicit about what is being done. Take
the character '0', find out what ordinal value it encodes to, add 1 to
that value, re-encode back to a character. That's exactly what C does,
only it does it explicitly.

Note that this still doesn't work the way we might like in EBCDIC, but
the very fact that you are forced to think about explicit conversion
steps means you are less likely to make unwarranted assumptions about
what characters convert to.

Better than both, I would say, would be for string objects to have
successor and predecessor methods, that skip ahead (or back) the
specified number of code points (defaulting to 1):

'A'.succ() => 'B'
'A'.succ(5) => 'F'

with appropriate exceptions if you try to go below 0 or above the largest
code point.

Steven D'Aprano · Dec 2, 2011

Note, I'm not saying that C's way is perfect; merely that using the
integer 45 to represent a hyphen is worse.

Dude, it was deliberately obfuscated code. I even named the function
"obfuscated_prefixes". I thought that would have been a hint <wink>

It's kinda scary that of all the sins against readability committed in my
function, including isinstance(type(c), type(type)) which I was
particularly proud of, the only criticism you came up with was that
chr(45) is hard to read. I'm impressed <grins like a mad thing>

[...]

I don't know about that. Anyone brought up on ASCII and moving to EBCDIC
will likely have trouble with this, no matter how many function calls it
takes.

Of course you will, because EBCDIC is a pile of festering garbage

But IMAO you're less likely to have trouble with with Unicode if you
haven't been trained to treat characters as synonymous with integers.

And besides, given how rare such byte-manipulations on ASCII characters
are in Python, it would be a shame to lose the ability to use '' and ""
for strings just to avoid calling ord and chr functions.

... and this still has that same issue. Arithmetic on codepoints depends
on that.

We shouldn't be doing arithmetic on code points. Or at least we shouldn't
unless we are writing a Unicode library that *needs* to care about the
implementation. We should only care about the interface, that the
character after 'A' is 'B'. Implementation-wise, we shouldn't care
whether A and B are represented in memory by 0x0041 and 0x0042, or by
0x14AF and 0x9B30. All we really need to know is that B comes immediately
after A. Everything else is implementation.

But I fear that the idea of working with chr and ord is far to ingrained
now to get rid of it.

Chris Angelico · Dec 2, 2011

Dude, it was deliberately obfuscated code. I even named the function
"obfuscated_prefixes". I thought that would have been a hint <wink>

It's kinda scary that of all the sins against readability committed in my
function, including isinstance(type(c), type(type)) which I was
particularly proud of, the only criticism you came up with was that
chr(45) is hard to read. I'm impressed <grins like a mad thing>

Heh! I know it was obfuscated, and that's why I didn't bother
mentioning the obvious.

And besides, given how rare such byte-manipulations on ASCII characters
are in Python, it would be a shame to lose the ability to use '' and ""
for strings just to avoid calling ord and chr functions.

Agreed. That flexibility is a _huge_ advantage. Definitely not worth
changing anything.

ChrisA

Chris Hulan · Dec 2, 2011

I'd just look a the unit tests for clarification/examples

John Ladasky · Dec 6, 2011

That's a self-contained piece of code.If I came upon it, I'd probably
copy and paste it to IDLE, see what it comes up with, and proceed from
there.

+1. That was going to be my comment exactly.

problem with hack using multiple inheritance for plugins	14	Jun 28, 2007
Picture Comparison Code Not Working Properly	1	Jul 24, 2021
Rock, Paper, Scissor game. Im getting TypeError, unsupported operand type(s) for -=: 'NoneType' and 'int'	2	Aug 29, 2023
hex dump w/ or w/out utf-8 chars	40	Jul 8, 2013
What should I do Before I give up programming?	6	Jan 14, 2023
IE innerHeight hack success	0	Feb 2, 2005
Avoiding a GET CGI hack attack	6	Jul 16, 2004
Clever ways to work around the fact that control breaks VS.NET design view? (replacing a literal?)	8	Aug 26, 2007

Clever hack or code abomination?

Roy Smith

Steven D'Aprano

Matt Joiner

Chris Angelico

Arnaud Delobelle

Vito 'ZeD' De Tullio

Martin P. Hellwig

Steven D'Aprano

Chris Angelico

Terry Reedy

Matt Joiner

Steven D'Aprano

Steven D'Aprano

Chris Angelico

Chris Hulan

John Ladasky

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads