The inverse of .join

Neil Cerutti · Jun 17, 2010

What's the best way to do the inverse operation of the .join
function?

nn · Jun 17, 2010

Neil said:
What's the best way to do the inverse operation of the .join
function?

split

Ian Kelly · Jun 17, 2010

What's the best way to do the inverse operation of the .join
function?

Use the str.split method?

MRAB · Jun 17, 2010

Neil said:
What's the best way to do the inverse operation of the .join
function?

..split, possibly, although there will be problems if the string contains
other occurrences of the separator.

Neil Cerutti · Jun 17, 2010

Use the str.split method?

split is perfect except for what happens with an empty string.

MRAB · Jun 17, 2010

Neil said:
split is perfect except for what happens with an empty string.

I see what you mean.

This is consistent:

>>> ','.join(['']) ''
>>> ''.split(',')

Click to expand...

Click to expand...

['']

but this isn't:

>>> ','.join([]) ''
>>> ''.split(',')

Click to expand...

Click to expand...

['']

An empty string could be the result of .join(['']) or .join([]).

Should .split grow an addition keyword argument to specify the desired
behaviour? (Although it's simple enough to define your own function.)

Robert Kern · Jun 17, 2010

split is perfect except for what happens with an empty string.

Why don't you try it and find out?

--
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless enigma
that is made terrible by our own mad attempt to interpret it as though it had
an underlying truth."
-- Umberto Eco

Stephen Hansen · Jun 17, 2010

Neil said:
Neil said:

split is perfect except for what happens with an empty string.

Click to expand...

I see what you mean.

This is consistent:

','.join(['']) ''
''.split(',')

Click to expand...

Click to expand...

['']

but this isn't:

','.join([]) ''
''.split(',')

Click to expand...

Click to expand...

['']

An empty string could be the result of .join(['']) or .join([]).

Should .split grow an addition keyword argument to specify the desired
behaviour? (Although it's simple enough to define your own function.)

Guido finds keyword-arguments-to-change-behavior to be unPythonic, IIRC.
It generally means 'make a new API'. But, the question is-- is it worth
the mental strain of a new API?

This is such an extreme edge case, having to do:

if blah:
result = blah.split(',')
else:
result = []

Is really not asking too much, I think.

--

Stephen Hansen
... Also: Ixokai
... Mail: me+list/python (AT) ixokai (DOT) io
... Blog: http://meh.ixokai.io/

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.10 (Darwin)

iQEcBAEBAgAGBQJMGn6UAAoJEKcbwptVWx/llpUH/ixy79zYNRUg1qvnucuQlMwU
ng8odgRwWthGhgdHl5iswlPRt3QcMhABDVRuaiVZuS2fJfmPS1I6QsrRd65wFwHa
nPD3f+Sj4EwsN0rHjvgRSn3c3yXTDb1VSb3za39rdFNLu4vjmmKvKM8T3n2A3LML
K6BZHKuU5oRnm5d3VjJwzOyFWUwQniDKLClQkKHT6YYJP5gXTD5Bl1Shw5Ch4+n8
g2I6WnJVV3N8JFAFn0r0nlfGUrx4Tkh4XttuQNnL3LhW4xi90EzCCqNStFWrsMXK
zP+cQFmC/19pndyzsx+LubY9anvZIxDqy8woUKxqEvJaBFDwyxr4+kSUOxnmo80=
=xnld
-----END PGP SIGNATURE-----

Neil Cerutti · Jun 17, 2010

Why don't you try it and find out?

I'm currently using the following without problems, while reading
a data file. One of the fields is a comma separated list, and may
be empty.

f = rec['codes']
if f == "":
f = []
else:
f = f.split(",")

I just wondered if something smoother was available.

Robert Kern · Jun 17, 2010

I would like to apologize. I read that sentence as a question for some reason.

That said, it always helps for you to show the results that you are getting (and
the code that gives those results) and state what results you were expecting.

--
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless enigma
that is made terrible by our own mad attempt to interpret it as though it had
an underlying truth."
-- Umberto Eco

Steven D'Aprano · Jun 18, 2010

What's the best way to do the inverse operation of the .join function?

str.join is a many-to-one function, and so it doesn't have an inverse.
You can't always get the input back unchanged:

L = ["a", "b", "c|d", "e"]
s = '|'.join(L)
s 'a|b|c|d|e'
s.split('|')

Click to expand...

Click to expand...

Steven D'Aprano · Jun 18, 2010

Should .split grow an addition keyword argument to specify the desired
behaviour?

Please no.

(Although it's simple enough to define your own function.)

Exactly.

Steven D'Aprano · Jun 18, 2010

I'm currently using the following without problems, while reading a data
file. One of the fields is a comma separated list, and may be empty.

f = rec['codes']
if f == "":
f = []
else:
f = f.split(",")

I just wondered if something smoother was available.

Seems pretty smooth to me. What's wrong with it? I assume you've put it
into a function for ease of use and reduction of code duplication.

You could also use the ternary operator, in which case it's a mere two-
liner and short enough to inline wherever you need it:

f = rec['codes']
f = f.split(",") if f else []

Neil Cerutti · Jun 18, 2010

I'm currently using the following without problems, while
reading a data file. One of the fields is a comma separated
list, and may be empty.

f = rec['codes']
if f == "":
f = []
else:
f = f.split(",")

I just wondered if something smoother was available.

Click to expand...

Seems pretty smooth to me. What's wrong with it? I assume
you've put it into a function for ease of use and reduction of
code duplication.

The part that's wrong with it, and it's probably my fault, is
that I can never think of it. I had to go dig it out of my code
to remember what the special case was.

You could also use the ternary operator, in which case it's a
mere two- liner and short enough to inline wherever you need
it:

f = rec['codes']
f = f.split(",") if f else []

That's pretty cool.

Thanks to everybody for their thoughts.

Jon Clements · Jun 18, 2010

Why don't you try it and find out?

Click to expand...

I'm currently using the following without problems, while reading
a data file. One of the fields is a comma separated list, and may
be empty.

f = rec['codes']
if f == "":
f = []
else:
f = f.split(",")

I just wondered if something smoother was available.

In terms of behaviour and 'safety', I'd go for:

rec = { 'code1': '1,2,3', 'code2': '' }
next(csv.reader([rec['code1']])) ['1', '2', '3']
next(csv.reader([rec['code2']]))

Click to expand...

Click to expand...

[]

hth
Jon.

Neil Cerutti · Jun 18, 2010

I just wondered if something smoother was available.

Click to expand...

In terms of behaviour and 'safety', I'd go for:

rec = { 'code1': '1,2,3', 'code2': '' }
next(csv.reader([rec['code1']])) ['1', '2', '3']
next(csv.reader([rec['code2']]))

Click to expand...

Click to expand...

[]

Slick!

join()	6	Mar 20, 2013
fmap(), "inverse" of Python map() function	9	Oct 5, 2012
inverse of a matrix with Fraction entries	23	Nov 24, 2010
image processing - inverse filtering	1	Jan 24, 2010
How to Join multiple rectangles using html5 and CSS?	2	Aug 7, 2023
Inverse of dict(zip(x,y))	22	Mar 4, 2009
Inverse of id()?	10	May 20, 2007
Global join function?	2	Mar 14, 2012

The inverse of .join

Neil Cerutti

nn

Ian Kelly

MRAB

Neil Cerutti

MRAB

Robert Kern

Stephen Hansen

Neil Cerutti

Robert Kern

Steven D'Aprano

Steven D'Aprano

Steven D'Aprano

Neil Cerutti

Jon Clements

Neil Cerutti

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads