How can I split by 1 or more occurrences of a delimiter in Python?

Question:

I have a formatted string from a log file, which looks like:

>>> a="test                            result"

That is, the test and the result are split by some spaces – it was probably created using formatted string which gave test some constant spacing.

Simple splitting won’t do the trick:

>>> a.split(" ")
['test', '', '', '', ... '', '', '', '', '', '', '', '', '', '', '', 'result']

split(DELIMITER, COUNT) cleared some unnecessary values:

>>> a.split(" ",1)
['test', '                           result']

This helped – but of course, I really need:

['test', 'result']

I can use split() followed by map + strip(), but I wondered if there is a more Pythonic way to do it.

Thanks,

Adam

UPDATE: Such a simple solution! Thank you all.

Asked By: Adam Matan

||

Answers:

Just this should work:

a.split()

Example:

>>> 'a      b'.split(' ')
['a', '', '', '', '', '', 'b']
>>> 'a      b'.split()
['a', 'b']

From the documentation:

If sep is not specified or is None, a different splitting algorithm is applied: runs of consecutive whitespace are regarded as a single separator, and the result will contain no empty strings at the start or end if the string has leading or trailing whitespace. Consequently, splitting an empty string or a string consisting of just whitespace with a None separator returns [].

Answered By: Mark Byers
>>> import re
>>> a="test                            result"
>>> re.split(" +",a)
['test', 'result']

>>> a.split()
['test', 'result']
Answered By: ghostdog74

Any problem with simple a.split()?

Answered By: YOU

Just do not give any delimeter?

>>> a="test                            result"
>>> a.split()
['test', 'result']
Answered By: Kimvais

Just adding one more way, more useful in cases where delimiter is different from space, and s.split() will not work.

like str = “Python,is,,more,,,,,flexible”.

In [27]: s = "Python,is,,more,,,,,flexible"

In [28]: str_list = list(filter(lambda x: len(x) > 0, s.split(",")))

In [29]: str_list
Out[29]: ['Python', 'is', 'more', 'flexible']
Answered By: anshu kumar

If you want to split by 1 or more occurrences of a delimiter and don’t want to just count on the default split() with no parameters happening to match your use case, you can use regex to match the delimiter. The following will use one or more occurrences of . as the delimiter:

s = 'a.b....c......d.ef...g'
sp = re.compile('.+').split(s)
print(sp)

which gives:

['a', 'b', 'c', 'd', 'ef', 'g']
Answered By: theferrit32
Categories: questions Tags: , , ,
Answers are sorted by their score. The answer accepted by the question owner as the best is marked with
at the top-right corner.