Replace all occurrences that match regular expression

Question:

I have a regular expression that searches for a string that contains '.00.' or '.11.' as follows:

.*.(00|11)..*

What I would like to do is replace all occurrences that match the pattern with 'X00X' or 'X11X'. For example, the string '.00..0..11.' would result in 'X00X.0.X11X'.

I was looking into the Python re.sub method and am unsure of how to do this effectively. The returned match object only matches on the first occurrence and therefore doesn’t work well. Any advice? Should I just be using a string replace for this task? Thanks.

Asked By: tots_o_tater

||

Answers:

re.sub() (docs for Python 2 and Python 3) does replace all matches it finds, but your use of .* may have caused the regex to match too much (even other occurences of .00. etc.). Simply do:

In [2]: re.sub(r".(00|11).", r"X1X", ".00..0..11.")
Out[2]: 'X00X.0.X11X'

Note that patterns cannot overlap:

In [3]: re.sub(r".(00|11).", r"X1X", ".00.11.")
Out[3]: 'X00X11.'
Answered By: Tim Pietzcker
Categories: questions Tags: ,
Answers are sorted by their score. The answer accepted by the question owner as the best is marked with
at the top-right corner.