re.findall 行为怪异

2022-01-29 00:00:00 python regex

问题描述

源字符串是:

# Python 3.4.3
s = r'abc123d, hello 3.1415926, this is my book'

这是我的模式:

pattern = r'-?[0-9]+(\.[0-9]*)?|-?\.[0-9]+'

但是,re.search 可以给我正确的结果:

however, re.search can give me correct result:

m = re.search(pattern, s)
print(m)  # output: <_sre.SRE_Match object; span=(3, 6), match='123'>

re.findall 只是转储一个空列表:

re.findall just dump out an empty list:

L = re.findall(pattern, s)
print(L)  # output: ['', '', '']

为什么不能 re.findall 给我预期的列表:

why can't re.findall give me the expected list:

['123', '3.1415926']


解决方案

s = r'abc123d, hello 3.1415926, this is my book'
print re.findall(r'-?[0-9]+(?:.[0-9]*)?|-?.[0-9]+',s)

您在使用原始模式时不需要转义两次.

You dont need to escape twice when you are using raw mode.

输出:['123', '3.1415926']

返回类型也是字符串的列表.如果您希望返回类型为 integers 和 floats,请使用 map

Also the return type will be a list of strings. If you want return type as integers and floats use map

import re,ast
s = r'abc123d, hello 3.1415926, this is my book'
print map(ast.literal_eval,re.findall(r'-?[0-9]+(?:.[0-9]*)?|-?.[0-9]+',s))

输出:[123, 3.1415926]

相关文章