Python 正则表达式

浏览 103 扫码分享 2023-11-19 19:21:29

1, 单个位置的字符串提取
- 1.1 贪婪和非贪婪
- 1.2 用 unicode 匹配中文标点
2, 连续多个位置的字符串提取
- 2.1 groupdict
- 2.2 贪婪和非贪婪

1, 单个位置的字符串提取

1.1 贪婪和非贪婪

import re
str = "0a没有?是非贪婪b加上?是贪婪模式b匹配最远的b"
print(re.findall(r"a(.+?)b",str))
#['没有?是非贪婪b加上?是贪婪模式b匹配最远的']
#['没有?是非贪婪']

1.2 用 unicode 匹配中文标点

import re
content='【神州租车】'
#这个正则表达式unicode编码
#    3010【
#    3011 】
regex_pattern = r".*\u3010(.+)\u3011.*"
print(re.findall(pattern=regex_pattern, string=content))
#['神州租车']

2, 连续多个位置的字符串提取

2.1 groupdict

s='2020-02-06'
reg=re.compile(r'(?P<year>\d{4})-(?P<month>\d{2})-(?P<day>\d{2})')
regMatch = reg.match(s)
if regMatch:
    print(regMatch.groupdict())
else:
    print('没有匹配上')

2.2 贪婪和非贪婪

if __name__ == '__main__':
    fmt = "%(asctime)s - %(filename)s:%(lineno)s - %(name)s - [%(levelname)s] - %(message)s"
    # 忽略大小写,提取所有括号内的内容
    regex = re.compile(r"\((.+?)\)", re.IGNORECASE)
    # 这里的提取,用到了阻止贪婪?,忽略大小写,.+?没有问号匹配的不对
    x=regex.findall(fmt)
    print(x)
['asctime', 'filename', 'lineno', 'name', 'levelname', 'message']

若有收获，就点个赞吧

上一篇:

下一篇:

让时间为你证明

展开/收起文章目录