python re.match()用法相关示例

脚本专栏 2024/11/17 佚名

2 0 1

神剑山庄资源网 Design By www.hcban.com

学习python爬虫时遇到了一个问题，书上有示例如下：

import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*)are(.*"htmlcode">

matchObj=re.match(r'(.*)are(.*"htmlcode">

import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*)are(.*"matchObj.group():",matchObj.group())
 print("matchObj.group(1):", matchObj.group(1))
 print("matchObj.group(2):", matchObj.group(2))
 print("matchObj.group(3):", matchObj.group(3))
else:
 print('No match!\n')




得到的结果是：

matchObj.group(): Cats are smarter than dogs

matchObj.group(1): Cats 

matchObj.group(2): 

matchObj.group(3):  smarter than dogs



可见第二个括号里的内容被默认为空了，然后删去那个？，可以看到结果变成：

matchObj.group(): Cats are smarter than dogs

matchObj.group(1): Cats 

matchObj.group(2):  smarter than dogs

matchObj.group(3): 



那么这是否就意味着？的默认值很可能是0次，那？这个符号到底有什么用呢
仔细想来这个说法并不是很严谨。尝试使用单独的."htmlcode">

import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*) are(.*)"matchObj.group():",matchObj.group())
 print("matchObj.group(1):", matchObj.group(1))
 print("matchObj.group(2):", matchObj.group(2))




也能在组别2中正常提取到are之后的字符内容，但稍微改动一下将？放到第二个括号内，
就什么也提取不到，同时导致group(0)中匹配的字符到Cats are就截止了（也就是第二个括号匹配失败）。
令人感到奇怪的是，如果将上面的代码改成


import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*) are (.*)+',line)

if matchObj:
 print("matchObj.group():",matchObj.group())
 print("matchObj.group(1):", matchObj.group(1))
 print("matchObj.group(2):", matchObj.group(2))




也就是仅仅将？改为+，虽然能成功匹配整个line但group(2)中没有内容，
如果把+放到第二个括号中就会产生报错，匹配失败。
那么是否可以认为.*"htmlcode">

import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*) are (.*r).*',line)

if matchObj:
 print("matchObj.group():",matchObj.group())
 print("matchObj.group(1):", matchObj.group(1))
 print("matchObj.group(2):", matchObj.group(2))
 #print("matchObj.group(3):", matchObj.group(3))
else:
 print('No match!\n')




为了泛用性尝试了一下把r改成‘ '但是得到的结果是‘smarter than '。于是尝试把.换成表示任意字母的
[a-zA-Z]，成功提取出了单个smarter，代码如下：


import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*) are ([a-zA-Z]* ).*',line)

if matchObj:
 print("matchObj.group():",matchObj.group())
 print("matchObj.group(1):", matchObj.group(1))
 print("matchObj.group(2):", matchObj.group(2))
 #print("matchObj.group(3):", matchObj.group(3))
else:
 print('No match!\n')

python,re.match(),python,re.match

标签：

python,re.match(),python,re.match

神剑山庄资源网 Design By www.hcban.com

神剑山庄资源网 免责声明：本站文章均来自网站采集或用户投稿，网站不提供任何软件下载或自行开发的软件！如有用户或公司发现本站内容信息存在侵权行为，请邮件告知！ 858582#qq.com

神剑山庄资源网 Design By www.hcban.com

评论“python re.match()用法相关示例”

暂无python re.match()用法相关示例的评论...

《魔兽世界》大逃杀！60人新游玩模式《强袭风暴》3月21日上线

暴雪近日发布了《魔兽世界》10.2.6 更新内容，新游玩模式《强袭风暴》即将于3月21 日在亚服上线，届时玩家将前往阿拉希高地展开一场 60 人大逃杀对战。

艾泽拉斯的冒险者已经征服了艾泽拉斯的大地及遥远的彼岸。他们在对抗世界上最致命的敌人时展现出过人的手腕，并且成功阻止终结宇宙等级的威胁。当他们在为即将于《魔兽世界》资料片《地心之战》中来袭的萨拉塔斯势力做战斗准备时，他们还需要在熟悉的阿拉希高地面对一个全新的敌人──那就是彼此。在《巨龙崛起》10.2.6 更新的《强袭风暴》中，玩家将会进入一个全新的海盗主题大逃杀式限时活动，其中包含极高的风险和史诗级的奖励。
《强袭风暴》不是普通的战场，作为一个独立于主游戏之外的活动，玩家可以用大逃杀的风格来体验《魔兽世界》，不分职业、不分装备（除了你在赛局中捡到的），光是技巧和战略的强弱之分就能决定出谁才是能坚持到最后的赢家。本次活动将会开放单人和双人模式，玩家在加入海盗主题的预赛大厅区域前，可以从强袭风暴角色画面新增好友。游玩游戏将可以累计名望轨迹，《巨龙崛起》和《魔兽世界：巫妖王之怒经典版》的玩家都可以获得奖励。

更新日志

2024年11月17日

python re.match()用法相关示例

python,re.match(),python,re.match

Python爬虫实现selenium处理iframe作用域问题

python利用appium实现手机APP自动化的示例

评论“python re.match()用法相关示例”

《魔兽世界》大逃杀！60人新游玩模式《强袭风暴》3月21日上线

更新日志

友情链接