软件测试|深入理解Python中的re.search()和re.findall()区别

news2026/2/15 13:24:53

前言

在Python中，正则表达式是一种强大的工具，用于在文本中查找、匹配和处理模式。re 模块提供了许多函数来处理正则表达式，其中 re.search()和 re.findall() 是常用的两个函数，用于在字符串中查找匹配的模式。本文将深入介绍这两个函数的用法，以及详细的使用示例。

re.search() 函数

re.search() 函数用于在字符串中查找匹配的第一个子串，并返回一个匹配对象。如果找到了匹配，可以通过匹配对象的方法和属性来获取相关信息。

import re

pattern = r'apple'
text = "I have an apple and a banana."

# 在文本中查找第一个匹配的子串
match = re.search(pattern, text)

if match:
    print("Found:", match.group())  # 获取匹配的子串
    print("Start:", match.start())  # 获取匹配的起始位置
    print("End:", match.end())      # 获取匹配的结束位置
else:
    print("No match found.")

re.findall() 函数

re.findall() 函数用于在字符串中查找所有匹配的子串，并返回一个包含所有匹配结果的列表。

import re

pattern = r'\d+'  # 匹配一个或多个数字
text = "I have 3 apples and 5 bananas. Total 8 fruits."

# 查找所有匹配的子串
matches = re.findall(pattern, text)

if matches:
    print("Matches:", matches)  # 获取所有匹配的子串列表
else:
    print("No matches found.")

使用示例

使用 re.search() 查找日期

import re

pattern = r'\d{2}-\d{2}-\d{4}'  # 匹配日期格式：dd-mm-yyyy
text = "Today's date is 31-08-2023."

match = re.search(pattern, text)

if match:
    print("Date found:", match.group())
else:
    print("No date found.")

使用 re.findall() 查找所有链接

import re

pattern = r'https?://\S+'  # 匹配HTTP或HTTPS链接
text = "Here are some links: https://www.example.com and http://google.com"

links = re.findall(pattern, text)

if links:
    print("Links found:", links)
else:
    print("No links found.")

使用 re.findall() 查找电子邮件地址

import re

pattern = r'\w+@\w+\.\w+'  # 匹配基本电子邮件地址
text = "Contact us at support@example.com or info@company.net"

emails = re.findall(pattern, text)

if emails:
    print("Email addresses found:", emails)
else:
    print("No email addresses found.")