Python图形验证码的识别：一步步详解

news2026/2/15 17:12:21

在Web开发和自动化测试中，图形验证码的识别是一项常见且重要的任务。图形验证码作为防止自动化攻击的一种手段，通过随机生成包含字符或数字的图片来增加用户验证的难度。然而，对于需要自动化处理的场景，如Web自动化测试或爬虫，图形验证码的自动识别显得尤为重要。本文将详细介绍如何使用Python来识别图形验证码。

1. 准备工作

1.1 安装必要的库

使用tesserocr技术进行图形验证码的识别是一个涉及图像处理和OCR（Optical Character Recognition，光学字符识别）技术的过程。tesserocr是Python的一个OCR识别库，它实际上是对Tesseract-OCR引擎的一层Python API封装，因此其核心是Tesseract-OCR。

安装tesserocr库：

pip install tesserocr Pillow

1.2 配置pytesseract

安装完Tesseract-OCR后，你需要在Python中配置pytesseract以使其能够找到Tesseract的执行文件，这可以通过设置环境变量来实现。

2.图形验证码的获取

图形验证码通常由Web服务器随机生成，并以图片的形式呈现给用户。在Python中，你可以使用requests库来下载这些验证码图片。

import requests  
  
def download_captcha(url, save_path):  
    response = requests.get(url)  
    with open(save_path, 'wb') as f:  
        f.write(response.content)    
# 示例URL和保存路径  
captcha_url = 'http://example.com/captcha.jpg'  
save_path = 'captcha.jpg'  
download_captcha(captcha_url, save_path)

3 .验证码图片的预处理

验证码图片往往包含噪点、干扰线等复杂背景，为了提高识别的准确率，需要对图片进行预处理。

from PIL import Image  
  
def preprocess_image(image_path):  
    image = Image.open(image_path)  
    # 转换为灰度图像  
    image = image.convert('L')  
    # 二值化处理  
    threshold = 127  
    table = []  
    for i in range(256):  
        if i < threshold:  
            table.append(0)  
        else:  
            table.append(1)  
    image = image.point(table, '1')  
    return image  
# 预处理验证码图片  
preprocessed_image = preprocess_image('captcha.jpg')

4. 使用OCR识别验证码

经过预处理后的图片，可以使用pytesseract库来进行OCR识别。

def recognize_captcha(image):  
    text = text = tesserocr.image_to_text(image)  
    return text  
  
# 识别验证码  
captcha_text = recognize_captcha(preprocessed_image)  
print(captcha_text)

4实战演练

写代码验证一下两张图片，命名为‘tset01.png’、‘test02.png’，并放入根目录。
在这里插入图片描述

import tesserocr
from PIL import Image


def preprocess_image(image_path):
    image = Image.open(image_path)
    # 转换为灰度图像
    image = image.convert('L')
    # 二值化处理
    threshold = 127
    table = []
    for i in range(256):
        if i < threshold:
            table.append(0)
        else:
            table.append(1)
    image = image.point(table, '1')
    return image


def recognize_captcha(image):
    text = tesserocr.image_to_text(image)

    return text


if __name__ == '__main__':
    imgs = ['test1.png', 'test2.png']
    for i in range(len(imgs)):
        # 预处理验证码图片
        preprocessed_image = preprocess_image(imgs[i])
        # 识别验证码
        captcha_text = recognize_captcha(preprocessed_image)
        print(captcha_text)

打印结果得到：

PFRT

3n3D

5. 注意事项和高级应用

识别精度：OCR识别的精度受到多种因素的影响，包括验证码的复杂度、预处理的效果等。对于复杂的验证码，可能需要更精细的预处理步骤或使用深度学习技术。
第三方打码平台：如果OCR识别无法满足需求，可以考虑使用第三方打码平台。这些平台通常提供更高的识别精度，但可能需要付费。
持续学习：OCR技术不断发展，可以定期更新Tesseract的识别模型，以提高识别效果。