YOLOv5 分类模型 OpenCV和PyTorch两者实现预处理的差异

news2025/4/8 17:56:22

flyfish

PyTorch封装了PIL库
简单对比下两者的使用方法

import cv2
from PIL import Image
import numpy as np

full_path_file_name="/media/a//ILSVRC2012_val_00001244.JPEG"


#OpenCV读取图像默认是BGR顺序
cv_image=cv2.imread(full_path_file_name) #BGR
print(cv_image.shape)
cv_image=cv2.cvtColor(cv_image,cv2.COLOR_BGR2RGB)
#print("cv_image:",cv_image)#(400, 500, 3) HWC

#PIL读取图像默认是RGB顺序
pil_image=Image.open(full_path_file_name)
print("pil_image:",pil_image)
numpy_image=np.array(pil_image)
print(numpy_image.shape)#(400, 500, 3) HWC BGR
#print("numpy_image:",numpy_image)

在这里插入图片描述

这样OpenCV和PIL返回的是相同的数据

如果是height > width的情况下，图像缩放大小是
$\left(\text{size} \times \frac{\text{height}}{\text{width}}, \text{size}\right)$

https://github.com/pytorch/vision/

vision/torchvision/transforms/functional.py

产生的问题
PyTorch中使用transforms.Resize，transforms.Resize使用了双线性插值和抗锯齿antialiasing，与cv2.resize处理不同。所以会造成推理结果有差异

def resize(img: Tensor, size: List[int], interpolation: InterpolationMode = InterpolationMode.BILINEAR,
           max_size: Optional[int] = None) -> Tensor:

The output image might be different depending on its type: when downsampling, the interpolation of PIL images
and tensors is slightly different, because PIL applies antialiasing. This may lead to significant differences
in the performance of a network. Therefore, it is preferable to train and serve a model with the same input
types.

对比下差异

from skimage.metrics import structural_similarity as ssim
from skimage.metrics import peak_signal_noise_ratio as psnr
from skimage.metrics import mean_squared_error as mse


target_size =224

img_w = pil_image.width
img_h = pil_image.height

image_width, image_height =0,0
if(img_h >= img_w):# hw
    image_width, image_height =target_size, int(target_size * img_h / img_w)
else:
    image_width, image_height =int(target_size * img_w  / img_h),target_size
    


print(image_width, image_height)
pil_resize_img = pil_image.resize((image_width, image_height), Image.BILINEAR)

#print("pil_resize_img:",np.array(pil_resize_img))

pil_resize_img=np.array(pil_resize_img)

cv_resize_img0 = cv2.resize(cv_image, (image_width, image_height), interpolation=cv2.INTER_CUBIC)
#print("cv_resize_img:",cv_resize_img0)
cv_resize_img1 = cv2.resize(cv_image, (image_width, image_height), interpolation=cv2.INTER_NEAREST)
cv_resize_img2 = cv2.resize(cv_image, (image_width, image_height), interpolation=cv2.INTER_LINEAR)
cv_resize_img3 = cv2.resize(cv_image, (image_width, image_height), interpolation=cv2.INTER_AREA)
cv_resize_img4 = cv2.resize(cv_image, (image_width, image_height), interpolation=cv2.INTER_LANCZOS4)
cv_resize_img5 = cv2.resize(cv_image, (image_width, image_height), interpolation=cv2.INTER_LINEAR_EXACT)
cv_resize_img6 = cv2.resize(cv_image, (image_width, image_height), interpolation=cv2.INTER_NEAREST_EXACT)


print(mse(pil_resize_img,pil_resize_img))
print(mse(pil_resize_img,cv_resize_img0))
print(mse(pil_resize_img,cv_resize_img1))
print(mse(pil_resize_img,cv_resize_img2))
print(mse(pil_resize_img,cv_resize_img3))
print(mse(pil_resize_img,cv_resize_img4))
print(mse(pil_resize_img,cv_resize_img5))
print(mse(pil_resize_img,cv_resize_img6))

可以使用structural_similarity、peak_signal_noise_ratio 、mean_squared_error对比
这里使用mean_squared_error

0.0
30.721508290816328
103.37267219387755
13.030575042517007
2.272438350340136
36.33767538265306
13.034412202380953
51.2258237670068

PyTorch推荐做法是 Therefore, it is preferable to train and serve a model with the same input types.训练和部署使用相同的输入

本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：http://www.coloradmin.cn/o/1238521.html

如若内容造成侵权/违法违规/事实不符，请联系多彩编程网进行投诉反馈，一经查实，立即删除！

YOLOv5 分类模型 OpenCV和PyTorch两者实现预处理的差异

相关文章

防止恶意攻击，服务器DDoS防御软件科普

美国云服务器：CN2/纯国际/高防线路介绍

【MATLAB源码-第86期】基于matlab的QC-LDPC码性能仿真，输出误码率曲线。

【计算机方向】通信、算法、自动化、机器人、电子电气、计算机工程、控制工程、计算机视觉~~~~~合集！！！

QT--MP3项目数据库数据表设计与实现_歌曲搜索

AIGC前沿技术与数字创新应用合作交流和论坛发布活动圆满落幕

CNVD-2023-12632：泛微E-cology9 browserjsp SQL注入漏洞复现 [附POC]

AIOps探索 | 应急处置中排障的降本增效方法探索（上）

Power Apps-Timer

Android：Google三方库之Firebase集成详细步骤（一）

vue3中v-for报错 ‘item‘ is of type ‘unknown‘

除夕不放假HR如何做

企业如何选择一款高效的ETL工具

【问题定位】通过看Mybatis源码解决系统问题

二、Gitee使用方法

SQLite3 数据库学习（四）：Qt 数据库基础操作

vite构建项目不能使用require解决方案

微信怎么设置自动回复？

小红书种草干货，秋冬流行趋势速递

Linux常用操作 Vim一般使用 SSH介绍 SSH密钥登录