CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some

news2025/12/21 18:45:12

问题描述：

在修改代码时，出现入下报错。

发生异常: RuntimeError
CUDA error: device-side assert triggered
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

分析：

本来以为是GPU卡bug了，但百度了解到这类问题很有可能是数据引起的。

例如数据只有五个类别，却要求分六个类别。

我正在做的是图像分割任务，所以很可能是分割类别出了问题。

产生报错之前我对数据进行了resize()操作，然后我尝试把他替换成centercrop()，发现报错消失了！随后便去查阅了官方文档对于resize()的解释：

torchvision.transforms.functional.resize(
                    img: torch.Tensor, 
                    size: List[int], 
                    interpolation: torchvision.transforms.functional.InterpolationMode = <InterpolationMode.BILINEAR: 'bilinear'>, 
                    max_size: Optional[int] = None, 
                    antialias: Optional[bool] = None) → torch.Tensor

其中关于插值的操作引起了我的注意，再细看：

interpolation (InterpolationMode) – Desired interpolation enum defined by
 torchvision.transforms.InterpolationMode. Default is InterpolationMode.BILINEAR. 
If input is Tensor, only InterpolationMode.NEAREST, InterpolationMode.BILINEAR and 
InterpolationMode.BICUBIC are supported. For backward compatibility integer values (e.g. 
PIL.Image[.Resampling].NEAREST) are still accepted, but deprecated since 0.13 and will be
 removed in 0.15. Please use InterpolationMode enum.

默认进行的是双线性插值，这种插值是选择临近的四个像素值，计算出新的插入值，如图：

于是焕然大悟，我的Groudtruth里面本应该只有0，1分别代表前景背景，但进行了双线性插值，会产生除0，1以为的点，网络无法识别才会报错。

解决方法：

将resize()中关于插值的参数改为临近值插值法，即选择最近的一个点的像素值进行插值，这样不会产生新的像素值。

resize(target, (self.size), interpolation = InterpolationMode.NEAREST)

如此便可。

本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：http://www.coloradmin.cn/o/500533.html

如若内容造成侵权/违法违规/事实不符，请联系多彩编程网进行投诉反馈，一经查实，立即删除！

CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some

问题描述：

分析：

解决方法：

相关文章

Android Switch开关按钮使用和自定义样式（系列教程五）

Vivado安装后添加器件库

云原生: istio+dapr构建多运行时服务网格

刷题练习3

牛顿迭代法解超越方程

~项目启动~

仪表检测与读数（一）：仪表检测

[ 云计算 | Azure ] Chapter 06 | 计算服务之虚拟机、虚拟机规模集、Azure 容器、Azure App 与 Azure Functions

【Qt】插件Plugin入门之Q_PLUGIN_METADATA()宏【2023.05.07】

Linux命令·netstat

详细版简单易学版TypeScript各类型声明

Python中模块和包基础学习

2.1 掌握NumPy数组对象ndarray

Python中异常处理的学习

C语言-学习之路-07

C嘎嘎~~ [类下篇(2)]

ADAS-透视前方：汽车HUD技术原理解析

HZNUCTF2023 web

03- 目标检测数据集和标注工具介绍 (目标检测)

JWT快速入门及日常使用