《使用深度学习三分类模型预测胸部CT中的肺腺癌侵袭性》| 文献速递-基于深度学习的乳房、前列腺疾病诊断系统

news2024/11/26 14:35:24

Title

题目

Predicting Invasiveness of Lung Adenocarcinoma at Chest CT with Deep Learning Ternary Classification Models

《使用深度学习三分类模型预测胸部CT中的肺腺癌侵袭性》

Background

背景

Preoperative discrimination of preinvasive, minimally invasive, and invasive adenocarcinoma at CT informs clinical management decisions but may be challenging for classifying pure ground-glass nodules (pGGNs). Deep learning (DL) may improve ternary classification.

术前在CT上区分非侵袭性、微小侵袭性和侵袭性腺癌对于临床管理决策具有重要意义,但对于纯磨玻璃结节(pGGNs)的分类可能存在挑战。深度学习(DL)可能会改进三分类的准确性。

Method

方法

In this retrospective study, six ternary models for classifying preinvasive, minimally invasive, and invasive adenocarcinoma were developed using a multicenter data set of lung nodules. The DL-based models were progressively modified through framework optimization, joint learning, and an adjudication strategy (simulating a multireader approach to resolving discordant nodule classifications), integrating two binary classification models with a ternary classification model to resolve discordant classifications sequentially. The six ternary models were then tested on an external data set of pGGNs imaged between December 2019 and January 2021. Diagnostic performance including accuracy, specificity, and sensitivity was assessed. The χ2 test was used to compare model performance in different subgroups stratified by clinical confounders.

在这项回顾性研究中,使用多中心肺结节数据集开发了六种用于分类非侵袭性、微小侵袭性和侵袭性腺癌的三分类模型。这些基于深度学习的模型通过框架优化、联合学习和裁定策略(模拟多阅片者方法以解决结节分类不一致)逐步改进,结合了两个二分类模型与一个三分类模型,依次解决分类不一致的问题。随后,在一个外部数据集中对这些三分类模型进行了测试,该数据集包含2019年12月至2021年1月期间拍摄的纯磨玻璃结节(pGGNs)。评估了包括准确率、特异性和敏感性在内的诊断性能,并使用χ²检验比较了按临床混杂因素分层的不同亚组中的模型表现。

Conclusion

结论

Combining framework optimization, joint learning, and an adjudication approach improved DL classification of adenocarcinoma invasiveness at chest CT.

结合框架优化、联合学习和裁定方法,提高了深度学习在胸部CT中对腺癌侵袭性分类的准确性。

Results

结果

A total of 4929 nodules from 4483 patients (mean age, 50.1 years ± 9.5 [SD]; 2806 female) were divided into training (n = 3384), validation (n = 579), and internal (n = 966) test sets. A total of 361 pGGNs from 281 patients (mean age, 55.2 years ± 11.1 [SD]; 186 female) formed the external test set. The proposed strategy improved DL model performance in external testing (P < .001). For classifying minimally invasive adenocarcinoma, the accuracy was 85% and 79%, sensitivity was 75% and 63%, and specificity was 89% and 85% for the model with adjudication (model 6) and the model without (model 3), respectively. Model 6 showed a relatively narrow range (maximum minus minimum) across diagnostic indexes (accuracy, 1.7%; sensitivity, 7.3%; specificity, 0.9%) compared with the other models (accuracy, 0.6%–10.8%; sensitivity, 14%–39.1%; specificity, 5.5%–17.9%).

共计4929个结节来自4483名患者(平均年龄50.1岁±9.5岁[标准差],其中2806名为女性),这些结节被分为训练集(n = 3384)、验证集(n = 579)和内部测试集(n = 966)。外部测试集由281名患者的361个纯磨玻璃结节(pGGNs)组成(平均年龄55.2岁±11.1岁[标准差],其中186名为女性)。结果表明,所提出的策略在外部测试中显著提高了深度学习模型的性能(P < .001)。对于微小侵袭性腺癌的分类,带有裁定策略的模型(模型6)的准确率为85%,灵敏度为75%,特异性为89%;相比之下,没有裁定策略的模型(模型3)的准确率为79%,灵敏度为63%,特异性为85%。与其他模型相比,模型6在诊断指标(准确率、灵敏度、特异性)上的范围相对较窄(准确率范围为1.7%;灵敏度范围为7.3%;特异性范围为0.9%),而其他模型的范围较宽(准确率范围为0.6%–10.8%;灵敏度范围为14%–39.1%;特异性范围为5.5%–17.9%)

Figure

图片

Figure 1: Overview of the framework of the ternary classification models and upgrade processes. Baseline: Flowchart of the radiomics-based model (model 1) (orangedotted box). Upgrade 1: Framework of the deep learning (DL)–based model (model 2) (cyan-dotted box). Upgrade 2: Fusion model (model 3), generated by combining model 1 and model 2 through a joint learning method (purple-dotted box). Upgrade 3: Implementation of an adjudication strategy (simulating a multireader approach) in model 4 (upgraded from model 1), model 5 (upgraded from model 2), and model 6 (upgraded from model 3) (red-dotted box). In the proposed strategy, 3v1 represented binary task 1 (atypical adenomatous hyperplasia [AAH] and adenocarcinoma in situ [AIS] + minimally invasive adenocarcinoma [MIA] vs invasive adenocarcinoma [IAC]), 2v2 represented binary task 2 (AAH and AIS vs MIA + IAC), and 2v1v1 represented the ternary classification (AAH and AIS vs MIA vs IAC). AUC = area under the receiver operating characteristics curve, DFL = discriminative filter learning, 4D = four-dimensional, LASSO = least absolute shrinkage and selection operator, LD = linear discriminant, LR = logistic regression, MLP = multilayer perception, ROI = region of interest, SVM = support vector machines, Xgboost = extreme gradient boosting

图1: 三分类模型的框架及升级过程概览。基线:基于影像组学的模型(模型1)的流程图(橙色虚线框)。升级1:基于深度学习(DL)模型(模型2)的框架(青色虚线框)。升级2:通过联合学习方法将模型1和模型2结合生成的融合模型(模型3)(紫色虚线框)。升级3:在模型4(从模型1升级)、模型5(从模型2升级)和模型6(从模型3升级)中实施裁定策略(模拟多阅片者方法)(红色虚线框)。在所提出的策略中,3v1代表二分类任务1(非典型腺瘤样增生 [AAH] 和原位腺癌 [AIS] + 微小侵袭性腺癌 [MIA] 对比侵袭性腺癌 [IAC]),2v2代表二分类任务2(AAH 和 AIS 对比 MIA + IAC),2v1v1代表三分类(AAH 和 AIS 对比 MIA 对比 IAC)。AUC = 受试者工作特性曲线下面积,DFL = 判别滤波学习,4D = 四维,LASSO = 最小绝对收缩和选择算子,LD = 线性判别分析,LR = 逻辑回归,MLP = 多层感知器,ROI = 感兴趣区,SVM = 支持向量机,Xgboost = 极端梯度提升。

图片

Figure 2: Flowchart of patient inclusion and exclusion criteria for (A) training, validation, and internal test sets and (B) external test set. pGGN = pure groundglass nodule.

图2: 患者纳入和排除标准的流程图,分别用于(A) 训练集、验证集和内部测试集,以及(B) 外部测试集。pGGN = 纯磨玻璃结节。

图片

Figure 3: Receiver operating characteristic curves obtained via the average method of the six ternary classification models of adenocarcinoma invasiveness in (A) internal and (B) external test sets. Model 1 is a radiomics-based model; model 2, a deep learning–based model; model 3, a fusion model generated by combining model 1 and model 2 through a joint learning method; model 4, upgraded model 1 based on the adjudication strategy; model 5, upgraded from model 2 based on the adjudication strategy; model 6, upgraded from model 3 based on the adjudication strategy. Since models 4, 5, and 6 could only generate classification results instead of probabilities, these models generated a single point in the receiver operating characteristic space. AUC = area under the receiver operating characteristic curve

图3: 通过平均方法获得的六种腺癌侵袭性三分类模型的受试者工作特性曲线(ROC),分别用于(A) 内部测试集和(B) 外部测试集。模型1为基于放射组学的模型;模型2为基于深度学习的模型;模型3为通过联合学习方法结合模型1和模型2生成的融合模型;模型4为基于裁定策略升级的模型1;模型5为基于裁定策略升级的模型2;模型6为基于裁定策略升级的模型3。由于模型4、5和6只能生成分类结果而非概率,这些模型在受试者工作特性空间中只生成了一个单点。AUC = 受试者工作特性曲线下面积。

图片

Figure 4: Radar maps of the five diagnostic indexes (accuracy, sensitivity, specificity, precision, and F1 score) in the external test set for (A) all six models and for (B) ternary classification of invasiveness. Each line in the radar map represents the performance of a certain model for a single classification (atypical adenomatous hyperplasia [AAH]/adenocarcinoma in situ [AIS], minimally invasive adenocarcinoma [MIA], or invasive adenocarcinoma [IAC]) according to the five diagnostic indexes, and the area enclosed by the line can be used to visually compare the performance for different classifications of different models. Model 1 is a radiomics-based model; model 2, a deep learning–based model; model 3, a fusion model generated by combining model 1 and model 2 through a joint learning method; model 4, upgraded from model 1 based on the adjudication strategy; model 5, upgraded from model 2 based on the adjudication strategy; model 6, upgraded from model 3 based on the adjudication strategy

图4: 外部测试集中五个诊断指标(准确率、灵敏度、特异性、精确度和F1得分)的雷达图,分别用于(A) 六种模型的整体表现和(B) 侵袭性三分类。雷达图中的每一条线表示某一模型在单一分类(非典型腺瘤样增生 [AAH] / 原位腺癌 [AIS]、微小侵袭性腺癌 [MIA] 或侵袭性腺癌 [IAC])下,根据五个诊断指标的表现,线条所围成的区域可以用来直观地比较不同模型在不同分类上的性能。模型1为基于放射组学的模型;模型2为基于深度学习的模型;模型3为通过联合学习方法结合模型1和模型2生成的融合模型;模型4为基于裁定策略升级的模型1;模型5为基于裁定策略升级的模型2;模型6为基于裁定策略升级的模型3.

图片

Figure 5: Confusion matrices for ternary classification of invasiveness in the external test set for (A) conventional ternary models including model 1 (a radiomics-based model), model 2 (a deep learning–based model), and model 3 (a fusion model generated by combining model 1 and model 2) and for (B) ternary models designed with the adjudication strategy including model 4 (upgraded from model 1), model 5 (upgraded from model 2), and model 6 (upgraded from model 3). The density of each color in confusion matrices displays the number of nodules in certain classification; a darker color indicates a greater number. (B) For the ternary models augmented with the adjudication strategy, the confusion matrices were modified as follows. Top row: The rows of the matrices represent the result of binary classification task 1, and the columns represent the result of binary classification task 2. The four large cells at the intersections of the rows and columns show the classification results (circled in the rounded square) of atypical adenomatous hyperplasia (AAH)/adenocarcinoma in situ (AIS), minimally invasive adenocarcinoma (MIA), invasive adenocarcinoma (IAC), and paradoxical nodules based on the fusion rule of binary task 1 and binary task 2. Then, each large cell is further divided into three parts according to the actual pathologic classification of the lesion: the upper left number is pathologic AAH/AIS, upper right number is pathologic MIA, and the bottom is pathologic IAC. Middle row: Confusion matrices of the ternary classification model that was used to more accurately identify the paradoxical nodules (ie, nodules simultaneously predicted as an IAC in binary task 1 and as an AAH/AIS in binary task 2). M = classification results generated by model, P = classification results at pathologic examination, T1 = binary classification task 1, T2 = binary classification task 2.

图5: 外部测试集中三分类模型对侵袭性的混淆矩阵,分别用于(A) 常规三分类模型(包括模型1(基于放射组学的模型)、模型2(基于深度学习的模型)和模型3(通过结合模型1和模型2生成的融合模型))和(B) 设计了裁定策略的三分类模型(包括模型4(从模型1升级)、模型5(从模型2升级)和模型6(从模型3升级)。

在混淆矩阵中,每种颜色的密度显示了某一分类中的结节数量;颜色越深,数量越多。(B) 对于采用裁定策略的三分类模型,混淆矩阵的修改如下:

顶部行:矩阵的行表示二分类任务1的结果,列表示二分类任务2的结果。四个大单元格在行列交点处显示了基于二分类任务1和任务2融合规则的非典型腺瘤样增生(AAH)/原位腺癌(AIS)、微小侵袭性腺癌(MIA)、侵袭性腺癌(IAC)和矛盾结节的分类结果(用圆角方框圈出)。每个大单元格进一步根据病理分类进行划分:左上角的数字为病理AAH/AIS,右上角的数字为病理MIA,底部的数字为病理IAC。

中间行:混淆矩阵用于更准确地识别矛盾结节(即在二分类任务1中预测为IAC而在二分类任务2中预测为AAH/AIS的结节)。M = 模型生成的分类结果,P = 病理检查结果,T1 = 二分类任务1,T2 = 二分类任务2。

图片

Figure 6: Heat maps of model 5 (upgraded from the conventional model 2 based on the adjudication strategy) generated by gradient-weighted class activation mapping, or Grad-CAM. Two examples are used to illustrate the mechanism of model classification. (A) The two binary classification models detect preinvasive features (binary classification task 1, model 3v1) and invasive features (binary classification task 2, model 2v2) separately. (B) Complex minimally invasive adenocarcinoma (MIA) nodule correctly classified with model 5 using the proposed strategy but incorrectly classified with the conventional ternary classification model (model 2). AAH = atypical adenomatous hyperplasia, AIS = adenocarcinoma in situ, IAC = invasive adenocarcinoma.

图6: 通过梯度加权类激活映射(Grad-CAM)生成的模型5的热图(模型5是基于裁定策略对常规模型2进行升级的模型)。使用两个示例来说明模型分类的机制。(A) 两个二分类模型分别检测非侵袭性特征(二分类任务1,模型3v1)和侵袭性特征(二分类任务2,模型2v2)。(B) 复杂的微小侵袭性腺癌(MIA)结节在使用所提出的策略的模型5中被正确分类,但在使用常规三分类模型(模型2)中被错误分类。AAH = 非典型腺瘤样增生,AIS = 原位腺癌,IAC = 侵袭性腺癌。

Table

图片

Table 1: Baseline Characteristics of Patients and Pulmonary Nodules in the Data Sets

表1: 数据集中患者和肺部结节的基线特征

图片

Table 2: Diagnostic Indexes of the Six Models in the External Test Set

表2: 六个模型在外部测试集中的诊断指标

图片

Table 3: Range of Diagnostic Indices in Ternary Classification for the Six Models in the External Test Set

表3: 外部测试集中六种模型的三分类诊断指标范围

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.coloradmin.cn/o/2045228.html

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈,一经查实,立即删除!

相关文章

开关电源之结构分析

如有技术问题及技术需求请加作者微信! 开关电源之结构分析 1、开关电源的结构 常用开关电源,主要是为电子设备提供直流电源供电。电子设备所需要的直流电压,范围一般都在几伏到十几伏,而交流市电电源供给的电压为220V(110V),频率为50Hz(60Hz)。开关电源的作用就是把一…

Surface Studio 曾经耀眼的骚气光芒至今依然

Surface Studio 曾经耀眼的骚气光芒至今依然 Surface Studio当时上市价格25988RMB&#xff0c;现在小黄鱼大概在3000的样子可以买到屏幕没有老化的那种。看着那块巨大的5K触摸屏&#xff0c;简直让人垂涎三尺欲罢不能。 一、卓越配置&#xff0c;尽显强大性能 Surface Studio…

freeRTOS之任务调度

本节课的内容是重中之重&#xff0c;对复习操作系统的任务调度也很有帮助。

API接口自定义字段返回,最终解决方案,再也不用写 vo、dto 转换逻辑了

西湖的晚霞真美啊 前言 此套方案太过完美&#xff0c;唯一缺点就是&#xff0c;字段返回基于反射操作&#xff0c;损失一点点性能 效果 API接口指定使用自定义增强注解MoreSerializeField&#xff0c;标注只返回 departName 字段 前端只会接收到 departName 字段 实现原理 …

Python爬虫——Selenium方法爬取LOL页面

文章目录 Selenium介绍用Selenium方法爬取LOL每个英雄的图片及名字 Selenium介绍 Selenium 是一个用于自动化Web应用程序测试的工具&#xff0c;但它同样可以被用来进行网页数据的抓取&#xff08;爬虫&#xff09;。Selenium 通过模拟用户在浏览器中的操作&#xff08;如点击…

进程间通信---管道通信、命名管道、匿名管道详解

什么是通信&#xff1f; 为什么要通信&#xff1f; 如何做到通信&#xff1f; 管道是什么&#xff1f; 管道用来干什么&#xff1f; 管道如何实现通信&#xff1f; 匿名管道是什么&#xff1f; 如何实现&#xff1f; 命名管道是什么&#xff1f; 如何实现&#xff1f; 什么是文…

探究 Element Plus Menu 横向多层级展开组件的 Bug 及解决方案

文章目录 1 ellipsis 是否省略多余的子项&#xff08;仅在横向模式生效&#xff09;问题描述解决方案 2 多个级别的子菜单位置错乱或默认直接展开问题描述解决方案1 index没有设置2 通用策略 1 ellipsis 是否省略多余的子项&#xff08;仅在横向模式生效&#xff09; 问题描述…

Global Structure-from-Motion Revisited golmap论文翻译

Global Structure-from-Motion Revisited Paper&#xff1a;https://demuc.de/papers/pan2024glomap.pdf Code&#xff1a;https://github.com/colmap/glomap 摘要- 从图像中恢复三维结构和摄像机运动一直是计算机视觉研究的热点&#xff0c;被称为运动结构恢复(SfM)。这个问…

【RAG提升技巧】查询改写HyDE

简介 提高 RAG 推理能力的一个好方法是添加查询理解层 ——在实际查询向量存储之前添加查询转换。以下是四种不同的查询转换&#xff1a; 路由&#xff1a;保留初始查询&#xff0c;同时查明其所属的适当工具子集。然后&#xff0c;将这些工具指定为合适的选项。查询重写&…

[论文笔记]ZeRO: Memory Optimizations Toward Training Trillion Parameter Models

引言 今天带来ZeRO: Memory Optimizations Toward Training Trillion Parameter Models的论文笔记。 大型深度模型提供了显著的准确性提升&#xff0c;但训练数十亿到数万亿个参数是具有挑战性的。现有的解决方案&#xff0c;如数据并行和模型并行&#xff0c;存在基本的局限…

汇昌联信做拼多多电商如何提升浏览量?

在拼多多电商平台上&#xff0c;如何提升商品的浏览量是每个卖家都关注的问题。高浏览量不仅意味着更多的潜在客户&#xff0c;还能间接提升销量和店铺的知名度。汇昌联信作为电商运营者&#xff0c;需要采取有效策略来吸引消费者的注意力。 一、优化商品标题和描述 商品标题是…

LDR6328Q:重塑电源管理新境界的取电芯片

在电子设备日益普及的今天&#xff0c;高效、智能的电源管理成为了提升用户体验的关键因素之一。而LDR6328Q作为一款专为设备端设计的取电芯片&#xff0c;凭借其出色的性能和创新的技术&#xff0c;正逐步成为电源管理领域的一颗璀璨新星。本文将深入探讨LDR6328Q取电芯片的特…

大模型面经,不要到处找了,收藏我这一篇就够了

在当前技术快速发展的背景下&#xff0c;大模型领域的职位成为了许多求职者的热门选择。为了帮助大家更好地准备面试&#xff0c;这里整理了一份大模型面试经验分享&#xff0c;涵盖了一些常见的面试流程、可能遇到的技术问题以及面试官可能会问到的行为问题等。 大模型面试经…

什么是实时数据仓库? 优势与最佳实践

在当今数据驱动的世界中&#xff0c;许多企业使用实时数据仓库来满足其分析和商业智能 (BI) 需求。这使他们能够做出更好的决策、推动增长并为客户提供价值。 数据仓库是一种数据存储和管理系统&#xff0c;其设计目标只有一个&#xff1a;管理和分析数据&#xff0c;以实现商…

链表求和问题(面试题)

目录 一题目&#xff1a; 二思路汇总&#xff1a; 三解答代码&#xff1a; 一题目&#xff1a; leetcode题目链接 &#xff1a;面试题 02.05. 链表求和 - 力扣&#xff08;LeetCode&#xff09; 二思路汇总&#xff1a; 想到这道题&#xff0c;可能看起来第一思路就是把它们…

【C++二分查找】2563. 统计公平数对的数目

本文涉及的基础知识点 C二分查找 LeetCode2563. 统计公平数对的数目 给你一个下标从 0 开始、长度为 n 的整数数组 nums &#xff0c;和两个整数 lower 和 upper &#xff0c;返回 公平数对的数目 。 如果 (i, j) 数对满足以下情况&#xff0c;则认为它是一个 公平数对 &…

利用住宅代理优化媒体监控,全面提升品牌管理与市场竞争力

引言 什么是媒体监控&#xff1f;主要用于哪里&#xff1f; 媒体监控面临的主要挑战 住宅代理在媒体监控中的作用 如何利用住宅代理进行媒体监控 总结 引言 在信息化时代&#xff0c;媒体监控&#xff08;media monitoring&#xff09;已成为企业进行品牌声誉管理、市场研…

喵喵蓝牙热敏打印机(下)

目录 前言一、电量、温度、缺纸检测1.电量检测2.针头温度检测3.缺纸检测 二、蓝牙APP通信打印1.蓝牙初始化2.APP通信打印 三、FreeRTOS任务整合 前言 喵喵蓝牙热敏打印机&#xff08;上&#xff09; 内容有点多&#xff0c;就分为了上下两篇。 一、电量、温度、缺纸检测 先启…

Codigger视频会议(Meeting):科技行业软件开发团队的协作革新

在数字化时代&#xff0c;软件开发团队的协作方式正在经历一场革命。远程工作已成为新常态&#xff0c;而Codigger视频会议&#xff08;Meeting&#xff09;作为这场变革的催化剂&#xff0c;正在帮助科技行业的团队突破地理限制&#xff0c;实现无缝协作。 随着科技行业的快速…

Adobe Audition AU 2023-23.6.6.1 解锁版下载和安装教程(专业的音频处理工具)

前言 Audition是Adobe旗下一款非常好用的音频处理工具&#xff0c;软件为用户们提供了功能强大的音频编辑功能和一个相对完善的工作流程&#xff0c;用户们无论是录制音乐、无线电广播还是视频配音&#xff0c;多音频合成&#xff0c;这款软件都能够给你足够的创作动力。audit…