基于segment anything model(SAM)相关性研究的各个方向论文/项目汇总

news2024/11/17 14:39:30

目录

  • 简介
  • anything项目整理
    • AnyObject
    • AnyGeneration
    • Any3D
    • AnyModel
    • AnyTask
    • AnyX
  • 论文汇总
    • AnyObejct
    • AnyGeneration
    • AnyModel
    • AnyTask

简介

有关anything相关的主流任务: 2d检测相关(AnyObject), 3d检测相关(Any3D),AI生成相关(AnyGeneration), AI模型优化相关(), AI任务相关, etc.

  • AnyObject - 分割、检测、分类、医学图像、OCR、姿态等。
  • AnyGeneration - 文本到图像的生成、编辑、修复、样式转换等。
  • Any3D - 3D 生成、分割等。
  • AnyModel - 任何修剪、任何量化、模型重使用。
  • AnyTask -LLM 控制器 + ModelZoo,通用解码,多任务学习。
  • AnyX - 其他主题:字幕等

anything项目整理

AnyObject

Title & AuthorsIntroUseful Links

Segment Anything
Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alex Berg, Wan-Yen Lo, Piotr Dollar, Ross Girshick
> Meta Research
> Preprint’23

[Segment Anything (Project)]
在这里插入图片描述
[Github]
[Page]
[Demo]

OVSeg: Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
Feng Liang, Bichen Wu, Xiaoliang Dai, Kunpeng Li, Yinan Zhao, Hang Zhang, Peizhao Zhang, Peter Vajda, Diana Marculescu
> Meta Research
> Preprint’23

[OVSeg (Project)]
image[Github]
[Page]

Learning to Segment Every Thing
Ronghang Hu, Piotr Dollar, Kaiming He, Trevor Darrell, Ross Girshick
> UC Berkeley, FAIR
> CVPR’18

[seg_every_thing (Project)]
image[Github]
[Page]

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu and Zhaoyang Zeng and Tianhe Ren and Feng Li and Hao Zhang and Jie Yang and Chunyuan Li and Jianwei Yang and Hang Su and Jun Zhu and Lei Zhang
> IDEA-Research
> Preprint’23

[Grounded-SAM, GroundingDINO (Project)]
在这里插入图片描述
[Github]
[Demo]

SegGPT: Segmenting Everything In Context
Xinlong Wang, Xiaosong Zhang, Yue Cao, Wen Wang, Chunhua Shen, Tiejun Huang
> BAAI-Vision
> Preprint’23

[SegGPT (Project)]
image[Github]
V3Det: Vast Vocabulary Visual Detection Dataset
Jiaqi Wang, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang, Conghui He, Dahua Lin
> Shanghai AI Laboratory, CUHK
> Preprint’23
image

segment-anything-video (Project)
Kadir Nar
在这里插入图片描述

[Github]

Towards Segmenting Anything That Moves
Achal Dave, Pavel Tokmakov, Deva Ramanan
> ICCV’19 Workshop

[segment-any-moving (Project)]
[Github]

Semantic Segment Anything
Jiaqi Chen, Zeyu Yang, Li Zhang

[Semantic-Segment-Anything (Project)]
image[Github]

Grounded Segment Anything: From Objects to Parts (Project)
Peize Sun and Shoufa Chen
[Github]

GroundedSAM-zero-shot-anomaly-detection (Project)
Yunkang Cao
image[Github]

Segment Anything Labelling Tool (SALT) (Project)
Anurag Ghosh
[Github]

Prompt-Segment-Anything (Project)
Rockey
[Github]

SAM-RBox (Project)
Qingyun Li
intro[Github]

VISAM (Project)
Feng Yan, Weixin Luo, Yujie Zhong, Yiyang Gan, Lin Ma
[Github]

Segment Anything EO tools: Earth observation tools for Meta AI Segment Anything (Project)
Aliaksandr Hancharenka, Alexander Chichigin
[Github]

napari-segment-anything: Segment Anything Model (SAM) native Qt UI (Project)
Jordão Bragantini, Kyle I S Harrington, Ajinkya Kulkarni
image[Github]

SAM-Medical-Imaging: Segment Anything Model (SAM) native Qt UI (Project)
Jordão Bragantini, Kyle I S Harrington, Ajinkya Kulkarni
image[Github]

OCR-SAM: Combining MMOCR with Segment Anything & Stable Diffusion. (Project)
Zhenhua Yang, Qing Jiang
[Github]

segment-anything-u-specify: using sam+clip to segment any objs u specify with text prompts. (Project)
MaybeShewill-CV
[Github]

Segment Everything Everywhere All at Once
Xueyan Zou, Jianwei Yang, Hao Zhang, Feng Li, Linjie Li, Jianfeng Gao, Yong Jae Lee

[SEEM (Project)]
[Github]

SegDrawer: Simple static web-based mask drawer (Project)
Harry
[Github]

Magic Copy: a Chrome extension (Project)
Harry
image[Github]

Track Anything: Segment Anything Meets Videos
Jinyu Yang, Mingqi Gao, Zhe Li, Shang Gao, Fangjing Wang, Feng Zheng

[Track-Anything (Project)]
[Github]
[Demo]

Count Anything (Project)
Liqi Yan
image[Github]

Segment-and-Track-Anything (Project)
Zongxin Yang
image[Github]

Pose for Everything: Towards Category-Agnostic Pose Estimation
Lumin Xu*, Sheng Jin*, Wang Zeng, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang
> CUHK, SenseTime
> ECCV’22 Oral

[Pose-for-Everything (Project)]
[Github]

Relate Anything Model (Project)
Zujin Guo*, Bo Li*, Jingkang Yang*, Zijian Zhou*, Ziwei Liu
> MMLab@NTU
> VisCom Lab, KCL/TongJi
Github

SegmentAnyRGBD (Project)
Jun Cen, Yizheng Wu, Xingyi Li, Jingkang Yang, Yixuan Pei, Lingdong Kong
> Visual Intelligence Lab@HKUST,
> HUST,
> MMLab@NTU,
> Smiles Lab@XJTU,
> NUS
Github



AnyGeneration

Title & AuthorsIntroUseful Links

High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach and Andreas Blattmann and Dominik Lorenz and Patrick Esser and Björn Ommer
> LMU München, Runway ML
> CVPR’22

[Stable-Diffusion (Project)]
intro[Github]
[Page]
[Demo]

Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang, Maneesh Agrawala
> Stanford University
> Preprint’23

[ControlNet (Project)]
intro[Github]
[Demo]
GigaGAN: Large-scale GAN for Text-to-Image Synthesis
Minguk Kang, Jun-Yan Zhu, Richard Zhang, Jaesik Park, Eli Shechtman, Sylvain Paris, Taesung Park
> POSTECH, Carnegie Mellon University, Adobe Research
> CVPR’23
image[Page]

Inpaint-Anything: Segment Anything Meets Image Inpainting (Project)
Tao Yu
[Github]

IEA: Image Editing Anything (Project)
Zhengcong Fei
intro[Github]

EditAnything (Project)
Shanghua Gao, Pan Zhou
[Github]

Segment Anything for Stable Diffusion Webui (Project)
Chengsong Zhang
image[Github]

Segment Anything with Clip (Project)
Jinwoo Park
intro[Github]

ShowAnything: Edit and Generate Anything In Image and Video (Project)
Showlab, NUS
Github

Transfer-Any-Style: About An interactive demo based on Segment-Anything for style transfer (Project)
LV-Lab, NUS
Github



Any3D

Title & AuthorsIntroUseful Links

Anything-3D: Segment-Anything + 3D, Let’s lift the anything to 3D (Project)
LV-Lab, NUS
Github

SAM 3D Selector: Utilizing segment-anything to help the region selection of 3D point cloud or mesh. (Project)
Nexuslrf
Github

3D-Box via Segment Anything. (Project)
dvlab-research
[Github]

Segment Anything 3D (Project)
Yunhan Yang, Xiaoyang Wu
[Github]



AnyModel

Title & AuthorsIntroUseful Links
[
DepGraph: Towards Any Structural Pruning
Gongfan Fang, Xinyin Ma, Mingli Song, Michael Bi Mi, Xinchao Wang
> Learning and Vision Lab @ NUS
> CVPR’23

[Torch-Pruning (Project)]
[Github]
[Demo]

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark
Yuhang Li and Mingzhu Shen and Jian Ma and Yan Ren and Mingxin Zhao and Qi Zhang and Ruihao Gong and Fengwei Yu and Junjie Yan
> SenseTime Research
> NeurIPS’21

[MQBench (Project)]
intro[Github]
[Page]

OTOv2: Automatic, Generic, User-Friendly
Tianyi Chen, Luming Liang, Tianyu Ding, Ilya Zharkov
> Microsoft
> ICLR’23

[Only Train Once (Project)]
intro[Github]

Deep Model Reassembly
Xingyi Yang, Daquan Zhou, Songhua Liu, Jingwen Ye, Xinchao Wang
LV Lab, NUS
> NeurIPS’22

[Deep Model Reassembly (Project)]
[Github]
[Page]



AnyTask

Title & AuthorsIntroUseful Links

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang
> Zhejiang University, MSRA
Preprint’23

[Jarvis (Project)]
[Github]
[Demo]
TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs
Yaobo Liang, Chenfei Wu, Ting Song, Wenshan Wu, Yan Xia, Yu Liu, Yang Ou, Shuai Lu, Lei Ji, Shaoguang Mao, Yun Wang, Linjun Shou, Ming Gong, Nan Duan
> Microsoft
> > Preprint’23
[Github]

Generalized Decoding for Pixel, Image and Language
Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao
> Microsoft
> CVPR’23

[X-Decoder (Project)]
intro[Github]
[Page]
[Demo]

Pre-Trained Image Processing Transformer
Chen, Hanting and Wang, Yunhe and Guo, Tianyu and Xu, Chang and Deng, Yiping and Liu, Zhenhua and Ma, Siwei and Xu, Chunjing and Xu, Chao and Gao, Wen
> Huawei-Noah
> CVPR’21

[Pretrained-IPT (Project)]
[Github]

OpenAGI: When LLM Meets Domain Experts
Yingqiang Ge, Wenyue Hua, Jianchao Ji, Juntao Tan, Shuyuan Xu, Yongfeng Zhang
> Rutgers University
> Preprint’23

[OpenAGI (Project)]
Github



AnyX

Title & AuthorsIntroUseful Links

Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao
> SUSTech VIP Lab
> Preprint’23

Caption Anything (Project)
[Github]
[Demo]

Image2Paragraph:Transform Image into Unique Paragraph (Project)
Jinpeng Wang
Github



论文汇总

AnyObejct

PaperFirst AuthorVenueTopic
Segment AnythingAlexander KirillovPreprint’23Segmentation
Learning to Segment Every ThingRonghang HuCVPR’18
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object DetectionShilong LiuPreprint’23Grouding+Detection
SegGPT: Segmenting Everything In ContextXinlong WangPreprint’23Segmentation
V3Det: Vast Vocabulary Visual Detection DatasetJiaqi WangPreprint’23Dataset
Pose for Everything: Towards Category-Agnostic Pose EstimationLumin XuECCV’22 OralPose

AnyGeneration

PaperFirst AuthorVenueTopic
High-Resolution Image Synthesis with Latent Diffusion ModelsRobin RombachCVPR’22Text-to-Image Generation
Adding Conditional Control to Text-to-Image Diffusion ModelsLvmin ZhangPreprint’23Controlllable Generation
GigaGAN: Large-scale GAN for Text-to-Image SynthesisMinguk KangCVPR’23Large-scale GAN
Inpaint Anything: Segment Anything Meets Image InpaintingTao YuPreprint’23Inpainting

AnyModel

PaperFirst AuthorVenueTopic
DepGraph: Towards Any Structural PruningGongfan FangCVPR’23Network Pruning
MQBench: Towards Reproducible and Deployable Model Quantization BenchmarkYuhang LiNeurIPS’21Network Quantization
OTOv2: Automatic, Generic, User-FriendlyTianyi ChenICLR’23Network Pruning
Deep Model ReassemblyXingyi YangNeurIPS’22Model Reuse

AnyTask

PaperFirst AuthorVenueTopic
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFaceYongliang ShenPreprint’23Modelzoo + LLM
TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIsYaobo LiangPreprint’23Modelzoo + LLM
Generalized Decoding for Pixel, Image and LanguageXueyan ZouCVPR’23Multi Tasking
Pre-Trained Image Processing TransformerChen, HantingCVPR’21Low-level Vision

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.coloradmin.cn/o/545632.html

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈,一经查实,立即删除!

相关文章

栈和队列OJ题:LeetCode--225.用队列实现栈

朋友们、伙计们,我们又见面了,今天给大家带来的是LeetCode--225.用队列实现栈 数 据 结 构 专 栏:数据结构 个 人 主 页 :stackY、 LeetCode 专 栏 :LeetCode刷题训练营 LeetCode--225.用队列实现栈&#xff…

软件测试工程师如何提高自己的竞争力?

案例一来自我们的资深功能测试工程师招聘。当时,有一位拥有近 9 年测试经验的资深测试候选人,我对他的简历还是比较满意的,所以就安排了面谈。但是,在聊的过程中我很快发现,这位候选人绝大多数的测试经验积累都“强”绑…

精彩回顾 | 2022(第二届)超级CSO年度评选颁奖盛典

2023年5月13日,2022(第二届)超级CSO年度评选颁奖盛典在上海举行,来自全国各地近200位来宾、业界专家、企业代表、合作伙伴以及CSO/CISO共同出席。本次盛典得到了包括中国网络安全审查技术与认证中心(CCRC)、…

什么是即时 AI ?有哪些应用场景

什么是即时 AI ? 即时 AI 是全球首款通过自然语言描述,快速生成可编辑的 UI 设计稿的设计工具。 输入文字描述后,即可一次性生成4张 包含矢量图层和图标、支持二次编辑、分层结构清晰 UI 设计稿。 即时 AI 目前已面向全部用户免费开放&#…

无效数据处理攻略: 如何从源头开始预防无效数据带来的风险

数据处理在现代社会中变得越来越重要,而对于数据的可靠性和准确性,我们始终非常关注。然而,即使在对数据进行了精心管理的情况下,无效数据依然可能存在,并可能对数据分析和决策带来不良影响。因此,处理无效…

[Windows驱动开发]-BlackBone实现内存读取的三种方式

文章目录 🛫 导读需求开发环境 升级优化(vs2019)相关地址Blackbone工程中的lib库添加Blackbone工程修改tools工程修改 旧文章整理(vs2017)功能描述内存读取-BlackBone库的集成内存读取-检测参数内存读取-ReadProcessMe…

【 数据处理系统 】(草稿)

文章目录 第3章 总体设计3.1 系统设计目标和原则3.2 系统架构设计3.3 数据采集模块设计3.4 数据预处理模块设计3.4.1 业务数据预处理模块设计3.4.2 日志数据预处理模块设计 3.5 数据存储设计3.6 数据仓库设计3.7 可视化模块设计 第4章 详细设计与实现4.1 数据采集模块4.1.1 数据…

一、11.C内存分配/堆栈

C内存分配/堆栈 01.C内存分配❤️ #include <stdio.h>const int g_A = 10; //常量区 int g_B = 20; //数据段 static<

【小菜鸡刷题记】----双指针篇

【小菜鸡刷题记】----双指针篇 剑指 Offer 18. 删除链表的节点剑指 Offer 22. 链表中倒数第k个节点剑指 Offer 25. 合并两个排序的链表剑指 Offer 52. 两个链表的第一个公共节点剑指 Offer 21. 调整数组顺序使奇数位于偶数前面剑指 Offer 57. 和为s的两个数字剑指 Offer 57 - I…

《斯坦福数据挖掘教程·第三版》读书笔记(英文版) Chapter 6 Frequent Itemsets

来源&#xff1a;《斯坦福数据挖掘教程第三版》对应的公开英文书和PPT Chapter 6 Frequent Itemsets The market-basket model of data is used to describe a common form of many-many relationship between two kinds of objects. On the one hand, we have items, and on…

YOLOv8 独家原创改进:独家首发最新原创EfficiCLNMS改进点,改进有效可以直接当做自己的原创改进点来写,新的增强预测帧

💡该教程为属于《芒果书》📚系列,包含大量的原创首发改进方式, 所有文章都是全网首发原创改进内容🚀 💡本篇文章为YOLOv8改进:独家首发最新EfficiCL-NMS改进点,新的增强预测帧率。 💡对自己数据集改进有效的话,可以直接当做自己的原创改进点来写!!!改进点先到…

小白量化《穿云箭集群量化》(7) 巡航导弹策略

小白量化《穿云箭集群量化》&#xff08;7&#xff09; 巡航导弹策略 量化交易策略比较有名的是网格策略&#xff0c;网格策略的缺点是对网格定义不容易&#xff0c;另外通过网格穿越交易也不是最优价格。 穿云箭量化平台提供了巡航导弹策略&#xff0c;可以利用巡航导弹技术自…

无效数据大揭秘——你不知道的那些坑!

进行数据管理时&#xff0c;无效数据可能会对生产力和决策质量造成严重的影响。如何发现和处理无效数据变得愈发重要。一起来唠唠各位大佬是如何处理的&#xff1f; ⭐ 什么是无效数据&#xff1f;⭐ 如何处理无效数据&#xff1f;⭐ 如何减少无效数据&#xff1f;⭐ 无效数据管…

Python入门(十一)while循环(一)

while循环&#xff08;一&#xff09; 1.简介2.使用while循环3.让用户选择何时退出4.使用标志5.使用break退出循环6.在循环中使用continue7.避免无限循环 作者&#xff1a;xiou 1.简介 for循环用于针对集合中的每个元素都执行一个代码块&#xff0c;而while循环则不断运行&am…

css3:精灵图sprite的使用

文章目录 精灵图sprite简介原理优缺点实例通过精灵图实现一个导航栏 精灵图sprite 简介 CSS精灵技术&#xff08;也称CSS Sprites、CSS雪碧&#xff09;&#xff0c;简单来说就是从一张有各种小图标的大图上截取下来一个小图标来使用。 正因为只要加载一张大图片&#xff0c;…

Restful路径下编写controller层及其增删改查

前置&#xff1a;需要先创建好项目&#xff0c;并且使用mabtis根据数据表生成好代码 mybatis plus自动生成代码&#xff08;代码生成器&#xff09;_wa1ttinG的博客-CSDN博客 一、controller层定义 controller层就是和用户打交道&#xff0c;直接与前端进行交互。可调用service…

安全中级1-nginx_host与php处理不同绕过

一、nginx配置证书 1.生成一个ssl.key密钥 openssl genrsa -des3 -out ssl.key 2096 2.创建一个key的目录,并将ssl.key放入到key目录下 mkdir key mv ssl.key key/ cd key 3.将ssl.key修改为xxx.key mv ssl.key xxx.key 4.创建ssl.key密钥 openssl rsa -in xxx.key -out ssl.…

【计算机组成原理】实验二

文章目录 实验二 运算器实验一、实验目的二、实验原理三、运算器功能编码四、设置初始状态任务一 算术运算任务二 逻辑运算任务三 移位运算任务四 进位控制与零标志 实验二 运算器实验 一、实验目的 完成算术、逻辑、移位运算实验&#xff0c;熟悉ALU运算类型的控制位运用。…

华为OD机试真题 Java 实现【硬件产品销售方案】【2023Q1 200分】

一、题目描述 某公司目前推出了AI开发者套件、AI加速卡、AI加速模块、AI服务器、智能边缘多种硬件产品&#xff0c;每种产品包含若干个型号。 现某合作厂商要采购金额为amount元的硬件产品搭建自己的AI基座。 假设当前库存有N种产品&#xff0c;每种产品的库存量充足&#x…