FastChat 大模型部署推理；Baichuan2-13B-Chat测试、chatglm2-6b测试

news2025/4/17 19:47:00

参考：
https://github.com/lm-sys/FastChat
https://blog.csdn.net/qq128252/article/details/132759107

##安装
pip3 install "fschat[model_worker,webui]"

1、chatglm2-6b测试

python3 -m fastchat.serve.cli --model-path ./chatglm2-6b --num-gpus 2

在这里插入图片描述

web使用

1）启动控制器

python3 -m fastchat.serve.controller

2）启动模型工作

python3 -m fastchat.serve.model_worker --model-path ./chatglm2-6b --num-gpus 2  --host=0.0.0.0  --port=21002

3）web服务启动

python3 -m fastchat.serve.gradio_web_server

在这里插入图片描述
打开网址查看：

2、Baichuan2-13B-Chat测试

##运行命令：

python3 -m fastchat.serve.cli --model-path ./Baichuan2-13B-Chat --num-gpus 4

在这里插入图片描述

1）ValueError: Tokenizer class BaichuanTokenizer does not exist or is not currently imported. 2）offload报错，ValueError: The current `device_map` had weights offloaded to the disk. Please provide an `offload_folder` for them.也需要增加

按照报错信息需要更改：
/site-packages/fastchat/serve/inference.py

增加trust_remote_code=True
在这里插入图片描述

本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：http://www.coloradmin.cn/o/1034178.html

如若内容造成侵权/违法违规/事实不符，请联系多彩编程网进行投诉反馈，一经查实，立即删除！

FastChat 大模型部署推理；Baichuan2-13B-Chat测试、chatglm2-6b测试

1、chatglm2-6b测试

web使用

2、Baichuan2-13B-Chat测试

1）ValueError: Tokenizer class BaichuanTokenizer does not exist or is not currently imported. 2）offload报错，ValueError: The current `device_map` had weights offloaded to the disk. Please provide an `offload_folder` for them.也需要增加

相关文章

【JVM内存区域及创建对象的过程】

Vue页面快速使用阿里巴巴矢量图标库

python小程序图书馆图书借阅借还管理系统 mbc21

科学数据分析和图形绘制软件GraphPad Prism 9 mac中文版特点介绍

如何利用物联网技术打造新型智能餐饮连锁店

flowable可使用元素介绍

差值结构的顺序偏好

Qt QCustomPlot介绍

gateway之整合sentinel流控降级

【c#-Nuget 包“在此源中不可用”】 Nuget package “Not available in this source“

这些代码转换工具太香了

rocketmq-spring-boot-starter 2.1.0 事务消息移除参数txProducerGroup

vcruntime140_1.dll 无法继续执行代码的修复方法分享

C语言自定义类型讲解：结构体，枚举，联合（1）

Windows 基于Visual Studio 开发Qt 6 注意事项

Matlab信号处理：FFT频谱分辨率

LwIP笔记02：

基于AlgoT1设备改进多源融合定位算法(GNSS+INS+VISION)

2023 年 Android 毕业设计选题推荐，200 道 Android 毕业设计题目，避免踩坑

【算法思想】排序

FastChat 大模型部署推理；Baichuan2-13B-Chat测试、chatglm2-6b测试

1、chatglm2-6b测试

web使用

2、Baichuan2-13B-Chat测试

1）ValueError: Tokenizer class BaichuanTokenizer does not exist or is not currently imported. 2）offload报错，ValueError: The current device_map had weights offloaded to the disk. Please provide an offload_folder for them.也需要增加

相关文章

1）ValueError: Tokenizer class BaichuanTokenizer does not exist or is not currently imported. 2）offload报错，ValueError: The current `device_map` had weights offloaded to the disk. Please provide an `offload_folder` for them.也需要增加