Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

xinference部署能使用deepspeed吗? #1503

Open
wangyongpenga opened this issue May 16, 2024 · 8 comments
Open

xinference部署能使用deepspeed吗? #1503

wangyongpenga opened this issue May 16, 2024 · 8 comments
Labels
question Further information is requested
Milestone

Comments

@wangyongpenga
Copy link

xinference部署能使用deepspeed或者设置参数能减小参数的使用?

@wangyongpenga wangyongpenga added the question Further information is requested label May 16, 2024
@XprobeBot XprobeBot added this to the v0.11.1 milestone May 16, 2024
@qinxuye
Copy link
Contributor

qinxuye commented May 16, 2024

使用 deepspeed 主要作用是?

@wangyongpenga
Copy link
Author

节省gpu

@qinxuye
Copy link
Contributor

qinxuye commented May 16, 2024

没明白,你准备用 deepspeed 拉 xinf 还是 xinf 拉 deepspeed ,要解决什么问题?

@wangyongpenga
Copy link
Author

就是如果资源比较紧张的话训练的时候,能使用deepspeed进行策略调用用比较少的gpu资源进行训练,推挤到推理的时候,我们部署推理模型的时候是不是也能使用deepspeed类似的策略进行gpu资源的合理使用

@Ilovecode93
Copy link

同问,我也遇到了同样的问题,xinf 中不知道如何使用deepspeed

@XprobeBot XprobeBot modified the milestones: v0.11.1, v0.11.2 May 17, 2024
@XprobeBot XprobeBot modified the milestones: v0.11.2, v0.11.3 May 24, 2024
@SeesawLiu
Copy link

我也遇到这样的需求,需要使用deepspeed集合多张小显存卡共同进行推理比较大的模型

@qinxuye
Copy link
Contributor

qinxuye commented May 30, 2024

deepspeed怎么使用的能给个例子吗

@SeesawLiu
Copy link

deepspeed怎么使用的能给个例子吗

我具体也没有写过,只看到相关的文档 deepspeed tutorials ZeRO-Infinity: Breaking the GPU Memory Wall for Extreme Scale Deep Learning

@XprobeBot XprobeBot modified the milestones: v0.11.3, v0.11.4 May 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

5 participants