-
Notifications
You must be signed in to change notification settings - Fork 248
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
xinference部署能使用deepspeed吗? #1503
Comments
使用 deepspeed 主要作用是? |
节省gpu |
没明白,你准备用 deepspeed 拉 xinf 还是 xinf 拉 deepspeed ,要解决什么问题? |
就是如果资源比较紧张的话训练的时候,能使用deepspeed进行策略调用用比较少的gpu资源进行训练,推挤到推理的时候,我们部署推理模型的时候是不是也能使用deepspeed类似的策略进行gpu资源的合理使用 |
同问,我也遇到了同样的问题,xinf 中不知道如何使用deepspeed |
我也遇到这样的需求,需要使用deepspeed集合多张小显存卡共同进行推理比较大的模型 |
deepspeed怎么使用的能给个例子吗 |
我具体也没有写过,只看到相关的文档 deepspeed tutorials ZeRO-Infinity: Breaking the GPU Memory Wall for Extreme Scale Deep Learning |
xinference部署能使用deepspeed或者设置参数能减小参数的使用?
The text was updated successfully, but these errors were encountered: