InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 256
Star 2.9k

Code
Issues 141
Pull requests 25
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: InternLM/lmdeploy

[Benchmark] benchmarks on different cuda architecture with mo...

#815 opened Dec 11, 2023 by lvhan028

Open 6

报名参加书生·浦语大模型实战营——两周带你玩转微调部署评测全链路

#890 opened Dec 26, 2023 by vansin

Open

Labels 32 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

141 Open 763 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Bug] Turbomind 后端显存占用翻倍

#1758 opened Jun 11, 2024 by QwertyJack

2 tasks done

[Bug] 判断条件检查

#1757 opened Jun 11, 2024 by seetimee

1 of 2 tasks

[Bug] Key Error loading OpenGVLab/Mini-InternVL-Chat-4B-V1-5

#1756 opened Jun 11, 2024 by HaoLiuHust

2 tasks done

[Bug] tp=4 tp=8 no response

#1755 opened Jun 11, 2024 by zeroleavebaoyang

2 tasks done

[Bug] Official image doesn't work for 4090 on CUDA 12.3 (but works for all other CUDA versions, and works for 12.3 on other GPU types)

#1750 opened Jun 11, 2024 by josephrocca

2 tasks done

[Feature] Low priority: Allow specifying HuggingFace model/repo name in lmdeploy convert

#1749 opened Jun 10, 2024 by josephrocca

[Feature] Support for compact Vision-Language models

#1748 opened Jun 10, 2024 by vody-am

[Bug] xcomposer 4khd lora weight error in lmdeploy

#1747 opened Jun 8, 2024 by ztfmars

2 tasks done

[Feature] min_p sampling parameter

#1745 opened Jun 8, 2024 by josephrocca

[Bug] Many concurrent requests with --enable-prefix-caching AND --quant-policy 8 crashes with: CUDA runtime error: an illegal memory access was encountered /opt/lmdeploy/src/turbomind/utils/allocator.h:231

#1744 opened Jun 8, 2024 by josephrocca

2 tasks done

[Bug] Space is incorrectly removed from start of generated text for /v1/completion endpoint

#1743 opened Jun 8, 2024 by josephrocca

2 tasks done

logits输出有问题[Bug]

#1742 opened Jun 8, 2024 by GZL11

2 tasks done

[Docs] Guidance on setting num_tokens_per_iter and max_prefill_iters to optimal values

#1740 opened Jun 8, 2024 by josephrocca

[Bug] detokenize_incrementally: OverflowError: out of range integral type conversion attempted

#1739 opened Jun 7, 2024 by josephrocca

2 tasks done

[Feature] Speculative Decoding

#1738 opened Jun 7, 2024 by josephrocca

[Bug] 量化模型时无输出

#1735 opened Jun 7, 2024 by NB-Group

2 tasks done

[Feature Request] OpenAI-compatible stop param

#1731 opened Jun 7, 2024 by josephrocca

[Bug] 部署cogvlm2运行时，接受的多个并发之间存在干扰，后面的请求使用前面请求传的图像

#1730 opened Jun 7, 2024 by LRHstudy

1 of 2 tasks

High GPU memory for running InternVL-Chat-V1-5-AWQ awaiting response

#1728 opened Jun 7, 2024 by tairen99

[Feature] Support for THUDM/glm-4v-9b planned feature

#1726 opened Jun 6, 2024 by Iven2132

How to trace multiple GPUs using nsight system

#1722 opened Jun 6, 2024 by sleepwalker2017

[Bug] Mini-InternVL1.5-4B does not suceessfully initialized.

#1721 opened Jun 6, 2024 by cydiachen

1 of 2 tasks

[Bug] internlm2-chat-1_8b模型使用4bit KV量化的时候找不到key_stats.pth awaiting response

#1720 opened Jun 6, 2024 by jxfruit

2 tasks

[Bug] Why does prefix caching change the generated content

#1719 opened Jun 5, 2024 by DayDayupupupup

1 of 2 tasks

[Feature] 想问下有打算支持GLM4V模型吗

#1713 opened Jun 5, 2024 by will-wiki

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly