Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BUG: Accessing a wrong model can cause the entire Docker environment to crash. #1520

Open
majestichou opened this issue May 20, 2024 · 1 comment
Labels
bug Something isn't working gpu
Milestone

Comments

@majestichou
Copy link

Describe the bug

I downloaded the SFR-Embedding-Mistral model weight file from huggingface. I put the model weights in the /home/hw/embedding_and_rerank_model directory and then started the container on machine which ip is xxxx , with the following command
docker run -v /home/hw/embedding_and_rerank_model:/root/models -p 9998:9997 --gpus all xprobe/xinference:latest xinference-local -H 0.0.0.0
After that, I went to http://xxxx:9998/ui and registered the SFR-Embedding-Mistral model. And then I launched the model.
Then I used langchain to access http://xxxx:9998/v1/embeddings/SFR-Embedding-Mistral. Everything is OK. The embedding model worked well.
And then I accidentally wrote the model name wrong, the SFR-Embedding-Mistral name was misspelled as SFR-Embedding-MistraT. Then I used langchain to access http://xxxx:9998/v1/embeddings/SFR-Embedding-MistraT. Of course, I didn't have a normal access model. I then changed the name to the correct one (http://xxxx:9998/v1/embeddings/SFR-Embedding-Mistral)and reconnected the model. At this point, the unexpected happened: I couldn't access the model properly, even though I filled in the API correctly. I tried to access the embedding model with Dify, and it also failed.

Accessing a wrong model can cause the entire Docker environment to crash.

To Reproduce

  1. Xinference docker image--V0.11.0
  2. Dify 0.6.6
  3. langchain 0.1.13
    langchain-community 0.0.31
    langchain-core 0.1.40
    langchain-openai 0.0.5
    langchain-text-splitters 0.0.1
    langchain-together 0.1.0

Expected behavior

Accessing a wrong model can not cause the entire Docker environment to crash. After the model name is corected, the model can still be invoked.

Additional context

As far as I can tell, the wrong model was started using the terminal command line, and then the docker mapped port does not exist.

@XprobeBot XprobeBot added bug Something isn't working gpu labels May 20, 2024
@XprobeBot XprobeBot added this to the v0.11.2 milestone May 20, 2024
@qinxuye
Copy link
Contributor

qinxuye commented May 21, 2024

Thanks for your report, we will address it ASAP.

@XprobeBot XprobeBot modified the milestones: v0.11.2, v0.11.3 May 24, 2024
@XprobeBot XprobeBot modified the milestones: v0.11.3, v0.11.4, v0.12.0, v0.12.1 May 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working gpu
Projects
None yet
Development

No branches or pull requests

3 participants