Fixes bedrock modelId encoding for Inference Profiles #9123

omrishiv · 2025-03-11T03:38:50Z

Title

Make sure to encode bedrock models if they are used as the modelId

Relevant issues

Fixes #8911

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally
My PR passes all unit tests on (make test-unit)[https://docs.litellm.ai/docs/extras/contributing_code]
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🐛 Bug Fix
✅ Test

Changes

Encodes the model if it is being used as the modelid

Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>

vercel · 2025-03-11T03:38:54Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Mar 11, 2025 3:58pm

ishaan-jaff

does this work for litellm.completion and litellm.acompletion ?

ishaan-jaff

reviewed

ishaan-jaff · 2025-03-11T03:57:35Z

tests/litellm/llms/chat/test_converse_handler.py

+import litellm
+
+
+def test_encode_model_id_with_inference_profile():


can you add a mock e2e test for completion(), you can mock the http response, test_bedrock_completion.py has some examples

Not sure I understand, this is addressing an incorrect url encoding. I am happy to update the now failing tests that are asserting the incorrect url. Does that work?

ishaan-jaff · 2025-03-11T03:57:57Z

litellm/llms/bedrock/chat/converse_handler.py

@@ -274,7 +274,7 @@ def completion(  # noqa: PLR0915
        if modelId is not None:
            modelId = self.encode_model_id(model_id=modelId)
        else:
-            modelId = model
+            modelId = self.encode_model_id(model_id=model)


if I pass model = "us.anthropicxxx" would that get encoded too? Is that intended ?

If I use us.amazon.nova-pro-v1:0 it still works

what does it get encoded too?

im concerned doing this for all models might have an un-intended side effect. What do you think @omrishiv ?

It looks like boto3 actually encodes all the requests:

https://bedrock-runtime.us-east-1.amazonaws.com/model/bedrock%2Fus.meta.llama3-3-70b-instruct-v1%3A0/invoke

This will require updating a lot of the asserts on the tests to update the expected URL, is that ok?

omrishiv · 2025-03-11T04:02:11Z

does this work for litellm.completion and litellm.acompletion ?

It does work with both completion and acompletion

ishaan-jaff

also can this be added to docs https://docs.litellm.ai/docs/providers/bedrock

omrishiv · 2025-03-11T04:21:51Z

also can this be added to docs https://docs.litellm.ai/docs/providers/bedrock

Do you mean using the instance profile? I'm not sure what needs to be updated? This is just properly conforming the url encoding

omrishiv · 2025-03-11T15:58:08Z

@ishaan-jaff @krrishdholakia , please take a look, as discussed, I think this is ready

omrishiv added 3 commits March 10, 2025 20:14

encode bedrock model id

12e7308

Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>

add test

338722b

Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>

update test

Loading
Loading status checks…

d25693b

Signed-off-by: omrishiv <327609+omrishiv@users.noreply.github.com>

omrishiv changed the title ~~8911 fix model encoding~~ Fixes bedrock modelId encoding for Inference Profiles Mar 11, 2025

vercel bot deployed to Preview March 11, 2025 03:39 View deployment

ishaan-jaff requested changes Mar 11, 2025

View reviewed changes

Merge branch 'main' into 8911-fix-model-encoding

Loading
Loading status checks…

e2adbae

vercel bot deployed to Preview March 11, 2025 15:38 View deployment

fix encoding in tests

Loading
Loading status checks…

cf8084b

vercel bot deployed to Preview March 11, 2025 15:58 View deployment

krrishdholakia changed the base branch from main to litellm_dev_03_12_2025_contributor_prs_p2 March 13, 2025 17:42

krrishdholakia merged commit 2c011d9 into BerriAI:litellm_dev_03_12_2025_contributor_prs_p2 Mar 13, 2025
2 checks passed

krrishdholakia mentioned this pull request Mar 15, 2025

Support bedrock Application inference profiles + Support guardrails on streaming responses #9274

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes bedrock modelId encoding for Inference Profiles #9123

Fixes bedrock modelId encoding for Inference Profiles #9123

omrishiv commented Mar 11, 2025 •

edited

Loading

vercel bot commented Mar 11, 2025 •

edited

Loading

ishaan-jaff left a comment

ishaan-jaff left a comment

ishaan-jaff Mar 11, 2025

omrishiv Mar 11, 2025

ishaan-jaff Mar 11, 2025

omrishiv Mar 11, 2025

ishaan-jaff Mar 11, 2025

ishaan-jaff Mar 11, 2025

omrishiv Mar 11, 2025

omrishiv commented Mar 11, 2025

ishaan-jaff left a comment

omrishiv commented Mar 11, 2025

omrishiv commented Mar 11, 2025

		import litellm


		def test_encode_model_id_with_inference_profile():

Fixes bedrock modelId encoding for Inference Profiles #9123

Fixes bedrock modelId encoding for Inference Profiles #9123

Conversation

omrishiv commented Mar 11, 2025 • edited Loading

Title

Relevant issues

Pre-Submission checklist

Type

Changes

vercel bot commented Mar 11, 2025 • edited Loading

ishaan-jaff left a comment

Choose a reason for hiding this comment

ishaan-jaff left a comment

Choose a reason for hiding this comment

ishaan-jaff Mar 11, 2025

Choose a reason for hiding this comment

omrishiv Mar 11, 2025

Choose a reason for hiding this comment

ishaan-jaff Mar 11, 2025

Choose a reason for hiding this comment

omrishiv Mar 11, 2025

Choose a reason for hiding this comment

ishaan-jaff Mar 11, 2025

Choose a reason for hiding this comment

ishaan-jaff Mar 11, 2025

Choose a reason for hiding this comment

omrishiv Mar 11, 2025

Choose a reason for hiding this comment

omrishiv commented Mar 11, 2025

ishaan-jaff left a comment

Choose a reason for hiding this comment

omrishiv commented Mar 11, 2025

omrishiv commented Mar 11, 2025

omrishiv commented Mar 11, 2025 •

edited

Loading

vercel bot commented Mar 11, 2025 •

edited

Loading