#7809: Permit bedrock application inference profiles #7822

bioshazard · 2025-03-10T17:08:35Z

Bedrock Chat Model did not have a way to invoke Application Inference Profiles. Solving this was complicated at first... how do you keep all that modelId-derived metadata parsing?

So I added a modelAlias field that simply overrides this.model to be used in the /invoke URL. It expects to be URL encoded. Worked great in my local testing. You can find me on Twitter @bios_hazard

vercel · 2025-03-10T17:08:42Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchainjs-docs	✅ Ready (Inspect)	Visit Preview		Mar 10, 2025 5:21pm

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchainjs-api-refs	⬜️ Ignored (Inspect)			Mar 10, 2025 5:21pm

AllenFang · 2025-03-12T02:49:03Z

invoke approach is complicated indeed but this patch looks hacky and like a workaround. I will see what I can do when i have time unless langchain team is okay for this patch. 🙏

bioshazard · 2025-03-12T03:24:10Z

Thanks, I think it's actually pretty clean given the meta data is parsed from the model id. All that is unavailable in the inference id arn. Curious to hear how you could do it another way with that limitation. I expect you'd need to do an intermediate lookup or supply a second field as I have.

jacoblee93

Thanks! Small question

jacoblee93 · 2025-03-13T03:03:50Z

libs/langchain-community/src/utils/bedrock/index.ts

+      For example, "arn%3Aaws%3Abedrock%3Aus-east-1%3A1234567890%3Aapplication-inference-profile%2Fabcdefghi", will override this.model in final /invoke URL call.
+      Must still provide `model` as normal modelId to benefit from all the metadata.
+  */
+  modelAlias?: string;


Should we call this applicationInferenceProfile?

langchain-ai#7809: Permit bedrock application inference profiles

926451e

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Mar 10, 2025

dosubot bot added the auto:nit Small modifications/deletions, fixes, deps or improvements to existing code or docs label Mar 10, 2025

vercel bot deployed to Preview – langchainjs-docs March 10, 2025 17:21 View deployment

jacoblee93 reviewed Mar 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#7809: Permit bedrock application inference profiles #7822

#7809: Permit bedrock application inference profiles #7822

bioshazard commented Mar 10, 2025 •

edited

Loading

vercel bot commented Mar 10, 2025 •

edited

Loading

AllenFang commented Mar 12, 2025

bioshazard commented Mar 12, 2025 •

edited

Loading

jacoblee93 left a comment

jacoblee93 Mar 13, 2025

#7809: Permit bedrock application inference profiles #7822

Are you sure you want to change the base?

#7809: Permit bedrock application inference profiles #7822

Conversation

bioshazard commented Mar 10, 2025 • edited Loading

vercel bot commented Mar 10, 2025 • edited Loading

AllenFang commented Mar 12, 2025

bioshazard commented Mar 12, 2025 • edited Loading

jacoblee93 left a comment

Choose a reason for hiding this comment

jacoblee93 Mar 13, 2025

Choose a reason for hiding this comment

bioshazard commented Mar 10, 2025 •

edited

Loading

vercel bot commented Mar 10, 2025 •

edited

Loading

bioshazard commented Mar 12, 2025 •

edited

Loading