feat(llm): Add prompt caching for Anthropic Claude models #12164

Seayon · 2024-12-27T09:43:59Z

Add prompt caching parameters for all Claude-3 series models, supporting tagged text caching to improve response speed. Each model can cache up to 4 text blocks.

Summary

Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.

Resolves #7382

Screenshots

Before

After

Checklist

Important

Please review the checklist below before submitting your pull request.

This change requires a documentation update, included: Dify Document
I understand that this PR may be closed in case there was no previous discussion or issues. (This doesn't apply to typos!)
I've added a test for each change that was introduced, and I tried as much as possible to make a single atomic change.
I've updated the documentation accordingly.
I ran dev/reformat(backend) and cd web && npx lint-staged(frontend) to appease the lint gods

crazywoola · 2024-12-27T12:56:25Z

Please fix the lint errors.

Add prompt caching parameters for all Claude-3 series models, supporting tagged text caching to improve response speed. Each model can cache up to 4 text blocks.

Seayon · 2024-12-27T17:03:43Z

Please fix the lint errors.
@crazywoola
Done

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. ⚙️ feat:model-runtime labels Dec 27, 2024

Seayon force-pushed the support-prompt-caching branch from 4fea270 to 1c0365a Compare December 27, 2024 09:46

Seayon force-pushed the support-prompt-caching branch from 1c0365a to 1cecc25 Compare December 27, 2024 16:46

feat(llm): Add prompt caching for Anthropic Claude models

b09f068

Add prompt caching parameters for all Claude-3 series models, supporting tagged text caching to improve response speed. Each model can cache up to 4 text blocks.

Seayon force-pushed the support-prompt-caching branch from b8dc3e1 to b09f068 Compare December 27, 2024 16:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(llm): Add prompt caching for Anthropic Claude models #12164

feat(llm): Add prompt caching for Anthropic Claude models #12164

Seayon commented Dec 27, 2024

crazywoola commented Dec 27, 2024

Seayon commented Dec 27, 2024 •

edited

Loading

feat(llm): Add prompt caching for Anthropic Claude models #12164

Are you sure you want to change the base?

feat(llm): Add prompt caching for Anthropic Claude models #12164

Conversation

Seayon commented Dec 27, 2024

Summary

Screenshots

Before

After

Checklist

crazywoola commented Dec 27, 2024

Seayon commented Dec 27, 2024 • edited Loading

Seayon commented Dec 27, 2024 •

edited

Loading