Skip to main content

Deepseek V3 0324, an updated version of the state-of-the-art DeepSeek V3 model, is now available. Try it now or read our DeepSeek quickstart!

FireAttention V2: 12x faster to make Long Contexts practical for Online Inference

FireAttention V2: 12x faster to make Long Contexts practical for Online Inference

By Dmytro Ivchenko|6/20/2024

Loading...