Skip to main content
Black Forest Labs /

FLUX.1 [schnell] FP8

accounts/fireworks/models/flux-1-schnell-fp8

ServerlessImage

FLUX.1 [schnell] is a 12 billion parameter rectified flow transformer capable of generating images from text descriptions. The FP8 version uses reduced precision numerics for 2x faster inference. FLUX.1 [schnell] FP8 is deployed as a Flumina app. See for more details: https://huggingface.co/fireworks-ai/FLUX.1-schnell-fp8-flumina

Serverless API

FLUX.1 [schnell] FP8 is available via Fireworks' serverless image API, where you pay per image step (per generated image) and do not need to configure GPUs.

The Fireworks image generation API can be called by making HTTP requests from any language. See the below API examples to see how to call it. Also check out the Flumina repo linked above for full API details and to see the implementation code.

Try it

API Examples

Generate a model response using the image endpoint of flux-1-schnell-fp8. API reference

import requests

url = "https://876a893-preview.api.fireworks.ai/inference/v1/workflows/accounts/fireworks/models/accounts/fireworks/models/flux-1-schnell-fp8/text_to_image"
headers = {
    "Content-Type": "application/json",
    "Accept": "image/jpeg",
    "Authorization": "Bearer $API_KEY",
}
data = {
    "prompt": "A beautiful sunset over the ocean"
}

response = requests.post(url, headers=headers, json=data)

if response.status_code == 200:
    with open("a.jpg", "wb") as f:
        f.write(response.content)
    print("Image saved as a.jpg")
else:
    print("Error:", response.status_code, response.text)

On-demand deployments

On-demand deployments allow you to use FLUX.1 [schnell] FP8 on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

See the On-demand deployments guide for details.

Deploy this model