OLLMModels

Image & Video Models

Image and video generation models on OLLM, reachable through the OpenAI-compatible API but not wired into the AI SDK provider interface.

Image and video generation models produce visual media from text prompts (and, for some models, from reference images or video).

When to Use

  • Image generation: creating, editing, or inpainting images from a text prompt
  • Video generation: text-to-video, image-to-video, and motion-controlled clips

To understand an existing image rather than generate one, use a Vision model instead.

How to Access

Image and video models are not available through the AI SDK provider. Calling ollm.imageModel() throws a NoSuchModelError, and there is no AI SDK helper for video generation.

Image-output and video-output models are reachable through the OpenAI-compatible OLLM API over raw HTTP. They appear in ollm.listModels() results (for example with 'image' in output_modalities), so you can discover IDs at runtime, but the request itself must be made directly against the gateway endpoint rather than through generateText or streamText.

TEE Catalog

Image generation models running in Trusted Execution Environments, on NEAR infrastructure with Intel TDX + NVIDIA H100 confidential compute.

ModelProviderInfrastructure
Flux.2 Klein 4BBFLnear

There are currently no video generation models in the TEE catalog.

ZDR Catalog

Image and video generation models running on Vercel's AI infrastructure with zero data retention provider agreements.

Image Generation

ModelProvider
Flux SchnellBFL
FLUX.1 Fill [pro]BFL
FLUX.1 Kontext MaxBFL
FLUX.1 Kontext ProBFL
FLUX.2 [flex]BFL
FLUX.2 [klein] 4BBFL
FLUX.2 [klein] 9BBFL
FLUX.2 [max]BFL
FLUX.2 [pro]BFL
FLUX1.1 [pro]BFL
FLUX1.1 [pro] UltraBFL
GPT Image 1OpenAI
GPT Image 1 MiniOpenAI
GPT Image 1.5OpenAI
GPT Image 2OpenAI
Imagen 4Google
Imagen 4 FastGoogle
Imagen 4 UltraGoogle
Grok ImaginexAI
Grok Imagine ImagexAI
Grok Imagine Image ProxAI
Recraft V2Recraft
Recraft V3Recraft
Recraft V4Recraft
Recraft V4 ProRecraft
Seedream 4.0ByteDance
Seedream 4.5ByteDance
Seedream 5.0 LiteByteDance

Several Google Gemini models also produce image output (for example Gemini 3 Pro Image, Gemini 3.1 Flash Image Preview, and Nano Banana).

Video Generation

ModelProvider
Veo 3.0Google
Veo 3.0 Fast GenerateGoogle
Veo 3.1Google
Veo 3.1 Fast GenerateGoogle
Kling v2.5 Turbo Image-to-VideoKuaishou
Kling v2.5 Turbo Text-to-VideoKuaishou
Kling v2.6 Image-to-VideoKuaishou
Kling v2.6 Motion ControlKuaishou
Kling v2.6 Text-to-VideoKuaishou
Kling v3.0 Image-to-VideoKuaishou
Kling v3.0 Text-to-VideoKuaishou
Seedance 2.0ByteDance
Seedance 2.0 FastByteDance
Seedance v1.0 Lite Image-to-VideoByteDance
Seedance v1.0 Lite Text-to-VideoByteDance
Seedance v1.0 ProByteDance
Seedance v1.0 Pro FastByteDance
Seedance v1.5 ProByteDance
Wan v2.5 Text-to-Video PreviewAlibaba
Wan v2.6 Image-to-VideoAlibaba
Wan v2.6 Image-to-Video FlashAlibaba
Wan v2.6 Reference-to-Video FlashAlibaba
Wan v2.6 Text-to-VideoAlibaba

On this page