Text Generation (Vision)
Vision models behave similarly to text models, but they can accept and interpret images as well.
Model Spotlight: Qwen2.5-VL-7B-Instruct (Vision + Language)
Getting Started with Vision-Language Inference
Additional Parameter (for Vision-Language models)
Using the API Directly
cURL Example
Python Example
JavaScript Example
Model Identifier
Response Example (Non-Streaming)
Response Example (Streaming Enabled)
Last updated