Overview

Welcome to Sentry Block’s Serverless Endpoints

The fastest and easiest way to integrate powerful generative AI models—like Meta’s LLaMA and Stable Diffusion—into your applications. With Sentry Block, there’s no setup required. Endpoints are fully pre-configured and ready to go. Just sign up, log in, and start building.

You can explore and test models directly through our platform’s UI or integrate them into your own workflows using code snippets in curl, Python, or JavaScript.


Key Features

Instant Setup – No configuration needed. Start using endpoints immediately after logging in. ✅ Live Testing – Try models directly from the platform interface. ✅ Developer-Friendly Code – Quick-start examples in curl, Python, and JavaScript to speed up your integration. ✅ Model Variety – Access powerful models like LLaMA 3.1, Stable Diffusion XL, and more. ✅ Scalable Infrastructure – All endpoints run on Sentry Block’s GPU-accelerated backend. ✅ Flexible Configurations – Adjust settings to better fit your use case or workload requirements.


Prerequisites

  • A Sentry Block account

  • At least $0.01 credit balance in your account to start using services


Pricing and Billing

💳 Pay-as-you-go model – Charges are calculated based on usage (tokens processed). For full pricing details, visit our [Pricing Page].


Types of Inference

To get started with: → Text Generation, see [Text Generation] → Image Generation, see [Image Generation] → Embedding Generation, see [Embedding Generation]

Last updated