Skip to main content

Inference tutorials

These tutorials include quick-start examples, conversation patterns, streaming, structured JSON output, supported models, and production tips.

Chat completions

Learn how to maintain context for chatbot-like interactions, how to receive tokens as they are generated, and more.

Structured JSON output

Useful for apps where you want predictable, structured data.