Inference tutorials
These tutorials include quick-start examples, conversation patterns, streaming, structured JSON output, supported models, and production tips.
Chat completions
Learn how to maintain context for chatbot-like interactions, how to receive tokens as they are generated, and more.
Structured JSON output
Useful for apps where you want predictable, structured data.