> ## Documentation Index
> Fetch the complete documentation index at: https://docs.vaero.co/llms.txt
> Use this file to discover all available pages before exploring further.

# Styling

> Transform your AI style

## Basics

Use the inference endpoint to perform rewriting with your fine-tuned model. The inference endpoint has two modes: `full` and `rewrite`

### Full Mode

Full mode performs inference from a standard AI model such as `gpt-5.4` or `Claude 4.6 Sonnet` and then styles the output of the AI with your fine-tuned model. This provides a full end-to-end inference process that is similar to using the OpenAI or Anthropic API.

Specify the fine-tuned model ID in `model`.

Provide the list of messages in the converations in `messages`.

The `full_mode_options` parameter enables setting `base_model` and `base_temperature` to specify the model and temperature for the base AI model for inference.

### Rewrite Mode

Rewrite mode uses your fine-tuned model to rewrite text that you provide it. In this mode, you will have generated text using an AI model separately, and you submit this AI-generated text to our endpoint to be rewritten.

Specify the fine-tuned model ID in `model`.

Provide the input text as a string in `message`.

No conversational turns or instructions to the AI need to be submitted.

If you have a large block of multi-paragraph text, include all of it in a single API call. You don't need to segment the text into separate calls.

The styled content is returned at `response["choices"][0]["message"]["content"]`

## Streaming

Set the `stream` option to `True` to enable streaming responses. Streaming can be used in `full` or `rewrite` mode.