Infinite output

LLM providers limit how much output a model can generate from a single request. This is usually called the output token limit.

Aider is able to work around this limit with models that support “prefilling” the assistant response. When you use aider with a model that supports prefill, you will see “infinite output” noted in the announcement lines displayed at launch:

Aider v0.58.0
Main model: claude-3-5-sonnet-20240620 with diff edit format, prompt cache, infinite output

Models that support prefill can be primed to think they started their response with a specific piece of text. You can put words in their mouth, and they will continue generating text from that point forward.

When aider is collecting code edits from a model and it hits the output token limit, aider simply initiates another LLM request with the partial response prefilled. This prompts the model to continue where it left off, generating more of the desired response. This prefilling of the partially completed response can be repeated, allowing for very long outputs. Joining the text across these output limit boundaries requires some heuristics, but is typically fairly reliable.

Aider supports “infinite output” for models that support “prefill”, such as:

  • claude-3-5-sonnet-20240620
  • claude-3-haiku-20240307
  • claude-3-opus-20240229
  • claude-3-sonnet-20240229
  • codestral/codestral-2405
  • codestral/codestral-latest
  • deepseek-chat
  • deepseek-coder
  • mistral/codestral-2405
  • mistral/codestral-latest
  • mistral/codestral-mamba-latest
  • mistral/mistral-large-2402
  • mistral/mistral-large-2407
  • mistral/mistral-large-latest
  • mistral/mistral-medium
  • mistral/mistral-medium-2312
  • mistral/mistral-medium-latest
  • mistral/mistral-small
  • mistral/mistral-small-latest
  • mistral/mistral-tiny
  • mistral/open-codestral-mamba
  • mistral/open-mistral-7b
  • mistral/open-mistral-nemo
  • mistral/open-mistral-nemo-2407
  • mistral/open-mixtral-8x22b
  • mistral/open-mixtral-8x7b
  • mistral/pixtral-12b-2409
  • openrouter/anthropic/claude-3.5-sonnet
  • vertex_ai/claude-3-5-sonnet@20240620
  • vertex_ai/claude-3-haiku@20240307
  • vertex_ai/claude-3-opus@20240229
  • vertex_ai/claude-3-sonnet@20240229