Last updated July 17, 2024
In AI News

Anthropic Doubles Claude 3.5 Sonnet API’s Output Token Limit to 8K Tokens

This enhancement is now available to developers using the Anthropic API, allowing for longer and more comprehensive AI-generated responses.

Share

Published on July 17, 2024

by Shyam Nandan Upadhyay

Anthropic announced that it has expanded the capabilities of its Claude 3.5 Sonnet AI model by doubling the maximum output token limit from 4,096 to 8,192 tokens.

This enhancement is now available to developers using the Anthropic API, allowing for longer and more comprehensive AI-generated responses.

To access this expanded functionality, developers need to include a specific beta header in their API calls: “anthropic-beta”: “max-tokens-3-5-sonnet-2024-07-15”. This must be added to the extra_headers parameter when creating messages through the API.

The upgrade is reflected in the model parameters interface, which shows a maximum token sampling of 8,192. The temperature setting remains at 0.7, balancing creativity and coherence in outputs.

Developers can implement this feature by updating their API calls with the new header, as demonstrated in the provided code snippet:

This update gives developers greater flexibility in generating extended content, potentially improving applications in areas such as long-form text generation, detailed analysis, and complex problem-solving tasks.

The increased token limit allows Claude 3.5 Sonnet to produce more extensive and nuanced outputs without the need for multiple API calls or text truncation.

In response to user queries on social media, Anthropic’s DevRel team clarified that this change is currently limited to the API and has not yet been implemented on claude.ai. They also expressed hope for a future rollout to the web interface.

The increased token limit opens up new possibilities for developers working with large-scale text generation and complex language tasks.

📣 Want to advertise in AIM? Book here

Shyam Nandan Upadhyay

Shyam is a tech journalist with expertise in policy and politics, and exhibits a fervent interest in scrutinising the convergence of AI and analytics in society. In his leisure time, he indulges in anime binges and mountain hikes.