Wrong token calculation for Gemini API

Kuane · April 8, 2025, 5:35am

For some reason, Gemini API (2.5 Pro and 2.0 Flash) are counting extra tokens.

The same system prompt would count around 10,000 tokens for other LLMs but whenever I choose Gemini 2.5 Pro or 2.0 Flash, it would count as around 40,000 tokens for the same system prompt!

Kuane · April 9, 2025, 1:28am

Checked a few hours ago, those 2 APIs are still counting 4x the tokens than others. Is this an issue on Google’s end or MS?

jerry-mindstudio · April 10, 2025, 3:56pm

@Kuane please retest. Gemini 2.0 and 2.5 family of models have been fixed to reflect token count (not character count).

Kuane · April 10, 2025, 3:57pm

Yup seems fixed. Thanks Jerry!

Topic		Replies	Views
Missing Gemini 2.5 Pro and Llama 4 Bug Reports	2	23	April 8, 2025
Response token count much higher than expected Support	0	7	September 18, 2025
Generate Text is ending early despite OpenAi 4.1 being set to 100k tokens Community Discussion	2	31	May 23, 2025
Anthropic Error Support	7	16	September 25, 2025
Gemini 1.5 Flash - Model Execution Failure Bug Reports	2	29	April 21, 2025

Wrong token calculation for Gemini API

Related topics