How to Check Gemini CLI Usage and Cost

Gemini CLI has no built-in token spend view — an open feature request at Google. Here is how to surface your usage and cost from the logs it already writes.

By Arham WaniLast updated June 2026 whoburnedmore guide

Quick answer

To check Gemini CLI token usage and cost, run npx whoburnedmore. Unlike Claude Code or Codex CLI, Gemini CLI has no built-in command to show your token spend — there is an open feature request for this on the Google GitHub repo. whoburnedmore fills the gap by reading Gemini CLI's local session logs and computing your total tokens and estimated cost per day and model. ⚡

Google launched Gemini CLI as a free, open-source terminal agent in mid-2025. It quickly attracted developers who wanted a generous free tier alongside Claude Code and Codex CLI. But there is a notable gap in the tooling: while Gemini CLI tracks usage internally to enforce rate limits, it does not expose a spend view to the user. You cannot type a command and see how many tokens you burned today or what the equivalent dollar cost would be at paid-tier rates. That gap is what whoburnedmore closes.

Why doesn't Gemini CLI show my token usage?

The short answer: it was not a priority at launch. The free tier is metered by requests per minute and per day rather than tokens, so the UI focused on request counts. Token spend only matters once you leave the free tier and start billing by the token — and by then Google expects you to use the Cloud Console billing dashboard or the AI Studio usage page.

The problem is that neither the Cloud Console nor AI Studio know about usage from the CLI unless you are authenticated with an API key. If you are using Gemini CLI on the free tier with OAuth login, your usage is invisible in every Google dashboard. The feature request tracking a native /usage command for Gemini CLI has been open since late 2025.

Free tier → paid tier surprises

Gemini CLI auto-routes heavy requests to larger models. If your session runs long or you are processing a large codebase, it may escalate from Gemini Flash to Gemini Pro mid-session. At paid API rates, the cost difference is significant. Without a spend view, you will not notice until a bill arrives. 📊

Where does Gemini CLI store its session logs?

Despite the missing UI, Gemini CLI does write structured logs to disk. By default, they live under ~/.gemini/sessions/ on macOS and Linux. Each session is a JSONL file with one record per turn, including the model name, token counts, and timestamp. whoburnedmore reads these to reconstruct your usage history:

Gemini CLI

writes logs locally

~/.gemini/sessions/

JSONL per session

whoburnedmore

parses + aggregates

Token + cost view

daily & model split

whoburnedmore bridges the gap Google's own UI leaves: log files exist, but there's no native viewer.

zsh — inspect Gemini CLI session logs

$ ls ~/.gemini/sessions/2026-06-14_session_a1b2c3.jsonl2026-06-13_session_d4e5f6.jsonl2026-06-12_session_g7h8i9.jsonl $ head -1 ~/.gemini/sessions/2026-06-14_session_a1b2c3.jsonl | python3 -m json.tool | grep -E 'model|token'    "model": "gemini-2.0-flash",    "inputTokenCount": 24381,    "outputTokenCount": 1847,

What counts as a Gemini CLI token?

Google uses the same transformer-style tokenization for Gemini models as other providers. Roughly speaking, one token maps to three or four characters of English text. Code is denser — a 200-line Python file typically encodes to 1,500–2,500 tokens depending on naming conventions and whitespace. Gemini 2.0's context window is 1M tokens, which sounds enormous until you realize a large codebase passed repeatedly to the model can fill it within a handful of sessions.

What does Gemini CLI actually cost?

On the free tier, cost in dollars is zero — up to the rate limits. Beyond that, billing switches to pay-per-token. whoburnedmore computes both the token total and what that usage would cost at the current Gemini API rates, so you can monitor whether you are approaching the threshold where billing kicks in or — if you are already on a paid plan — how your bills are accruing:

cost=in_Flash×r_Flash,in+out_Flash×r_Flash,out+in_Pro×r_Pro,in+out_Pro×r_Pro,out

Gemini Flash and Gemini Pro have separate input/output rates; the model split matters more than the total token count.

At mid-2026 rates: Gemini 2.0 Flash is around $0.075 per million input tokens and $0.30 per million output tokens — inexpensive for individual requests, but sessions that repeatedly send large context windows accumulate quickly. Gemini Pro is priced roughly 10x higher. A month where 20% of your requests were quietly routed to Pro instead of Flash can increase your bill by 2–3x.

zsh — ~/work

$ npx whoburnedmore↳ scanning ~/.gemini/sessions/…↳ found 412 entries across 31 sessions   GEMINI CLI USAGE — last 30 days  ─────────────────────────────────────────────  2026-06-14   Flash  1,841,200   Pro    287,400  2026-06-13   Flash  2,304,700   Pro     94,100  2026-06-12   Flash    987,300   Pro    341,900   MODEL BREAKDOWN  gemini-2.0-flash       28.1M tokens   $2.11  gemini-2.0-pro          4.0M tokens   $3.98   30-day total:  32.1M tokens   est. $6.09

Note June 12: Pro tokens were 35% of that day's total, which drove a disproportionate share of the estimated cost. Gemini CLI routed those requests automatically — whoburnedmore makes the split visible so you can decide whether to configure a model preference in your ~/.gemini/config.json.

Which Gemini CLI free-tier limits should I watch?

The free tier enforces two independent ceilings. Hitting either one pauses your requests until the window resets. Neither is visible in the CLI itself — you find out when requests start returning 429 RESOURCE_EXHAUSTED:

Requests per minute (RPM)

Gemini 2.0 Flash allows 15 requests per minute on the free tier. A rapid tool-call loop can hit this ceiling in under two minutes, causing cascading delays as the agent retries.

Requests per day (RPD)

Daily request limits vary by model. For Gemini 2.0 Flash the free ceiling is 1,500 requests per day. whoburnedmore tracks your daily request count alongside token totals so you can see which limit you are approaching.

Method	Token spend	Model split	Daily history	Free tier alerts
Gemini CLI (built-in)	—	—	—	—
Cloud Console / AI Studio	API key only	partial		—
whoburnedmore				via history

Google's own dashboards only see API-key usage. OAuth/free-tier CLI sessions are only visible through the local logs.

Compare against your other AI tools

Gemini CLI's low Flash pricing can make it look cheap — until you add it to the same table as your Claude Code and Codex spend. Run npx whoburnedmore for the combined view, or see the cross-tool AI usage guide to understand how all your agents add up.

Running whoburnedmore in local mode

1
Standard run (leaderboard)
npx whoburnedmore — scans all supported tools, prints the breakdown, and optionally submits your totals to the public leaderboard.
2
Local-only mode
npx whoburnedmore --local — opens a private browser dashboard with the same charts, nothing submitted. Useful if you want the visual without sharing.
3
Dry run to inspect what is shared
npx whoburnedmore --dry-run — prints the exact JSON payload that would be sent, so you can verify only aggregate totals are included, not prompts or file names. 🛡️

native spend commands in Gemini CLI

separate free-tier limits (RPM + RPD)

30+

days of history from local logs

Related guides

How to Check Your AI Coding Token Usage

The cross-tool overview: one command that totals your token usage and cost across every AI coding agent you run.

How to Check Codex CLI Usage and Token Cost

Codex CLI's /status only shows the current window — here's how to see your full history and cost.

AI Coding Cost: Claude Code vs Codex vs Gemini

A real cost comparison of the big three — measured from your logs, not marketing.

← Browse all whoburnedmore guides

How to Check Gemini CLI Usage and Cost

Gemini CLI has no built-in token spend view — an open feature request at Google. Here is how to surface your usage and cost from the logs it already writes.

By Arham WaniLast updated June 2026 whoburnedmore guide

Quick answer

Why doesn't Gemini CLI show my token usage?

Free tier → paid tier surprises

Where does Gemini CLI store its session logs?

Gemini CLI

writes logs locally

~/.gemini/sessions/

JSONL per session

whoburnedmore

parses + aggregates

Token + cost view

daily & model split

whoburnedmore bridges the gap Google's own UI leaves: log files exist, but there's no native viewer.

zsh — inspect Gemini CLI session logs

$ ls ~/.gemini/sessions/2026-06-14_session_a1b2c3.jsonl2026-06-13_session_d4e5f6.jsonl2026-06-12_session_g7h8i9.jsonl $ head -1 ~/.gemini/sessions/2026-06-14_session_a1b2c3.jsonl | python3 -m json.tool | grep -E 'model|token'    "model": "gemini-2.0-flash",    "inputTokenCount": 24381,    "outputTokenCount": 1847,

What counts as a Gemini CLI token?

What does Gemini CLI actually cost?

cost=in_Flash×r_Flash,in+out_Flash×r_Flash,out+in_Pro×r_Pro,in+out_Pro×r_Pro,out

Gemini Flash and Gemini Pro have separate input/output rates; the model split matters more than the total token count.

zsh — ~/work

$ npx whoburnedmore↳ scanning ~/.gemini/sessions/…↳ found 412 entries across 31 sessions   GEMINI CLI USAGE — last 30 days  ─────────────────────────────────────────────  2026-06-14   Flash  1,841,200   Pro    287,400  2026-06-13   Flash  2,304,700   Pro     94,100  2026-06-12   Flash    987,300   Pro    341,900   MODEL BREAKDOWN  gemini-2.0-flash       28.1M tokens   $2.11  gemini-2.0-pro          4.0M tokens   $3.98   30-day total:  32.1M tokens   est. $6.09

Which Gemini CLI free-tier limits should I watch?

Requests per minute (RPM)

Gemini 2.0 Flash allows 15 requests per minute on the free tier. A rapid tool-call loop can hit this ceiling in under two minutes, causing cascading delays as the agent retries.

Requests per day (RPD)

Method	Token spend	Model split	Daily history	Free tier alerts
Gemini CLI (built-in)	—	—	—	—
Cloud Console / AI Studio	API key only	partial		—
whoburnedmore				via history

Google's own dashboards only see API-key usage. OAuth/free-tier CLI sessions are only visible through the local logs.

Compare against your other AI tools

Running whoburnedmore in local mode

1
Standard run (leaderboard)
npx whoburnedmore — scans all supported tools, prints the breakdown, and optionally submits your totals to the public leaderboard.
2
Local-only mode
npx whoburnedmore --local — opens a private browser dashboard with the same charts, nothing submitted. Useful if you want the visual without sharing.
3
Dry run to inspect what is shared
npx whoburnedmore --dry-run — prints the exact JSON payload that would be sent, so you can verify only aggregate totals are included, not prompts or file names. 🛡️

native spend commands in Gemini CLI

separate free-tier limits (RPM + RPD)

30+

days of history from local logs

Related guides

How to Check Your AI Coding Token Usage

The cross-tool overview: one command that totals your token usage and cost across every AI coding agent you run.

How to Check Codex CLI Usage and Token Cost

Codex CLI's /status only shows the current window — here's how to see your full history and cost.

AI Coding Cost: Claude Code vs Codex vs Gemini

A real cost comparison of the big three — measured from your logs, not marketing.

← Browse all whoburnedmore guides