How to Check Kimi CLI Usage and Cost
Kimi CLI shows the tokens for the run you're staring at — and forgets them the moment you close the terminal. Here's how to recover the whole picture.
Quick answer
npx whoburnedmore— it reads Kimi CLI's local logs, totals your tokens by day and by model, estimates the cost, and ranks you on the leaderboard. Only daily aggregates leave your machine. Kimi's own readout shows just the current run, with no saved spend history. 🔥Kimi CLI is Moonshot AI's terminal coding agent, driven by the Kimi K2 family of models and wired for MCP servers and reusable skills. It is genuinely pleasant to drive from the prompt, but it is built like a tool, not like a billing console: it prints the token count for the turn it just finished, and that number evaporates the instant the session ends. There is no dashboard, no “this month” tab, and nothing that tells you whether last week's big agent loop quietly cost more than you expected.
How do I see my Kimi CLI token usage?
Kimi writes a structured log of each agent run to your machine. You do not have to parse it by hand — one command sweeps those logs and prints a real breakdown:
$ npx whoburnedmore↳ detecting installed coding agents…↳ kimi-cli logs found · 19 runs parsed KIMI CLI USAGE ──────────────────────────────────────── 2026-06-16 in 1,902,400 out 154,300 2026-06-15 in 2,640,100 out 221,800 2026-06-14 in 880,950 out 61,200 BY MODEL kimi-k2 11.4M tokens kimi-k2-turbo 3.2M tokens this month: 14.6M tokens est. cost shown inline
Because whoburnedmore lists Kimi among the agents it understands, you do not configure anything: it discovers the log directory, walks each recorded run, and folds the per-turn token fields into day-level and model-level totals. The K2 turbo variant and the standard K2 model are reported separately, since their per-token rates differ.
What Kimi's built-in readout gives you
Inside an active Kimi session you can glance at the tokens for the current exchange, and that is useful for catching a context window that is ballooning mid-task. What it cannot do is look backwards. Three gaps are worth naming:
- 1
No persistence across sessions
The live token figure belongs to the running process. Quit Kimi and the count resets to zero; yesterday's agent marathon leaves no on-screen trace. - 2
No per-day or per-model rollup
You see one run at a time, never “Tuesday burned 4M tokens” or “K2-turbo was 22% of my spend.” The aggregation simply isn't built in. - 3
No cost translation
Kimi reports tokens, not dollars. Turning K2 token counts into an estimated bill is left entirely to you — which is exactly the chore whoburnedmore automates.
What does Kimi CLI cost?
Moonshot bills the Kimi API by tokens, charging input and output at different rates, with the turbo tier priced apart from the standard model. whoburnedmore does not hard-code those numbers — see Moonshot's pricing for the current figures — but it applies the right rate to each model bucket and sums them so you get a single estimate:
The split matters more than people expect: output tokens are the pricier side, so a few verbose agent runs can dominate the bill even when input dwarfs output by raw count. The donut below shows how a typical month divides across the K2 models and the system prompts the agent keeps replaying:
- kimi-k262%
- kimi-k2-turbo24%
- tool + system replays14%
On a flat plan? The estimate still earns its keep
If you access Kimi through a bundled or subscription tier, the cost whoburnedmore shows is “what this would run at API rates.” That is the honest yardstick for whether your plan is a bargain — and it flags a runaway agent loop before it becomes a monthly habit. 💸How do I track Kimi usage over time?
Tracking trend is the whole point, and it is the one thing the in-session readout can never give you. Because whoburnedmore reads the saved run logs, it can rebuild history you never explicitly recorded — yesterday, last week, the whole month — in three steps:
- 1
Run it on a cadence
Dropnpx whoburnedmoreinto your shell once a day, or just run it whenever you're curious. Each run re-reads the full log, so you never miss a session. - 2
Read the day-by-day column
The daily table makes spikes obvious. A day of heavy MCP tool-calling, where the agent re-sends context on every loop, lights up immediately against quieter days. - 3
Compare model mix month over month
Watch the K2 vs K2-turbo ratio shift as your habits change. Leaning on turbo for speed shows up as a cost line you can actually see and decide about.
| Method | Past days | Cost estimate | Per-model split | Multi-agent |
|---|---|---|---|---|
| Kimi in-session readout | — | — | — | — |
| manual log grep | tedious | — | — | — |
| whoburnedmore | all your tools |
Already run other agents?
If Codex CLI, Gemini CLI, or Claude Code also live on this machine, the same command reports Kimi beside them in one stacked table. The cross-tool usage guide walks through the combined view.Is reading my Kimi logs private?
Your prompts, your file names, and your code never travel anywhere. whoburnedmore lifts only the numeric fields — token counts, model labels, timestamps — and the single thing that can leave your machine is the daily aggregate, and only if you opt to post it to the leaderboard. Want to be sure? Run npx whoburnedmore --dry-run to print exactly what would be submitted, or npx whoburnedmore --local to open a dashboard that never leaves your laptop. 🛡️
command, zero setup
models split out
rollups rebuilt
Kimi CLI gives you a fast, MCP-aware agent and a glimpse of the current run. whoburnedmore supplies the part Moonshot left out of the terminal: a durable, day-and-model view of what you actually burned — and a leaderboard spot to prove it.
Related guides
How to Check Your AI Coding Token Usage
The cross-tool overview: one command that totals your token usage and cost across every AI coding agent you run.
How to Check opencode Token Usage and Cost
opencode keeps detailed logs — here's how to turn them into a token + cost total.
The Best AI Coding Token Trackers in 2026
ccusage vs tokscale vs native dashboards vs whoburnedmore — a free, cross-tool comparison.