FinOps for AI coding tools

The AI coding meter for every agent you run.

$2,340/mo · across every account, Mac & cloud

Ottto reconciles subscriptions, API usage, credits, quota windows, and cloud bills into one number you can defend. No proxy in the request path, no added latency, no mystery totals.

Free to start · no credit card · 3 minutes to get started
ACCOUNTS & USAGE CLOUD & CREDITS Claude Maxron@gmail.com · sub$580 Claude Teamron@northwind.com · 3 seats$420 Claude APIpay-as-you-go · sk-…a3f$340 Codex Proron@gmail.com · sub$360 Codex Businessron@northwind.com · sub$180 Cursor Proron@gmail.com · sub$60 awsBedrockwork · us-east-1 · cloud$280 Vertexwork · gcp-proj · cloud$120 $CreditsAnthropic · promo−$40 $CreditsAWS · activate−$200 $0 this month · all of it reconciled ✓
every agent & terminal you run - captured
Claude Code Codex Cursor Pi Gemini Ghostty iTerm cmux openclaw Hermes + more
Why we built this

We don't want to spend less on AI. We want to scale up without flying blind.

We found ourselves using 3x the AI of last month, across Codex, Claude Code, Pi, Bedrock, Vertex, subscriptions and APIs, on multiple Macs. The bill stopped being background noise. So we keep checking the same things, several times a day:

Where are we losing efficiency to huge context, bad caching, noisy MCP tools, retries, or sessions we reopen after days idle?

Advisor finds it

How much more are we paying now that agents run loops and spawn other agents, instead of us driving every step?

agent vs human

Which agents are expensive but not productive?

cost by tool

How much are we paying in APIs right now, and would a fourth $200 subscription be smarter?

subs vs PAYG

What is our total AI coding cost?

one number
One place, reconciled

Not per device. Not per account. One reconciled truth layer.

Ottto pulls usage from every agent, every account, on every machine - plus the cloud and your credits - then reconciles it into one trusted view. By tool, model, account, session, repo, and billing path. Subscriptions, API usage and credits, side by side.

  • All your accounts. A personal and a business Codex, two Claude plans + API, all deduped into one number.
  • All your agents and machines. Claude Code, Codex, Cursor, Pi - and the cloud - together, not one at a time.
  • Subscriptions, API and credits. Monthly plans, metered usage, and promo credits in the same picture.
  • Every number labeled. Exact where we know it, estimated where we don't.
Local sessionsyour Macs
Live telemetryaccounts
Connectorsplans · API
Cloud billsAWS/GCP
oto
Reconcilededupe · price · label
Claude · 2 plans + API$1,340exact
Codex · Pro + Business$540exact
awsBedrock · work$280est.
Find the leak

Most of your spend is productive. Some of it buys nothing.

You're not overpaying for speed, that's the point. But some spend buys nothing back: re-paid context from broken caching, a plan on the wrong tier, a 1M window where 200K would do, idle agents still looping. Ottto separates the spend that makes you faster from the spend that doesn't, and shows you exactly how much.

$620/mo found across your stack
Broken caching in one repo$300/mo
Plan on the wrong tier$140/mo
1M context where 200K would do$110/mo
Idle Codex Business plan$70/mo
Advisor · every day

A short queue of fixes, not another dashboard.

Advisor watches quota windows, model/cache behavior, long-context sessions, agent-spawned work, and plan fit every day. It shows the dollars, the evidence, and the caveats before anything changes.

Advisor recommendation Quota window reset changed the best place to run work

Codex API spend is rising while a Claude Max window has room.

Impact +$140/moConfidence exact
Route next run to the subscription window Reset detected · room available · nothing changes without review
approve
Approved action recorded$140/mo tracked against real usage
oto
Your daily digest5 things changed · 2 anomalies · today
+$620/moif you approve all
Quota window reset before the scheduled time.

detector Route overflow away from paid API usage before the next reset.

+$140/mo
ReviewSkip
Model cost and cache behavior regressed.

detector Reprice the same traffic with the cache hit-rate drop called out.

+$160/mo
ReviewSkip
Long-context session cost jumped.

detector Spot sessions where 1M context is being used where a smaller window would do.

+$110/mo
Track
Agent-spawned sessions multiplied spend.

detector Separate human-driven work from agents spawning other agents.

+$70/mo
Track
Plan fit recommendation is ready.

autopilot Demote, promote, or switch only when the usage evidence supports it.

+$140/mo
Track
Analyzeevery day
Flaganomalies
Recommendwith the $
You approveone click
Trackreal savings

Not a scary autopilot that moves money on its own. It finds, explains, prices, asks, executes only approved actions, and tracks rollback state.

Subscriptions & limits

Know before you hit the wall.

Every plan and account you pay for, how much you actually use, and when the next limit resets - so you're never surprised. Which plan fits, what you'd save on pay-as-you-go, and when to cancel before a renewal you won't use.

  • Quota resets. "Claude Max - 78% of your 5-hour window. Resets 3:40pm."
  • Renewal warnings. "Codex Business renews Tuesday - idle 3 weeks. Cancel?"
  • Savings vs pay-as-you-go. The cross-account math vendor dashboards never show.
ottto.net/subscriptions
Claude Maxoverpaying
$200/mo41% used
Cursor Progood fit
renews in 4d88% used
Codex Businessrenews Tue · idle
$25/mo6% used
Codex Proresets 3:40pm
5-hr window78% used
Switch Claude Max → Pro + API, cancel the idle Codex Business plan. Same usage, ~$160/mo less.
Provider Impact

The moment a provider changes, you know what it costs you.

Anthropic ships a new default. OpenAI changes pricing. A model gets deprecated, a quota window shifts. Ottto watches provider changes and reconciles each one against your actual usage so a price hike or default swap shows up as "here's what it does to your bill."

  • Real-time alerts. Pricing, defaults, deprecations, quota and outage changes.
  • Tied to your spend. Not "something changed" - "this changed your bill by $X."
Anthropic changed Claude Code's default modeldetected 2 hours ago
LIVE
Your sessions are 18% pricier since the switch.
The new default uses more expensive output tokens for the same work. We caught it the moment your usage shifted.
Affects 2 repos$120 extra this weeksince Tue 9:02am
Pin the previous modelSee impact
Powered by our open feed - AI Provider Watch ↗
Under the hood

Out of your way. Open by default.

No proxy · zero added latency Open source · local-first daemon ChatGPT + MCP · ask where you already work
See how it works →
Why ottto

The one place that sees all of it.

Free meters watch one machine. Gateways see only proxied API traffic and sit in your path. Vendor dashboards show one tool, one account. Ottto ties every device, agent and account together.

Free CLIs & menu barsGatewaysVendor dashboardsottto
All your devicesone machineproxied onlyn/ayes
All accounts & agentsone or a fewAPI onlyone toolyes
Subs + API + creditsquota onlyusage onlyits ownall
Reconciles every sourcenononoyes
Real-time provider-change alertsnononoyes
Recommends & helps actnolimitsnoyes
In your request pathneveryesn/anever
Questions

The things you're wondering.

QDoes Ottto slow down my coding tools?

No. Ottto is out of your request path - it reads telemetry your tools already emit. Zero added latency, and nothing breaks if Ottto is down.

QCan you see my code or prompts?

Ottto is built around usage, cost, and limit signals - not your source code. You choose what each source shares, and local data stays labeled and under your control.

QMultiple accounts and plans?

Yes - that's the point. A Pro and a Business Codex, two Claude plans, an API key and some credits all reconcile into one deduped picture across every Mac.

QDo I need an API key?

No. Ottto works with subscriptions too - Pro, Max, and the rest. Subscriptions, API and credits, all first-class.

QIs my subscription safe?

Yes. Ottto observes; it never proxies your requests or holds your login, so it isn't affected by the gateway restrictions vendors introduced in 2026.

QWhat is free and open source?

You can start free, no credit card. The runtime and provider feed are open source: ottto and ai-provider-watch.

Why I'm building Ottto

"Not because I want to slow down. Because I want to scale up without flying blind."

The bill stopped being background noise, so I'm building the tool I kept wishing I had: open-source building blocks for the basics, and one product for the actionable bottom line. Usage and cost intelligence for AI coding agents, so you can push harder, not slower.

oto
Ronbuilding Ottto

See your real numbers in minutes.

No sales call. No proxy. No slowdown. Install the app, connect your accounts, and watch every agent, machine and credit reconcile into one picture.

Free to start · no credit card · Apple Silicon · macOS Sonoma+