Built for engineers who ship AI.
tokentrace gives you full visibility into every LLM call your app makes — spend, latency, prompts, and traces — without changing how you write code.
The problem we solve
Most teams flying blind with LLMs. They don't know which prompts cost the most, which model versions introduced a latency regression, or what the exact input/output was when a user reported a bad response.
tokentrace captures every LLM call — input, output, model, tokens, latency, cost — and makes it searchable in a dashboard your team can share.
Who uses tokentrace
Supported providers
OpenAI, Anthropic, and Google Gemini — with more providers on the roadmap. One SDK wraps all of them. Switch models; your traces follow automatically.
Open source core
The tokentrace SDK is MIT-licensed and lives on GitHub. You can read exactly what data is collected, fork it, extend it, or self-host the entire platform. We believe observability tooling should be auditable.