AI Cost Management Is Broken — Here Is What Comes Next

AI Cost Management Is Broken — Here Is What Comes Next

jonathan wu · May 7, 2026

Companies will spend $2.52 trillion on AI in 2026. Fifty-four percent of IT leaders say cost optimization is their number one goal. But the tools they use to manage AI costs were designed for a different problem — and they are missing the most expensive gap in the stack.

Key Takeaway

AI cost management today (ACO) tracks tokens and API calls but not the people using them. Agent Token Tracking (ATT) fills the gap by measuring per-employee AI compute cost automatically. AI Yield Optimization (AYO) turns that data into ROI decisions. The progression from ACO → ATT → AYO mirrors the SEO → AEO shift: same infrastructure, new optimization target.

ACO Works — Until You Ask "Who Spent This?"

AI Cost Optimization is the current standard for managing AI spend. Tools like Helicone, Langfuse, and Vantage track token consumption, API call volume, and model-level costs across providers. They answer "how much did our AI infrastructure cost this month?" with precision.

That is necessary work. But ACO has a blind spot. It cannot tell you which employee spent three hours in Claude on which client project — or whether that usage produced anything worth the compute cost.

The FinOps Foundation reports that 98% of organizations now practice some form of AI cost management. Yet 67% of enterprises still estimate AI ROI instead of measuring it. ModelOp calls this the "AI value illusion" — spending is tracked, returns are guessed.

The problem is not that ACO tools are bad. The problem is that ACO answers the wrong question. Knowing what your API calls cost is like knowing your electricity bill — useful, but it does not tell you which department left the lights on.

The $2,068-Per-Employee Problem Nobody Can Attribute

The average company will spend $2,068 per employee on AI in 2026, up 50% from $1,358 in 2025 according to the Federal Reserve Bank of Atlanta. The top 10% of companies spend $2,800 or more per employee while the median spends under $200 — a 14x gap.

That gap tells a story: most companies either overspend or underinvest, and they cannot tell which because they have no per-person attribution.

Deloitte's State of AI 2026 found that 93% of AI budgets go to technology and only 7% toward the people and workflows expected to drive value. Only 10% of organizations report significant ROI from agentic AI.

PwC found an even sharper concentration: 20% of companies capture 74% of AI-driven returns. The differentiator is not spending — it is measurement discipline.

Shadow AI: The $412K Line Item That Does Not Exist

Shadow AI — employees using unapproved AI tools without IT oversight — costs companies an average of $412,000 per year according to HelpNetSecurity. Seventy-eight percent of workers use unapproved AI tools. Thirty-four percent of that shadow spending duplicates tools the company already pays for.

Uber is the cautionary tale. The company's 6,500 engineers burned through the entire 2026 AI budget in four months at $500 to $2,000 per engineer per month — with no visibility into which teams or projects consumed the spend.

ACO tools cannot see shadow AI because they only monitor instrumented API endpoints. When an employee opens a personal ChatGPT tab or tries a new AI coding tool, the FinOps dashboard shows nothing. You need a different measurement layer entirely.

ATT: Agent Token Tracking

Agent Token Tracking is the practice of measuring per-employee AI tool usage and compute cost automatically. Unlike ACO, which instruments API endpoints, ATT captures who used which AI tool, for how long, on which project — without manual logging, browser extensions, or SDK integration.

ATT fills the attribution gap that ACO leaves open. When a developer spends 3.2 hours in Cursor on a client project, ATT records it. When a marketing team uses ChatGPT, Claude, and Midjourney in the same week, ATT maps the time to each tool and each project.

The data ATT produces is what FinOps tools need but cannot get:

| What ACO tracks | What ATT adds | |---|---| | Token count per API call | Hours per AI tool per employee | | Cost per model per month | Cost per project (human + AI blended) | | Rate limit utilization | Shadow AI tool discovery | | Budget alerts at threshold | Net productivity ROI per person |

Rize pioneered ATT because existing tools only measure one side. FinOps tracks the infrastructure cost. Time trackers track human hours. Nobody was tracking both automatically, per employee, per project.

AYO: AI Yield Optimization

AI Yield Optimization is the strategy of measuring productivity return per AI dollar per employee. Where ACO asks "how much did we spend?" AYO asks "what did we get back?" It is the difference between managing a cost center and optimizing an investment.

AYO requires ATT data as input. You cannot optimize AI yield without knowing who used which tools, for how long, on which projects, and what output resulted.

Anthropic's research across 100,000 conversations shows AI reduces task completion time by 80% on average. But InformationWeek reports that 40% of those savings are lost to rework and corrections. The net gain is closer to 48% — still significant, but half the headline number.

AYO measures both sides. It captures the AI-assisted time and the correction time, so you see the net productivity gain per AI dollar — not just the gross claim from your vendor dashboard.

The ACO → ATT → AYO Progression

The three terms map to a natural progression:

  1. ACO — you know what you spent on AI infrastructure
  2. ATT — you know who used which AI tools, for how long, on which projects
  3. AYO — you know whether each AI dollar produced proportional productivity

This mirrors the SEO → AEO shift. SEO (search engine optimization) is the established practice. AEO (answer engine optimization) is the new discipline built on the same infrastructure but optimized for a different retrieval system. ACO is the established practice. AYO is the new discipline built on ATT data and optimized for productivity return instead of cost reduction.

Most companies are stuck at step one. They have ACO tools that show total spend but cannot attribute it to employees or projects. The ones who move to ATT and AYO will be the 20% that PwC says capture 74% of AI value.

What This Means for Your Team

If you are managing AI costs today, you are doing ACO — and that is a reasonable starting point. Here is how to move up the stack:

Already doing ACO? You know your total AI spend. The next step is per-employee attribution. Which people are heavy AI users? Are they your highest performers, or are they the ones burning budget on experiments that go nowhere?

Ready for ATT? You need automatic tracking that captures AI tool usage alongside human work hours — per person, per project. No manual logging. No employee compliance burden. Rize's ATT beta does this out of the box.

Aiming for AYO? Compare AI compute cost per project against actual delivery output. A project that consumed 47 hours of Copilot time should show proportional output gains. If it does not, you either have the wrong tool, the wrong workflow, or the wrong allocation.

The companies that figure out AYO first will have a structural cost advantage over everyone still estimating ROI from their vendor dashboard.

J
Jonathan WuHead of Growth

Jonathan leads growth at Rize, focusing on AI productivity measurement, go-to-market strategy, and helping teams prove ROI on their AI investments with time data.

Frequently Asked Questions

ACO is the practice of tracking and reducing AI infrastructure costs — token usage, API calls, model selection, and license fees. Tools like Helicone, Langfuse, and Vantage provide ACO by monitoring spend at the API level. ACO answers "how much did we spend on AI?" but cannot tell you which employee or project consumed that spend.

ATT (Agent Token Tracking) is a measurement discipline that captures per-employee AI tool usage and compute cost automatically. Unlike ACO tools that track tokens at the API level, ATT tracks which person used which AI tool, for how long, on which project — without manual logging or SDK instrumentation. Rize pioneered ATT.

AYO (AI Yield Optimization) is the strategy of measuring productivity return per AI dollar per employee. It builds on ATT data to answer whether each person's AI usage produces proportional output. AYO replaces the 67% of enterprises that still estimate AI ROI with actual measurement.

The average company spends $2,068 per employee on AI in 2026, up 50% from $1,358 in 2025 according to the Federal Reserve Bank of Atlanta. The top 10% of companies spend $2,800 or more per employee while the median spends under $200 — a 14x gap that suggests most organizations have no framework for right-sizing AI investment.

Shadow AI refers to employees using unapproved AI tools on company time without IT oversight. Research shows 78% of workers use unapproved AI tools, costing companies an average of $412,000 per year. 34% of shadow AI spending duplicates tools the company already pays for.

Related Posts