iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
Amazfit Cheetah 2 Ultra: The Most Expensive Smartwatch Yet—Is It Worth the Price? New Automated Jailbreak Attack UNIATTACK Achieves High Success Rate Against Multi-Layered LLM Defenses UXBench: Measuring the Actionability of LLM-Generated UX Critiques LaWAM: Latent World Action Model Enables Efficient, Dynamics-Aware Robot Control with Low Latency Sub-Quadratic Vision Transformers Cut Self-Attention Cost for Faster Image Captioning NordVPN's Private Server Add-On Gives Enterprises Isolated Hardware and Static IP for Secure Remote Access India Soyabean Acreage Seen Rising Up to 10% on High Prices, Weak Monsoon Outlook FlowMPC: New Framework Combines Flow Matching and World Models to Improve Robot Manipulation DYNA Framework Uses Temporal Knowledge Graphs to Reduce LLM Forgetting Without Retraining RAMS: Resource-Adaptive Model Switching for Embedded Edge Perception Under Load Amazfit Cheetah 2 Ultra: The Most Expensive Smartwatch Yet—Is It Worth the Price? New Automated Jailbreak Attack UNIATTACK Achieves High Success Rate Against Multi-Layered LLM Defenses UXBench: Measuring the Actionability of LLM-Generated UX Critiques LaWAM: Latent World Action Model Enables Efficient, Dynamics-Aware Robot Control with Low Latency Sub-Quadratic Vision Transformers Cut Self-Attention Cost for Faster Image Captioning NordVPN's Private Server Add-On Gives Enterprises Isolated Hardware and Static IP for Secure Remote Access India Soyabean Acreage Seen Rising Up to 10% on High Prices, Weak Monsoon Outlook FlowMPC: New Framework Combines Flow Matching and World Models to Improve Robot Manipulation DYNA Framework Uses Temporal Knowledge Graphs to Reduce LLM Forgetting Without Retraining RAMS: Resource-Adaptive Model Switching for Embedded Edge Perception Under Load
Home ›› Technology ›› Ai ›› Llms ›› ‘Pretty Crazy’ Token Usage Tests Enterprise AI Bets as Companies Balance Costs and Gains

‘Pretty Crazy’ Token Usage Tests Enterprise AI Bets as Companies Balance Costs and Gains

Enterprise adoption of generative AI is driving a new focus on 'tokenomics'—managing the surging cost of tokens. While companies like 8x8 report $5M in savings from Claude, others like Cisco and RBC see token usage skyrocketing. Baseball Lifestyle 101 used Claude to land a $1M order via supply chain insights.

iG
iGEN Editorial
June 16, 2026
‘Pretty Crazy’ Token Usage Tests Enterprise AI Bets as Companies Balance Costs and Gains

As enterprises pour hundreds of millions into generative AI, a new cost metric—token usage—is reshaping budgets and strategy. Tokens represent the amount of content an AI model analyzes and generates, and according to WIRED, companies are increasingly grappling with the 'pretty crazy' consumption levels. Roughly 300 companies addressed token concerns on earnings calls in April and May 2026, up from just 93 a year ago.

The Rise of Tokenomics

Token usage is soaring across industries. Royal Bank of Canada’s CEO disclosed that token usage surged 500 percent over six months. At Cisco, CEO Chuck Robbins said on an earnings call that a third of employees use an internal AI chatbot daily, so 'the token usage is getting pretty, pretty crazy.' Amplitude CEO Spenser Skates noted that top engineers are 'spending thousands of dollars a month or more on tokens.' Box CEO Aaron Levine called token budgeting 'one of the most important' and 'heated' topics. WIRED’s review of transcripts from data provider AlphaStreet shows token mentions in earnings calls more than tripled year over year.

Savings and Supply Chain Wins

Despite cost concerns, some companies report net gains. 8x8, a software firm, uses Anthropic’s Claude for drafting emails, analyzing customer feedback, and writing code. Over 18 months, the company estimates saving $5 million annually by canceling dozens of software subscriptions partly replaced by Claude. Joel Neeb, 8x8’s chief transformation and business operations officer, says the annualized Claude bill is 'well below' that figure, which 'makes my chief financial officer happy.' He declined to share exact AI spending.

In a direct supply chain application, Long Island, New York-based clothing brand Baseball Lifestyle 101—expecting $250 million in sales this year—told 50 top managers to spend about 20 percent of their salary on AI tokens monthly. Cofounder Bill Rom says cost may exceed $100,000 a month by year-end, but Claude recently helped land a $1 million order by identifying that a retailer was running low on sizes of the popular ice-cream-patterned shorts. 'That’s a day and a half of work that can now happen' faster, WIRED reports.

Company Token Usage/Concern Business Impact
8x8 Annual Claude bill well below $5M saved $5M annual savings from cancelling subscriptions
Baseball Lifestyle 101 Monthly cost may exceed $100k $1M order from Claude-identified inventory shortage
Royal Bank of Canada Token usage up 500% in 6 months Cost concern
Cisco 1/3 employees use AI chatbot daily Usage 'pretty crazy'
Amplitude Engineers spending thousands/month on tokens High individual usage
Box Token budgeting a 'heated' topic Strategic focus

Managing the Token Budget

Executives at several companies are developing or buying systems to monitor token usage and choose the lowest-priced model for a given prompt. Others are still balancing hiring budgets with token allocations. Prices fluctuate and new, more expensive models are released monthly, creating planning challenges. Some companies, like 8x8, encourage more AI use despite costs; Neeb expects savings and costs to eventually even out as adoption spreads.

For enterprise technology decision-makers, tokenomics is now a core governance issue. While tools like Claude can drive supply chain wins—spotting inventory gaps that yield million-dollar orders—unchecked token consumption can erode ROI. The key is deploying AI where it delivers measurable outcomes, as demonstrated by 8x8’s savings and Baseball Lifestyle 101’s revenue capture.


Sources: WIRED – Top Stories

Keep Reading

Recommended Stories

ChatPlanner: LLM Framework Personalizes Public Transit Routing with Fine-Tuning and RAG Technology

ChatPlanner: LLM Framework Personalizes Public Transit Routing with Fine-Tuning and RAG

Researchers present ChatPlanner, a novel framework that leverages fine-tuned Large Language Models (LLMs) with Retrieval-Augmented Generation (RAG) to capture diverse user preferences for public transit routing. The system extracts routing parameters from natural language queries, integrates preferences into the routing algorithm, and generates feasible, personalized alternatives. Three experiments show that the combined fine-tuning and RAG approach achieves highest accuracy and uncovers valuable solutions overlooked by existing route planners.

June 16, 2026
Service-Induced Congestion Threatens LLM Serving Throughput, New Model Shows Technology

Service-Induced Congestion Threatens LLM Serving Throughput, New Model Shows

A new mathematical model from researchers at MIT and elsewhere shows that in large language model serving, persistent GPU memory consumption from key-value caches creates a 'service-induced congestion' effect. Under high concurrency, this can lead to instability and throughput losses as high as 50%. The paper identifies scheduling design principles to avoid these losses.

June 16, 2026
LLMs Struggle on Privacy-Constrained Industrial Tabular Data, Study Finds Technology

LLMs Struggle on Privacy-Constrained Industrial Tabular Data, Study Finds

A new study from arXiv compares large language models (LLMs) with classical machine learning on an industrial car retrofit prediction task, finding that while LLMs have niche uses, tree ensembles remain superior. The research highlights that on privacy-constrained tables, LLMs are more effective as complementary components than replacements.

June 16, 2026
Study Finds LLMs' Legal Reasoning Unfaithful: Scope Laundering and Formalization Flaws Identified Technology

Study Finds LLMs' Legal Reasoning Unfaithful: Scope Laundering and Formalization Flaws Identified

A study comparing LLM classification, LLM-based formal reasoning, and solver-based reasoning on ContractNLI finds that while formal reasoning improves accuracy, it does not guarantee faithfulness. Researchers identify three recurring failure modes: scope laundering, implicit constraint blindness, and program synthesis failures. The findings raise concerns about relying on LLM-based formal reasoning as a proxy for symbolic execution.

June 16, 2026