LIVE — Updated every 30 min

The SaaS & AI
News Wire

Breaking launches, pricing shakeups, funding rounds & shutdowns.
Tracked automatically. Analyzed by our AI editorial team.

999 Stories

21 Product Launch

16 Major Update

7 Pricing Change

3 Funding Round

2 Shutdown

Saturday, April 11, 2026

Pricing Change 09:22 PM

LLM API Prices Collapse by 90% in 2 Years, Reshaping AI Development

A new report from Fungies.io reveals a dramatic 90% price reduction in flagship LLM APIs since 2024, driven by aggressive competition from DeepSeek, OpenAI, Google, and Anthropic, fundamentally altering the economics of AI integration for businesses

SaaS buyers must now prioritize detailed LLM API cost-benefit analysis when evaluating AI-driven solutions. Focus on vendors transparent about their underlying LLM choices and ensure their model selection aligns with your specific workload's performance and cost requirements, rather than assuming all AI capabilities are priced equally. A misstep here can significantly inflate operational expenses.

Read full analysis

The cost of integrating advanced Large Language Models (LLMs) into applications has plummeted by an unprecedented margin, according to a new analysis by Fungies.io. Just two years ago, in early 2024, a leading LLM API typically cost $10 per million input tokens. By April 2026, the market has transformed so drastically that superior performance is available for a quarter of that price, with perfectly adequate models now costing as little as a hundredth of the 2024 benchmark.

This seismic shift was ignited by DeepSeek, which aggressively "blew up the pricing floor" with its highly competitive offerings. This move triggered a rapid response across the industry. OpenAI countered with "aggressive cuts across the GPT-5 family," while Google intensified its strategy by "dangling free tiers that actually work." Anthropic, known for its Claude series, significantly reduced its premium Opus model pricing by a substantial 67% and expanded its context window to an impressive 1 million tokens, enhancing its value proposition. The net effect is a market where the cost for comparable quality output can vary by a factor of 100x depending on model selection.

The ramifications of this pricing revolution are widespread, impacting virtually every segment of the AI ecosystem. Developers can now build more sophisticated AI applications with significantly lower operational costs, making experimental AI features economically viable. For businesses, from startups to large enterprises, the ability to process massive datasets, such as document processing pipelines handling millions of pages monthly, becomes economically feasible. However, this new landscape also introduces a critical decision point for AI product managers and strategists.

“Choosing the wrong model for your workload can cost you 100x more than necessary for the same quality output. Getting this wrong by even one tier can mean the difference between a profitable feature and one that bleeds cash every month.”
— Fungies.io Analysts

The pricing landscape in April 2026 is characterized by extreme variability. The core pricing structure for LLM APIs typically involves separate rates for input and output tokens. Here’s a snapshot of key offerings per 1 million tokens:

Model	Input Price	Output Price	Key Distinction
Gemini 2.5 Flash-Lite	$0.10	$0.40	Cheapest actively supported model
DeepSeek V3.2	$0.28	$0.42	Best value, 90% cache discounts
GPT-5.4	$2.50	$10.00	Best overall balance of capability and cost
Claude Opus 4.6	$5.00	$25.00	Premium accuracy, 1M context window

This thousand-fold spread between the cheapest and premium models means that a single request costing $0.0001 on Gemini Flash could cost upwards of $0.10 on a higher-tier model. The market now demands meticulous evaluation, balancing cost, performance, and specific use-case requirements to optimize profitability and innovation. As competition intensifies, we can expect further specialization and a continued focus on value, pushing the boundaries of what's possible with AI at scale.

Why this matters to you: As a SaaS buyer, understanding this dynamic pricing is crucial for selecting the right AI-powered tools and avoiding unnecessary costs, directly impacting your operational budget and competitive edge.

Announcement Pillar	Key Feature/Status	Enterprise Impact
Claude Managed Agents	Public Beta, Composable APIs	Accelerated AI agent deployment, reduced infrastructure overhead
Claude Cowork	General Availability (GA)	Enhanced enterprise readiness (RBAC, OpenTelemetry, Zoom MCP)
Claude Code	Significant Update	Improved governance, Bedrock integration, cost insights, performance

Feature	Gemini CLI	Claude Code	Codex
Model	Gemini 3.1 Pro	Claude Opus 4.6	GPT-5.4
Context	1M tokens	1M tokens	200K tokens
Free Tier	Yes (Flash)	No	No
Pricing	$20/mo (Pro)	$20/mo (Max)	$20/mo (Plus)

Feature	Claude for Word	Microsoft Copilot (Context)
Integration Method	Native Sidebar Add-in	Deep OS/App Integration
Edit Review	Tracked Changes ("Redlining")	Inline Suggestions / Drafts
Cross-App Context	Word, Excel, PowerPoint	Broader Microsoft 365 Suite

Plan	Monthly Price	Codex Usage (vs. Plus)	Target User
ChatGPT Plus	$20	1x	Standard users
ChatGPT Pro $100	$100	5x (10x promo)	Developers with growing needs
ChatGPT Pro $200	$200	20x	Heavy lifting, demanding workflows

Region	Agentic Booking Availability
United States	Existing
Australia	New
Canada	New
Hong Kong	New
India	New
New Zealand	New
Singapore	New
South Africa	New
United Kingdom	New

Feature	Traditional Method	Zendrop MCP Server
Task Management	Multiple dashboards, manual clicks	Natural language commands via AI
Data Access	Disparate APIs, screen scraping	Direct, permissioned live data access
Efficiency	Time-consuming, prone to errors	Instant, automated, conversational

Platform	Overall Score (out of 10)
C3 Code	9.2
Palantir	7.7
OpenAI Codex	6.0
Anthropic Claude Code	5.2

Cloud PC Tier	Old Monthly Price	New Monthly Price (20% Off)
Basic (2vCPU, 4GB RAM)	$31	$24.80
Standard (2vCPU, 8GB RAM)	$41	$32.80
Premium (4vCPU, 16GB RAM)	$66	$52.80

Shutdown Phase	Effective Date	User Impact
New Sign-ups Cease	Immediately	No new users can join
Data Export Recommended	Immediately	Users must manually export plans
Read-Only Access Begins	July 2014	No further plan changes possible
Reduced-Redundancy Mode	Immediately	Potential for unplanned outages

Metric	Instant 1.0 (Inactive App)	Traditional VM/Serverless (Inactive App)
Compute Cost	Zero	Variable (often non-zero)
Memory Cost	Zero	Variable (often non-zero)
Provisioning Model	Row-based multi-tenant	Individual VM/Function

Feature	Detail
Galileo Evaluation Metrics	20+ (e.g., Hallucination Detection, Context Adherence)
Supported AI Platforms	OpenAI, Anthropic, Azure OpenAI, AWS Bedrock
Acquisition Target Close	Q4 FY2026 (approx. July 2026)

Feature	Benefit for Developers	Impact on Workflow
Multigres Operator (Open Source)	Advanced Postgres management	Greater control, high availability
GitHub Integration (All Plans)	Automated migration deployment	Streamlined CI/CD, fewer errors
Stripe Projects Partnership	Simplified service provisioning	Faster project setup, less friction
Studio AI Integration	Instant troubleshooting/code help	Increased productivity, faster fixes

Subscription Tier	Estimated Monthly Cost	Primary Benefits
Basic/Consumer	Lower or Free	Standard access, limited usage
Mid-Tier (Codex)	$200 - $500	Enhanced API, priority access, advanced Codex features
Enterprise	Significantly Higher	Custom contracts, bespoke support, full integration

Editing Approach	Precision & Control	Efficiency
Traditional Prompting	Low (often inexact)	Moderate (requires re-prompting)
Firefly Precision Flow	High (slider-based refinement)	High (real-time visual iteration)

Feature	Open-Source Alternative	Claude Cowork (Commercial)
Operating Model	100% Local	Cloud-based
Software Cost	Free (Open-Source)	Subscription Fees Apply
Data Privacy	Full User Control	Relies on Vendor Policies
LLM Flexibility	Agnostic (Local/API)	Primarily Anthropic's Claude

Feature Status	Date	Key Benefit
Public Preview	Jan 2026	Initial testing & feedback
General Availability	Mar 30, 2026	Stable, production-ready feature

Aspect	Traditional Salesforce Dev Workflow	Web Console Workflow (Beta)
Tool Integration	Multiple external tools (IDE, Query Editor, Log Viewer)	Single, embedded browser-based IDE
Context Switching	Frequent switching between applications	Minimal, stay within Salesforce UI
Issue Resolution	Navigate, locate, fix, validate (multi-step)	Investigate, fix, validate (streamlined)

User Tier	Initial Access	Future Access
Google AI Plus, Pro, Ultra	Web (Immediate)	Mobile, more countries (Coming weeks)
Free Users	None	Web, Mobile, more countries (Coming weeks)

Subscription Tier	Price/Month	Included Usage (approx. per 5-hr window)
Claude Pro	$20	~44,000 tokens
Max 5x	$100	~88,000 tokens
Max 20x	$200	~220,000 tokens

Service/Cost	Per Minute Cost	Monthly Breakeven for $9/mo
Deepgram/PlayHT (STT/TTS)	$0.26 - $0.35	25 minutes
Standard Transcription	$0.024	375 minutes

Service/Model	Cost per Minute	Notes
OpenAI Whisper API	$0.006	25MB file limit, batch-oriented
Self-hosted Whisper	$0.50-$3.00/hr (GPU)	Infrastructure costs vary
Deepgram Nova-2	$0.0048	More affordable for high-volume English batch

Investment Focus	Amount (USD)	Details
Total Funding Round	$122 Billion	Post-money valuation: $852 Billion
Amazon Investment	$50 Billion	$35B contingent on IPO/AGI; involves AWS integration
Nvidia & SoftBank	$30 Billion Each	Key strategic investments
Individual Investors	>$3 Billion	First time direct investment via bank channels

Deepgram API Cost	Details
Standard Nova-3	~$0.46/hour ($0.0077/min)
Growth Plan	~$0.39/hour (with $4,000/year commit)
Voice Agent API	$0.075/min (WebSocket connection)
Free Credits	$200 (over 700 hours transcription)

Feature	@ai-sdk/deepgram	OpenAI Whisper	AssemblyAI
Real-time Latency	Sub-300ms	N/A (Batch only)	300ms
Pricing (STT)	$0.0077/min	$0.006/min	$0.15/hour (streaming)
Billing	Per-second	Per-request/minute	Usage-based
Customization	Keyterm prompting	Limited/None	LeMUR LLM framework

LLM API Prices Collapse by 90% in 2 Years, Reshaping AI Development

Anthropic Unveils Managed Agents, Claude Cowork GA in Strategic Triple Play

AI Coding Assistant Showdown: Gemini, Claude, Codex Benchmarked

Anthropic's Claude Embeds in Word, Challenges Microsoft Copilot

OpenAI Introduces $100/Month Pro Plan for Codex Users, Expanding Options

Google AI Mode Gets UI Refresh, Agentic Booking Goes Global

Zendrop Unveils AI-Powered Dropshipping Control with New MCP Server

Cohere and Aleph Alpha in Merger Talks: A New AI Powerhouse Emerges?

C3 AI Unveils C3 Code to Accelerate Enterprise AI Application Development

Microsoft Slashes Windows 365 Cloud PC Prices by 20 Percent

Gigantt Project Management Shuts Down, Citing Economic Non-Viability

Instant 1.0 Launches: Open-Source Backend Reimagines AI-Coded App Hosting

Cisco Acquires Galileo, Bolstering Splunk's AI Observability for Trust

Supabase Boosts Enterprise Features, Developer Flow in April Update

OpenAI Launches Mid-Tier Codex Subscription for Power Users

Adobe Firefly Unveils Precision Flow: From 'Almost There' to 'Exactly Right'

Open-Source AI Assistant Challenges Claude Cowork with Local-First Approach

Grafana Labs Unveils Smarter Visualization Suggestions for Faster Insights

Salesforce Unveils Web Console Beta: In-Platform IDE for Faster Development

Gemini App Introduces 'Notebooks' for Enhanced Chat & File Organization

Microsoft Launches Agent Framework 1.0: Unifying AI Development for Production

Claude Code Pricing: API vs. Subscriptions & The April 4 Shift

TestRail Unveils AI Test Script Generation to Streamline QA Automation

The $9/Month SaaS Model Is Dead: AI Costs Reshape the Market by 2026

OpenAI's GPT-5 Era: New Models & Unrivaled Transcription Emerge

OpenAI Secures $122 Billion Funding, Reaches $852 Billion Valuation

VS Code 1.115 Elevates Developer Workflow with AI, Terminal Enhancements

Vercel AI SDK Deepgram Module Updates, Enhancing Real-time Voice AI

Softr Unveils AI No-Code Platform for Production-Ready Business Tools

Gemma 4's 31B Model Challenges AI Giants, Reshaping SaaS Tool Choices

Airflow 3.2.0 Arrives, Boosting Data Orchestration for Enterprise Teams

Anthropic Bolsters Healthcare AI with US$400M Coefficient Bio Acquisition

Yuma AI Unveils Ask Yuma: Conversational AI Simplifies E-commerce Support Automation

Drift's Future Uncertain? Top Alternatives Emerge for 2026

Anthropic Simplifies AI Agent Development with Claude Managed Agents

Teamwork.com Unveils AI Teammates Scout & Flo in March 2026 Update

Adobe Launches Acrobat Spaces AI Tool: What Does It Do?

AI Startups Secure $221 Billion in Q1 2026, Reshaping SaaS Landscape

Block's Goose: An Open-Source AI Agent Redefining Engineering Automation

Atlassian Boosts Confluence with Visual AI and Third-Party Agents

Arcee Unveils Trinity Large Thinking: A New Open-Source AI Contender

Cursor 3 Reimagines Coding with Agent-First AI Workspace

Lucid Software Bridges Visuals and AI with New Claude Connector

GitLab Duo CLI Brings Agentic AI to the Terminal for Full DevSecOps

Anthropic Unveils Mythos AI, Partners with Apple on Cybersecurity

Swoogo Integrates AI Tools with New Native MCP Server

Google Unveils Free Offline AI Dictation App for iPhone

Google Adjusts Gemini Pricing for Diverse AI Workloads

Slack Unveils Major AI Overhaul, Transforms Slackbot into Desktop Agent