Free 30-min Web3 Consultation
Book Now
Smart Contract Security Audits
Learn More
Custom DeFi Protocol Development
Explore
Full-Stack Web3 dApp Development
View Services
Free 30-min Web3 Consultation
Book Now
Smart Contract Security Audits
Learn More
Custom DeFi Protocol Development
Explore
Full-Stack Web3 dApp Development
View Services
Free 30-min Web3 Consultation
Book Now
Smart Contract Security Audits
Learn More
Custom DeFi Protocol Development
Explore
Full-Stack Web3 dApp Development
View Services
Free 30-min Web3 Consultation
Book Now
Smart Contract Security Audits
Learn More
Custom DeFi Protocol Development
Explore
Full-Stack Web3 dApp Development
View Services
the-ethereum-roadmap-merge-surge-verge
Blog

Why Stateful Execution Is Hard to Optimize

Ethereum's performance ceiling isn't about consensus or data availability—it's the stateful execution model. This deep dive explains the fundamental constraints of the EVM, why parallelization is a band-aid, and how the Verge's stateless future is the only real fix.

introduction
THE STATE

The Bottleneck Isn't Where You Think

The primary constraint for scaling blockchains is not transaction throughput, but the exponential growth of state data that nodes must store and process.

State is the bottleneck. Execution speed is a solved problem; rollups like Arbitrum and Optimism already process thousands of transactions per second. The real cost is storing the resulting state changes, which grows linearly with usage and must be accessible for all future execution.

Statelessness is the goal. Protocols like Ethereum's Verkle Trees and Solana's State Compression aim to separate execution from storage. A validator only needs a cryptographic proof of state, not the entire database, radically reducing hardware requirements.

The evidence is in the data. A full Ethereum archive node requires over 12TB. Without state expiry or stateless clients, this growth creates centralization pressure, as only well-funded entities can afford to run infrastructure.

deep-dive
THE BOTTLENECK

Deconstructing the State Monster

Stateful execution, the core of smart contract platforms, inherently creates performance and decentralization trade-offs that limit scalability.

State is the bottleneck. Every transaction modifies a global database, requiring consensus and storage. This creates a fundamental throughput ceiling, as seen in Ethereum's 15-30 TPS limit.

Parallelization hits a wall. Optimistic (Solana) or deterministic (Sui, Aptos) parallel execution requires knowing which state a transaction will touch. Complex, composable interactions make this prediction impossible, forcing sequential fallback.

Storage costs compound. The state bloat problem forces nodes to prune history or increase hardware requirements, centralizing infrastructure. Solutions like EIP-4444 and stateless clients are multi-year endeavors.

Evidence: Arbitrum Nitro's 2M TPS benchmark is for compute, not state updates. Real-world throughput collapses under heavy state contention, proving the monster remains.

WHY STATEFUL EXECUTION IS HARD

Optimization Attempts vs. Fundamental Limits

A comparison of architectural approaches to scaling stateful execution, highlighting the trade-offs between performance, decentralization, and developer experience.

Core Constraint / MetricMonolithic L1 (e.g., Ethereum Mainnet)Modular Execution Layer (e.g., Arbitrum, Optimism)Parallel Execution Engine (e.g., Sui, Aptos)Stateless / ZK-Coprocessor (e.g., RISC Zero, Axiom)

Execution Throughput (TPS)

~15-30

~4,000-40,000

~100,000+ (theoretical)

Off-chain compute, ~1,000+ proofs/sec

State Growth per Block

~1 MB

~10-100 MB (compressed)

~50-500 MB

0 MB (state is an input, not stored)

Worst-Case Block Time

12 seconds

~1-2 seconds (to L1)

< 1 second

Proof generation time (~2-10 sec)

Developer Complexity (State Mgmt)

Manual optimization required

Similar to L1, but cheaper

Requires explicit ownership declarations

Pure functions; no persistent state

Cross-Shard/Chain Messaging Latency

N/A (single chain)

~1-20 minutes (L1 finality delay)

Instant (within shard), minutes (cross-shard)

Synchronous via proof verification

Hardware Requirements (Node)

~2 TB SSD, 16 GB RAM

~1-3 TB SSD, 32+ GB RAM

~500 GB-1 TB SSD, 64+ GB RAM

Verifier: minimal; Prover: 128+ GB RAM, high-end CPU

Inherent Centralization Pressure

Low (global consensus)

Medium (sequencer optionality)

High (validator performance critical)

Very High (proving is computationally intensive)

future-outlook
THE STATE CRISIS

The Path Through the Verge: Statelessness is the Only Exit

The exponential growth of execution state is the fundamental bottleneck that makes scaling blockchains via execution sharding or parallelization economically unsustainable.

Stateful execution creates unbounded costs. Every transaction modifies the global state, forcing every node to store and process the entire history. This makes scaling via parallel execution shards like Near or Zilliqa a temporary fix, as each new shard multiplies the total state burden.

Statelessness inverts the scaling paradigm. Instead of nodes carrying the full state, users provide cryptographic proofs of their state. This enables verification scaling, where nodes only verify proofs, not manage data. The Verge in Ethereum's roadmap is the canonical implementation of this principle.

Witness size is the new bottleneck. Stateless clients require witnesses—Merkle proofs of relevant state. Current binary Merkle trees produce proofs too large for mainnet. The shift to Verkle trees reduces witness size from ~1KB to ~150 bytes, making statelessness practical.

Ethereum's roadmap validates the thesis. The merge, surge, and scourge address throughput and cost. The verge (statelessness) is the final piece that makes scaling sustainable by eliminating the node's state management overhead, enabling true horizontal scaling.

takeaways
THE STATE COMPLEXITY TRAP

TL;DR for Protocol Architects

Stateful execution is the core bottleneck for scaling L1s and L2s. Here's why optimizing it is fundamentally hard.

01

The Problem: State Bloat is Exponential

Every transaction modifies the global state, which must be stored, proven, and made available forever. This creates a quadratic scaling problem for validators and nodes.\n- Storage Cost: A full Ethereum archive node requires ~15TB and grows by ~100GB/month.\n- Sync Time: Bootstrapping a new node can take weeks, centralizing network participation.

15TB+
Archive Size
Weeks
Sync Time
02

The Solution: Statelessness & State Expiry

Decouple execution from full state storage. Clients only hold state they need, verified via cryptographic proofs. Verkle trees (Ethereum) and stateless clients are the endgame.\n- Witness Size: Reduces proof size from MBs to KBs, enabling light clients.\n- Node Requirements: Lowers hardware needs from TB to GB, enabling broader decentralization.

GBs
Node Storage
KBs
Witness Size
03

The Problem: Parallelization is State-Contention Hell

Optimistic parallel execution (Solana, Aptos, Sui) hits diminishing returns due to access conflicts. Transactions touching the same state must be serialized, creating bottlenecks.\n- Contention Rate: Real-world workloads see >30% conflict rates, capping throughput.\n- Scheduler Overhead: Dynamic scheduling adds ~10-100ms latency per block, eating gains.

>30%
Conflict Rate
~100ms
Scheduler Tax
04

The Solution: Intent-Based & Specialized VMs

Move complexity off-chain. Let users express what they want, not how to do it. UniswapX, CowSwap, and Across use solvers. Specialized VMs like FuelVM and SVM optimize for specific state models.\n- Throughput: Solvers batch and optimize off-chain, enabling 10,000+ TPS equivalents.\n- Efficiency: Dedicated VMs reduce opcode overhead by ~40% vs. general-purpose EVM.

10k+
Effective TPS
-40%
Overhead
05

The Problem: Cross-Shard/Chain State Sync

Fragmented state across rollups or shards breaks composability and creates latency. Atomic cross-domain transactions require complex, slow bridging protocols.\n- Finality Delay: Moving assets via canonical bridges can take 10 mins to 7 days.\n- Security Risk: LayerZero, Wormhole-style bridges add new trust assumptions and attack surfaces.

10min-7d
Sync Latency
New Attack Surface
Security
06

The Solution: Shared Sequencing & Isolated State

Coordinate execution across domains via a shared sequencer (e.g., Espresso, Astria) or treat each app/chain as an isolated state machine with asynchronous messaging.\n- Atomicity: Enables sub-second cross-rollup composability.\n- Simplification: Isolated state (like Celestia rollups) reduces global sync requirements by >90%.

<1s
Cross-Domain Tx
-90%
Sync Burden
ENQUIRY

Get In Touch
today.

Our experts will offer a free quote and a 30min call to discuss your project.

NDA Protected
24h Response
Directly to Engineering Team
10+
Protocols Shipped
$20M+
TVL Overall
NDA Protected direct pipeline