FHE Is Overhyped for On-Chain AI: ZKPs Win on Cost

introduction

THE REALITY CHECK

Introduction

FHE's computational overhead makes it impractical for most on-chain AI applications today, despite its theoretical promise.

FHE is computationally prohibitive. Encrypting, processing, and decrypting data with Fully Homomorphic Encryption requires orders of magnitude more compute than plaintext operations, creating an insurmountable cost barrier for on-chain inference.

The market misunderstands the use case. The primary demand is for verifiable inference, not private computation. Projects like EigenLayer's AVS for AI and Ritual's Infernet focus on proving correct execution, which is a more tractable problem.

Performance metrics are disqualifying. A single encrypted multiplication can be 1,000,000x slower than its plaintext equivalent. This makes real-time applications, like an AI agent on an Ethereum rollup, economically impossible with current FHE schemes like TFHE.

key-trends

CRYPTO'S AI PRIVACY TRAP

The Market's Misplaced Bet on FHE

Fully Homomorphic Encryption is being pitched as the holy grail for on-chain AI, but its computational overhead makes it a non-starter for real-time applications.

The Latency Lie: FHE vs. ZKML

FHE operations introduce ~10-1000x slowdown versus plaintext computation, making interactive AI agents impossible. Zero-Knowledge Machine Learning (ZKML) frameworks like EZKL and Giza prove privacy with verification, not execution, offering a pragmatic path forward.

Key Benefit 1: ZK proofs verify a private inference in seconds, FHE computes it in minutes.
Key Benefit 2: Enables verifiable AI without the prohibitive runtime cost of homomorphic ops.

1000x

Slower

~2s

ZK Verify Time

The Cost Chasm: Who Pays for the Overhead?

FHE's massive computational burden translates directly to gas costs. A single private inference could cost >$100 on Ethereum L1, destroying any viable economic model. Projects like Infernet and Ritual are exploring hybrid models that use trusted hardware or off-chain compute with on-chain settlement to bypass this.

Key Benefit 1: Offloads FHE's heaviest work to specialized co-processors or networks.
Key Benefit 2: Creates a sustainable fee market for private AI, unlike pure on-chain FHE.

$100+

Per Inference Cost

-99%

With Hybrid Model

The Privacy Mismatch: On-Chain Data is Already Public

FHE protects computation, but the fundamental input/output data must still be posted on-chain. For most DeFi or gaming AI agents, the request and result are the valuable secrets—FHE doesn't hide them. Solutions like TCP and Automata use TEEs (Trusted Execution Environments) to guarantee private execution with integrity, a more complete solution.

Key Benefit 1: TEEs (e.g., Intel SGX) provide full data confidentiality and code integrity.
Key Benefit 2: Avoids the cryptographic overhead of FHE for equivalent privacy guarantees.

Data Hidden

~100ms

TEE Latency

The Infrastructure Gap: No FHE L2 Exists

Despite hype, no production-grade FHE-optimized Layer 2 or co-processor network has meaningful adoption. Contrast with the ~$20B TVL in ZK-Rollups like zkSync and Starknet, which have spent years optimizing provers. FHE requires entirely new hardware (e.g., FHE ASICs) not yet deployed at scale.

Key Benefit 1: ZK-Rollups have proven scaling paths (recursive proofs, custom VMs).
Key Benefit 2: Hardware acceleration for FHE is still a research project, not a product.

FHE L2 TVL

$20B+

ZK-Rollup TVL

thesis-statement

THE REALITY CHECK

Core Argument: The Latency & Cost Chasm

FHE's computational overhead creates prohibitive latency and gas costs for on-chain AI inference, making it a non-starter for real-time applications.

FHE's computational overhead is immense. Encrypting, computing on, and decrypting data requires orders of magnitude more processing than plaintext operations, measured in seconds or minutes per inference.

On-chain gas costs become astronomical. A single FHE operation can cost thousands of dollars in gas, versus cents for a verifiable ZK-SNARK proof from a system like Risc Zero or zkML.

This creates a fundamental latency chasm. Real-time applications like AI agents or on-chain games require sub-second responses; FHE's minute-long delays are a deal-breaker.

Evidence: A 2023 Zama benchmark showed a single encrypted inference on fhEVM took ~30 seconds and cost over $50 in gas, while a comparable Giza zkML proof finalized in ~2 seconds for under $1.

ON-CHAIN AI INFRASTRUCTURE

FHE vs. ZKP: A Performance Reality Check

A quantitative comparison of cryptographic primitives for private computation, focusing on the practical constraints for on-chain AI inference and model training.

Feature / Metric	Fully Homomorphic Encryption (FHE)	Zero-Knowledge Proofs (ZKPs)	Trusted Execution Env. (TEE)
Computational Overhead (vs. plaintext)	10,000x - 1,000,000x	100x - 10,000x (proving)	1.1x - 2x
Proof/Result Verification Time	< 1 sec (decryption)	Seconds to minutes (SNARK)	< 100 ms
On-Chain Gas Cost for 1 GPT-3.5 Inference	$1000 (est.)	$50 - $200 (zkML)	$5 - $20
Supports Arbitrary Computations
Requires Specialized Hardware (ASIC/FPGA)
Active On-Chain Projects	Fhenix, Inco	Modulus, EZKL, Giza	Phala, Oasis
Primary Use Case	Encrypted state & recurrent computations	Verifiable inference & one-off proofs	High-throughput private model serving

deep-dive

THE COMPUTE BOTTLENECK

Why ZKPs Are the Pragmatic Path for On-Chain AI

Fully Homomorphic Encryption is computationally prohibitive for real-time on-chain AI, making Zero-Knowledge Proofs the only viable scaling solution today.

FHE is computationally prohibitive. Performing inference on FHE-encrypted data requires orders of magnitude more compute than plaintext operations, making real-time on-chain AI applications economically impossible.

ZKPs separate proof from execution. Protocols like EigenLayer and Risc Zero generate proofs of correct AI model execution off-chain, posting only a tiny cryptographic proof on-chain for verification.

This mirrors the L2 scaling playbook. Just as Arbitrum and zkSync scale Ethereum by moving execution off-chain, ZKPs scale on-chain AI by moving its immense computational burden off-chain.

Evidence: A single FHE multiplication can be 1,000,000x slower than its plaintext equivalent, while a ZK-SNARK proof for a complex AI model can be verified on-chain in milliseconds for a few cents.

protocol-spotlight

THE FHE FALLOUT

Protocols Building the ZK-Centric Future

FHE's computational overhead makes it impractical for on-chain AI inference; these protocols are building viable alternatives.

The Problem: FHE's Latency is a Dealbreaker

Fully Homomorphic Encryption requires ~1,000,000x more compute than plaintext operations. For real-time AI inference, this translates to latencies of minutes to hours, not milliseconds. This kills UX for any interactive application like on-chain trading agents or gaming AIs.

1Mx

Compute Overhead

>60s

Inference Latency

The Solution: ZKML with zkSNARKs

Protocols like Modulus, Giza, and EZKL use zkSNARKs to prove off-chain AI inference was performed correctly. This provides cryptographic integrity with sub-second verification on-chain. The model weights and input data can remain private or public, offering flexible trust models for oracles and autonomous agents.

<1s

On-Chain Verify

~50KB

Proof Size

The Solution: Optimistic ML with Fraud Proofs

Frameworks like RiscZero and Cartesi use optimistic fraud proofs for heavy computation. AI inference runs in a deterministic VM off-chain, with a ~1-week challenge period for disputes. This offers a pragmatic, cost-effective path for complex models where absolute finality isn't required instantly, trading off latency for scalability.

~$0.01

Cost per Op

7 Days

Challenge Window

The Solution: Hybrid Trusted Execution (TEE + ZK)

Projects like Phala Network and Oasis combine Trusted Execution Environments (TEEs) with ZK proofs for scalable confidential compute. TEEs handle the heavy FHE-like computation privately, while ZK proofs attest to the integrity of the TEE's operation. This provides a balanced trade-off between performance and verifiable trust.

~100ms

Inference Time

TEE + ZK

Trust Model

The Problem: FHE's Cost Prohibits Scale

The astronomical gas costs of verifying FHE proofs on-chain would consume more resources than the underlying AI task is worth. At current Ethereum gas prices, a single FHE operation could cost hundreds of dollars, making decentralized AI agents economically non-viable compared to centralized APIs.

$100+

Est. Gas Cost

>1B Gas

Verification Overhead

The Arbiter: Specialized Co-Processors

Networks like Espresso Systems with the Tiramisu data-availability layer and Celestia-inspired rollups are building ZK-optimized execution environments. These act as co-processors for AI, handling verification off the main L1. This architectural shift is essential to make ZKML and optimistic ML economically sustainable at scale.

~0.1c

Target Cost

Co-Processor

Architecture

counter-argument

THE LONG GAME

Steelman: When FHE Might Matter (And Why It's Not Now)

FHE's utility is constrained to niche, high-value applications where on-chain privacy is non-negotiable.

Private On-Chain Auctions are the canonical use case. FHE enables sealed-bid mechanisms like those in Aztec's Noir or Fhenix's confidential DeFi, preventing front-running and information leakage. This is a genuine advantage over ZKPs, which only prove state transitions.

Regulated Financial Primitives require FHE's data-in-use encryption. Projects like Zama and Inco Network target tokenized RWAs and compliant DeFi where sensitive data (KYC, credit scores) must be processed on-chain without exposure.

The performance tax is prohibitive. FHE operations are 1,000x slower than plaintext computation. This confines it to low-frequency, high-value logic, not the high-throughput inference demanded by most on-chain AI agents.

ZKPs dominate for verification. For most AI applications, proving a model executed correctly (via EZKL, Giza) is sufficient. FHE's full encryption is overkill when the model weights or input data aren't a secret.

takeaways

FHE IS OVERHYPED

TL;DR for CTOs & Architects

FHE is a cryptographic marvel, but its computational overhead makes it a poor fit for on-chain AI inference. Here's where to focus instead.

The Latency & Cost Problem

FHE operations are orders of magnitude slower than plaintext computation. On-chain AI inference is already a latency and gas-cost nightmare; FHE makes it economically impossible for real-time applications.

Latency: FHE ops are ~10,000x to 1,000,000x slower than plaintext.
Cost: A single encrypted inference could cost >$100 in gas, versus cents for a verifiable ZK proof.

~1Mx

Slower

$100+

Per Op Cost

The Verifiability Alternative (ZKML)

For on-chain AI, you don't need to hide the model/data during execution; you need to cryptographically verify the result. Zero-Knowledge Machine Learning (ZKML) like zkML, EZKL, or Giza proves a computation was performed correctly off-chain with ~100-1000x less overhead than FHE.

Key Use: Model integrity, verifiable inference, and fair sequencing.
Entities: Modulus Labs, Gensyn, Risc Zero.

1000x

Less Overhead

Off-Chain

Compute

The Privacy Alternative (TEEs & MPC)

If input/output privacy is the goal, use specialized hardware (TEEs) or multi-party computation. Trusted Execution Environments like Intel SGX provide a secure, isolated enclave with near-native speed. MPC distributes trust among parties.

TEEs: ~native speed, trusted hardware requirement.
MPC: Trustless but slower, ideal for high-value, low-frequency ops.

~Native

Speed

Hardware

Trust Assumption

The Real Niche: Encrypted State

FHE's killer app is not on-chain AI inference, but encrypted state and conditional logic. Projects like Fhenix and Inco use FHE to enable private smart contracts where state is always encrypted, enabling novel DeFi and gaming mechanics impossible with ZK or TEEs alone.

Use Case: Private auctions, sealed-bid governance, hidden-game-state RPGs.
Trade-off: Accept high cost for specific, high-value privacy guarantees.

Encrypted

Persistent State

Niche

High-Value Use

Fully Homomorphic Encryption Is Overhyped for On-Chain AI

Introduction

The Market's Misplaced Bet on FHE

The Latency Lie: FHE vs. ZKML

The Cost Chasm: Who Pays for the Overhead?

The Privacy Mismatch: On-Chain Data is Already Public

The Infrastructure Gap: No FHE L2 Exists

Core Argument: The Latency & Cost Chasm

FHE vs. ZKP: A Performance Reality Check

Why ZKPs Are the Pragmatic Path for On-Chain AI

Protocols Building the ZK-Centric Future

The Problem: FHE's Latency is a Dealbreaker

The Solution: ZKML with zkSNARKs

The Solution: Optimistic ML with Fraud Proofs

The Solution: Hybrid Trusted Execution (TEE + ZK)

The Problem: FHE's Cost Prohibits Scale

The Arbiter: Specialized Co-Processors

Steelman: When FHE Might Matter (And Why It's Not Now)

TL;DR for CTOs & Architects

The Latency & Cost Problem

The Verifiability Alternative (ZKML)

The Privacy Alternative (TEEs & MPC)

The Real Niche: Encrypted State

Get a free quote.

Get In Touch
today.

Fully Homomorphic Encryption Is Overhyped for On-Chain AI

Introduction

The Market's Misplaced Bet on FHE

The Latency Lie: FHE vs. ZKML

The Cost Chasm: Who Pays for the Overhead?

The Privacy Mismatch: On-Chain Data is Already Public

The Infrastructure Gap: No FHE L2 Exists

Core Argument: The Latency & Cost Chasm

FHE vs. ZKP: A Performance Reality Check

Why ZKPs Are the Pragmatic Path for On-Chain AI

Protocols Building the ZK-Centric Future

The Problem: FHE's Latency is a Dealbreaker

The Solution: ZKML with zkSNARKs

The Solution: Optimistic ML with Fraud Proofs

The Solution: Hybrid Trusted Execution (TEE + ZK)

The Problem: FHE's Cost Prohibits Scale

The Arbiter: Specialized Co-Processors

Steelman: When FHE *Might* Matter (And Why It's Not Now)

TL;DR for CTOs & Architects

The Latency & Cost Problem

The Verifiability Alternative (ZKML)

The Privacy Alternative (TEEs & MPC)

The Real Niche: Encrypted State

Get In Touch today.

Steelman: When FHE Might Matter (And Why It's Not Now)

Get In Touch
today.