How to Set Up Data Availability for a New Blockchain

introduction

ARCHITECTURE

Introduction to Data Availability for New Chains

Data availability (DA) is the guarantee that transaction data is published and accessible for network participants. For new blockchain layers, choosing a DA solution is a foundational architectural decision impacting security, cost, and scalability.

When a new Layer 2 (L2) rollup or app-specific chain is launched, it must decide where to post its transaction data. This data is essential for anyone to reconstruct the chain's state and verify its correctness. The core problem is ensuring this data is available for download, not just promised. If data is withheld, the network cannot detect invalid state transitions, breaking its security model. This is known as the data availability problem.

New chains have several architectural options for DA. The traditional method is to post all data directly to a Layer 1 (L1) like Ethereum, using its calldata. This provides the highest security by inheriting Ethereum's consensus but is expensive. Alternatives include using a validium, which posts data availability certificates or proofs to the L1 while keeping the actual data off-chain on a separate network. Another emerging category is modular DA layers, such as Celestia, EigenDA, or Avail, which are specialized blockchains built solely for cheap, high-throughput data publishing.

The choice involves a direct trade-off between security and cost. Full data on Ethereum (e.g., Optimism, Arbitrum) offers strong guarantees but high fees. Validiums (e.g., some StarkEx instances) significantly reduce cost but introduce a trust assumption in the off-chain DA committee or proof system. Modular DA layers aim for a middle ground, offering cryptographic or economic security that is weaker than Ethereum's but stronger than a simple sidechain, at a fraction of the cost.

Implementing a DA layer requires integrating with its publishing and sampling protocols. For example, a rollup using Celestia would submit its block data to Celestia's namespace, retrieve data availability proofs (like Data Availability Sampling certificates), and then post a compact proof of this publication to its settlement layer (like Ethereum). This allows the L1 to verify that the data is available on Celestia without downloading it all.

Key technical considerations include data blob formats (EIP-4844 blobs vs. raw transactions), data attestation proofs (e.g., KZG commitments, Merkle roots), and retrievability mechanisms. Developers must ensure their node software can connect to DA layer RPC endpoints, parse the returned data, and handle data withholding attacks by challenging unavailable data through fraud proofs or validity proofs.

The DA landscape is evolving rapidly with EIP-4844 proto-danksharding on Ethereum reducing L1 costs, and new modular DA providers competing on throughput and price. For a new chain, the optimal path depends on its value-at-risk, transaction volume, and time-to-market requirements. Starting with a modular DA layer can provide scalable launch economics, with the option to migrate data posting to a more secure layer as the chain matures and value locked increases.

prerequisites

PREREQUISITES AND ARCHITECTURE DECISIONS

Setting Up Data Availability for New Chains

Choosing a data availability (DA) layer is a foundational decision for any new blockchain, directly impacting security, scalability, and decentralization. This guide outlines the key considerations and setup prerequisites.

Data availability ensures that all network participants can access and verify the data of newly proposed blocks. Without reliable DA, a chain is vulnerable to malicious validators hiding transaction data, enabling fraud. The core decision is whether to use on-chain DA (e.g., storing all data on the base layer like Ethereum) or an external DA layer (e.g., Celestia, EigenDA, Avail). On-chain DA offers maximum security but limits throughput and increases costs, while external layers provide scalable bandwidth at the cost of introducing a new trust assumption.

Before implementation, you must define your chain's requirements. Key metrics include throughput targets (bytes per second), cost sensitivity (cost per byte), and security model (economic security vs. cryptographic guarantees). For example, a high-frequency DeFi rollup may prioritize low-latency DA with fraud proofs, while a sovereign chain might value censorship resistance and opt for a DA layer with strong validator decentralization. Tools like the Celestia DA Light Client or EigenDA's Disperser have specific integration prerequisites that shape your node architecture.

Your technical stack dictates integration complexity. EVM-based rollups using a settlement layer like Ethereum often integrate DA via smart contracts that verify data availability proofs, such as EIP-4844 blob transactions or Verkle tree commitments. For Cosmos SDK or Polygon CDK chains, native modules for DA layers like Celestia are available. You must prepare your node software to interface with the DA layer's RPC endpoints, typically requiring adaptations to your block production and verification logic to post and retrieve data blobs.

A critical prerequisite is establishing a robust data sampling and fraud proof system. Light nodes must be able to efficiently sample small, random portions of block data to verify its availability. This requires implementing Data Availability Sampling (DAS) protocols. If using an optimistic rollup framework like Arbitrum Nitro or Optimism Bedrock, you'll need to configure your fraud proof verifier to challenge missing data. For validity rollups using zkSync Era or Starknet, the zero-knowledge proof itself must attest to correct data availability, often by committing to the data root in the proof.

Finally, consider the operational and economic setup. Using an external DA layer requires staking or payment in the native token (e.g., TIA for Celestia, ETH for EigenDA). You must manage these funds for sequencer operations. Furthermore, you should plan for data retrieval guarantees; some layers offer archival services, while others may have shorter data retention periods, necessitating your own backup solutions. Testing your integration on a testnet (like Celestia's Mocha or EigenDA's Holesky testnet) is non-negotiable before mainnet deployment to validate performance and cost assumptions.

key-concepts-text

SETTING UP DATA AVAILABILITY FOR NEW CHAINS

Key Concepts: Blobs, Data Availability Sampling, and Proofs

This guide explains the core components of modern data availability layers, focusing on how new blockchains can implement secure and scalable data publishing.

Data availability (DA) ensures that all transaction data within a block is published and accessible for network participants to verify. For new chains, especially Layer 2 rollups, a robust DA layer is critical. It prevents malicious validators from hiding transaction data, which could lead to invalid state transitions going unchallenged. The primary components of a modern DA system are blobs, Data Availability Sampling (DAS), and cryptographic proofs. Blobs are the fundamental unit of data storage, while DAS and proofs provide the mechanisms for efficient and trust-minimized verification.

Blobs are large, dedicated data packets designed for cost-effective temporary storage. Unlike calldata, which is processed and stored forever on-chain, blobs are intended to be available only for a short period (e.g., ~18 days in Ethereum's EIP-4844). This separation reduces long-term storage costs while guaranteeing data is available long enough for fraud or validity proofs to be submitted. For a new chain, implementing blob transactions involves defining a blob-carrying transaction type and integrating with a DA layer's blob submission and retrieval APIs.

Data Availability Sampling (DAS) is a technique that allows light nodes to verify data availability without downloading entire blocks. A node performs DAS by randomly sampling small chunks of the blob data. If all samples are successfully retrieved, the node can be statistically confident the entire data is available. This enables scalable verification. To support DAS, data must be erasure-coded, expanding the original data with redundant pieces using a scheme like Reed-Solomon. This ensures the data can be reconstructed even if some chunks are missing, making it impossible to hide data without failing a significant number of random samples.

Proofs provide cryptographic guarantees about the data. The two main types are Data Availability Proofs and Data Availability Attestations. A Data Availability Proof, such as a KZG commitment or a Merkle proof, cryptographically commits to the blob's content. Clients can verify that a specific piece of data belongs to the committed blob without having the full dataset. For DAS, attestations are often used, where a committee of validators signs off on data availability. Validity and fraud proofs for rollups depend entirely on the underlying DA layer's guarantees.

Setting up DA for a new chain involves several steps. First, choose a DA provider (e.g., Celestia, EigenDA, Ethereum via blobs) and integrate its SDK. Your chain's sequencer or block producer must format transaction data into blobs, generate the requisite commitments (KZG or Merkle root), and post them to the DA network. You must then configure your node software to perform DAS or to verify DA attestations, ensuring your light clients can trust the chain's state without running a full archival node. Proper integration is essential for security and decentralization.

ARCHITECTURE OPTIONS

Data Availability Layer Comparison

A comparison of core technical and economic characteristics for major data availability solutions.

Feature / Metric	Celestia	EigenDA	Avail	Ethereum (Calldata)
Consensus Mechanism	Tendermint BFT	EigenLayer (restaked Ethereum)	Nominated Proof-of-Stake (NPoS)	Proof-of-Stake (Gasper)
Data Availability Sampling (DAS)
Blob Transaction Support	Blobstream	EigenDA Blobs	Data Availability (DA) Blocks	EIP-4844 Blobs
Data Blob Size Limit	8 MB	128 KB per operator	4 MB	128 KB per blob
Typical Cost per MB	$0.10 - $0.50	$0.01 - $0.10	$0.05 - $0.20	$50 - $200
Finality Time	~15 seconds	~6 minutes	~20 seconds	~12 minutes
Security Model	Native Token (TIA)	Restaked ETH	Native Token (AVAIL)	Ethereum Consensus
Interoperability	IBC, Blobstream to Ethereum	Native to Ethereum L2s	Polygon SDK, Substrate	Native Ethereum L1

integration-steps-celestia

TUTORIAL

Step-by-Step: Integrating Celestia DA

A practical guide to implementing Celestia's modular data availability layer for a new sovereign or rollup blockchain.

Celestia provides a modular data availability (DA) layer, allowing new blockchains to offload the responsibility of storing and broadcasting transaction data. This is a foundational step for building sovereign rollups or validiums that require high throughput and low costs. Instead of each chain replicating full nodes for data storage, they can post compressed data blobs to Celestia's network, where a global set of light nodes ensures the data is available for download and verification. This separation of execution from consensus and data availability is the core tenet of modular blockchain architecture.

The integration process begins with configuring your chain's node software. For a Cosmos SDK-based chain, this involves setting the correct AppVersion in your app.go file and configuring the PrepareProposal and ProcessProposal ABCI methods to handle Blob Transactions. You must also integrate a DA client, such as celestia-node, to interact with the Celestia network. Your chain's block producer (sequencer or validator) will use this client to submit data blobs containing the compressed transaction data (e.g., via SubmitPayForBlob) after building a block.

On the verification side, your chain's full nodes or light clients need a mechanism to confirm data availability. They do this by sampling the data posted to Celestia using Data Availability Sampling (DAS). Implement a DA bridge service that runs a Celestia light node. This service listens for new blocks from your chain, fetches the corresponding data blob from Celestia using the namespace ID and block height, and verifies its availability and correctness via cryptographic proofs. The Celestia documentation provides reference implementations for these components.

A critical step is choosing and configuring the blob format. You must decide how to serialize your block data (e.g., using Protobuf or simple RLP encoding) before submitting it as a blob. The Square size of the Celestia block you target will determine your cost and throughput; larger squares fit more data but require higher fees. You must also manage namespacing to ensure your chain's data is uniquely identifiable within Celestia's global blob space, preventing conflicts with other integrated chains.

Finally, test your integration thoroughly on Celestia's testnets (Mocha or Arabica) before mainnet deployment. Simulate high-throughput scenarios to ensure your DA bridge service is resilient and can keep up with block production. Monitor key metrics like blob submission latency, success rate, and verification time. Successful integration means your chain inherits Celestia's robust security model for data availability, allowing you to focus on execution-layer innovation while ensuring users can always reconstruct the chain state.

integration-steps-eigenda

DATA AVAILABILITY

Step-by-Step: Integrating EigenDA

A technical guide for developers on implementing EigenDA as a modular data availability layer for new blockchain rollups and appchains.

EigenDA is a high-throughput data availability (DA) layer built on Ethereum using EigenLayer's restaking mechanism. Unlike monolithic chains or other DA solutions, it provides a cost-effective and scalable marketplace for posting transaction data. For a new L2 rollup or appchain, integrating EigenDA involves configuring your node software to post data blobs to EigenDA's network of operators instead of directly to Ethereum's calldata. This reduces data posting costs by over 90% while maintaining strong cryptographic guarantees of data availability, a prerequisite for secure fraud or validity proofs.

The core integration point is your sequencer or block producer. You must modify it to batch transaction data into Ethereum EIP-4844 blobs and dispatch them to the EigenDA disperser. Start by importing the official EigenDA client libraries. The key steps are: 1) Blob Encoding: Use the eigenda CLI or SDK to encode your batch data into a blob. 2) Dispersal: Send the blob to the EigenDA disperser endpoint, which will distribute chunks to a committee of operators. 3) Confirmation: Receive a Data Availability Certificate (DA cert)—a cryptographic commitment signed by the operator set—which you then post to your settlement layer (e.g., Ethereum) as proof.

A basic integration snippet using the TypeScript SDK illustrates the dispersal process. First, install the package (@eigenlayer/da-client). The following code shows how to configure the client and submit a blob:

typescript
import { EigenDAClient } from '@eigenlayer/da-client';
const client = new EigenDAClient('https://disperser.eigenda.xyz');
const data = Buffer.from('your_rollup_batch_data');
const blob = await client.encodeBlob(data);
const daCertificate = await client.disperseBlob(blob);
// Post daCertificate to your settlement contract

The returned daCertificate contains the KZG commitment and quorum signatures needed for verification. Your rollup's bridge or verification contract on Ethereum must be able to validate this certificate against EigenDA's known operator set.

After configuring data posting, you must update your verification contracts. If you're a rollup using fraud proofs (e.g., an Optimium) or validity proofs (zk-rollup), your verifier contract on Ethereum no longer needs to directly access calldata. Instead, it checks the DA certificate. The contract verifies that the certificate is signed by a sufficient quorum of restaked EigenDA operators and that the KZG commitment matches the data root in your rollup state. EigenLayer's slashing conditions ensure operators are penalized for withholding data, making this trust-minimized. Reference EigenDA's audited contract templates for implementation.

Finally, test thoroughly on a testnet before mainnet deployment. Use EigenDA's Holesky testnet disperser and a local devnet for your chain. Monitor key metrics: dispersal latency, confirmation time, and cost per blob. Compare these against using Ethereum calldata directly. Remember that your node's data retrieval logic must also be updated; light nodes can fetch data chunks from EigenDA's nodes using the DA certificate as a lookup key, ensuring they can reconstruct the full batch if your sequencer is offline. This completes the shift to a modular data availability architecture.

tools-and-sdks

DATA AVAILABILITY

Essential Tools and SDKs

These tools and frameworks are critical for developers building or integrating with data availability layers like Celestia, EigenDA, and Avail.

Celestia's Optimint (Rollkit)

Rollkit is a modular framework for building sovereign rollups. It provides the data availability interface to post transaction data to Celestia. Key features include:

ABCI++ Compatibility: Enables any Cosmos SDK or other ABCI-compatible chain to use Celestia for DA.
Fraud Proofs: Supports optimistic rollup constructions with built-in fraud proof logic.
Light Node Integration: Rollups can verify data availability using Celestia light nodes. Use it to launch a rollup where consensus and execution are decoupled from data availability.

EXPLORE

EigenDA Developer SDK

The EigenDA SDK provides the core client libraries for interacting with EigenLayer's data availability service. It includes:

Disperser Client: For rollup sequencers to batch and disperse data blobs to the EigenDA network.
Retrieval Client: For nodes (like rollup verifiers) to retrieve and verify the availability of data.
Integration Examples: Sample code for integrating with common rollup stacks like OP Stack and Arbitrum Nitro. The SDK handles the cryptographic commitments and proofs required for the EigenDA protocol.

EXPLORE

Avail's Light Client & RPC

To verify data on Avail, you need its light client or public RPC endpoints. The toolkit includes:

Kate RPC Client: Submits data and retrieves Kate polynomial commitments and proofs.
Data Availability Sampling (DAS): Light clients perform DAS to probabilistically verify data is available without downloading it all.
Substrate-based Tooling: Built with Substrate's sc_consensus_avail for seamless integration with Polkadot SDK chains. This is essential for any chain using Avail as its DA layer to post and verify transaction data.

EXPLORE

Arbitrum Nitro with External DA

Arbitrum Nitro can be configured to post its compressed transaction batches ("blobs") to an external DA layer instead of Ethereum. The setup involves:

Sequencer Configuration: Modifying the sequencer's batch-poster to submit to a DA layer's RPC.
Validator Verification: Configuring validators to fetch data from the external DA provider.
Fraud Proof Compatibility: Ensuring the fraud proof system can read the data from the new source. This is a key example of retrofitting a major L2 to use modular DA.

EXPLORE

Polygon CDK with DA Options

Polygon Chain Development Kit (CDK) is a zk-rollup framework that offers a configurable DA layer. Developers can choose:

Ethereum (Blobs): Use Ethereum's EIP-4844 blobs for cost-effective DA.
Celestia or Avail: Plug in an external DA layer for lower costs and higher throughput.
DAC (Data Availability Committee): A permissioned committee for enterprise use cases. The CDK's modular design makes switching DA layers a configuration change.

EXPLORE

Data Availability Sampling (DAS) Simulators

DAS Simulators are critical for testing the security and performance of light nodes. Key tools include:

Celestia's DAS Network Simulator: A local testnet that simulates light nodes performing sampling.
EigenDA's Testnet Tooling: Includes local disperser/retriever setups to test blob dispersal and retrieval latency.
Custom Simulation Frameworks: Libraries like libp2p can be used to build custom network simulations for DA. Use these to prototype and validate your light client's behavior before mainnet deployment.

EXPLORE

DATA AVAILABILITY

Common Integration Issues and Troubleshooting

Resolve common challenges when setting up data availability for new blockchains, including RPC configuration, fee estimation, and network compatibility.

RPC sync failures are often caused by incorrect configuration or network incompatibility. The primary issues are:

Incorrect Chain ID or Network ID: Your node's chainId must match the target chain's specification exactly (e.g., 1 for Ethereum mainnet, 42161 for Arbitrum One). Mismatches cause handshake failures.
Unsupported RPC Methods: Chainscore DA requires specific JSON-RPC methods like eth_getBlockByNumber, eth_getLogs, and debug_traceBlock. Ensure your node (Geth, Erigon, Nethermind) exposes these endpoints.
WebSocket vs. HTTP: The Chainscore relayer primarily uses HTTP. If you've configured a WebSocket URL (ws://), switch to HTTP (http:// or https://).
Node Synchronization State: Your archival node must be fully synced to the latest block. A lagging node will return incomplete data, causing transaction proofs to fail.

Checklist: Verify your CHAINSCORE_RPC_URL environment variable, confirm method availability via curl, and ensure your node is on the correct fork.

GAS FEE COMPARISON

Cost Analysis: DA vs. Calldata on Ethereum

Comparison of transaction costs for posting data to Ethereum mainnet using blob-carrying transactions (EIP-4844) versus traditional calldata.

Cost Factor	EIP-4844 Blob (Data Availability)	Calldata (Execution Layer)	Notes
Base Fee Unit	Blob Gas	Execution Gas	Separate fee markets after EIP-4844
Typical Cost per Byte	$0.0001 - $0.001	$0.01 - $0.10	Blob cost is ~10-100x cheaper at scale
Data Capacity per Tx	~128 KB (2 blobs)	~24 KB (30M gas limit)	Calldata limited by block gas; blobs have dedicated limits
Fee Market Volatility	Low to Medium	Very High	Blob gas targets separate from execution gas, reducing correlation with NFT/DeFi activity
Long-Term Storage	Pruned after ~18 days	Permanently on-chain	Blobs are not stored by Ethereum nodes long-term; only commitments are.
Developer Overhead	Requires 4844 client & blob API	Standard eth_sendTransaction	Integrating blobs requires updates to client libraries and RPC calls.
Ideal Use Case	High-volume L2 batch posting, modular DA	Small data, permanent on-chain records, contract interactions

resource-links

DEVELOPER DOCS

Official Documentation and Resources

Primary documentation sources for setting up data availability when launching or migrating a new blockchain. Each resource covers node architecture, publishing flows, and integration details required for production deployments.

Celestia Data Availability Documentation

Celestia provides modular data availability using data availability sampling (DAS), allowing light clients to verify data without downloading full blocks. This documentation is essential if you are deploying a rollup or sovereign chain that outsources DA to Celestia.

Key areas covered in the official docs:

Blob publishing using the Celestia node and namespaces
Rollup integration via celestia-node and celestia-app
Light node vs full node tradeoffs and hardware requirements
Data availability sampling guarantees and fraud assumptions

Concrete setup steps include running a bridge node, configuring namespace IDs, and submitting blobs via RPC. Celestia is commonly used by rollups like Eclipse and Movement. Developers should understand how Celestia separates consensus from execution and how DA fees are calculated per byte.

EXPLORE

EigenDA Official Documentation

EigenDA is a high-throughput data availability layer built on EigenLayer, designed for rollups requiring large data bandwidth with customizable quorum assumptions. The official documentation explains how to publish and retrieve data using EigenDA operators.

Topics covered in the docs:

Disperser architecture and erasure coding pipeline
Operator quorums and trust configuration
Rollup SDK integration for posting batch data
Throughput targets exceeding Ethereum L1 calldata limits

EigenDA is suitable for chains that want Ethereum-aligned security while optimizing for data volume. The docs include system diagrams, API specs, and configuration guidance. Developers should pay close attention to liveness assumptions and how operator sets are selected for production deployments.

EXPLORE

Avail Data Availability Docs

Avail is a standalone data availability blockchain designed for modular stacks and application-specific chains. Its documentation walks through publishing block data, verifying availability, and running Avail nodes.

The Avail docs focus on:

KZG commitments and block verification
Light client architecture and DAS
Integration guides for rollups and appchains
Node setup including RPC, bootnodes, and syncing

Avail is commonly evaluated for chains wanting a DA layer independent of Ethereum while retaining cryptographic availability guarantees. The documentation includes example RPC calls and explains how Avail validates that block data is fully published before finalization.

EXPLORE

Ethereum Data Availability and Blobspace (EIP-4844)

Ethereum provides native data availability through blobspace introduced in EIP-4844, also known as proto-danksharding. The official Ethereum documentation explains how rollups post data blobs and how they differ from calldata.

Key concepts covered:

Blob transactions and fee markets
Temporary data retention and pruning rules
KZG commitments used for blob verification
Rollup posting flows for L2s like Optimism and Arbitrum

This resource is critical for chains that want maximum Ethereum alignment and security. Developers should understand blob size limits, how blob fees are adjusted, and the implications for rollup costs. The docs also link to relevant EIPs and client implementations.

EXPLORE

DATA AVAILABILITY

Frequently Asked Questions

Common questions and troubleshooting for developers integrating data availability layers for new blockchain networks.

Data availability (DA) refers to the guarantee that all transaction data for a block is published and accessible to network participants. For new Layer 2s and app-chains, it's the foundation for security and trustlessness. Without reliable DA, validators cannot verify state transitions, and users cannot reconstruct the chain's history to detect fraud. Modern solutions like Celestia, EigenDA, and Avail separate DA from consensus, allowing chains to post data off their main execution layer. This reduces costs by over 90% compared to posting all data directly to Ethereum L1, while maintaining strong security guarantees for light clients and fraud/validity proofs.

conclusion

IMPLEMENTATION SUMMARY

Conclusion and Next Steps

You have now configured the core components for data availability on your new blockchain. This guide has covered the essential steps from selecting a DA layer to integrating it with your node software.

The choice of a Data Availability (DA) layer is a foundational architectural decision. Whether you selected a modular DA solution like Celestia, EigenDA, or Avail, or opted for an Ethereum-based approach using EIP-4844 blobs or a validium, your implementation directly impacts security, cost, and throughput. Ensure your chosen solution's consensus mechanism and data sampling assumptions align with your chain's trust model. The integration code you've written, particularly the DataAvailabilityLayer interface and the modifications to your block production logic, forms the critical bridge between your execution environment and the external DA network.

With the core integration complete, your next steps should focus on robust testing and monitoring. Deploy your modified node to a private testnet and simulate various failure modes: - DA node unavailability during block proposal - Malformed data submissions to the DA layer - Network partition scenarios. Use the DA layer's native tools, like Celestia's celestia-node or EigenDA's disperser client, to verify data was correctly posted and is retrievable. Implement health checks and metrics (e.g., DA posting latency, confirmation times, cost per byte) in your node's observability stack to catch issues early.

Finally, consider the long-term operational and upgrade path. DA layers are rapidly evolving. Stay informed about protocol upgrades; for instance, monitor Ethereum's progress on full danksharding if using blobs. Plan for how your chain will handle a mandatory migration or a security incident in the underlying DA layer. Engage with the community of your chosen DA provider and contribute to its ecosystem. The security and scalability of your application chain now depend on this external subsystem, making its reliable operation a continuous priority.