What is a Data Feed?

definition

BLOCKCHAIN INFRASTRUCTURE

A data feed is a continuous, real-time stream of structured information from an external source to a blockchain or decentralized application.

In blockchain contexts, a data feed (often called an oracle) is a critical piece of infrastructure that provides smart contracts with access to verified, real-world data they cannot natively access, such as asset prices, weather conditions, or sports scores. This external data is essential for triggering contract execution, as on-chain logic can only react to information that exists within its own ledger. Without a reliable data feed, smart contracts for decentralized finance (DeFi), insurance, and supply chain applications would be unable to function autonomously.

The core challenge for any data feed is the oracle problem: ensuring the data's integrity, timeliness, and resistance to manipulation as it moves from an off-chain source to the on-chain environment. Solutions to this problem include using multiple data sources for aggregation, cryptographic proofs of data provenance, and decentralized oracle networks like Chainlink or API3. These systems employ consensus mechanisms among independent node operators to deliver a single, tamper-resistant data point, known as a price feed in financial applications.

Data feeds are categorized by their data type and delivery method. A price feed delivers real-time cryptocurrency or traditional asset prices, which is foundational for DeFi lending protocols and decentralized exchanges. An event-driven feed transmits specific real-world occurrences, such as election results or flight delays. Technically, feeds can be push-based (automatically sent when new data is available) or pull-based (retrieved by the smart contract when needed), each with distinct trade-offs in cost and latency.

The security and reliability of a data feed are paramount, as a compromised feed can lead to massive financial losses. High-value applications mitigate this risk through data aggregation (averaging inputs from numerous premium sources), cryptographic signing of data at the source, and decentralization of the oracle network itself. For example, a robust ETH/USD price feed might aggregate data from over 30 independent exchanges, with nodes cryptographically attesting to the data before it is finalized on-chain.

Beyond finance, data feeds enable a vast array of blockchain use cases. They are used in parametric insurance to verify natural disasters, in dynamic NFTs to reflect real-world events, and in enterprise supply chains to track IoT sensor data. As the blockchain ecosystem evolves, the demand for more sophisticated, low-latency, and privacy-preserving data feeds continues to grow, driving innovation in oracle design and verifiable computation techniques like zero-knowledge proofs.

how-it-works

MECHANISM

How a Data Feed Works

A data feed is a continuous, automated stream of information, typically from a source to a consumer. In blockchain and DeFi, this mechanism is critical for providing smart contracts with reliable, real-world data.

A data feed is a mechanism for the continuous, automated delivery of information from a source (like an API or oracle network) to a consumer (like a smart contract or application). In decentralized systems, this process is not a simple data transfer; it involves a multi-step workflow to ensure the data's integrity, timeliness, and reliability before it is used to trigger financial transactions or state changes on-chain. The core challenge is bridging the gap between off-chain data and the deterministic blockchain environment.

The workflow typically begins with data sourcing, where a node operator or oracle network retrieves raw data from multiple high-quality sources, such as cryptocurrency exchanges, traditional financial APIs, or IoT sensors. This is followed by data aggregation, where values from these disparate sources are combined using a specific method—like a median or volume-weighted average—to produce a single, tamper-resistant data point. This aggregation mitigates the risk of relying on any single, potentially faulty or manipulated source.

Once aggregated, the data must be validated and formatted for on-chain consumption. Oracle networks often use consensus mechanisms where multiple nodes independently fetch and aggregate data; only the agreed-upon result is submitted in a transaction. This transaction, containing the formatted data point (e.g., the current ETH/USD price), is then broadcast to the blockchain, where it is delivered on-chain to a specific smart contract address, often called a consumer contract or price feed consumer.

Finally, the on-chain verification and consumption phase occurs. The receiving smart contract verifies the data's authenticity, often by checking it was signed by a trusted oracle or came from a pre-authorized address. Once verified, the contract's internal logic is executed based on the new data—for example, liquidating an undercollateralized loan, settling a derivatives contract, or adjusting rewards in a liquidity pool. This entire cycle, from sourcing to consumption, is what constitutes a functional data feed in a decentralized ecosystem.

key-features

ORACLE ARCHITECTURE

Key Features of a Data Feed

A data feed is a continuous stream of validated information from external sources (off-chain) delivered to a blockchain (on-chain). Its core features define its reliability, security, and utility for smart contracts.

01

Decentralized Data Sourcing

A robust data feed aggregates price or event data from multiple, independent sources (e.g., centralized exchanges, DEXs, trading desks) to produce a single tamper-resistant value. This prevents manipulation from any single source and creates a consensus-based truth.

Example: A BTC/USD feed might aggregate data from Coinbase, Binance, and Kraken.
Mechanism: Uses a median or TWAP (Time-Weighted Average Price) function to filter outliers.

02

Cryptographic Proofs & Attestations

Data is cryptographically signed by the oracle node or network before being written on-chain, providing verifiable authenticity. This creates an immutable audit trail linking the reported data to a specific source and time.

Key Term: Attestation - A signed message from an oracle attesting to the validity of specific data.
Purpose: Enables downstream contracts or users to cryptographically verify the data's origin and integrity.

03

On-Chain Data Availability

The final, aggregated data point is published and stored directly on the blockchain. This makes it globally accessible, transparent, and immutable for any smart contract to consume within the same block or later.

Consumption: Smart contracts read the data from a predefined storage address (e.g., an oracle contract).
Critical Property: The data becomes part of the blockchain's state, inheriting its security guarantees.

04

Update Frequency & Latency

The rate at which a feed refreshes its on-chain value, balancing freshness with cost and network load. Low latency is critical for derivatives and lending protocols to reflect real-time prices.

Heartbeat: A minimum time between updates (e.g., every block, every 5 seconds).
Deviation Threshold: An update is triggered if the off-chain price moves beyond a set percentage, ensuring updates on volatility.

05

Decentralized Oracle Networks (DONs)

The infrastructure layer of independent nodes that operate the feed. A DON distributes the responsibilities of data sourcing, aggregation, validation, and on-chain reporting across multiple parties, removing single points of failure.

Security Model: Relies on a cryptoeconomic security where nodes stake collateral that can be slashed for malicious behavior.
Example Networks: Chainlink, API3, Witnet.

06

Data Integrity & Manipulation Resistance

The combined design that makes it economically and technically infeasible to corrupt the feed. This is achieved through cryptoeconomic incentives, decentralized sourcing, and cryptographic proofs.

Primary Threat: Flash loan attacks that manipulate a price on a single DEX.
Defense: Aggregating from many sources over time (TWAP) and slashing malicious node operators.

common-types

DATA FEED ARCHITECTURE

Common Types of Data Feeds

Data feeds are categorized by their underlying architecture, which determines their security model, data sources, and trust assumptions. The primary distinction is between on-chain and off-chain data sourcing.

01

On-Chain Price Feeds

These feeds derive price data directly from on-chain sources like Decentralized Exchange (DEX) liquidity pools. They calculate a volume-weighted average price (VWAP) or a time-weighted average price (TWAP) from the pool's trading history. This method is permissionless and transparent, but can be vulnerable to manipulation on low-liquidity assets.

Examples: Uniswap V3 TWAP oracles, Chainlink Data Streams (on-chain aggregation).
Key Mechanism: Uses a moving average over a specified time window to smooth out price volatility and flash crashes.

EXPLORE

02

Off-Chain Aggregator Feeds

The most common type of decentralized oracle feed. A network of independent node operators retrieves price data from off-chain centralized exchanges (CEXs) and other premium sources. The nodes submit data on-chain, where a smart contract aggregates the reports (e.g., by removing outliers and calculating the median) to produce a single, tamper-resistant data point.

Examples: Chainlink Data Feeds, Pyth Network (via pull oracle model).
Security Model: Relies on a decentralized network of nodes and cryptographic proofs to ensure data integrity and availability.

EXPLORE

03

Pull-Based Oracle Feeds

A design where data is not continuously pushed on-chain. Instead, the data is stored off-chain (or on a high-throughput chain like Pythnet) and cryptographically signed by publishers. Applications pull the verified data on-demand via a verifiable random function (VRF) or a permissionless relay, paying only for the data they consume. This reduces on-chain gas costs and latency.

Examples: Pyth Network's core design, API3's dAPIs.
Efficiency: Ideal for high-frequency data updates where constant on-chain storage is prohibitively expensive.

EXPLORE

04

Cross-Chain Data Feeds

Feeds designed to deliver the same canonical data point to multiple blockchain networks simultaneously. They solve the oracle problem across fragmented ecosystems, ensuring DeFi applications on different chains (e.g., Ethereum, Avalanche, Polygon) operate with the same price reference. This often involves a hub-and-spoke model with a primary data source and cross-chain messaging protocols for attestation relay.

Examples: Chainlink CCIP-enabled feeds, LayerZero's Oracle module.
Use Case: Essential for cross-chain lending, derivatives, and stablecoin parity.

EXPLORE

05

Proof of Reserve Feeds

A specialized data feed that provides cryptographic proof of an entity's collateral holdings. It verifies that assets backing a tokenized product (like a stablecoin or wrapped asset) are held in reserve at a 1:1 ratio. Oracles attest to the balance and custody of reserves, often by verifying Merkle proofs of exchange wallets or bank accounts.

Examples: Feeds verifying USDC reserves, proof of BTC backing for wrapped BTC.
Function: Provides transparency and auditability to mitigate counterparty and fractional reserve risk.

06

Event-Driven & Custom Feeds

Feeds that deliver data triggered by specific real-world events or provide non-financial data. This includes sports scores, election results, weather data, or IoT sensor readings. They enable parametric insurance, prediction markets, and dynamic NFTs. The data sourcing and attestation logic is highly customized for the specific use case.

Examples: Feeds for flight status (for flight delay insurance), randomness for gaming (VRF).
Complexity: Requires defining clear, objective data resolution criteria for the oracle network to evaluate.

DATA SOURCE COMPARISON

On-Chain Data vs. Oracle Data Feed

A comparison of two primary data sources for smart contracts, highlighting their distinct origins, trust models, and use cases.

Feature	On-Chain Data	Oracle Data Feed
Data Source	Data stored and verified on the blockchain itself	Data sourced from external, off-chain systems and APIs
Data Type	Native blockchain state (e.g., token balances, transaction history)	Real-world data (e.g., price feeds, weather, sports scores)
Trust Model	Inherently trustless, secured by blockchain consensus	Requires trust in the oracle network's security and data aggregation
Latency	Deterministic, available after block confirmation	Variable, depends on oracle update frequency and network latency
Manipulation Resistance	Highly resistant; requires attacking the underlying chain	Vulnerable to manipulation of off-chain sources or the oracle itself
Primary Use Case	Settling internal contract state and interactions	Connecting smart contracts to external events and information
Example	Checking an ERC-20 balance in a wallet	Executing a derivatives contract based on an ETH/USD price
Cost to Update	Gas fees for on-chain state changes	Oracle service fees and gas for reporting transactions

ecosystem-usage

DATA FEED

Protocols & Use Cases

Data feeds, or oracles, are critical infrastructure that securely deliver external, off-chain information to on-chain smart contracts, enabling them to execute based on real-world events and data.

01

Price Feeds

The most common oracle use case, providing real-time asset prices (e.g., ETH/USD) to DeFi protocols. These feeds are essential for:

Lending platforms to determine collateralization ratios and trigger liquidations.
Decentralized exchanges (DEXs) for accurate spot and perpetual trading.
Synthetic asset protocols to mint and settle assets pegged to real-world value. Key providers aggregate data from multiple centralized and decentralized exchanges to ensure tamper resistance and accuracy.

EXPLORE

02

Proof of Reserve

A specialized data feed that provides on-demand cryptographic proof of an institution's collateral holdings. This addresses counterparty risk by allowing users to verify that assets backing a tokenized product (like a stablecoin or wrapped asset) are fully collateralized.

Real-world example: A stablecoin issuer can use a Proof of Reserve feed to publicly and automatically attest that its USD reserves match or exceed its token supply.
Enhances transparency and trust minimization for centralized bridging services and custodians.

03

Verifiable Randomness

Provides a cryptographically secure, publicly verifiable source of randomness (VRF) for smart contracts. This is critical for applications where predictable outcomes would break the system.

Primary use cases:
- NFT minting and randomized attribute generation.
- Blockchain gaming for loot boxes or matchmaking.
- Governance for fair jury selection or leader election. The process involves generating randomness off-chain and delivering it with a cryptographic proof that the provider could not have manipulated the result.

04

Cross-Chain Data (CCIP)

An emerging standard for building generalized cross-chain applications. While often associated with token transfers, the Cross-Chain Interoperability Protocol (CCIP) is fundamentally a messaging layer that can transmit any data, including data feed updates, across blockchains.

Enables oracle networks to serve as a unified data layer for a multi-chain ecosystem.
Allows a smart contract on Chain A to trigger an action on Chain B based on an event or data point, creating complex cross-chain logic.

EXPLORE

05

Event-Driven Automation

Data feeds trigger trust-minimized automation via keeper networks. When a predefined condition (e.g., a price reaching a limit) is met by an oracle, an automated transaction is executed.

Key automation functions:
- Liquidation of undercollateralized loans.
- Rebalancing of portfolio management vaults.
- Limit order fulfillment on DEXs.
- Yield harvesting and compound interest accrual. This removes manual intervention and central points of failure for time-sensitive contract functions.

06

Real-World Asset (RWA) Data

Bridges traditional finance and blockchain by providing on-chain attestations for off-chain assets. This data enables the tokenization and on-chain management of Real-World Assets.

Example data types:
- Interest rates (e.g., SOFR, Fed Funds Rate) for RWA lending.
- Credit scores or KYC/AML status for on-chain credit markets.
- Supply chain data (temperature, location) for trade finance.
- Sports scores or election results for prediction markets. This expands smart contract utility far beyond native crypto-assets.

security-considerations

DATA FEED

Security Considerations & Risks

Data feeds (oracles) are critical infrastructure that bridge off-chain data with on-chain smart contracts, introducing unique security vectors that must be understood and mitigated.

01

Data Authenticity & Source Integrity

The primary risk is the ingestion of incorrect or manipulated data from the source. Attackers can target the data provider's API or the underlying data generation process (e.g., a compromised exchange). Mitigations include:

Using multiple, independent data sources.
Implementing cryptographic proofs of data origin where possible.
Employing data attestation from trusted entities.

02

Oracle Node Compromise

The security of the oracle network nodes themselves is paramount. A compromised node can report malicious data. Key considerations:

Decentralization: A network of independent, Sybil-resistant nodes reduces single points of failure.
Node Operator Reputation & Slashing: Systems that penalize (slash) node stake for malfeasance.
Private Key Security: Nodes must securely manage the keys used to sign data reports on-chain.

03

On-Chain Data Delivery & Manipulation

Even with correct off-chain data, the on-chain delivery mechanism can be attacked.

Transaction Ordering (MEV): Block builders can front-run or sandwich oracle update transactions to exploit price discrepancies.
Data Freshness (Staleness): Out-of-date data can be used if update frequency is insufficient or transactions are delayed.
Consensus Manipulation: In some designs, attackers could manipulate the on-chain aggregation logic (e.g., via flash loans to skew a TWAP).

04

Smart Contract Integration Risk

The consuming smart contract's design determines its vulnerability to oracle failures.

Lack of Circuit Breakers: Contracts without pause mechanisms or price change limits can be drained before bad data is detected.
Over-reliance on a Single Feed: Using one oracle creates a critical dependency.
Insufficient Heartbeat Checks: Not verifying that data is being updated regularly can leave a contract operating on stale values.

05

Cryptoeconomic & Governance Attacks

Attacks targeting the oracle's tokenomics or governance.

Stake Slashing Attacks: An attacker may intentionally cause honest nodes to be slashed to reduce network security.
Governance Takeovers: Acquiring enough governance tokens to maliciously change oracle parameters (e.g., data sources, node set).
Freezing Attacks: Exploiting design flaws to freeze funds that depend on the oracle, even without stealing them.

06

Real-World Incident: The Synthetix sKRW Oracle Attack

In June 2020, a price feed error from the Korean Won (KRW) data source provided by Chainlink to the Synthetix sKRW synth caused its price to be reported at roughly 1/1000th of its true value. This allowed an arbitrageur to purchase ~39 million sKRW for just 0.000001 cents each before the issue was corrected, netting a profit. The incident highlighted the criticality of data source validation and the need for rapid circuit breaker responses.

DATA FEEDS

Common Misconceptions

Clarifying widespread misunderstandings about blockchain oracles, price feeds, and data reliability.

No, blockchain data feeds, or oracles, differ significantly in their decentralization, data sourcing, and security models. A centralized oracle relies on a single data source and operator, creating a single point of failure. In contrast, a decentralized oracle network (DON) like Chainlink aggregates data from multiple independent node operators and sources, using cryptographic proofs and consensus mechanisms to deliver a single, validated data point on-chain. The security and reliability of a smart contract are directly tied to the oracle's design, making the choice between a centralized feed and a decentralized network critical.

DATA FEEDS

Frequently Asked Questions

Essential questions about blockchain data feeds, oracles, and how they securely connect smart contracts to the real world.

A blockchain data feed is a stream of external, real-world data (like asset prices, weather, or sports scores) delivered to a smart contract by a decentralized oracle network. It works by aggregating data from multiple independent node operators, applying consensus mechanisms to validate the information, and then posting the aggregated result on-chain in a format the smart contract can consume. This process bridges the gap between the deterministic blockchain environment and variable off-chain data sources, enabling applications like DeFi lending, prediction markets, and parametric insurance to function autonomously and reliably.

Data Feed