How Block Assembly Works: A Step-by-Step Guide for Developers

introduction

BLOCKCHAIN FUNDAMENTALS

Introduction to Block Assembly

Block assembly is the critical process where a network node collects, validates, and orders transactions into a candidate block for consensus.

At its core, block assembly is the act of constructing a candidate block. A node, typically a validator or miner, selects pending transactions from its mempool, executes them locally to verify state changes, and orders them into a data structure. This process determines the block's content, transaction fees, and gas usage before it is proposed to the network. The assembler's goal is to maximize the block's value, often by prioritizing transactions with the highest fees, while adhering to protocol rules like block gas limits and size constraints.

The assembler must handle transaction dependencies and nonce ordering. For example, if Transaction B spends an output created by Transaction A, both must be included with A coming first. Ethereum clients use algorithms to build a transaction pool where transactions are sorted by maxPriorityFeePerGas and maxFeePerGas (post-EIP-1559) or simply by gas price (pre-EIP-1559). Tools like Geth's txpool and the eth_getBlockByNumber RPC call allow developers to inspect pending transactions and see assembly logic in action.

In Proof-of-Work (PoW) systems like Bitcoin, miners perform assembly and then expend computational power to find a valid nonce. In Proof-of-Stake (PoS) systems like Ethereum, validators are randomly selected to propose a block; they run local execution clients (e.g., Geth, Erigon) for assembly and consensus clients (e.g., Prysm, Lighthouse) for coordination. The block is then broadcast as a block proposal to peer nodes, who verify its contents before attesting to its validity.

prerequisites

PREREQUISITES AND CORE CONCEPTS

How Block Assembly Works in Practice

A technical walkthrough of the process by which transactions are selected, ordered, and packaged into a new block by a network validator or miner.

Block assembly is the critical, real-time process where a network participant constructs the next block in the blockchain. It begins when a node, acting as a block producer (e.g., a miner in Proof-of-Work or a validator in Proof-of-Stake), is selected to create a new block. This node's primary task is to collect pending transactions from its mempool (memory pool), a local database of unconfirmed transactions broadcast across the network. The assembler must then select a subset of these transactions, order them, and execute them to compute a new valid state root.

The selection and ordering logic is governed by the node's transaction prioritization policy. While the base incentive is to maximize transaction fees, other factors influence the final order. These include Gas Price auctions in Ethereum-like chains, where users bid for inclusion, and MEV (Maximal Extractable Value) strategies, where searchers pay premiums to position transactions advantageously. A block assembler may run sophisticated algorithms to reorder transactions, inserting their own arbitrage or liquidations to capture value, a practice known as proposer-builder separation (PBS) in advanced designs.

During assembly, the node sequentially executes each chosen transaction within its local EVM or VM. This execution validates signatures, checks balances, runs smart contract code, and updates the world state. The output is a set of state changes and a list of transaction receipts. Crucially, the assembler must ensure the total computational load, measured in gas used, does not exceed the block's gas limit. Failed transactions are typically included but marked as reverted, with their gas still consumed.

The final assembled block contains a block header and the list of ordered transactions. The header includes metadata such as the parent block hash, the new state root, transaction root, receipts root, bloom filter, gas used, and a timestamp. For the block to be propagated and accepted by the network, it must satisfy all consensus rules (valid PoW, proper PoS signature) and execution-layer validity rules. Invalid blocks are rejected by honest peers.

In practice, modern block assembly is often outsourced to specialized block builders via markets like Flashbots Auction on Ethereum. Builders compete to create the most profitable block bundle and submit it to the winning validator. This separation enhances efficiency and can reduce negative MEV externalities. Understanding this process is essential for developers optimizing transaction submission, analysts tracking network activity, and anyone designing protocols sensitive to transaction ordering.

key-components

ARCHITECTURE

Key Components of a Block Assembly System

Block assembly is the critical process of constructing a valid block for the network. This system involves several specialized components working in concert.

Block Builder

The block builder is the core engine responsible for assembling transactions into a candidate block. It performs several key functions:

Transaction Selection: Chooses pending transactions from the mempool based on criteria like fee priority.
State Execution: Simulates the execution of selected transactions to compute the resulting state root.
Block Construction: Packages the transactions, block header, and execution results into a structured block proposal. Builders compete to create the most profitable or efficient block for validators.

EXPLORE

Relay

A relay acts as a trusted, neutral intermediary between block builders and validators (proposers). Its primary role is to prevent MEV (Maximal Extractable Value) exploitation and censorship. Key responsibilities include:

Bid Collection: Receives encrypted block bids from multiple builders.
Bid Auction: Hosts a first-price auction for the right to have a block included.
Payload Delivery: Decrypts the winning bid and delivers the block payload to the validator. Relays like the Flashbots Relay are essential for Proposer-Builder Separation (PBS).

EXPLORE

Mempool

The mempool (memory pool) is the network's waiting area for unconfirmed transactions. It is the primary data source for block builders. Important characteristics:

Transaction Propagation: Nodes broadcast signed transactions peer-to-peer, where they are gossiped across the network.
Fee Market: Transactions with higher gas fees are typically prioritized for inclusion.
Privacy Concerns: Public mempools expose transaction intent, leading to front-running risks, which has spurred the growth of private transaction channels like Flashbots Protect.

EXPLORE

Consensus Client

The consensus client (e.g., Prysm, Lighthouse) is run by validators to participate in the blockchain's proof-of-stake protocol. In block assembly, its key role is:

Proposer Selection: The beacon chain pseudo-randomly selects a validator to propose a block for a given slot.
Block Proposal: The selected validator's consensus client receives a block payload from a relay/builder and publishes it to the network.
Attestation: Other validators' clients attest to the validity of the proposed block. This client manages the validator's duties according to the consensus rules.

EXPLORE

Execution Client

The execution client (e.g., Geth, Erigon) is responsible for transaction execution and state management. It interacts closely with the block assembly process:

State Management: Maintains the entire current state of the Ethereum Virtual Machine (EVM), including account balances and smart contract storage.
Transaction Execution: When a builder assembles a block, it uses an execution client to simulate the transactions and generate a new state root.
Payload Generation: Produces the execution payload, which contains the transaction list and post-state data, for the consensus layer.

EXPLORE

MEV-Boost Middleware

MEV-Boost is middleware that allows Ethereum validators to outsource block building to a competitive marketplace. It is a practical implementation of Proposer-Builder Separation (PBS). How it works:

Interfaces with Relays: The validator's software connects to multiple, independent relays to source block bids.
Auction Participation: Relays run auctions where builders submit bids for their block proposals.
Profit Maximization: The validator automatically selects the highest-paying, valid block header, maximizing their rewards from MEV. Over 90% of Ethereum blocks are built using MEV-Boost.

EXPLORE

step-1-mempool

BLOCK ASSEMBLY

Step 1: Monitoring and Managing the Mempool

The mempool is the staging area for all pending transactions before they are included in a block. Effective monitoring and management are the first critical steps for any block builder.

A node's mempool (memory pool) is a dynamic, unsorted collection of all valid, signed transactions that have been broadcast to the network but not yet confirmed in a block. For block builders, this is the raw material. Monitoring involves subscribing to the peer-to-peer network via the mempool or tx message types to receive new transactions in real-time. Builders must validate each transaction against the current state—checking signatures, nonces, and gas—before adding it to their local pool. This creates a private, prioritized view of pending demand.

Managing this pool requires sophisticated logic. Simple first-in-first-out (FIFO) ordering is inefficient. Builders implement algorithms to sort transactions primarily by effective gas price (tip per unit of gas), as this directly impacts validator revenue. They must also consider dependencies (a transaction requiring the output of another unconfirmed transaction) and evict transactions that become invalid due to a conflicting transaction being mined. Tools like Geth's txpool API or a dedicated service like Flashbots Protect provide interfaces to inspect and interact with the mempool.

In practice, builders don't just watch the public mempool. To capture maximum value, they also monitor private transaction channels (e.g., Flashbots bundles, Taichi Network, or private RPC endpoints) where users and searchers submit transactions directly to avoid front-running and ensure inclusion. This creates a multi-source stream of transaction flow that the builder must merge and evaluate. The goal is to construct the most profitable candidate block from this combined set, which leads directly to the next step: constructing the block payload itself.

step-2-selection

BLOCK ASSEMBLY

Step 2: Transaction Selection Algorithms

After transactions are validated, the block builder must select which ones to include. This selection is governed by algorithms that optimize for fee revenue, network health, and fairness.

The primary goal for most block producers is maximizing fee revenue. This leads to the widespread use of a Greedy Fee Rate (GFR) algorithm, which simply sorts the mempool by fee-per-byte (or per-gas) in descending order and includes transactions until the block is full. This is computationally simple and aligns with economic incentives. In Ethereum, this is often referred to as a Priority Gas Auction (PGA) environment, where users compete by increasing transaction maxPriorityFee to get included faster.

However, pure GFR has drawbacks. It can lead to starvation for low-fee transactions and doesn't account for dependencies. A Child Pays for Parent (CPFP) is a dependency where a low-fee parent transaction is bundled with a high-fee child transaction to incentivize inclusion. More advanced algorithms, like those used in Bitcoin Core's block template creation, perform ancestor set scoring. Instead of evaluating single transactions, they evaluate packages (a transaction plus its unconfirmed ancestors) by their total fee rate, ensuring dependent transactions are mined together.

Beyond fees, selection algorithms must consider network rules and policy. Most nodes enforce minimum relay fees and rules against dust outputs. Builders also implement Replace-By-Fee (RBF) policies, allowing a higher-fee transaction to replace a lower-fee one. In Ethereum post-EIP-1559, the algorithm targets the base fee, and users specify a priority fee (tip) for the miner. Builders select transactions where maxFeePerGas >= baseFee + minTip, again typically choosing those with the highest priorityFee.

Mempool fragmentation presents a challenge. Not all nodes see the same set of transactions due to network propagation delays. Sophisticated builders run mempool monitoring services to gather a more complete view. In practice, builders like mev-geth or Flashbots builders use enhanced algorithms that also evaluate Maximal Extractable Value (MEV) opportunities, reordering or inserting transactions to capture arbitrage or liquidation profits, which significantly complicates the simple fee-sorting model.

The final output of the selection algorithm is an ordered list of transaction identifiers or raw transactions, forming the block's body. This list is then passed to the next step for execution and state commitment. The choice of algorithm directly impacts user experience (transaction confirmation time), network security (by disincentivizing spam), and the economic efficiency of the chain.

step-3-ordering

BLOCK ASSEMBLY

Step 3: Ordering Transactions Within the Block

After transactions are selected from the mempool, the block producer must order them to optimize for network efficiency and fee collection.

The order of transactions within a block is not arbitrary. Block producers, such as miners in Proof-of-Work or validators in Proof-of-Stake, sequence transactions to maximize their own revenue while adhering to protocol rules. This process is often called transaction ordering or block building. The primary goal is to include the set of transactions that yields the highest total fees, as block rewards are typically fixed. This creates a complex optimization problem, as transactions can have dependencies on each other's state changes.

A naive approach of simply ordering by highest fee-per-byte (gas price) can fail. Consider a DeFi arbitrage transaction that relies on a specific pool state created by a prior, lower-fee transaction. If the arbitrage tx is processed first, it will fail, wasting block space. Smart block builders use algorithms to construct a Directed Acyclic Graph (DAG) of transaction dependencies. They then topologically sort this graph to ensure all transactions execute successfully, while still prioritizing high-fee bundles. Builders on Ethereum and other EVM chains often use a greedy algorithm that starts with the highest fee transaction and incrementally adds compatible, high-fee dependent transactions.

In practice, this optimization is so critical that a specialized market for block building has emerged, especially post-EIP-1559. Entities known as block builders (like Flashbots' mev-boost relays) compete to create the most profitable block bundle for the proposing validator. They use sophisticated software that simulates transaction execution paths, bundles MEV (Maximal Extractable Value) opportunities like arbitrage and liquidations, and submits a complete, ordered block proposal. The validator then simply chooses the most profitable header.

For developers, understanding this order is crucial for transaction success. To increase the chance of your transaction being included in the optimal position, you can use techniques like fee bumping with tools like Ethereum's eth_maxPriorityFeePerGas or employ private transaction pools (e.g., via Flashbots Protect) to shield from front-running. The order also determines the state root for the block, as each transaction's execution modifies the global state sequentially.

Ultimately, the published block order is a solution to a real-time economic game. It balances the builder's profit motive with the network's need for valid state transitions. This step transforms a raw set of transactions into a deterministic, executable sequence that defines the new canonical state for all network participants.

step-4-construction

BLOCK ASSEMBLY

Step 4: Constructing the Block Header and Body

This step details the final assembly of a candidate block, combining the header metadata with the selected transactions into a single, immutable data structure.

The block header is an 80-byte data structure that cryptographically commits to the entire contents of the block. Its fields are derived from the work done in previous steps: the previous block hash links to the chain, the timestamp is set, the difficulty target is calculated, and the Merkle root is computed from the finalized transaction list in the block body. The nonce field is initialized to zero, ready for the Proof-of-Work mining process. This header acts as a unique fingerprint for the block's data.

The block body contains the actual payload: the ordered list of transactions selected for inclusion. In Bitcoin, this starts with the coinbase transaction, which creates new bitcoin and collects fees, followed by the prioritized transactions from the mempool. In Ethereum and other EVM chains, the body is more complex, containing the list of transactions and a list of ommers (uncle blocks). The body's serialization is protocol-specific but must be deterministic for all network participants.

The critical link between the header and body is the Merkle root (or Trie root in Ethereum). For Bitcoin, a binary Merkle tree is constructed from the transaction IDs. In Ethereum, three separate Patricia Merkle Tries are used for the state, transactions, and receipts. The root hash of the transactions trie is placed into the header's transactionsRoot field. This design allows nodes to verify that a transaction is included in a block by checking a small Merkle proof without downloading the entire chain.

Once assembled, the candidate block is broadcast to the network. Other nodes validate it by independently recalculating the Merkle roots, verifying all transactions against the current state, and checking the header's Proof-of-Work. A block is only added to the canonical chain once a majority of the network accepts it. This process of header-body construction and validation is the fundamental mechanism for achieving distributed consensus on a shared transaction history.

ARCHITECTURE

Comparison of Block Assembly Strategies

How different consensus and execution models determine the process of building and finalizing blocks.

Strategy	Sequential (Monolithic)	Parallel (Modular)	Optimistic (Rollup)	ZK (Rollup)
Execution Model	Single-threaded	Multi-threaded / Sharded	Single-threaded, off-chain	Single-threaded, off-chain
Block Producer	Validator	Proposer-Builder-Separation (PBS)	Sequencer	Sequencer
Finality Time	~12 sec (Ethereum)	~2 sec (Solana)	~12 sec + 7 day challenge	~12 sec + ~20 min proof
State Growth	Linear, on-chain	Exponential, sharded	Compressed, on-chain	Compressed, on-chain
Data Availability	On-chain	On-chain (shard data)	On-chain (calldata) or off-chain (DAC)	On-chain (calldata)
MEV Resistance	Low (public mempool)	Medium (PBS auctions)	Centralized sequencer risk	Centralized sequencer risk
Gas Fee Model	First-price auction	Priority fee + base fee	Batch-submitted to L1	Batch-submitted to L1 + proof cost
Primary Example	Ethereum PoS	Solana, Celestia	Arbitrum One, Optimism	zkSync Era, Starknet

code-walkthrough

BUILDING BLOCKS

Code Walkthrough: A Simplified Block Assembler

A practical guide to the core logic that selects and orders transactions for a new block.

A block assembler is the critical component within a node that constructs a candidate block for the network. Its primary job is to select pending transactions from the mempool, order them, and package them into a block that is valid, profitable, and ready for execution. This process involves evaluating transaction fees, dependencies, and resource limits like gas or compute units. In Proof-of-Work (PoW) systems, miners run assemblers to maximize fee revenue. In Proof-of-Stake (PooS) networks like Ethereum, validators perform this role. The assembler's output is the raw data that gets cryptographically signed and broadcast to peers.

Let's examine a simplified Python class that captures the assembler's core logic. We'll focus on a fee-based selection algorithm, which prioritizes transactions offering the highest fee per unit of gas (or similar resource). This model assumes a mempool of pending transactions and a block with a maximum gas limit.

python
class SimpleBlockAssembler:
    def __init__(self, gas_limit):
        self.gas_limit = gas_limit
        self.mempool = []  # List of (tx_hash, gas_used, fee)
    
    def add_to_mempool(self, tx):
        """Add a transaction to the pending pool."""
        self.mempool.append(tx)
    
    def assemble_block(self):
        """Select transactions for the next block."""
        # 1. Sort mempool by fee per gas (descending)
        sorted_txs = sorted(self.mempool, key=lambda x: x[2]/x[1], reverse=True)
        
        block_txs = []
        block_gas_used = 0
        
        # 2. Greedy selection loop
        for tx in sorted_txs:
            tx_hash, gas, fee = tx
            if block_gas_used + gas <= self.gas_limit:
                block_txs.append(tx_hash)
                block_gas_used += gas
            else:
                break  # Block is full
        
        return block_txs, block_gas_used

The assemble_block method implements a greedy algorithm. It first sorts all pending transactions by their effective gas price (fee/gas). It then iterates through this sorted list, adding transactions to the block as long as the cumulative gas does not exceed the gas_limit. This simple approach maximizes the total fee revenue for the block producer under the given constraints. However, real-world assemblers must handle far greater complexity: transaction dependencies (a later transaction may require an earlier one to succeed), state access lists to avoid conflicts, and MEV (Maximal Extractable Value) opportunities through sophisticated bundling and ordering strategies known as MEV-Boost on Ethereum.

In production systems like Geth (Ethereum) or Core Bitcoin, the assembler is tightly integrated with the node's transaction pool and execution engine. For instance, it must validate that each selected transaction executes successfully against the current state before finalizing the block. Advanced assemblers also implement replace-by-fee logic and manage orphaned transactions. The shift towards Proposer-Builder Separation (PBS), formalized in Ethereum's roadmap, outsources complex block construction to specialized builders, while validators simply choose the most profitable pre-built block. This walkthrough provides the foundational concept upon which these advanced, high-stakes systems are built.

resource-links

DEVELOPER REFERENCES

Resources and Further Reading

Primary documentation, research papers, and tooling used by client developers and infrastructure teams to understand how blocks are assembled, ordered, and proposed in production networks.

Ethereum Block Structure and Validation Rules

The Ethereum Yellow Paper and execution-layer specifications define how blocks are structured and validated at the protocol level. These resources explain exactly what must be included for a block to be considered valid by clients.

Key details covered:

Block header fields like parentHash, stateRoot, receiptsRoot, and gasLimit
Transaction ordering and gas accounting enforced during execution
State transition rules applied by every full node

Developers working on clients, indexers, or custom execution environments should read these specs to understand why blocks can be rejected even when transactions are individually valid.

Proposer-Builder Separation (PBS) and mev-boost

Proposer-Builder Separation changes how blocks are assembled on Ethereum post-merge. Instead of validators building blocks locally, specialized builders construct blocks and submit them to proposers via relays.

mev-boost is the reference implementation used on mainnet. Studying it helps explain real-world block assembly flow:

Builders assemble transaction bundles and MEV opportunities
Relays validate blocks and handle bid selection
Proposers choose the highest-paying valid block

Understanding PBS is essential for:

Validator operators configuring mev-boost
Researchers analyzing MEV extraction
Developers building alternative builder pipelines

The documentation and codebase illustrate message formats, timing constraints, and failure cases that directly impact block production.

EXPLORE

Geth: Transaction Pool and Block Building Logic

Geth remains the most widely used Ethereum execution client, and its codebase shows how block assembly works in practice at the client level.

Relevant components include:

TxPool logic for prioritizing pending transactions
Miner/Builder logic for selecting transactions based on gas price and constraints
State simulation used to ensure transactions execute successfully within block limits

Reading the Geth implementation helps developers understand:

Why some pending transactions are skipped
How gas limits and EIP-1559 base fee affect inclusion
How local block builders differ from PBS-based flows

This is particularly useful for teams building custom RPCs, private order flow systems, or modified execution clients.

EXPLORE

Consensus Layer Perspective: Slot Timing and Proposals

Block assembly does not happen in isolation. The consensus layer enforces strict slot timing that constrains how long builders and proposers have to act.

Consensus-layer documentation explains:

12-second slots and how proposal deadlines are enforced
How missed slots occur when blocks arrive late
The interaction between execution payloads and beacon blocks

Understanding this timing model is critical when:

Debugging missed or orphaned blocks
Designing builder or relay infrastructure
Analyzing proposer rewards and penalties

These specs bridge the gap between execution-layer block contents and consensus-layer finalization, clarifying why perfectly valid blocks can still fail to land on chain.

EXPLORE

DEVELOPER TROUBLESHOOTING

Frequently Asked Questions on Block Assembly

Common technical questions and solutions for developers working with block assembly, mempools, and transaction ordering.

Block assembly is the process where a block proposer (e.g., a validator or miner) selects and orders transactions from the mempool into a candidate block. Block production is the broader term that includes assembly plus the subsequent cryptographic work (Proof-of-Work or Proof-of-Stake signature) to finalize the block.

In practice, assembly involves:

Transaction Selection: Choosing which pending transactions to include based on fee priority, smart contract dependencies, and block gas/size limits.
Ordering: Determining the sequence of transactions, which can affect state changes and MEV opportunities.
Block Construction: Building the raw block data structure (header and body) before it's signed and broadcast.

Tools like the eth_getBlockByNumber RPC call return the final assembled block.

conclusion

PRACTICAL SUMMARY

Conclusion and Next Steps

This guide has walked through the critical stages of block assembly, from transaction selection to final block proposal. Here's a summary of the key takeaways and resources for further exploration.

Understanding block assembly is fundamental for developers building on or researching blockchain systems. The process is a complex interplay of consensus rules, economic incentives, and network performance. Key practical takeaways include: the mempool is the source of truth for pending transactions; priority ordering (often by fee) directly impacts user experience and miner/validator revenue; and the final block header, containing the Merkle root and previous block hash, cryptographically seals the entire assembly.

For those looking to implement or analyze this process, several resources are invaluable. Reviewing the core code is essential: for Ethereum, study the blockchain and miner packages in the Go-Ethereum (Geth) repository; for Solana, examine the bank and leader_schedule modules. Tools like Ethereum's block explorer or Solana's Explorer allow you to inspect real-time mempool data and dissect proposed blocks, providing concrete examples of the theory in action.

To deepen your expertise, consider these next steps. First, simulate a local network using frameworks like Ganache (Ethereum) or local Solana test-validator to observe assembly without cost. Second, analyze historical data to see how transaction fee markets evolve during network congestion. Finally, explore advanced topics like MEV (Maximal Extractable Value) strategies such as frontrunning and arbitrage, which represent sophisticated, often adversarial approaches to influencing transaction ordering within the block assembly process.