How to Build a Rollup Monitoring Dashboard

introduction

INTRODUCTION

Setting Up a Rollup Monitoring and Analytics Dashboard

A comprehensive guide to building a real-time dashboard for tracking the health, performance, and security of your rollup.

Rollups are complex systems where sequencers, provers, and data availability layers must operate in sync. Without proper observability, issues like transaction backlogs, proof generation delays, or data availability failures can go unnoticed, leading to degraded performance or security risks. A dedicated monitoring dashboard provides a single pane of glass to track these critical metrics, enabling proactive maintenance and informed decision-making. This guide walks through building a dashboard using open-source tools like Prometheus, Grafana, and custom data collectors.

The foundation of any monitoring system is metrics collection. For a rollup, you need to gather data from multiple sources: the sequencer node (e.g., transaction throughput, pending queue size, gas usage), the prover (proof generation time, success rate, hardware utilization), and the underlying L1 (data posting costs, confirmation times, bridge state). Tools like Prometheus exporters can be written to scrape this data from node APIs, RPC endpoints, and smart contracts. For example, you might track rollup_blocks_produced_total or da_batch_submission_cost_eth.

Once metrics are collected, you need a visualization layer. Grafana is the industry standard for this, allowing you to create dashboards with graphs, gauges, and alerts. You can build panels to visualize real-time TPS, average transaction latency over time, the cost of posting data to Ethereum, and the status of the bridge's withdrawal queue. Setting meaningful alert rules in Prometheus or Grafana is crucial; you should be notified if proof generation fails consecutively or if the sequencer's mempool exceeds a dangerous threshold.

For deeper analytics beyond basic metrics, consider integrating a specialized indexer or building custom queries. You might want to analyze user growth by tracking unique active addresses, monitor the economic security by calculating the total value locked (TVL) in bridges, or audit sequencer censorship resistance. Services like The Graph for subgraphs or direct queries to an indexed database (e.g., using Dune Analytics or a self-hosted Postgres instance) can power these insights. This analytical layer transforms raw data into actionable business intelligence.

Finally, the dashboard must be actionable. Integrate alert notifications to Slack, PagerDuty, or Telegram so your team can respond immediately to incidents. Use Grafana annotations to mark deployments or major network events, creating a timeline for post-mortem analysis. Regularly review and update your dashboard as the rollup evolves—new contracts, upgrade mechanisms, and fraud proof systems will introduce new critical metrics to monitor. A well-maintained dashboard is not a static project but a core component of your rollup's operational infrastructure.

prerequisites

SETUP CHECKLIST

Prerequisites

Before building a rollup monitoring dashboard, you need the right tools, infrastructure, and data sources. This section covers the essential components to have in place.

A functional rollup monitoring system requires a solid foundation. You'll need access to the rollup's execution client (e.g., an OP Stack node, Arbitrum Nitro node, or zkSync Era server) and its connected L1 data availability layer (like Ethereum). This provides the raw transaction data, block headers, and state roots. Simultaneously, you must configure access to the rollup's sequencer RPC endpoint, which is the primary gateway for submitting transactions and querying the latest state. For historical analysis, you'll also need access to an indexed archive node or a service like The Graph to efficiently query past events and transactions.

The core of your dashboard is the data pipeline. You will need to set up a process to extract, transform, and load (ETL) data from the sources mentioned above. This typically involves writing scripts or using a framework to listen for new blocks and events, parse the data into a structured format (like converting hex values to decimals), and load it into a time-series database. Popular choices for this backend include PostgreSQL with TimescaleDB, InfluxDB, or ClickHouse. These databases are optimized for storing and querying sequential metric data, which is essential for tracking trends like TPS, gas fees, and active addresses over time.

Your development environment must include the necessary libraries and SDKs. For most EVM-compatible rollups, you will need Node.js (v18+) or Python (3.9+) with the Ethers.js or Web3.py library to interact with RPC endpoints and smart contracts. To build the dashboard frontend, a framework like React or Next.js is common, paired with a charting library such as Recharts, Chart.js, or Apache ECharts for visualization. You should also be familiar with the rollup's specific bridge contracts and precompiles, as monitoring cross-chain message passing and proving activity is a critical function.

Finally, establish your alerting and logging infrastructure. Integrate a service like Prometheus to scrape metrics from your application and Grafana to build the visual dashboard and set up alerts. For logging errors and tracking application health, configure structured logging with a service like Loki or an APM tool. Ensure you have the rollup's contract addresses (e.g., L1 and L2 bridge addresses, sequencer inbox) and RPC URLs documented and accessible to your scripts. With these components ready, you can proceed to implement the specific data collectors and visualizations for your dashboard.

key-concepts

ROLLUP ANALYTICS

Key Monitoring Concepts

Essential metrics, tools, and frameworks for building a comprehensive rollup observability stack.

Core Rollup Metrics

Track the fundamental health of your rollup by monitoring these key data points:

Sequencer Activity: Transaction throughput (TPS), batch submission intervals, and gas usage.
L1 Settlement: Confirmation times, L1 gas costs for data submission, and finality latency.
Network State: Active addresses, transaction success/failure rates, and mempool depth.
Financials: Sequencer revenue (from L2 gas) and costs (L1 data posting fees).

EXPLORE

Data Availability Monitoring

Ensure transaction data is available for state reconstruction and fraud proofs. Critical checks include:

Data Posting Verification: Confirm all batch data is published to the designated L1 (Ethereum, Celestia, Avail) or off-chain solution.
Data Retention: Monitor the accessibility and integrity of historical data blobs or calldata.
DA Layer Health: Track the underlying data availability layer's latency, cost, and uptime, as it is a primary failure point.

EXPLORE

Prover & Validity Proof Systems

For ZK-Rollups, the proving system is the core security mechanism. Monitor:

Proof Generation Time: Latency for creating validity proofs, which impacts withdrawal finality.
Prover Infrastructure: Health and performance of prover nodes; a single point of failure can halt the chain.
Verification Costs: Gas cost to verify proofs on the L1 settlement layer.
Circuit Metrics: For custom rollups, track constraints and proof sizes to optimize performance.

EXPLORE

Bridge & Withdrawal Monitoring

The bridge contract is a critical trust point. Actively monitor for anomalies:

Withdrawal Queue: Track pending withdrawal requests and their processing time (7 days for optimistic, ~1 hour for ZK).
Bridge Balances: Monitor liquidity in the L1 escrow contract to ensure solvency.
Fraud Proof Windows: For optimistic rollups, track the challenge period status for any disputed state roots.
Message Relayers: Ensure the service relaying L1<>L2 messages is operational.

EXPLORE

RPC & Node Health

Your rollup's access layer must be reliable. Monitor the infrastructure serving user requests:

RPC Endpoint Performance: Response times, error rates (5xx), and uptime for JSON-RPC endpoints.
Node Synchronization: Ensure execution and consensus nodes are synced to the latest L1 and L2 blocks.
Resource Usage: CPU, memory, and disk I/O for sequencer and validator nodes to prevent bottlenecks.
Peer Count: Network connectivity health for decentralized sequencer sets or peer-to-peer networks.

EXPLORE

Alerting & Dashboard Tools

Implement proactive monitoring with these established tools:

Prometheus/Grafana: The standard for collecting metrics and building custom dashboards. Use client libraries (e.g., for Geth, Erigon) to expose rollup-specific metrics.
Tenderly Alerts: Set up real-time alerts for failed transactions, contract events, or specific on-chain conditions.
Blocknative Mempool Explorer: Monitor pre-chain activity and transaction propagation.
Open Source Stacks: Adapt frameworks like the Ethereum Execution Client Grafana dashboard for your rollup client.

EXPLORE

architecture-overview

SYSTEM ARCHITECTURE

Setting Up a Rollup Monitoring and Analytics Dashboard

A production-grade rollup requires comprehensive observability to track performance, security, and user activity. This guide details the architecture for a real-time monitoring dashboard.

A robust rollup monitoring system aggregates data from multiple sources. The core components are: a data ingestion layer pulling from the rollup's sequencer, L1 settlement contracts, and RPC nodes; a time-series database like Prometheus or TimescaleDB for metrics storage; and a visualization layer such as Grafana. Key metrics to capture include transaction throughput (TPS), batch submission latency to L1, gas costs, and sequencer health status. For example, an Optimism rollup would monitor ovm_sequencer_tx_count and gas_used for each L1 batch.

Instrumenting your rollup node is the first implementation step. For a custom rollup client, you must expose metrics via an HTTP endpoint, typically /metrics. Using a library like Prometheus client for Go or prom-client for Node.js, you can instrument key functions. Track counters for transactions processed, gauges for mempool size, and histograms for block production time. For existing stacks like Arbitrum Nitro, the node already exposes Prometheus metrics on port 9876. You configure Prometheus to scrape these targets at a defined interval, such as every 15 seconds.

The dashboard must visualize both chain state and system health. Create Grafana panels for: Sequencer Performance (live TPS, pending queue), L1 Settlement (batch submission frequency, confirmation time, gas spend), and RPC Service (request latency, error rates). Implement alerts for critical failures, like sequencer downtime or a spike in failed transactions. Use subgraphs from The Graph to index and query complex event data, such as daily active addresses or popular smart contract interactions, enriching your analytics beyond basic node metrics.

For advanced analysis, integrate a dedicated analytics database. Stream raw transaction data and event logs to ClickHouse or Apache Pinot using a service like Vector or Fluentd. This enables complex SQL queries for business intelligence, like calculating user retention cohorts or identifying the most gas-intensive contract calls. Architecturally, this forms a second data pipeline separate from the real-time operational metrics, ensuring analytical queries don't impact monitoring system performance.

Security and access control are critical for a production dashboard. Secure Grafana and Prometheus endpoints behind authentication. Use Grafana's built-in roles or integrate with OAuth2 providers. For team access, consider exposing the dashboard via a secure tunnel like Cloudflare Tunnel instead of public IPs. Regularly back up your Grafana dashboards as JSON files to version control. This setup provides the single pane of glass needed to maintain 99.9% uptime and make data-driven decisions for rollup optimization.

step1-data-collector

BACKEND ARCHITECTURE

Step 1: Build the Data Collector Service

The data collector is the foundational backend service that fetches, processes, and stores raw blockchain data from your rollup for analysis.

A robust data collector service is the backbone of any monitoring dashboard. Its primary function is to ingest raw data from your rollup's RPC endpoints and sequencer, then structure it into a queryable format. For an OP Stack rollup like Base or Optimism, you would typically connect to the eth_getBlockByNumber and eth_getLogs JSON-RPC methods. This service runs on a schedule (e.g., every 12 seconds) to capture new blocks, transactions, and contract events, ensuring your analytics reflect near real-time chain activity.

You need to architect the service for reliability and idempotency. This means implementing robust error handling for RPC timeouts, managing chain reorganizations (reorgs), and ensuring no data is duplicated if the service restarts. A common pattern is to track the last processed block height in a database like PostgreSQL. The service logic should: 1) Poll for the latest block, 2) Fetch all transactions and logs for new blocks, 3) Parse and normalize the data, and 4) Commit it to your datastore in a single atomic transaction to maintain consistency.

For processing, you'll extract key metrics from the raw data. Essential datasets to collect include:

Transaction Volumes: Count and aggregate value per block.
Gas Usage: Track gasUsed to monitor network congestion and fee trends.
Contract Interactions: Decode event logs (e.g., Transfer events for ERC-20 tokens) to track token flows and popular dApps.
Sequencer Metrics: Monitor batch submission transactions to L1 for latency and cost analysis. Structuring this data into relational tables (e.g., blocks, transactions, events) is crucial for efficient querying in later steps.

Here is a simplified code snippet in Node.js using Ethers.js to fetch and store block data. This example assumes you have a PostgreSQL client (pg) and a blocks table set up.

javascript
const { ethers } = require('ethers');
const { Client } = require('pg');

const rpcUrl = 'https://your-rollup-rpc-url.com';
const provider = new ethers.JsonRpcProvider(rpcUrl);
const dbClient = new Client({ connectionString: process.env.DATABASE_URL });

async function collectBlockData(blockNumber) {
    const block = await provider.getBlock(blockNumber);
    const txs = await Promise.all(
        block.transactions.map(txHash => provider.getTransaction(txHash))
    );
    // Insert into database
    await dbClient.query(
        'INSERT INTO blocks(number, hash, timestamp, tx_count) VALUES($1, $2, $3, $4) ON CONFLICT DO NOTHING',
        [block.number, block.hash, block.timestamp, txs.length]
    );
    console.log(`Collected block #${block.number}`);
}

This basic collector must be extended to handle reorgs by checking parent hashes and to process transaction receipts for logs.

Finally, consider scaling and maintenance. As transaction volume grows, batch inserts and connection pooling become essential. For production, containerize the service with Docker and orchestrate it using Kubernetes or a process manager like PM2. Implement comprehensive logging (e.g., with Winston or Pino) and alerting for failed data-fetching cycles. The output of this service—a clean, timestamped dataset—is what powers all subsequent analytical models and dashboard visualizations.

step2-prometheus-metrics

INSTRUMENTATION

Step 2: Expose Metrics to Prometheus

Configure your rollup node to export structured performance and operational data for collection.

Prometheus operates on a pull-based model, meaning it periodically scrapes HTTP endpoints for metrics. Your rollup node must expose these metrics in Prometheus's specific text-based exposition format. For most blockchain clients built in Go, Rust, or Node.js, this is achieved by integrating a dedicated metrics library. The prom-client for Node.js, prometheus crate for Rust, and client_golang for Go are the standard choices. These libraries handle the formatting and provide a /metrics HTTP endpoint that serves the current snapshot of all registered metrics.

You must instrument your application code to track the data you care about. Common rollup-specific metrics include: rollup_blocks_proposed_total, rollup_transactions_processed, sequencer_batch_size_bytes, l1_submission_duration_seconds, and state_root_calculation_time. Each metric should be labeled with dimensions like chain_id or batch_type for granular analysis. For example, tracking l1_gas_used per transaction batch helps optimize cost efficiency. Avoid exposing sensitive information like private keys or raw transaction data in labels.

The metrics endpoint must be served on a dedicated port, separate from the node's main RPC API, for security and isolation. Configure your node's startup command or configuration file to enable metrics and define the bind address (e.g., --metrics, --metrics.addr 0.0.0.0, --metrics.port 6060). Use environment variables for port configuration to maintain flexibility across deployment environments (development, staging, production). Ensure this port is accessible from your Prometheus server's network.

Finally, verify the setup by querying the endpoint directly. Run your node and navigate to http://<your-node-ip>:<metrics-port>/metrics in a browser or use curl. You should see a plain-text response beginning with # HELP and # TYPE directives, followed by metric lines like rollup_blocks_proposed_total 142. This confirms your node is correctly exposing data. The next step is to configure Prometheus to scrape this target.

step3-grafana-dashboard

VISUALIZE DATA

Create the Grafana Dashboard

Import and configure a pre-built dashboard to visualize your rollup's real-time health and performance metrics.

With Prometheus scraping your node's metrics, you now need a visualization layer. Grafana dashboards transform raw time-series data into actionable charts and alerts. For this guide, we'll import the OP Stack Node Dashboard (Dashboard ID 18602), a community-maintained template designed specifically for OP Stack-based rollups. This dashboard provides a comprehensive view of key performance indicators (KPIs) like block production, transaction throughput, and system resource usage.

To import the dashboard, log into your Grafana instance (typically at http://localhost:3000). In the left sidebar, navigate to Dashboards > New > Import. In the Import via grafana.com field, enter the dashboard ID 18602 and click Load. You will be prompted to select a Prometheus data source; choose the prometheus source you configured in the previous step. Finally, click Import to create the dashboard.

The imported dashboard is organized into logical panels. Key sections to monitor include:

Chain Data: Tracks l2geth block height, transaction count per block, and gas usage.
System Metrics: Monitors CPU, memory, disk I/O, and network usage of your node's host machine.
Batch Submitter/Proposer: Shows the health of the components that submit data and proofs to L1 (critical for sequencer operation).
Database Performance: Graphs LevelDB read/write latencies, which can become a bottleneck.

You should customize the dashboard for your specific deployment. Edit the dashboard and modify panel queries to match your node's job labels (e.g., job="op-node"). Set up alert rules directly in Grafana or in Prometheus to notify you of critical issues, such as the block height stalling or system memory exceeding 90%. For production, configure persistent storage for Grafana and set up authentication.

This dashboard provides the foundational observability needed to operate a rollup node reliably. By correlating chain activity with system resource graphs, you can diagnose performance bottlenecks, verify that batches are being submitted on schedule, and ensure overall network health. The next step involves configuring log aggregation with Loki to complement these metrics with detailed event tracing.

step4-alerting-rules

PROACTIVE MONITORING

Step 4: Configure Alerting Rules

Define conditions that trigger notifications when your rollup's health or performance deviates from expected baselines.

Alerting rules transform raw metrics into actionable intelligence. They are conditional statements evaluated at regular intervals by your monitoring system (like Prometheus with the Alertmanager). When a rule's condition is true for a sustained period, it fires an alert, which is then routed to configured channels like Slack, PagerDuty, or email. Effective rules act as an early warning system for issues like sequencer downtime, transaction backlogs, or liquidity depletion before they impact end-users.

Key Alert Categories for Rollups

You should configure rules across several critical dimensions:

Sequencer Health: Alerts for sequencer process downtime, high error rates on RPC endpoints, or failure to submit batches to L1.
Transaction Pipeline: Alerts for a growing mempool (pending_transactions), a sudden drop in transactions per second (TPS), or failed transaction ratio spikes.
L1 Settlement: Alerts for missed batch submission deadlines, abnormally high L1 gas costs per batch, or failures in state root updates.
Financial & State: Alerts for bridge contract balance thresholds, validator/staker slashing events, or a stalled state root finalization.

Here is an example Prometheus alerting rule for a sequencer heartbeat, written in YAML format for a prometheus-rules.yaml file:

yaml
groups:
  - name: rollup_sequencer
    rules:
    - alert: SequencerDown
      expr: up{job="sequencer"} == 0
      for: 1m
      labels:
        severity: critical
      annotations:
        summary: "Sequencer instance {{ $labels.instance }} is down."
        description: "The sequencer has been unreachable for over 1 minute."

This rule uses the up metric, which is 1 for a healthy target and 0 when down. The for: 1m clause creates a waiting period to prevent false alarms from transient network blips.

Configure alert severity levels (e.g., warning, critical) and routing appropriately. A critical sequencer-down alert might page an on-call engineer, while a warning for elevated gas costs might only go to a Slack channel. Use annotations to include diagnostic information in notifications, such as the affected instance, current metric value, and a link to the relevant dashboard. Always test your rules by intentionally triggering the failure condition in a staging environment to validate the entire pipeline—from detection to notification.

Avoid alert fatigue by tuning thresholds carefully. Start with conservative values and adjust based on historical data. Implement alert grouping and silencing in Alertmanager to prevent notification storms during a widespread incident. Finally, document each alert's purpose, expected response, and escalation path in a runbook. This ensures anyone on call understands the urgency and procedure when an alert fires.

KEY CATEGORIES

Essential Rollup Monitoring Metrics

Critical metrics to track for operational health, security, and performance of a rollup.

Metric Category	Specific Metric	Target / Healthy Range	Why It Matters
Sequencer Health	Block Production Interval	< 2 seconds	Indicates sequencer liveness and network stability.
Sequencer Health	Pending Queue Size	< 100 transactions	Shows transaction backlog; high values cause delays.
Data Availability	L1 Data Posting Latency	< Ethereum block time (12s)	Measures delay in publishing proofs, affecting finality.
Data Availability	Calldata Cost per Batch	$50 - $200 (varies with L1 gas)	Primary operational cost driver for the rollup.
State & Synchronization	Node Sync Time (Full)	< 1 hour	Time for a new node to sync, indicating chain efficiency.
State & Synchronization	State Growth Rate	Track weekly trend	Unbounded growth impacts node hardware requirements.
Financials & Fees	Avg Transaction Fee (USD)	$0.10 - $1.00	Direct user cost and network congestion indicator.
Financials & Fees	Sequencer Profit Margin	20% after L1 costs	Sustainability metric for rollup operation.
Security & Decentralization	Active Proposers / Challengers	5 independent entities	Measures the health of the fraud/validity proof system.
Security & Decentralization	Time to Finality (L1 confirmation)	~30 minutes to 1 week	Time for a withdrawal to be considered fully secure.

ROLLUP DASHBOARDS

Troubleshooting Common Issues

Common problems encountered when building or using rollup monitoring dashboards, with solutions for developers.

A blank dashboard is often caused by incorrect RPC endpoint configuration or a stalled data ingestion pipeline.

Check these points first:

RPC Endpoint Health: Verify your L1 and L2 RPC URLs are correct and responsive. Use curl to test the endpoint: curl -X POST -H "Content-Type: application/json" --data '{"jsonrpc":"2.0","method":"eth_blockNumber","params":[],"id":1}' YOUR_RPC_URL.
Indexer Status: If using The Graph, check the subgraph's syncing status in the hosted service dashboard or your Graph Node logs for errors.
Time Range Filter: Ensure your dashboard's default time filter isn't set to a future date or a period before the rollup launched.
Data Source Permissions: For cloud-based solutions like Google BigQuery or AWS, confirm the service account has the necessary read permissions on the data tables.

resource-links

ROLLUP OPERATIONS

Tools and Resources

These tools are commonly used to build a production-grade rollup monitoring and analytics dashboard. Each card focuses on a concrete component: metrics collection, tracing, chain-level observability, and transaction-level debugging.

Prometheus + Grafana for Rollup Metrics

Prometheus is the standard for time-series metrics collection, while Grafana is used to visualize and alert on those metrics. Together, they form the backbone of most rollup monitoring stacks.

Typical rollup metrics to export:

Sequencer health: block production interval, missed slots, mempool depth
Batch posting: L1 calldata size, batch frequency, posting latency
Node performance: CPU, memory, disk I/O, RPC response times
Protocol metrics: L2 gas usage, transaction throughput, reorg depth

Implementation details:

Instrument sequencer, batcher, and prover services with /metrics endpoints
Use node_exporter for host-level metrics
Define alert rules for stalled block production or L1 submission failures

Most Optimism and Arbitrum operators use this stack internally. It works equally well for OP Stack, Nitro, zkSync, and custom rollups when metrics are exposed explicitly.

EXPLORE

OpenTelemetry for Tracing Rollup Services

OpenTelemetry (OTel) provides distributed tracing and structured logs across rollup components. It is critical once your rollup has more than one long-running service.

Key use cases:

Trace a transaction from RPC ingestion → sequencing → execution → batching → L1 submission
Measure latency contributions per component
Correlate errors across services using trace IDs

How teams deploy it:

Instrument Go or Rust services with OTel SDKs
Export traces to Tempo, Jaeger, or OTLP-compatible backends
Attach chain metadata like chain_id, batch_id, or l1_tx_hash as span attributes

Without tracing, diagnosing intermittent batch delays or failed state roots is guesswork. OTel turns rollup execution into inspectable flows instead of isolated logs.

EXPLORE

Blockscout for L2 Explorer and Indexing

Blockscout is an open-source block explorer commonly deployed for rollups. It provides indexed access to blocks, transactions, contracts, and events, which is essential for analytics dashboards.

What Blockscout adds to monitoring:

Canonical transaction history for L2
Contract-level analytics and event decoding
Indexed data for custom dashboards and SQL queries

Operational considerations:

Requires a dedicated PostgreSQL database and archive-style RPC access
Indexing lag is a useful signal for RPC or node instability
Can be paired with Grafana by exporting database metrics

Many OP Stack and zk-rollups run a private Blockscout instance even if a public explorer exists. It gives operators full control over indexing speed, retention, and schema access.

EXPLORE

Tenderly for Transaction-Level Debugging

Tenderly provides deep execution tracing, simulation, and alerting for EVM-compatible rollups. It is especially useful for catching failures that do not surface clearly in node logs.

How it fits into a monitoring workflow:

Inspect failed L2 transactions with full opcode-level traces
Simulate protocol upgrades or config changes before deployment
Set alerts on specific contract errors or reverted calls

Practical usage:

Connect Tenderly to your rollup RPC endpoint
Track system contracts like bridges, inboxes, or upgrade beacons
Use simulations to validate batch submission or upgrade transactions

While Tenderly is not a replacement for metrics or tracing, it closes the gap between raw chain data and human-readable execution analysis.

EXPLORE

ROLLUP MONITORING

Frequently Asked Questions

Common questions and solutions for developers building and maintaining rollup monitoring dashboards.

Focus on sequencer health, data availability, and cost efficiency. Key metrics include:

Sequencer Metrics: Block production rate, transaction inclusion latency, and uptime.
L1-L2 Bridge: Deposit/withdrawal confirmation times, bridge transaction success rate, and gas costs.
Data Availability: Calldata posted to L1 per batch, data availability layer latency (e.g., Celestia, EigenDA).
Network State: Transactions per second (TPS), active addresses, and total value locked (TVL).
Costs: Average transaction fee in USD and gas, and cost per batch posted to L1.

Track these using tools like Prometheus for collection and Grafana for visualization to identify bottlenecks.