How to Design a Digital Asset Custody Infrastructure

introduction

ARCHITECTURE GUIDE

How to Design a Custody Infrastructure for Digital Assets

A technical guide to designing secure, scalable, and compliant custody systems for institutions managing blockchain-based assets.

Digital asset custody infrastructure is the secure storage and management system for private keys that control on-chain assets. Unlike traditional finance where custody involves holding securities, blockchain custody is about key management. The core challenge is balancing security (protecting keys from theft or loss), accessibility (enabling legitimate transactions), and compliance (adhering to regulations like the SEC's Rule 206(4)-2 for investment advisers). A well-designed system must account for threat models including external hackers, insider threats, and operational errors.

The foundation of any custody design is the key storage architecture. Options range from simple, non-custodial hot wallets (software-connected) and cold storage (air-gapped hardware) to complex multi-party computation (MPC) and hardware security module (HSM) clusters. For institutions, a tiered model is standard: a deeply cold, offline vault for the majority of assets (the deep cold tier), a warmer, MPC-based layer for operational funds, and a hot wallet with strict limits for immediate liquidity. Each tier has progressively stricter controls; moving funds between them requires multi-signature (multisig) approval from authorized personnel.

Transaction authorization is governed by policy engines. These are programmable rulesets that define who can approve transactions, under what conditions, and for what amounts. For example, a policy might state: "Transfers > $1M require 3-of-5 approvals from the CFO, COO, and designated security officers." Modern custody platforms implement these policies using smart contracts on chains like Ethereum (via Safe{Wallet}) or through proprietary off-chain signing services. The policy engine must integrate with identity and access management (IAM) systems to authenticate users and enforce role-based permissions.

Operational security requires robust audit logging and monitoring. Every key generation, access attempt, transaction proposal, and approval must be immutably logged. These logs feed into real-time alerting systems for anomalous activity, such as a withdrawal request from an unusual IP address or an attempt to change authorization policies. Furthermore, the system must enable proof of reserves—cryptographically verifiable attestations that the custodian holds the assets it claims, without exposing private keys. This is often achieved through signed messages from custody addresses or using zero-knowledge proofs.

Finally, design must consider disaster recovery and inheritance. A key sharding scheme, like Shamir's Secret Sharing (SSS), can split a master recovery key among multiple trusted entities or geographic locations, ensuring the wallet can be reconstructed if primary hardware fails. Legal documents must clearly define succession protocols for accessing assets in unforeseen circumstances. The entire infrastructure should be regularly tested through penetration testing and red team exercises to identify vulnerabilities before attackers do.

prerequisites

FOUNDATION

Prerequisites and Core Assumptions

Before designing a custody system, you must establish the core principles and technical environment that will define its security and operational model.

Designing a digital asset custody infrastructure is a security-first engineering challenge. It begins with a clear definition of your threat model and trust assumptions. You must decide if you are building a self-custody solution for users, a custodial service for clients, or a hybrid model like multi-party computation (MPC). The core assumption is that private keys are the ultimate source of truth; their generation, storage, and usage define the system's entire security posture. This requires a deep understanding of cryptographic primitives like Elliptic Curve Digital Signature Algorithm (ECDSA) for Bitcoin/ Ethereum or EdDSA for networks like Solana.

A robust custody design assumes the inevitability of component failure and malicious actors. Therefore, the architecture must implement defense in depth. This involves layering security controls: physical security for hardware, network isolation (air-gapped systems), strict access controls, and comprehensive audit logging. You must also plan for key lifecycle management: secure generation (using Hardware Security Modules or HSMs), distribution (via sharding in MPC or Shamir's Secret Sharing), rotation, and eventual destruction. The system should never expose a complete private key in plaintext during any operation.

The technical stack forms another critical prerequisite. You will need to integrate with blockchain nodes or reliable RPC providers (e.g., Alchemy, Infura, QuickNode) for reading chain state and broadcasting transactions. For signing, you'll evaluate HSMs (e.g., from AWS CloudHSM, Thales, or Utimaco), Trusted Execution Environments (TEEs) like Intel SGX, or MPC libraries (e.g., ZenGo's tss-lib, Fireblocks' MPC-CMP). Your design must account for transaction construction, fee estimation, and handling various transaction types (simple transfers, smart contract interactions, staking operations).

Finally, establish clear operational and compliance guardrails. Define who can authorize transactions (policy engines), how they are authorized (multi-signature schemes with m-of-n thresholds), and the process for emergency scenarios (key recovery). Assume regulatory requirements like Travel Rule compliance and proof of reserves will be necessary. The system must produce an immutable audit trail for all actions, reconciling on-chain activity with internal logs. Starting with these prerequisites ensures the resulting infrastructure is built on a foundation of explicit, secure assumptions rather than hidden vulnerabilities.

key-concepts-text

CORE CONCEPTS

How to Design a Custody Infrastructure for Digital Assets

A systematic guide to architecting secure, scalable, and operationally sound systems for managing cryptographic keys and executing blockchain transactions.

The foundation of any digital asset custody system is key generation and storage. This process must prioritize cryptographic security from inception. For institutional use, keys are typically generated using Hardware Security Modules (HSMs) or specialized Multi-Party Computation (MPC) libraries like libsecp256k1 in an air-gapped environment. The generated private key material is never stored in plaintext; instead, it is split into cryptographic shares using algorithms like Shamir's Secret Sharing (SSS) or Threshold Signature Schemes (TSS). These shares are then distributed across geographically separate, access-controlled secure enclaves or dedicated HSMs, ensuring no single point of failure or compromise.

Once keys are secured, the next layer is transaction authorization and policy enforcement. This is governed by a policy engine that defines rules for transaction validity, such as multi-signature quorums (e.g., 2-of-3), daily withdrawal limits, allowed destination addresses (whitelists), and transaction co-signer hierarchies. Modern custody architectures implement this logic programmatically, often using smart contracts for on-chain enforcement (like a Gnosis Safe) or off-chain oracle networks and signing services that validate requests against a predefined policy file before any cryptographic operation is initiated. This layer transforms raw key material into a governed financial control system.

The transaction lifecycle—from construction to settlement—is the operational core. A transaction request is first parsed and validated against the policy engine. If approved, the required key shares are retrieved from their secure locations to collaboratively generate a signature, a process that occurs entirely within secure hardware for MPC or via coordinated signing ceremonies for multi-sig. The signed transaction is then broadcast to the appropriate blockchain network (e.g., Ethereum, Bitcoin) via redundant, monitored node infrastructure. Finally, the system must monitor for on-chain confirmation and update internal accounting ledgers, a process known as settlement. This entire flow must be auditable, with immutable logs for compliance and incident response.

Designing for security and operational resilience requires addressing specific threat models. This includes implementing quorum-based signing to mitigate insider threats, using hardware-enforced key isolation to prevent extraction, and maintaining cold storage vaults for long-term asset holdings. Furthermore, the infrastructure must have robust disaster recovery and key rotation procedures. Regular internal audits, penetration testing, and the use of formally verified cryptographic libraries are non-negotiable for institutional-grade custody. The system's security is only as strong as its weakest procedural or technical link.

Finally, the custody system must provide clear APIs and integration points for users and downstream applications. This includes a well-documented REST or gRPC API for submitting transaction requests, querying balances, and managing policies. For developers, providing Software Development Kits (SDKs) and webhook notifications for transaction status updates is essential. The infrastructure should abstract away blockchain complexity, allowing applications to interact with digital assets through simple, secure calls, while the custody platform handles key security, signing, and blockchain interaction seamlessly in the background.

core-components

DESIGN PRINCIPLES

Architectural Components of a Custody System

Building a secure custody infrastructure requires a modular approach. This guide breaks down the core technical components, from key management to transaction orchestration, that developers must integrate.

Hardware Security Modules (HSMs)

The physical root of trust for private key generation and storage. HSMs are FIPS 140-2 Level 3+ certified devices that perform cryptographic operations in a physically isolated, tamper-proof environment. They ensure keys are never exposed in plaintext to connected systems.

Key Functions: Secure key generation, signing, and storage.
Integration: Connect via PKCS#11 or vendor-specific APIs.
Examples: Thales, Utimaco, and cloud HSM services from AWS CloudHSM and Google Cloud KMS.

EXPLORE

Multi-Party Computation (MPC)

A cryptographic protocol that distributes signing authority across multiple parties. MPC eliminates single points of failure by splitting a private key into shares held by separate entities. Transactions require a threshold (e.g., 2-of-3) of parties to collaborate to sign, without any single party ever reconstructing the full key.

Use Case: Enterprise-grade wallets and exchange custody.
Benefits: No single point of failure, programmable signing policies.
Protocols: GG18, GG20, and implementations from Fireblocks, Curv, and ZenGo.

EXPLORE

Transaction Orchestration Engine

The software layer that constructs, validates, and routes transactions for approval. This engine interacts with HSMs or MPC nodes, enforces business logic (whitelists, limits), and handles nonce management across multiple blockchains.

Core Tasks: Fee estimation, payload construction, and sequential nonce management.
Challenge: Preventing double-spends and ensuring atomicity in DeFi operations.
Tools: Custom-built services or adapters for protocols like MetaMask Institutional.

Policy & Approval Workflows

The rule engine that codifies governance and risk controls. Policies define who can approve transactions, under what conditions, and with what limits. Workflows route transactions for multi-signature approval based on amount, destination, or asset type.

Examples: A >$1M ETH transfer requires 3-of-5 approvals; all withdrawals to a new address have a 24-hour delay.
Implementation: Often uses smart contracts for on-chain enforcement or off-chain policy servers.

Audit Logging & Monitoring

A comprehensive, immutable ledger of all custody-related events. Every key operation, transaction initiation, policy change, and access attempt must be logged. Real-time monitoring alerts on anomalous patterns, such as rapid-fire withdrawal attempts.

Data Captured: Timestamps, user IDs, transaction hashes, policy decisions, and system health metrics.
Compliance: Essential for SOC 2 Type II audits and regulatory reporting.
Integration: Feeds into SIEM tools like Splunk or Datadog.

Key Recovery & Inheritance

A secure process to regain access to assets if primary keys are lost. Social Recovery schemes use a pre-defined set of guardians (other devices or trusted entities) to reconstruct access. Inheritance solutions use time-locked contracts or decentralized protocols to transfer asset control upon verifiable events.

Techniques: Shamir's Secret Sharing, smart contract-based recovery (e.g., Safe{Wallet}).
Critical Design: Must balance security against the risk of permanent loss.

CUSTODY ARCHITECTURES

Key Management Scheme Comparison

A comparison of the primary cryptographic schemes for securing digital asset custody, evaluating trade-offs between security, operational complexity, and user experience.

Feature / Metric	Single-Party Custody	Multi-Party Computation (MPC)	Multi-Signature (Multisig)
Private Key Generation	Single device/wallet	Distributed across parties	Distributed across signers
Signing Process	Single signature	Collaborative computation (no full key)	Multiple independent signatures
Fault Tolerance	None (single point of failure)	Threshold schemes (e.g., 2-of-3)	Threshold schemes (e.g., 2-of-3)
Typical Transaction Latency	< 1 sec	2-5 sec (network rounds)	5-30 sec (sequential signing)
On-Chain Footprint	Standard transaction	Standard transaction	Larger transaction & gas cost
Key Refresh / Rotation	Impossible (new wallet)	Proactive secret sharing	Requires new wallet setup
Provider Lock-in Risk	High	Medium (protocol-dependent)	Low (standardized EIP-712)
Auditability & Transparency	Low	Medium (complex proofs)	High (on-chain verification)

transaction-engine-design

CUSTODY INFRASTRUCTURE

Designing the Transaction Engine

A secure transaction engine is the core of any digital asset custody solution, responsible for constructing, signing, and broadcasting transactions. This guide details the architectural patterns and security considerations for building one.

The primary function of a transaction engine is to orchestrate the signing flow for on-chain operations. It receives a transaction intent (e.g., 'send 1 ETH to address X'), fetches the latest network state (nonce, gas prices), constructs the raw transaction payload, and routes it to the secure signing environment. This separation of duties—between the frontend interface, the transaction builder, and the signing module—is a fundamental security principle. The engine should never have direct access to private keys.

For institutional custody, supporting multi-party computation (MPC) or multi-signature (multisig) schemes is essential. An MPC-based engine uses distributed key generation (DKG) to create signatures without ever assembling a complete private key in one location. A multisig engine, like one using Safe (formerly Gnosis Safe) smart contracts, constructs transactions that require M-of-N approvals. The engine must manage the signing ceremony, collect partial signatures from geographically distributed signers, and aggregate them into a valid final signature.

Transaction simulation and risk assessment are critical pre-signing steps. Before presenting a transaction for approval, the engine should simulate its execution using a service like Tenderly or a local node fork. This checks for reverts, unexpected side effects, or excessive gas costs. The engine can integrate threat feeds to flag interactions with known malicious addresses or newly deployed contracts. Implementing allow-lists for destination addresses and contract methods at the engine level adds another layer of operational security.

Here is a simplified code example of a transaction engine's core construction logic using Ethers.js, demonstrating parameter validation and payload building:

javascript
async function buildTransaction(fromAddress, to, value, data = '0x') {
  // 1. Fetch live network parameters
  const nonce = await provider.getTransactionCount(fromAddress);
  const gasPrice = await provider.getGasPrice();
  const chainId = (await provider.getNetwork()).chainId;

  // 2. Estimate gas limit
  const gasLimit = await provider.estimateGas({
    from: fromAddress,
    to: to,
    value: value,
    data: data
  });

  // 3. Construct the raw transaction object
  const txPayload = {
    nonce,
    gasPrice,
    gasLimit,
    to,
    value,
    data,
    chainId
  };

  // 4. Serialize for signing (RLP encoding)
  return ethers.utils.serializeTransaction(txPayload);
}

The engine must be network-agnostic, supporting EVM chains, Solana, Cosmos, and others through abstracted providers. It should handle chain-specific nuances like Solana's recent blockhash requirements or Cosmos SDK's memo fields. For scalability, the engine architecture should be stateless, with all persistent data (transaction queues, approval states) stored in a separate database. This allows for horizontal scaling and high availability, ensuring the custody service remains operational during peak loads or maintenance events.

Finally, comprehensive logging and auditing are non-negotiable. Every action—transaction construction, simulation result, approval request, and broadcast—must be immutably logged with relevant metadata (timestamp, user ID, originating IP). These logs feed into real-time monitoring dashboards and are essential for post-incident forensic analysis and regulatory compliance. The transaction engine is not just a technical component; it's the enforceable layer of your security and operational policy.

audit-logging-implementation

CUSTODY INFRASTRUCTURE

Implementing Immutable Audit Logging

A guide to designing tamper-proof audit trails for digital asset custody, covering core principles, architectural patterns, and implementation strategies using blockchain and cryptographic techniques.

Immutable audit logging is the foundational layer of accountability and transparency in digital asset custody. Unlike traditional databases where logs can be altered, an immutable log creates a permanent, cryptographically verifiable record of every action. This includes key lifecycle events like wallet creation, transaction signing, policy changes, and access attempts. The primary goal is to provide a single source of truth that can be independently audited to detect anomalies, prove compliance, and reconstruct events after an incident. This is non-negotiable for institutional custody, where regulators and clients demand provable security and operational integrity.

The core mechanism for achieving immutability is cryptographic hashing. Each log entry contains a timestamp, action, actor, and relevant metadata (e.g., transaction hash, wallet address). This entry is hashed using SHA-256. The critical step is linking entries: each new log's hash includes the hash of the previous entry, creating a hash chain or Merkle tree. Tampering with any historical entry would require recalculating all subsequent hashes, which is computationally infeasible. For enhanced security, this chain's latest hash can be periodically anchored to a public blockchain like Ethereum or Bitcoin via a transaction, providing a timestamp and proof of existence that is secured by the underlying blockchain's consensus.

Architecturally, the audit log should be a separate, write-only subsystem. Custody transaction engines and key management services must publish events to this log via a well-defined API before an action is considered complete. This enforces event sourcing, where the log becomes the primary state record. Implementations often use append-only databases, specialized log services like Apache Kafka with strict retention policies, or directly leverage a permissioned blockchain (e.g., a private Ethereum network, Hyperledger Fabric) as the ledger itself. Access to the log must be strictly controlled, with read-only interfaces for auditors and automated monitoring systems.

For developers, implementing this involves creating a structured log schema. Each entry should be a JSON object with consistent fields: sequenceId, timestamp, eventType (e.g., SIGNING_INITIATED, POLICY_UPDATED), actorId, subjectId (the asset or wallet involved), and metadata. Here's a conceptual code snippet for creating a hash chain in Node.js:

javascript
const crypto = require('crypto');
class AuditLog {
  constructor() { this.chain = []; }
  addEntry(data) {
    const prevHash = this.chain.length > 0 ? this.chain[this.chain.length-1].hash : '0';
    const entry = {
      data,
      prevHash,
      timestamp: new Date().toISOString(),
    };
    entry.hash = this.calculateHash(entry);
    this.chain.push(entry);
    return entry.hash; // Can be anchored to a blockchain
  }
  calculateHash(entry) {
    return crypto.createHash('sha256').update(JSON.stringify(entry)).digest('hex');
  }
}

Effective audit logging must be paired with real-time monitoring and alerting. Systems should parse the log stream to detect suspicious patterns, such as rapid-fire signing attempts, changes to withdrawal whitelists, or access from unauthorized geolocations. Tools like the ELK Stack (Elasticsearch, Logstash, Kibana) or Splunk can index and visualize log data for this purpose. Furthermore, the integrity of the entire log must be regularly verified. This involves recomputing the hash chain and comparing the latest hash with the value stored on the public blockchain anchor. This process provides cryptographic proof that the historical record has not been altered, fulfilling the core requirement of non-repudiation for custody providers.

client-interface-patterns

CUSTODY INFRASTRUCTURE

Client Interface and API Patterns

Designing a secure and scalable custody system requires robust interfaces for key management, transaction signing, and policy enforcement. These patterns define how applications interact with secure key storage.

Multi-Party Computation (MPC) Wallets

MPC wallets eliminate single points of failure by distributing private key shards across multiple parties. Signing requires a threshold of participants, enhancing security without a single seed phrase.

Key Generation: Keys are generated in a distributed manner using protocols like GG20.
Transaction Signing: Parties collaborate to produce a signature without reconstructing the full key.
Use Case: Institutional custody (Fireblocks, Curv) and wallet-as-a-service providers use MPC for scalable, policy-driven asset control.

EXPLORE

Hardware Security Module (HSM) Integration

HSMs are physical devices that provide FIPS 140-2 Level 3 certified secure key generation, storage, and cryptographic operations. They are the backbone of traditional financial custody.

API Pattern: Applications use PKCS#11 or vendor-specific REST APIs (e.g., Thales, Utimaco) to delegate signing.
Air-Gapped Signing: For highest security, HSMs can be kept offline, with transactions transferred via QR codes or USB drives.
Consideration: Higher latency and cost compared to cloud-based MPC, but offers proven physical security.

EXPLORE

Policy Engines and Transaction Pre-Screens

A policy engine evaluates transaction requests against a set of rules before signing is permitted. This is critical for compliance and risk management.

Rule Types: Whitelisted addresses, daily volume limits, time-of-day restrictions, and multi-approval workflows.
Implementation: Policies are often defined in JSON or a DSL and enforced by a separate service that interfaces with the signing module.
Example: A rule may require 2-of-3 administrator approvals for any transfer over 10 ETH.

Wallet Abstraction & Account Abstraction

ERC-4337 (Account Abstraction) and EIP-3074 allow smart contract wallets to manage assets with programmable logic, separating signer keys from the wallet address.

Custody Integration: Custody providers can act as the signer for an abstracted account, enabling social recovery, batched transactions, and gas sponsorship.
API Pattern: Custody service exposes an endpoint that returns signatures for user operations (UserOperation objects).
Benefit: Users retain a consistent smart contract wallet address while the underlying signer keys can be rotated or managed institutionally.

EXPLORE

Audit Logging and Non-Repudiation

A tamper-proof audit trail is non-negotiable for regulated custody. Every API call, key access attempt, and transaction signing event must be logged.

Immutable Storage: Logs should be written to a separate, append-only system or a blockchain (e.g., anchoring hashes to Ethereum or a private ledger).
Data Captured: Requestor ID, timestamp, operation type, affected addresses, amount, policy decision, and final transaction hash.
Compliance: Essential for SOC 2 Type II, ISO 27001, and financial regulations.

REST vs gRPC API Design

Choosing the right API protocol impacts performance, developer experience, and security.

REST (HTTP/JSON): Ubiquitous and easy to debug. Use for administrative tasks (policy management, user onboarding) and web clients. Implement strict rate limiting and authentication (OAuth 2.0, API keys).
gRPC (HTTP/2, Protobuf): Better for high-frequency, low-latency operations like real-time transaction streaming or internal service communication. Offers built-in streaming and smaller payloads.
Hybrid Approach: Many custody providers (e.g., Coinbase Prime) offer both, using gRPC for core signing services and REST for management APIs.

CUSTODY ARCHITECTURE

Security Controls and Threat Mitigation Matrix

Comparison of core security models for digital asset custody, detailing key controls and their effectiveness against common threats.

Security Control / Threat	Self-Custody (Hot Wallet)	Multi-Party Computation (MPC)	Hardware Security Module (HSM) Custody
Private Key Storage	Single device memory	Distributed key shards	FIPS 140-2 Level 3+ certified hardware
Signing Authority	Single point of failure	Threshold signatures (e.g., 2-of-3)	Quorum policies with hardware authentication
Insider Threat Mitigation
Remote Exploit Risk (e.g., phishing)	High	Medium (requires compromise of threshold)	Low (air-gapped signing)
Transaction Finality Speed	< 1 sec	2-5 sec (network coordination)	1-3 sec (HSM processing)
Disaster Recovery Complexity	High (seed phrase management)	Medium (shard backup procedures)	High (HSM cluster & key backup)
Audit Trail & Compliance	Limited	Comprehensive (on-chain proofs)	Comprehensive (HSM logs, CEX-ready)
Approximate Annual OpEx (for enterprise)	$1k-5k	$50k-200k+	$200k-1M+

scalability-considerations

SCALABILITY AND MULTI-CHAIN CONSIDERATIONS

How to Design a Custody Infrastructure for Digital Assets

A scalable custody system must manage assets across multiple blockchains while maintaining security, operational efficiency, and future flexibility. This guide outlines the architectural decisions and trade-offs involved.

Modern digital asset custody must support a multi-chain reality. A rigid, single-chain architecture is insufficient for institutions holding assets on Ethereum, Solana, Bitcoin, and emerging Layer 2 networks. The core design challenge is balancing security isolation with operational unification. A common pattern is the hierarchical deterministic (HD) wallet system, using a single seed phrase to derive unique key pairs for each supported chain via separate derivation paths (e.g., m/44'/60'/0'/0 for Ethereum, m/44'/501'/0'/0' for Solana). This provides a unified recovery mechanism while keeping chain-specific keys logically separated.

Scalability requires abstraction layers. The custody backend should interact with a generic BlockchainAdapter interface, not directly with each chain's RPC. For example, a signTransaction method would accept a chain-agnostic payload, which the appropriate adapter converts into a chain-specific transaction. This allows adding support for a new blockchain by implementing a single adapter module, without modifying core signing or security logic. Consider using established libraries like ethers.js for EVM chains and @solana/web3.js for Solana within these adapters to handle low-level details.

Transaction scheduling and fee management become complex at scale. A custody system must handle nonce management for EVM chains, recent blockhash requirements for Solana, and dynamic fee markets across all networks. Implement a transaction queue with chain-specific workers that monitor mempool status and network congestion. For fees, integrate with services like Gas Station Network (GSN) for Ethereum or use priority fee estimation for Solana to ensure timely processing without overpaying. Batch operations, where protocol-safe, can significantly reduce costs and load.

Security models must scale with the number of chains and assets. Multi-signature (multisig) schemes are essential, but their implementation varies: Ethereum uses smart contracts like Gnosis Safe, while Bitcoin uses native OP_CHECKMULTISIG and Solana uses the SPL Token multisig program. Your custody logic should abstract the approval policy (e.g., 2-of-3 signatures) from the chain-specific signing mechanics. Furthermore, hardware security module (HSM) integration must support the cryptographic algorithms (ECDSA, EdDSA) and curve types (secp256k1, ed25519) required by different blockchains.

Finally, design for observability and key lifecycle management. Implement comprehensive logging and monitoring for all cross-chain operations, tracking asset movements by chain ID and asset type. Key rotation and revocation procedures must account for each blockchain's derivation path and address format. Use off-chain computation for sensitive operations like key derivation, and ensure your disaster recovery process includes testing restoration for every supported chain. The infrastructure should be stateless where possible, with all critical data, including derivation paths and chain configurations, stored securely in an encrypted vault.

DEVELOPER FAQ

Frequently Asked Questions on Custody Design

Common technical questions and troubleshooting for building secure, scalable custody infrastructure for digital assets.

Multi-Party Computation (MPC) and multi-signature (multi-sig) wallets are both threshold signature schemes, but they differ fundamentally in architecture and on-chain footprint.

Multi-sig (e.g., Gnosis Safe) uses a smart contract on-chain. It requires M out of N predefined private key signatures to authorize a transaction. This is transparent and verifiable on-chain but incurs gas costs and exposes the participant count.

MPC (e.g., using libraries like ZenGo's tss-lib) generates a single signature through a distributed protocol off-chain. The private key is never assembled in one place. Only the final, standard ECDSA signature is broadcast, making transactions indistinguishable from a single-key wallet and cheaper on gas.

Key trade-off: Multi-sig offers on-chain auditability; MPC provides privacy and efficiency but requires a more complex, trusted off-chain coordination layer.

resource-links

CUSTODY ARCHITECTURE

Implementation Resources and Tools

Practical tools and reference architectures for designing a secure, auditable digital asset custody infrastructure. These resources focus on key management, transaction authorization, infrastructure security, and operational controls used by exchanges, funds, and on-chain applications.

Hardware Security Modules (HSMs) for Key Storage

Hardware Security Modules (HSMs) provide tamper-resistant environments for generating and storing private keys. They are a foundational component of institutional-grade custody.

Key implementation considerations:

Key generation inside the HSM to prevent plaintext key exposure
FIPS 140-2 Level 3 or higher certification for regulated environments
Support for secp256k1 and EdDSA curves used by Bitcoin, Ethereum, and Solana
Integration with signing services via PKCS#11 or REST APIs

Common deployment patterns:

Use HSMs for cold and warm wallets, while hot wallets rely on rate-limited signing
Combine with policy engines to enforce transaction limits and approvals

HSMs are typically used alongside MPC or multisig rather than as a standalone custody solution.

EXPLORE

Multi-Party Computation (MPC) Wallet Frameworks

Multi-Party Computation (MPC) splits private keys into multiple shares held by separate parties or services. No single system ever reconstructs the full key.

Why MPC is widely adopted:

Eliminates single points of compromise
Enables policy-based signing such as 2-of-3 or 3-of-5 approvals
Supports automated workflows without exposing private keys

Implementation details developers should evaluate:

Threshold schemes (GG18, GG20, or proprietary variants)
Latency impact during transaction signing
Key resharing for employee rotation and incident response

MPC is commonly used by centralized exchanges and custodians to secure billions in assets while maintaining operational speed.

EXPLORE

Cold Storage and Air-Gapped Wallet Design

Cold storage keeps private keys fully offline to minimize attack surface. Air-gapped systems are still the dominant approach for long-term treasury holdings.

Best practices for cold custody:

Generate keys on offline machines with verified OS images
Store key material using encrypted removable media or HSM-backed devices
Require multi-person physical access controls for key retrieval

Operational safeguards:

Documented key ceremonies with video and written records
Geographic distribution of key shards or backups
Regular recovery drills to validate procedures

Cold storage significantly reduces online attack risk but introduces operational friction. Most custody architectures combine cold storage for reserves with warm or hot wallets for liquidity.

Transaction Policy Engines and Approval Workflows

Transaction policy engines enforce rules before a signature is produced. They are critical for preventing unauthorized or erroneous transfers.

Typical controls include:

Per-transaction and daily transfer limits
Address allowlists and blocklists
Time delays for high-value withdrawals
Multi-role approval flows (operator, reviewer, approver)

Implementation notes:

Policies should be enforced before cryptographic signing, not after
All decisions must be logged and immutable for auditability
Integrate alerts for anomalous behavior or policy violations

Well-designed policy engines reduce reliance on human trust and align custody operations with internal risk management frameworks.

Auditing, Logging, and Compliance Tooling

Auditable custody systems require detailed logs of every key, policy, and transaction event. This is essential for internal controls and external regulators.

Core logging requirements:

Key lifecycle events such as generation, rotation, and decommissioning
All signing requests and approvals with timestamps and identities
System access logs for infrastructure and key services

Recommended practices:

Store logs in append-only systems such as WORM storage or immutable databases
Retain logs for multiple years to meet regulatory expectations
Regularly reconcile on-chain transactions with internal records

Strong observability allows teams to detect incidents early and demonstrate custody integrity during audits or investigations.