Backtesting

Backtesting runs your assertions against actual historical transactions from a specified block range. This helps ensure your assertions work correctly with real transaction patterns and don’t trigger false positives on legitimate protocol operations.

If you’re looking for the reference documentation on all configuration parameters, see the Backtesting Reference.

How It Works

Backtesting operates in two phases:

Transaction Fetching: Identifies all transactions to the target contract in the block range
Transaction Validation: Replays each transaction with your assertion enabled

The tool fetches transactions using one of two methods:

trace_filter API (recommended): Single RPC call for the entire block range, detects internal calls
Block scanning: One RPC call per block, detects only direct calls

For validation, the tool forks the EVM at each transaction to the target contract and replays it against the specified assertion.

Example

// SPDX-License-Identifier: MIT
pragma solidity ^0.8.20;

import {CredibleTestWithBacktesting} from "../src/backtesting/CredibleTestWithBacktesting.sol";
import {BacktestingTypes} from "../src/backtesting/BacktestingTypes.sol";
import {MyAssertion} from "../assertions/src/MyAssertion.a.sol";

contract MyBacktestingTest is CredibleTestWithBacktesting {
    function testHistoricalTransactions() public {
        BacktestingTypes.BacktestingResults memory results = executeBacktest(
            BacktestingTypes.BacktestingConfig({
                targetContract: 0x5fd84259d66Cd46123540766Be93DFE6D43130D7, // USDC on Optimism Sepolia
                endBlock: 31336940, // Latest block to test
                blockRange: 20, // Number of blocks to test
                assertionCreationCode: type(MyAssertion).creationCode,
                assertionSelector: MyAssertion.assertionInvariant.selector,
                rpcUrl: "https://sepolia.optimism.io",
                detailedBlocks: false,
                useTraceFilter: true,
                forkByTxHash: false
            })
        );

        // Check results
        assert(results.assertionFailures == 0, "Found protocol violations!");
    }
}

Backtesting tests inherit from CredibleTestWithBacktesting instead of CredibleTest. The executeBacktest function returns results in a BacktestingTypes.BacktestingResults struct.

Consider excluding backtesting from your CI/CD pipeline as it can take time to run. Run backtesting manually on demand to increase confidence in your assertions before deployment.

Common Configurations

Large Block Ranges (100+ blocks)

BacktestingTypes.BacktestingConfig({
    targetContract: TARGET_CONTRACT,
    endBlock: LATEST_BLOCK,
    blockRange: 100,
    assertionCreationCode: type(MyAssertion).creationCode,
    assertionSelector: MyAssertion.assertionInvariant.selector,
    rpcUrl: vm.envString("MAINNET_RPC_URL"),
    detailedBlocks: false,
    useTraceFilter: true,   // Fast fetching
    forkByTxHash: false     // Fast validation
})

Expected runtime: ~20 seconds for 100 blocks with 28 transactions

Debugging Failures

BacktestingTypes.BacktestingConfig({
    targetContract: TARGET_CONTRACT,
    endBlock: FAILING_BLOCK,
    blockRange: 5,  // Small range around failure
    assertionCreationCode: type(MyAssertion).creationCode,
    assertionSelector: MyAssertion.assertionInvariant.selector,
    rpcUrl: vm.envString("MAINNET_RPC_URL"),
    detailedBlocks: false,
    useTraceFilter: true,
    forkByTxHash: true      // Exact state for debugging
})

Use this when investigating why a specific transaction failed. The forkByTxHash: true setting replays all prior transactions to get the exact state just before execution. This is disabled by default, since it uses significantly more RPC calls and hence is slower. It is recommended to use this setting when debugging failures for concentrated block intervals.

Quick Tests (< 20 blocks)

BacktestingTypes.BacktestingConfig({
    targetContract: TARGET_CONTRACT,
    endBlock: RECENT_BLOCK,
    blockRange: 10,
    assertionCreationCode: type(MyAssertion).creationCode,
    assertionSelector: MyAssertion.assertionInvariant.selector,
    rpcUrl: vm.envString("MAINNET_RPC_URL"),
    detailedBlocks: false,
    useTraceFilter: false,  // Simple method sufficient
    forkByTxHash: false
})

For small ranges, the simpler block scanning method is adequate.

Testing Against a Specific Transaction

When you want to test your assertion against a specific known transaction (such as a historical exploit), set blockRange: 1 with the block number containing that transaction:

BacktestingTypes.BacktestingConfig({
    targetContract: BALANCER_V2_VAULT,
    endBlock: 23717632,     // Block containing the exploit transaction
    blockRange: 1,          // Only test this single block
    assertionCreationCode: type(BatchSwapDeltaAssertion).creationCode,
    assertionSelector: BatchSwapDeltaAssertion.assertionBatchSwapRateManipulation.selector,
    rpcUrl: vm.envString("MAINNET_RPC_URL"),
    detailedBlocks: false,
    useTraceFilter: true,
    forkByTxHash: true
})

This is particularly useful for:

Validating that your assertion catches known exploits
Testing edge cases found in production
Verifying assertion behavior on specific problematic transactions

See the Balancer V2 Rate Manipulation Exploit for a real-world example of using backtesting to verify an assertion catches a historical exploit transaction.

Understanding Results

When backtesting completes, transactions are categorized into different result types to help you quickly identify issues:

Result Categories

PASS - Assertion passed successfully

Transaction replayed and assertion validated without errors
Recommended Action: None - everything is working as expected

NEEDS_REVIEW - Transaction requires review

Transaction either called a function not monitored by your assertion, or failed to replay in the test environment
Common causes:
- Function mismatch: Assertion monitors batch() but transaction called call()
- State dependencies: Transaction depends on earlier transactions in the same block
- Context requirements: Transaction requires specific block state that isn’t present when forking at block level
Recommended Action:
- If many transactions show this status, enable forkByTxHash: true to fork at exact transaction state
- Review to ensure your assertion is triggering on the correct function
Note: These are NOT assertion violations - they’re informational and indicate transactions that either aren’t relevant to your assertion or fail before the assertion is executed due to wrong configuration of the backtest

ASSERTION_FAIL - Assertion reverted

Your assertion reverted when validating this transaction
Most Common Causes:
- False positive: Your assertion logic incorrectly flags legitimate protocol behavior (most common)
- Gas limit exceeded: Assertion ran out of gas (300k limit)
- Assertion bug: Logic error in your assertion code
- Known exploit: If testing against a historical exploit block, this is expected behavior
Recommended Action:
- Check if the transaction is a known exploit - if so, this confirms your assertion works correctly
- Review your assertion logic to ensure it properly handles this transaction pattern
- Check if the assertion is running out of gas and needs optimization
- Verify the transaction on a block explorer to understand what it does

UNKNOWN_ERROR - Unexpected failure

An error occurred that doesn’t fit other categories
May indicate RPC issues, assertion bugs, or unexpected contract behavior
Recommended Action: Check the error message and retry; if persistent, file a bug report.

Reading the Summary

At the end of each backtest run, you’ll see a summary like:

==========================================
           BACKTESTING SUMMARY
==========================================
Block Range: 23697580 - 23697590
Total Transactions: 15
Processed Transactions: 15
Successful Validations: 10
Failed Validations: 5

=== ERROR BREAKDOWN ===
Protocol Violations (Assertion Failures): 1
Needs Review (Selector Mismatch or Prestate Issues): 4

Success Rate: 66%
================================

Running Tests

# Set RPC URL environment variable
export RPC_URL="YOUR_RPC_URL"

# Run backtesting tests
pcl test --ffi --match-test testHistoricalTransactions

The --ffi flag is required to enable foreign function interface for RPC calls.

Foundry Configuration

Enable FFI in your Foundry profile:

[profile.backtest-assertions]
src = "assertions/src"
test = "assertions/test/backtest"
out = "assertions/out"
libs = ["lib"]
solc = "0.8.29"
optimizer = true
optimizer_runs = 200
ffi = true # To avoid setting --ffi in the test command

See CI/CD Integration for complete Foundry profile configuration.

Best Practices

Start small - Test with 10-20 blocks first to verify your setup
Use trace_filter - Set useTraceFilter: true if your RPC provider supports it
Keep forkByTxHash false - Enable if assertions are reverting and might be affected by txs earlier in a block
Use paid RPC providers - For block ranges over 1,000
Check assertion failures - Ensure results.assertionFailures == 0
Run before deployment - Catch false positives on real transactions
Run manually - Exclude from CI/CD to avoid long test runs

Learn More:

Backtesting Reference - Complete configuration details
Testing Assertions - Basic testing guide
Fuzz Testing - Test with random inputs
Balancer V2 Rate Manipulation Exploit

Getting Started

Architecture

Assertions

dApp

Guides

Reference

Resources

How It Works

Example

Common Configurations

Large Block Ranges (100+ blocks)

Debugging Failures

Quick Tests (< 20 blocks)

Testing Against a Specific Transaction

Understanding Results

Result Categories

Reading the Summary

Running Tests

Foundry Configuration

Best Practices

Getting Started

Architecture

Assertions

dApp

Guides

Reference

Resources

​How It Works

​Example

​Common Configurations

​Large Block Ranges (100+ blocks)

​Debugging Failures

​Quick Tests (< 20 blocks)

​Testing Against a Specific Transaction

​Understanding Results

​Result Categories

​Reading the Summary

​Running Tests

​Foundry Configuration

​Best Practices

How It Works

Example

Common Configurations

Large Block Ranges (100+ blocks)

Debugging Failures

Quick Tests (< 20 blocks)

Testing Against a Specific Transaction

Understanding Results

Result Categories

Reading the Summary

Running Tests

Foundry Configuration

Best Practices