Get Tokenomics

On-Chain Analytics for Tokenomists: Tools, Metrics, and Practice

On-chain analytics for tokenomists: Dune, Nansen, DefiLlama tools, holder concentration metrics, exchange flows, and wallet activity patterns.

A project claims: “we have 50,000 holders and a healthy distribution.” Open the blockchain — 10 wallets control 85% of supply, 40,000 addresses hold airdrop dust, and three whales dump $200K on DEX weekly. On-chain analytics turns declarations into facts. For a tokenomist, this isn’t an optional skill — it’s the primary tool for verification, design, and audit.

Why Tokenomists Need On-Chain Data

The blockchain is a public ledger. Everything is recorded: who owns what, when they bought, where they transferred, how much they staked. A tokenomist uses this data at three stages:

StageWhat we analyzeWhy
DesignComparables: how competitor tokens are distributed, staking ratios, holder concentrationCalibrating parameters: allocation, vesting, staking rewards
AuditFacts: does actual distribution match stated, is vesting executing, where do tokens flowVerifying a project’s claims
MonitoringDynamics: exchange flows, whale activity, concentration changesEarly warning of sell pressure
On-chain vs off-chain
On-chain data is objective but incomplete. It shows what happened (transfer, swap, stake), but not why. A whale moved 1M tokens to Binance — is it a sale? Rebalancing? Margin collateral? On-chain analytics provides facts; interpretation is up to the tokenomist.
On-chain analytics tools for tokenomistsCentral node — on-chain analytics, four tools: Dune, Nansen, Arkham, DefiLlamaOn-Chain Analyticsverification, audit, monitoringDune AnalyticsSQL queries on dataDefiLlamaTVL, yields, comparisonArkhamaddress deanonymizationNansenwallet labeling

Tools

Dune Analytics

What it is: a platform for SQL queries on on-chain data. Supports Ethereum, Polygon, Arbitrum, Optimism, Solana, Base, and dozens of other networks.

For tokenomists:

  • Custom queries: holder distribution, DEX volumes, flows
  • Public dashboards by other analysts
  • Real-time data visualization

Key tables:

TableContentsTypical query
tokens.transfersAll ERC-20 transfersFlows between wallets and exchanges
dex.tradesDEX tradesTrading volume, large swaps
staking.*Staking operationsPercentage of staked tokens
balancesCurrent balancesTop holders, concentration

Example query: top-10 token holders

SELECT
  address,
  balance,
  balance * 100.0 / SUM(balance) OVER() AS pct_supply
FROM tokens.balances
WHERE token_address = 0x...  -- token address
  AND blockchain = 'ethereum'
  AND balance > 0
ORDER BY balance DESC
LIMIT 10

This query is illustrative. Dune’s current tokens.balances spells expect a point-in-time filter (e.g., block_date = current_date or a block_time bound), so a direct copy-paste may need adjustment depending on the exact spell version in use.

Access: free plan (limited queries), Pro at $390/month per user (see dune.com/pricing).

Nansen

What it is: an analytics platform with wallet labeling. Nansen assigns labels to addresses: “exchange,” “fund,” “whale,” “smart money,” “market maker.”

For tokenomists:

  • Identifying holder types (not just addresses, but who’s behind them)
  • Tracking “smart money” — where capital flows
  • Exchange inflow/outflow analysis

Key features:

FeatureDescription
Token God ModeFull token analytics: holders, flows, concentration
Smart MoneyTracking fund, whale, and successful trader wallets
Exchange FlowToken inflow/outflow to CEXs
Wallet ProfilerIndividual address analysis: portfolio, activity, P&L

Access: Free tier and Pro at $49/month (annual) or $69/month (monthly); custom enterprise on request (see nansen.ai/plans).

Arkham Intelligence

What it is: a platform for deanonymization and visualization of on-chain data. Specialization — linking addresses to real entities (funds, exchanges, protocols, individuals).

For tokenomists:

  • Who’s behind large wallets
  • Flow visualization between entities (relationship graph)
  • Alerts on large transfers

Access: free plan with basic functionality.

DefiLlama

What it is: a DeFi protocol data aggregator. Open, free, no registration required.

For tokenomists:

  • TVL (Total Value Locked) of protocols — a trust and usage indicator
  • Pool and staking yields
  • Cross-network protocol comparison
  • Bridge flows

Key metrics:

MetricWhat it showsWhere to find
TVLHow much is locked in a protocoldefillama.com/protocol/[name]
Fees / RevenueProtocol incomedefillama.com/fees
YieldsPool yieldsdefillama.com/yields
StablecoinsStablecoin supply by networkdefillama.com/stablecoins
BridgesCross-chain flowsdefillama.com/bridges

Access: completely free, open API.

Etherscan and Block Explorers

What it is: the basic tool for viewing individual transactions, addresses, and contracts.

NetworkExplorer
EthereumEtherscan.io
BNB ChainBscScan.com
PolygonPolygonscan.com
ArbitrumArbiscan.io
SolanaSolscan.io
Cosmos ecosystem chainsMintscan.io (covers a family of Cosmos SDK chains)

For tokenomists:

  • Checking specific transactions and contracts
  • Reading smart contracts (verifying staking, vesting parameters)
  • Viewing holder lists (limited)

Tool Comparison

ToolData typeCostNetworksPrimary use
DuneSQL queriesFree / $390/mo (Pro)20+Custom analytics
NansenLabeled walletsFree / $49–$69/mo (Pro)10+Holder identification
ArkhamDeanonymizationFree+10+Who’s behind an address
DefiLlamaDeFi aggregatorFree100+TVL, yields, comparison
EtherscanBlock explorerFree1 per networkTransactions, contracts

Key Metrics for Tokenomists

1. Holder Concentration

Why it matters: high concentration = risk of price manipulation, centralized governance, mass sell-off.

Concentration Ratio

CR_n = Sum(Balance_top_n) / Supply × 100%
  • CR_n — share of supply controlled by the top-n wallets (computed)
  • CR_10 > 80% — critical concentration (author’s heuristic, not a standardized benchmark)
  • CR_10 < 50% — healthy distribution (author’s heuristic)
  • Thresholds assume team, treasury, foundation, and CEX custodial wallets are excluded — otherwise early-stage tokens trivially exceed 80% in their first years

Benchmarks:

CR_10InterpretationExample
< 30%Excellent distributionBitcoin (~6% — top-10 addresses, per Arkham/River/CoinLore 2026)
30–50%HealthyEthereum (~35%, excluding the Beacon Deposit Contract and CEX custodial wallets)
50–70%Moderate concentrationTypical DeFi token
70–90%High concentrationEarly-stage project
> 90%CriticalRed flag
Exclude contract addresses
When calculating concentration, exclude: exchange addresses (custodial), staking smart contracts, bridges, treasury wallets. Otherwise “top-10” will show exchanges and contracts, not real holders. A canonical example is Ethereum’s Beacon Deposit Contract — if counted as a single address, it alone holds a majority of ETH supply and dominates any “top-10” list, creating a misleading concentration picture. Nansen and Arkham help automatically categorize addresses. Also distinguish address-level from entity-level concentration: the same entity (exchange, ETF custodian, fund) often spreads balances across many addresses, so address-level top-10 understates real concentration, while entity-level top-10 overstates it if treated as “holders” rather than custodial aggregates. The Bitcoin figures illustrate this: top-10 addresses ≈ 6% of supply, but top-10 entities (including exchanges, ETFs, and MicroStrategy-style treasuries) exceed 25%.

Gini Coefficient

Gini = (2 × Sum(i × Balance_i)) / (n × Sum(Balance_i)) − (n + 1) / n
  • Balance_i — i-th balance in the sorted list (ascending: smallest first)
  • i — rank position from 1 to n
  • n — total number of addresses
  • Gini = 0 — perfect equality (everyone holds equal amounts)
  • Gini = 1 — perfect inequality (one address holds everything)
  • For free-float token holder distributions, typically observed: 0.85–0.99 (highly sensitive to methodology — wallet vs entity, inclusion of dust and contract addresses)

This is the canonical discrete rank-based Gini (equivalent to twice the area between the Lorenz curve and the line of equality). It requires balances to be sorted in ascending order before ranks i are assigned; with this convention the result is bounded in [0, 1]. The previously common form 1 − 2·(Σ i·x_i)/(n·Σx_i) + 1/n assumes descending sort and is algebraically equivalent to the canonical form above only with the correct sign conventions — we use the ascending form to avoid ambiguity.

The Gini coefficient for crypto assets is almost always > 0.9 (extreme inequality). This is normal: investors, the team, and treasury hold the bulk. What matters isn’t the absolute Gini, but its trend: decreasing = distribution improving, increasing = concentration growing.

2. Exchange Flows

Why it matters: token inflow to exchanges = potential sell pressure. Outflow = accumulation.

Net_flow = Inflow_CEX − Outflow_CEX
  • Net_flow > 0 — inflow to exchanges (classically read as bearish) (computed)
  • Net_flow < 0 — outflow from exchanges (classically read as bullish) (computed)
  • Look at the 7-day moving average, not individual days
This signal has weakened since 2023

The “inflow = bearish, outflow = bullish” rule of thumb was decisively muddied by several structural changes:

  • Spot ETFs (BTC, ETH) move large quantities on and off exchange wallets for custody rebalancing between authorized participants and custodians — these flows have nothing to do with retail sell pressure, and can run in either direction irrespective of market sentiment.
  • Staking and liquid staking — for PoS assets, outflows increasingly reflect deposits into staking contracts or LSTs, not long-term “cold storage” accumulation. Conversely, unstaking withdrawals may land on exchanges for hedging rather than immediate sale.
  • OTC desks and internal transfers — exchanges periodically reshuffle hot/cold wallets, which registers as flow without any underlying user activity.

Treat exchange net flow as a contextual signal, not a standalone trigger. Cross-reference with ETF creations/redemptions, staking deposit contract activity, and stablecoin flows before drawing conclusions.

SignalWhat’s happeningInterpretation
Large CEX inflowWhale transferred tokens to exchangePotential sale within hours/days
Sustained outflowTokens withdrawn to cold walletsAccumulation, long-term holding
Inflow after unlockInvestors withdraw unlocked tokens to exchangeExpected sell pressure

3. Wallet Activity

Why it matters: active address count shows real token usage (unlike trading volume, which is easily faked).

MetricDefinitionWhat it shows
Daily Active Addresses (DAA)Unique addresses with transactions per dayDaily activity
New addressesNew addresses that received the tokenAudience growth
Holding timeAverage token holding durationHolder loyalty
Coin age (dormancy)Average “age” of transferred coinsOld coins moving = large players acting

4. Staking and DeFi Positions

MetricFormula / sourceWhy
Staking ratioStaked_tokens / Total_supplyShows what share is locked and not pressuring the market
DeFi TVL in tokenDefiLlama / DuneToken used as collateral, liquidity
LP concentrationTop LPs’ share of the poolRisk of liquidity withdrawal
Unlock scheduleOn-chain vesting contractWhen to expect sell pressure

Practical Applications

Auditing a Project’s Tokenomics

When auditing an existing project, a tokenomist verifies on-chain data against a checklist:

On-chain audit checklist

  • Distribution matches stated: whitepaper allocation matches actual balances (treasury, team, investors)
  • Vesting is executing: vesting contract is functioning, unlocks happen on schedule
  • Concentration within limits: CR_10 < 70% (excluding exchanges and contracts)
  • No anomalous flows: large transfers to/from exchanges, unexplained treasury movements
  • Staking is healthy: staking ratio in target range, no dominance by 1–2 validators
  • Liquidity is sufficient: AMM pool depth can absorb vesting unlocks
  • Burn/emission matches model: if the model assumes burning — it's happening on-chain
  • Designing New Tokenomics

    When building tokenomics from scratch, on-chain data from comparable projects helps calibrate parameters:

    ParameterWhere to lookWhat to find
    Target staking ratioComparables in the same categoryMedian value for PoS/DeFi
    Liquidity pool sizeDEX pools of comparablesDepth relative to market cap
    Post-airdrop distribution speedDune: airdrop recipient behavior% who sold within 7/30 days
    Active usersDAA of comparablesRealistic forecast for the business plan
    Optimal liquidity allocation% of supply in DEX pools of comparables5–15% is a commonly observed range in our practice, but highly project-dependent

    Post-Launch Monitoring

    After TGE, on-chain monitoring provides early signals:

    SignalData sourceAction
    Rising CEX inflowNansen Exchange FlowPrepare liquidity, alert the MM
    Declining staking ratioDune: unstaking eventsInvestigate cause (yield, competitor, panic)
    Concentration increasingDune: top holdersWhale accumulating — assess intent
    DAA decliningDune: active addressesDeclining usage — product problem
    Large treasury mintEtherscanVerify alignment with governance decision

    Limitations of On-Chain Analytics

    Don't overestimate on-chain
    On-chain data is a powerful tool, but with limitations. Not all conclusions are obvious, and not all signals mean what they appear to.
    LimitationExplanation
    Incomplete attributionOne person can own hundreds of addresses. CR_10 may show 10 addresses belonging to a single whale
    CEX addresses = aggregatesAn exchange’s balance represents thousands of users. A large exchange wallet ≠ a large holder
    Private networksL2 data, private chains, and off-chain settlements may be unavailable
    Correlation ≠ causationInflow to exchange ≠ guaranteed sale. It could be rebalancing, margin collateral, OTC
    Historical biasOn-chain shows the past. Extrapolation is the analyst’s responsibility

    Holder Concentration Calculator

    Enter top-10 wallet balances and total supply to calculate CR_10.

    Holder Concentration Calculator (CR₁₀)
    Top-1, top 2–5, top 6–10, total supply — concentration and risk assessment
    Open calculator →

    Common Mistakes

    On-chain analytics pitfalls

  • Counting addresses as users: one person = many wallets. "100K holders" may be 5K real users with dust addresses from airdrops
  • Ignoring exchange addresses: including Binance Hot Wallet as "top-1 holder" distorts the concentration picture
  • Exchange inflow = sale: this is correlation, not causation. Check actual trades, not just transfers
  • Looking at absolute values: CR_10 = 60% for a post-TGE project is normal (team + investors). What matters is the trend: is concentration decreasing over time?
  • Extrapolating trends: "staking has grown for 3 months → it will grow forever" — no. Staking grows to equilibrium, then stabilizes
  • Need on-chain analytics for your tokenomics?

    We build monitoring frameworks, identify holder concentration risks, and track emission health for DeFi and GameFi projects.

    Get in touch