A project claims: “we have 50,000 holders and a healthy distribution.” Open the blockchain — 10 wallets control 85% of supply, 40,000 addresses hold airdrop dust, and three whales dump $200K on DEX weekly. On-chain analytics turns declarations into facts. For a tokenomist, this isn’t an optional skill — it’s the primary tool for verification, design, and audit.
Why Tokenomists Need On-Chain Data
The blockchain is a public ledger. Everything is recorded: who owns what, when they bought, where they transferred, how much they staked. A tokenomist uses this data at three stages:
| Stage | What we analyze | Why |
|---|---|---|
| Design | Comparables: how competitor tokens are distributed, staking ratios, holder concentration | Calibrating parameters: allocation, vesting, staking rewards |
| Audit | Facts: does actual distribution match stated, is vesting executing, where do tokens flow | Verifying a project’s claims |
| Monitoring | Dynamics: exchange flows, whale activity, concentration changes | Early warning of sell pressure |
Tools
Dune Analytics
What it is: a platform for SQL queries on on-chain data. Supports Ethereum, Polygon, Arbitrum, Optimism, Solana, Base, and dozens of other networks.
For tokenomists:
- Custom queries: holder distribution, DEX volumes, flows
- Public dashboards by other analysts
- Real-time data visualization
Key tables:
| Table | Contents | Typical query |
|---|---|---|
tokens.transfers | All ERC-20 transfers | Flows between wallets and exchanges |
dex.trades | DEX trades | Trading volume, large swaps |
staking.* | Staking operations | Percentage of staked tokens |
balances | Current balances | Top holders, concentration |
Example query: top-10 token holders
SELECT
address,
balance,
balance * 100.0 / SUM(balance) OVER() AS pct_supply
FROM tokens.balances
WHERE token_address = 0x... -- token address
AND blockchain = 'ethereum'
AND balance > 0
ORDER BY balance DESC
LIMIT 10
This query is illustrative. Dune’s current tokens.balances spells expect a point-in-time filter (e.g., block_date = current_date or a block_time bound), so a direct copy-paste may need adjustment depending on the exact spell version in use.
Access: free plan (limited queries), Pro at $390/month per user (see dune.com/pricing).
Nansen
What it is: an analytics platform with wallet labeling. Nansen assigns labels to addresses: “exchange,” “fund,” “whale,” “smart money,” “market maker.”
For tokenomists:
- Identifying holder types (not just addresses, but who’s behind them)
- Tracking “smart money” — where capital flows
- Exchange inflow/outflow analysis
Key features:
| Feature | Description |
|---|---|
| Token God Mode | Full token analytics: holders, flows, concentration |
| Smart Money | Tracking fund, whale, and successful trader wallets |
| Exchange Flow | Token inflow/outflow to CEXs |
| Wallet Profiler | Individual address analysis: portfolio, activity, P&L |
Access: Free tier and Pro at $49/month (annual) or $69/month (monthly); custom enterprise on request (see nansen.ai/plans).
Arkham Intelligence
What it is: a platform for deanonymization and visualization of on-chain data. Specialization — linking addresses to real entities (funds, exchanges, protocols, individuals).
For tokenomists:
- Who’s behind large wallets
- Flow visualization between entities (relationship graph)
- Alerts on large transfers
Access: free plan with basic functionality.
DefiLlama
What it is: a DeFi protocol data aggregator. Open, free, no registration required.
For tokenomists:
- TVL (Total Value Locked) of protocols — a trust and usage indicator
- Pool and staking yields
- Cross-network protocol comparison
- Bridge flows
Key metrics:
| Metric | What it shows | Where to find |
|---|---|---|
| TVL | How much is locked in a protocol | defillama.com/protocol/[name] |
| Fees / Revenue | Protocol income | defillama.com/fees |
| Yields | Pool yields | defillama.com/yields |
| Stablecoins | Stablecoin supply by network | defillama.com/stablecoins |
| Bridges | Cross-chain flows | defillama.com/bridges |
Access: completely free, open API.
Etherscan and Block Explorers
What it is: the basic tool for viewing individual transactions, addresses, and contracts.
| Network | Explorer |
|---|---|
| Ethereum | Etherscan.io |
| BNB Chain | BscScan.com |
| Polygon | Polygonscan.com |
| Arbitrum | Arbiscan.io |
| Solana | Solscan.io |
| Cosmos ecosystem chains | Mintscan.io (covers a family of Cosmos SDK chains) |
For tokenomists:
- Checking specific transactions and contracts
- Reading smart contracts (verifying staking, vesting parameters)
- Viewing holder lists (limited)
Tool Comparison
| Tool | Data type | Cost | Networks | Primary use |
|---|---|---|---|---|
| Dune | SQL queries | Free / $390/mo (Pro) | 20+ | Custom analytics |
| Nansen | Labeled wallets | Free / $49–$69/mo (Pro) | 10+ | Holder identification |
| Arkham | Deanonymization | Free+ | 10+ | Who’s behind an address |
| DefiLlama | DeFi aggregator | Free | 100+ | TVL, yields, comparison |
| Etherscan | Block explorer | Free | 1 per network | Transactions, contracts |
Key Metrics for Tokenomists
1. Holder Concentration
Why it matters: high concentration = risk of price manipulation, centralized governance, mass sell-off.
Concentration Ratio
- CR_n — share of supply controlled by the top-n wallets (computed)
- CR_10 > 80% — critical concentration (author’s heuristic, not a standardized benchmark)
- CR_10 < 50% — healthy distribution (author’s heuristic)
- Thresholds assume team, treasury, foundation, and CEX custodial wallets are excluded — otherwise early-stage tokens trivially exceed 80% in their first years
Benchmarks:
| CR_10 | Interpretation | Example |
|---|---|---|
| < 30% | Excellent distribution | Bitcoin (~6% — top-10 addresses, per Arkham/River/CoinLore 2026) |
| 30–50% | Healthy | Ethereum (~35%, excluding the Beacon Deposit Contract and CEX custodial wallets) |
| 50–70% | Moderate concentration | Typical DeFi token |
| 70–90% | High concentration | Early-stage project |
| > 90% | Critical | Red flag |
Gini Coefficient
- Balance_i — i-th balance in the sorted list (ascending: smallest first)
- i — rank position from 1 to n
- n — total number of addresses
- Gini = 0 — perfect equality (everyone holds equal amounts)
- Gini = 1 — perfect inequality (one address holds everything)
- For free-float token holder distributions, typically observed: 0.85–0.99 (highly sensitive to methodology — wallet vs entity, inclusion of dust and contract addresses)
This is the canonical discrete rank-based Gini (equivalent to twice the area between the Lorenz curve and the line of equality). It requires balances to be sorted in ascending order before ranks i are assigned; with this convention the result is bounded in [0, 1]. The previously common form 1 − 2·(Σ i·x_i)/(n·Σx_i) + 1/n assumes descending sort and is algebraically equivalent to the canonical form above only with the correct sign conventions — we use the ascending form to avoid ambiguity.
The Gini coefficient for crypto assets is almost always > 0.9 (extreme inequality). This is normal: investors, the team, and treasury hold the bulk. What matters isn’t the absolute Gini, but its trend: decreasing = distribution improving, increasing = concentration growing.
2. Exchange Flows
Why it matters: token inflow to exchanges = potential sell pressure. Outflow = accumulation.
- Net_flow > 0 — inflow to exchanges (classically read as bearish) (computed)
- Net_flow < 0 — outflow from exchanges (classically read as bullish) (computed)
- Look at the 7-day moving average, not individual days
The “inflow = bearish, outflow = bullish” rule of thumb was decisively muddied by several structural changes:
- Spot ETFs (BTC, ETH) move large quantities on and off exchange wallets for custody rebalancing between authorized participants and custodians — these flows have nothing to do with retail sell pressure, and can run in either direction irrespective of market sentiment.
- Staking and liquid staking — for PoS assets, outflows increasingly reflect deposits into staking contracts or LSTs, not long-term “cold storage” accumulation. Conversely, unstaking withdrawals may land on exchanges for hedging rather than immediate sale.
- OTC desks and internal transfers — exchanges periodically reshuffle hot/cold wallets, which registers as flow without any underlying user activity.
Treat exchange net flow as a contextual signal, not a standalone trigger. Cross-reference with ETF creations/redemptions, staking deposit contract activity, and stablecoin flows before drawing conclusions.
| Signal | What’s happening | Interpretation |
|---|---|---|
| Large CEX inflow | Whale transferred tokens to exchange | Potential sale within hours/days |
| Sustained outflow | Tokens withdrawn to cold wallets | Accumulation, long-term holding |
| Inflow after unlock | Investors withdraw unlocked tokens to exchange | Expected sell pressure |
3. Wallet Activity
Why it matters: active address count shows real token usage (unlike trading volume, which is easily faked).
| Metric | Definition | What it shows |
|---|---|---|
| Daily Active Addresses (DAA) | Unique addresses with transactions per day | Daily activity |
| New addresses | New addresses that received the token | Audience growth |
| Holding time | Average token holding duration | Holder loyalty |
| Coin age (dormancy) | Average “age” of transferred coins | Old coins moving = large players acting |
4. Staking and DeFi Positions
| Metric | Formula / source | Why |
|---|---|---|
| Staking ratio | Staked_tokens / Total_supply | Shows what share is locked and not pressuring the market |
| DeFi TVL in token | DefiLlama / Dune | Token used as collateral, liquidity |
| LP concentration | Top LPs’ share of the pool | Risk of liquidity withdrawal |
| Unlock schedule | On-chain vesting contract | When to expect sell pressure |
Practical Applications
Auditing a Project’s Tokenomics
When auditing an existing project, a tokenomist verifies on-chain data against a checklist:
On-chain audit checklist
Designing New Tokenomics
When building tokenomics from scratch, on-chain data from comparable projects helps calibrate parameters:
| Parameter | Where to look | What to find |
|---|---|---|
| Target staking ratio | Comparables in the same category | Median value for PoS/DeFi |
| Liquidity pool size | DEX pools of comparables | Depth relative to market cap |
| Post-airdrop distribution speed | Dune: airdrop recipient behavior | % who sold within 7/30 days |
| Active users | DAA of comparables | Realistic forecast for the business plan |
| Optimal liquidity allocation | % of supply in DEX pools of comparables | 5–15% is a commonly observed range in our practice, but highly project-dependent |
Post-Launch Monitoring
After TGE, on-chain monitoring provides early signals:
| Signal | Data source | Action |
|---|---|---|
| Rising CEX inflow | Nansen Exchange Flow | Prepare liquidity, alert the MM |
| Declining staking ratio | Dune: unstaking events | Investigate cause (yield, competitor, panic) |
| Concentration increasing | Dune: top holders | Whale accumulating — assess intent |
| DAA declining | Dune: active addresses | Declining usage — product problem |
| Large treasury mint | Etherscan | Verify alignment with governance decision |
Limitations of On-Chain Analytics
| Limitation | Explanation |
|---|---|
| Incomplete attribution | One person can own hundreds of addresses. CR_10 may show 10 addresses belonging to a single whale |
| CEX addresses = aggregates | An exchange’s balance represents thousands of users. A large exchange wallet ≠ a large holder |
| Private networks | L2 data, private chains, and off-chain settlements may be unavailable |
| Correlation ≠ causation | Inflow to exchange ≠ guaranteed sale. It could be rebalancing, margin collateral, OTC |
| Historical bias | On-chain shows the past. Extrapolation is the analyst’s responsibility |
Holder Concentration Calculator
Enter top-10 wallet balances and total supply to calculate CR_10.
Common Mistakes
On-chain analytics pitfalls
Need on-chain analytics for your tokenomics?
We build monitoring frameworks, identify holder concentration risks, and track emission health for DeFi and GameFi projects.
Get in touch