Overview
ClyptQ Data System integrates multi-asset class data and provides consistent data access from development to production through environment-based automatic routing.Data Sources
Supported Asset Classes:- Crypto: Binance, Coinbase, Gate.io (spot & futures)
- Stocks: Yahoo Finance, Tiingo
- Macro: FRED (economic indicators)
- On-Chain: CryptoQuant (on-chain data)
- OHLCV: Price candles (1m, 5m, 1h, 1d, etc.)
- Orderbook: L2 order book snapshots
- Tick: Trade execution data
- Fundamentals: Financial data
- Alternative: News, social media
Observation Specs
Data is defined with Specs:Storage Architecture
Environment-based Automatic Routing:| Environment | Storage Backend | Path |
|---|---|---|
| dev | Local Parquet | ./data |
| prod | EFS Parquet | /efs/data |
| kernel | Backend API | HTTP |
Data Loading
Backtest Mode:Data Validation
ValidationReport: Automatic validation on all data loading:- Coverage: All requested symbols loaded
- NaN Ratio: Ratio of invalid data
- Time Range: Covers requested period
- Timestamps: Sufficient data points
Data Preprocessing
Forward-Fill (FFill): Missing data is automatically FFilled:- First missing:
valid=False(cannot FFill) - Subsequent missing: FFill with last valid value →
valid=True, updated=False - New value:
valid=True, updated=True
Live Data Handling
Warmup + Gap Filling:Symbol-Source Mapping
SymbolSourceMap maps symbols to specs:- BTC loads from both ohlcv_1m + orderbook
- ETH loads from ohlcv_1m only
- AAPL loads from stock_1d only
Redis Integration
Multi-Consumer Optimization: When multiple strategies use the same data:- Single WebSocket connection
- Multiple strategies consume same data
- Reduced collector load
Metadata Catalog
Pre-Execution Validation: Check data availability before execution:Best Practices
1. Pre-Execution ValidationPerformance
Typical Sizes:- 1 month 1m OHLCV, 100 symbols: ~200 MB
- 1 year 1d stock, 500 symbols: ~50 MB
- Live orderbook, 20 symbols: ~5 MB cache
- Backtest (1 month, 100 symbols): 2-5 seconds
- Live warmup (100 ticks): 1-3 seconds
- Redis subscribe: less than 100ms
Ecosystem Value
For Builders:- Immediate access to diverse data sources
- Production infrastructure provided
- Identical to development environment
- Data quality guaranteed
- Transparent validation process
- Reproducible results
