
Build dashboards, automate reports, and ask questions in plain English — all from your ClickHouse data, no complex infrastructure to maintain.
Want it to watch your ClickHouse data and act on its own? Meet the ClickHouse agent →
ClickHouse is a column-oriented OLAP database, and tap-clickhouse replicates its tables into your warehouse by reflecting the schema dynamically rather than pulling fixed business objects. It discovers every user table across non-system databases and supports both full-table and incremental sync on a monotonic key.
All user-defined tables across non-system ClickHouse databases, discovered automatically through SQLAlchemy reflection (system, information_schema, and INFORMATION_SCHEMA databases are excluded). Each table becomes a stream with columns mapped to JSON Schema types, including ClickHouse-specific handling for Enum, Nullable, LowCardinality, and DateTime64 columns. Lets you replicate whatever analytical or event data you already store in ClickHouse for downstream modeling and reporting.
Per-table incremental replication driven by a monotonic replication key such as updated_at or created_at, with records ordered by that key and filtered against the stored bookmark so only new or changed rows are pulled on each run. Full-table sync is used when no incremental key is configured. Keeps large ClickHouse tables synced efficiently for ongoing analytics without re-reading the full dataset each time.
Authenticate ClickHouse in a few clicks. OAuth, API key, or IAM role — we handle secrets and rotation.
We pull every stream into your warehouse. CDC where the API supports it; full + incremental otherwise. Hourly-or-faster, row-level secure.
SQL, dashboards, or ask Fi in plain English. Your ClickHouse data lives next to every other source — ready to join.
Build your own with the Definite SDK, or request it. Most go live in days.
Join ClickHouse with the rest of your data, then ask Fi questions across all of it.