
Build dashboards, automate reports, and ask questions in plain English — all from your S3 data, no complex infrastructure to maintain.
Have multiple S3 accounts? Analytics across multiple S3 accounts →
Reads files stored in Amazon S3 (and S3‑compatible object stores) such as CSV, JSON Lines, Parquet, and Avro, applying glob/prefix filters and optional schema inference to turn files into tabular records. This enables centralizing raw data dumps, exports, logs, and other file-based datasets for downstream analytics.
A logical table composed of files in a bucket matched by a prefix or glob with optional schema inference. Enables analysis of dataset freshness, completeness, row counts, and ingestion latency across file-based exports and logs.
An individual S3 object (file) and its metadata such as size and last modified time. Supports monitoring of file delivery SLAs, volume growth, partition coverage, and file-level ingestion errors.
Row-level records parsed from file contents (CSV, JSON Lines, Parquet, Avro). Powers downstream KPIs and aggregations like volumes, trends, and cohort analyses once ingested.
The inferred columns and data types derived from the dataset’s files. Enables tracking of schema drift, type conformance, and data quality validation.
Uses your AWS Access Key ID and Secret (or an assumed IAM role) to authenticate; can read public buckets without credentials
Requires a S3 account to connect.
Operational data, performance metrics, and business insights.
Authenticate S3 in a few clicks. OAuth, API key, or IAM role — we handle secrets and rotation.
We pull every stream into your warehouse. CDC where the API supports it; full + incremental otherwise. Hourly-or-faster, row-level secure.
SQL, dashboards, or ask Fi in plain English. Your S3 data lives next to every other source — ready to join.
Build your own with the Definite SDK, or ask us — we add new connectors every week.
Join S3 with the rest of your data, then ask Fi questions across all of it.