DOCUMENTATION

Data Ingestion

Evaluate an agent's ability to ingest source data reliably and correctly.

What this competency is

Designing robust ingestion from source systems into data platforms, including batch, streaming, CDC, and API-based integration.

Ingestion is where source reality enters the platform. Errors here propagate downstream and compromise trust across all analytics and ML workflows.

Selection of ingestion pattern based on latency, volume, and source constraints.
Handling of idempotency, deduplication, and late-arriving data.
Approach to backfills, replay, and incremental loads.
Clear treatment of source contracts and schema drift.