Catalog data sources

Published on April 9th, 2026

Overview

Data sources are how you pull structured data from the outside world into Roadie’s catalog datastore.

Each data source is part of an integration.

Data source runs can be scheduled so the underlying data stored in Roadie stays fresh before workflows read that data and turn it into catalog entities.

Open Data sources in the catalog administration experience to:

  • Create and edit sources that map integration responses into datastore objects
  • See object counts, last run times, and status at a glance
  • Drill into a source to test extraction and adjust configuration

Data sources are usually the upstream step in a generation pipeline: sync objects to a data source, then consume them with Data source nodes in Entity Workflows.

datasource-list

Further reading