Castor EDC connector

Set up the Castor EDC connector in Kaivo: authentication, configuration, the 16 BigQuery tables it syncs, and answers to common questions.

Written By Lauri Raivio

Last updated About 1 hour ago

Kaivo is a fully managed data platform that syncs your Castor EDC data into a Google BigQuery warehouse and keeps it up to date automatically. There is no pipeline to build and no infrastructure to run, so you can spend your time analysing your data from Castor EDC instead of moving it.

What is the Castor EDC connector

Sync your Castor EDC clinical trial data into BigQuery with Kaivo to analyse study data and form completeness.

CategoryTech
StatusGenerally available
AuthenticationAPI key
SetupSelf-service

Getting started with the Castor EDC connector

  1. Sign up for Kaivo and create a workspace.
  2. Connect your Castor EDC account.
  3. Choose which tables to sync.
  4. Wait for the initial sync to finish.
  5. Query your data in BigQuery or your favourite AI or BI tool.

Authenticating Castor EDC

Authenticate with your Client Secret.

FieldDescription
Client Secret

Your Castor EDC API Client Secret, shown alongside the Client ID under Account → Settings → Castor EDC API.

Prerequisites

Castor EDC uses OAuth client credentials, so you need API credentials from your Castor EDC account:

  1. Sign in to your Castor EDC account, hover over Account and click Settings, then open the Castor EDC API tab.
  2. Generate a new API client and copy the Client ID and Client Secret.

Then enter those values below.

Need help? Reach our team via the chat in the bottom-right corner.

Configuring the Castor EDC connector

When you set up the connector, you provide:

FieldDescription
URL Region

The Castor server region where your study is hosted (the subdomain of your Castor URL).

Client ID

Your Castor EDC API Client ID, generated under Account → Settings → Castor EDC API.

Start Date

Any data before this date will not be fetched.

Tables and columns synced from Castor EDC

Kaivo syncs 16 tables from Castor EDC into a dedicated dataset in your BigQuery warehouse. Click any table to see its columns and types.

How the Castor EDC sync works

After the first load, Kaivo keeps your BigQuery warehouse up to date for you. Where Castor EDC supports it, each sync pulls only new and changed records so it stays fast; otherwise it refreshes the whole table. Every record keeps its original ID, so you won't get duplicate rows.

Frequently asked questions

How long does the initial sync take for Castor EDC?

It depends on how much history is in your Castor EDC account. Most initial syncs finish within minutes, while large accounts can take a few hours. After that, syncs only fetch new and changed records, so they're much faster.

Can I sync only some tables or columns?

Yes. You pick which tables to sync when you set up the connection and can change the selection later. Tables you don't select are never copied to your warehouse.

What happens when Castor EDC's schema changes?

New fields are never added automatically. You choose which fields to sync, so data you haven't selected (sensitive personal data, for example) never lands in your warehouse. When a new field appears, it becomes available for you to add. What happens to removed or renamed fields depends on a table's sync mode: full-refresh tables always match what's currently in Castor EDC, so dropped fields disappear, while incremental tables keep their existing columns and history, so an old field stays and newly added fields fill in over time.

How do I handle GDPR or data deletion requests?

Your data lives in your own Kaivo-managed BigQuery warehouse, so the most direct option is to delete or anonymise specific records right in BigQuery. If you delete data in Castor EDC instead, full-refresh tables drop it on the next sync, while incremental tables keep it, so you would remove the row in BigQuery or ask us to run a full refresh. To remove everything, delete the Castor EDC connector in Kaivo and all of its synced data is deleted with it.

Common use cases for Castor EDC data

Data completeness

Use study_fields and study_form to find missing or incomplete records across sites.

Site reporting

Join study_site with study_statistics to compare data collection across sites.

Audit trail

Use audit_trial to track changes to study data over time.

Use Castor EDC data in your AI and BI tools

Once Castor EDC data lands in your Kaivo-managed BigQuery warehouse, you can explore it with AI tools or any BI tool that connects to BigQuery. Here's how the most common destinations work with Castor EDC data.

Claude

Use Kaivo's MCP server to give Claude secure, workspace-scoped access to your data. Setup guide →

Power BI

Microsoft's BI tool with a native BigQuery connector. Supports direct query and scheduled refresh. Setup guide →

Data Studio

Free Google BI tool with native BigQuery support. One-click connection to your Kaivo warehouse; great for SMB teams on Google Workspace. Setup guide →

Tableau

The premium analytics standard, with native BigQuery integration. Setup guide →

Google Sheets

Use Connected Sheets to query BigQuery directly from a spreadsheet, with no SQL. Setup guide →

Excel

Connect via Power Query's BigQuery connector. Setup guide →

Metabase

Open-source BI tool with strong BigQuery support. Setup guide →

See our pricing page for Castor EDC connector pricing and plan details.

  • Adform: Sync Adform to BigQuery.
  • Amplitude: Sync Amplitude to BigQuery.
  • Auth0: Sync Auth0 to BigQuery.
  • Convex: Sync Convex to BigQuery.
  • GitHub: Sync GitHub to BigQuery.
  • GitLab: Sync GitLab to BigQuery.