Supported Connectors
Google Drive
Google Docs, Slides, SheetsReal-time sync via webhooks. Supports shared drives, nested folders, and collaborative documents.
Notion
Pages, Databases, BlocksInstant sync of workspace content. Handles rich formatting, embeds, and database properties.
OneDrive
Word, Excel, PowerPointScheduled sync every 4 hours. Supports personal and business accounts with file versioning.
Web Crawler
Web Pages, DocumentationCrawl websites automatically with robots.txt compliance. Scheduled recrawling keeps content up to date.
Quick Start
1. Create Connection
2. Handle OAuth Callback
After user completes OAuth, the connection is automatically established and sync begins.3. Monitor Sync Status
How Connectors Work
Authentication Flow
- Create Connection: Call
/v3/connections/{provider}to get OAuth URL (or direct connection for web-crawler) - User Authorization: Redirect user to complete OAuth flow (not required for web-crawler)
- Automatic Setup: Connection established, sync begins immediately
- Continuous Sync: Real-time updates via webhooks + scheduled sync every 4 hours (or scheduled recrawling for web-crawler)
Document Processing Pipeline
Sync Mechanisms
| Provider | Real-time Sync | Scheduled Sync | Manual Sync |
|---|---|---|---|
| Google Drive | ✅ Webhooks (7-day expiry) | ✅ Every 4 hours | ✅ On-demand |
| Notion | ✅ Webhooks | ✅ Every 4 hours | ✅ On-demand |
| OneDrive | ✅ Webhooks (30-day expiry) | ✅ Every 4 hours | ✅ On-demand |
| Web Crawler | ❌ Not supported | ✅ Scheduled recrawling (7+ days) | ✅ On-demand |