The SharePoint connector synchronizes supported text files from SharePoint document libraries via Microsoft Graph API.
Authentication
OAuth — requires scopes:
Sites.Read.AllFiles.Read.All
Configuration
| Field | Type | Required | Description |
|---|---|---|---|
| Site URL | Input | Yes | SharePoint site URL |
| Folder Path | Input | No | Specific document library path |
| Max Files | Input | No | Limit files |
Supported File Types
.txt, .md, .html, .htm, .csv, .json, .xml, .yaml, .yml, .log, .rst, .tsv
Size Limit
10 MB per file.
How It Works
- Site Resolution — Resolves SharePoint site URL to site ID
- Folder Traversal — Lists folders recursively with pagination state cursor
- File Filtering — Filters for supported text extensions
- Content Fetching — Downloads file content; converts HTML to plain text
- Content Deferred — Stubs during listing; full content on demand
Pagination State
Cursor encodes: {folderStack[], currentFolder, pageUrl} for seamless recursive traversal across sync runs.
Metadata Tags
| Tag | Type | Description |
|---|---|---|
| Path | Text | Document path in SharePoint |
| Last Modified | Date | Last modification time |
| Created Date | Date | Creation date |
| Created By | Text | Original creator |
| File Size | Number | File size in bytes |

