Sync S3 Files

POST

integrations

files

curl --request POST \
  --url https://api.carbon.ai/integrations/s3/files \
  --header 'Content-Type: application/json' \
  --header 'authorization: <api-key>' \
  --data '{
  "ids": [
    {
      "id": "<string>",
      "bucket": "<string>",
      "prefix": "<string>"
    }
  ],
  "tags": {},
  "chunk_size": 123,
  "chunk_overlap": 123,
  "skip_embedding_generation": true,
  "embedding_model": "OPENAI",
  "generate_sparse_vectors": true,
  "prepend_filename_to_chunks": true,
  "max_items_per_chunk": 1,
  "set_page_as_boundary": false,
  "data_source_id": 123,
  "request_id": "<string>",
  "use_ocr": true,
  "parse_pdf_tables_with_ocr": true,
  "file_sync_config": {
    "auto_synced_source_types": [
      "ARTICLE"
    ],
    "sync_attachments": false,
    "detect_audio_language": false,
    "transcription_service": "assemblyai",
    "include_speaker_labels": false,
    "split_rows": false,
    "generate_chunks_only": false,
    "store_file_only": false,
    "skip_file_processing": false
  }
}'

{
  "success": true
}

Authorizations

authorization

string

header

required

token <token>, corresponds to temporary access tokens.

Body

application/json

ids

object[]

required

Each input should be one of the following: A bucket name, a bucket name and a prefix, or a bucket name and an object key. A prefix is the common path for all objects you want to sync. Paths should end with a forward slash.

Response

200

application/json

Successful Response

success

boolean

required

Sync S3 Connection List SharePoint Sites

curl --request POST \
  --url https://api.carbon.ai/integrations/s3/files \
  --header 'Content-Type: application/json' \
  --header 'authorization: <api-key>' \
  --data '{
  "ids": [
    {
      "id": "<string>",
      "bucket": "<string>",
      "prefix": "<string>"
    }
  ],
  "tags": {},
  "chunk_size": 123,
  "chunk_overlap": 123,
  "skip_embedding_generation": true,
  "embedding_model": "OPENAI",
  "generate_sparse_vectors": true,
  "prepend_filename_to_chunks": true,
  "max_items_per_chunk": 1,
  "set_page_as_boundary": false,
  "data_source_id": 123,
  "request_id": "<string>",
  "use_ocr": true,
  "parse_pdf_tables_with_ocr": true,
  "file_sync_config": {
    "auto_synced_source_types": [
      "ARTICLE"
    ],
    "sync_attachments": false,
    "detect_audio_language": false,
    "transcription_service": "assemblyai",
    "include_speaker_labels": false,
    "split_rows": false,
    "generate_chunks_only": false,
    "store_file_only": false,
    "skip_file_processing": false
  }
}'

{
  "success": true
}

API Documentation

Health

Auth

Files

User

Web Scrape

Data Source

Gitbook

S3

SharePoint

GitHub

Gmail

Slack

Outlook

Organizations

Tags

Chunks / Embeddings

Retrieval

Webhooks

White Labeling

CRM

Authorizations

Body

Response