POST
/
integrations
/
files
/
sync
curl --request POST \
  --url https://api.carbon.ai/integrations/files/sync \
  --header 'Content-Type: application/json' \
  --header 'authorization: <api-key>' \
  --data '{
  "data_source_id": 123,
  "ids": [
    "<string>"
  ],
  "tags": {},
  "chunk_size": 123,
  "chunk_overlap": 123,
  "skip_embedding_generation": true,
  "embedding_model": "OPENAI",
  "generate_sparse_vectors": true,
  "prepend_filename_to_chunks": true,
  "max_items_per_chunk": 1,
  "set_page_as_boundary": true,
  "request_id": "<string>",
  "use_ocr": true,
  "parse_pdf_tables_with_ocr": true
}'
{
  "success": true
}

Authorizations

authorization
string
headerrequired

token <token>, corresponds to temporary access tokens.

Body

application/json
data_source_id
integer
required
ids
required
tags
object | null
chunk_size
integer | null
chunk_overlap
integer | null
skip_embedding_generation
boolean | null
embedding_model
enum<string> | null
Available options:
OPENAI,
AZURE_OPENAI,
AZURE_ADA_LARGE_256,
AZURE_ADA_LARGE_1024,
AZURE_ADA_LARGE_3072,
AZURE_ADA_SMALL_512,
AZURE_ADA_SMALL_1536,
COHERE_MULTILINGUAL_V3,
VERTEX_MULTIMODAL,
OPENAI_ADA_LARGE_256,
OPENAI_ADA_LARGE_1024,
OPENAI_ADA_LARGE_3072,
OPENAI_ADA_SMALL_512,
OPENAI_ADA_SMALL_1536
generate_sparse_vectors
boolean | null
prepend_filename_to_chunks
boolean | null
max_items_per_chunk
integer | null

Number of objects per chunk. For csv, tsv, xlsx, and json files only.

set_page_as_boundary
boolean
default: false
request_id
string | null
use_ocr
boolean | null
parse_pdf_tables_with_ocr
boolean | null

Response

200 - application/json
success
boolean
required