Tutorial

This step-by-step tutorial will walk you through the process of connecting an user’s Confluence account to Carbon, syncing your files, and leveraging Carbon’s functionality for your RAG workflow.

Jupyter Notebook

Follow along with interactive code snippets corresponding to the tutorial.

Step 1: Generate OAuth URL

To initiate the integration process, follow these steps:

Visit the Carbon OAuth URL generation endpoint: /oauth_url
Generate an OAuth URL specifically for Confluence
Click on the generated URL to begin the authentication process with your user’s Confluence account

Step 2: Verify Account Connection

After successfully authenticating your Confluence account, verify the connection:

Navigate to the Carbon user data sources endpoint: /user_data_sources
Check if your user’s Confluence account is listed, indicating a successful connection

Step 3: Select Files to Sync

Specific connectors such as Google and Dropbox have their own file selectors, so this step is not required.

To sync specific files and folders from Confluence to Carbon, follow these steps:

Sync all items in a user’s data source connection: /integrations/items/sync
Browse your Confluence file directory using the Carbon file listing endpoint: /integrations/items/list
Select the desired files and folders you want to sync to Carbon
Pass the selected items to the Carbon file sync endpoint: /integrations/files/sync
The sync process will begin, syncing the content of the selected files to Carbon

The /integrations/items/sync endpoint syncs all items in a user’s data source connection. It only syncs the directory structure and accompanying metadata, not the content within the files.

Steps #1-3 above can be managed via our pre-built Carbon Connect React component as well.

Step 4: Access Synced Files

Once the sync process is complete, you can access the synced files in Carbon:

Visit the Carbon user files endpoint: /user_files_v2
- By setting include_raw_file to true, you can retrieve a pre-signed URL for the uploaded file in its original format (e.g., PDF, DOCX, etc.). This allows you to access and download the file as it was initially uploaded by the user.
- By setting include_parsed_text_file to true, you can retrieve a plain text version of the uploaded file. This can be useful when you need to process or analyze the content of the file without dealing with the original file format.
The synced files will maintain their original directory structure from Confluence

Step 5: Retrieve Embeddings

Carbon provides powerful embedding capabilities for your synced files:

Retrieve embeddings and content using the Carbon embeddings endpoint: /embeddings
Alternatively, you can retrieve the embeddings and chunks and store it in your own vector database: /list_chunks_and_embeddings

Step 6: Resync Files (Optional)

If you make changes to files in Confluence after the initial sync, you can resync them to update the versions in Carbon:

Use the Carbon file resync endpoint: /resync_file
Specify the individual files you want to resync

Step 7: Set Up Auto-Sync (Optional)

To keep your files automatically in sync between Confluence and Carbon, you can enable scheduled auto-sync:

Set up auto-sync at the organization level using the Carbon update organization endpoint: /organization/update
Alternatively, enable auto-sync for individual users using the Carbon update users endpoint: /update_users

By following these steps, you’ll be able to seamlessly integrate your Confluence account with Carbon, sync your files, and leverage Carbon’s advanced features to supercharge your development workflow. Happy building with Carbon!

Step 8: Add Webhooks (Optional)

For testing, you can create a tunnel to your localhost server using a tool like ngrok. For example: https://5f9f-98-14-198-85.ngrok-free.app/api/webhook

Add a URL to which webhooks should be sent. This can be done using the /add_webhook endpoint.
The response from the /add_webhook endpoint includes a signing_key. Save this key securely as it is used to validate the authenticity of received webhooks and cannot be retrieved again.
You can create a tunnel to your localhost server using a tool like ngrok. For example: https://5f9f-98-14-198-85.ngrok-free.app/api/webhook
At this point, all events will be sent to the URL specified in Step 1. An event - sent via an HTTP POST request - contains two important elements: a Carbon-Signature header and a body with a single key-value pair. You can validate the authenticity and integrity of the webhook by calculating its signature.
To ensure that the webhooks are genuine and have not been tampered with, you should validate the incoming webhooks using the signing_key from Step 2. You can do this manually or use our official webhook libraries here.
Once you receive and handle the webhook, you should respond to the POST request with a 200 status - otherwise, the webhook will be retried (up to three times).

Get Started

Source Connectors

Destination Connectors

Learn

Resources

Migrations

Jupyter Notebook

Step 1: Generate OAuth URL

Step 2: Verify Account Connection

Step 3: Select Files to Sync

Step 4: Access Synced Files

Step 5: Retrieve Embeddings

Step 6: Resync Files (Optional)

Step 7: Set Up Auto-Sync (Optional)

Step 8: Add Webhooks (Optional)

Get Started

Source Connectors

Destination Connectors

Learn

Resources

Migrations

Jupyter Notebook

​Step 1: Generate OAuth URL

​Step 2: Verify Account Connection

​Step 3: Select Files to Sync

​Step 4: Access Synced Files

​Step 5: Retrieve Embeddings

​Step 6: Resync Files (Optional)

​Step 7: Set Up Auto-Sync (Optional)

​Step 8: Add Webhooks (Optional)

Step 1: Generate OAuth URL

Step 2: Verify Account Connection

Step 3: Select Files to Sync

Step 4: Access Synced Files

Step 5: Retrieve Embeddings

Step 6: Resync Files (Optional)

Step 7: Set Up Auto-Sync (Optional)

Step 8: Add Webhooks (Optional)