If you’re new to Unstructured, read this note first.Before you can create a source connector, you must first sign in to your Unstructured account:
- If you do not already have an Unstructured account, go to https://unstructured.io/contact and fill out the online form to indicate your interest.
- If you already have an Unstructured account, sign in by using the URL of the sign in page that Unstructured provided to you when your Unstructured account was created. If you do not have this URL, contact Unstructured Sales at sales@unstructured.io.
- For the Unstructured UI or the Unstructured API, only Couchbase Capella clusters are supported.
- For Unstructured Ingest, Couchbase Capella clusters and local Couchbase server deployments are supported.
For Couchbase Capella, you will need:
- A Couchbase Capella account.
- A Couchbase Capella cluster.
- A bucket, scope, and collection on the cluster.
- The cluster’s public connection string.
- The cluster access name (username) and secret (password).
-
Incoming IP address allowance for the cluster.
To get Unstructured’s IP address ranges, go to
https://assets.p6m.u10d.net/publicitems/ip-prefixes.json
and allow all of the
ip_prefixfields’ values that are listed.These IP address ranges are subject to change. You can always find the latest ones in the preceding file.
- Installation of a local Couchbase server.
- Connection details to the local Couchbase server.
- On the sidebar, click Connectors.
- Click Sources.
- Cick New or Create Connector.
- Give the connector some unique Name.
- In the Provider area, click Couchbase.
- Click Continue.
- Follow the on-screen instructions to fill in the fields as described later on this page.
- Click Save and Test.
- Name (required): A unique name for this connector.
- Bucket (required): The name of the bucket to connect to on the cluster.
- Connection string (required): The connection string for the Couchbase cluster.
- Scope: The name of the scope in the bucket. The default is
_defaultif not otherwise specified. - Collection: The name of the collection in the scope. The default is
_defaultif not otherwise specified. - Batch Size: The maximum number of documents per transmitted batch. The default is
50if not otherwise specified. - Collection ID key (source connector only): The name of the collection field that contains the document ID. The default is
idif not otherwise specified - Username (required): The cluster access name (username) that has access to the cluster.
- Password (required): The password for the corresponding username.

