WebsitePlatform Login

Amazon S3

Configure S3 and S3-compatible storage

In meinGPT (UI)

For most teams, setup is done directly in meinGPT without editing local config files.

  1. Open admin settings in meinGPT
  2. Go to Data Pools / Data Sources
  3. Click Add Source and choose this source type
  4. Configure credentials and scope in the UI
  5. Save and trigger the first sync

If you do not run your own DataVault runtime, this is usually all you need.

On-Prem Runtime Configuration (Advanced)

data_pools:
  - id: s3-documents
    type: s3
    access_key_id: $AWS_ACCESS_KEY_ID
    secret_access_key: $AWS_SECRET_ACCESS_KEY
    endpoint: https://s3.amazonaws.com
    bucket_name: my-bucket
    provider: "AWS"

Configuration Options

FieldTypeDefaultRequiredDescription
idstring-Unique identifier for the data pool
typestring-Must be "s3"
access_key_idstring-AWS access key
secret_access_keystring-AWS secret key
endpointstring-S3 endpoint URL
bucket_namestring-S3 bucket name
providerstring"Other"Provider ("AWS", "MinIO", "DigitalOcean", "Other")
base_pathstring""Optional folder prefix

Synchronization

  • Vault connects to the configured bucket and syncs objects under base_path.
  • Subsequent runs are incremental and only process changes.
  • For large buckets, start with a narrow base_path for safer rollout.

Setup

  1. Create IAM user: AWS IAM Console
  2. Attach S3 policy: Give read-only access to your bucket
  3. Generate access keys: Copy Access Key ID and Secret
  4. Add to environment: AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY

Amazon S3 Documentation

On-prem only: this source page is relevant when you operate your own DataVault runtime and configure data_pools yourself.

On this page