Configuration
Image Generation

🖼 Image Generation configuration

Config variableValues
IMAGE_PROVIDERdalle huggingface sdwebuidefault: dalle

DALL-e

In .env, make sure IMAGE_PROVIDER is commented (or set to dalle):

# IMAGE_PROVIDER=dalle    # this is the default

Further optional configuration:

Config variableValues
IMAGE_SIZE256 512 1024default: 256

Hugging Face

To use text-to-image models from Hugging Face, you need a Hugging Face API token. Link to the appropriate settings page: Hugging Face > Settings > Tokens (opens in a new tab)

Once you have an API token, uncomment and adjust these variables in your .env:

IMAGE_PROVIDER=huggingface
HUGGINGFACE_API_TOKEN=your-huggingface-api-token

Further optional configuration:

Config variableValues
HUGGINGFACE_IMAGE_MODELsee available models (opens in a new tab)default: CompVis/stable-diffusion-v1-4

Stable Diffusion WebUI

It is possible to use your own self-hosted Stable Diffusion WebUI with Auto-GPT:

IMAGE_PROVIDER=sdwebui

Note

Make sure you are running WebUI with --api enabled.

Further optional configuration:

Config variableValues
SD_WEBUI_URLURL to your WebUIdefault: http://127.0.0.1:7860
SD_WEBUI_AUTH{username}:{password}Note: do not copy the braces!

Selenium

sudo Xvfb :10 -ac -screen 0 1024x768x24 & DISPLAY=:10 <YOUR_CLIENT>