Simple Visual Question Answering Example¶

This example demonstrates how to use the framework for visual question answering (VQA) tasks. The example code can be found in the examples/step1_simpleVQA directory.

   cd examples/step1_simpleVQA

Overview¶

This example implements a simple Visual Question Answering (VQA) workflow that consists of two main components:

Input Interface
Handles user input containing questions about images
Processes and manages image data
Extracts the user's questions/instructions
Simple VQA Processing
Takes the user input and image
Analyzes the image based on the user's question
Generates appropriate responses to visual queries

The workflow follows a straightforward sequence:

Prerequisites¶

Python 3.10+
Required packages installed (see requirements.txt)
Access to OpenAI API or compatible endpoint (see configs/llms/gpt.yml)
Redis server running locally or remotely
Conductor server running locally or remotely

Configuration¶

The container.yaml file is a configuration file that manages dependencies and settings for different components of the system, including Conductor connections, Redis connections, and other service configurations. To set up your configuration:

Generate the container.yaml file: bash python compile_container.py This will create a container.yaml file with default settings under examples/step1_simpleVQA.
Configure your LLM settings in configs/llms/gpt.yml:
Set your OpenAI API key or compatible endpoint through environment variable or by directly modifying the yml file bash export custom_openai_key="your_openai_api_key" export custom_openai_endpoint="your_openai_endpoint"
Configure other model settings like temperature as needed through environment variable or by directly modifying the yml file
Update settings in the generated container.yaml:
Modify Redis connection settings:
- Set the host, port and credentials for your Redis instance
- Configure both redis_stream_client and redis_stm_client sections
Update the Conductor server URL under conductor_config section
Adjust any other component settings as needed

Running the Example¶

Run the simple VQA example:

For terminal/CLI usage: bash python run_cli.py

For app/GUI usage: bash python run_app.py

Troubleshooting¶

If you encounter issues: - Verify Redis is running and accessible - Check your OpenAI API key is valid - Ensure all dependencies are installed correctly - Review logs for any error messages

Building the Example¶

Coming soon! This section will provide detailed instructions for building and packaging the step1_simpleVQA example step by step.