GPT Vision Node

This node in LiteGraph facilitates interactions with OpenAI’s vision models like GPT-4 Vision. It’s designed for analyzing images and extracting insights through natural language processing.

Inputs and Outputs

Input: Accepts URLs of images for processing by the vision model.

Output: Outputs the model’s analysis of the images in natural language.

Properties

Model: Choose from vision model options like ‘gpt-4-vision-preview’.

Secret: API key for accessing OpenAI services.

Functionalities

Includes functionalities for image content description, comparison, and detailed analysis.

Widgets

Model Selector: Choose the vision model for image analysis.

Secret Selector: Select the API key for access to OpenAI services.

Usage

1. Connect image URL inputs.

2. Choose the vision model and API key.

3. The node processes the images and outputs the analysis result in natural language.