Enum Modality
"Modality" refers to the type or form of data that a model is designed to process, either as input or output. In AI and machine learning contexts, modality describes the nature of the information being handled — such as text, image, audio, or video.
For example:
- A text-to-text model like GPT-4 processes text inputs and generates text outputs.
- A text-to-image model like DALL·E takes text prompts and produces images.
- A multimodal model like Gemini can process multiple types of data simultaneously, such as combining text and image inputs.
The concept of modality helps categorize models based on the kinds of sensory or informational data they handle, and is especially important for understanding the capabilities and limitations of a model.
Namespace: Glitch9.AIDevKit
Assembly: .dll
Syntax
[Flags]
public enum Modality
Fields
Name | Description |
---|---|
Audio | |
File | |
Image | |
Text | |
TextEmbedding | |
Video |