Image to Text Agent

On this page

Usage

Create a file image_to_text.py with the following code:

image_to_text.py

from pathlib import Path

from phi.agent import Agent
from phi.model.openai import OpenAIChat

agent = Agent(
    model=OpenAIChat(id="gpt-4o"),
    markdown=True,
)

image_path = Path(__file__).parent.joinpath("multimodal-agents.jpg")
agent.print_response(
    "Write a 3 sentence fiction story about the image",
    images=[str(image_path)],
)

Usage

Create a virtual environment

Open the Terminal and create a python virtual environment.

python3 -m venv ~/.venvs/aienv
source ~/.venvs/aienv/bin/activate

Install libraries

pip install openai phidata

Run the agent

python image_to_text.py

Cal.com Agent Research Agent

Examples

How To

Image to Text Agent

Usage

Examples

How To

​Usage

Usage