Introduction
Phidata is a framework for building multi-modal agents.
- Build multi-modal agents with memory, knowledge, tools and reasoning.
- Build teams of agents that can work together to solve problems.
- Chat with your agents using a beautiful Agent UI.
Key Features
- Simple & Elegant
- Powerful & Flexible
- Multi-Modal by default
- Multi-Agent orchestration
- A beautiful Agent UI to chat with your agents
- Agentic RAG built-in
- Structured outputs
- Reasoning built-in
- Monitoring & Debugging built-in
Install
Simple & Elegant
Phidata Agents are simple and elegant, resulting in minimal, beautiful code.
For example, you can create a web search agent in 10 lines of code.
Setup
Setup your virtual environment
Install libraries
Export your OpenAI key
Phidata works with most model providers but for these examples let’s use OpenAI.
You can get an API key from here.
Run the agent
Powerful & Flexible
Phidata agents can use multiple tools and follow instructions to achieve complex tasks.
For example, you can create a finance agent with tools to query financial data.
Create a finance agent
Run the agent
Install libraries
Run the agent
Multi-Modal by default
Phidata agents support text, images, audio and video.
For example, you can create an image agent that can understand images and make tool calls as needed
Create an image agent
Run the agent
Multi-Agent orchestration
Phidata agents can work together as a team to achieve complex tasks.
Create an agent team
Run the agent team
Run the agent team
Continue reading
- Chat with your Agents using a beautiful Agent UI.
- More examples
- Monitoring & Debugging
Was this page helpful?