Let’s run the AI API in production on AWS.

AWS Setup

1

Add Credentials

To run on AWS, you need one of the following:

  1. The ~/.aws/credentials file with your AWS credentials
  2. AWS_ACCESS_KEY_ID + AWS_SECRET_ACCESS_KEY environment variables

To create the credentials file, install the aws cli and run aws configure

2

Add Region and Subnets

Add 2 Subnets to the workspace/settings.py file (required for ECS services)

workspace/settings.py
ws_settings = WorkspaceSettings(
    ...
    # -*- AWS settings
    # Add your Subnet IDs here
    subnet_ids=["subnet-xyz", "subnet-xyz"],
    ...
)

Please check that the subnets belong to the selected aws_region

Update Secrets

1

RDS database password

Update the RDS database password in workspace/secrets/prd_db_secrets.yml

workspace/secrets/prd_db_secrets.yml
# Secrets used by prd RDS database
MASTER_USERNAME: api
MASTER_USER_PASSWORD: "api9999!!"
2

API Secrets

Add any other secrets used by your api to workspace/secrets/prd_api_secrets.yml

workspace/secrets/prd_api_secrets.yml
SECRET_KEY: "very_secret"
# OPENAI_API_KEY: "sk-***"

Create AWS resources

Create AWS resources using:

phi ws up --env prd --infra aws

This will create:

  1. ECS Cluster for the application.
  2. ECS Task Definitions and Services that run the application on the ECS cluster.
  3. LoadBalancer to route traffic to the application.
  4. Security Groups that control incoming and outgoing traffic.
  5. Secrets for managing application and database secrets.
  6. RDS Database for Knowledge Base and Storage.

Press Enter to confirm and grab a cup of coffee while the resources spin up.

  • The RDS database takes about 5 minutes to activate.
  • These resources are defined in the workspace/prd_resources.py file.
  • Use the ECS console to view services and logs.
  • Use the RDS console to view the database instance.

Production FastApi

1

Enable FastApi

Update the workspace/settings.py file and set prd_api_enabled=True

workspace/settings.py
...
ws_settings = WorkspaceSettings(
    ...
    # Uncomment the following line
    prd_api_enabled=True,
...
2

Create FastApi resources

phi ws up --env prd --infra aws --group api

Press Enter to confirm

3

View API Endpoints

  • Open the LoadBalancer DNS + the /docs endpoint to view the API Endpoints.
  • Load the knowledge base using /v1/assitants/load-knowledge-base
  • Test the v1/assitants/chat endpoint with
{
  "message": "How do I make pad thai?",
  "assistant": "AUTO_PDF"
}
  • Integrate with your front-end or product.

API Endpoints

Update Production

Follow this guide to update your production application. You'll need to:

  1. Create a new image
  2. Update the ECS Task Definition and Services.

Delete AWS resources

Play around and then delete AWS resources using:

phi ws down --env prd --infra aws

Next

Congratulations on running your AI API on AWS. Next Steps: