GitHub - cdreetz/sf-api: ML Deployment Template with FastAPI, Docker, Kubernetes

Introduction

This is an example project for ML deployment consisting of a FastAPI serving ML model predictions in both single and batch requests. FastAPI is the ideal framework for serving ML models because of its overall performance and async functionality with Uvicorn. It is also a personal favorite of mine given its simplicity, allowing for very quick deployment of API endpoints.

*Will use in a blog post explaining the process in more detail.... eventually.

Project Structure

App: The data models, routes, and services required to serve the API endpoint
Config: Configuration settings including necessary file paths or env variables
Kube: Deployment and service configuration files for scaling with Kubernetes
Pickles: Pkl files for the model, model variables, and scaler
Tests: Unit tests for data models and endpoints
Training: Script that covers the steps used during training, and usable for preprocessing request data
Dockerfile: The file for turning the API into a Docker image
requirements.txt: The dependencies used provided by a pip freeze
run_api.sh: A shell script for building and running the docker image

Scaling With Kubernetes

Requirements:

deployment file, sets the container/image to use, number of replicas to create, min/max resources
metric server, metrics are the basis for other service functions like autoscaling and load balancing. cpu utiliz, mem utiliz, # requests
horizontal pod autoscaler, allows your deployment to automatically scale up if a metric threshold is met
load balancer, manages the workloads across the cluster to efficiently utilize the different instances
set up prometheus for continuous collection of metrics over time
link the prometheus data to grafna to visualize deployment metrics

Scaling With EKS

Requirements:

Create an EKS cluster in the console, with the CLI, or with Teraform/CloudFormation
A ELB (elastic load balancer) is automatically created but you can select between a classic, network, or application load balancer
You can also use the Cluster Autoscaler for EKS
Set up any needed storage for persistence
Amazon CloudWatch for logging and monitoring

Other Considerations

CI/CD and deployment strategies
Security of the network

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Project Structure

Scaling With Kubernetes

Scaling With EKS

Other Considerations

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
app		app
config		config
kube		kube
pickles		pickles
tests		tests
training		training
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt
run_api.sh		run_api.sh

cdreetz/sf-api

Folders and files

Latest commit

History

Repository files navigation

Introduction

Project Structure

Scaling With Kubernetes

Scaling With EKS

Other Considerations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages