CV MLOps Architecture

Development architecture

The development stage architecture is set up to:

Development architecture Figure 1: Development stage architecture diagram

Toggle for description of the architecture diagram (Figure 1).

Dev Notebook Container

MLflow Tracking Server

Model Serving

Foundational libraries

Interactive development

Orchestration & resource management

Model training & data quality

Hyperparameter Tuning

Logs and tracks

Model Card

Summarizes model details, training dataset, and evaluation metrics (like Mean Precision, Recall, F1 Score)
Helps maintain model documentation and reproducibility

KServe

A standard model inference platform on Kubernetes, built for scalable, production-grade ML serving
Supports real-time inference via a REST API with a standardized protocol across ML frameworks
Enables modern serverless inference workloads with autoscaling (e.g., Scale to Zero, GPU-based)
Provides advanced deployment strategies such as canary rollouts, ensembles, and model transformers
Supports pre/post-processing, monitoring, and explainability
Integrates with ModelMesh for intelligent routing and high-density model serving

Data and metric flow are set up as follows:

Deployment architecture Figure 2: Deployment stage architecture diagram

Toggle for description of the architecture diagram (Figure 2).

AI Platform AKS Cluster

Kubeflow Pipeline

MLflow Integration

The architecture is deployed on an AI Platform Azure Kubernetes Service (AKS) cluster.
Kubernetes (K8S) Persistent Data Volume stores data used in Kubeflow pipeline stages.

Core workflow composed of three main stages: