Selected Projects

RAG Platform

Configurable Retrieval-Augmented Generation platform for deploying Q&A services on domain-specific documents.

OpenAIMilvusAzure SearchKubernetes
View Details

CDO Inferencing Services

Internal scalable inferencing endpoints for embedding models, classifiers, and LLMs for the Chief Data Office.

KServeNVIDIA TritonvLLMKubernetes
View Details

CDO Speech Services

Unified speech and translation services abstracted into a single API and Python client library.

NVIDIA RivaKubernetesPythonFastAPI
View Details

AutoGen Studio Deployment

Enterprise deployment of AutoGen Studio enabling multi-agent AI workflows with proper governance.

AutoGenPostgreSQLAzure ADPython
View Details

Data Insights Platform

Enterprise data platform using Azure Kubernetes and Databricks to support product strategies.

Azure K8sDatabricksSparkKafka
View Details

Route Optimization Platform

System for repair and install technicians using OR-Tools. Achieved $220M annual savings.

PythonOR-ToolsPostgreSQLRedis
View Details