ML Systems Engineer

Building Intelligent Systems at Scale

Architecting production ML pipelines, RAG systems, and MLOps infrastructure that power real-world AI applications serving thousands of users.

View My Work Get in Touch

Tech Stack Universe

Kubernetes

Docker

Jenkins

📊 MLflow

PyTorch

🔗 LangChain

AWS

Terraform

🌊 Airflow

🔍 FAISS

Prometheus

Grafana

FastAPI

⚙️ Kubeflow

SageMaker

Years Experience

95%

Efficiency Boost

50K+

Daily API Calls

99%

System Uptime

$ whoami

Expertise & Focus

AI/ML Engineering

RAG systems, AI agents, LLMs, prompt engineering, fine-tuning, and inference optimization for production environments

LangChain
LLMs
PyTorch
Vector DBs

MLOps & Automation

End-to-end ML pipelines, CI/CD automation, model monitoring, and deployment orchestration at enterprise scale

Kubernetes
Docker
Jenkins
MLflow

Cloud Infrastructure

AWS, Azure, GCP services for ML workloads, infrastructure as code, and scalable cloud-native architectures

AWS
SageMaker
Azure ML
Terraform

Data Engineering

Data pipelines, feature engineering, vector databases, dataset optimization, and distributed data processing

Airflow
FAISS
DVC
Pandas

Engineering Philosophy

I believe in building ML systems that are not just accurate, but reliable, scalable, and maintainable. My approach combines rigorous engineering practices with practical AI implementation—cutting deployment times from hours to minutes while maintaining 99%+ uptime. Every system I build is designed to evolve, monitor itself, and serve real users in production.

$ ls ~/projects

Featured Work

                    # Medical LLM Chatbot
                  
                    from langchain import ChatGroq
                  
                    from tavily import TavilySearchAPI
                  
                    deploy_chatbot("prod")
                  
                    ✓ Deployment time: 15 min (87% faster)

Production MLOps Pipeline

Medical LLM Chatbot

Built and deployed a production-grade medical chatbot using LangChain, Groq LLM, and Tavily search with complete MLOps infrastructure. Implemented CI/CD pipeline with Jenkins, Docker, SonarQube, and AWS ECS Fargate, reducing deployment time from 2 hours to 15 minutes.

87% Faster Deployment

5+ Health Metrics

LangChain
Jenkins
Docker
AWS ECS
FastAPI
Prometheus

View Project →

Enterprise RAG System

Multilingual Document Chatbot

Led development of enterprise RAG-based chatbot serving 2,000+ employees across 8 data centers. Architected document retrieval system supporting 4 languages with vector database optimization, improving retrieval efficiency by 95% through advanced chunking strategies and embedding optimization.

95% Faster Retrieval

2K+ Active Users

RAG
Vector DB
Azure
Kubernetes
Python
NLP

Case Study →

                    # RAG Pipeline Performance
                  
                    Languages: 4
                  
                    Data Centers: 8
                  
                    Retrieval Speed: ↑95%
                  
                    Active Users: 2000+

                    # ML Pipeline Orchestration
                  
                    with Kubeflow() as kf:
                  
                      kf.process_data(167000 records)
                  
                      kf.train_model()
                  
                    ✓ Pipeline: 3h → 20min (89% faster)

Healthcare ML Pipeline

Cancer Survival Prediction System

Orchestrated complete ML pipeline for colorectal cancer survival prediction on 167K+ patient records using Kubeflow on Minikube. Containerized data processing and model training components, reducing pipeline execution from 3 hours to 20 minutes. Integrated MLflow with DagsHub for experiment tracking and deployed Flask API for clinical decision support.

89% Faster Training

167K Patient Records

Kubeflow
MLflow
Docker
Flask
Scikit-learn
DagsHub

View Project →

$ cat ~/.mlops/tools

MLOps Tech Stack

🤖

ML Frameworks & Libraries

PyTorch

TensorFlow

Scikit-learn

Pandas

NumPy

🔗 LangChain

🦙 LlamaIndex

⚙️

MLOps & Orchestration

📊 MLflow

⚡ Kubeflow

🌊 Airflow

📦 DVC

SageMaker

Azure ML

☁️

Cloud & Infrastructure

Kubernetes

Docker

Terraform

AWS

Azure

GCP

🔄

CI/CD & Automation

Jenkins

GitHub Actions

GitLab CI

Azure DevOps

🔍 SonarQube

📊

Monitoring & Observability

Prometheus

Grafana

📈 DataDog

🔎 ELK Stack

CloudWatch

🗃️

Data & Vector Stores

PostgreSQL

Redis

🔍 FAISS

🌲 Pinecone

🔷 Chroma

MongoDB

🚀

APIs & Frameworks

FastAPI

Flask

Python

Git

Linux/Bash

$ cat /etc/contact

Let's Build Something

I'm always interested in discussing ML systems architecture, MLOps best practices, or opportunities to build production AI systems that make a real impact. Whether you're looking to scale your ML infrastructure or need help architecting a new system, let's connect.

Available for opportunities

Email chandan.naidu97@gmail.com

GitHub @imchandanmohan

LinkedIn /in/imchandanmohan

Location Virginia, USA

Hi there, I'm

Chandan