RAG-OS Pipeline
Production-grade RAG system for technical HVAC manuals using Docling, Qdrant, and BGE embeddings. Adaptive chunking and cross-encoder reranking with rigorous LLM-as-Judge evaluation framework.
Hi, I'm
Applied ML Engineer & Data Scientist at University of Washington crafting intelligent systems with RAG, LLMs, Deep Learning, and Production ML.
I'm an Applied ML Engineer and Data Scientist with a passion for building intelligent systems that solve real-world problems. I recently graduated with my Master's from University of Washington, specializing in Retrieval-Augmented Generation (RAG), LLM, Agentic AI Systems, and Scalable ML Pipelines.
With experience at Uber, American Express, Contextual AI, AWS, Hitachi Vantara, and Microsoft, I bridge the gap between cutting-edge research and production-grade software. I don't just train models; I deploy them to create tangible impact.
Most recently at Contextual AI, I designed a multilingual multimodal RAG system with hybrid retrieval, cross-encoder reranking, and LLM-as-a-Judge evaluation, achieving 86.5% Recall@3 across 150+ technical manuals. I've also built deep learning models for real-time predictions, forecasting systems across global markets, and scalable ETL pipelines.
I'm a University Gold Medalist from GGSIPU with a perfect 4.0 GPA, and I believe in building ML systems that don't just work in notebooks—they work in production.
M.S. in Data Science
GPA: 3.9/4.0 • Seattle, WA
Sep 2024 – Mar 2026
B.Tech in Computer Science & Engineering (University Gold Medalist)
GPA: 4.0/4.0 • Delhi, India
Aug 2018 – Jun 2022
Built enterprise-grade AI workflows using Amazon Bedrock and SageMaker as part of a selective technical cohort, focusing on RAG patterns, foundation model selection, and inference tuning with cost-latency optimization.
Built a multilingual, multimodal RAG system for HVAC refrigerant recovery that supports WhatsApp voice and image queries, OCR-based equipment parsing, and grounded QA over 150+ technical manuals. Designed the retrieval stack with hybrid search, metadata-aware filtering, query expansion, and reranking, and benchmarked both open-source and Contextual AI pipelines on retrieval quality, latency, and hallucination detection.
Worked on delivery ETA and marketplace optimization at Uber by improving ATD prediction with ML models and production SQL pipelines, using statistical analysis to evaluate dispatch decisions under supply and food readiness constraints, and supporting A/B tests that improved ETA accuracy and reduced courier time per trip in undersupplied markets.
Improved fraud detection recall by 1.2% through XGBoost optimization, driving $20K in annual loss reduction. Developed LSTM-based cloud autoscaling models with 98.9% accuracy and built global BiLSTM forecasting systems for FX rates and inbound calls across 28 global markets.
Migrated 75% of global asset management data to a big data lake, optimizing ETL pipelines for 12% faster processing. Built high-throughput Java microservices routing 200K+ daily requests, reducing infrastructure costs by $10.2K annually.
Optimized IQ data pipelines by integrating U-Net autoencoder-based anomaly detection, improving data processing efficiency by 12.3% and enhancing real-time monitoring capabilities.
Developed a neural collaborative filtering recommender with constraint optimization for scheduling, achieving a 29% improvement in overall scheduling efficiency and system performance.
End-to-end ML systems built for production, not just prototypes.
Production-grade RAG system for technical HVAC manuals using Docling, Qdrant, and BGE embeddings. Adaptive chunking and cross-encoder reranking with rigorous LLM-as-Judge evaluation framework.
Agentic text-to-SQL Census assistant using Snowflake Cortex with cryptographic-style SQL validation, semantic vector guardrails, and a deterministic fallback router.
RAG-based investment analysis system on Databricks, serving Llama inference over Delta tables (5M+ rows) with demonstrated 60% end-to-end time reduction in manual vs. RAG-assisted benchmarks.
Interactive dashboard for breast cancer analysis featuring ResNet-based classification, Kaplan-Meier survival analysis, and exploratory data analysis with Streamlit interface.
I'm currently looking for opportunities in Applied AI/ML, Data Science, and ML Modelling. Whether you have a question, want to discuss a project, or just want to say hi, my inbox is always open!
Let's Connect