Projects
Selected engineering and applied machine learning projects spanning retrieval systems, geospatial modeling, and full-stack AI deployment.
Knowledge Distillation for Reasoning in MoE and Dense LLMs
Efficiently transferring reasoning from large LLMs to smaller models using parameter-efficient distillation
- Developed an offline knowledge distillation framework combining sequence-level and logit-level distillation to transfer reasoning capabilities from large teacher models to smaller student models.
- Leveraged teacher-generated chain-of-thought (CoT) reasoning traces and trained student models via supervised fine-tuning to replicate both intermediate reasoning steps and final answers.
- Applied Low-Rank Adaptation (LoRA) to enable parameter-efficient distillation, updating less than 1% of model weights while preserving performance and reducing compute requirements.
- Implemented logit-based distillation using temperature-scaled softmax and KL divergence loss to align student token distributions with teacher outputs via top-k probability reconstruction.
- Built an offline pipeline to extract teacher logits and map them to student token spaces, enabling scalable training without direct API dependency.
- Evaluated performance on SciBench and TheoremQA across multi-domain reasoning tasks, using accuracy and BERTScore to measure both correctness and reasoning alignment.
- Demonstrated that small LLMs can approximate both the reasoning process and output distributions of larger models with minimal parameter updates.
GenreBlender
Generative AI Music Mixer & Classifier
- Built a generative AI system that blends two music genres using a controllable slider (α ∈ [0,1]) and generates hybrid audio with Meta’s MusicGen.
- Designed a clean ML pipeline on the GTZAN dataset, including feature selection, scaling, label encoding, and reproducible preprocessing for inference.
- Trained a 4-layer PyTorch MLP (92% validation accuracy) using scikit-learn’s GroupShuffleSplit to prevent data leakage and predict genre probabilities from audio features.
- Developed a measurable genre-blending framework by comparing predicted probabilities against a weighted target distribution to quantify how well the blend worked via Target = (α·A) + (1−α)·B.
- Built and deployed an interactive Streamlit web application enabling real-time genre blending, probabilistic evaluation, and 10-second AI-generated audio output.
PocketRAG
Retrieval-Augmented Generation System
- Built a lightweight RAG web application enabling instant PDF summarization and document-based question answering.
- Implemented semantic search using FAISS vector indexing with embedding-based chunk retrieval.
- Integrated Gemini API for context-aware generation with dynamic prompt construction.
- Designed clean frontend UI for real-time query-response streaming and document upload.
Telematics UBI Prototype
Usage-Based Insurance Modeling Platform
- Developed a full-stack usage-based insurance system for telematics risk scoring and driver behavior analysis.
- Built XGBoost-based risk prediction models trained on telematics driving data with engineered behavioral features.
- Implemented FastAPI backend for model serving and REST endpoints with secure AWS deployment.
- Designed React frontend dashboard visualizing driver risk scores and prediction outputs.
This Website!
Personal Portfolio
- Designed and deployed a responsive personal portfolio website using Next.js, Tailwind CSS, and TypeScript to showcase my work in AI/ML
- Implemented GitHub-based CI/CD with Vercel for automated build and deployment on every commit.
- Leveraged modern React architecture (App Router) and optimized performance using built-in Next.js font and asset optimization features