Building production-grade ML, NLP, Time Series, and LLM systems — from raw data to deployed, measurable results.
// Domain interest: Finance · FinTech · Analytics-driven products
I graduated from Delhi Technological University in 2024 with a B.Tech in Mechanical Engineering (CGPA: 7.86). In my final year, I made a deliberate call to skip campus placements and pivot to Data Science — betting on skills over credentials.
Since then, I've built 5 production-deployed projects covering the full modern DS stack: classical ML with explainability, NLP with transformer models, time series forecasting, LLM-powered RAG systems, and an MLOps pipeline with Docker and CI/CD. Every project is live, measurable, and built to production standards.
My skills are domain-agnostic — the same tools that power churn prediction work for retention in EdTech; financial forecasting techniques apply to any sequential data problem. I have a particular interest in Finance and FinTech, but I bring full-stack DS capability to any data-intensive product.
Full ML lifecycle on 10,000 bank customer records — imbalanced data handling, Bayesian hyperparameter tuning, and SHAP-powered per-customer explainability. Business framing throughout: quantifying the real cost of missing a churn event.
Finance-domain NLP using FinBERT — a BERT model pre-trained on financial text. Goes beyond accuracy: ablation studies, a caught evaluation bug, and a statistically honest sentiment-price correlation study across 5 tickers and 2 years of data.
Dual-domain forecasting — 3M+ rows of retail data plus live NSE stock prices. Every modelling decision is documented and justified: why SARIMA beat Prophet here, what the earthquake spike means, why stock MAPE is higher than retail MAPE.
Production ML engineering from pipeline design to live deployment. A leak-proof sklearn pipeline, MLflow experiment tracking, FastAPI REST service, Docker container, and GitHub Actions CI/CD that auto-deploys on every push — with failing tests blocking deployment.
End-to-end Retrieval-Augmented Generation pipeline. Upload any PDF or text document, ask questions in natural language, get answers grounded strictly in the source content — with multi-turn conversation memory and zero hallucination by design.
Open to Data Scientist and Analyst roles at startups and growth-stage companies. Strong preference for data-intensive products — Finance, FinTech, EdTech, HealthTech, SaaS analytics. Available immediately.