// ml.engineer · builder
Nikhil
Mourya
ML Engineer · Builder
I optimize models for production.
Production still finds a way to optimize me.
About
I'm Nikhil Mourya — I shrink models for a living; .from_pretrained() hands you weights, not a thesis.
I base out of BIT Mesra; half the "RAG stacks" I meet are a vector DB cosplaying as a product requirement — the rest is just vibes billing rent.
I'm always maintaining that unbroken eye contact with a single block of code for hours like we have beef.
Model Compression
Pruning and LoRA are quiet admissions that full fine-tuning is often billing for capacity you never needed. I like models that shrink without forgetting what they're for.
NLP & Text Systems
Summarization, classification, pipelines where fluent isn't the same as faithful. Production NLP is mostly telling confident hallucinations they can't sit with us.
ML Engineering
Training loops, eval harnesses, Flask on GCP — the glue between notebook and someone else's pager. If it only runs on my laptop, it's a demo; if it survives Friday, it's work.
Competitive Programming
Codeforces Specialist — 1400+. Graphs have a way of humbling you on a schedule; the upside is you stop trusting clever one-liners without proof.
Specialist Codeforces 1400+
Still grinding rated rounds — the graphs are optional; the ego damage isn't. Proof you can think with a clock breathing down your neck.
peak: 1400+ · still climbingFinalist SIH — Hospital Mgmt System
99.5% uptime with real models in the loop — the 0.5% was character development. Backend stayed polite even when the night shift wasn't.
99.5% uptime · production MLFinalist IIIT Delhi — ResNet50
94% accuracy after grid search stopped me from brute-forcing the hyperparameter void. Sometimes the boring search is the clever move.
94% acc · −30% train time