Skip to Content

Hi, my name is

Gaurav Batra.

I build scalable AI systems.

I'm a Master's student at UW-Madison specializing in Systems and AI/ML, with 3+ years of experience building production MLOps platforms and optimizing ML infrastructure.

Currently seeking full-time opportunities in AI Infrastructure, MLOps, and Distributed Systems for 2026.

About Me

Hello! I'm Gaurav, a Master's student in Computer Science at UW-Madison (GPA: 3.92/4), specializing in systems, distributed computing, and AI/ML.

I bring over 3.5 years of professional experience as an AI Platform Engineer at Couture.ai, where I built scalable MLOps infrastructure, and as an AI intern at NVIDIA, optimizing video compression with deep learning. Most recently, I'll be joining Sigma Computing as a Software Engineering Intern in Summer 2025.

I graduated with honors (CGPA: 9.48/10) from IIIT Hyderabad, where I researched bandit algorithms at the Machine Learning Lab under Prof. Naresh Manwani, publishing my work at PRICAI'21.

Beyond my academic and professional pursuits, I'm passionate about gaming. My favorites range from strategy classics like Age Of Empires to action-adventure titles like Assassin's Creed.

Here are a few technologies I've been working with recently:

  • Python, C++, Golang, Rust
  • PyTorch, TensorFlow, Hugging Face
  • Kubernetes, Docker, AWS, GCP
  • FastAPI, PostgreSQL, Redis
  • Spark/PySpark, FUSE, Bash
Avatar

Where I've Worked

Software Engineering Intern - AI @ Sigma Computing

May 2025 - Aug 2025
  • Built and launched a semantic search feature in Ask Sigma, allowing natural language queries to find workbooks, driving a 30% improvement over Sigma's older search system.
  • Deployed the feature as a scalable service on Kubernetes, optimizing for performance and reliability, resulting in 100+ daily queries served with 10% lower latency.
  • Partnered closely with the VP of AI and Co-founder to deploy the feature in production and showcase it in Sigma's 2025 AI product launch.
  • Tech Involved: Python, Kubernetes, Semantic Search, NLP, Production ML Systems.
batra98's Github stats

Other Noteworthy Projects

view the archive
Folder
Deduplication Pipeline & Multi-GPU LLM Fine-Tuning

Developed a high-throughput deduplication pipeline using conventional hashing and MinHash to eliminate exact and near-duplicates from over 10M documents across GitHub and Common Crawl sources. Built a multi-GPU fine-tuning pipeline with Hugging Face Accelerate and Transformers to efficiently fine-tune LLMs featuring optimized I/O, cluster-level parallelism, checkpointing, and reproducible W&B logging.

  • Python
  • PyTorch
  • Hugging Face
  • Multi-GPU Training
  • MinHash
Folder
Custom Filesystem using FUSE

Designed and implemented WFS, a custom block-based filesystem using the FUSE framework, supporting basic filesystem operations and RAID 0/1 configurations to enhance fault tolerance and performance. Built entirely in C with low-level block management.

  • C
  • FUSE
  • RAID
  • Systems Programming
Folder
Simple Token passing system and parallelized version of mergesort

Using Erlang to implement a simple token passing architecture and parallelized version of mergesort.

  • Erlang
  • Parallelized Mergesort
  • Token Passing
Folder
Gradient Descent and its variants

In this project we analyze how different variants of gradient descent minimize functions.

  • gradient-descent
  • polyak-momentum
  • nestrov
Folder
Simple Server Architecture in Java

Using RMI(Remote Method Invocation) in Java to implement a simple single server architecture with support for multiple clients. The server will have functions to store the graph,add edges to graph and get the minimum spanning tree of the graph.

  • Minimum Spanning Tree
  • RMI
Folder
Implementation of Undo Logging and Recovery

This project contains implementation of Undo Logging and Recovery in Python3.5.

  • database-systems
  • Undo-Logging and Recovery

What's Next?

Get In Touch

I'm currently seeking full-time opportunities in AI Infrastructure, MLOps, and Distributed Systems starting in 2026. Whether you have an opportunity, want to collaborate on a project, or just want to chat about ML systems and distributed computing, feel free to reach out!

My inbox is always open, and I'll do my best to get back to you!

Say Hello