Welcome to the Matrix

I'm

GAURAV SINGH

AWS CERTIFIED
AZURE AI CERTIFIED
ML CERTIFIED
IEEE PUBLISHED
MCP
LangGraph
RAG
Kafka
Azure AI
Python

ABOUT_ME

Gaurav Singh

AI Software Engineer

Software Engineer · AI Agents & Distributed Systems

Software Engineer with 4+ years shipping secure, high-availability cloud services and AI-driven developer features. Currently an AI Software Engineer at SUNY Buffalo, leading work on Multi-Agent Orchestration, Semantic Search, and production LLM Evaluation.

AI Agent Systems

MCP agents with LangGraph orchestration, RAG pipelines, and LLM eval harnesses that gate production releases.

40→8min Researcher lookup
90% Agent regressions caught
<100ms RAG on 1,000+ records

Distributed Systems

Event-driven microservices, Kafka pipelines, and Java / .NET platforms on AWS + Azure with on-call DRI rigor.

500K+ Daily transactions
99.99% SLA on Azure
0 Data loss at 10× spikes

Open Source & Research

Contributor to LiteLLM and AWSLabs agent-squad. Peer-reviewed paper in IEEE Xplore on agentic AI.

PR #21417 LiteLLM · 34 tests
100+ LLM APIs routed via MCP gateway
IEEE Xplore publication

Tech Stack

AI · Agents LangGraph LangChain MCP RAG Tool Calling Eval Harnesses
Languages Python TypeScript Java C# / .NET SQL
Frontend React Next.js
Data · Infra PostgreSQL Redis Kafka Docker Kubernetes AWS Azure GCP
Open to Software Engineer · AI/ML Engineer · Applied AI Engineer

PROJECTS

Innovative solutions and cutting-edge implementations

2026 · AI AGENTS

WeatherWise Agent

AI-powered weather assistant built with LangGraph and MCP (Model Context Protocol). Natural language queries for real-time weather, forecasts, air quality, and alerts. Deployed on GCP using Vertex AI with Dockerized microservices and network isolation.

LangGraph + MCP architecture SSE streaming responses GCP / Vertex AI deployed Docker microservices
FastAPI LangGraph MCP Next.js Vertex AI GCP Docker Gemini
2026 · HEALTHCARE AI

Clinical QA Intelligence

AI-powered clinical question-answering system that synthesizes evidence from ClinicalTrials.gov and PubMed. Delivers cited, confidence-scored answers to medical questions using Google Gemini for intelligent query routing and response generation.

Multi-source evidence synthesis Inline citations Confidence scoring GCP Cloud Run deployed
FastAPI Python React TypeScript PostgreSQL Google Gemini Docker GCP
2026 · ADTECH
ADTECH AI

Campaign Intelligence Assistant

AI-powered campaign analytics tool for adtech teams. Natural language chat interface that queries campaign databases, generates LCI attribution reports, compares campaigns, and recommends audience segments.

LangGraph agent w/ 5 tools Real-time SSE streaming pgvector semantic search Serverless (Vercel + Neon)
FastAPI LangGraph Next.js PostgreSQL pgvector Groq Gemini Vercel Neon
2025 · ENTERPRISE MCP
ENTERPRISE MCP

Agentic MCP Hub

Enterprise-grade MCP server that gives AI agents a unified interface to Jira, Slack, and SQL databases. Designed for secure, multi-step agentic workflows across enterprise systems.

Jira + Slack + SQL Secure workflows Enterprise-grade Multi-step agents
Python MCP SQLAlchemy Pydantic Enterprise
2025 · VOICE AI
VOICE AI

X-Voice - Voice Analytics AI

Deep learning system for predicting accent, age, and gender from voice data using pi-whisper and LoRA fine-tuning. Real-time speech analysis with high accuracy.

Real-time speech analysis LoRA fine-tuning Accent prediction Age & gender detection
Python Pi-whisper LoRA Deep Learning Speech Analysis
2024 · RESEARCH
RESEARCH

AutoRSR - Speech Disorder Detection

AI-powered system for early detection of speech disorders in children using advanced machine learning. Determines whether a child requires Speech-Language Pathologist (SLP) attention with high accuracy.

Early disorder detection SLP screening High accuracy ML Healthcare AI
Python Machine Learning Speech Analysis AI Detection Healthcare
ACTIVELY CONTRIBUTING IN 2026

OPEN_SOURCE

Real-world contributions to production open source projects

11 Pull Requests
4 Active Repos
46K+ Stars Combined
Shipped to Production

awslabs / agent-squad

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

7.5k+ Stars Python 3 PRs

BerriAI / LiteLLM

100 LLM APIs in one OpenAI-compatible interface — Python proxy, load balancing, spend tracking & observability. One of the most starred Python AI infra projects on GitHub.

39k+ Stars Python 5 PRs

aden-hive / hive

Open-source AI agent framework with parallel execution, memory management, and composable workflows for building production-grade autonomous agents.

Growing Python 2 PRs Merged · 1 Issue Filed

Pull Requests

fix(executor): enforce branch timeout and memory conflict strategy in parallel execution Bug Found & Fixed Merged
feat(events): emit GOAL_ACHIEVED event on terminal-node success Feature Proposed & Built Merged

  Wired unused branch_timeout_seconds and memory_conflict_strategy configs into the parallel execution path with 6 new tests. Also filed a feature request for GOAL_ACHIEVED event emission and implemented the solution in a separate PR.

shinzo-labs / shinzo

Complete observability platform for AI agents and MCP servers. Improve AI deployment outcomes, identify inference inefficiencies, and gain insights into real agent usage patterns.

Growing TypeScript 1 PR

Pull Requests

Session Replay & Analytics Feature

  Implemented full session replay & analytics backend — capturing MCP tool call sequences, token usage, latency, and session timelines for production AI agents.

BLOGS & POSTS

Posts and articles on AI agents, MCP, LLMOps, and the work behind them.

8 Total Pieces
6 LinkedIn Posts
2 Long-Form Articles
Active in 2026

SKILLS

Technology stack and expertise

AI · AGENTS
MCP
Advanced · 1+ Yr
AI · AGENTS
LangGraph
Advanced · 1+ Yr
LANGUAGES
Python
Expert · 4+ Yrs
LANGUAGES
TypeScript
Advanced · 3+ Yrs
LANGUAGES
Java
Expert · 4+ Yrs
CLOUD
AWS · Azure
Advanced · 3+ Yrs
FRONTEND
React / Next.js
Advanced · 3+ Yrs
DATA
PostgreSQL · Kafka
Advanced · 4+ Yrs

RESUME

Five-year career across AI systems, distributed services, and research

5+ Years
6 Roles
2 Countries
MS CS SUNY Buffalo
3.9 GPA

Education

MS in Computer Science & Engineering

Aug 2024 – Dec 2025

University at Buffalo - SUNY, New York, USA

GPA: 3.9 / 4.0

Coursework

Distributed Systems Operating Systems Deep Learning AI / ML Algorithms & Analysis Cloud Computing

Professional Experience

CURRENT

AI Software Engineer

Jan 2026 - Present

University at Buffalo, United States

Software Engineer Intern

May 2025 - Aug 2025

National AI Institute, USA

Research Assistant

Nov 2024 - Dec 2025

The Research Foundation for SUNY, United States

Software Engineer Intern

Jun 2024 - Aug 2024

Muni Health, USA

Software Engineer

Feb 2022 - Jun 2024

TCS

Software Engineer

Jan 2021 - Jan 2022

Netcore Solutions, India

TUTORIALS

Walkthroughs and explainers — Cloud, Algorithms, and DSA fundamentals.

WorkCode & Gaurav
4 Featured Videos · Active Channel
Visit Channel

TERMINAL

A scripted shell tour — whoami, education, stack, and what's running in 2026.

gaurav@portfolio · SYSTEM_INFO LIVE

CONNECT

Talk AI agents, distributed systems, or whatever you're shipping. The form, email, and LinkedIn all reach me.

Location

Buffalo, NY · USA

Open to relocate · remote-friendly

Email

ksingh.gav@gmail.com

Best for detailed conversations

Response

Usually within 24 hours

Mon–Fri · EST