Rohith Kandula rohithkandula19

🌌 WHO AM I

I'm an AI/ML Engineer who gets a kick out of taking ideas from zero to production. Not the kind who fine-tunes a model in a notebook and calls it a day. I mean actually shipping things: APIs, cloud deployments, real users, real data.

I've spent the last 4+ years obsessing over the full stack of AI: from training PyTorch models and building RAG pipelines to wiring up Kafka event streams and deploying on GCP and AWS. If it involves LLMs, agents, or real-time ML, I've probably broken it three times and shipped it on the fourth.

Right now I'm building toward roles at companies that actually push the frontier: Anthropic, OpenAI, Google DeepMind. Not because of the hype, but because I genuinely care about where this technology goes.

rohith = {
    "location"  : "Charlotte, NC",
    "education" : "MS Information Technology, University of Cincinnati (GPA: 3.89)",
    "cert"      : "AWS Solutions Architect Associate",
    "currently" : "Building production AI systems. Shipping. Repeating.",
    "targeting" : ["Anthropic", "OpenAI", "Google DeepMind", "Meta AI"],
    "status"    : "Open to AI Engineer | GenAI | LLM | MLOps (US)",
}

🗡️ OPEN SOURCE: RONIN

Ronin: a masterless, provider-agnostic AI coding agent

A Claude-Code-style terminal agent that reads, edits, and runs your code, every write behind a diff you approve. Built on a provider-agnostic framework, so the same agent runs on Claude for top quality or free on Gemini / Cerebras / Groq / Ollama (--offline strips every network tool for air-gapped coding). The multi-provider design unlocks things a single-vendor agent structurally can't:

Consensus: run a task across several models in parallel; a judge synthesizes one cross-checked answer
Dojo: rival models each attempt the same change in isolated git worktrees; a judge picks the best diff
Kaizen: the agent finds a weakness in its own source, fixes it in a worktree, and keeps the diff only if the test suite passes

Python  ->  7-package monorepo  ->  1,376 offline tests  ->  MCP + 200 plugins

🚀 THINGS I'VE BUILT AND SHIPPED

6 production systems. All live. All mine.

🧠 RO MedRAG

Medical research is buried in millions of papers. I built an agentic RAG system that searches PubMed in real-time, pulls the right papers, and synthesizes clinical answers using Claude Sonnet, all streamed live to the user.

LangGraph -> FAISS -> PubMed API -> Claude Sonnet
FastAPI -> GCP Cloud Run -> PostgreSQL -> SSE

🎯 RO AI Recommendation Engine

I got curious about how Netflix actually works under the hood. So I built it. A proper two-tower PyTorch model with BPR loss, 4-source candidate generation, LightGBM reranking, and a Kafka event pipeline on Kubernetes.

PyTorch BPR -> FAISS IVFPQ -> LightGBM LTR -> MMR
Kafka -> ClickHouse -> Kubernetes HPA -> Terraform EKS

💩 BullshiftDetector

I got tired of seeing "Humbled and excited to announce" for the 50th time. So I built a Claude-powered detector that scores LinkedIn posts for corporate cringe (0-100), generates a one-line roast, and rewrites it like a normal human. Because someone had to.

Claude API -> FastAPI -> Next.js 14
GCP Cloud Run -> Firebase Hosting

📊 ROVA AI Forecasting

A full ML forecasting platform with a proper pipeline: PyTorch NN + LSTM ensemble, 128-dimensional feature engineering, SHAP attributions for explainability, and KL-divergence drift detection that auto-retrains when the distribution shifts. 14 screens on GCP.

PyTorch NN + LSTM -> MLflow -> Celery + Redis
Prometheus + Grafana -> GCP Cloud Run

🔍 RO Fraud Detection

Enterprise fraud detection with LangGraph agents doing the heavy lifting: real-time risk scoring, multi-step reasoning on each transaction, full audit trail on every decision. Deployed on AWS EC2 with Nginx and SSL. Actually production-grade.

LangGraph Agents -> FastAPI -> SQLAlchemy
AWS EC2 -> Docker -> Nginx -> Let's Encrypt

🚌 MR Buses

A full interstate bus booking platform with an AI chatbot that actually knows the routes, schedules, and can help you book, not just answer FAQs. Google OAuth, real-time seat booking, admin dashboard. Live on GCP.

LangChain RAG -> FastAPI -> GCP Cloud Run
Cloud SQL PostgreSQL -> Firebase Hosting

🛰️ LIVE STATUS

"All live" isn't a slogan. A GitHub Action pings every deployment and rewrites this table. Receipts, not vibes.

System	Status	Response
RO MedRAG	🟢 LIVE	248 ms
BullshiftDetector	🟢 LIVE	37 ms
MR Buses	🟢 LIVE	61 ms
RO Fraud Detection	🟢 LIVE	251 ms

_{🤖 Checked automatically every 6 hours by GitHub Actions · last run 2026-06-18 10:46 UTC}

⚡ Recently shipped

Ronin · pushed 2026-06-18 · Masterless, terminal-native coding agent (Claude Code-style: reads, edits, runs code) f…
.github · pushed 2026-06-11 · Community health defaults
agentfaceoff · pushed 2026-06-11 · Live LLM battle arena · same prompt to two models, token-by-token split-screen streamin…
rovex-ai · pushed 2026-06-11 · Enterprise fraud detection · 4-model ensemble scoring with a LangGraph investigation ag…
prior-auth-agent · pushed 2026-06-11 · Prior-authorization agent · citation-grounded payer decisions.

🛠️ STACK

flowchart LR
    subgraph Data["📥 Data & Retrieval"]
        Docs[Docs & APIs] --> VS[(FAISS · pgvector · hybrid search)]
        Events[Events] --> Kafka[Kafka] --> CH[(ClickHouse / BigQuery)]
    end
    subgraph Brain["🧠 Agents & Models"]
        VS --> LG[LangGraph: supervisor · planner-executor · HITL]
        LG <--> Claude[Claude API]
        PT[PyTorch Two-Tower / LSTM] --> LGB[LightGBM Rerank]
    end
    subgraph Evals["🧪 Evals & Observability"]
        LG --> EV[RAGAS · golden datasets · LLM-as-judge]
        EV --> LF[Langfuse: traces · cost · drift]
    end
    subgraph Serve["🚀 Serving & Infra"]
        LG --> API[FastAPI + SSE]
        LGB --> API
        API --> UI[Next.js / React]
        API --> Cloud[GCP · AWS · Azure] --> K8s[Kubernetes + Terraform]
    end

Languages

LLMs & GenAI

Agents · RAG · Vectors

Evaluation & Observability

ML & DL

Voice & Real-Time

Backend · Cloud · Infra

Data & Frontend

📊 GITHUB STATS

🕹️ THE ARCADE

This profile is playable. Yes, really.

🏆 CREDENTIALS

Credential	Details
Claude with Google Cloud's Vertex AI	Anthropic Education · May 2026
Claude 101	Anthropic · Apr 2026
AI Fluency: Framework & Foundations	Anthropic · Apr 2026
AWS Certified Solutions Architect – Associate	Amazon Web Services · May 2025 (exp. May 2028)
MS, Information Technology	University of Cincinnati · GPA 3.89 · Dec 2024
Deep Learning Specialization	Andrew Ng / DeepLearning.AI · 2024
LangChain & LLM Agents for Production	DeepLearning.AI · 2024

_{🤖 The portfolio has an AI narrator and an "Ask Rohith" chat, and it rewrites itself depending on whether you're a recruiter, an engineer, or just curious. Go say hi.}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly