Skip to content
View arushahmd's full-sized avatar
  • CygnusPay
  • Lahore
  • 08:59 (UTC -12:00)

Highlights

  • Pro

Block or report arushahmd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
arushahmd/README.md

Aroosh Ahmad banner

Portfolio LinkedIn Email WhatsApp

Lahore, Pakistan Remote US hours friendly

πŸ‘‹ About Me

AI Engineer focused on building production-grade AI systems β€” including LLM-powered applications, voice agents, and OCR pipelines.

I specialize in designing deterministic + hybrid AI architectures that reduce hallucination, improve reliability, and scale in real-world environments.

My work combines:

  • LLMs + RAG systems
  • Voice AI (STT/TTS pipelines)
  • Computer Vision (OCR, detection)
  • Backend systems (FastAPI, Redis, Docker)

I focus on shipping systems, not just models.

Experience Snapshot

  • Cygnus Payments - AI / Backend Engineer (Aug 2025 - Present)
  • Independent Research - LLM Research Engineer (Jan 2025 - Present)
  • Center of Language Engineering - AI Research Officer (Nov 2023 - Feb 2025)
  • Nodlays - AI Engineer (Oct 2022 - Jan 2024)

πŸš€ Featured Projects

πŸ”Ή Compass Voice

Real-time AI voice ordering system using Twilio, Deepgram, FastAPI, and Redis.

  • Deterministic state machine + NLU pipeline (low hallucination design)
  • Real-time audio streaming with sub-second latency (<1s)
  • Production-oriented architecture (session management, state routing)

πŸ”Ή Urdu OCR Pipeline

CNN-LSTM based OCR + NLP pipeline for Urdu, Arabic, and Farsi.

  • Achieved ~98% accuracy with CER reduction (3.4% β†’ 2.3%)
  • End-to-end pipeline: image β†’ text β†’ structured output
  • Handles low-resource language challenges

πŸ”Ή MenuParser AI

OCR + LLM pipeline for structured menu extraction.

  • Converts unstructured menus into structured data
  • ~99% manual effort reduction
  • Combines CV + LLM reasoning

πŸ”Ή LLM Fine-Tuning Research

Instruction fine-tuning with semantic batching (Flan-T5 + LoRA).

  • FAISS-based grouping for stable training
  • Multi-seed reproducible experiments
  • Focus on convergence stability + generalization

🧠 Engineering Focus

  • Designing reliable AI systems (not just prototypes)
  • Reducing hallucination via deterministic pipelines
  • Building scalable backend + AI integrations
  • Experimentation + research (LLMs, training strategies)

πŸ›  Core Stack

Core stack icons

AI / ML: PyTorch, TensorFlow, Transformers, Hugging Face
Backend: FastAPI, Django, REST APIs
Infra: Docker, Redis, AWS, Azure
CV / NLP: OpenCV, YOLO, PaddleOCR, spaCy

Education

  • MPhil in Artificial Intelligence, PUCIT (2024-2026)
  • BS Computer Science, Lahore Garrison University (2017-2021)

🎯 Current Focus

  • Building production-ready LLM and voice AI systems
  • Advancing research in instruction fine-tuning
  • Preparing for high-performance engineering roles (AI/ML systems)

Pinned Loading

  1. compass-voice compass-voice Public

    This is the final project version of compass restaurant call order booking. You can call and book your order through your phone with just a phone call doing conversation with our AI.

    Python 1

  2. urdu-ocr-media-utils urdu-ocr-media-utils Public

    urdu pdfs to text using ocr and relevant media utilities

    Python

  3. pose-estimation-correction-ui-emgucv pose-estimation-correction-ui-emgucv Public

    Real-time pose estimation and feedback using Emgu CV and .NET UI

    C# 1

  4. Business-Card-Named-Entity-Recognition Business-Card-Named-Entity-Recognition Public

    End-to-end pipeline for extracting and labeling entities from business card images with spaCy NER.

    Jupyter Notebook 1

  5. image-captcha-solver image-captcha-solver Public

    Jupyter Notebook