Keyush

Keyush

Resume | Google Scholar | Github | LinkedIn | Email ID | Portfolio of Projects

⇧ Top

Hello folks! I am a Software Engineer who likes to build intelligent systems at scale. I have a strong background in product focussed software develeopment (now LLM-powered, ofcourse!), - designing RAG architectures, scalable ML pipelines,and production AI services that solve real problems for users and businesses or help in modernizing software systems.

I truly believe that writing code is not the bottleneck with the advancements in LLMs, but the real interesting problem lies at the intersection of architectural design, systems thinking and applied AI where architectural decisions determine whether a system feels magical or broken. I enjoy building and transitioning from research-grade protoype models to product-grade systems.

Previously, I used to work as an AI Researcher at the Computational Social Listening Lab and Penn Medicine where I was advised by Prof. Lyle Ungar and Prof. Sharath Guntuku.

What I Work On

LLM & RAG Systems
ML Engineering
Data Engineering & Cloud
Full-Stack AI Applications

I thrive at the intersection of applied AI and software engineering — whether optimizing inference latency, reducing pipeline runtimes, or deploying models that drive measurable business outcomes.

Open to opportunities in AI/ML Engineering, Data Science, and Software Engineering roles where I can build impactful, production-grade AI systems.

Kindly take a look at my Portfolio of Projects.

Professional Experience

Aug 2025 - Present

Omnicom

Software Engineer, ML
2024 - 2025

Penn Medicine

AI Researcher
2024

Universal Media

Data Science Intern
2022 - 2023

IIFL Finance Ltd

Assistant Manager

Research

Click the link below to take a look at my research interests and some questions that interest me.

Research Interests


Check out my ongoing projects in the section below.

Current Research Projects


Publication: Pre-prints


Portfolio of Selected Projects

ML Systems / MLOps
🤖 AI Engineering / LLMs
👁 Computer Vision
🎨 MultiModal / NLP
🖥 Systems & Data Engineering
📈 Probability & Statistical Modeling

Project Descriptions

Particle Agent

FastAPI · LangChain · Pinecone · React · TypeScript
  • Engineered RAG for product queries with OpenAI embeddings and Pinecone vector search for >97% retrieval accuracy and <200ms response latency.
  • Implemented session-based memory with >95% accurate follow-up handling and robust multi-turn interactions.
  • Developed a real-time chat frontend using React, Vite, and TypeScript.
Back to Portfolio

Ride Duration Prediction

XGBoost · Hyperopt · MLflow · Airflow · Docker · Flask
  • Tuned XGBoost and Random Forest models using Hyperopt, improving RMSE by ~30%.
  • Designed a modular ML pipeline using Apache Airflow with Docker + CeleryExecutor.
  • Deployed the trained model as a REST API using Flask, containerized with Docker.
Back to Portfolio

Image Reconstruction using Diffusion Transformers

PatchVAE · Diffusion Transformer · CelebA
  • Developed a PatchVAE model to encode facial features from the CelebA dataset, trained a Diffusion model on VAE latent representations.
  • Achieved an FID score of 14.2, producing high-quality realistic face images.
Back to Portfolio

Instance Segmentation: By Location

ResNet · FPN · SOLO · PyTorch
  • Implemented an instance segmentation framework inspired by SOLO with ResNet backbone and Feature Pyramid Network.
  • End-to-end trainable system eliminating the need for bounding boxes or complex post-processing.
Back to Portfolio

Improving Depth Estimation of DINOv2

DINOv2 · CNN Adapter · Phase Correlation · ORB
  • Integrated a CNN-based adapter achieving a 23.8% reduction in MSE, outperforming vanilla DINOv2-base while being faster.
  • Combined temporal information across frames to reduce per-frame errors and enhance depth map accuracy.
Back to Portfolio

FitBit ChatBot

Django · PostgreSQL · LangChain
  • Designed a Django-based AI chatbot for health-related conversations with PostgreSQL for patient data management.
  • Implemented dynamic entity extraction for medications and appointment preferences with automated escalation.
Back to Portfolio

Multithreaded Image Blurring with POSIX Threads

C++ · POSIX Threads · Shared Memory
  • Achieved a 2.8x speedup compared to the sequential baseline (from 3251 ms to 1165 ms) using 4 threads.
  • Designed lock-free parallelism via shared memory without mutexes, restricting writes to thread-local output regions.
Back to Portfolio

Scalable ETL Pipelines with Microsoft Azure

Azure Data Factory · Databricks · ADLS Gen2 · Synapse
  • Implemented a robust ETL pipeline using Azure Data Factory for automated data ingestion from HTTP and SQL sources.
  • Transformed data via Azure Databricks and loaded into Synapse Analytics for downstream analysis.
Back to Portfolio

Analyzing Consumer Behavior in Mobile Plan Selection Using Statistical Modeling

NBD · Gamma Distribution · Chi-Square · Q-Q Plots
  • Fitted Shifted NBD, Truncated NBD, and Gamma models to understand consumer behavior in mobile plan selection.
  • Evaluated models using Q-Q plots, chi-square likelihood ratio tests, and p-value assessments.
Back to Portfolio

Deepfake Detection

VideoLaMA · BLIP · LLaVA · Kendall Tau
  • Evaluated state-of-the-art Video Vision-Language Models for deepfake detection with synthetic data and annotated explanations.
  • Employed Kendall Tau’s correlation and reliability analysis to verify inter-annotator agreement on deformation labels.
Back to Portfolio

Traversability Estimation

Semantic Segmentation · Attention-enhanced FCN
  • Developed a terrain classification model using Semantic Segmentation and an attention-enhanced FCN, achieving a 2% improvement in IoU.
  • Enhanced off-road navigation for autonomous vehicles by optimizing path planning and terrain adaptability.
Back to Portfolio

Bangalore House Prediction

sklearn · Linear Regression · GridSearchCV · Flask
  • Built a ML model with linear regression and GridSearchCV for hyperparameter tuning, evaluated with k-fold cross-validation.
  • Deployed via Flask API with an HTML/CSS/JS frontend for dynamic price predictions.
Back to Portfolio

Topic Modelling with Latent Dirichlet Allocation

Gensim · NLTK · SpaCy · LDA
  • Implemented LDA for topic modeling, classifying text into topics based on underlying word distributions.
  • Computed coherence and perplexity scores to optimize topic count; generated interactive visualizations.
Back to Portfolio