Keyush

Keyush Shah

Resume | Google Scholar | Github | LinkedIn | Email ID | Portfolio of Projects

Hello, I work as an AI Researcher at the Computational Social Listening Lab where I am advised by Prof. Lyle Ungar and Prof. Sharath Guntuku.

My projects span high-impact domains including healthcare, finance, and social media, with a core emphasis on creating models that are not only accurate, but also explainable and deployable at scale. I’m passionate about leveraging ML, AI, and software engineering to deliver results that drive business value and societal impact.

🚀 I am currently actively seeking full-time opportunities in applied ML, AI-driven product development, or software engineering — especially within mission-driven and tech-forward teams.

A Sneak Peek into my Value Proposition

My academic journey began with a strong foundation Statistics and Mathematics where I developed the acumen in Analytics, Inference and data driven modeling - it laid the groundwork for my foray into Data Science. Now I am focused on exploring the vast potential of Generative AI and Machine Learning in solving real-world challenges and leveraging data to drive impactful insights.

  • I bring a strong track record of delivering real-world impact through data science and analytics across diverse industries. From optimizing marketing performance with media mix models to driving digital transformation through predictive modeling, my work has consistently translated data into strategic value.
  • I've built scalable data pipelines, deployed machine learning solutions in production, and developed dashboards that inform key business decisions. Whether it's enhancing campaign ROI, streamlining operations, or personalizing customer experiences, I focus on using data to solve high-impact problems and create measurable outcomes.
  • Besides, I have experience with data engineering tools and cloud platforms, particularly the Azure suite and AWS, utilizing services such as Azure Data Factory, Azure Databricks and AWS Sagemaker to build scalable data pipelines and ML models for AI-driven applications.
  • Kindly take a look at my Portfolio of Projects.

    Professional Experience

    Aug 2025 - Present

    Omnicom

    Software Engineer, ML
    2024 - 2025

    Penn Medicine

    AI Researcher
    2024

    Universal Media

    Data Science Intern
    2022 - 2023

    IIFL Finance Ltd

    Assistant Manager

    Research

    Click the link below to take a look at my research interests and some questions that interest me.

    Research Interests


    Check out my ongoing projects in the section below.

    Current Research Projects


    Publication: Pre-prints


    Portfolio of Selected Projects

    ML Systems / MLOps
    🤖 AI Engineering / LLMs
    👁 Computer Vision
    🎨 MultiModal / NLP
    🖥 Systems & Data Engineering
    📈 Probability & Statistical Modeling

    Project Descriptions

    Particle Agent

    FastAPI · LangChain · Pinecone · React · TypeScript
    • Engineered RAG for product queries with OpenAI embeddings and Pinecone vector search for >97% retrieval accuracy and <200ms response latency.
    • Implemented session-based memory with >95% accurate follow-up handling and robust multi-turn interactions.
    • Developed a real-time chat frontend using React, Vite, and TypeScript.
    Back to Portfolio

    Ride Duration Prediction

    XGBoost · Hyperopt · MLflow · Airflow · Docker · Flask
    • Tuned XGBoost and Random Forest models using Hyperopt, improving RMSE by ~30%.
    • Designed a modular ML pipeline using Apache Airflow with Docker + CeleryExecutor.
    • Deployed the trained model as a REST API using Flask, containerized with Docker.
    Back to Portfolio

    Image Reconstruction using Diffusion Transformers

    PatchVAE · Diffusion Transformer · CelebA
    • Developed a PatchVAE model to encode facial features from the CelebA dataset, trained a Diffusion model on VAE latent representations.
    • Achieved an FID score of 14.2, producing high-quality realistic face images.
    Back to Portfolio

    Instance Segmentation: By Location

    ResNet · FPN · SOLO · PyTorch
    • Implemented an instance segmentation framework inspired by SOLO with ResNet backbone and Feature Pyramid Network.
    • End-to-end trainable system eliminating the need for bounding boxes or complex post-processing.
    Back to Portfolio

    Improving Depth Estimation of DINOv2

    DINOv2 · CNN Adapter · Phase Correlation · ORB
    • Integrated a CNN-based adapter achieving a 23.8% reduction in MSE, outperforming vanilla DINOv2-base while being faster.
    • Combined temporal information across frames to reduce per-frame errors and enhance depth map accuracy.
    Back to Portfolio

    FitBit ChatBot

    Django · PostgreSQL · LangChain
    • Designed a Django-based AI chatbot for health-related conversations with PostgreSQL for patient data management.
    • Implemented dynamic entity extraction for medications and appointment preferences with automated escalation.
    Back to Portfolio

    Multithreaded Image Blurring with POSIX Threads

    C++ · POSIX Threads · Shared Memory
    • Achieved a 2.8x speedup compared to the sequential baseline (from 3251 ms to 1165 ms) using 4 threads.
    • Designed lock-free parallelism via shared memory without mutexes, restricting writes to thread-local output regions.
    Back to Portfolio

    Scalable ETL Pipelines with Microsoft Azure

    Azure Data Factory · Databricks · ADLS Gen2 · Synapse
    • Implemented a robust ETL pipeline using Azure Data Factory for automated data ingestion from HTTP and SQL sources.
    • Transformed data via Azure Databricks and loaded into Synapse Analytics for downstream analysis.
    Back to Portfolio

    Analyzing Consumer Behavior in Mobile Plan Selection Using Statistical Modeling

    NBD · Gamma Distribution · Chi-Square · Q-Q Plots
    • Fitted Shifted NBD, Truncated NBD, and Gamma models to understand consumer behavior in mobile plan selection.
    • Evaluated models using Q-Q plots, chi-square likelihood ratio tests, and p-value assessments.
    Back to Portfolio

    Deepfake Detection

    VideoLaMA · BLIP · LLaVA · Kendall Tau
    • Evaluated state-of-the-art Video Vision-Language Models for deepfake detection with synthetic data and annotated explanations.
    • Employed Kendall Tau’s correlation and reliability analysis to verify inter-annotator agreement on deformation labels.
    Back to Portfolio

    Traversability Estimation

    Semantic Segmentation · Attention-enhanced FCN
    • Developed a terrain classification model using Semantic Segmentation and an attention-enhanced FCN, achieving a 2% improvement in IoU.
    • Enhanced off-road navigation for autonomous vehicles by optimizing path planning and terrain adaptability.
    Back to Portfolio

    Bangalore House Prediction

    sklearn · Linear Regression · GridSearchCV · Flask
    • Built a ML model with linear regression and GridSearchCV for hyperparameter tuning, evaluated with k-fold cross-validation.
    • Deployed via Flask API with an HTML/CSS/JS frontend for dynamic price predictions.
    Back to Portfolio

    Topic Modelling with Latent Dirichlet Allocation

    Gensim · NLTK · SpaCy · LDA
    • Implemented LDA for topic modeling, classifying text into topics based on underlying word distributions.
    • Computed coherence and perplexity scores to optimize topic count; generated interactive visualizations.
    Back to Portfolio