Debanna Das

I am a Visiting Scholar at the University of Pennsylvania's Advanced Cardiovascular Imaging Lab working on cardiac imaging and machine learning. I'm fortunate to be advised by Prof. Walter Witschey and to be supported by the Reliance Foundation Undergraduate Scholars Program.

My work spans AI research at Penn and CMU alongside building production systems at startups, bridging rigorous experimentation with real-world deployment.

Research

My research contributes to AI4Healthcare and AI4Science.

Advanced Cardiovascular Imaging Lab, University of Pennsylvania

SUPERVISOR: Prof. Walter Witschey

Hemothorax—internal chest bleeding from trauma—requires rapid diagnosis, but manual CT analysis is slow and subject to human variability. I developed an AI-assisted detection system at Penn Medicine to help radiologists identify these critical bleeds more quickly and consistently. The system combines multiple deep learning architectures working together: 2D models analyze slice-by-slice patterns while 3D models capture full volumetric context. By merging their predictions, this ensemble approach outperformed both individual models and fine-tuned versions of existing medical imaging systems. The work shows that architectural diversity, not just more data, can drive better clinical AI performance. This reduces diagnostic delays in emergency settings and gives radiologists a reliable AI assistant for detecting these life-threatening bleeds.

Cryo-ET discovery visualization from CMU Xu Lab

Xu Lab, Carnegie Mellon University

SUPERVISOR: Prof. Min Xu

Cryo-electron tomography generates millions of 3D images of molecular structures inside cells, but manually labeling these volumes is prohibitively slow, which creates a bottleneck for biological discovery. I developed and compared two unsupervised discovery pipelines to automatically identify and group similar molecular structures without requiring human annotations. The first approach used transfer learning: a pretrained 3D ResNet-18 from video recognition extracted volumetric features, followed by t-SNE for dimensionality reduction and K-Means clustering to discover structural groups. The second approach trained a custom 3D SimCLR encoder from scratch using contrastive self-supervised learning on the cryo-ET data itself, then applied UMAP for manifold learning and HDBSCAN for density-based clustering that automatically handles noise and determines cluster counts. The self-supervised approach outperformed the transfer learning approach, nearly tripling cluster coherence and producing substantially better-separated molecular groupings. This enables high-throughput structural discovery in areas where expert annotation is the rate-limiting factor for scientific progress.

Industry Experience

I build production-grade AI systems, combining LLM workflows, real-time APIs, and ML pipelines with the infrastructure, evaluation, and reliability engineering needed to ship them to users.

MIT Media Lab workspace

MIT Media Lab

Technical Contributions

CI/CD Pipeline
Static Analysis
gRPC Backend
MPI Deployment
Test Coverage
Code Quality

Contributed to SONAR (Self-Organizing Network of Aggregated Representations) at MIT Media Lab, a decentralized federated learning framework enabling peer-to-peer collaborative model training across distributed nodes without centralizing raw data. Refactored core algorithm modules and utility services to enforce consistent coding standards, introducing static analysis tooling via Pylint and Pyright with enforced score thresholds and pre-commit hooks to catch regressions at the source. Hardened the CI/CD pipeline by authoring GitHub Actions workflows covering automated linting, type checking, and functional end-to-end training tests over the gRPC communication backend — ensuring new contributions remained reliable as the codebase scaled across both MPI and gRPC deployment environments.

GENIE AI workspace screenshot

GENIE AI

At GENIE AI, I worked across the full ML development lifecycle—from studying LLM architecture fundamentals (transformer attention mechanisms, positional encodings, and layer normalization strategies) to fine-tuning GPT models for domain-specific applications in healthcare diagnostics and educational content generation. Developed systematic evaluation frameworks that benchmarked fine-tuned outputs against ground-truth behaviors across precision, recall, and domain-specific accuracy metrics. Led the system architecture design for GenCanvas, defining the end-to-end data flow: user natural language inputs processed through spaCy NLP pipelines for entity extraction, routed to domain-specific GPT endpoints with custom system prompts, validated against JSON schemas, and rendered as interactive canvas elements via real-time WebSocket connections.

Routechef

Engineered two production-grade systems at Routechef. First, an AI-powered conversational travel assistant built on a LangGraph state-machine architecture, where user messages flow through modular pipeline nodes handling intent detection, structured query extraction via GPT-4o-mini, context-aware route caching with MD5-hashed keys and LLM-driven cache invalidation decisions, and real-time flight and train data retrieval over WebSocket APIs — with a two-tier geocoding system backed by a local database of 8,000+ Indian railway stations falling back to the Google Maps API. Second, a Python-based chat analytics engine that ingested raw conversation logs through a multi-stage ETL pipeline, performing session reconstruction, Unicode-range language detection for Hindi and Tamil, quantile-based user engagement bucketing, and rule-based intent classification — feeding downstream modules that trained a Random Forest response-type classifier, ran K-Means behavioral clustering across six user-level features with PCA visualization, and computed DAU/WAU/MAU stickiness metrics surfaced through interactive Plotly dashboards.

Projects

Beyond my core research and industry work, I build AI applications across diverse domains; from customer support to content creation, to explore new techniques and strengthen my engineering skills.

Customer Support Copilot

Built an AI assistant pipeline that classifies tickets, scores sentiment, and drafts grounded responses via a RAG workflow.

TikTok Content Analysis and Script Generation System

Developed an AI tool that monitors TikTok trends, mines transcripts, and outputs ready-to-use creator scripts.

AI-Driven Storytelling System

Created a multi-stage storytelling pipeline that chains GPT-4, LangChain, and automated consistency checks for polished narratives.

Leadership & Community Engagement

I led student technical and community organizations at BITS Pilani.

BITS Alumni Association (BITSAA International)

Team Lead

Led strategic initiatives to grow global alumni engagement across 50+ countries, coordinating a distributed team to launch mentorship programs and regional chapters that strengthened the BITS alumni network.

Team BITS

Technical Team Lead

Led a 12-person development team to deliver five flagship campus projects including event management systems and student portals, establishing code review processes and sprint planning that kept technical quality high as the team scaled.

Contact Me