Seeking Next Technical Challenge

MILAN BAROT

Let's Talk
6+ Years in Tech

Building Data Systems
That Are Fast & Reliable.

Senior Data Engineer with 6+ years building mission-critical pipelines across healthcare, real estate, and telecom. At Blue Cross Blue Shield, I led the deliveries and modernization of complex EDI and CMS workflows using Azure cloud and legacy infrastructure — in regulated environments where data accuracy is non-negotiable. I've shipped 30+ production deployments and, most recently, extended that foundation into AI engineering with production LLM apps featuring sub-second streaming.

Data Architecture

Query optimization and lakehouse design on Azure Databricks and AWS.

AI Integration

Production LLM applications with RAG pipelines and sub-second TTFT.

Technical Expertise

Specialized in scalable data engineering, cloud infrastructure, and AI-powered production systems.

Languages

  • Python (Data / AI)
  • SQL (Databricks / Postgres)
  • JavaScript (Node.js)
  • Bash / Shell

Cloud & Infra

  • Azure Databricks / ADF
  • AWS S3 / Lambda
  • Docker
  • GitHub Actions / CI-CD

AI & Pipelines

  • LLM / RAG Pipelines
  • Apache Airflow
  • Dagster
  • Pyspark / Pandas

Portfolio Showcase

GitHub Repositories
RealTime Context Engine
FastAPI Electron OpenAI

RealTime Context Engine

Real-time AI co-pilot for technical interviews. Pairs an Electron shell with a FastAPI backend to ingest live screen + audio, inject resume context via a custom RAG pipeline, and stream grounded answers with sub-second latency.

SmartScreen may prompt you — this app uses a local developer certificate. If Windows flags it, click More infoRun anyway to proceed.

Cross-Platform Music Sync
Dagster AWS S3 Python

Cross-Platform Music Sync

Automated ELT pipeline keeping YouTube Music and Spotify libraries in sync. Persists state to S3 to prevent duplicates; runs monthly via Dagster.

Spotify Analytics
Python Power BI Airflow/Dagster Docker

Spotify Liked Songs Analytics

Analytics pipeline surfacing listening patterns from Spotify audio features. Staged to PostgreSQL, transformed with Pandas, visualized in Power BI dashboards, and orchestrated with Airflow/Dagster within Docker containers.

Let's Collaborate.

Open to technical discussions about data engineering and AI, as well as challenging collaboration opportunities.