Seeking Next Technical Challenge

MILAN BAROT

Let's Talk
6+ Years in Tech

Building Data Systems
That Are Fast & Reliable.

Senior Data Engineer with 6+ years building mission-critical pipelines across healthcare, real estate, and telecom. At Blue Cross Blue Shield, I led the deliveries and modernization of complex EDI and CMS workflows using Azure cloud and legacy infrastructure — in regulated environments where data accuracy is non-negotiable. I've shipped 30+ production deployments and, most recently, extended that foundation into AI engineering with production LLM apps featuring sub-second streaming.

Data Architecture

Query optimization and lakehouse design on Azure Databricks and AWS.

AI Integration

Production LLM applications with RAG pipelines and sub-second TTFT.

Technical Expertise

Specialized in scalable data engineering, cloud infrastructure, and AI-powered production systems.

Languages

  • Python (Data / AI)
  • SQL (Databricks / Postgres)
  • JavaScript (Node.js)
  • Bash / Shell

Cloud & Infra

  • Azure Databricks / ADF
  • AWS S3 / Lambda
  • Docker
  • GitHub Actions / CI-CD

AI & Pipelines

  • LLM / RAG Pipelines
  • Apache Airflow
  • Dagster
  • Pyspark / Pandas

Portfolio Showcase

GitHub Repositories
Kalshi Custom Predictions
PySpark DuckDB RAG / LLM Docker

Kalshi Custom Predictions

Real-time data engineering pipeline identifying mispriced contracts on the Kalshi prediction market. Ingests live RSS feeds and GDELT events through a Medallion architecture (Bronze → Silver → Gold), indexes news via ChromaDB RAG, and issues Buy/Sell signals using Groq's Llama 3.3 with Kelly Criterion position sizing.

RealTime Context Engine
FastAPI Electron OpenAI

RealTime Context Engine

Real-time AI co-pilot for technical interviews. Pairs an Electron shell with a FastAPI backend to ingest live screen + audio, inject resume context via a custom RAG pipeline, and stream grounded answers with sub-second latency.

SmartScreen may prompt you — this app uses a local developer certificate. If Windows flags it, click More infoRun anyway to proceed.

Cross-Platform Music Sync
Dagster AWS S3 Python

Cross-Platform Music Sync

Automated ELT pipeline keeping YouTube Music and Spotify libraries in sync. Persists state to S3 to prevent duplicates; runs monthly via Dagster.

Spotify Analytics
Python Power BI Airflow/Dagster Docker

Spotify Liked Songs Analytics

Analytics pipeline surfacing listening patterns from Spotify audio features. Staged to PostgreSQL, transformed with Pandas, visualized in Power BI dashboards, and orchestrated with Airflow/Dagster within Docker containers.

BCBS System Architecture
Azure ADF SSIS EDI 834/837 SQL Server

Healthcare EDI & CMS Compliance Platform

Architected a consolidated EDI 834/837 framework at BCBS Arizona, automating generation and delivery of 35+ distinct CMS file types through three core SSIS packages. Orchestrated legacy payer data migration to HealthRules Payer on Azure via ADF, restored critical pipelines after the 2024 CHC system breach, and maintained 100% federal compliance across 25+ production deployments in a regulated healthcare environment.

Let's Collaborate.

Open to technical discussions about data engineering and AI, as well as challenging collaboration opportunities.