Principal-level Architect & Data Engineer with 8+ years designing, building, and scaling mission-critical platforms. Expert in distributed systems, AI infrastructure, and the design of data-intensive applications, translating complex challenges into scalable, revenue-generating solutions. Experienced across full-stack data architecture — from real-time APIs to enterprise data lakehouse — with consistent delivery of business impact in multiple domains.
Real-Time Peer Discovery Engine
• Re-engineered the flagship peer matching system that slashed execution time from 7 minutes
to 10,000+ users
• Migrated from Elasticsearch to Qdrant vector database achieving 10x faster similarity search
across 20M+ embeddings
• Built enterprise web scraping platform processing 8M+ websites with structured data extraction
capabilities
Enterprise-Scale AI Infrastructure
• Architected, built, and deployed a distributed multi-cloud pipeline (Azure + GCP) with 80
concurrent batch jobs across 10 regions
• Processed 120B tokens in 2 hours using Gemini Flash 2.5 on Vertex AI - a scale previously
impractical, unlocking new business opportunities such as semantic enrichment of millions of
companies, real-time peer discovery, supplier/ownership mapping, product tagging and much
more
Data Lakehouse Architecture
• Built ACID-compliant data lakehouse using Spark, Delta Lake, Hive, and Azure Blob Storage
on Kubernetes
• Implemented medallion architecture (bronze -> silver -> gold -> platinum -> diamond) for datalake
• Automated schema onboarding and data quality validation reducing debugging effort by 60%
and trusted reports for customers
API Performance Engineering
• Reduced API response latency from 6s to replacing it with targeted, efficient data access patterns
• Streamlined authentication by shifting from DB lookups to lightweight token-based validation,
cutting unnecessary database load
• Consolidated nested queries and applied selective caching, removing redundant data fetches
• Achieved these improvements in just 2 weeks, surpassing a year of prior optimization efforts
and delivering a step-change in user experience
Architecture & Design: Multi-Tenant SaaS, Distributed Systems, Data Lakehouse (Medallion), Event-Driven Architecture, Real-Time APIs, System Optimization