Available for consulting

Designing Scalable Data Systems & Building Practical AI Products

Data Architect with 13+ years of experience in distributed data platforms, real-time pipelines, and cloud-native architectures. I build systems that scale — and tools that solve real-world problems.

13+ Years of experience
50+ Data systems delivered
100% SLA adherence track record

What I Do

🏗️

Data Architecture

Design scalable batch and streaming systems using Spark, Kafka, Iceberg, and modern lakehouse architectures across cloud platforms.

🚀

Engineering Leadership

Define engineering standards, CI/CD pipelines, and lead teams delivering production-grade data platforms with high reliability.

🤖

Applied AI Products

Build practical AI tools that automate real workflows — from hiring intelligence to data-driven decision systems.

Selected Work

Spark · Finance

Data Platform Architecture

Led end-to-end architecture of a Spark-based reconciliation platform handling financial data with strict SLA requirements.

Kafka · AWS Lambda

Real-Time Pipelines

Built event-driven orchestration using Kafka-triggered AWS Lambda controlling Airflow workflows with 100% SLA adherence.

DBT · Incremental

Incremental Data Processing

Designed DBT-based incremental pipelines with attribute-level change detection across millions of records.

Insights

Product · AI

How I Built RecruitAI

Turning resume screening into a structured decision system using LLMs and custom scoring logic.

Read more →

Engineering · Spark

Failures in Spark Systems

Real production issues, root causes, and practical patterns to prevent them in large-scale pipelines.

Read more →

Architecture

Designing Data Pipelines

Lessons from building large-scale batch & streaming systems across manufacturing and finance domains.

Read more →
JS

JD Satapathy

I'm a Data Architect with 13+ years of experience building and scaling distributed data platforms across manufacturing and finance domains. My work focuses on designing reliable, SLA-driven systems and applying AI to solve practical engineering problems.

Apache Spark Apache Kafka AWS Snowflake DBT Airflow Python Applied AI

Let's Build Something Scalable

Open to discussions on data architecture, system design, and applied AI.