Available for technical advisory

Designing Scalable Data Systems & Building Practical AI Products

Data Architect with 13+ years of experience in distributed data platforms, real-time pipelines, and cloud-native architectures. I build systems that scale — and tools that solve real-world problems.

Explore My Work View Products →

13+ Years of experience

50+ Data systems delivered

100% SLA adherence track record

Expertise

What I Do

🏗️

Data Architecture

Design scalable batch and streaming systems using Spark, Kafka, Iceberg, and modern lakehouse architectures across cloud platforms.

🚀

Engineering Leadership

Define engineering standards, CI/CD pipelines, and lead teams delivering production-grade data platforms with high reliability.

🤖

Applied AI Products

Build practical AI tools that automate real workflows — from hiring intelligence to data-driven decision systems.

What I've Built

AI Product

RecruitAI

AI-powered resume screener that evaluates and ranks candidates against job descriptions with structured scoring, strengths & gaps analysis, and hiring recommendations.

Candidate ranking with scoring logic
Strengths & gap analysis per candidate
Role-fit assessment for data engineering roles

Try RecruitAI →

⚡

Portfolio

Selected Work

Spark · Finance

Data Platform Architecture

Led end-to-end architecture of a Spark-based reconciliation platform handling financial data with strict SLA requirements.

Kafka · AWS Lambda

Real-Time Pipelines

Built event-driven orchestration using Kafka-triggered AWS Lambda controlling Airflow workflows with 100% SLA adherence.

DBT · Incremental

Incremental Data Processing

Designed DBT-based incremental pipelines with attribute-level change detection across millions of records.

Blog

Insights

Product · AI

How I Built RecruitAI

Turning resume screening into a structured decision system using LLMs and custom scoring logic.

Engineering · SQL

Recursive CTE

How to traverse a hierarchy using Recursive CTE. Example problem: Get managers all the way to CEO

Architecture

Designing Data Pipelines

Lessons from building large-scale batch & streaming systems across manufacturing and finance domains.

Show More →

About Me

JD Satapathy

I'm a Data Architect with 13+ years of experience building and scaling distributed data platforms across manufacturing and finance domains. My work focuses on designing reliable, SLA-driven systems and applying AI to solve practical engineering problems.

Apache Spark Apache Kafka AWS Snowflake DBT Airflow Python Applied AI

Let's Build Something Scalable

Open to discussions on data architecture, system design, and applied AI.

Lets Discuss LinkedIn ↗