Inside DrMark’s Lab

Inside DrMark’s Lab

Big Data, MapReduce, and the File Systems That Make It Work

Notes on Lecture 6: the only safe space here is HDFS replication with processing micro-batches, not microaggressions.

The Unshielded Mind's avatar
The Unshielded Mind
Sep 16, 2025
∙ Paid

This article distills the core ideas from my lecture on big data distributed computing, including MapReduce, Google-style file systems, and Hadoop’s YARN. It keeps the good stuff like numbers, failure modes, and concrete examples, and trims the hand-waving. If you have ever kicked off a job and then watched your cluster flicker with tasks while you prayed nothing melted, this is for you.

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Markgrechanik@gmail.com · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture