Introduction to 14 412 Simple Hash Join
Exploring 14 412 Simple Hash Join reveals several interesting facts. My book "Patterns in Data Management" is now available both as an ebook or a print book (with color graphics!).
14 412 Simple Hash Join Comprehensive Overview
In this informative video, we explore one of the key concepts in Apache Spark's data processing engine, the Shuffle Andy Pavlo (https://www.cs.cmu.edu/~pavlo/) Slides: https://15445.courses.cs.cmu.edu/fall2025/slides/12- There are three key algorithms use to combine rows from two tables: * Nested Loops *
Video lecture, part of the "DB2" course, U Tübingen, summer semester 2020. Read by Torsten Grust.
Summary & Highlights for 14 412 Simple Hash Join
- Hash
- Prof. Andy Pavlo (http://www.cs.cmu.edu/~pavlo/) Slides: https://15721.courses.cs.cmu.edu/spring2020/slides/17-hashjoins.pdf ...
- The classical wisdom is that
- ... divided our problem into separate partitions that we can now join up using essentially naive
- Prof. Andy Pavlo (https://www.cs.cmu.edu/~pavlo/) Slides: https://15721.courses.cs.cmu.edu/spring2023/slides/11-hashjoins.pdf ...
Stay tuned for more updates related to 14 412 Simple Hash Join.