Puzzle 5: Broadcast
Overview
Implement a kernel that broadcast adds vector a
and vector b
and stores it in 2D matrix out
.
Note: You have more threads than positions.
Key concepts
- Broadcasting vectors to matrix
- 2D thread management
- Mixed dimension operations
- Memory layout patterns
Implementation approaches
🔰 Raw memory approach
Learn how to handle broadcasting with manual memory indexing.
📐 LayoutTensor Version
Use LayoutTensor to handle mixed-dimension operations.
💡 Note: Notice how LayoutTensor simplifies broadcasting compared to manual indexing.