Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 13: Reductions and their Implementation
Category
Published on
Abstract
Structuring Parallel Algorithms
Topics:
- Parallel Reductions
- Parallel Prefix Sum
- Relevance of Scan
- Application of Scan
- Scan on the CPU
- First attempt Parallel Scan Algorithm
- Work efficiency considerations
- Improving Efficiency
- Use Padding to reduce conflicts
- Global Synchronization in CUDA
Credits
These lecture were breezed by Carl Pearson and Daniel Borup and then reviewed, edited ,and Uploaded by Omar Sobh.
Sponsored by
NCN@illinois
Cite this work
Researchers should cite this work as follows: