“Accelerating DSP/ML Tasks with CUDA: A Comparison of Single-Core and Parallel Processing Implementations”

For this assignment, select a computationally intensive two- or three-dimensional DSP/ML task such as performing image convolution, and implement it on both a single-core processor and parallel processors using CUDA.
Measure the speed-up achieved with the CUDA implementation, and, compare the power analysis of both implementations.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top