5/5 - (1 vote)

1 Matrix Multiplication

Name: [Solved] EE451 Homework5- Matrix Multiplication
Brand: Assignment Chef
SKU: [Solved] EE451 Homework5- Matrix Multiplication
Price: 25 USD
Availability: InStock
Rating: 5 (1 reviews)

In the lecture and discussion, we discussed two approaches to compute matrix multiplication (C = AB) using CUDA: (1) unoptimized implementation using global memory only and (2) block matrix multiplication using shared memory.

In this assignment, your task is implementing 1024 1024 matrix multiplication using these two approaches.

Approach 1 (unoptimized implementation using global memory only):
- Name this program as p1.cu
- The value of each element of A is 1
- The value of each element of B is 2
- Thread block configuration: 16 16
- Grid configuration: 64 64
- After computation, print the value of C[451][451]
Approach 2 (block matrix multiplication using shared memory):
- Name this program as p2.cu
- The value of each element of A is 1
- The value of each element of B is 2
- Thread block configuration: 32 32
- Grid configuration: 32 32
- More details of this algorithm can be found in the paper Matrix Multiplication with CUDA under the Readings category of blackboard.
- After computation, print the value of C[451][451]
Report: measure the execution time of the kernel of Approach 1 and Approach 2, respectively. Briefly discuss your observations.

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Whatsapp Us

[Solved] EE451 Homework5- Matrix Multiplication

1 Matrix Multiplication

Reviews

Related products

[Solved] EE451 Homework2-Example Program

[Solved] EE451 Homework6- K-means Clustering

[Solved] EE451 Homework1- Matrix Multiplication

[Solved] EE451 Homework3-parallelize a serial program

[Solved] EE451 Homework4- Pass Message in a Ring