# Day 5

The focus of **Day 5** is **cuBLAS Matrix Multiplication**

---

## Code Descriptions

### `cublas_matmul.cu` (Matrix Multiplication using cuBLAS)
- Implements **matrix multiplication** using the cuBLAS `cublasSgemm` operation.
- Avoids explicit `cudaMemcpy` by utilizing `cublasSetMatrix` and `cublasGetMatrix`.
- Uses the **cuBLAS API** for optimized **GPU-accelerated BLAS operations**.
- Performs **C = A × B** efficiently with `cublasSgemm`.

## Profiling and Running

To compile and run the CUDA program, use:

```sh
nvcc -o cublas_matmul cublas_matmul.cu -lcublas -lcurand
./cublas_matmul