Click here to Skip to main content
15,884,628 members
Please Sign up or sign in to vote.
1.00/5 (1 vote)
See more:
Multiplying two Matrices A[4x3] and B[3x4].
Resultant Matrix C[4x4].

no of parallel operations per block is 4.
Therefore,
THREADS_PER_BLOCK = 4

XML
dim3 dimBlock(THREADS_PER_BLOCK, THREADS_PER_BLOCK);
dim3 dimGrid(B.width/dimBlock.x, A.height/dimBlock.y);
MatrixMultKernel<<<dimGrid, dimBlock>>>(d_A, d_B, d_C);


CUDA reports "invalid configuration error". need help in this regard.
Posted
Updated 3-Apr-10 8:44am
v2

Using CUBLAS we can multiply two arbitrarily sized Matrices.
For more details CUBLAS[^]
 
Share this answer
 
v2
Comments
Addy Tas 13-Jan-12 17:07pm    
Seems a bit late but i figured; while I'm reading it, why not fix the link.
Cheers, AT
try this link[^]

Hope it helps
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900