提交记录 19465 - Judge Duck Online

用户	题目	状态	得分	用时	内存	语言	代码长度
Conical	mmmd1k. 测测你的双精度矩阵乘法-1k	Compile Error	0	0 ns	0 KB	C++11	1.62 KB

提交时间	评测时间
2023-05-12 16:09:06	2023-05-12 16:09:06

代码

const char* sgemm_desc = "Simple blocked sgemm.";

#if !defined(BLOCK_SIZE)
#define BLOCK_SIZE 41
#endif

#define min(a,b) (((a)<(b))?(a):(b))

/* This auxiliary subroutine performs a smaller sgemm operation
 *  C := C + A * B
 * where C is M-by-N, A is M-by-K, and B is K-by-N. */
static void do_block (int lda, int M, int N, int K, double* A, double* B, double* C)
{
    /* For each row i of A */
    for (int i = 0; i < M; ++i)
        /* For each column j of B */ 
        for (int j = 0; j < N; ++j) 
        {
            /* Compute C(i,j) */
            float cij = C[i+j*lda];
            for (int k = 0; k < K; ++k)
            cij += A[i+k*lda] * B[k+j*lda];
            C[i+j*lda] = cij;
        }
}

/* This routine performs a sgemm operation
 *  C := C + A * B
 * where A, B, and C are lda-by-lda matrices stored in column-major format. 
 * On exit, A and B maintain their input values. */  
void matrix_multiply (int lda, double* A, double* B, double* C)
{
    /* For each block-row of A */ 
    for (int i = 0; i < lda; i += BLOCK_SIZE)
        /* For each block-column of B */
        for (int j = 0; j < lda; j += BLOCK_SIZE)
            /* Accumulate block sgemms into block of C */
            for (int k = 0; k < lda; k += BLOCK_SIZE)
            {
                /* Correct block dimensions if block "goes off edge of" the matrix */
                int M = min (BLOCK_SIZE, lda-i);
                int N = min (BLOCK_SIZE, lda-j);
                int K = min (BLOCK_SIZE, lda-k);

                /* Perform individual block sgemm */
                do_block(lda, M, N, K, A + i + k*lda, B + k + j*lda, C + i + j*lda);
            }
}

评测结果

Compilation

N/A

Compile Error

Score: N/A

显示更多

/usr/bin/ld: jp_data/tasks/43cf4d50f09c11ed953b9d510784bac5/lib.o: in function `main':
lib.cpp:(.text.startup+0x3b): undefined reference to `matrix_multiply(int, double const*, double const*, double*)'
collect2: error: ld returned 1 exit status