Single point of access to the national research repositories

Subject: HPC, CPU, Cache, Memory, Superlinear Speedup

Year: 2013

Type: Proceeding article

Title: Intel vs AMD: Matrix Multiplication Performance

Author: Anchev, Nenad
Author: Gushev, Marjan
Author: Ristov, Sashko
Author: Atanasovski, Blagoj

Abstract: Matrix-Matrix multiplication (MMM) is widely used algorithm in today’s computations and researches. Many techniques exist to speed up its execution. In this paper, we analyze the performance of MMM varying matrix size in order to determine its behavior and the region where it provides the best performance. We also determine the best speedup and efficiency in parallel implementation for different CPU architectures since cache architecture and organization is very important for MMM performance. Intel i7 and AMD Opteron CPUs are used as an environment. Several achieved results are expected, but there are also many unexpected. Superlinear speedup (speedup greater than the number of used threads) and the efficiency greater than 100% are achieved for each parallel implementation only on AMD Opteron. We observe regions with performance discrepancy for all three parameters for both CPUs.

Publisher: IEEE

Relation: 2013 36th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

Identifier: oai:repository.ukim.mk:20.500.12188/17251
Identifier: http://hdl.handle.net/20.500.12188/17251

Title	Date	Views
Intel vs AMD: Matrix Multiplication Performance	2013	19