Subject: HPC, CPU, Cache, Memory, Superlinear Speedup
Year: 2013
Type: Proceeding article
Title: Intel vs AMD: Matrix Multiplication Performance
Author: Anchev, Nenad
Author: Gushev, Marjan
Author: Ristov, Sashko
Author: Atanasovski, Blagoj
Abstract: Matrix-Matrix multiplication (MMM) is widely used algorithm in today’s computations and researches. Many techniques exist to speed up its execution. In this paper, we analyze the performance of MMM varying matrix size in order to determine its behavior and the region where it provides the best performance. We also determine the best speedup and efficiency in parallel implementation for different CPU architectures since cache architecture and organization is very important for MMM performance. Intel i7 and AMD Opteron CPUs are used as an environment. Several achieved results are expected, but there are also many unexpected. Superlinear speedup (speedup greater than the number of used threads) and the efficiency greater than 100% are achieved for each parallel implementation only on AMD Opteron. We observe regions with performance discrepancy for all three parameters for both CPUs.
Publisher: IEEE
Relation: 2013 36th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)
Identifier: oai:repository.ukim.mk:20.500.12188/17251
Identifier: http://hdl.handle.net/20.500.12188/17251