Recent progress with the top to bottom approach to vectorization in GeantV

Guilherme Amadio; Ananya; John Apostolakis; Marilena Bandieramonte; Shiba Behera; Abhijit Bhattacharyya; René Brun; Philippe Canal; Federico Carminati; Gabriele Cosmo; Vitaliy Drohan; Daniel Elvira; Krzysztof Genser; Andrei Gheata; Gheata Mi-haela; Ilias Goulas; Farah Hariri; Vladimir Ivanchenko; Przemislaw Karpinski; Gulrukh Khattak; Dmitri Konstantinov; Harphool Kumawat; Guilherme Lima; Jesús Martínez Castro; Patricia Mendez; Aldo Miranda Aguillar; Katalin Nikolics; Mihaly Novak; Elena Orlova; Kevin Pedro; Witold Pokorski; Alberto Ribon; Dmitry Savin; Ryan Schmitz; Raman Sehgal; Oksana Shadura; Shruti Sharan; Sofia Vallecorsa; Sandro Wenzel; Soon Yung Jun

doi:10.1051/epjconf/201921402007

Proceedings

Open Access

EPJ Web of Conferences 214, 02007 (2019)
https://doi.org/10.1051/epjconf/201921402007

Recent progress with the top to bottom approach to vectorization in GeantV

Guilherme Amadio¹, Ananya, John Apostolakis¹, Marilena Bandieramonte¹^,2, Shiba Behera³, Abhijit Bhattacharyya³, René Brun¹, Philippe Canal⁴, Federico Carminati¹, Gabriele Cosmo¹, Vitaliy Drohan, Daniel Elvira⁴, Krzysztof Genser⁴, Andrei Gheata¹^*, Gheata Mi-haela¹^,5, Ilias Goulas¹, Farah Hariri¹, Vladimir Ivanchenko¹^,6, Przemislaw Karpinski, Gulrukh Khattak¹, Dmitri Konstantinov¹^,8, Harphool Kumawat³, Guilherme Lima⁴, Jesús Martínez Castro⁷, Patricia Mendez¹, Aldo Miranda Aguillar⁷, Katalin Nikolics¹, Mihaly Novak¹, Elena Orlova, Kevin Pedro⁴, Witold Pokorski¹, Alberto Ribon¹, Dmitry Savin, Ryan Schmitz, Raman Sehgal³, Oksana Shadura¹, Shruti Sharan, Sofia Vallecorsa¹, Sandro Wenzel¹ and Soon Yung Jun⁴

¹ European Organization for Nuclear Research (CERN), Switzerland
² University of Pittsburgh, 4200 Fifth Avenue, Pittsburgh, PA 15260, USA
³ Bhabha Atomic Research Centre (BARC), India
⁴ Fermi National Accelerator Laboratory (FNAL), USA
⁵ Institute of Space Science (ISS), Romania
⁶ Tomsk State University, Tomsk, Russia
⁷ Centro de Investigación en Computación (CIC-IPN), Mexico
⁸ NRC Kurchatov Institute (IHEP) Protvino, Russia

^* e-mail: andrei.gheata@cern.ch

Published online: 17 September 2019

Abstract

SIMD acceleration can potentially boost by factors the application throughput. Achieving efficient SIMD vectorization for scalar code with complex data flow and branching logic, goes however way beyond breaking some loop dependencies and relying on the compiler. Since the refactoring effort scales with the number of lines of code, it is important to understand what kind of performance gains can be expected in such complex cases. We started to investigate a couple of years ago a top to bottom vectorization approach to particle transport simulation. Percolating vector data to algorithms was mandatory since not all the components can internally vectorize. Vectorizing low-level algorithms is certainly necessary, but not sufficient to achieve relevant SIMD gains. In addition, the overheads for maintaining the concurrent vector data flow and copy data have to be minimized. In the context of a vectorization R&D for simulation we developed a framework to allow different categories of scalar and vectorized components to co-exist, dealing with data flow management and real-time heuristic optimizations. The paper describes our approach on coordinating SIMD vectorization at framework level, making a detailed quantitative analysis of the SIMD gain versus overheads, with a breakdown by components in terms of geometry, physics and magnetic field propagation. We also present the more general context of this R&D work and goals for 2018.

© The Authors, published by EDP Sciences, 2019

This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.