Copyright and Licensing
Articles accepted for publication will be licensed under the Creative Commons BY-NC-SA. Authors must sign a non-exclusive distribution agreement after article acceptance.
Bioinformatics and Computational Biology are disciplines that have used GPUs for over two decades to accelerate data processing in computational applications, where CUDA has been the most utilized programming language. However, the exclusive use of CUDA presents a portability issue, as it is only compatible with NVIDIA GPUs and not with other heterogeneous architectures, such as AMD or Intel GPUs, or any other type of accelerator. To address this portability challenge, the Khronos Group recently introduced the SYCL standard, a multi-platform programming model that offers a high-level programming interface. This standard facilitates the development of portable applications that can efficiently leverage the capabilities of different hardware devices, such as NVIDIA, Intel, and AMD GPUs, without needing significant modifications to the source code. Therefore, this thesis set out as its general objective to evaluate the feasibility of SYCL as a unified, portable, and efficient heterogeneous programming model for the design and development of computationally demanding applications on heterogeneous GPU-based systems, specifically in the field of bioinformatics.
Initially, a detailed investigation was conducted about heterogeneous programming models, performance metrics, and bioinformatics concepts, in order to establish the theoretical foundations of this thesis. Then, the SW# suite was chosen as the case study, as it represents a clear example of a CUDA-based bioinformatics application for biological sequence alignment. Using the SYCLomatic tool, a complete migration of the CUDA code to SYCL was carried out, which involved modifying the generated code and solving runtime errors. Additionally, the functionality was verified, optimizations were applied, and the resulting SYCL code was standardized to be compatible with other SYCL implementations.
Subsequently, multiple experiments were conducted to evaluate the functionality and performance portability of the software migrated to SYCL. These experiments involved running the application on a wide variety of HPC platforms, including different CPUs and GPUs from various manufacturers. The results obtained demonstrated performance comparable to CUDA in most configurations, confirming the effectiveness of SYCL. Moreover, good performance portability across platforms was observed due to SYCL's ability to run on various hardware combinations. Furthermore, performance remained consistent when switching SYCL implementations. In conclusion, this study demonstrates that SYCL is a viable alternative as a unified, portable, and efficient programming model in the context of heterogeneous computing with GPUs for bioinformatics applications. The findings of this study lay the groundwork for the transition of legacy applications and the development of new solutions leveraging the capabilities of SYCL.
M. Costanzo, E. Rucci, C. García-Sanchez, M. Naiouf, and M. Prieto-Matías, «Assessing opportunities of SYCL for biological sequence alignment on GPU-based systems», J Supercomput, feb. 2024, doi: 10.1007/s11227-024-05907-2.
M. Costanzo, E. Rucci, C. García-Sánchez, M. Naiouf, and M. Prieto-Matías, «Migrating CUDA to oneAPI: A Smith-Waterman Case Study», en Bioinformatics and Biomedical Engineering. IWBBIO 2022. Lecture Notes in Computer Science., I. Rojas, O. Valenzuela, F. Rojas, L. J. Herrera, y F. Ortuño, Eds., in Lecture Notes in Computer Science, vol. 13347. Cham: Springer International Publishing, jun. 2022, pp. 103-116. doi: 10.1007/978- 3-031-07802-6_9.
M. Costanzo, E. Rucci, C. García-Sánchez, M. Naiouf, and M. Prieto-Matías, «Comparing Performance and Portability Between CUDA and SYCL for Protein Database Search on NVIDIA, AMD, and Intel GPUs», in 2023 IEEE 35th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Porto Alegre, Brasil: IEEE, oct. 2023, pp. 141-148. doi: 10.1109/SBAC-PAD59825.2023.00023.
M. Costanzo, E. Rucci, C. García-Sánchez, and M. Naiouf, «Brief performance portability analysis of a matrix multiplication kernel on multiple vendor GPUs», en Short Papers of the 11th Conference on Cloud Computing Conference, Big Data & Emerging Topics (JCC-BD&ET 2023), jul. 2023, pp. 13-18. [Online]. Available at: http://sedici.unlp.edu.ar/handle/10915/155420
M. Costanzo, E. Rucci, C. García-Sánchez, and M. Naiouf, «Early Experiences Migrating CUDA codes to oneAPI», in Short papers of the 9th Conference on Cloud Computing, Big Data & Emerging Topics (JCC-BD&ET 2021), jul. 2021, pp. 14-18. [Online]. Available: http://sedici.unlp.edu.ar/handle/10915/125138
Copyright (c) 2024 Manuel Costanzo
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Articles accepted for publication will be licensed under the Creative Commons BY-NC-SA. Authors must sign a non-exclusive distribution agreement after article acceptance.
Review Stats:
Mean Time to First Response: 89 days
Mean Time to Acceptance Response: 114 days
Member of:
ISSN
1666-6038 (Online)
1666-6046 (Print)