1. Ricardo Nobre, Aleksandar Ilic, Sergio Santander-Jiménez, and Leonel Sousa (2021). Fourth-Order Exhaustive Epistasis Detection for the xPU Era. 50th International Conference on Parallel Processing. Association for Computing Machinery, New York, NY, USA, Article 27, 1–10.
DOI: https://doi.org/10.1145/3472456.3472509.
2. Sunidhi Dhandhania, Akshay Deodhar, Konstantin Pogorelov, Swarnendu Biswas and Johannes Langguth (2021). Explaining the Performance of Supervised and Semi-Supervised Methods for Automated Sparse Matrix Format Selection, 50th International Conference on Parallel Processing Workshop, pp. 1-10.
DOI: https://doi.org/10.1145/3458744.3474049.
3. Diogo Marques, Aleksandar Ilic, and Leonel Sousa (2021). Mansard Roofline Model: Reinforcing the Accuracy of the Roofs. ACM Trans. Model. Perform. Eval. Comput. Syst. 6, 2, Article 7.
DOI: https://doi.org/10.1145/3475866.
4. Luk Burchard, Johannes Moe, Daniel Thilo Schroeder, Konstantin Pogorelov, and Johannes Langguth (2021). iPUG: Accelerating Breadth-First Graph Traversals Using Manycore Graphcore IPUs. High Performance Computing. ISC High Performance 2021. Lecture Notes in Computer Science, vol 12728. Springer, Cham.
DOI: https://doi.org/10.1007/978-3-030-78713-4_16.
5. Amro Alabsi Aljundi, Taha Atahan Akyildiz, and Kamer Kaya (2021). Boosting Graph Embedding on a Single GPU. IEEE Transactions on Parallel and Distributed Systems, vol. 33, no. 11, pp. 3092-3105.
DOI: https://doi.org/10.1109/TPDS.2021.3129617.
6. Luk Burchard, Xing Cai, and Johannes Langguth (2021). iPUG for Multiple Graphcore IPUs: Optimizing Performance and Scalability of Parallel Breadth-First Search. IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC), pp. 162-171.
DOI: https://doi.org/10.1109/HiPC53243.2021.00030.
7. Diogo Marques, Rafael Campos, Sergio Santander-Jiménez, Zakhar Matveev, Leonel Sousa, and Aleksandar Ilic (2022). Unlocking Personalized Healthcare on Modern CPUs/GPUs: Three-way Gene Interaction Study. 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 146-156.
DOI: https://doi.org/10.1109/IPDPS53621.2022.00023.
8. Amro Alabsi Aljundi, Taha Atahan Akyıldız, and Kamer Kaya (2022). Degree-Aware Kernels for Computing Jaccard Weights on GPUs. 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 897-907.
DOI: http://doi.org/10.1109/IPDPS53621.2022.00092.
9. Gökhan Göktürk and Kamer Kaya (2022). Fast and High-Quality Influence Maximization on Multiple GPUs. 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 897-907.
DOI: http://doi.org/10.1109/IPDPS53621.2022.00093.
10. Ricardo Nobre, Aleksandar Ilic, Sergio Santander-Jiménez, and Leonel Sousa (2022). Tensor-Accelerated Fourth-Order Epistasis Detection on GPUs. 51st International Conference on Parallel Processing (ICPP ’22), August 29-September 1, 2022, Bordeaux, France. ACM, New York, NY, USA, 11 pages.
DOI: https://doi.org/10.1145/3545008.3545066.
11. Erhan Tezcan, Tugba Torun, Fahrican Koşar, Kamer Kaya, and Didem Unat (2022). Mixed and Multi-Precision SpMV for GPUs with Row-wise Precision Selection. 2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Bordeaux, France, 2022, pp. 31-40. DOI: 10.1109/SBAC-PAD55451.2022.00014.
12. Andreas Thune, Sven-Arne Reinemo, Tor Skeie and Xing Cai (2023). Detailed Modeling of Heterogeneous and Contention-Constrained Point-to-Point MPI Communication. IEEE Transactions on Parallel and Distributed Systems. DOI: 10.1109/TPDS.2023.3253881.
13. Luk Burchard, Kristian Gregorius Hustad, Johannes Langguth and Xing Cai (2023). Enabling Unstructured-Mesh Computation on Massively Tiled AI-Processors: An Example of Accelerating In-Silico Cardiac Simulation. Front. Phys. Sec. Statistical and Computational Physics.
DOI: 10.3389/fphy.2023.979699.
14. Muhammad Aditya Sasongko, Milind Chabbi, Paul H. J. Kelly and Didem Unat (2023). Precise event sampling-based data locality tools for AMD multicore architectures. Concurrency Computat Pract Exper.
DOI: 10.1002/cpe.7707.
15. Sergej Breiter, Josef Weidendorfer, Minh Thanh Chung and Karl Fürlinger (2023). A Profiling-Based Approach to Cache Partitioning of Program Data. Parallel and Distributed Computing, Applications and Technologies. PDCAT 2022. Lecture Notes in Computer Science, vol 13798. Springer, Cham.
DOI: 10.1007/978-3-031-29927-8_35.
16. Alexandre Rodrigues, Leonel Sousa and Alexandar Ilic (2023). Performance Modelling-driven Optimization of RISC-V Hardware for Efficient SpMV. Proceedings of the International workshop on RISC-V for HPC. ISC High Performance Conference, May 2023.
DOI: 10.1007/978-3-031-40843-4_36.
17. Miguel Graça, Diogo Marques, Sergio Santander-Jiménez, Leonel Sousa, and Aleksandar Ilic. “Interpreting High Order Epistasis Using Sparse Transformers.” In 2023 IEEE/ACM Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE), pp. 114-125. IEEE, 2023.
DOI: 10.1145/3580252.3586982.
18. Alexandre Rodrigues, Leonel Sousa and Aleksandar Ilic . “A Performance Modelling-driven Approach to Hardware Resource Scaling”, In the 3rd International Workshop on Performance and Energy-efficiency in Concurrent and Distributed Systems (PECS/EuroPar), 2023 (to be published).
19. Afonso Coutinho, Diogo Marques, Leonel Sousa and Aleksandar Ilic. “Sparse-aware CARM: Rooflining locality of sparse computations”, In the 1st International Workshop on Tools for Data Locality, Power and Performance (TDLPP/EuroPar), 2023 (to be published).