Towards Iterative Relational Algebra on the GPU Publications 2023 USENIX Annual Technical Conference (USENIX ATC 23)
Dask-Extended External Tasks for HPC/ML in Transit Workflows Publications SC-W '23: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis
LM4HPC: Towards Effective Language Model Application in High-Performance Computing Publications IWOMP 2023: OpenMP: Advanced Task-Based, Device and Compiler Programming
TrainBF: High-Performance DNN Training Engine Using BFloat16 on AI Accelerators Publications Euro-Par 2023: Parallel Processing
A Survey of Techniques for Optimizing Transformer Inference Publications Journal of Systems Architecture