Dask-Extended External Tasks for HPC/ML in Transit Workflows Publications SC-W '23: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis
LM4HPC: Towards Effective Language Model Application in High-Performance Computing Publications IWOMP 2023: OpenMP: Advanced Task-Based, Device and Compiler Programming
TrainBF: High-Performance DNN Training Engine Using BFloat16 on AI Accelerators Publications Euro-Par 2023: Parallel Processing
A Survey of Techniques for Optimizing Transformer Inference Publications Journal of Systems Architecture
Differentiable Neural Architecture, Mixed Precision and Accelerator Co-Search Publications IEEE Access
HPC-GPT: Integrating Large Language Model for High-Performance Computing Publications SC-W '23: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis
Characterizing the Performance of Triangle Counting on Graphcore's IPU Architecture Publications SC-W '23: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis
Pynta─An Automated Workflow for Calculation of Surface and Gas–Surface Kinetics Publications Journal of Chemical Information and Modeling