WActiGrad: Structured Pruning for Efficient Finetuning and Inference of Large Language Models on AI Accelerators Publications Euro-Par 2024: Parallel Processing
Portable Cross-Facility Workflows for X-Ray Ptychography Publications SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis
Equilibrium and Nonequilibrium Ensemble Methods for Accurate, Precise and Reproducible Absolute Binding Free Energy Calculations Publications Journal of Chemical Theory and Computation
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators Publications SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis
Comet: Development of an Unstructured Mesh Direct Simulation Monte Carlo Code on GPUs Publications AIAA Aviation Forum and Ascend 2024
Nonequilibrium Dynamics of Electron Emission from Cold and Hot Graphene Under Proton Irradiation Publications Nano Letters
Acceleration of the Particle-in-Cell Code OSIRIS with Graphics Processing Units Publications Journal of Plasma Physics
GPU-Accelerated Solution of the Bethe–Salpeter Equation for Large and Heterogeneous Systems Publications Journal of Chemical Theory and Computation
The Plasmodium Falciparum NCR1 Transporter Is an Antimalarial Target that Exports Cholesterol from the Parasite’s Plasma Membrane Publications Science Advances
Cryo-EM Structures of the Human Band 3 Transporter Indicate a Transport Mechanism Involving the Coupled Movement of Chloride and Bicarbonate Ions Publications PLOS Biology