This research proposes to develop machine learning-based autotuning techniques for collective communication algorithm selection. Building on existing dynamic pipelining runtime infrastructure, the work will focus on creating performance models that can recommend suitable collective algorithms for common communication patterns and message sizes. The investigation will primarily center on offline autotuning through systematic benchmarking on representative HPC systems, generating training data to guide algorithm selection. The research will explore how historical performance profiles can inform collective operation decisions at runtime, with the goal of improving performance for typical scientific applications without requiring manual configuration by end users.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Intern
Education Level
No Education Listed