BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/Denver
X-LIC-LOCATION:America/Denver
BEGIN:DAYLIGHT
TZOFFSETFROM:-0700
TZOFFSETTO:-0600
TZNAME:MDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0600
TZOFFSETTO:-0700
TZNAME:MST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20260422T000712Z
LOCATION:507
DTSTART;TZID=America/Denver:20231113T120000
DTEND;TZID=America/Denver:20231113T122000
UID:submissions.supercomputing.org_SC23_sess444_ws_waccpd108@linklings.com
SUMMARY:Analysis of MURaM – A Solar Physics Application, for Scalability, 
 Performance, and Portability
DESCRIPTION:Eric F. Wright (University of Delaware), Cena Brown (National 
 Center for Atmospheric Research (NCAR)), Damien Przybylski (MPS Solar Syst
 em Research), Matthias Rempel and Supreeth Suresh (National Center for Atm
 ospheric Research (NCAR)), and Sunita Chandrasekaran (University of Delawa
 re)\n\nWith the advent of GPUs in parallel computing several languages, to
 ols and compilers are being developed. Many impactful applications can ben
 efit from the performance capabilities these GPUs provide, but moving larg
 e, complex code bases to GPU execution often poses many hurdles and growin
 g pains as developers adapt unfamiliar programming models and interface wi
 th increasingly complex, but powerful hardwares. Our work discusses experi
 ences using OpenACC to bring GPU acceleration to MURaM, a state-of-the-art
  solar physics application, including various problems we have explored an
 d overcome to bring better performance portability to the code within the 
 limitations of the programming model. We then provide scaling results and 
 findings transitioning to current generation GPU architectures with strong
  and weak scaling on up to 512 NVIDIA A100 GPUs, observing one A100 GPU as
  comparable to 90-100 CPU cores, and GPUs scaling much further than the CP
 U runs are capable.\n\nTag: Accelerators, Artificial Intelligence/Machine 
 Learning, Algorithms, Applications, Compilers, Data Movement and Memory, H
 eterogeneous Computing, Modeling and Simulation, Performance Optimization,
  Programming Frameworks and System Software, Runtime Systems\n\nRegistrati
 on Category: Workshop Reg Pass\n\nSession Chairs: Maciej Cytowski (Pawsey 
 Supercomputing Research Centre; Commonwealth Scientific and Industrial Res
 earch Organisation (CSIRO), Australia); Verónica G. Melesse Vergara (Oak R
 idge National Laboratory (ORNL)); and Jose Manuel Monsalve Diaz (Advanced 
 Micro Devices (AMD))\n\n
END:VEVENT
END:VCALENDAR
