BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/Denver
X-LIC-LOCATION:America/Denver
BEGIN:DAYLIGHT
TZOFFSETFROM:-0700
TZOFFSETTO:-0600
TZNAME:MDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0600
TZOFFSETTO:-0700
TZNAME:MST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20260422T000712Z
LOCATION:605
DTSTART;TZID=America/Denver:20231113T092200
DTEND;TZID=America/Denver:20231113T094100
UID:submissions.supercomputing.org_SC23_sess441_ws_p3hpc128@linklings.com
SUMMARY:Performance Evaluation of Heterogeneous GPU Programming Frameworks
  for Hemodynamic Simulations
DESCRIPTION:Aristotle Martin (Duke University); Geng Liu (Argonne National
  Laboratory (ANL)); William Ladd (Duke University); Seyong Lee, John Gounl
 ey, and Jeffrey Vetter (Oak Ridge National Laboratory (ORNL)); Saumil Pate
 l, Silvio Rizzi, Victor Mateevitsi, and Joseph Insley (Argonne National La
 boratory (ANL)); and Amanda Randles (Duke University)\n\nPreparing for the
  deployment of large scientific and engineering codes on upcoming exascale
  systems with GPU-dense nodes is made challenging by the unprecedented div
 ersity of device architectures and heterogeneous programming models. In th
 is work, we evaluate the process of porting a massively parallel, fluid dy
 namics code written in CUDA to SYCL, HIP, and Kokkos with a range of backe
 nds, using a combination of automated tools and manual tuning. We use a pr
 oxy application along with a custom performance model to inform the result
 s and identify additional optimization strategies. At scale performance of
  the programming model implementations are evaluated on pre-production GPU
  node architectures for Frontier and Aurora, as well as on current NVIDIA 
 device-based systems Summit and Polaris. Real-world workloads representing
  3D blood flow calculations in complex vasculature are assessed. Our analy
 sis highlights critical trade-offs between code performance, portability, 
 and development time.\n\nTag: Performance Measurement, Modeling, and Tools
 , Performance Optimization\n\nRegistration Category: Workshop Reg Pass\n\n
 Session Chairs: Judith C. Hill (Lawrence Livermore National Laboratory (LL
 NL)), CJ Newburn (NVIDIA Corporation), Scott J. Parker (Argonne National L
 aboratory (ANL)), and John Pennycook (Intel Corporation)\n\n
END:VEVENT
END:VCALENDAR
