BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/Denver
X-LIC-LOCATION:America/Denver
BEGIN:DAYLIGHT
TZOFFSETFROM:-0700
TZOFFSETTO:-0600
TZNAME:MDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0600
TZOFFSETTO:-0700
TZNAME:MST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20260422T000712Z
LOCATION:401-402
DTSTART;TZID=America/Denver:20231114T160000
DTEND;TZID=America/Denver:20231114T163000
UID:submissions.supercomputing.org_SC23_sess179_pap501@linklings.com
SUMMARY:HPAC-Offload: Accelerating HPC Applications with Portable Approxim
 ate Computing on the GPU
DESCRIPTION:Zane Fink (Lawrence Livermore National Laboratory (LLNL), Univ
 ersity of Illinois) and Konstantinos Parasyris, Giorgis Georgakoudis, and 
 Harshitha Menon (Lawrence Livermore National Laboratory (LLNL))\n\nThe end
  of Dennard scaling and the slowdown of Moore's law led to a shift in tech
 nology trends toward parallel architectures, particularly in HPC systems. 
 To continue providing performance benefits, HPC should embrace Approximate
  Computing (AC), which trades application quality loss for improved perfor
 mance. However, existing AC techniques have not been extensively applied a
 nd evaluated in state-of-the-art hardware architectures such as GPUs, the 
 primary execution vehicle for HPC applications today.\n\nThis paper presen
 ts HPAC-Offload, a pragma-based programming model that extends OpenMP offl
 oad applications to support AC techniques, allowing portable approximation
 s across different GPU architectures. We conduct a comprehensive performan
 ce analysis of HPAC-Offload across GPU-accelerated HPC applications, revea
 ling that AC techniques can significantly accelerate HPC applications (1.6
 4x LULESH on AMD, 1.57x NVIDIA) with minimal quality loss (0.1%). Our anal
 ysis offers deep insights into the performance of GPU-based AC that guide 
 the future development of AC algorithms and systems for these architecture
 s.\n\nTag: Accelerators, Distributed Computing, Middleware and System Soft
 ware, Performance Measurement, Modeling, and Tools, Post-Moore Computing\n
 \nRegistration Category: Tech Program Reg Pass\n\nAward Finalist: Best Pap
 er Finalist\n\nReproducibility Badges: Artifact Available, Artifact Functi
 onal, Results Reproduced\n\nSession Chair: Hari Subramoni (The Ohio State 
 University)\n\n
END:VEVENT
END:VCALENDAR
