BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/Denver
X-LIC-LOCATION:America/Denver
BEGIN:DAYLIGHT
TZOFFSETFROM:-0700
TZOFFSETTO:-0600
TZNAME:MDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0600
TZOFFSETTO:-0700
TZNAME:MST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20260422T000712Z
LOCATION:DEF Concourse
DTSTART;TZID=America/Denver:20231114T100000
DTEND;TZID=America/Denver:20231114T170000
UID:submissions.supercomputing.org_SC23_sess291_rpost166@linklings.com
SUMMARY:The Impact of Process Topology on RMA Programming Models:  A Study
  on NERSC Perlmutter
DESCRIPTION:Nikodemos Koutsoheras (Pacific Northwest National Laboratory (
 PNNL)), Sayan Ghosh (University of Maryland), Nathan Tallent and Joshua Su
 etterlein (Pacific Northwest National Laboratory (PNNL)), and Abhinav Bhat
 ele (University of Maryland)\n\nRemote Memory Access (RMA) provides an alt
 ernate mechanism for data movement by separating communication with synchr
 onization, exposing remote memory access features via one-sided communicat
 ion semantics to a global address space. Performance of the most popular a
 synchronous RMA interfaces like MPI RMA and SHMEM has steadily improved ov
 er the past years due to better software/hardware support from the vendors
  and community-driven programming model standardization efforts. \n    \nC
 urrent RMA benchmarking efforts are mostly focused on investigating elemen
 tary data movement overheads between a process-pair within and across node
 s, not considering a specific process topology. Distributed-memory applica
 tions on the other hand must deal with overlapped data distributions, whic
 h governs the underlying topology of the processes. We discuss the perform
 ance of SHMEM and MPI RMA (in comparison with MPI point-to-point) for grid
  and graph process topologies on NERSC Perlmutter supercomputer, demonstra
 ting average and 99th percentile latencies.\n\nTag: Artificial Intelligenc
 e/Machine Learning, Architecture and Networks, Heterogeneous Computing, I/
 O and File Systems, Performance Measurement, Modeling, and Tools, Post-Moo
 re Computing, Programming Frameworks and System Software, Quantum Computin
 g\n\nRegistration Category: Tech Program Reg Pass, Exhibits Reg Pass\n\n
END:VEVENT
END:VCALENDAR
