BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/Denver
X-LIC-LOCATION:America/Denver
BEGIN:DAYLIGHT
TZOFFSETFROM:-0700
TZOFFSETTO:-0600
TZNAME:MDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0600
TZOFFSETTO:-0700
TZNAME:MST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20260422T000713Z
LOCATION:301-302-303
DTSTART;TZID=America/Denver:20231114T110000
DTEND;TZID=America/Denver:20231114T113000
UID:submissions.supercomputing.org_SC23_sess172_pap241@linklings.com
SUMMARY:Itoyori: Reconciling Global Address Space and Global Fork-Join Tas
 k Parallelism
DESCRIPTION:Shumpei Shiina and Kenjiro Taura (University of Tokyo)\n\nThis
  paper introduces Itoyori, a task-parallel runtime system designed to tack
 le the challenge of scaling task parallelism (more specifically, nested fo
 rk-join parallelism) beyond a single node. The partitioned global address 
 space (PGAS) model is often employed in task-parallel systems, but naively
  combining them can lead to poor performance due to fine-grained and redun
 dant remote memory accesses. Itoyori addresses this issue by automatically
  caching global memory accesses at runtime, enabling efficient cache shari
 ng among parallel tasks running on the same processor. As a real-world cas
 e study, we ported an existing task-parallel implementation of the Fast Mu
 ltipole Method (FMM) to distributed memory with Itoyori and achieved a 7.5
 x speedup when scaled from a single node to 12 nodes and up to 6.0x faster
  performance than without caching. This study demonstrates that global-vie
 w fork-join programming can be made practical and scalable, while requirin
 g minimal changes to the shared-memory code.\n\nTag: Heterogeneous Computi
 ng, Programming Frameworks and System Software, Task Parallelism\n\nRegist
 ration Category: Tech Program Reg Pass\n\nReproducibility Badges: Artifact
  Available, Artifact Functional, Results Reproduced\n\nSession Chair: Clau
 dia Fohry (University of Kassel, Germany)\n\n
END:VEVENT
END:VCALENDAR
