Skip to main content
Digital Experience
Schedule
Dates & Deadlines
Toggle navigation
Toggle navigation
Program
Dropdown menu toggle
Program
Schedule
Keynote
HPC Creates Plenary
Invited Talks
Panels
Tutorials
Workshops
Papers
Reproducibility Initiative
AD/AE Process & Badges
Awards
Birds of a Feather
Early Career
Exhibitor Forum
Posters
ACM SRC
Doctoral Showcase
Research Posters
SciViz Showcase
Job Fair
Receptions
Exhibits
Dropdown menu toggle
Exhibits
Exhibitor Prospectus
Exhibitor Application
Exhibitor List & Floorplan
Exhibitor Forum
Exhibitor Housing
Exhibitor Function Space
SCinet for Exhibitors
Recruit at the Job Fair
Students
Dropdown menu toggle
Students@SC
Lead Student Volunteers
Student Volunteers
Student Cluster Competition
IndySCC
Mentor–Protégé Matching
HPC Immersion
Alumni Networking Event
Speed Mentoring Event
Job Fair
SCinet
Dropdown menu toggle
SCinet
SCinet Technology
SCinet Teams
WINS
INDIS Workshop
Participate in SCinet
Contributors & Volunteers
SC Network Policy
Media
Dropdown menu toggle
Media
Media Registration
Media Partners
Blog
Newsletter
Photos & Logos
Attend
Dropdown menu toggle
Attend
Registration
Visa Applications
Digital Experience
Schedule
Atlanta
Convention Center
Housing
Family Resources
Inclusivity
Code of Conduct
Volunteer
Search
Search
Home
Session
Session
Full Schedule
·
Contributors
·
Organizations
·
Search
Program
Workshop
:
Fourth International Symposium on Checkpointing for Supercomputing (SuperCheck-SC23)
Session Chairs
Gene Cooperman
Northeastern University
Bogdan Nicolae
Argonne National Laboratory (ANL)
Illinois Institute of Technology
Rebecca Hartman-Baker
National Energy Research Scientific Computing Center (NERSC)
Lawrence Berkeley National Laboratory (LBNL)
Donglai Dai
Advanced Micro Devices (AMD)
Event Type
Workshop
Time
Sunday, 12 November 2023
2pm
-
5:30pm
Location
710
Tags
Fault Handling and Tolerance
Registration Categories
W
Presentations
2:00pm
-
2:05pm
Welcome to SuperCheck-SC23
Presenter
Rebecca Hartman-Baker
2:05pm
-
2:50pm
AI-Augmented SWARM Based Resilience for Integrate Research Infrastructures
Presenter
Franck Cappello
2:50pm
-
3:00pm
Lightning Talk: Diaspora – Resilient Event Processing for Irregular, Distributed Scientific Applications
Presenter
Justin Wozniak
3:00pm
-
3:25pm
SuperCheck-SC23 – Afternoon Break
3:25pm
-
3:50pm
Checkpoint/Restart for CUDA Kernels
Author/Presenters
Niklas Eiling
Stefan Lankes
Antonello Monti
3:50pm
-
4:15pm
Implementation-Oblivious Transparent Checkpoint-Restart for MPI
Author/Presenters
Yao Xu
Leonid Belyaev
Twinkle Jain
Derek Schafer
Anthony Skjellum
Gene Cooperman
4:15pm
-
4:40pm
Asynchronous Multi-Level Checkpointing: An Enabler of Reproducibility using Checkpoint History Analytics
Author/Presenters
Kevin Assogba
Bogdan Nicolae
Huub Van Dam
M. Mustafa Rafique
4:40pm
-
4:50pm
Lightning Talk: Update on Checkpointing and Localized Recovery for Nested Fork-Join Programs
Presenter
Claudia Fohry
4:50pm
-
5:00pm
Lightning Talk: Toward Efficient Asynchronous Checkpointing for Large-Language Models
Presenter
Avinash Maurya
5:00pm
-
5:10pm
Lightning Talk: Inherent Checkpointing Properties of Nested Parallelism
Presenter
Stanislav Bratanov
5:10pm
-
5:20pm
Lightning Talk: Trade-Offs For Developing File Aggregated I/O For Asynchronous Checkpointing
Presenter
Mikaila Gossman
5:20pm
-
5:30pm
Lightning Talk: Datastates for Debugging – Using Productive Checkpointing for Improved Debugging
Presenter
Robert Underwood
Back To Top Button