Close this window

Email Announcement Archive

[Users] Call for Participation: First International Symposium on Checkpointing for Supercomputing (SuperCheck21)

Author: Zhengji Zhao <zzhao_at_lbl.gov>
Date: 2020-09-24 15:28:54

Dear NERSC users, We would like to invite you to participate in the first International Symposium on Checkpointing for Supercomputing (SuperCheck21), which will be held February 4-5, 2021. The Call for Participation is attached below. Best regards, Zhengji Zhao <https://www.nersc.gov/about/nersc-staff/user-engagement/zhengji-zhao/>, NERSC at Lawrence Berkeley National Laboratory (LBNL) Rebecca Hartman-Baker <https://www.nersc.gov/about/nersc-staff/user-engagement/rebecca-hartman-baker/>, NERSC at LBNL Gene Cooperman <https://www.khoury.northeastern.edu/people/gene-cooperman/>, Northeastern University Devesh Tiwari <https://coe.northeastern.edu/people/tiwari-devesh/>, Northeastern University ___________________________ First International Symposium on Checkpointing for Supercomputing (SuperCheck21)Call for participation NERSC <https://www.nersc.gov> is hosting the First International Symposium on Checkpointing for Supercomputing, which will be held February 4-5, 2021. This free event will be held online and will feature the latest work in checkpoint/restart research, tools development and production use. About the Symposium Checkpoint/Restart (C/R) is critical for fault-tolerant computing in high-performance computing (HPC). While there has been much research and development on C/R and C/R tools, few HPC end users are able to use these tools in production workloads. Although research codes often demonstrate promising C/R capabilities, there are no feasible C/R options for diverse production workloads, especially on cutting-edge HPC systems. In this symposium, we will bring together C/R researchers, practitioners, application developers, and end users to share both the latest research results and experiences on adopting C/R tools in production. The goal of this symposium is to showcase the latest research on C/R, motivate the development of usable C/R tools, and boost the adoption of C/R tools in HPC production workloads. This symposium features up-to-the-minute, original and high-quality work, and will be presentation only (no papers). Authors are required to submit a two-page extended abstract for peer-review. Accepted abstracts will be published at arxiv.org, and the authors will be invited to present their work at the symposium. The presentation slides and recordings along with the presenter profiles will be posted on the symposium website <https://easychair.org/my/conference?conf=supercheck21#>. We encourage participation from researchers, end-users, professionals and students. Topics of Interest We welcome any and all aspects of checkpointing for science and engineering in the High Performance Computing (HPC) context, including the latest research results and development, deployment, and application experiences. The symposium scope includes but is not limited to: C/R research and tools development: - C/R targeting the full range of supercomputing software, including MPI, OpenMP, GPGPU software, FPGAs, cloud, and container applications, etc. - Both pure and hybrid approaches to transparent checkpointing (some examples of hybrid approaches are: application-specific plugins to aid in checkpointing; and integrated modules for transparent checkpointing as part of larger scientific/engineering toolkits) - Frameworks for multi-level checkpointing - The development of new methods for low-overhead checkpointing, newer fundamental algorithms, software development methods, the impact of future supercomputer hardware, performance evaluation, and reproducibility, fault recovering - Research on C/R scheduling and intervals C/R use in production (including all levels of checkpointing: application, job, and system levels): - The adoption of transparent C/R tools in production workloads (C/R use cases) - The application-initiated use of C/R tools (alternative to built-in internal checkpointing) - C/R applications and support on HPC systems (e.g., resource scheduling, system utilization, batch system integration, best practice, etc.) SubmissionWe invite authors to submit their original, high-quality work.All submissions should be made electronically through the SuperCheck21 submissions website <https://easychair.org/my/conference?conf=supercheck21#>. Submissions must be double blind, i.e., authors should remove their names, institutions or hints found in references to earlier work. When discussing past work, they need to refer to themselves in the third person, as if they were discussing another researcher’s work. Furthermore, authors must identify any conflict of interest with the PC chair or PC members.Authors are required to submit a <150 word abstract (this will be used for the symposium website) and two pages of an extended abstract, as well as presenter bios ( <300 words). The page limit includes figures and tables, but does not include references, for which there is no page limit. Extended abstracts should be submitted in the IEEE conference format as a PDF.Click here to submit your abstracts. <https://easychair.org/my/conference?conf=supercheck21#>Upon AcceptanceThe symposium will feature a keynote speaker, an invited talk, a panel discussion, and technical talks with a mix of of 10-minute and 25-minute talks, each with 5 minutes of discussion.If your submission is accepted, you will have about one month to finalize your talk. All presentations will be pre-recorded. Presenters will receive instructions and more information on recording and uploading their presentations, which are due January 22, 2021. All presentation slides, recording, and the presenter bio will be included in the technical program archive on the SuperCheck21 website. ParticipationThe symposium will be held from February 4-5, 2021, 8:00am–12:45pm Pacific Time. All participants including presenters are required to register. The registration is free. Click here to register for the symposium <https://ckpt-symposium.lbl.gov/registration> Important Dates - Call for Participation Release: September 24, 2020 - Abstract Submission Due: December 7, 2020 (AoE) - Acceptance Notification: December 20, 2020 - Presentation Submission Due: Jan 22, 2021 (AoE) - Symposium: February 4-5, 2021 Organizers - Zhengji Zhao <https://www.nersc.gov/about/nersc-staff/user-engagement/zhengji-zhao/>, National Energy Research Scientific Computing Center(NERSC) at Lawrence Berkeley National Laboratory (LBNL) - Rebecca Hartman-Baker <https://www.nersc.gov/about/nersc-staff/user-engagement/rebecca-hartman-baker/>, NERSC at LBNL - Gene Cooperman <https://www.khoury.northeastern.edu/people/gene-cooperman/>, Northeastern University - Devesh Tiwari <https://coe.northeastern.edu/people/tiwari-devesh/>, Northeastern University Contact: Zhengji Zhao, zzhao@lbl.gov -- Zhengji Zhao, Ph.D User Engagement Group - HPC consultant National Energy Scientific Computing Center at Lawrence Berkeley National Laboratory 1 Cyclotron RD, M/S 59R4010A, Berkeley, CA 94720 z <jkwack2@illinois.edu>zhao@lbl.gov | phone: (510) 631-5025 | fax: 510-486-6459 _______________________________________________ Users mailing list Users@nersc.gov

Close this window