Kempner Institute Spring 2024 Compute Workshop

Kempner Institute Spring 2024 Compute Workshop#

Date: March 28, 2024
Time: 1:00 - 4:00 PM
Location: SEC 2.118
Presenters: Ella Batty, Naeem Khoshnevis, Max Shad

Welcome to the Kempner Institute Spring 2024 Compute Workshop! This workshop is designed to provide an introduction to High-Performance Computing (HPC) and the Kempner Institute AI cluster. The workshop will cover the basics of HPC, including an overview of the Kempner Institute AI cluster architecture and storage tiers. We will also discuss data transfer methods, code synchronization, and software modules. The workshop will include an introduction to job management and monitoring, advanced computing techniques, and support and troubleshooting.

Infrastructure Orientation#

Welcome and Introduction
Cluster Access (Click Here)
Overview of the Kempner Institute Cluster Architecture (Click Here)
Understanding Storage Tiers (Click Here)
Shared Open-Source Data Repositories on Cluster (Click Here)
Good Citizenship on the Cluster (Click Here)

Development#

Job Management and Monitoring#

Fairshare Policy and Job Priority Basics (Max) (Click Here)

SLURM Partitions (Click Here)
- FASRC SLURM Partitions (Click Here)

Useful Slurm commands (Click Here)
Monitoring Job Status and Utilization

Advanced Computing Techniques#

Best practices for HPC efficiency
Introduction to parallel computing (Click Here)
Containerization with Singularity (Click Here)
Distributed Computing and Training (Click Here)

Support and Troubleshooting#

Troubleshooting Common Issues
Support Framework: FASRC and Kempner Engineering Team (Click Here)
- Send a ticket to FASRC (rchelp [at] rc.fas.harvard.edu)
Closing Remarks and Q&A Session

Kempner Institute Spring 2024 Compute Workshop

Contents

Kempner Institute Spring 2024 Compute Workshop#

Infrastructure Orientation#

Development#

Job Management and Monitoring#

Advanced Computing Techniques#

Support and Troubleshooting#