Cheaha Quick Start
https://docs.rc.uab.edu/
Please use the new documentation url https://docs.rc.uab.edu/ for all Research Computing documentation needs.
As a result of this move, we have deprecated use of this wiki for documentation. We are providing read-only access to the content to facilitate migration of bookmarks and to serve as an historical record. All content updates should be made at the new documentation site. The original wiki will not receive further updates.
Thank you,
The Research Computing Team
NOTE: This page is still under development. Please refer to Getting Started page for detailed documentation.
Cheaha is a shared cluster computing environment for UAB researchers. Cheaha offers total 110 TFLOPS compute power, 4.7 PB high-performance storage and 20 TB memory. See Hardware for more details on compute platform, but first let's get started with an example and see how easy it is to use.
If you have any questions about Cheaha usage then please contact Research Computing team at support@vo.uabgrid.uab.edu .
Logging In
More detailed login instructions are also available.
Most users will authenticate to Cheaha using their BlazerID and associated password using an SSH (Secure Shell) client. The basic syntax is as follows:
ssh BLAZERID@cheaha.rc.uab.edu
Hello Cheaha!
A shared cluster environment like Cheaha uses a job scheduler to run tasks on the cluster to provide optimal resource sharing among users. Cheaha uses a job scheduling system call SGE to schedule and manage jobs. A user needs to tell SGE about resource requirements (e.g. CPU, memory) so that it can schedule jobs effectively. These resource requirements along with actual application code can be specified in a single file commonly referred as 'Job Script/File'. Following is a simple job script that prints job number and hostname.
#!/bin/bash # #SBATCH --job-name=test #SBATCH --output=res.txt #SBATCH --ntasks=1 #SBATCH --time=10:00 #SBATCH --mem-per-cpu=100 #SBATCH --mail-type=FAIL #SBATCH --mail-user=$USER@uab.edu srun hostname srun sleep 60
Lines starting with '#SBATCH' have a special meaning in the SLURM world. SLURM specific configuration options are specified after the '#SBATCH' characters. Above configuration options are useful for most job scripts and for additional configuration options refer to SLURM commands manual. A job script is submitted to the cluster using SLURM specific commands. There are many commands available, but following three commands are the most common:
- sbatch - to submit job
- scancel - to delete job
- squeue - to view job status
We can submit above job script using sbatch command:
$ sbatch HelloCheaha.sh Submitted batch job 52707
When the job script is submitted, SLURM queues it up and assigns it a job number (e.g. 52707 in above example). The job number is available inside job script using environment variable $JOB_ID. This variable can be used inside job script to create job related directory structure or file names.
Software
Cheaha's software stack includes many scientific computing softwares. Below is list of popular softwares available on Cheaha:
These softwares can be included in a job environment using environment modules. Environment modules make environment variables modification easy and repeatable. Please refer to Cheaha_Quick_Start_Softwares page for more details.
Storage
Need to talk about GPFS here
Graphical Interface
Some applications use graphical interface to perform certain actions (e.g. submit buttons, file selections etc.). Cheaha supports graphical applications using an interactive X-Windows session with SLURMS sinteractive command. This will allow you to run graphical applications like MATLAB or AFNI on Cheaha. Refer to Cheaha_Quick_Start_Interactive_Jobs for details on running graphical X-Windows applications.
Scheduling Policies
Support
If you have any questions about our documentation or need any help with Cheaha then please contact us on support@vo.uabgrid.uab.edu . Cheaha is maintained by UAB IT's Research Computing team.