Cheaha: Difference between revisions

From Cheaha
Jump to navigation Jump to search
m (→‎Meta-Scheduling Framework: Remove image link)
(Update url for new docs site and use consistent text for obsolete reference.)
 
(133 intermediate revisions by 11 users not shown)
Line 1: Line 1:
'''Cheaha''' is a shared high-performance computing (HPC) resource sponsored by [http://www.uab.edu/it UAB Information Technology (UAB IT)]. Cheaha is available to members of the UAB community to support research activities in need of enhanced computational capacity.
The Cheaha overview page has moved to Research Computing's new documentation site.  Please visit https://docs.rc.uab.edu/ for information on Cheaha.


Cheaha includes a dedicated pool of local compute resources and can provide seamless access to remote compute resources through the use of grid computing technologies.  The local compute pool contains two resource banks based on the [http://en.wikipedia.org/wiki/X86_64 x86_64 64-bit architecture]. 192 3.0Ghz cores and 120 1.6Ghz cores combine to provide nearly 3TFlops of dedicated computing power. 
The obsolete content of the original page can be found at [[Obsolete: Cheaha]] for historical reference.
 
Use of the local compute pool is controlled via scheduling policies designed to maximize availability of total capacity and ensure guaranteed access to reserved resources.  Use of the remote compute pool is contingent upon allocations for individual users on specific remote resources. Incorporation of remote resources enables simplified account management for HPC users and can significantly increase the total compute capacity available for HPC jobs.
 
Cheaha is located in the UAB Shared Computing facility in BEC. Resource design and development is lead by UAB IT Infrastructure Services in open collaboration with community members.  Development effort is coordinated though a [http://projects.uabgrid.uab.edu/cheaha dedicated project web site].  Operational support is provided by the [http://me.eng.uab.edu/wiki/index.php?title=Main_Page UAB School of Engineering's cluster support group].
 
Cheaha is named in honor of [http://en.wikipedia.org/wiki/Cheaha_Mountain Cheaha Mountain], the highest peak in the state of Alabama.  Cheaha's summit offers panoramic vistas of the surrounding landscape ([http://www.flickr.com/search/?q=cheaha Cheaha photo-stream on Flikr]).
 
== History ==
 
=== Initial Acquisition ===
 
In 2002 UAB was awarded an infrastructure development grant through the NSF EPsCoR program.  This led to the 2005 acquisition of a 64 node compute cluster with 2 AMD Opteron 242 1.6Ghz CPUs per node (128 total cores).  This cluster was named Cheaha.  Cheaha expanded the compute capacity available at UAB and was the first general-access resource for the community. It lead to expanded roles for UAB IT in research computing support through the development of the UAB Shared HPC Facility in BEC and provided further engagement in Globus-based grid computing resource development on campus via UABgrid and regionally via [http://www.suragrid.org SURAgrid].
 
=== 2008 Hardware Upgrade ===
 
In 2008, money was allocated for hardware upgrades which lead to the acquisition of an additional 192 cores based on the Intel Quad-Core E5450 3.0Ghz CPU in August of 2008.  This hardware represented a major technology upgrade that included space for additional expansion to address over-all capacity demand and enable resource reservation. 
 
This upgrade also included enhancements to enable access to the aggregate compute power available to the UAB community and improve management of compute jobs across clusters that are part of the UABgrid computing infrastructure.
 
10Gigabit Ethernet connectivity to the UABgrid Research Network supports high speed data transfers between clusters connected to this network, enabling efficient job staging on multiple resources. [http://www.gridway.org GridWay-based] meta-scheduling of enables management of compute jobs across cluster boundaries and brings grid-computing into production.
 
=== Continuous Resource Improvement ===
 
The 2008 upgrade began a phased development approach for Cheaha with on-going increases in capacity and feature enhancements being brought into production via an [http://projects.uabgrid.uab.edu/cheaha open community process].  The first two phases highlighting the logic connectivity between resources are represented in the diagram on the right.  Phase 1 is scheduled for production in January 2009.
 
== Availability ==
 
== Meta-Scheduling Framework ==
 
== Support ==

Latest revision as of 20:13, 31 August 2022

The Cheaha overview page has moved to Research Computing's new documentation site. Please visit https://docs.rc.uab.edu/ for information on Cheaha.

The obsolete content of the original page can be found at Obsolete: Cheaha for historical reference.