SG Call March 5, 2012
Application Discovery and Deployment Strategies for SURAgrid
Discovering where users can run applications on OSG resources supporting the SURAgrid VO is key to making the grid easier to use. Mapping the availability of applications across resources is a key function of the SURAgrid community.
Gabriele Garzoglio and Marko Slyz from the OSG VO forum will give an overview of software distribution practices of various VOs on OSG to help us understand options available as we build a solution for SURAgrid VO.
Steve Johnson (TAMU) and David Mathews-Morgan (UGA) will provide an over of early work on an approach to advertise site applications.
Please join us for this important community call on March 5th at 2pm Eastern and help shape SURAgrid operations to benefit the science at your campus.
- VO Application Management in OSG - Presented by Gabriele Garzoglio (FNAL) and Marko Slyz (FNAL) of the OSG VO Forum
- Preliminary SURAgrid Application Discovery - Presented by Steve Johnson(TAMU) and David Mathews-Morgan (UGA). An overview of the of the proposed solution that use's OSG's BDII database.
Conference Bridge: 800-377-8846 Pin: 14421498
Discussion Summary / Notes from the Community
Gabriele Garzoglio (FNAL) presented an overview of OSG and provided guidance on a variety of solutions in place within OSG for how VO's manage their applications from individual jobs simply including the application within the job (for smaller apps, less than 50-100MB in size, just send an app tarball and unpack it in the job) to VO-level solutions where the VO's application group maintains an application set across OSG resources by manual effort a select sites or automated special jobs across all resources to install applications in the $OSG_APP area of resource.
A new tool that is proving useful for solutions in the latter category is the CERN VM Filesystem (CVMFS) which packages a common application namespace across multiple sites.
The application distribution solutions can also integrate with the popular GlideWMS job distribution framework used across. This framework sends probe jobs to compute resources on behalf of the VO and collects available resources into a common pool of compute nodes. This framework leverages Condor rescheduling semantics and can scan a site's $OSG_APP directory for the VO to advertise the list of applications available at specific sites in order to match a job to compute resources that support application requirements defined by the job.
Other points raised:
- Applications should only assume only a simple base OS install, most commonly the RedHat EL5 deritives, Scientific Linux 5 and CentOS5.
- VOs should consider the level of site support for their VO in choosing an application distribution method. Typically, sites on OSG do not support VOs directly and its incumbant on the VO to maintain the applications across resources.
- We should feel welcome to contact firstname.lastname@example.org to answer any specific questions we have aout implementing this solution.
Steve Johnson (TAMU) presented the initial work on explore BDII as an information repository for application information at a specific site. In this scenario, sites would register their provided applications to the BDII database and then application users could query this data set to determine the resources which can support the application.
An interesting potential of an application database or common application advertising solution would be to offer site-optimized applications with the potential for some very tuned software configurations. The apps definitions could also be a virtual machine.
Steve solicited participation and encouraged engagement in refining this solution into a comprehensive solution for SURAgrid.