Dmtcp Checkpointing
Revision as of 19:21, 14 September 2017 by Curtish@uab.edu (talk | contribs) (add keyword checkpoint)
Attention: Research Computing Documentation has Moved
https://docs.rc.uab.edu/
https://docs.rc.uab.edu/
Please use the new documentation url https://docs.rc.uab.edu/ for all Research Computing documentation needs.
As a result of this move, we have deprecated use of this wiki for documentation. We are providing read-only access to the content to facilitate migration of bookmarks and to serve as an historical record. All content updates should be made at the new documentation site. The original wiki will not receive further updates.
Thank you,
The Research Computing Team
DMTCP: Distributed MultiThreaded CheckPointing
Search keywords: checkpoint check point
http://dmtcp.sourceforge.net/FAQ.html
Available on Cheaha.RC
- module load DMTCP/2.4.5
- module load DMTCP/2.5.0
example SLURM Job: https://github.com/dmtcp/dmtcp/tree/master/plugin/batch-queue/job_examples
- to use "srun" or not? - just for better reporting via sacct -j ### / sstat -j ###