Dmtcp Checkpointing
Revision as of 19:17, 14 September 2017 by Curtish@uab.edu (talk | contribs) (Curtish@uab.edu moved page Dmtcp to Dmtcp Checkpointing: make this page searchable under the keyword checkpoint)
Attention: Research Computing Documentation has Moved
https://docs.rc.uab.edu/
https://docs.rc.uab.edu/
Please use the new documentation url https://docs.rc.uab.edu/ for all Research Computing documentation needs.
As a result of this move, we have deprecated use of this wiki for documentation. We are providing read-only access to the content to facilitate migration of bookmarks and to serve as an historical record. All content updates should be made at the new documentation site. The original wiki will not receive further updates.
Thank you,
The Research Computing Team
DMTCP: Distributed MultiThreaded CheckPointing
http://dmtcp.sourceforge.net/FAQ.html
Available on Cheaha.RC
- module load DMTCP/2.4.5
- module load DMTCP/2.5.0
example SLURM Job: https://github.com/dmtcp/dmtcp/tree/master/plugin/batch-queue/job_examples
- to use "srun" or not? - just for better reporting via sacct -j ### / sstat -j ###