Dmtcp Checkpointing

From Cheaha
Revision as of 16:56, 7 February 2017 by Curtish@uab.edu (talk | contribs)
Jump to navigation Jump to search


Attention: Research Computing Documentation has Moved
https://docs.rc.uab.edu/


Please use the new documentation url https://docs.rc.uab.edu/ for all Research Computing documentation needs.


As a result of this move, we have deprecated use of this wiki for documentation. We are providing read-only access to the content to facilitate migration of bookmarks and to serve as an historical record. All content updates should be made at the new documentation site. The original wiki will not receive further updates.

Thank you,

The Research Computing Team

DMTCP: Distributed MultiThreaded CheckPointing

Coming soon to Cheaha.RC as a module: http://dmtcp.sourceforge.net/FAQ.html

example SLURM Job: https://github.com/dmtcp/dmtcp/tree/master/plugin/batch-queue/job_examples

  • to use "srun" or not? - just for better reporting via sacct -j ### / sstat -j ###