Dmtcp Checkpointing

From Cheaha
Revision as of 19:21, 14 September 2017 by Curtish@uab.edu (talk | contribs) (add keyword checkpoint)
Jump to navigation Jump to search


Attention: Research Computing Documentation has Moved
https://docs.rc.uab.edu/


Please use the new documentation url https://docs.rc.uab.edu/ for all Research Computing documentation needs.


As a result of this move, we have deprecated use of this wiki for documentation. We are providing read-only access to the content to facilitate migration of bookmarks and to serve as an historical record. All content updates should be made at the new documentation site. The original wiki will not receive further updates.

Thank you,

The Research Computing Team

DMTCP: Distributed MultiThreaded CheckPointing

Search keywords: checkpoint check point

http://dmtcp.sourceforge.net/FAQ.html

Available on Cheaha.RC

  • module load DMTCP/2.4.5
  • module load DMTCP/2.5.0

example SLURM Job: https://github.com/dmtcp/dmtcp/tree/master/plugin/batch-queue/job_examples

  • to use "srun" or not? - just for better reporting via sacct -j ### / sstat -j ###