Checkpoint/restore tool v1.0
Checkpoint/restore tool v1.0
Posted Nov 25, 2013 19:14 UTC (Mon)
by arekm (guest, #4846)
[Link] (5 responses)
Posted Nov 25, 2013 19:14 UTC (Mon) by arekm (guest, #4846) [Link] (5 responses)
criu --check could actually tell what's missing since that is not obvious:
$ uname -a
Linux t400 3.12.1 #73 SMP PREEMPT Wed Nov 20 22:46:34 CET 2013 x86_64 Intel(R)_Core(TM)2_Duo_CPU_____T9400__@_2.53GHz PLD Linux
$ sudo criu check
/proc/<pid>/map_files directory is missing.
(00.001273) Error (sk-unix.c:353): Can't stat socket 0x31e00f(./tmp/ksocket-arekm/plasma-desktophK7986.slave-socket): No such file or directory
/proc/sys/kernel/ns_last_pid sysctl is missing.
System call kcmp is not supported
prctl: PR_GET_TID_ADDRESS is not supported
/proc/sys/kernel/sem_next_id sysctl is missing.
(00.002998) Warn (cr-check.c:514): Dirty tracking is OFF. Memory snapshot will not work.
/proc/<pid>/timers file is missing.
Checkpoint/restore tool v1.0
Posted Nov 25, 2013 19:31 UTC (Mon)
by rmini (subscriber, #4991)
[Link] (3 responses)
Posted Nov 25, 2013 19:31 UTC (Mon) by rmini (subscriber, #4991) [Link] (3 responses)
Checkpoint/restore tool v1.0
Posted Nov 25, 2013 19:42 UTC (Mon)
by arekm (guest, #4846)
[Link] (2 responses)
Posted Nov 25, 2013 19:42 UTC (Mon) by arekm (guest, #4846) [Link] (2 responses)
Checkpoint/restore tool v1.0
Posted Nov 25, 2013 20:19 UTC (Mon)
by arekm (guest, #4846)
[Link] (1 responses)
Posted Nov 25, 2013 20:19 UTC (Mon) by arekm (guest, #4846) [Link] (1 responses)
dumping mc:
$ sudo criu dump -t 6089
(00.002344) Error (sk-unix.c:353): Can't stat socket 0x66bf(./tmp/ksocket-arekm/kmailoy5661.slave-socket): No such file or directory
(00.002694) Error (sk-unix.c:353): Can't stat socket 0x45f6(./tmp/ksocket-arekm/plasma-desktopKS5342.slave-socket): No such file or directory
(00.009186) Error (tty.c:203): tty: Can't obtain ptmx index: Inappropriate ioctl for device
(00.009231) Error (cr-dump.c:1491): Dump files (pid: 6089) failed with -1
(00.009661) Error (cr-dump.c:1811): Dumping FAILED.
restoring bash:
$ sudo criu restore
(00.004443) 6216: Error (tty.c:178): tty: Found slave peer index 4 without correspond master peer
(00.004621) Error (cr-restore.c:1062): 6216 exited, status=1
(00.004636) Error (cr-restore.c:1597): Restoring FAILED
dumping && restoring "sleep 1000 < /dev/null > /dev/null &"
$ sudo criu dump -t 6280 --shell-job
(00.002147) Error (sk-unix.c:353): Can't stat socket 0x45f6(./tmp/ksocket-arekm/plasma-desktopKS5342.slave-socket): No such file or directory
$ sudo criu restore
(00.018195) 6280: Error (tty.c:178): tty: Found slave peer index 4 without correspond master peer
(00.018406) Error (cr-restore.c:1062): 6280 exited, status=1
(00.018430) Error (cr-restore.c:1597): Restoring FAILED.
Checkpoint/restore tool v1.0
Posted Nov 25, 2013 21:06 UTC (Mon)
by mathstuf (subscriber, #69389)
[Link]
Posted Nov 25, 2013 21:06 UTC (Mon) by mathstuf (subscriber, #69389) [Link]
> (00.004443) 6216: Error (tty.c:178): tty: Found slave peer index 4 without correspond master peer
You're missing `--shell-job` here (from my reading of the docs[1]). Also probably needed for `mc` dumping as well.
For the socket issues, try `--ext-unix-sk`[2].
[1]http://criu.org/Advanced_usage#Shell_jobs_C.2FR
[2]http://criu.org/Advanced_usage#External_UNIX_sockets
Checkpoint/restore tool v1.0
Posted Nov 25, 2013 19:38 UTC (Mon)
by mathstuf (subscriber, #69389)
[Link]
Posted Nov 25, 2013 19:38 UTC (Mon) by mathstuf (subscriber, #69389) [Link]
Checkpoint/restore tool v1.0
Posted Nov 26, 2013 11:42 UTC (Tue)
by Anssi (subscriber, #52242)
[Link]
Posted Nov 26, 2013 11:42 UTC (Tue) by Anssi (subscriber, #52242) [Link]
I guess this new solution is considerably more robust :)
Checkpoint/restore tool v1.0
Posted Nov 27, 2013 22:03 UTC (Wed)
by gvy (guest, #11981)
[Link]
Posted Nov 27, 2013 22:03 UTC (Wed) by gvy (guest, #11981) [Link]
Checkpoint/restore tool v1.0
Posted Nov 27, 2013 23:04 UTC (Wed)
by kolyshkin (guest, #34342)
[Link]
Posted Nov 27, 2013 23:04 UTC (Wed) by kolyshkin (guest, #34342) [Link]
Also, it seems like many people do not quite grasp why CRIU fails to checkpoint something (usually "an external resource"). So here's the article that tries to explain it http://criu.org/What_cannot_be_checkpointed
Hibernation
Posted Nov 28, 2013 2:49 UTC (Thu)
by bojan (subscriber, #14302)
[Link] (3 responses)
Posted Nov 28, 2013 2:49 UTC (Thu) by bojan (subscriber, #14302) [Link] (3 responses)
Hibernation
Posted Nov 28, 2013 3:36 UTC (Thu)
by mjg59 (subscriber, #23239)
[Link]
Posted Nov 28, 2013 3:36 UTC (Thu) by mjg59 (subscriber, #23239) [Link]
The audio case could be fixed by special casing Pulseaudio, but you have similar issues with some other hardware. Fixing it would be a pretty significant amount of effort.
Hibernation
Posted Nov 28, 2013 3:40 UTC (Thu)
by raven667 (subscriber, #5198)
[Link]
Posted Nov 28, 2013 3:40 UTC (Thu) by raven667 (subscriber, #5198) [Link]
Hibernation
Posted Nov 28, 2013 5:47 UTC (Thu)
by dlang (guest, #313)
[Link]
Posted Nov 28, 2013 5:47 UTC (Thu) by dlang (guest, #313) [Link]
you may want to move some apps to a different machine, or just free up the resources that some app is using without loosing the work that it's done.
Also hibernation doesn't allow for you to change kernels in the stop/start process and it keeps kernel state.
they may sound similar, but since hibernation and checkpoint/restore have different scope to what they deal with, the resulting uses are very different.