Troubleshooting a failed backup while using the Checkmk appliance

The restore of a backup on the Checkmk Appliance fails.

LAST TESTED ON CHECKMK 2.1.0P17

Table of Contents

Getting Started

Background information regarding this subject is available on our:

Problem

You may receive error messages like the following:

Dec  8 14:41:46 hostname webconfd[827]: 127.0.0.1 "GET /ajax_backup_job_state.py?job=restore&_ajaxid=1670506906 HTTP/1.1" 200 2207
Dec  8 14:41:47 hostname mkbackup: Finished system data restore
Dec  8 14:41:47 hostname mkbackup: Performing site restore: test
Dec  8 14:41:47 hostname mkbackup: Error while sending data to restore process: [Errno 32] Broken pipe
Dec  8 14:41:47 hostname webconfd[827]: 127.0.0.1 "GET /ajax_backup_job_state.py?job=restore&_ajaxid=1670506907 HTTP/1.1" 200 3052


2022-12-08 14:41:28 --- Starting restore (Check_MK_Appliance-hostname-daily_full-complete) ---
2022-12-08 14:41:28 Cleaning up Check_MK processess and temporary filesystems
[...]
2022-12-08 14:41:30 Cleaning up system processess
2022-12-08 14:41:30 Performing system restore (system.tar)
2022-12-08 14:41:30 Cleaning up /rw/var/lib/heartbeat/log_daemon
2022-12-08 14:41:30 Cleaning up /rw/var/lib/samba/winbindd_privileged/pipe
2022-12-08 14:41:30 Cleaning up /rw/var/lib/samba/private/msg.sock/870
2022-12-08 14:41:30 Cleaning up /rw/var/lib/samba/private/msg.sock/817
2022-12-08 14:41:30 Finished system restore
2022-12-08 14:41:30 Setting up standalone device environment
2022-12-08 14:41:30 Performing system data restore (system-data.tar)
2022-12-08 14:41:30 Cleaning up /omd/apache
2022-12-08 14:41:30 Cleaning up /omd/versions
2022-12-08 14:41:32 Cleaning up /omd/sites
2022-12-08 14:41:47 Finished system data restore
2022-12-08 14:41:47 Performing site restore: test
2022-12-08 14:41:47 Error while sending data to restore process: [Errno 32] Broken pipe
2022-12-08 14:41:47 b'Failed to open the backup: invalid header\n'
An exception occured:
Traceback (most recent call last):
  File "/usr/sbin/mkbackup", line 1550, in mode_restore
    do_restore(opts, info)
  File "/usr/sbin/mkbackup", line 1581, in do_restore
    do_system_restore(opts, info)
  File "/usr/sbin/mkbackup", line 1701, in do_system_restore
    do_site_restore(info, site=site_id_)
  File "/usr/sbin/mkbackup", line 1655, in do_site_restore
    log(p.stderr.read())
  File "/usr/sbin/mkbackup", line 119, in log
    syslog.syslog(s)
TypeError: [priority,] message string

Solution

  1. Log into the appliance as root.

  2. Now set the default Checkmk version to the version of the site to be restored.

    # If your site is a Checkmk Enterprise Edition with version 2.1.0p17 if would look like this: omd setversion 2.1.0p17.cee
    omd setversion $VERSION.$EDITION