[Dirvish] One of the vaults broken - init enough?
Isaac W Traxler
traxler at lsu.edu
Wed Sep 8 13:14:19 UTC 2021
Presumably you had a working abckup at some point in the past and something went wrong. You now probably have numerous backup directories where a portion are still good but some number have littlle of no data.
The first goal is to get a good current backup. Before anythong elase, make sure no backups are running for home (as suggested by others). Now run a backup init to get a good backup of home. If this fails, tell us ehat the log files say. Once that works, tyour future backups should be fine.
This leaves the question of the exising backup directories. One approach would be to do nothing and simply let them eventually get purged out. A better approach would be to identify the failed backups and eliminate them (I go to the home directory and do a "du -sh *". This command will take a while, but it will report the amount of space that each backup is taking. It should be obvious which ones failed (much smaller in space). You can confirmthey failed by looking at the summary files. I normally delete failed backup directories (so I don't assume they are there to restore from).
The next issue is disk usage. Each init does a "full" backup and does not take advantage of dirvish's (rsync's) ability to simply backup changes. This results in duplicate copies of files. If space is of no issue, you can not worry about this (and eventually as they age out this problem will fix itself). When this issue happens, I go to the directory above (my tree looks like /backups/server/file_system -- so I "cd /backups/server"). and then I run "hardlink file_systyem" and wait for it to finish (this may take a long time also). the hardlink command will identify all duplicate files and replace duplicates with hardlinks to one copy of the data (essentially what Dirvish/rsync does when making the backups). This will recover any "lost" space due to duplicate files.
In fact, I make it a practice to periodcally run hardlink on every vault to eliminate any duplicate files as a normal part of maintenance.
Isaac Traxler
LSU/LONI HPC Storage & Infrastructure Manager
traxler at lsu.edu
________________________________
From: dirvish <dirvish-bounces at lists.dirvish.org> on behalf of Rolf-Werner Eilert <rwe-sse at osnanet.de>
Sent: Wednesday, September 8, 2021 5:17 AM
To: dirvish at dirvish.org <dirvish at dirvish.org>
Subject: [Dirvish] One of the vaults broken - init enough?
I hope this list is still alive...
Some time ago I received a
21:45:44 /usr/sbin/dirvish --vault backup-home
backup-home:default:20210727-2145:
/mnt/backups/backup-home/dirvish/lock_file already exists
on the /home vault. After some days I looked it up and yes, I found a
lock file and deleted it, but it had no effect. So I decided to initiate
the /home vault again, then let it backup (only /home) once and went
into holidays.
Since then I get
21:24:51 /usr/sbin/dirvish --vault backup-home
backup-home:default:20210830-2124: ERROR: no images for branch default found
Guess I messed it up, but how can I re-initiate the /home? Of course,
the cronjob starts dirvish-runall, so do I have to expire all of them
first, then initiate again?
Thank you for your help!
Rolf
_______________________________________________
dirvish mailing list
dirvish at lists.dirvish.org
https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.dirvish.org%2Flistinfo%2Fdirvish&data=04%7C01%7Ctraxler%40lsu.edu%7Cb71e92782b944fece56d08d972b1d5b2%7C2d4dad3f50ae47d983a09ae2b1f466f8%7C0%7C0%7C637666930839365745%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&sdata=bqzuMRKhLDAtRMs8%2BjCgyi5Uc%2FXbmhzGTfhzFaIbEa8%3D&reserved=0
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.dirvish.org/pipermail/dirvish/attachments/20210908/00b7b342/attachment.htm>
More information about the dirvish
mailing list