[Dirvish] One of the vaults broken - init enough?

Rolf-Werner Eilert rwe-sse at osnanet.de
Wed Sep 8 15:13:50 UTC 2021


Thank you for your comprehensive answers, Isaac and Paul,

I'll look it up and be back in couple of days.

Regards
Rolf

Am 08.09.21 um 15:14 schrieb Isaac W Traxler:
> Presumably you had a working abckup at some point in the past and 
> something went wrong. You now probably have numerous backup directories 
> where a portion are still good but some number have littlle of no data.
> 
> The first goal is to get a good current backup. Before anythong elase, 
> make sure no backups are running for home (as suggested by others). Now 
> run a backup  init to get a good backup of home. If this fails, tell us 
> ehat the log  files say. Once that works, tyour future backups should be 
> fine.
> 
> This leaves the question of the exising backup directories. One approach 
> would be to do nothing and simply let them eventually get purged out. A 
> better approach would be to identify the failed backups and eliminate 
> them (I go to the home directory and do a "du -sh *".  This command will 
> take a while, but it will report the amount of space that each backup is 
> taking. It should be obvious which ones failed (much smaller in space). 
> You can confirmthey failed by looking at the summary files. I normally 
> delete failed backup directories (so I don't assume they are there to 
> restore from).
> 
> The next issue is disk usage. Each init does a "full" backup and does 
> not take advantage of dirvish's (rsync's) ability to simply backup 
> changes. This results in duplicate copies of files. If space is of no 
> issue, you can not worry about this (and eventually as they age out this 
> problem will fix itself). When this issue happens, I go to the directory 
> above (my tree looks like /backups/server/file_system -- so I "cd 
> /backups/server"). and then I run "hardlink file_systyem" and wait for 
> it to finish (this may take a long time also). the hardlink command will 
> identify all duplicate files and replace duplicates with hardlinks to 
> one copy of the data (essentially what Dirvish/rsync does when making 
> the backups). This will recover any "lost" space due to duplicate files.
> 
> In fact, I make it a practice to periodcally run hardlink on every vault 
> to eliminate any duplicate files as a normal part of maintenance.
> 
> Isaac Traxler
> LSU/LONI HPC Storage & Infrastructure Manager
> traxler at lsu.edu
> 
> ------------------------------------------------------------------------
> *From:* dirvish <dirvish-bounces at lists.dirvish.org> on behalf of 
> Rolf-Werner Eilert <rwe-sse at osnanet.de>
> *Sent:* Wednesday, September 8, 2021 5:17 AM
> *To:* dirvish at dirvish.org <dirvish at dirvish.org>
> *Subject:* [Dirvish] One of the vaults broken - init enough?
> I hope this list is still alive...
> 
> Some time ago I received a
> 
> 21:45:44 /usr/sbin/dirvish --vault backup-home
> backup-home:default:20210727-2145:
> /mnt/backups/backup-home/dirvish/lock_file already exists
> 
> on the /home vault. After some days I looked it up and yes, I found a
> lock file and deleted it, but it had no effect. So I decided to initiate
> the /home vault again, then let it backup (only /home) once and went
> into holidays.
> 
> Since then I get
> 
> 21:24:51 /usr/sbin/dirvish --vault backup-home
> backup-home:default:20210830-2124: ERROR: no images for branch default found
> 
> Guess I messed it up, but how can I re-initiate the /home? Of course,
> the cronjob starts dirvish-runall, so do I have to expire all of them
> first, then initiate again?
> 
> Thank you for your help!
> 
> Rolf
> 
> 
> _______________________________________________
> dirvish mailing list
> dirvish at lists.dirvish.org
> https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.dirvish.org%2Flistinfo%2Fdirvish&data=04%7C01%7Ctraxler%40lsu.edu%7Cb71e92782b944fece56d08d972b1d5b2%7C2d4dad3f50ae47d983a09ae2b1f466f8%7C0%7C0%7C637666930839365745%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&sdata=bqzuMRKhLDAtRMs8%2BjCgyi5Uc%2FXbmhzGTfhzFaIbEa8%3D&reserved=0 
> <https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.dirvish.org%2Flistinfo%2Fdirvish&data=04%7C01%7Ctraxler%40lsu.edu%7Cb71e92782b944fece56d08d972b1d5b2%7C2d4dad3f50ae47d983a09ae2b1f466f8%7C0%7C0%7C637666930839365745%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&sdata=bqzuMRKhLDAtRMs8%2BjCgyi5Uc%2FXbmhzGTfhzFaIbEa8%3D&reserved=0>
> 
> _______________________________________________
> dirvish mailing list
> dirvish at lists.dirvish.org
> https://lists.dirvish.org/listinfo/dirvish
> 




More information about the dirvish mailing list