[Digikam-users] backup and data integrity

Jakob Østergaard Hegelund joe at evalesco.com
Wed Jan 16 21:53:24 GMT 2008


On Wednesday 16 January 2008, Arnd Baecker wrote:
> Hi,
>
> one important issue with digital images is the
> question of backup (eg. CD/DVD, optical media, ..
> several separate hard-disks, off-site hard-disks, ...).

I use rsync to an off-site server.

My mother uses rsync to a removable disk.

I work for a company who will be providing affordable remote backup as a 
service to end users.

> Another (maybe often over-looked?) aspect is whether
> the data (both on the master disk and the backups)
> are still correct.
> It may well happen that files just get corrupted
> on the hard-disk.

Sure, that's unlikely but not impossible.

> (I recently had such an experience, where
> fortunately an old back-up on CD allowed to recover
> the few files).
>
> So my question is:
> How do you ensure the correctness of your data?

Use proper hardware with ECC memory.

No overclocked/non-ECC systems.

> What methods are useable and could one maybe
> integrate/provide part of the needed tools
> inside digikam?
>
> One approach might be to use a hash value (eg. md5):
>
> - Digikam could compute a hash value for every
>   image and store it inside of digikam's database.
>
>   This would allow, by an additional tool, to periodically (etc.)
>   check for any possible changes (=corruption) of images.

But the application should not do this.

The reason why we use a modern OS on modern hardware is, that the OS and 
the hardware will work together to provide us with everything from 
hardware abstraction to error correction.

ZFS provides the extra layer of error detection along with error 
correction that you seek. I believe time would be better spent lobbying 
for a port of that technology to Linux (one way or another - yes I am 
aware that there are complications), rather than trying to patch in a 
little bit of that functionality into a single application (which will 
not help the general case).

Anyway, that's my take on the issue. I believe any improvement that 
would be needed, is needed on a general system-wide scale, not on a 
single-application scale.

-- 
Jakob Østergaard Hegelund



More information about the Digikam-users mailing list