borg analyze

borg [common options] analyze [options]

Common options

Archive filters — Archive filters can be applied to repository targets.

-a PATTERN, --match-archives PATTERN

only consider archives matching all patterns. see “borg help match-archives”.

--sort-by KEYS

Comma-separated list of sorting keys; valid keys are: timestamp, archive, name, id, tags, host, user; default is: timestamp

--first N

consider first N archives after other filters were applied

--last N

consider last N archives after other filters were applied

--oldest TIMESPAN

consider archives between the oldest archive’s timestamp and (oldest + TIMESPAN), e.g. 7d or 12m.

--newest TIMESPAN

consider archives between the newest archive’s timestamp and (newest - TIMESPAN), e.g. 7d or 12m.

--older TIMESPAN

consider archives older than (now - TIMESPAN), e.g. 7d or 12m.

--newer TIMESPAN

consider archives newer than (now - TIMESPAN), e.g. 7d or 12m.

Description

Analyze archives to find “hot spots”.

Borg analyze relies on the usual archive matching options to select the archives that should be considered for analysis (e.g. -a series_name). Then it iterates over all matching archives, over all contained files and collects information about chunks stored in all directories it encountered.

It considers chunk IDs and their plaintext sizes (we don’t have the compressed size in the repository easily available) and adds up added/removed chunks’ sizes per direct parent directory and outputs a list of “directory: size”.

You can use that list to find directories with a lot of “activity” - maybe some of these are temporary or cache directories you did forget to exclude.

To not have these unwanted directories in your backups, you could carefully exclude these in borg create (for future backups) or use borg recreate to re-create existing archives without these.