- parallel: Utilize your multiple {core,cpu}s!
- fdupes: "Finds duplicate files in a given set of directories"
- s3cmd sync: Like rsync, but with S3 as the file/dir destination
I used parallel to speed up various batches of S3 operations whose per-request latency is noticeable for large batch sizes.
Note that s3cmd sync will not copy symbolic links to S3, but it will copy hard links, so fdupes -H can be used to eliminate uploading of these redundant files.
No comments:
Post a Comment