diff --git a/knowledge base/wget.md b/knowledge base/wget.md new file mode 100644 index 0000000..05fdbe1 --- /dev/null +++ b/knowledge base/wget.md @@ -0,0 +1,45 @@ +# `wget` + +## Table of contents + +1. [TL;DR](#tldr) +1. [Further readings](#further-readings) +1. [Sources](#sources) + +## TL;DR + +```sh +# Download all the pictures from a webpage. +# Limit yourself to JPG files from the domain storing them. +# Save them in a single directory. +wget 'https://www.theskyfolk.com/photo' -Hcr -D'images.squarespace-cdn.com' \ + --e'robots=off' -t'3' -w'1' -A'jpg' -nc -nd -np --xattr +wget 'https://www.theskyfolk.com/photo' \ + --span-hosts --continue --recursive \ + --domains 'images.squarespace-cdn.com' --execute 'robots=off' \ + --tries '3' --wait '1' \ + --accept 'jpg' --no-clobber --no-directories --no-parent --xattr +``` + +## Further readings + +- [Manual] + +## Sources + +All the references in the [further readings] section, plus the following: + +- [Ský Fólk] + + + + +[manual]: https://www.gnu.org/software/wget/manual/wget.html + + +[further readings]: #further-readings + + +[ský fólk]: https://www.theskyfolk.com