mirror of
https://gitea.com/mcereda/oam.git
synced 2026-02-09 05:44:23 +00:00
feat: download images from a webpage with wget
This commit is contained in:
45
knowledge base/wget.md
Normal file
45
knowledge base/wget.md
Normal file
@@ -0,0 +1,45 @@
|
|||||||
|
# `wget`
|
||||||
|
|
||||||
|
## Table of contents <!-- omit in toc -->
|
||||||
|
|
||||||
|
1. [TL;DR](#tldr)
|
||||||
|
1. [Further readings](#further-readings)
|
||||||
|
1. [Sources](#sources)
|
||||||
|
|
||||||
|
## TL;DR
|
||||||
|
|
||||||
|
```sh
|
||||||
|
# Download all the pictures from a webpage.
|
||||||
|
# Limit yourself to JPG files from the domain storing them.
|
||||||
|
# Save them in a single directory.
|
||||||
|
wget 'https://www.theskyfolk.com/photo' -Hcr -D'images.squarespace-cdn.com' \
|
||||||
|
--e'robots=off' -t'3' -w'1' -A'jpg' -nc -nd -np --xattr
|
||||||
|
wget 'https://www.theskyfolk.com/photo' \
|
||||||
|
--span-hosts --continue --recursive \
|
||||||
|
--domains 'images.squarespace-cdn.com' --execute 'robots=off' \
|
||||||
|
--tries '3' --wait '1' \
|
||||||
|
--accept 'jpg' --no-clobber --no-directories --no-parent --xattr
|
||||||
|
```
|
||||||
|
|
||||||
|
## Further readings
|
||||||
|
|
||||||
|
- [Manual]
|
||||||
|
|
||||||
|
## Sources
|
||||||
|
|
||||||
|
All the references in the [further readings] section, plus the following:
|
||||||
|
|
||||||
|
- [Ský Fólk]
|
||||||
|
|
||||||
|
<!--
|
||||||
|
References
|
||||||
|
-->
|
||||||
|
|
||||||
|
<!-- Upstream -->
|
||||||
|
[manual]: https://www.gnu.org/software/wget/manual/wget.html
|
||||||
|
|
||||||
|
<!-- In-article sections -->
|
||||||
|
[further readings]: #further-readings
|
||||||
|
|
||||||
|
<!-- Others -->
|
||||||
|
[ský fólk]: https://www.theskyfolk.com
|
||||||
Reference in New Issue
Block a user