diff options
author | CoprDistGit <infra@openeuler.org> | 2023-05-31 06:59:52 +0000 |
---|---|---|
committer | CoprDistGit <infra@openeuler.org> | 2023-05-31 06:59:52 +0000 |
commit | 342b69d8e6f2f039c14d04d74f61e8bba5351790 (patch) | |
tree | b18c63ad480be86b0c6ba07c3acda9bf2359731e | |
parent | 3dc1f672420fffaf27b3913e0e8f052ab7398199 (diff) |
automatic import of python-baiduimagesdownload
-rw-r--r-- | .gitignore | 1 | ||||
-rw-r--r-- | python-baiduimagesdownload.spec | 469 | ||||
-rw-r--r-- | sources | 1 |
3 files changed, 471 insertions, 0 deletions
@@ -0,0 +1 @@ +/BaiduImagesDownload-2.0.0.tar.gz diff --git a/python-baiduimagesdownload.spec b/python-baiduimagesdownload.spec new file mode 100644 index 0000000..acf1f9b --- /dev/null +++ b/python-baiduimagesdownload.spec @@ -0,0 +1,469 @@ +%global _empty_manifest_terminate_build 0 +Name: python-BaiduImagesDownload +Version: 2.0.0 +Release: 1 +Summary: download image from Baidu Image +License: MIT +URL: https://github.com/YXL76/BaiduImagesDownload +Source0: https://mirrors.nju.edu.cn/pypi/web/packages/d2/89/356b6d76558ceeac7add69a0669ee8cd4bdc795a7fd0965b3dd9983961ca/BaiduImagesDownload-2.0.0.tar.gz +BuildArch: noarch + +Requires: python3-aiodns +Requires: python3-aiohttp +Requires: python3-cchardet +Requires: python3-tqdm + +%description +# BaiduImagesDownload + +[](https://github.com/YXL76/BaiduImagesDownload/actions) +[](https://codecov.io/gh/YXL76/BaiduImagesDownload) +[](https://www.codacy.com/manual/YXL76/BaiduImagesDownload?utm_source=github.com&utm_medium=referral&utm_content=YXL76/BaiduImagesDownload&utm_campaign=Badge_Grade) + +> `BaiduImagesDownload`是一个快速、简单百度图片爬取工具 + +```python +from BaiduImagesDownload import Crawler + +net, num, urls = Crawler.get_images_url('二次元', 20) +Crawler.download_images(urls) +``` + +目录 + +- [BaiduImagesDownload](#baiduimagesdownload) + - [安装](#安装) + - [使用](#使用) + - [基本](#基本) + - [下载设置](#下载设置) + - [文档](#文档) + - [get_images_url](#get_images_url) + - [download_images](#download_images) + - [日志](#日志) + - [许可](#许可) + +## 安装 + +```bash +pip install BaiduImagesDownload +``` + +## 使用 + +### 基本 + +```python +from BaiduImagesDownload import Crawler + +# original为True代表优先下载原图 +net, num, urls = Crawler.get_images_url('二次元', 20, original=True) +Crawler.download_images(urls) +``` + +### 下载设置 + +```python +from BaiduImagesDownload import Crawler + +# rule设置允许的图片格式,默认为('.png', '.jpg') +# timeout为超时时间,默认为60(s) +net, num, urls = Crawler.get_images_url('二次元', 20) +Crawler.download_images(urls, rule=('.png', '.jpg'), timeout=60) +``` + +## 文档 + +### get_images_url + +```python +class Crawler: + + @staticmethod + def get_images_url(word: str, num: int, original: bool = True, + timeout: int = __CONCURRENT_TIMEOUT) -> (bool, bool, list): +``` + +参数 + +- `word: str`: 搜索关键词 +- `num: int`: 搜索数量 +- `original: bool, optional`:是否下原图,默认为`True` +- `timeout: int, optional`: 请求 timeout, 默认为`60(s)` + +返回 + +- `net: bool`: 网络连接是否成功,成功为 True,失败为 False +- `num: bool`: 图片数量是否满足,满足为 True,不足为 False +- `urls: list`: 获取的 urls,每项为一个`dict`,其中有两个键`obj_url`,`from_url`。`obj_url`为对应图片的`url`,`from_url`为`Referer` + +### download_images + +```python +class Crawler: + + @staticmethod + def download_images(urls: list, rule: tuple = ('.png', '.jpg'), + path: str = 'download', timeout: int = __CONCURRENT_TIMEOUT, + concurrent: int = __CONCURRENT_NUM, command: bool = True) -> (int, int): +``` + +参数 + +- `urls: list`: 需要爬的图片列表,格式与`get_images_url`返回的相同 +- `rule: tuple, optional`: 允许下载的格式,默认为`('.png', '.jpg')` +- `path: str, optional`: 图片下载的路径,默认为`'download'` +- `timeout: int, optional`: 请求 timeout, 默认为`60(s)` +- `concurrent: int, optional`: 并行下载的数量,默认为`100` +- `command: bool, optional`: 是否在控制台显示进度条,默认为`True` + +返回 + +- `success: int`: 下载成功的数量 +- `failed: int`: 下载失败的数量 + +### 日志 + +可以设置日志的等级以及输出,具体请查看[logging](https://docs.python.org/3.8/library/logging.html) + +```python +import logging +from BaiduImagesDownload import logger + +# 设置日志的等级为DEBUG +# 默认为INFO +logger.setLevel(logging.DEBUG) + +# 设置输出到文件 +file_handler = logging.FileHandler('~/BaiduImagesDownload.log') +file_handler.setFormatter(logging.Formatter( + '[%(asctime)s] [%(levelname)s] %(message)s')) # 设置输出格式 +logger.addHandler(file_handler) +``` + +## 许可 + +[](https://github.com/YXL76/BaiduImagesDownload/blob/master/LICENSE) + + + + +%package -n python3-BaiduImagesDownload +Summary: download image from Baidu Image +Provides: python-BaiduImagesDownload +BuildRequires: python3-devel +BuildRequires: python3-setuptools +BuildRequires: python3-pip +%description -n python3-BaiduImagesDownload +# BaiduImagesDownload + +[](https://github.com/YXL76/BaiduImagesDownload/actions) +[](https://codecov.io/gh/YXL76/BaiduImagesDownload) +[](https://www.codacy.com/manual/YXL76/BaiduImagesDownload?utm_source=github.com&utm_medium=referral&utm_content=YXL76/BaiduImagesDownload&utm_campaign=Badge_Grade) + +> `BaiduImagesDownload`是一个快速、简单百度图片爬取工具 + +```python +from BaiduImagesDownload import Crawler + +net, num, urls = Crawler.get_images_url('二次元', 20) +Crawler.download_images(urls) +``` + +目录 + +- [BaiduImagesDownload](#baiduimagesdownload) + - [安装](#安装) + - [使用](#使用) + - [基本](#基本) + - [下载设置](#下载设置) + - [文档](#文档) + - [get_images_url](#get_images_url) + - [download_images](#download_images) + - [日志](#日志) + - [许可](#许可) + +## 安装 + +```bash +pip install BaiduImagesDownload +``` + +## 使用 + +### 基本 + +```python +from BaiduImagesDownload import Crawler + +# original为True代表优先下载原图 +net, num, urls = Crawler.get_images_url('二次元', 20, original=True) +Crawler.download_images(urls) +``` + +### 下载设置 + +```python +from BaiduImagesDownload import Crawler + +# rule设置允许的图片格式,默认为('.png', '.jpg') +# timeout为超时时间,默认为60(s) +net, num, urls = Crawler.get_images_url('二次元', 20) +Crawler.download_images(urls, rule=('.png', '.jpg'), timeout=60) +``` + +## 文档 + +### get_images_url + +```python +class Crawler: + + @staticmethod + def get_images_url(word: str, num: int, original: bool = True, + timeout: int = __CONCURRENT_TIMEOUT) -> (bool, bool, list): +``` + +参数 + +- `word: str`: 搜索关键词 +- `num: int`: 搜索数量 +- `original: bool, optional`:是否下原图,默认为`True` +- `timeout: int, optional`: 请求 timeout, 默认为`60(s)` + +返回 + +- `net: bool`: 网络连接是否成功,成功为 True,失败为 False +- `num: bool`: 图片数量是否满足,满足为 True,不足为 False +- `urls: list`: 获取的 urls,每项为一个`dict`,其中有两个键`obj_url`,`from_url`。`obj_url`为对应图片的`url`,`from_url`为`Referer` + +### download_images + +```python +class Crawler: + + @staticmethod + def download_images(urls: list, rule: tuple = ('.png', '.jpg'), + path: str = 'download', timeout: int = __CONCURRENT_TIMEOUT, + concurrent: int = __CONCURRENT_NUM, command: bool = True) -> (int, int): +``` + +参数 + +- `urls: list`: 需要爬的图片列表,格式与`get_images_url`返回的相同 +- `rule: tuple, optional`: 允许下载的格式,默认为`('.png', '.jpg')` +- `path: str, optional`: 图片下载的路径,默认为`'download'` +- `timeout: int, optional`: 请求 timeout, 默认为`60(s)` +- `concurrent: int, optional`: 并行下载的数量,默认为`100` +- `command: bool, optional`: 是否在控制台显示进度条,默认为`True` + +返回 + +- `success: int`: 下载成功的数量 +- `failed: int`: 下载失败的数量 + +### 日志 + +可以设置日志的等级以及输出,具体请查看[logging](https://docs.python.org/3.8/library/logging.html) + +```python +import logging +from BaiduImagesDownload import logger + +# 设置日志的等级为DEBUG +# 默认为INFO +logger.setLevel(logging.DEBUG) + +# 设置输出到文件 +file_handler = logging.FileHandler('~/BaiduImagesDownload.log') +file_handler.setFormatter(logging.Formatter( + '[%(asctime)s] [%(levelname)s] %(message)s')) # 设置输出格式 +logger.addHandler(file_handler) +``` + +## 许可 + +[](https://github.com/YXL76/BaiduImagesDownload/blob/master/LICENSE) + + + + +%package help +Summary: Development documents and examples for BaiduImagesDownload +Provides: python3-BaiduImagesDownload-doc +%description help +# BaiduImagesDownload + +[](https://github.com/YXL76/BaiduImagesDownload/actions) +[](https://codecov.io/gh/YXL76/BaiduImagesDownload) +[](https://www.codacy.com/manual/YXL76/BaiduImagesDownload?utm_source=github.com&utm_medium=referral&utm_content=YXL76/BaiduImagesDownload&utm_campaign=Badge_Grade) + +> `BaiduImagesDownload`是一个快速、简单百度图片爬取工具 + +```python +from BaiduImagesDownload import Crawler + +net, num, urls = Crawler.get_images_url('二次元', 20) +Crawler.download_images(urls) +``` + +目录 + +- [BaiduImagesDownload](#baiduimagesdownload) + - [安装](#安装) + - [使用](#使用) + - [基本](#基本) + - [下载设置](#下载设置) + - [文档](#文档) + - [get_images_url](#get_images_url) + - [download_images](#download_images) + - [日志](#日志) + - [许可](#许可) + +## 安装 + +```bash +pip install BaiduImagesDownload +``` + +## 使用 + +### 基本 + +```python +from BaiduImagesDownload import Crawler + +# original为True代表优先下载原图 +net, num, urls = Crawler.get_images_url('二次元', 20, original=True) +Crawler.download_images(urls) +``` + +### 下载设置 + +```python +from BaiduImagesDownload import Crawler + +# rule设置允许的图片格式,默认为('.png', '.jpg') +# timeout为超时时间,默认为60(s) +net, num, urls = Crawler.get_images_url('二次元', 20) +Crawler.download_images(urls, rule=('.png', '.jpg'), timeout=60) +``` + +## 文档 + +### get_images_url + +```python +class Crawler: + + @staticmethod + def get_images_url(word: str, num: int, original: bool = True, + timeout: int = __CONCURRENT_TIMEOUT) -> (bool, bool, list): +``` + +参数 + +- `word: str`: 搜索关键词 +- `num: int`: 搜索数量 +- `original: bool, optional`:是否下原图,默认为`True` +- `timeout: int, optional`: 请求 timeout, 默认为`60(s)` + +返回 + +- `net: bool`: 网络连接是否成功,成功为 True,失败为 False +- `num: bool`: 图片数量是否满足,满足为 True,不足为 False +- `urls: list`: 获取的 urls,每项为一个`dict`,其中有两个键`obj_url`,`from_url`。`obj_url`为对应图片的`url`,`from_url`为`Referer` + +### download_images + +```python +class Crawler: + + @staticmethod + def download_images(urls: list, rule: tuple = ('.png', '.jpg'), + path: str = 'download', timeout: int = __CONCURRENT_TIMEOUT, + concurrent: int = __CONCURRENT_NUM, command: bool = True) -> (int, int): +``` + +参数 + +- `urls: list`: 需要爬的图片列表,格式与`get_images_url`返回的相同 +- `rule: tuple, optional`: 允许下载的格式,默认为`('.png', '.jpg')` +- `path: str, optional`: 图片下载的路径,默认为`'download'` +- `timeout: int, optional`: 请求 timeout, 默认为`60(s)` +- `concurrent: int, optional`: 并行下载的数量,默认为`100` +- `command: bool, optional`: 是否在控制台显示进度条,默认为`True` + +返回 + +- `success: int`: 下载成功的数量 +- `failed: int`: 下载失败的数量 + +### 日志 + +可以设置日志的等级以及输出,具体请查看[logging](https://docs.python.org/3.8/library/logging.html) + +```python +import logging +from BaiduImagesDownload import logger + +# 设置日志的等级为DEBUG +# 默认为INFO +logger.setLevel(logging.DEBUG) + +# 设置输出到文件 +file_handler = logging.FileHandler('~/BaiduImagesDownload.log') +file_handler.setFormatter(logging.Formatter( + '[%(asctime)s] [%(levelname)s] %(message)s')) # 设置输出格式 +logger.addHandler(file_handler) +``` + +## 许可 + +[](https://github.com/YXL76/BaiduImagesDownload/blob/master/LICENSE) + + + + +%prep +%autosetup -n BaiduImagesDownload-2.0.0 + +%build +%py3_build + +%install +%py3_install +install -d -m755 %{buildroot}/%{_pkgdocdir} +if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi +if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi +if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi +if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi +pushd %{buildroot} +if [ -d usr/lib ]; then + find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/lib64 ]; then + find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/bin ]; then + find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/sbin ]; then + find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst +fi +touch doclist.lst +if [ -d usr/share/man ]; then + find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst +fi +popd +mv %{buildroot}/filelist.lst . +mv %{buildroot}/doclist.lst . + +%files -n python3-BaiduImagesDownload -f filelist.lst +%dir %{python3_sitelib}/* + +%files help -f doclist.lst +%{_docdir}/* + +%changelog +* Wed May 31 2023 Python_Bot <Python_Bot@openeuler.org> - 2.0.0-1 +- Package Spec generated @@ -0,0 +1 @@ +9b67a21a90a930c303597f9c2aca4a26 BaiduImagesDownload-2.0.0.tar.gz |