summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorCoprDistGit <infra@openeuler.org>2023-05-31 06:59:52 +0000
committerCoprDistGit <infra@openeuler.org>2023-05-31 06:59:52 +0000
commit342b69d8e6f2f039c14d04d74f61e8bba5351790 (patch)
treeb18c63ad480be86b0c6ba07c3acda9bf2359731e
parent3dc1f672420fffaf27b3913e0e8f052ab7398199 (diff)
automatic import of python-baiduimagesdownload
-rw-r--r--.gitignore1
-rw-r--r--python-baiduimagesdownload.spec469
-rw-r--r--sources1
3 files changed, 471 insertions, 0 deletions
diff --git a/.gitignore b/.gitignore
index e69de29..af527ba 100644
--- a/.gitignore
+++ b/.gitignore
@@ -0,0 +1 @@
+/BaiduImagesDownload-2.0.0.tar.gz
diff --git a/python-baiduimagesdownload.spec b/python-baiduimagesdownload.spec
new file mode 100644
index 0000000..acf1f9b
--- /dev/null
+++ b/python-baiduimagesdownload.spec
@@ -0,0 +1,469 @@
+%global _empty_manifest_terminate_build 0
+Name: python-BaiduImagesDownload
+Version: 2.0.0
+Release: 1
+Summary: download image from Baidu Image
+License: MIT
+URL: https://github.com/YXL76/BaiduImagesDownload
+Source0: https://mirrors.nju.edu.cn/pypi/web/packages/d2/89/356b6d76558ceeac7add69a0669ee8cd4bdc795a7fd0965b3dd9983961ca/BaiduImagesDownload-2.0.0.tar.gz
+BuildArch: noarch
+
+Requires: python3-aiodns
+Requires: python3-aiohttp
+Requires: python3-cchardet
+Requires: python3-tqdm
+
+%description
+# BaiduImagesDownload
+
+[![Python package](https://github.com/YXL76/BaiduImagesDownload/workflows/Python%20package/badge.svg)](https://github.com/YXL76/BaiduImagesDownload/actions)
+[![codecov](https://codecov.io/gh/YXL76/BaiduImagesDownload/branch/master/graph/badge.svg)](https://codecov.io/gh/YXL76/BaiduImagesDownload)
+[![Codacy Badge](https://app.codacy.com/project/badge/Grade/0dce5ee6b45f427fa5aa782907408d19)](https://www.codacy.com/manual/YXL76/BaiduImagesDownload?utm_source=github.com&utm_medium=referral&utm_content=YXL76/BaiduImagesDownload&utm_campaign=Badge_Grade)
+
+> `BaiduImagesDownload`是一个快速、简单百度图片爬取工具
+
+```python
+from BaiduImagesDownload import Crawler
+
+net, num, urls = Crawler.get_images_url('二次元', 20)
+Crawler.download_images(urls)
+```
+
+目录
+
+- [BaiduImagesDownload](#baiduimagesdownload)
+ - [安装](#安装)
+ - [使用](#使用)
+ - [基本](#基本)
+ - [下载设置](#下载设置)
+ - [文档](#文档)
+ - [get_images_url](#get_images_url)
+ - [download_images](#download_images)
+ - [日志](#日志)
+ - [许可](#许可)
+
+## 安装
+
+```bash
+pip install BaiduImagesDownload
+```
+
+## 使用
+
+### 基本
+
+```python
+from BaiduImagesDownload import Crawler
+
+# original为True代表优先下载原图
+net, num, urls = Crawler.get_images_url('二次元', 20, original=True)
+Crawler.download_images(urls)
+```
+
+### 下载设置
+
+```python
+from BaiduImagesDownload import Crawler
+
+# rule设置允许的图片格式,默认为('.png', '.jpg')
+# timeout为超时时间,默认为60(s)
+net, num, urls = Crawler.get_images_url('二次元', 20)
+Crawler.download_images(urls, rule=('.png', '.jpg'), timeout=60)
+```
+
+## 文档
+
+### get_images_url
+
+```python
+class Crawler:
+
+ @staticmethod
+ def get_images_url(word: str, num: int, original: bool = True,
+ timeout: int = __CONCURRENT_TIMEOUT) -> (bool, bool, list):
+```
+
+参数
+
+- `word: str`: 搜索关键词
+- `num: int`: 搜索数量
+- `original: bool, optional`:是否下原图,默认为`True`
+- `timeout: int, optional`: 请求 timeout, 默认为`60(s)`
+
+返回
+
+- `net: bool`: 网络连接是否成功,成功为 True,失败为 False
+- `num: bool`: 图片数量是否满足,满足为 True,不足为 False
+- `urls: list`: 获取的 urls,每项为一个`dict`,其中有两个键`obj_url`,`from_url`。`obj_url`为对应图片的`url`,`from_url`为`Referer`
+
+### download_images
+
+```python
+class Crawler:
+
+ @staticmethod
+ def download_images(urls: list, rule: tuple = ('.png', '.jpg'),
+ path: str = 'download', timeout: int = __CONCURRENT_TIMEOUT,
+ concurrent: int = __CONCURRENT_NUM, command: bool = True) -> (int, int):
+```
+
+参数
+
+- `urls: list`: 需要爬的图片列表,格式与`get_images_url`返回的相同
+- `rule: tuple, optional`: 允许下载的格式,默认为`('.png', '.jpg')`
+- `path: str, optional`: 图片下载的路径,默认为`'download'`
+- `timeout: int, optional`: 请求 timeout, 默认为`60(s)`
+- `concurrent: int, optional`: 并行下载的数量,默认为`100`
+- `command: bool, optional`: 是否在控制台显示进度条,默认为`True`
+
+返回
+
+- `success: int`: 下载成功的数量
+- `failed: int`: 下载失败的数量
+
+### 日志
+
+可以设置日志的等级以及输出,具体请查看[logging](https://docs.python.org/3.8/library/logging.html)
+
+```python
+import logging
+from BaiduImagesDownload import logger
+
+# 设置日志的等级为DEBUG
+# 默认为INFO
+logger.setLevel(logging.DEBUG)
+
+# 设置输出到文件
+file_handler = logging.FileHandler('~/BaiduImagesDownload.log')
+file_handler.setFormatter(logging.Formatter(
+ '[%(asctime)s] [%(levelname)s] %(message)s')) # 设置输出格式
+logger.addHandler(file_handler)
+```
+
+## 许可
+
+[![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://github.com/YXL76/BaiduImagesDownload/blob/master/LICENSE)
+
+
+
+
+%package -n python3-BaiduImagesDownload
+Summary: download image from Baidu Image
+Provides: python-BaiduImagesDownload
+BuildRequires: python3-devel
+BuildRequires: python3-setuptools
+BuildRequires: python3-pip
+%description -n python3-BaiduImagesDownload
+# BaiduImagesDownload
+
+[![Python package](https://github.com/YXL76/BaiduImagesDownload/workflows/Python%20package/badge.svg)](https://github.com/YXL76/BaiduImagesDownload/actions)
+[![codecov](https://codecov.io/gh/YXL76/BaiduImagesDownload/branch/master/graph/badge.svg)](https://codecov.io/gh/YXL76/BaiduImagesDownload)
+[![Codacy Badge](https://app.codacy.com/project/badge/Grade/0dce5ee6b45f427fa5aa782907408d19)](https://www.codacy.com/manual/YXL76/BaiduImagesDownload?utm_source=github.com&utm_medium=referral&utm_content=YXL76/BaiduImagesDownload&utm_campaign=Badge_Grade)
+
+> `BaiduImagesDownload`是一个快速、简单百度图片爬取工具
+
+```python
+from BaiduImagesDownload import Crawler
+
+net, num, urls = Crawler.get_images_url('二次元', 20)
+Crawler.download_images(urls)
+```
+
+目录
+
+- [BaiduImagesDownload](#baiduimagesdownload)
+ - [安装](#安装)
+ - [使用](#使用)
+ - [基本](#基本)
+ - [下载设置](#下载设置)
+ - [文档](#文档)
+ - [get_images_url](#get_images_url)
+ - [download_images](#download_images)
+ - [日志](#日志)
+ - [许可](#许可)
+
+## 安装
+
+```bash
+pip install BaiduImagesDownload
+```
+
+## 使用
+
+### 基本
+
+```python
+from BaiduImagesDownload import Crawler
+
+# original为True代表优先下载原图
+net, num, urls = Crawler.get_images_url('二次元', 20, original=True)
+Crawler.download_images(urls)
+```
+
+### 下载设置
+
+```python
+from BaiduImagesDownload import Crawler
+
+# rule设置允许的图片格式,默认为('.png', '.jpg')
+# timeout为超时时间,默认为60(s)
+net, num, urls = Crawler.get_images_url('二次元', 20)
+Crawler.download_images(urls, rule=('.png', '.jpg'), timeout=60)
+```
+
+## 文档
+
+### get_images_url
+
+```python
+class Crawler:
+
+ @staticmethod
+ def get_images_url(word: str, num: int, original: bool = True,
+ timeout: int = __CONCURRENT_TIMEOUT) -> (bool, bool, list):
+```
+
+参数
+
+- `word: str`: 搜索关键词
+- `num: int`: 搜索数量
+- `original: bool, optional`:是否下原图,默认为`True`
+- `timeout: int, optional`: 请求 timeout, 默认为`60(s)`
+
+返回
+
+- `net: bool`: 网络连接是否成功,成功为 True,失败为 False
+- `num: bool`: 图片数量是否满足,满足为 True,不足为 False
+- `urls: list`: 获取的 urls,每项为一个`dict`,其中有两个键`obj_url`,`from_url`。`obj_url`为对应图片的`url`,`from_url`为`Referer`
+
+### download_images
+
+```python
+class Crawler:
+
+ @staticmethod
+ def download_images(urls: list, rule: tuple = ('.png', '.jpg'),
+ path: str = 'download', timeout: int = __CONCURRENT_TIMEOUT,
+ concurrent: int = __CONCURRENT_NUM, command: bool = True) -> (int, int):
+```
+
+参数
+
+- `urls: list`: 需要爬的图片列表,格式与`get_images_url`返回的相同
+- `rule: tuple, optional`: 允许下载的格式,默认为`('.png', '.jpg')`
+- `path: str, optional`: 图片下载的路径,默认为`'download'`
+- `timeout: int, optional`: 请求 timeout, 默认为`60(s)`
+- `concurrent: int, optional`: 并行下载的数量,默认为`100`
+- `command: bool, optional`: 是否在控制台显示进度条,默认为`True`
+
+返回
+
+- `success: int`: 下载成功的数量
+- `failed: int`: 下载失败的数量
+
+### 日志
+
+可以设置日志的等级以及输出,具体请查看[logging](https://docs.python.org/3.8/library/logging.html)
+
+```python
+import logging
+from BaiduImagesDownload import logger
+
+# 设置日志的等级为DEBUG
+# 默认为INFO
+logger.setLevel(logging.DEBUG)
+
+# 设置输出到文件
+file_handler = logging.FileHandler('~/BaiduImagesDownload.log')
+file_handler.setFormatter(logging.Formatter(
+ '[%(asctime)s] [%(levelname)s] %(message)s')) # 设置输出格式
+logger.addHandler(file_handler)
+```
+
+## 许可
+
+[![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://github.com/YXL76/BaiduImagesDownload/blob/master/LICENSE)
+
+
+
+
+%package help
+Summary: Development documents and examples for BaiduImagesDownload
+Provides: python3-BaiduImagesDownload-doc
+%description help
+# BaiduImagesDownload
+
+[![Python package](https://github.com/YXL76/BaiduImagesDownload/workflows/Python%20package/badge.svg)](https://github.com/YXL76/BaiduImagesDownload/actions)
+[![codecov](https://codecov.io/gh/YXL76/BaiduImagesDownload/branch/master/graph/badge.svg)](https://codecov.io/gh/YXL76/BaiduImagesDownload)
+[![Codacy Badge](https://app.codacy.com/project/badge/Grade/0dce5ee6b45f427fa5aa782907408d19)](https://www.codacy.com/manual/YXL76/BaiduImagesDownload?utm_source=github.com&utm_medium=referral&utm_content=YXL76/BaiduImagesDownload&utm_campaign=Badge_Grade)
+
+> `BaiduImagesDownload`是一个快速、简单百度图片爬取工具
+
+```python
+from BaiduImagesDownload import Crawler
+
+net, num, urls = Crawler.get_images_url('二次元', 20)
+Crawler.download_images(urls)
+```
+
+目录
+
+- [BaiduImagesDownload](#baiduimagesdownload)
+ - [安装](#安装)
+ - [使用](#使用)
+ - [基本](#基本)
+ - [下载设置](#下载设置)
+ - [文档](#文档)
+ - [get_images_url](#get_images_url)
+ - [download_images](#download_images)
+ - [日志](#日志)
+ - [许可](#许可)
+
+## 安装
+
+```bash
+pip install BaiduImagesDownload
+```
+
+## 使用
+
+### 基本
+
+```python
+from BaiduImagesDownload import Crawler
+
+# original为True代表优先下载原图
+net, num, urls = Crawler.get_images_url('二次元', 20, original=True)
+Crawler.download_images(urls)
+```
+
+### 下载设置
+
+```python
+from BaiduImagesDownload import Crawler
+
+# rule设置允许的图片格式,默认为('.png', '.jpg')
+# timeout为超时时间,默认为60(s)
+net, num, urls = Crawler.get_images_url('二次元', 20)
+Crawler.download_images(urls, rule=('.png', '.jpg'), timeout=60)
+```
+
+## 文档
+
+### get_images_url
+
+```python
+class Crawler:
+
+ @staticmethod
+ def get_images_url(word: str, num: int, original: bool = True,
+ timeout: int = __CONCURRENT_TIMEOUT) -> (bool, bool, list):
+```
+
+参数
+
+- `word: str`: 搜索关键词
+- `num: int`: 搜索数量
+- `original: bool, optional`:是否下原图,默认为`True`
+- `timeout: int, optional`: 请求 timeout, 默认为`60(s)`
+
+返回
+
+- `net: bool`: 网络连接是否成功,成功为 True,失败为 False
+- `num: bool`: 图片数量是否满足,满足为 True,不足为 False
+- `urls: list`: 获取的 urls,每项为一个`dict`,其中有两个键`obj_url`,`from_url`。`obj_url`为对应图片的`url`,`from_url`为`Referer`
+
+### download_images
+
+```python
+class Crawler:
+
+ @staticmethod
+ def download_images(urls: list, rule: tuple = ('.png', '.jpg'),
+ path: str = 'download', timeout: int = __CONCURRENT_TIMEOUT,
+ concurrent: int = __CONCURRENT_NUM, command: bool = True) -> (int, int):
+```
+
+参数
+
+- `urls: list`: 需要爬的图片列表,格式与`get_images_url`返回的相同
+- `rule: tuple, optional`: 允许下载的格式,默认为`('.png', '.jpg')`
+- `path: str, optional`: 图片下载的路径,默认为`'download'`
+- `timeout: int, optional`: 请求 timeout, 默认为`60(s)`
+- `concurrent: int, optional`: 并行下载的数量,默认为`100`
+- `command: bool, optional`: 是否在控制台显示进度条,默认为`True`
+
+返回
+
+- `success: int`: 下载成功的数量
+- `failed: int`: 下载失败的数量
+
+### 日志
+
+可以设置日志的等级以及输出,具体请查看[logging](https://docs.python.org/3.8/library/logging.html)
+
+```python
+import logging
+from BaiduImagesDownload import logger
+
+# 设置日志的等级为DEBUG
+# 默认为INFO
+logger.setLevel(logging.DEBUG)
+
+# 设置输出到文件
+file_handler = logging.FileHandler('~/BaiduImagesDownload.log')
+file_handler.setFormatter(logging.Formatter(
+ '[%(asctime)s] [%(levelname)s] %(message)s')) # 设置输出格式
+logger.addHandler(file_handler)
+```
+
+## 许可
+
+[![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://github.com/YXL76/BaiduImagesDownload/blob/master/LICENSE)
+
+
+
+
+%prep
+%autosetup -n BaiduImagesDownload-2.0.0
+
+%build
+%py3_build
+
+%install
+%py3_install
+install -d -m755 %{buildroot}/%{_pkgdocdir}
+if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi
+if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi
+if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi
+if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi
+pushd %{buildroot}
+if [ -d usr/lib ]; then
+ find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/lib64 ]; then
+ find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/bin ]; then
+ find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/sbin ]; then
+ find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+touch doclist.lst
+if [ -d usr/share/man ]; then
+ find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst
+fi
+popd
+mv %{buildroot}/filelist.lst .
+mv %{buildroot}/doclist.lst .
+
+%files -n python3-BaiduImagesDownload -f filelist.lst
+%dir %{python3_sitelib}/*
+
+%files help -f doclist.lst
+%{_docdir}/*
+
+%changelog
+* Wed May 31 2023 Python_Bot <Python_Bot@openeuler.org> - 2.0.0-1
+- Package Spec generated
diff --git a/sources b/sources
new file mode 100644
index 0000000..1ca38dd
--- /dev/null
+++ b/sources
@@ -0,0 +1 @@
+9b67a21a90a930c303597f9c2aca4a26 BaiduImagesDownload-2.0.0.tar.gz