%global _empty_manifest_terminate_build 0 Name: python-BaiduImagesDownload Version: 2.0.0 Release: 1 Summary: download image from Baidu Image License: MIT URL: https://github.com/YXL76/BaiduImagesDownload Source0: https://mirrors.nju.edu.cn/pypi/web/packages/d2/89/356b6d76558ceeac7add69a0669ee8cd4bdc795a7fd0965b3dd9983961ca/BaiduImagesDownload-2.0.0.tar.gz BuildArch: noarch Requires: python3-aiodns Requires: python3-aiohttp Requires: python3-cchardet Requires: python3-tqdm %description # BaiduImagesDownload [![Python package](https://github.com/YXL76/BaiduImagesDownload/workflows/Python%20package/badge.svg)](https://github.com/YXL76/BaiduImagesDownload/actions) [![codecov](https://codecov.io/gh/YXL76/BaiduImagesDownload/branch/master/graph/badge.svg)](https://codecov.io/gh/YXL76/BaiduImagesDownload) [![Codacy Badge](https://app.codacy.com/project/badge/Grade/0dce5ee6b45f427fa5aa782907408d19)](https://www.codacy.com/manual/YXL76/BaiduImagesDownload?utm_source=github.com&utm_medium=referral&utm_content=YXL76/BaiduImagesDownload&utm_campaign=Badge_Grade) > `BaiduImagesDownload`是一个快速、简单百度图片爬取工具 ```python from BaiduImagesDownload import Crawler net, num, urls = Crawler.get_images_url('二次元', 20) Crawler.download_images(urls) ``` 目录 - [BaiduImagesDownload](#baiduimagesdownload) - [安装](#安装) - [使用](#使用) - [基本](#基本) - [下载设置](#下载设置) - [文档](#文档) - [get_images_url](#get_images_url) - [download_images](#download_images) - [日志](#日志) - [许可](#许可) ## 安装 ```bash pip install BaiduImagesDownload ``` ## 使用 ### 基本 ```python from BaiduImagesDownload import Crawler # original为True代表优先下载原图 net, num, urls = Crawler.get_images_url('二次元', 20, original=True) Crawler.download_images(urls) ``` ### 下载设置 ```python from BaiduImagesDownload import Crawler # rule设置允许的图片格式,默认为('.png', '.jpg') # timeout为超时时间,默认为60(s) net, num, urls = Crawler.get_images_url('二次元', 20) Crawler.download_images(urls, rule=('.png', '.jpg'), timeout=60) ``` ## 文档 ### get_images_url ```python class Crawler: @staticmethod def get_images_url(word: str, num: int, original: bool = True, timeout: int = __CONCURRENT_TIMEOUT) -> (bool, bool, list): ``` 参数 - `word: str`: 搜索关键词 - `num: int`: 搜索数量 - `original: bool, optional`:是否下原图,默认为`True` - `timeout: int, optional`: 请求 timeout, 默认为`60(s)` 返回 - `net: bool`: 网络连接是否成功,成功为 True,失败为 False - `num: bool`: 图片数量是否满足,满足为 True,不足为 False - `urls: list`: 获取的 urls,每项为一个`dict`,其中有两个键`obj_url`,`from_url`。`obj_url`为对应图片的`url`,`from_url`为`Referer` ### download_images ```python class Crawler: @staticmethod def download_images(urls: list, rule: tuple = ('.png', '.jpg'), path: str = 'download', timeout: int = __CONCURRENT_TIMEOUT, concurrent: int = __CONCURRENT_NUM, command: bool = True) -> (int, int): ``` 参数 - `urls: list`: 需要爬的图片列表,格式与`get_images_url`返回的相同 - `rule: tuple, optional`: 允许下载的格式,默认为`('.png', '.jpg')` - `path: str, optional`: 图片下载的路径,默认为`'download'` - `timeout: int, optional`: 请求 timeout, 默认为`60(s)` - `concurrent: int, optional`: 并行下载的数量,默认为`100` - `command: bool, optional`: 是否在控制台显示进度条,默认为`True` 返回 - `success: int`: 下载成功的数量 - `failed: int`: 下载失败的数量 ### 日志 可以设置日志的等级以及输出,具体请查看[logging](https://docs.python.org/3.8/library/logging.html) ```python import logging from BaiduImagesDownload import logger # 设置日志的等级为DEBUG # 默认为INFO logger.setLevel(logging.DEBUG) # 设置输出到文件 file_handler = logging.FileHandler('~/BaiduImagesDownload.log') file_handler.setFormatter(logging.Formatter( '[%(asctime)s] [%(levelname)s] %(message)s')) # 设置输出格式 logger.addHandler(file_handler) ``` ## 许可 [![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://github.com/YXL76/BaiduImagesDownload/blob/master/LICENSE) %package -n python3-BaiduImagesDownload Summary: download image from Baidu Image Provides: python-BaiduImagesDownload BuildRequires: python3-devel BuildRequires: python3-setuptools BuildRequires: python3-pip %description -n python3-BaiduImagesDownload # BaiduImagesDownload [![Python package](https://github.com/YXL76/BaiduImagesDownload/workflows/Python%20package/badge.svg)](https://github.com/YXL76/BaiduImagesDownload/actions) [![codecov](https://codecov.io/gh/YXL76/BaiduImagesDownload/branch/master/graph/badge.svg)](https://codecov.io/gh/YXL76/BaiduImagesDownload) [![Codacy Badge](https://app.codacy.com/project/badge/Grade/0dce5ee6b45f427fa5aa782907408d19)](https://www.codacy.com/manual/YXL76/BaiduImagesDownload?utm_source=github.com&utm_medium=referral&utm_content=YXL76/BaiduImagesDownload&utm_campaign=Badge_Grade) > `BaiduImagesDownload`是一个快速、简单百度图片爬取工具 ```python from BaiduImagesDownload import Crawler net, num, urls = Crawler.get_images_url('二次元', 20) Crawler.download_images(urls) ``` 目录 - [BaiduImagesDownload](#baiduimagesdownload) - [安装](#安装) - [使用](#使用) - [基本](#基本) - [下载设置](#下载设置) - [文档](#文档) - [get_images_url](#get_images_url) - [download_images](#download_images) - [日志](#日志) - [许可](#许可) ## 安装 ```bash pip install BaiduImagesDownload ``` ## 使用 ### 基本 ```python from BaiduImagesDownload import Crawler # original为True代表优先下载原图 net, num, urls = Crawler.get_images_url('二次元', 20, original=True) Crawler.download_images(urls) ``` ### 下载设置 ```python from BaiduImagesDownload import Crawler # rule设置允许的图片格式,默认为('.png', '.jpg') # timeout为超时时间,默认为60(s) net, num, urls = Crawler.get_images_url('二次元', 20) Crawler.download_images(urls, rule=('.png', '.jpg'), timeout=60) ``` ## 文档 ### get_images_url ```python class Crawler: @staticmethod def get_images_url(word: str, num: int, original: bool = True, timeout: int = __CONCURRENT_TIMEOUT) -> (bool, bool, list): ``` 参数 - `word: str`: 搜索关键词 - `num: int`: 搜索数量 - `original: bool, optional`:是否下原图,默认为`True` - `timeout: int, optional`: 请求 timeout, 默认为`60(s)` 返回 - `net: bool`: 网络连接是否成功,成功为 True,失败为 False - `num: bool`: 图片数量是否满足,满足为 True,不足为 False - `urls: list`: 获取的 urls,每项为一个`dict`,其中有两个键`obj_url`,`from_url`。`obj_url`为对应图片的`url`,`from_url`为`Referer` ### download_images ```python class Crawler: @staticmethod def download_images(urls: list, rule: tuple = ('.png', '.jpg'), path: str = 'download', timeout: int = __CONCURRENT_TIMEOUT, concurrent: int = __CONCURRENT_NUM, command: bool = True) -> (int, int): ``` 参数 - `urls: list`: 需要爬的图片列表,格式与`get_images_url`返回的相同 - `rule: tuple, optional`: 允许下载的格式,默认为`('.png', '.jpg')` - `path: str, optional`: 图片下载的路径,默认为`'download'` - `timeout: int, optional`: 请求 timeout, 默认为`60(s)` - `concurrent: int, optional`: 并行下载的数量,默认为`100` - `command: bool, optional`: 是否在控制台显示进度条,默认为`True` 返回 - `success: int`: 下载成功的数量 - `failed: int`: 下载失败的数量 ### 日志 可以设置日志的等级以及输出,具体请查看[logging](https://docs.python.org/3.8/library/logging.html) ```python import logging from BaiduImagesDownload import logger # 设置日志的等级为DEBUG # 默认为INFO logger.setLevel(logging.DEBUG) # 设置输出到文件 file_handler = logging.FileHandler('~/BaiduImagesDownload.log') file_handler.setFormatter(logging.Formatter( '[%(asctime)s] [%(levelname)s] %(message)s')) # 设置输出格式 logger.addHandler(file_handler) ``` ## 许可 [![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://github.com/YXL76/BaiduImagesDownload/blob/master/LICENSE) %package help Summary: Development documents and examples for BaiduImagesDownload Provides: python3-BaiduImagesDownload-doc %description help # BaiduImagesDownload [![Python package](https://github.com/YXL76/BaiduImagesDownload/workflows/Python%20package/badge.svg)](https://github.com/YXL76/BaiduImagesDownload/actions) [![codecov](https://codecov.io/gh/YXL76/BaiduImagesDownload/branch/master/graph/badge.svg)](https://codecov.io/gh/YXL76/BaiduImagesDownload) [![Codacy Badge](https://app.codacy.com/project/badge/Grade/0dce5ee6b45f427fa5aa782907408d19)](https://www.codacy.com/manual/YXL76/BaiduImagesDownload?utm_source=github.com&utm_medium=referral&utm_content=YXL76/BaiduImagesDownload&utm_campaign=Badge_Grade) > `BaiduImagesDownload`是一个快速、简单百度图片爬取工具 ```python from BaiduImagesDownload import Crawler net, num, urls = Crawler.get_images_url('二次元', 20) Crawler.download_images(urls) ``` 目录 - [BaiduImagesDownload](#baiduimagesdownload) - [安装](#安装) - [使用](#使用) - [基本](#基本) - [下载设置](#下载设置) - [文档](#文档) - [get_images_url](#get_images_url) - [download_images](#download_images) - [日志](#日志) - [许可](#许可) ## 安装 ```bash pip install BaiduImagesDownload ``` ## 使用 ### 基本 ```python from BaiduImagesDownload import Crawler # original为True代表优先下载原图 net, num, urls = Crawler.get_images_url('二次元', 20, original=True) Crawler.download_images(urls) ``` ### 下载设置 ```python from BaiduImagesDownload import Crawler # rule设置允许的图片格式,默认为('.png', '.jpg') # timeout为超时时间,默认为60(s) net, num, urls = Crawler.get_images_url('二次元', 20) Crawler.download_images(urls, rule=('.png', '.jpg'), timeout=60) ``` ## 文档 ### get_images_url ```python class Crawler: @staticmethod def get_images_url(word: str, num: int, original: bool = True, timeout: int = __CONCURRENT_TIMEOUT) -> (bool, bool, list): ``` 参数 - `word: str`: 搜索关键词 - `num: int`: 搜索数量 - `original: bool, optional`:是否下原图,默认为`True` - `timeout: int, optional`: 请求 timeout, 默认为`60(s)` 返回 - `net: bool`: 网络连接是否成功,成功为 True,失败为 False - `num: bool`: 图片数量是否满足,满足为 True,不足为 False - `urls: list`: 获取的 urls,每项为一个`dict`,其中有两个键`obj_url`,`from_url`。`obj_url`为对应图片的`url`,`from_url`为`Referer` ### download_images ```python class Crawler: @staticmethod def download_images(urls: list, rule: tuple = ('.png', '.jpg'), path: str = 'download', timeout: int = __CONCURRENT_TIMEOUT, concurrent: int = __CONCURRENT_NUM, command: bool = True) -> (int, int): ``` 参数 - `urls: list`: 需要爬的图片列表,格式与`get_images_url`返回的相同 - `rule: tuple, optional`: 允许下载的格式,默认为`('.png', '.jpg')` - `path: str, optional`: 图片下载的路径,默认为`'download'` - `timeout: int, optional`: 请求 timeout, 默认为`60(s)` - `concurrent: int, optional`: 并行下载的数量,默认为`100` - `command: bool, optional`: 是否在控制台显示进度条,默认为`True` 返回 - `success: int`: 下载成功的数量 - `failed: int`: 下载失败的数量 ### 日志 可以设置日志的等级以及输出,具体请查看[logging](https://docs.python.org/3.8/library/logging.html) ```python import logging from BaiduImagesDownload import logger # 设置日志的等级为DEBUG # 默认为INFO logger.setLevel(logging.DEBUG) # 设置输出到文件 file_handler = logging.FileHandler('~/BaiduImagesDownload.log') file_handler.setFormatter(logging.Formatter( '[%(asctime)s] [%(levelname)s] %(message)s')) # 设置输出格式 logger.addHandler(file_handler) ``` ## 许可 [![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://github.com/YXL76/BaiduImagesDownload/blob/master/LICENSE) %prep %autosetup -n BaiduImagesDownload-2.0.0 %build %py3_build %install %py3_install install -d -m755 %{buildroot}/%{_pkgdocdir} if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi pushd %{buildroot} if [ -d usr/lib ]; then find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/lib64 ]; then find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/bin ]; then find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/sbin ]; then find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst fi touch doclist.lst if [ -d usr/share/man ]; then find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst fi popd mv %{buildroot}/filelist.lst . mv %{buildroot}/doclist.lst . %files -n python3-BaiduImagesDownload -f filelist.lst %dir %{python3_sitelib}/* %files help -f doclist.lst %{_docdir}/* %changelog * Wed May 31 2023 Python_Bot - 2.0.0-1 - Package Spec generated