automatic import of python-facebook-scraper

author: CoprDistGit <infra@openeuler.org> 2023-04-11 14:38:34 +0000
committer: CoprDistGit <infra@openeuler.org> 2023-04-11 14:38:34 +0000
commit: 2fa72953076f4a52c48b739a1adaadaa8e537656 (patch)
tree: d9539fc33efcff7a7bbfd2f3b54867cba455473d /python-facebook-scraper.spec
parent: 02e856558ea810aceda1d582e0a30b9a577fff9b (diff)
1 files changed, 752 insertions, 0 deletions
diff --git a/python-facebook-scraper.spec b/python-facebook-scraper.spec
new file mode 100644
index 0000000..3f3805f
--- /dev/null
+++ b/python-facebook-scraper.spec
@@ -0,0 +1,752 @@
+%global _empty_manifest_terminate_build 0
+Name:		python-facebook-scraper
+Version:	0.2.59
+Release:	1
+Summary:	Scrape Facebook public pages without an API key
+License:	MIT
+URL:		https://github.com/kevinzg/facebook-scraper
+Source0:	https://mirrors.nju.edu.cn/pypi/web/packages/b7/4f/2011a76880e16fced40e24d34a09afacfec236d7b339177ad3b6ce31ce24/facebook-scraper-0.2.59.tar.gz
+BuildArch:	noarch
+
+Requires:	python3-requests-html
+Requires:	python3-youtube_dl
+Requires:	python3-browser-cookie3
+Requires:	python3-dateparser
+Requires:	python3-demjson3
+
+%description
+# Facebook Scraper
+
+[![PyPI download month](https://img.shields.io/pypi/dm/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+[![PyPI download week](https://img.shields.io/pypi/dw/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+[![PyPI download day](https://img.shields.io/pypi/dd/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+
+[![PyPI version](https://img.shields.io/pypi/v/facebook-scraper?color=blue)](https://pypi.python.org/pypi/facebook-scraper/)
+[![PyPI pyversions](https://img.shields.io/pypi/pyversions/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+[![GitHub commits since tagged version](https://img.shields.io/github/commits-since/kevinzg/facebook-scraper/v0.2.59)](https://github.com/kevinzg/facebook-scraper/commits/)
+
+[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
+
+
+Scrape Facebook public pages without an API key. Inspired by [twitter-scraper](https://github.com/kennethreitz/twitter-scraper).
+
+
+## Install
+
+To install the latest release from PyPI:
+
+```sh
+pip install facebook-scraper
+```
+
+Or, to install the latest master branch:
+
+```sh
+pip install git+https://github.com/kevinzg/facebook-scraper.git
+```
+
+## Usage
+
+Send the unique **page name, profile name, or ID** as the first parameter and you're good to go:
+
+```python
+>>> from facebook_scraper import get_posts
+
+>>> for post in get_posts('nintendo', pages=1):
+...     print(post['text'][:50])
+...
+The final step on the road to the Super Smash Bros
+We’re headed to PAX East 3/28-3/31 with new games
+```
+
+
+### Optional parameters
+
+*(For the `get_posts` function)*.
+
+- **group**: group id, to scrape groups instead of pages. Default is `None`.
+- **pages**: how many pages of posts to request, the first 2 pages may have no results, so try with a number greater than 2. Default is 10.
+- **timeout**: how many seconds to wait before timing out. Default is 30.
+- **credentials**: tuple of user and password to login before requesting the posts. Default is `None`.
+- **extra_info**: bool, if true the function will try to do an extra request to get the post reactions. Default is False.
+- **youtube_dl**: bool, use Youtube-DL for (high-quality) video extraction. You need to have youtube-dl installed on your environment. Default is False.
+- **post_urls**: list, URLs or post IDs to extract posts from. Alternative to fetching based on username.
+- **cookies**: One of:
+  - The path to a file containing cookies in Netscape or JSON format. You can extract cookies from your browser after logging into Facebook with an extension like [Get Cookies.txt (Chrome)](https://chrome.google.com/webstore/detail/get-cookiestxt/bgaddhkoddajcdgocldbbfleckgcbcid?hl=en) or [Cookie Quick Manager (Firefox)](https://addons.mozilla.org/en-US/firefox/addon/cookie-quick-manager/). Make sure that you include both the c_user cookie and the xs cookie, you will get an InvalidCookies exception if you don't.
+  - A [CookieJar](https://docs.python.org/3.9/library/http.cookiejar.html#http.cookiejar.CookieJar)
+  - A dictionary that can be converted to a CookieJar with [cookiejar_from_dict](https://2.python-requests.org/en/master/api/#requests.cookies.cookiejar_from_dict)
+  - The string `"from_browser"` to try extract Facebook cookies from your browser
+- **options**: Dictionary of options. Set `options={"comments": True}` to extract comments, set `options={"reactors": True}` to extract the people reacting to the post.
+Both `comments` and `reactors` can also be set to a number to set a limit for the amount of comments/reactors to retrieve.
+Set `options={"progress": True}` to get a `tqdm` progress bar while extracting comments and replies.
+Set `options={"allow_extra_requests": False}` to disable making extra requests when extracting post data (required for some things like full text and image links).
+Set `options={"posts_per_page": 200}` to request 200 posts per page. The default is 4.
+
+## CLI usage
+
+```sh
+$ facebook-scraper --filename nintendo_page_posts.csv --pages 10 nintendo
+```
+
+Run `facebook-scraper --help` for more details on CLI usage.
+
+**Note:** If you get a `UnicodeEncodeError` try adding `--encoding utf-8`.
+
+
+## Post example
+
+```python
+{'available': True,
+ 'comments': 459,
+ 'comments_full': None,
+ 'factcheck': None,
+ 'fetched_time': datetime.datetime(2021, 4, 20, 13, 39, 53, 651417),
+ 'image': 'https://scontent.fhlz2-1.fna.fbcdn.net/v/t1.6435-9/fr/cp0/e15/q65/58745049_2257182057699568_1761478225390731264_n.jpg?_nc_cat=111&ccb=1-3&_nc_sid=8024bb&_nc_ohc=ygH2fPmfQpAAX92ABYY&_nc_ht=scontent.fhlz2-1.fna&tp=14&oh=7a8a7b4904deb55ec696ae255fff97dd&oe=60A36717',
+ 'images': ['https://scontent.fhlz2-1.fna.fbcdn.net/v/t1.6435-9/fr/cp0/e15/q65/58745049_2257182057699568_1761478225390731264_n.jpg?_nc_cat=111&ccb=1-3&_nc_sid=8024bb&_nc_ohc=ygH2fPmfQpAAX92ABYY&_nc_ht=scontent.fhlz2-1.fna&tp=14&oh=7a8a7b4904deb55ec696ae255fff97dd&oe=60A36717'],
+ 'is_live': False,
+ 'likes': 3509,
+ 'link': 'https://www.nintendo.com/amiibo/line-up/',
+ 'post_id': '2257188721032235',
+ 'post_text': 'Don’t let this diminutive version of the Hero of Time fool you, '
+              'Young Link is just as heroic as his fully grown version! Young '
+              'Link joins the Super Smash Bros. series of amiibo figures!\n'
+              '\n'
+              'https://www.nintendo.com/amiibo/line-up/',
+ 'post_url': 'https://facebook.com/story.php?story_fbid=2257188721032235&id=119240841493711',
+ 'reactions': {'haha': 22, 'like': 2657, 'love': 706, 'sorry': 1, 'wow': 123}, # if `extra_info` was set
+ 'reactors': None,
+ 'shared_post_id': None,
+ 'shared_post_url': None,
+ 'shared_text': '',
+ 'shared_time': None,
+ 'shared_user_id': None,
+ 'shared_username': None,
+ 'shares': 441,
+ 'text': 'Don’t let this diminutive version of the Hero of Time fool you, '
+         'Young Link is just as heroic as his fully grown version! Young Link '
+         'joins the Super Smash Bros. series of amiibo figures!\n'
+         '\n'
+         'https://www.nintendo.com/amiibo/line-up/',
+ 'time': datetime.datetime(2019, 4, 30, 5, 0, 1),
+ 'user_id': '119240841493711',
+ 'username': 'Nintendo',
+ 'video': None,
+ 'video_id': None,
+ 'video_thumbnail': None,
+ 'w3_fb_url': 'https://www.facebook.com/Nintendo/posts/2257188721032235'}
+```
+
+
+### Notes
+
+- There is no guarantee that every field will be extracted (they might be `None`).
+- Group posts may be missing some fields like `time` and `post_url`.
+- Group scraping may return only one page and not work on private groups.
+- If you scrape too much, Facebook might temporarily ban your IP.
+- The vast majority of unique IDs on facebook (post IDs, video IDs, photo IDs, comment IDs, profile IDs, etc) can be appended to https://www.facebook.com/ to result in a redirect to the corresponding object.
+- Some functions (such as extracting reactions) require you to be logged into Facebook (pass cookies). If something isn't working as expected, try pass cookies and see if that fixes it.
+
+## Profiles
+
+The `get_profile` function can extract information from a profile's about section. Pass in the account name or ID as the first parameter.  
+Note that Facebook serves different information depending on whether you're logged in (cookies parameter), such as Date of birth and Gender. Usage:
+
+```python
+from facebook_scraper import get_profile
+get_profile("zuck") # Or get_profile("zuck", cookies="cookies.txt")
+```
+Outputs:
+```python
+{'About': "I'm trying to make the world a more open place.",
+ 'Education': 'Harvard University\n'
+              'Computer Science and Psychology\n'
+              '30 August 2002 - 30 April 2004\n'
+              'Phillips Exeter Academy\n'
+              'Classics\n'
+              'School year 2002\n'
+              'Ardsley High School\n'
+              'High School\n'
+              'September 1998 - June 2000',
+ 'Favourite Quotes': '"Fortune favors the bold."\n'
+                     '- Virgil, Aeneid X.284\n'
+                     '\n'
+                     '"All children are artists. The problem is how to remain '
+                     'an artist once you grow up."\n'
+                     '- Pablo Picasso\n'
+                     '\n'
+                     '"Make things as simple as possible but no simpler."\n'
+                     '- Albert Einstein',
+ 'Name': 'Mark Zuckerberg',
+ 'Places lived': [{'link': '/profile.php?id=104022926303756&refid=17',
+                   'text': 'Palo Alto, California',
+                   'type': 'Current town/city'},
+                  {'link': '/profile.php?id=105506396148790&refid=17',
+                   'text': 'Dobbs Ferry, New York',
+                   'type': 'Home town'}],
+ 'Work': 'Chan Zuckerberg Initiative\n'
+         '1 December 2015 - Present\n'
+         'Facebook\n'
+         'Founder and CEO\n'
+         '4 February 2004 - Present\n'
+         'Palo Alto, California\n'
+         'Bringing the world closer together.'}
+```
+
+To extract friends, pass the argument `friends=True`, or to limit the amount of friends retrieved, set `friends` to the desired number.
+
+## Group info
+
+The `get_group_info` function can extract info about a group. Pass in the group name or ID as the first parameter.  
+Note that in order to see the list of admins, you need to be logged in (cookies parameter).
+
+Usage:
+
+```python
+from facebook_scraper import get_group_info
+get_group_info("makeupartistsgroup") # or get_group_info("makeupartistsgroup", cookies="cookies.txt")
+```
+
+Output:
+
+```python
+{'admins': [{'link': '/africanstylemagazinecom/?refid=18',
+             'name': 'African Style Magazine'},
+            {'link': '/connectfluencer/?refid=18',
+             'name': 'Everythingbrightandbeautiful'},
+            {'link': '/Kaakakigroup/?refid=18', 'name': 'Kaakaki Group'},
+            {'link': '/opentohelp/?refid=18', 'name': 'Open to Help'}],
+ 'id': '579169815767106',
+ 'members': 6814229,
+ 'name': 'HAIRSTYLES',
+ 'type': 'Public group'}
+```
+
+## To-Do
+
+- Async support
+- ~~Image galleries~~ (`images` entry)
+- ~~Profiles or post authors~~ (`get_profile()`)
+- ~~Comments~~ (with `options={'comments': True}`)
+
+
+## Alternatives and related projects
+
+- [facebook-post-scraper](https://github.com/brutalsavage/facebook-post-scraper). Has comments. Uses Selenium.
+- [facebook-scraper-selenium](https://github.com/apurvmishra99/facebook-scraper-selenium). "Scrape posts from any group or user into a .csv file without needing to register for any API access".
+- [Ultimate Facebook Scraper](https://github.com/harismuneer/Ultimate-Facebook-Scraper).  "Scrapes almost everything about a Facebook user's profile". Uses Selenium.
+- [Unofficial APIs](https://github.com/Rolstenhouse/unofficial-apis). List of unofficial APIs for various services, none for Facebook for now, but might be worth to check in the future.
+- [major-scrapy-spiders](https://github.com/talhashraf/major-scrapy-spiders). Has a profile spider for Scrapy.
+- [facebook-page-post-scraper](https://github.com/minimaxir/facebook-page-post-scraper). Seems abandoned.
+    - [FBLYZE](https://github.com/isaacmg/fb_scraper). Fork (?).
+- [RSSHub](https://github.com/DIYgod/RSSHub/blob/master/lib/routes/facebook/page.js). Generates an RSS feed from Facebook pages.
+- [RSS-Bridge](https://github.com/RSS-Bridge/rss-bridge/blob/master/bridges/FacebookBridge.php). Also generates RSS feeds from Facebook pages.
+
+
+%package -n python3-facebook-scraper
+Summary:	Scrape Facebook public pages without an API key
+Provides:	python-facebook-scraper
+BuildRequires:	python3-devel
+BuildRequires:	python3-setuptools
+BuildRequires:	python3-pip
+%description -n python3-facebook-scraper
+# Facebook Scraper
+
+[![PyPI download month](https://img.shields.io/pypi/dm/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+[![PyPI download week](https://img.shields.io/pypi/dw/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+[![PyPI download day](https://img.shields.io/pypi/dd/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+
+[![PyPI version](https://img.shields.io/pypi/v/facebook-scraper?color=blue)](https://pypi.python.org/pypi/facebook-scraper/)
+[![PyPI pyversions](https://img.shields.io/pypi/pyversions/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+[![GitHub commits since tagged version](https://img.shields.io/github/commits-since/kevinzg/facebook-scraper/v0.2.59)](https://github.com/kevinzg/facebook-scraper/commits/)
+
+[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
+
+
+Scrape Facebook public pages without an API key. Inspired by [twitter-scraper](https://github.com/kennethreitz/twitter-scraper).
+
+
+## Install
+
+To install the latest release from PyPI:
+
+```sh
+pip install facebook-scraper
+```
+
+Or, to install the latest master branch:
+
+```sh
+pip install git+https://github.com/kevinzg/facebook-scraper.git
+```
+
+## Usage
+
+Send the unique **page name, profile name, or ID** as the first parameter and you're good to go:
+
+```python
+>>> from facebook_scraper import get_posts
+
+>>> for post in get_posts('nintendo', pages=1):
+...     print(post['text'][:50])
+...
+The final step on the road to the Super Smash Bros
+We’re headed to PAX East 3/28-3/31 with new games
+```
+
+
+### Optional parameters
+
+*(For the `get_posts` function)*.
+
+- **group**: group id, to scrape groups instead of pages. Default is `None`.
+- **pages**: how many pages of posts to request, the first 2 pages may have no results, so try with a number greater than 2. Default is 10.
+- **timeout**: how many seconds to wait before timing out. Default is 30.
+- **credentials**: tuple of user and password to login before requesting the posts. Default is `None`.
+- **extra_info**: bool, if true the function will try to do an extra request to get the post reactions. Default is False.
+- **youtube_dl**: bool, use Youtube-DL for (high-quality) video extraction. You need to have youtube-dl installed on your environment. Default is False.
+- **post_urls**: list, URLs or post IDs to extract posts from. Alternative to fetching based on username.
+- **cookies**: One of:
+  - The path to a file containing cookies in Netscape or JSON format. You can extract cookies from your browser after logging into Facebook with an extension like [Get Cookies.txt (Chrome)](https://chrome.google.com/webstore/detail/get-cookiestxt/bgaddhkoddajcdgocldbbfleckgcbcid?hl=en) or [Cookie Quick Manager (Firefox)](https://addons.mozilla.org/en-US/firefox/addon/cookie-quick-manager/). Make sure that you include both the c_user cookie and the xs cookie, you will get an InvalidCookies exception if you don't.
+  - A [CookieJar](https://docs.python.org/3.9/library/http.cookiejar.html#http.cookiejar.CookieJar)
+  - A dictionary that can be converted to a CookieJar with [cookiejar_from_dict](https://2.python-requests.org/en/master/api/#requests.cookies.cookiejar_from_dict)
+  - The string `"from_browser"` to try extract Facebook cookies from your browser
+- **options**: Dictionary of options. Set `options={"comments": True}` to extract comments, set `options={"reactors": True}` to extract the people reacting to the post.
+Both `comments` and `reactors` can also be set to a number to set a limit for the amount of comments/reactors to retrieve.
+Set `options={"progress": True}` to get a `tqdm` progress bar while extracting comments and replies.
+Set `options={"allow_extra_requests": False}` to disable making extra requests when extracting post data (required for some things like full text and image links).
+Set `options={"posts_per_page": 200}` to request 200 posts per page. The default is 4.
+
+## CLI usage
+
+```sh
+$ facebook-scraper --filename nintendo_page_posts.csv --pages 10 nintendo
+```
+
+Run `facebook-scraper --help` for more details on CLI usage.
+
+**Note:** If you get a `UnicodeEncodeError` try adding `--encoding utf-8`.
+
+
+## Post example
+
+```python
+{'available': True,
+ 'comments': 459,
+ 'comments_full': None,
+ 'factcheck': None,
+ 'fetched_time': datetime.datetime(2021, 4, 20, 13, 39, 53, 651417),
+ 'image': 'https://scontent.fhlz2-1.fna.fbcdn.net/v/t1.6435-9/fr/cp0/e15/q65/58745049_2257182057699568_1761478225390731264_n.jpg?_nc_cat=111&ccb=1-3&_nc_sid=8024bb&_nc_ohc=ygH2fPmfQpAAX92ABYY&_nc_ht=scontent.fhlz2-1.fna&tp=14&oh=7a8a7b4904deb55ec696ae255fff97dd&oe=60A36717',
+ 'images': ['https://scontent.fhlz2-1.fna.fbcdn.net/v/t1.6435-9/fr/cp0/e15/q65/58745049_2257182057699568_1761478225390731264_n.jpg?_nc_cat=111&ccb=1-3&_nc_sid=8024bb&_nc_ohc=ygH2fPmfQpAAX92ABYY&_nc_ht=scontent.fhlz2-1.fna&tp=14&oh=7a8a7b4904deb55ec696ae255fff97dd&oe=60A36717'],
+ 'is_live': False,
+ 'likes': 3509,
+ 'link': 'https://www.nintendo.com/amiibo/line-up/',
+ 'post_id': '2257188721032235',
+ 'post_text': 'Don’t let this diminutive version of the Hero of Time fool you, '
+              'Young Link is just as heroic as his fully grown version! Young '
+              'Link joins the Super Smash Bros. series of amiibo figures!\n'
+              '\n'
+              'https://www.nintendo.com/amiibo/line-up/',
+ 'post_url': 'https://facebook.com/story.php?story_fbid=2257188721032235&id=119240841493711',
+ 'reactions': {'haha': 22, 'like': 2657, 'love': 706, 'sorry': 1, 'wow': 123}, # if `extra_info` was set
+ 'reactors': None,
+ 'shared_post_id': None,
+ 'shared_post_url': None,
+ 'shared_text': '',
+ 'shared_time': None,
+ 'shared_user_id': None,
+ 'shared_username': None,
+ 'shares': 441,
+ 'text': 'Don’t let this diminutive version of the Hero of Time fool you, '
+         'Young Link is just as heroic as his fully grown version! Young Link '
+         'joins the Super Smash Bros. series of amiibo figures!\n'
+         '\n'
+         'https://www.nintendo.com/amiibo/line-up/',
+ 'time': datetime.datetime(2019, 4, 30, 5, 0, 1),
+ 'user_id': '119240841493711',
+ 'username': 'Nintendo',
+ 'video': None,
+ 'video_id': None,
+ 'video_thumbnail': None,
+ 'w3_fb_url': 'https://www.facebook.com/Nintendo/posts/2257188721032235'}
+```
+
+
+### Notes
+
+- There is no guarantee that every field will be extracted (they might be `None`).
+- Group posts may be missing some fields like `time` and `post_url`.
+- Group scraping may return only one page and not work on private groups.
+- If you scrape too much, Facebook might temporarily ban your IP.
+- The vast majority of unique IDs on facebook (post IDs, video IDs, photo IDs, comment IDs, profile IDs, etc) can be appended to https://www.facebook.com/ to result in a redirect to the corresponding object.
+- Some functions (such as extracting reactions) require you to be logged into Facebook (pass cookies). If something isn't working as expected, try pass cookies and see if that fixes it.
+
+## Profiles
+
+The `get_profile` function can extract information from a profile's about section. Pass in the account name or ID as the first parameter.  
+Note that Facebook serves different information depending on whether you're logged in (cookies parameter), such as Date of birth and Gender. Usage:
+
+```python
+from facebook_scraper import get_profile
+get_profile("zuck") # Or get_profile("zuck", cookies="cookies.txt")
+```
+Outputs:
+```python
+{'About': "I'm trying to make the world a more open place.",
+ 'Education': 'Harvard University\n'
+              'Computer Science and Psychology\n'
+              '30 August 2002 - 30 April 2004\n'
+              'Phillips Exeter Academy\n'
+              'Classics\n'
+              'School year 2002\n'
+              'Ardsley High School\n'
+              'High School\n'
+              'September 1998 - June 2000',
+ 'Favourite Quotes': '"Fortune favors the bold."\n'
+                     '- Virgil, Aeneid X.284\n'
+                     '\n'
+                     '"All children are artists. The problem is how to remain '
+                     'an artist once you grow up."\n'
+                     '- Pablo Picasso\n'
+                     '\n'
+                     '"Make things as simple as possible but no simpler."\n'
+                     '- Albert Einstein',
+ 'Name': 'Mark Zuckerberg',
+ 'Places lived': [{'link': '/profile.php?id=104022926303756&refid=17',
+                   'text': 'Palo Alto, California',
+                   'type': 'Current town/city'},
+                  {'link': '/profile.php?id=105506396148790&refid=17',
+                   'text': 'Dobbs Ferry, New York',
+                   'type': 'Home town'}],
+ 'Work': 'Chan Zuckerberg Initiative\n'
+         '1 December 2015 - Present\n'
+         'Facebook\n'
+         'Founder and CEO\n'
+         '4 February 2004 - Present\n'
+         'Palo Alto, California\n'
+         'Bringing the world closer together.'}
+```
+
+To extract friends, pass the argument `friends=True`, or to limit the amount of friends retrieved, set `friends` to the desired number.
+
+## Group info
+
+The `get_group_info` function can extract info about a group. Pass in the group name or ID as the first parameter.  
+Note that in order to see the list of admins, you need to be logged in (cookies parameter).
+
+Usage:
+
+```python
+from facebook_scraper import get_group_info
+get_group_info("makeupartistsgroup") # or get_group_info("makeupartistsgroup", cookies="cookies.txt")
+```
+
+Output:
+
+```python
+{'admins': [{'link': '/africanstylemagazinecom/?refid=18',
+             'name': 'African Style Magazine'},
+            {'link': '/connectfluencer/?refid=18',
+             'name': 'Everythingbrightandbeautiful'},
+            {'link': '/Kaakakigroup/?refid=18', 'name': 'Kaakaki Group'},
+            {'link': '/opentohelp/?refid=18', 'name': 'Open to Help'}],
+ 'id': '579169815767106',
+ 'members': 6814229,
+ 'name': 'HAIRSTYLES',
+ 'type': 'Public group'}
+```
+
+## To-Do
+
+- Async support
+- ~~Image galleries~~ (`images` entry)
+- ~~Profiles or post authors~~ (`get_profile()`)
+- ~~Comments~~ (with `options={'comments': True}`)
+
+
+## Alternatives and related projects
+
+- [facebook-post-scraper](https://github.com/brutalsavage/facebook-post-scraper). Has comments. Uses Selenium.
+- [facebook-scraper-selenium](https://github.com/apurvmishra99/facebook-scraper-selenium). "Scrape posts from any group or user into a .csv file without needing to register for any API access".
+- [Ultimate Facebook Scraper](https://github.com/harismuneer/Ultimate-Facebook-Scraper).  "Scrapes almost everything about a Facebook user's profile". Uses Selenium.
+- [Unofficial APIs](https://github.com/Rolstenhouse/unofficial-apis). List of unofficial APIs for various services, none for Facebook for now, but might be worth to check in the future.
+- [major-scrapy-spiders](https://github.com/talhashraf/major-scrapy-spiders). Has a profile spider for Scrapy.
+- [facebook-page-post-scraper](https://github.com/minimaxir/facebook-page-post-scraper). Seems abandoned.
+    - [FBLYZE](https://github.com/isaacmg/fb_scraper). Fork (?).
+- [RSSHub](https://github.com/DIYgod/RSSHub/blob/master/lib/routes/facebook/page.js). Generates an RSS feed from Facebook pages.
+- [RSS-Bridge](https://github.com/RSS-Bridge/rss-bridge/blob/master/bridges/FacebookBridge.php). Also generates RSS feeds from Facebook pages.
+
+
+%package help
+Summary:	Development documents and examples for facebook-scraper
+Provides:	python3-facebook-scraper-doc
+%description help
+# Facebook Scraper
+
+[![PyPI download month](https://img.shields.io/pypi/dm/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+[![PyPI download week](https://img.shields.io/pypi/dw/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+[![PyPI download day](https://img.shields.io/pypi/dd/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+
+[![PyPI version](https://img.shields.io/pypi/v/facebook-scraper?color=blue)](https://pypi.python.org/pypi/facebook-scraper/)
+[![PyPI pyversions](https://img.shields.io/pypi/pyversions/facebook-scraper.svg)](https://pypi.python.org/pypi/facebook-scraper/)
+[![GitHub commits since tagged version](https://img.shields.io/github/commits-since/kevinzg/facebook-scraper/v0.2.59)](https://github.com/kevinzg/facebook-scraper/commits/)
+
+[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
+
+
+Scrape Facebook public pages without an API key. Inspired by [twitter-scraper](https://github.com/kennethreitz/twitter-scraper).
+
+
+## Install
+
+To install the latest release from PyPI:
+
+```sh
+pip install facebook-scraper
+```
+
+Or, to install the latest master branch:
+
+```sh
+pip install git+https://github.com/kevinzg/facebook-scraper.git
+```
+
+## Usage
+
+Send the unique **page name, profile name, or ID** as the first parameter and you're good to go:
+
+```python
+>>> from facebook_scraper import get_posts
+
+>>> for post in get_posts('nintendo', pages=1):
+...     print(post['text'][:50])
+...
+The final step on the road to the Super Smash Bros
+We’re headed to PAX East 3/28-3/31 with new games
+```
+
+
+### Optional parameters
+
+*(For the `get_posts` function)*.
+
+- **group**: group id, to scrape groups instead of pages. Default is `None`.
+- **pages**: how many pages of posts to request, the first 2 pages may have no results, so try with a number greater than 2. Default is 10.
+- **timeout**: how many seconds to wait before timing out. Default is 30.
+- **credentials**: tuple of user and password to login before requesting the posts. Default is `None`.
+- **extra_info**: bool, if true the function will try to do an extra request to get the post reactions. Default is False.
+- **youtube_dl**: bool, use Youtube-DL for (high-quality) video extraction. You need to have youtube-dl installed on your environment. Default is False.
+- **post_urls**: list, URLs or post IDs to extract posts from. Alternative to fetching based on username.
+- **cookies**: One of:
+  - The path to a file containing cookies in Netscape or JSON format. You can extract cookies from your browser after logging into Facebook with an extension like [Get Cookies.txt (Chrome)](https://chrome.google.com/webstore/detail/get-cookiestxt/bgaddhkoddajcdgocldbbfleckgcbcid?hl=en) or [Cookie Quick Manager (Firefox)](https://addons.mozilla.org/en-US/firefox/addon/cookie-quick-manager/). Make sure that you include both the c_user cookie and the xs cookie, you will get an InvalidCookies exception if you don't.
+  - A [CookieJar](https://docs.python.org/3.9/library/http.cookiejar.html#http.cookiejar.CookieJar)
+  - A dictionary that can be converted to a CookieJar with [cookiejar_from_dict](https://2.python-requests.org/en/master/api/#requests.cookies.cookiejar_from_dict)
+  - The string `"from_browser"` to try extract Facebook cookies from your browser
+- **options**: Dictionary of options. Set `options={"comments": True}` to extract comments, set `options={"reactors": True}` to extract the people reacting to the post.
+Both `comments` and `reactors` can also be set to a number to set a limit for the amount of comments/reactors to retrieve.
+Set `options={"progress": True}` to get a `tqdm` progress bar while extracting comments and replies.
+Set `options={"allow_extra_requests": False}` to disable making extra requests when extracting post data (required for some things like full text and image links).
+Set `options={"posts_per_page": 200}` to request 200 posts per page. The default is 4.
+
+## CLI usage
+
+```sh
+$ facebook-scraper --filename nintendo_page_posts.csv --pages 10 nintendo
+```
+
+Run `facebook-scraper --help` for more details on CLI usage.
+
+**Note:** If you get a `UnicodeEncodeError` try adding `--encoding utf-8`.
+
+
+## Post example
+
+```python
+{'available': True,
+ 'comments': 459,
+ 'comments_full': None,
+ 'factcheck': None,
+ 'fetched_time': datetime.datetime(2021, 4, 20, 13, 39, 53, 651417),
+ 'image': 'https://scontent.fhlz2-1.fna.fbcdn.net/v/t1.6435-9/fr/cp0/e15/q65/58745049_2257182057699568_1761478225390731264_n.jpg?_nc_cat=111&ccb=1-3&_nc_sid=8024bb&_nc_ohc=ygH2fPmfQpAAX92ABYY&_nc_ht=scontent.fhlz2-1.fna&tp=14&oh=7a8a7b4904deb55ec696ae255fff97dd&oe=60A36717',
+ 'images': ['https://scontent.fhlz2-1.fna.fbcdn.net/v/t1.6435-9/fr/cp0/e15/q65/58745049_2257182057699568_1761478225390731264_n.jpg?_nc_cat=111&ccb=1-3&_nc_sid=8024bb&_nc_ohc=ygH2fPmfQpAAX92ABYY&_nc_ht=scontent.fhlz2-1.fna&tp=14&oh=7a8a7b4904deb55ec696ae255fff97dd&oe=60A36717'],
+ 'is_live': False,
+ 'likes': 3509,
+ 'link': 'https://www.nintendo.com/amiibo/line-up/',
+ 'post_id': '2257188721032235',
+ 'post_text': 'Don’t let this diminutive version of the Hero of Time fool you, '
+              'Young Link is just as heroic as his fully grown version! Young '
+              'Link joins the Super Smash Bros. series of amiibo figures!\n'
+              '\n'
+              'https://www.nintendo.com/amiibo/line-up/',
+ 'post_url': 'https://facebook.com/story.php?story_fbid=2257188721032235&id=119240841493711',
+ 'reactions': {'haha': 22, 'like': 2657, 'love': 706, 'sorry': 1, 'wow': 123}, # if `extra_info` was set
+ 'reactors': None,
+ 'shared_post_id': None,
+ 'shared_post_url': None,
+ 'shared_text': '',
+ 'shared_time': None,
+ 'shared_user_id': None,
+ 'shared_username': None,
+ 'shares': 441,
+ 'text': 'Don’t let this diminutive version of the Hero of Time fool you, '
+         'Young Link is just as heroic as his fully grown version! Young Link '
+         'joins the Super Smash Bros. series of amiibo figures!\n'
+         '\n'
+         'https://www.nintendo.com/amiibo/line-up/',
+ 'time': datetime.datetime(2019, 4, 30, 5, 0, 1),
+ 'user_id': '119240841493711',
+ 'username': 'Nintendo',
+ 'video': None,
+ 'video_id': None,
+ 'video_thumbnail': None,
+ 'w3_fb_url': 'https://www.facebook.com/Nintendo/posts/2257188721032235'}
+```
+
+
+### Notes
+
+- There is no guarantee that every field will be extracted (they might be `None`).
+- Group posts may be missing some fields like `time` and `post_url`.
+- Group scraping may return only one page and not work on private groups.
+- If you scrape too much, Facebook might temporarily ban your IP.
+- The vast majority of unique IDs on facebook (post IDs, video IDs, photo IDs, comment IDs, profile IDs, etc) can be appended to https://www.facebook.com/ to result in a redirect to the corresponding object.
+- Some functions (such as extracting reactions) require you to be logged into Facebook (pass cookies). If something isn't working as expected, try pass cookies and see if that fixes it.
+
+## Profiles
+
+The `get_profile` function can extract information from a profile's about section. Pass in the account name or ID as the first parameter.  
+Note that Facebook serves different information depending on whether you're logged in (cookies parameter), such as Date of birth and Gender. Usage:
+
+```python
+from facebook_scraper import get_profile
+get_profile("zuck") # Or get_profile("zuck", cookies="cookies.txt")
+```
+Outputs:
+```python
+{'About': "I'm trying to make the world a more open place.",
+ 'Education': 'Harvard University\n'
+              'Computer Science and Psychology\n'
+              '30 August 2002 - 30 April 2004\n'
+              'Phillips Exeter Academy\n'
+              'Classics\n'
+              'School year 2002\n'
+              'Ardsley High School\n'
+              'High School\n'
+              'September 1998 - June 2000',
+ 'Favourite Quotes': '"Fortune favors the bold."\n'
+                     '- Virgil, Aeneid X.284\n'
+                     '\n'
+                     '"All children are artists. The problem is how to remain '
+                     'an artist once you grow up."\n'
+                     '- Pablo Picasso\n'
+                     '\n'
+                     '"Make things as simple as possible but no simpler."\n'
+                     '- Albert Einstein',
+ 'Name': 'Mark Zuckerberg',
+ 'Places lived': [{'link': '/profile.php?id=104022926303756&refid=17',
+                   'text': 'Palo Alto, California',
+                   'type': 'Current town/city'},
+                  {'link': '/profile.php?id=105506396148790&refid=17',
+                   'text': 'Dobbs Ferry, New York',
+                   'type': 'Home town'}],
+ 'Work': 'Chan Zuckerberg Initiative\n'
+         '1 December 2015 - Present\n'
+         'Facebook\n'
+         'Founder and CEO\n'
+         '4 February 2004 - Present\n'
+         'Palo Alto, California\n'
+         'Bringing the world closer together.'}
+```
+
+To extract friends, pass the argument `friends=True`, or to limit the amount of friends retrieved, set `friends` to the desired number.
+
+## Group info
+
+The `get_group_info` function can extract info about a group. Pass in the group name or ID as the first parameter.  
+Note that in order to see the list of admins, you need to be logged in (cookies parameter).
+
+Usage:
+
+```python
+from facebook_scraper import get_group_info
+get_group_info("makeupartistsgroup") # or get_group_info("makeupartistsgroup", cookies="cookies.txt")
+```
+
+Output:
+
+```python
+{'admins': [{'link': '/africanstylemagazinecom/?refid=18',
+             'name': 'African Style Magazine'},
+            {'link': '/connectfluencer/?refid=18',
+             'name': 'Everythingbrightandbeautiful'},
+            {'link': '/Kaakakigroup/?refid=18', 'name': 'Kaakaki Group'},
+            {'link': '/opentohelp/?refid=18', 'name': 'Open to Help'}],
+ 'id': '579169815767106',
+ 'members': 6814229,
+ 'name': 'HAIRSTYLES',
+ 'type': 'Public group'}
+```
+
+## To-Do
+
+- Async support
+- ~~Image galleries~~ (`images` entry)
+- ~~Profiles or post authors~~ (`get_profile()`)
+- ~~Comments~~ (with `options={'comments': True}`)
+
+
+## Alternatives and related projects
+
+- [facebook-post-scraper](https://github.com/brutalsavage/facebook-post-scraper). Has comments. Uses Selenium.
+- [facebook-scraper-selenium](https://github.com/apurvmishra99/facebook-scraper-selenium). "Scrape posts from any group or user into a .csv file without needing to register for any API access".
+- [Ultimate Facebook Scraper](https://github.com/harismuneer/Ultimate-Facebook-Scraper).  "Scrapes almost everything about a Facebook user's profile". Uses Selenium.
+- [Unofficial APIs](https://github.com/Rolstenhouse/unofficial-apis). List of unofficial APIs for various services, none for Facebook for now, but might be worth to check in the future.
+- [major-scrapy-spiders](https://github.com/talhashraf/major-scrapy-spiders). Has a profile spider for Scrapy.
+- [facebook-page-post-scraper](https://github.com/minimaxir/facebook-page-post-scraper). Seems abandoned.
+    - [FBLYZE](https://github.com/isaacmg/fb_scraper). Fork (?).
+- [RSSHub](https://github.com/DIYgod/RSSHub/blob/master/lib/routes/facebook/page.js). Generates an RSS feed from Facebook pages.
+- [RSS-Bridge](https://github.com/RSS-Bridge/rss-bridge/blob/master/bridges/FacebookBridge.php). Also generates RSS feeds from Facebook pages.
+
+
+%prep
+%autosetup -n facebook-scraper-0.2.59
+
+%build
+%py3_build
+
+%install
+%py3_install
+install -d -m755 %{buildroot}/%{_pkgdocdir}
+if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi
+if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi
+if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi
+if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi
+pushd %{buildroot}
+if [ -d usr/lib ]; then
+	find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/lib64 ]; then
+	find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/bin ]; then
+	find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/sbin ]; then
+	find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+touch doclist.lst
+if [ -d usr/share/man ]; then
+	find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst
+fi
+popd
+mv %{buildroot}/filelist.lst .
+mv %{buildroot}/doclist.lst .
+
+%files -n python3-facebook-scraper -f filelist.lst
+%dir %{python3_sitelib}/*
+
+%files help -f doclist.lst
+%{_docdir}/*
+
+%changelog
+* Tue Apr 11 2023 Python_Bot <Python_Bot@openeuler.org> - 0.2.59-1
+- Package Spec generated
author	CoprDistGit <infra@openeuler.org>	2023-04-11 14:38:34 +0000
committer	CoprDistGit <infra@openeuler.org>	2023-04-11 14:38:34 +0000
commit	2fa72953076f4a52c48b739a1adaadaa8e537656 (patch)
tree	d9539fc33efcff7a7bbfd2f3b54867cba455473d /python-facebook-scraper.spec
parent	02e856558ea810aceda1d582e0a30b9a577fff9b (diff)