summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorCoprDistGit <infra@openeuler.org>2023-05-05 06:08:58 +0000
committerCoprDistGit <infra@openeuler.org>2023-05-05 06:08:58 +0000
commit5acb10d59dae77ba000c6c35579fe51d969426a0 (patch)
tree0afed18c5d61c253642794f25ac13235ef9043fa
parentefe4a97c4ed65907d36131c5c6561df41d365217 (diff)
automatic import of python-socid-extractoropeneuler20.03
-rw-r--r--.gitignore1
-rw-r--r--python-socid-extractor.spec324
-rw-r--r--sources1
3 files changed, 326 insertions, 0 deletions
diff --git a/.gitignore b/.gitignore
index e69de29..64be847 100644
--- a/.gitignore
+++ b/.gitignore
@@ -0,0 +1 @@
+/socid-extractor-0.0.23.tar.gz
diff --git a/python-socid-extractor.spec b/python-socid-extractor.spec
new file mode 100644
index 0000000..4760f33
--- /dev/null
+++ b/python-socid-extractor.spec
@@ -0,0 +1,324 @@
+%global _empty_manifest_terminate_build 0
+Name: python-socid-extractor
+Version: 0.0.23
+Release: 1
+Summary: Extract accounts' identifiers from personal pages on various platforms
+License: GPL-3.0
+URL: https://github.com/soxoj/socid-extractor
+Source0: https://mirrors.nju.edu.cn/pypi/web/packages/a3/a9/076be3ccadc93c8a16f362aad8f235abbc5314e92864ed8aae353587134b/socid-extractor-0.0.23.tar.gz
+BuildArch: noarch
+
+Requires: python3-requests
+Requires: python3-dateutil
+Requires: python3-beautifulsoup4
+
+%description
+# socid_extractor
+
+Extract information about a user from profile webpages / API responses and save it in machine-readable format.
+
+## When it may be useful
+
+- Getting all available info by the username or/and account UID. Examples: [Week in OSINT](https://medium.com/week-in-osint/getting-a-grasp-on-googleids-77a8ab707e43), [OSINTCurious](https://osintcurio.us/2019/10/01/searching-instagram-part-2/)
+- Users tracking, checking that the account was previously known (by ID) even if all public info has changed. Examples: [Aware Online](https://www.aware-online.com/en/importance-of-user-ids-in-social-media-investigations/)
+- Searching by commonly used cross-service UIDs (GAIA ID, Facebook UID, Yandex Public ID, etc.)
+ - DB leaks of forums and platforms in SQL format
+ - Indexed links that contain target profile ID
+- Searching for tracking data by comparison with other IDs - [how it works](https://www.eff.org/wp/behind-the-one-way-mirror), [how can it be used](https://www.nytimes.com/interactive/2019/12/19/opinion/location-tracking-cell-phone.html).
+- Law enforcement online requests
+
+## Tools using socid_extractor
+
+[Maigret](https://github.com/soxoj/maigret) - powerful namechecker, generate a report with all available info from accounts found.
+
+[YaSeeker](https://github.com/HowToFind-bot/YaSeeker) - tool to gather all available information about Yandex account by login/email.
+
+## Installation
+
+ $ pip3 install socid-extractor
+
+The latest development version can be installed directly from GitHub:
+
+ $ pip3 install -U git+https://github.com/soxoj/socid_extractor.git
+
+## Using
+
+As a command-line tool:
+```
+$ socid_extractor --url https://www.deviantart.com/muse1908
+country: France
+created_at: 2005-06-16 18:17:41
+gender: female
+username: Muse1908
+website: www.patreon.com/musemercier
+links: ['https://www.facebook.com/musemercier', 'https://www.instagram.com/muse.mercier/', 'https://www.patreon.com/musemercier']
+tagline: Nothing worth having is easy...
+```
+
+Without installing:
+```
+$ ./run.py --url https://www.deviantart.com/muse1908
+```
+
+As a Python library:
+```
+>>> import socid_extractor, requests
+>>> r = requests.get('https://www.patreon.com/annetlovart')
+>>> socid_extractor.extract(r.text)
+{'patreon_id': '33913189', 'patreon_username': 'annetlovart', 'fullname': 'Annet Lovart', 'links': "['https://www.facebook.com/322598031832479', 'https://www.instagram.com/annet_lovart', 'https://twitter.com/annet_lovart', 'https://youtube.com/channel/UClDg4ntlOW_1j73zqSJxHHQ']"}
+```
+
+## Sites
+
+- Google (all documents pages, maps contributions), cookies required
+- Yandex (disk, albums, znatoki, music, realty, collections), cookies required to prevent captcha blocks
+- Facebook (user & group pages)
+- Instagram
+- Reddit
+- Medium
+- Flickr
+- Tumblr
+- TikTok
+- GitHub
+- VK (user page)
+- OK (user page)
+- Mail.ru (my.mail.ru user mainpage, photo, video, games, communities)
+
+...and many others.
+
+Check [tests file](./tests/test_e2e.py) for extracted data examples, [schemes file](./socid_extractor/schemes.py) to check all supported sites.
+
+
+## Testing
+
+```sh
+python3 -m pytest tests/test_e2e.py -n 10 -k 'not cookies' -m 'not github_failed and not rate_limited'
+```
+
+
+
+
+%package -n python3-socid-extractor
+Summary: Extract accounts' identifiers from personal pages on various platforms
+Provides: python-socid-extractor
+BuildRequires: python3-devel
+BuildRequires: python3-setuptools
+BuildRequires: python3-pip
+%description -n python3-socid-extractor
+# socid_extractor
+
+Extract information about a user from profile webpages / API responses and save it in machine-readable format.
+
+## When it may be useful
+
+- Getting all available info by the username or/and account UID. Examples: [Week in OSINT](https://medium.com/week-in-osint/getting-a-grasp-on-googleids-77a8ab707e43), [OSINTCurious](https://osintcurio.us/2019/10/01/searching-instagram-part-2/)
+- Users tracking, checking that the account was previously known (by ID) even if all public info has changed. Examples: [Aware Online](https://www.aware-online.com/en/importance-of-user-ids-in-social-media-investigations/)
+- Searching by commonly used cross-service UIDs (GAIA ID, Facebook UID, Yandex Public ID, etc.)
+ - DB leaks of forums and platforms in SQL format
+ - Indexed links that contain target profile ID
+- Searching for tracking data by comparison with other IDs - [how it works](https://www.eff.org/wp/behind-the-one-way-mirror), [how can it be used](https://www.nytimes.com/interactive/2019/12/19/opinion/location-tracking-cell-phone.html).
+- Law enforcement online requests
+
+## Tools using socid_extractor
+
+[Maigret](https://github.com/soxoj/maigret) - powerful namechecker, generate a report with all available info from accounts found.
+
+[YaSeeker](https://github.com/HowToFind-bot/YaSeeker) - tool to gather all available information about Yandex account by login/email.
+
+## Installation
+
+ $ pip3 install socid-extractor
+
+The latest development version can be installed directly from GitHub:
+
+ $ pip3 install -U git+https://github.com/soxoj/socid_extractor.git
+
+## Using
+
+As a command-line tool:
+```
+$ socid_extractor --url https://www.deviantart.com/muse1908
+country: France
+created_at: 2005-06-16 18:17:41
+gender: female
+username: Muse1908
+website: www.patreon.com/musemercier
+links: ['https://www.facebook.com/musemercier', 'https://www.instagram.com/muse.mercier/', 'https://www.patreon.com/musemercier']
+tagline: Nothing worth having is easy...
+```
+
+Without installing:
+```
+$ ./run.py --url https://www.deviantart.com/muse1908
+```
+
+As a Python library:
+```
+>>> import socid_extractor, requests
+>>> r = requests.get('https://www.patreon.com/annetlovart')
+>>> socid_extractor.extract(r.text)
+{'patreon_id': '33913189', 'patreon_username': 'annetlovart', 'fullname': 'Annet Lovart', 'links': "['https://www.facebook.com/322598031832479', 'https://www.instagram.com/annet_lovart', 'https://twitter.com/annet_lovart', 'https://youtube.com/channel/UClDg4ntlOW_1j73zqSJxHHQ']"}
+```
+
+## Sites
+
+- Google (all documents pages, maps contributions), cookies required
+- Yandex (disk, albums, znatoki, music, realty, collections), cookies required to prevent captcha blocks
+- Facebook (user & group pages)
+- Instagram
+- Reddit
+- Medium
+- Flickr
+- Tumblr
+- TikTok
+- GitHub
+- VK (user page)
+- OK (user page)
+- Mail.ru (my.mail.ru user mainpage, photo, video, games, communities)
+
+...and many others.
+
+Check [tests file](./tests/test_e2e.py) for extracted data examples, [schemes file](./socid_extractor/schemes.py) to check all supported sites.
+
+
+## Testing
+
+```sh
+python3 -m pytest tests/test_e2e.py -n 10 -k 'not cookies' -m 'not github_failed and not rate_limited'
+```
+
+
+
+
+%package help
+Summary: Development documents and examples for socid-extractor
+Provides: python3-socid-extractor-doc
+%description help
+# socid_extractor
+
+Extract information about a user from profile webpages / API responses and save it in machine-readable format.
+
+## When it may be useful
+
+- Getting all available info by the username or/and account UID. Examples: [Week in OSINT](https://medium.com/week-in-osint/getting-a-grasp-on-googleids-77a8ab707e43), [OSINTCurious](https://osintcurio.us/2019/10/01/searching-instagram-part-2/)
+- Users tracking, checking that the account was previously known (by ID) even if all public info has changed. Examples: [Aware Online](https://www.aware-online.com/en/importance-of-user-ids-in-social-media-investigations/)
+- Searching by commonly used cross-service UIDs (GAIA ID, Facebook UID, Yandex Public ID, etc.)
+ - DB leaks of forums and platforms in SQL format
+ - Indexed links that contain target profile ID
+- Searching for tracking data by comparison with other IDs - [how it works](https://www.eff.org/wp/behind-the-one-way-mirror), [how can it be used](https://www.nytimes.com/interactive/2019/12/19/opinion/location-tracking-cell-phone.html).
+- Law enforcement online requests
+
+## Tools using socid_extractor
+
+[Maigret](https://github.com/soxoj/maigret) - powerful namechecker, generate a report with all available info from accounts found.
+
+[YaSeeker](https://github.com/HowToFind-bot/YaSeeker) - tool to gather all available information about Yandex account by login/email.
+
+## Installation
+
+ $ pip3 install socid-extractor
+
+The latest development version can be installed directly from GitHub:
+
+ $ pip3 install -U git+https://github.com/soxoj/socid_extractor.git
+
+## Using
+
+As a command-line tool:
+```
+$ socid_extractor --url https://www.deviantart.com/muse1908
+country: France
+created_at: 2005-06-16 18:17:41
+gender: female
+username: Muse1908
+website: www.patreon.com/musemercier
+links: ['https://www.facebook.com/musemercier', 'https://www.instagram.com/muse.mercier/', 'https://www.patreon.com/musemercier']
+tagline: Nothing worth having is easy...
+```
+
+Without installing:
+```
+$ ./run.py --url https://www.deviantart.com/muse1908
+```
+
+As a Python library:
+```
+>>> import socid_extractor, requests
+>>> r = requests.get('https://www.patreon.com/annetlovart')
+>>> socid_extractor.extract(r.text)
+{'patreon_id': '33913189', 'patreon_username': 'annetlovart', 'fullname': 'Annet Lovart', 'links': "['https://www.facebook.com/322598031832479', 'https://www.instagram.com/annet_lovart', 'https://twitter.com/annet_lovart', 'https://youtube.com/channel/UClDg4ntlOW_1j73zqSJxHHQ']"}
+```
+
+## Sites
+
+- Google (all documents pages, maps contributions), cookies required
+- Yandex (disk, albums, znatoki, music, realty, collections), cookies required to prevent captcha blocks
+- Facebook (user & group pages)
+- Instagram
+- Reddit
+- Medium
+- Flickr
+- Tumblr
+- TikTok
+- GitHub
+- VK (user page)
+- OK (user page)
+- Mail.ru (my.mail.ru user mainpage, photo, video, games, communities)
+
+...and many others.
+
+Check [tests file](./tests/test_e2e.py) for extracted data examples, [schemes file](./socid_extractor/schemes.py) to check all supported sites.
+
+
+## Testing
+
+```sh
+python3 -m pytest tests/test_e2e.py -n 10 -k 'not cookies' -m 'not github_failed and not rate_limited'
+```
+
+
+
+
+%prep
+%autosetup -n socid-extractor-0.0.23
+
+%build
+%py3_build
+
+%install
+%py3_install
+install -d -m755 %{buildroot}/%{_pkgdocdir}
+if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi
+if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi
+if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi
+if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi
+pushd %{buildroot}
+if [ -d usr/lib ]; then
+ find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/lib64 ]; then
+ find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/bin ]; then
+ find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/sbin ]; then
+ find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+touch doclist.lst
+if [ -d usr/share/man ]; then
+ find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst
+fi
+popd
+mv %{buildroot}/filelist.lst .
+mv %{buildroot}/doclist.lst .
+
+%files -n python3-socid-extractor -f filelist.lst
+%dir %{python3_sitelib}/*
+
+%files help -f doclist.lst
+%{_docdir}/*
+
+%changelog
+* Fri May 05 2023 Python_Bot <Python_Bot@openeuler.org> - 0.0.23-1
+- Package Spec generated
diff --git a/sources b/sources
new file mode 100644
index 0000000..760b3f1
--- /dev/null
+++ b/sources
@@ -0,0 +1 @@
+a75ad01c463aa103f7648aa211a23dfd socid-extractor-0.0.23.tar.gz