diff options
| author | CoprDistGit <infra@openeuler.org> | 2023-05-05 06:08:58 +0000 |
|---|---|---|
| committer | CoprDistGit <infra@openeuler.org> | 2023-05-05 06:08:58 +0000 |
| commit | 5acb10d59dae77ba000c6c35579fe51d969426a0 (patch) | |
| tree | 0afed18c5d61c253642794f25ac13235ef9043fa | |
| parent | efe4a97c4ed65907d36131c5c6561df41d365217 (diff) | |
automatic import of python-socid-extractoropeneuler20.03
| -rw-r--r-- | .gitignore | 1 | ||||
| -rw-r--r-- | python-socid-extractor.spec | 324 | ||||
| -rw-r--r-- | sources | 1 |
3 files changed, 326 insertions, 0 deletions
@@ -0,0 +1 @@ +/socid-extractor-0.0.23.tar.gz diff --git a/python-socid-extractor.spec b/python-socid-extractor.spec new file mode 100644 index 0000000..4760f33 --- /dev/null +++ b/python-socid-extractor.spec @@ -0,0 +1,324 @@ +%global _empty_manifest_terminate_build 0 +Name: python-socid-extractor +Version: 0.0.23 +Release: 1 +Summary: Extract accounts' identifiers from personal pages on various platforms +License: GPL-3.0 +URL: https://github.com/soxoj/socid-extractor +Source0: https://mirrors.nju.edu.cn/pypi/web/packages/a3/a9/076be3ccadc93c8a16f362aad8f235abbc5314e92864ed8aae353587134b/socid-extractor-0.0.23.tar.gz +BuildArch: noarch + +Requires: python3-requests +Requires: python3-dateutil +Requires: python3-beautifulsoup4 + +%description +# socid_extractor + +Extract information about a user from profile webpages / API responses and save it in machine-readable format. + +## When it may be useful + +- Getting all available info by the username or/and account UID. Examples: [Week in OSINT](https://medium.com/week-in-osint/getting-a-grasp-on-googleids-77a8ab707e43), [OSINTCurious](https://osintcurio.us/2019/10/01/searching-instagram-part-2/) +- Users tracking, checking that the account was previously known (by ID) even if all public info has changed. Examples: [Aware Online](https://www.aware-online.com/en/importance-of-user-ids-in-social-media-investigations/) +- Searching by commonly used cross-service UIDs (GAIA ID, Facebook UID, Yandex Public ID, etc.) + - DB leaks of forums and platforms in SQL format + - Indexed links that contain target profile ID +- Searching for tracking data by comparison with other IDs - [how it works](https://www.eff.org/wp/behind-the-one-way-mirror), [how can it be used](https://www.nytimes.com/interactive/2019/12/19/opinion/location-tracking-cell-phone.html). +- Law enforcement online requests + +## Tools using socid_extractor + +[Maigret](https://github.com/soxoj/maigret) - powerful namechecker, generate a report with all available info from accounts found. + +[YaSeeker](https://github.com/HowToFind-bot/YaSeeker) - tool to gather all available information about Yandex account by login/email. + +## Installation + + $ pip3 install socid-extractor + +The latest development version can be installed directly from GitHub: + + $ pip3 install -U git+https://github.com/soxoj/socid_extractor.git + +## Using + +As a command-line tool: +``` +$ socid_extractor --url https://www.deviantart.com/muse1908 +country: France +created_at: 2005-06-16 18:17:41 +gender: female +username: Muse1908 +website: www.patreon.com/musemercier +links: ['https://www.facebook.com/musemercier', 'https://www.instagram.com/muse.mercier/', 'https://www.patreon.com/musemercier'] +tagline: Nothing worth having is easy... +``` + +Without installing: +``` +$ ./run.py --url https://www.deviantart.com/muse1908 +``` + +As a Python library: +``` +>>> import socid_extractor, requests +>>> r = requests.get('https://www.patreon.com/annetlovart') +>>> socid_extractor.extract(r.text) +{'patreon_id': '33913189', 'patreon_username': 'annetlovart', 'fullname': 'Annet Lovart', 'links': "['https://www.facebook.com/322598031832479', 'https://www.instagram.com/annet_lovart', 'https://twitter.com/annet_lovart', 'https://youtube.com/channel/UClDg4ntlOW_1j73zqSJxHHQ']"} +``` + +## Sites + +- Google (all documents pages, maps contributions), cookies required +- Yandex (disk, albums, znatoki, music, realty, collections), cookies required to prevent captcha blocks +- Facebook (user & group pages) +- Instagram +- Reddit +- Medium +- Flickr +- Tumblr +- TikTok +- GitHub +- VK (user page) +- OK (user page) +- Mail.ru (my.mail.ru user mainpage, photo, video, games, communities) + +...and many others. + +Check [tests file](./tests/test_e2e.py) for extracted data examples, [schemes file](./socid_extractor/schemes.py) to check all supported sites. + + +## Testing + +```sh +python3 -m pytest tests/test_e2e.py -n 10 -k 'not cookies' -m 'not github_failed and not rate_limited' +``` + + + + +%package -n python3-socid-extractor +Summary: Extract accounts' identifiers from personal pages on various platforms +Provides: python-socid-extractor +BuildRequires: python3-devel +BuildRequires: python3-setuptools +BuildRequires: python3-pip +%description -n python3-socid-extractor +# socid_extractor + +Extract information about a user from profile webpages / API responses and save it in machine-readable format. + +## When it may be useful + +- Getting all available info by the username or/and account UID. Examples: [Week in OSINT](https://medium.com/week-in-osint/getting-a-grasp-on-googleids-77a8ab707e43), [OSINTCurious](https://osintcurio.us/2019/10/01/searching-instagram-part-2/) +- Users tracking, checking that the account was previously known (by ID) even if all public info has changed. Examples: [Aware Online](https://www.aware-online.com/en/importance-of-user-ids-in-social-media-investigations/) +- Searching by commonly used cross-service UIDs (GAIA ID, Facebook UID, Yandex Public ID, etc.) + - DB leaks of forums and platforms in SQL format + - Indexed links that contain target profile ID +- Searching for tracking data by comparison with other IDs - [how it works](https://www.eff.org/wp/behind-the-one-way-mirror), [how can it be used](https://www.nytimes.com/interactive/2019/12/19/opinion/location-tracking-cell-phone.html). +- Law enforcement online requests + +## Tools using socid_extractor + +[Maigret](https://github.com/soxoj/maigret) - powerful namechecker, generate a report with all available info from accounts found. + +[YaSeeker](https://github.com/HowToFind-bot/YaSeeker) - tool to gather all available information about Yandex account by login/email. + +## Installation + + $ pip3 install socid-extractor + +The latest development version can be installed directly from GitHub: + + $ pip3 install -U git+https://github.com/soxoj/socid_extractor.git + +## Using + +As a command-line tool: +``` +$ socid_extractor --url https://www.deviantart.com/muse1908 +country: France +created_at: 2005-06-16 18:17:41 +gender: female +username: Muse1908 +website: www.patreon.com/musemercier +links: ['https://www.facebook.com/musemercier', 'https://www.instagram.com/muse.mercier/', 'https://www.patreon.com/musemercier'] +tagline: Nothing worth having is easy... +``` + +Without installing: +``` +$ ./run.py --url https://www.deviantart.com/muse1908 +``` + +As a Python library: +``` +>>> import socid_extractor, requests +>>> r = requests.get('https://www.patreon.com/annetlovart') +>>> socid_extractor.extract(r.text) +{'patreon_id': '33913189', 'patreon_username': 'annetlovart', 'fullname': 'Annet Lovart', 'links': "['https://www.facebook.com/322598031832479', 'https://www.instagram.com/annet_lovart', 'https://twitter.com/annet_lovart', 'https://youtube.com/channel/UClDg4ntlOW_1j73zqSJxHHQ']"} +``` + +## Sites + +- Google (all documents pages, maps contributions), cookies required +- Yandex (disk, albums, znatoki, music, realty, collections), cookies required to prevent captcha blocks +- Facebook (user & group pages) +- Instagram +- Reddit +- Medium +- Flickr +- Tumblr +- TikTok +- GitHub +- VK (user page) +- OK (user page) +- Mail.ru (my.mail.ru user mainpage, photo, video, games, communities) + +...and many others. + +Check [tests file](./tests/test_e2e.py) for extracted data examples, [schemes file](./socid_extractor/schemes.py) to check all supported sites. + + +## Testing + +```sh +python3 -m pytest tests/test_e2e.py -n 10 -k 'not cookies' -m 'not github_failed and not rate_limited' +``` + + + + +%package help +Summary: Development documents and examples for socid-extractor +Provides: python3-socid-extractor-doc +%description help +# socid_extractor + +Extract information about a user from profile webpages / API responses and save it in machine-readable format. + +## When it may be useful + +- Getting all available info by the username or/and account UID. Examples: [Week in OSINT](https://medium.com/week-in-osint/getting-a-grasp-on-googleids-77a8ab707e43), [OSINTCurious](https://osintcurio.us/2019/10/01/searching-instagram-part-2/) +- Users tracking, checking that the account was previously known (by ID) even if all public info has changed. Examples: [Aware Online](https://www.aware-online.com/en/importance-of-user-ids-in-social-media-investigations/) +- Searching by commonly used cross-service UIDs (GAIA ID, Facebook UID, Yandex Public ID, etc.) + - DB leaks of forums and platforms in SQL format + - Indexed links that contain target profile ID +- Searching for tracking data by comparison with other IDs - [how it works](https://www.eff.org/wp/behind-the-one-way-mirror), [how can it be used](https://www.nytimes.com/interactive/2019/12/19/opinion/location-tracking-cell-phone.html). +- Law enforcement online requests + +## Tools using socid_extractor + +[Maigret](https://github.com/soxoj/maigret) - powerful namechecker, generate a report with all available info from accounts found. + +[YaSeeker](https://github.com/HowToFind-bot/YaSeeker) - tool to gather all available information about Yandex account by login/email. + +## Installation + + $ pip3 install socid-extractor + +The latest development version can be installed directly from GitHub: + + $ pip3 install -U git+https://github.com/soxoj/socid_extractor.git + +## Using + +As a command-line tool: +``` +$ socid_extractor --url https://www.deviantart.com/muse1908 +country: France +created_at: 2005-06-16 18:17:41 +gender: female +username: Muse1908 +website: www.patreon.com/musemercier +links: ['https://www.facebook.com/musemercier', 'https://www.instagram.com/muse.mercier/', 'https://www.patreon.com/musemercier'] +tagline: Nothing worth having is easy... +``` + +Without installing: +``` +$ ./run.py --url https://www.deviantart.com/muse1908 +``` + +As a Python library: +``` +>>> import socid_extractor, requests +>>> r = requests.get('https://www.patreon.com/annetlovart') +>>> socid_extractor.extract(r.text) +{'patreon_id': '33913189', 'patreon_username': 'annetlovart', 'fullname': 'Annet Lovart', 'links': "['https://www.facebook.com/322598031832479', 'https://www.instagram.com/annet_lovart', 'https://twitter.com/annet_lovart', 'https://youtube.com/channel/UClDg4ntlOW_1j73zqSJxHHQ']"} +``` + +## Sites + +- Google (all documents pages, maps contributions), cookies required +- Yandex (disk, albums, znatoki, music, realty, collections), cookies required to prevent captcha blocks +- Facebook (user & group pages) +- Instagram +- Reddit +- Medium +- Flickr +- Tumblr +- TikTok +- GitHub +- VK (user page) +- OK (user page) +- Mail.ru (my.mail.ru user mainpage, photo, video, games, communities) + +...and many others. + +Check [tests file](./tests/test_e2e.py) for extracted data examples, [schemes file](./socid_extractor/schemes.py) to check all supported sites. + + +## Testing + +```sh +python3 -m pytest tests/test_e2e.py -n 10 -k 'not cookies' -m 'not github_failed and not rate_limited' +``` + + + + +%prep +%autosetup -n socid-extractor-0.0.23 + +%build +%py3_build + +%install +%py3_install +install -d -m755 %{buildroot}/%{_pkgdocdir} +if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi +if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi +if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi +if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi +pushd %{buildroot} +if [ -d usr/lib ]; then + find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/lib64 ]; then + find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/bin ]; then + find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/sbin ]; then + find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst +fi +touch doclist.lst +if [ -d usr/share/man ]; then + find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst +fi +popd +mv %{buildroot}/filelist.lst . +mv %{buildroot}/doclist.lst . + +%files -n python3-socid-extractor -f filelist.lst +%dir %{python3_sitelib}/* + +%files help -f doclist.lst +%{_docdir}/* + +%changelog +* Fri May 05 2023 Python_Bot <Python_Bot@openeuler.org> - 0.0.23-1 +- Package Spec generated @@ -0,0 +1 @@ +a75ad01c463aa103f7648aa211a23dfd socid-extractor-0.0.23.tar.gz |
