%global _empty_manifest_terminate_build 0 Name: python-streamcorpus_pipeline Version: 0.7.21 Release: 1 Summary: Tools for building streamcorpus objects, such as those used in TREC. License: MIT/X11 license http://opensource.org/licenses/MIT URL: http://github.com/trec-kba/streamcorpus-pipeline Source0: https://mirrors.nju.edu.cn/pypi/web/packages/c4/4a/5d99efe84c4fd702db21044cf76d15149eb997aad0610d637de1e9ff6528/streamcorpus_pipeline-0.7.21.tar.gz BuildArch: noarch %description streamcorpus_pipeline is a document processing pipeline that assembles streamcorpus objects from raw data sets. The streamcorpus_pipeline python module contains tools for processing streamcorpus.StreamItem objects stored in Chunks. It includes transform functions for getting clean_html, clean_visible, creating labels from hyperlinks to particular sites (e.g. Wikipedia), and taggers like LingPipe, Serif, and Factorie, which make Tokens and Sentences. Read more at [streamcorpus.org](http://streamcorpus.org/) %package -n python3-streamcorpus_pipeline Summary: Tools for building streamcorpus objects, such as those used in TREC. Provides: python-streamcorpus_pipeline BuildRequires: python3-devel BuildRequires: python3-setuptools BuildRequires: python3-pip %description -n python3-streamcorpus_pipeline streamcorpus_pipeline is a document processing pipeline that assembles streamcorpus objects from raw data sets. The streamcorpus_pipeline python module contains tools for processing streamcorpus.StreamItem objects stored in Chunks. It includes transform functions for getting clean_html, clean_visible, creating labels from hyperlinks to particular sites (e.g. Wikipedia), and taggers like LingPipe, Serif, and Factorie, which make Tokens and Sentences. Read more at [streamcorpus.org](http://streamcorpus.org/) %package help Summary: Development documents and examples for streamcorpus_pipeline Provides: python3-streamcorpus_pipeline-doc %description help streamcorpus_pipeline is a document processing pipeline that assembles streamcorpus objects from raw data sets. The streamcorpus_pipeline python module contains tools for processing streamcorpus.StreamItem objects stored in Chunks. It includes transform functions for getting clean_html, clean_visible, creating labels from hyperlinks to particular sites (e.g. Wikipedia), and taggers like LingPipe, Serif, and Factorie, which make Tokens and Sentences. Read more at [streamcorpus.org](http://streamcorpus.org/) %prep %autosetup -n streamcorpus_pipeline-0.7.21 %build %py3_build %install %py3_install install -d -m755 %{buildroot}/%{_pkgdocdir} if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi pushd %{buildroot} if [ -d usr/lib ]; then find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/lib64 ]; then find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/bin ]; then find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/sbin ]; then find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst fi touch doclist.lst if [ -d usr/share/man ]; then find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst fi popd mv %{buildroot}/filelist.lst . mv %{buildroot}/doclist.lst . %files -n python3-streamcorpus_pipeline -f filelist.lst %dir %{python3_sitelib}/* %files help -f doclist.lst %{_docdir}/* %changelog * Fri May 05 2023 Python_Bot - 0.7.21-1 - Package Spec generated