%global _empty_manifest_terminate_build 0
Name: python-SoMaJo
Version: 2.2.3
Release: 1
Summary: A tokenizer and sentence splitter for German and English web and social media texts.
License: GNU General Public License v3 or later (GPLv3+)
URL: https://github.com/tsproisl/SoMaJo
Source0: https://mirrors.nju.edu.cn/pypi/web/packages/90/40/6f501c33e95f3952e3f8e8ca4436a88fb5e3788e1c98c8224e5e6722be7c/SoMaJo-2.2.3.tar.gz
BuildArch: noarch
Requires: python3-regex
%description
SoMaJo is a state-of-the-art tokenizer for German and English web and
social media texts. It won the `EmpiriST 2015 shared task
`_ on automatic
linguistic annotation of computer-mediated communication / social
media. As such, it is particularly well-suited to perform tokenization
on all kinds of written discourse, for example chats, forums, wiki
talk pages, tweets, blog comments, social networks, SMS and WhatsApp
dialogues.
More detailed documentation is available `here
`_.
%package -n python3-SoMaJo
Summary: A tokenizer and sentence splitter for German and English web and social media texts.
Provides: python-SoMaJo
BuildRequires: python3-devel
BuildRequires: python3-setuptools
BuildRequires: python3-pip
%description -n python3-SoMaJo
SoMaJo is a state-of-the-art tokenizer for German and English web and
social media texts. It won the `EmpiriST 2015 shared task
`_ on automatic
linguistic annotation of computer-mediated communication / social
media. As such, it is particularly well-suited to perform tokenization
on all kinds of written discourse, for example chats, forums, wiki
talk pages, tweets, blog comments, social networks, SMS and WhatsApp
dialogues.
More detailed documentation is available `here
`_.
%package help
Summary: Development documents and examples for SoMaJo
Provides: python3-SoMaJo-doc
%description help
SoMaJo is a state-of-the-art tokenizer for German and English web and
social media texts. It won the `EmpiriST 2015 shared task
`_ on automatic
linguistic annotation of computer-mediated communication / social
media. As such, it is particularly well-suited to perform tokenization
on all kinds of written discourse, for example chats, forums, wiki
talk pages, tweets, blog comments, social networks, SMS and WhatsApp
dialogues.
More detailed documentation is available `here
`_.
%prep
%autosetup -n SoMaJo-2.2.3
%build
%py3_build
%install
%py3_install
install -d -m755 %{buildroot}/%{_pkgdocdir}
if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi
if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi
if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi
if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi
pushd %{buildroot}
if [ -d usr/lib ]; then
find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst
fi
if [ -d usr/lib64 ]; then
find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst
fi
if [ -d usr/bin ]; then
find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst
fi
if [ -d usr/sbin ]; then
find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst
fi
touch doclist.lst
if [ -d usr/share/man ]; then
find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst
fi
popd
mv %{buildroot}/filelist.lst .
mv %{buildroot}/doclist.lst .
%files -n python3-SoMaJo -f filelist.lst
%dir %{python3_sitelib}/*
%files help -f doclist.lst
%{_docdir}/*
%changelog
* Mon May 15 2023 Python_Bot - 2.2.3-1
- Package Spec generated