diff options
author | CoprDistGit <infra@openeuler.org> | 2023-05-05 06:34:28 +0000 |
---|---|---|
committer | CoprDistGit <infra@openeuler.org> | 2023-05-05 06:34:28 +0000 |
commit | cbf555f526557e6c6a9485a612ede2fcb5d5595f (patch) | |
tree | 078e01beaa282cd712527df42f06a681badd5930 | |
parent | 7fa147f2c1c286239adaceec0595573c6f2d78ff (diff) |
automatic import of python-folia-toolsopeneuler20.03
-rw-r--r-- | .gitignore | 1 | ||||
-rw-r--r-- | python-folia-tools.spec | 162 | ||||
-rw-r--r-- | sources | 1 |
3 files changed, 164 insertions, 0 deletions
@@ -0,0 +1 @@ +/FoLiA-tools-2.5.4.tar.gz diff --git a/python-folia-tools.spec b/python-folia-tools.spec new file mode 100644 index 0000000..fc1ca93 --- /dev/null +++ b/python-folia-tools.spec @@ -0,0 +1,162 @@ +%global _empty_manifest_terminate_build 0 +Name: python-FoLiA-tools +Version: 2.5.4 +Release: 1 +Summary: FoLiA-tools contains various Python-based command line tools for working with FoLiA XML (Format for Linguistic Annotation) +License: GPL +URL: https://proycon.github.io/folia +Source0: https://mirrors.nju.edu.cn/pypi/web/packages/0e/ab/eef9d57cabb8604535b36b17c16a10c9630c277e24f5395ec999dbfaba7e/FoLiA-tools-2.5.4.tar.gz +BuildArch: noarch + + +%description +A number of command-line tools are readily available for working with FoLiA, to various ends. The following tools are currently available: +- ``foliavalidator`` -- Tests if documents are valid FoLiA XML. **Always use this to test your documents if you produce your own FoLiA documents!**. See the extra documentation in the dedicated scetion below. +- ``foliaquery`` -- Advanced query tool that searches FoLiA documents for a specified pattern, or modifies a document according to the query. Supports FQL (FoLiA Query Language) and CQL (Corpus Query Language). +- ``foliaeval`` -- Evaluation tool, can compute various evaluation metrics for selected annotation types, either against + a gold standard reference or as a measure of inter-annotated agreement. +- ``folia2txt`` -- Convert FoLiA XML to plain text (pure text, without any annotations). Use this to extract plain text + from any FoLiA document. +- ``folia2annotatedtxt`` -- Like above, but produces output simple + token annotations inline, by appending them directly to the word using a specific delimiter. +- ``folia2columns`` -- This conversion tool reads a FoLiA XML document + and produces a simple columned output format (including CSV) in which each token appears on one line. Note that only simple token annotations are supported and a lot of FoLiA data can not be intuitively expressed in a simple columned format! +- ``folia2html`` -- Converts a FoLiA document to a semi-interactive HTML document, with limited support for certain token annotations. +- ``folia2dcoi`` -- Convert FoLiA XML to D-Coi XML (only for annotations supported by D-Coi) +- ``foliatree`` -- Outputs the hierarchy of a FoLiA document. +- ``foliacat`` -- Concatenate multiple FoLiA documents. +- ``foliacount`` -- This script reads a FoLiA XML document and counts certain structure elements. +- ``foliacorrect`` -- A tool to deal with corrections in FoLiA, can automatically accept suggestions or strip all corrections so parsers that don't know how to handle corrections can process it. +- ``foliaerase`` -- Erases one or more specified annotation types from the FoLiA document. +- ``folialangid`` -- Does language detection on FoLiA documents, assigns language identifiers to different substructures +- ``foliaid`` -- Assigns IDs to elements in FoLiA documents. Use this to automatically generate identifiers on certain (or all) elements. +- ``foliafreqlist`` -- Output a frequency list on tokenised FoLiA documents. +- ``foliamerge`` -- Merges annotations from two or more FoLiA documents. +- ``foliatextcontent`` -- A tool for adding or stripping text redundancy (i.e. text associated with multiple structural levels), supports computing and adding offset information. Use this if you want to have text available on a different level (e.g. the global text level). +- ``foliaupgrade`` -- Upgrades a document to the latest FoLiA version. +- ``alpino2folia`` -- Convert Alpino-DS XML to FoLiA XML +- ``dcoi2folia`` -- Convert D-Coi XML to FoLiA XML +- ``conllu2folia`` -- Convert files in the `CONLL-U format <http://http://universaldependencies.org/format.html>`_ to FoLiA XML. +- ``rst2folia`` -- Convert ReStructuredText, a lightweight non-intrusive text markup language, to FoLiA, using `docutils <http://docutils.sourceforge.net/>`_. +- ``tei2folia`` -- Convert a subset of TEI to FoLiA. See the extra documentation in the section below. +- ``folia2salt`` -- Convert FoLiA XML to `Salt <https://corpus-tools.org/salt/>`_, which in turn enables further conversions (annis, paula, TCF, TigerXML, and others) through `Pepper <https://corpus-tools.org/pepper/>`_. See the extra documentation in the dedicated section below. +All of these tools are written in Python, and thus require a Python 3 installation to run. More tools are added as time progresses. + +%package -n python3-FoLiA-tools +Summary: FoLiA-tools contains various Python-based command line tools for working with FoLiA XML (Format for Linguistic Annotation) +Provides: python-FoLiA-tools +BuildRequires: python3-devel +BuildRequires: python3-setuptools +BuildRequires: python3-pip +%description -n python3-FoLiA-tools +A number of command-line tools are readily available for working with FoLiA, to various ends. The following tools are currently available: +- ``foliavalidator`` -- Tests if documents are valid FoLiA XML. **Always use this to test your documents if you produce your own FoLiA documents!**. See the extra documentation in the dedicated scetion below. +- ``foliaquery`` -- Advanced query tool that searches FoLiA documents for a specified pattern, or modifies a document according to the query. Supports FQL (FoLiA Query Language) and CQL (Corpus Query Language). +- ``foliaeval`` -- Evaluation tool, can compute various evaluation metrics for selected annotation types, either against + a gold standard reference or as a measure of inter-annotated agreement. +- ``folia2txt`` -- Convert FoLiA XML to plain text (pure text, without any annotations). Use this to extract plain text + from any FoLiA document. +- ``folia2annotatedtxt`` -- Like above, but produces output simple + token annotations inline, by appending them directly to the word using a specific delimiter. +- ``folia2columns`` -- This conversion tool reads a FoLiA XML document + and produces a simple columned output format (including CSV) in which each token appears on one line. Note that only simple token annotations are supported and a lot of FoLiA data can not be intuitively expressed in a simple columned format! +- ``folia2html`` -- Converts a FoLiA document to a semi-interactive HTML document, with limited support for certain token annotations. +- ``folia2dcoi`` -- Convert FoLiA XML to D-Coi XML (only for annotations supported by D-Coi) +- ``foliatree`` -- Outputs the hierarchy of a FoLiA document. +- ``foliacat`` -- Concatenate multiple FoLiA documents. +- ``foliacount`` -- This script reads a FoLiA XML document and counts certain structure elements. +- ``foliacorrect`` -- A tool to deal with corrections in FoLiA, can automatically accept suggestions or strip all corrections so parsers that don't know how to handle corrections can process it. +- ``foliaerase`` -- Erases one or more specified annotation types from the FoLiA document. +- ``folialangid`` -- Does language detection on FoLiA documents, assigns language identifiers to different substructures +- ``foliaid`` -- Assigns IDs to elements in FoLiA documents. Use this to automatically generate identifiers on certain (or all) elements. +- ``foliafreqlist`` -- Output a frequency list on tokenised FoLiA documents. +- ``foliamerge`` -- Merges annotations from two or more FoLiA documents. +- ``foliatextcontent`` -- A tool for adding or stripping text redundancy (i.e. text associated with multiple structural levels), supports computing and adding offset information. Use this if you want to have text available on a different level (e.g. the global text level). +- ``foliaupgrade`` -- Upgrades a document to the latest FoLiA version. +- ``alpino2folia`` -- Convert Alpino-DS XML to FoLiA XML +- ``dcoi2folia`` -- Convert D-Coi XML to FoLiA XML +- ``conllu2folia`` -- Convert files in the `CONLL-U format <http://http://universaldependencies.org/format.html>`_ to FoLiA XML. +- ``rst2folia`` -- Convert ReStructuredText, a lightweight non-intrusive text markup language, to FoLiA, using `docutils <http://docutils.sourceforge.net/>`_. +- ``tei2folia`` -- Convert a subset of TEI to FoLiA. See the extra documentation in the section below. +- ``folia2salt`` -- Convert FoLiA XML to `Salt <https://corpus-tools.org/salt/>`_, which in turn enables further conversions (annis, paula, TCF, TigerXML, and others) through `Pepper <https://corpus-tools.org/pepper/>`_. See the extra documentation in the dedicated section below. +All of these tools are written in Python, and thus require a Python 3 installation to run. More tools are added as time progresses. + +%package help +Summary: Development documents and examples for FoLiA-tools +Provides: python3-FoLiA-tools-doc +%description help +A number of command-line tools are readily available for working with FoLiA, to various ends. The following tools are currently available: +- ``foliavalidator`` -- Tests if documents are valid FoLiA XML. **Always use this to test your documents if you produce your own FoLiA documents!**. See the extra documentation in the dedicated scetion below. +- ``foliaquery`` -- Advanced query tool that searches FoLiA documents for a specified pattern, or modifies a document according to the query. Supports FQL (FoLiA Query Language) and CQL (Corpus Query Language). +- ``foliaeval`` -- Evaluation tool, can compute various evaluation metrics for selected annotation types, either against + a gold standard reference or as a measure of inter-annotated agreement. +- ``folia2txt`` -- Convert FoLiA XML to plain text (pure text, without any annotations). Use this to extract plain text + from any FoLiA document. +- ``folia2annotatedtxt`` -- Like above, but produces output simple + token annotations inline, by appending them directly to the word using a specific delimiter. +- ``folia2columns`` -- This conversion tool reads a FoLiA XML document + and produces a simple columned output format (including CSV) in which each token appears on one line. Note that only simple token annotations are supported and a lot of FoLiA data can not be intuitively expressed in a simple columned format! +- ``folia2html`` -- Converts a FoLiA document to a semi-interactive HTML document, with limited support for certain token annotations. +- ``folia2dcoi`` -- Convert FoLiA XML to D-Coi XML (only for annotations supported by D-Coi) +- ``foliatree`` -- Outputs the hierarchy of a FoLiA document. +- ``foliacat`` -- Concatenate multiple FoLiA documents. +- ``foliacount`` -- This script reads a FoLiA XML document and counts certain structure elements. +- ``foliacorrect`` -- A tool to deal with corrections in FoLiA, can automatically accept suggestions or strip all corrections so parsers that don't know how to handle corrections can process it. +- ``foliaerase`` -- Erases one or more specified annotation types from the FoLiA document. +- ``folialangid`` -- Does language detection on FoLiA documents, assigns language identifiers to different substructures +- ``foliaid`` -- Assigns IDs to elements in FoLiA documents. Use this to automatically generate identifiers on certain (or all) elements. +- ``foliafreqlist`` -- Output a frequency list on tokenised FoLiA documents. +- ``foliamerge`` -- Merges annotations from two or more FoLiA documents. +- ``foliatextcontent`` -- A tool for adding or stripping text redundancy (i.e. text associated with multiple structural levels), supports computing and adding offset information. Use this if you want to have text available on a different level (e.g. the global text level). +- ``foliaupgrade`` -- Upgrades a document to the latest FoLiA version. +- ``alpino2folia`` -- Convert Alpino-DS XML to FoLiA XML +- ``dcoi2folia`` -- Convert D-Coi XML to FoLiA XML +- ``conllu2folia`` -- Convert files in the `CONLL-U format <http://http://universaldependencies.org/format.html>`_ to FoLiA XML. +- ``rst2folia`` -- Convert ReStructuredText, a lightweight non-intrusive text markup language, to FoLiA, using `docutils <http://docutils.sourceforge.net/>`_. +- ``tei2folia`` -- Convert a subset of TEI to FoLiA. See the extra documentation in the section below. +- ``folia2salt`` -- Convert FoLiA XML to `Salt <https://corpus-tools.org/salt/>`_, which in turn enables further conversions (annis, paula, TCF, TigerXML, and others) through `Pepper <https://corpus-tools.org/pepper/>`_. See the extra documentation in the dedicated section below. +All of these tools are written in Python, and thus require a Python 3 installation to run. More tools are added as time progresses. + +%prep +%autosetup -n FoLiA-tools-2.5.4 + +%build +%py3_build + +%install +%py3_install +install -d -m755 %{buildroot}/%{_pkgdocdir} +if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi +if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi +if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi +if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi +pushd %{buildroot} +if [ -d usr/lib ]; then + find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/lib64 ]; then + find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/bin ]; then + find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/sbin ]; then + find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst +fi +touch doclist.lst +if [ -d usr/share/man ]; then + find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst +fi +popd +mv %{buildroot}/filelist.lst . +mv %{buildroot}/doclist.lst . + +%files -n python3-FoLiA-tools -f filelist.lst +%dir %{python3_sitelib}/* + +%files help -f doclist.lst +%{_docdir}/* + +%changelog +* Fri May 05 2023 Python_Bot <Python_Bot@openeuler.org> - 2.5.4-1 +- Package Spec generated @@ -0,0 +1 @@ +ad2500917d9e4d271585b9783e81cf4d FoLiA-tools-2.5.4.tar.gz |