summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorCoprDistGit <infra@openeuler.org>2023-05-05 06:34:28 +0000
committerCoprDistGit <infra@openeuler.org>2023-05-05 06:34:28 +0000
commitcbf555f526557e6c6a9485a612ede2fcb5d5595f (patch)
tree078e01beaa282cd712527df42f06a681badd5930
parent7fa147f2c1c286239adaceec0595573c6f2d78ff (diff)
automatic import of python-folia-toolsopeneuler20.03
-rw-r--r--.gitignore1
-rw-r--r--python-folia-tools.spec162
-rw-r--r--sources1
3 files changed, 164 insertions, 0 deletions
diff --git a/.gitignore b/.gitignore
index e69de29..1325e76 100644
--- a/.gitignore
+++ b/.gitignore
@@ -0,0 +1 @@
+/FoLiA-tools-2.5.4.tar.gz
diff --git a/python-folia-tools.spec b/python-folia-tools.spec
new file mode 100644
index 0000000..fc1ca93
--- /dev/null
+++ b/python-folia-tools.spec
@@ -0,0 +1,162 @@
+%global _empty_manifest_terminate_build 0
+Name: python-FoLiA-tools
+Version: 2.5.4
+Release: 1
+Summary: FoLiA-tools contains various Python-based command line tools for working with FoLiA XML (Format for Linguistic Annotation)
+License: GPL
+URL: https://proycon.github.io/folia
+Source0: https://mirrors.nju.edu.cn/pypi/web/packages/0e/ab/eef9d57cabb8604535b36b17c16a10c9630c277e24f5395ec999dbfaba7e/FoLiA-tools-2.5.4.tar.gz
+BuildArch: noarch
+
+
+%description
+A number of command-line tools are readily available for working with FoLiA, to various ends. The following tools are currently available:
+- ``foliavalidator`` -- Tests if documents are valid FoLiA XML. **Always use this to test your documents if you produce your own FoLiA documents!**. See the extra documentation in the dedicated scetion below.
+- ``foliaquery`` -- Advanced query tool that searches FoLiA documents for a specified pattern, or modifies a document according to the query. Supports FQL (FoLiA Query Language) and CQL (Corpus Query Language).
+- ``foliaeval`` -- Evaluation tool, can compute various evaluation metrics for selected annotation types, either against
+ a gold standard reference or as a measure of inter-annotated agreement.
+- ``folia2txt`` -- Convert FoLiA XML to plain text (pure text, without any annotations). Use this to extract plain text
+ from any FoLiA document.
+- ``folia2annotatedtxt`` -- Like above, but produces output simple
+ token annotations inline, by appending them directly to the word using a specific delimiter.
+- ``folia2columns`` -- This conversion tool reads a FoLiA XML document
+ and produces a simple columned output format (including CSV) in which each token appears on one line. Note that only simple token annotations are supported and a lot of FoLiA data can not be intuitively expressed in a simple columned format!
+- ``folia2html`` -- Converts a FoLiA document to a semi-interactive HTML document, with limited support for certain token annotations.
+- ``folia2dcoi`` -- Convert FoLiA XML to D-Coi XML (only for annotations supported by D-Coi)
+- ``foliatree`` -- Outputs the hierarchy of a FoLiA document.
+- ``foliacat`` -- Concatenate multiple FoLiA documents.
+- ``foliacount`` -- This script reads a FoLiA XML document and counts certain structure elements.
+- ``foliacorrect`` -- A tool to deal with corrections in FoLiA, can automatically accept suggestions or strip all corrections so parsers that don't know how to handle corrections can process it.
+- ``foliaerase`` -- Erases one or more specified annotation types from the FoLiA document.
+- ``folialangid`` -- Does language detection on FoLiA documents, assigns language identifiers to different substructures
+- ``foliaid`` -- Assigns IDs to elements in FoLiA documents. Use this to automatically generate identifiers on certain (or all) elements.
+- ``foliafreqlist`` -- Output a frequency list on tokenised FoLiA documents.
+- ``foliamerge`` -- Merges annotations from two or more FoLiA documents.
+- ``foliatextcontent`` -- A tool for adding or stripping text redundancy (i.e. text associated with multiple structural levels), supports computing and adding offset information. Use this if you want to have text available on a different level (e.g. the global text level).
+- ``foliaupgrade`` -- Upgrades a document to the latest FoLiA version.
+- ``alpino2folia`` -- Convert Alpino-DS XML to FoLiA XML
+- ``dcoi2folia`` -- Convert D-Coi XML to FoLiA XML
+- ``conllu2folia`` -- Convert files in the `CONLL-U format <http://http://universaldependencies.org/format.html>`_ to FoLiA XML.
+- ``rst2folia`` -- Convert ReStructuredText, a lightweight non-intrusive text markup language, to FoLiA, using `docutils <http://docutils.sourceforge.net/>`_.
+- ``tei2folia`` -- Convert a subset of TEI to FoLiA. See the extra documentation in the section below.
+- ``folia2salt`` -- Convert FoLiA XML to `Salt <https://corpus-tools.org/salt/>`_, which in turn enables further conversions (annis, paula, TCF, TigerXML, and others) through `Pepper <https://corpus-tools.org/pepper/>`_. See the extra documentation in the dedicated section below.
+All of these tools are written in Python, and thus require a Python 3 installation to run. More tools are added as time progresses.
+
+%package -n python3-FoLiA-tools
+Summary: FoLiA-tools contains various Python-based command line tools for working with FoLiA XML (Format for Linguistic Annotation)
+Provides: python-FoLiA-tools
+BuildRequires: python3-devel
+BuildRequires: python3-setuptools
+BuildRequires: python3-pip
+%description -n python3-FoLiA-tools
+A number of command-line tools are readily available for working with FoLiA, to various ends. The following tools are currently available:
+- ``foliavalidator`` -- Tests if documents are valid FoLiA XML. **Always use this to test your documents if you produce your own FoLiA documents!**. See the extra documentation in the dedicated scetion below.
+- ``foliaquery`` -- Advanced query tool that searches FoLiA documents for a specified pattern, or modifies a document according to the query. Supports FQL (FoLiA Query Language) and CQL (Corpus Query Language).
+- ``foliaeval`` -- Evaluation tool, can compute various evaluation metrics for selected annotation types, either against
+ a gold standard reference or as a measure of inter-annotated agreement.
+- ``folia2txt`` -- Convert FoLiA XML to plain text (pure text, without any annotations). Use this to extract plain text
+ from any FoLiA document.
+- ``folia2annotatedtxt`` -- Like above, but produces output simple
+ token annotations inline, by appending them directly to the word using a specific delimiter.
+- ``folia2columns`` -- This conversion tool reads a FoLiA XML document
+ and produces a simple columned output format (including CSV) in which each token appears on one line. Note that only simple token annotations are supported and a lot of FoLiA data can not be intuitively expressed in a simple columned format!
+- ``folia2html`` -- Converts a FoLiA document to a semi-interactive HTML document, with limited support for certain token annotations.
+- ``folia2dcoi`` -- Convert FoLiA XML to D-Coi XML (only for annotations supported by D-Coi)
+- ``foliatree`` -- Outputs the hierarchy of a FoLiA document.
+- ``foliacat`` -- Concatenate multiple FoLiA documents.
+- ``foliacount`` -- This script reads a FoLiA XML document and counts certain structure elements.
+- ``foliacorrect`` -- A tool to deal with corrections in FoLiA, can automatically accept suggestions or strip all corrections so parsers that don't know how to handle corrections can process it.
+- ``foliaerase`` -- Erases one or more specified annotation types from the FoLiA document.
+- ``folialangid`` -- Does language detection on FoLiA documents, assigns language identifiers to different substructures
+- ``foliaid`` -- Assigns IDs to elements in FoLiA documents. Use this to automatically generate identifiers on certain (or all) elements.
+- ``foliafreqlist`` -- Output a frequency list on tokenised FoLiA documents.
+- ``foliamerge`` -- Merges annotations from two or more FoLiA documents.
+- ``foliatextcontent`` -- A tool for adding or stripping text redundancy (i.e. text associated with multiple structural levels), supports computing and adding offset information. Use this if you want to have text available on a different level (e.g. the global text level).
+- ``foliaupgrade`` -- Upgrades a document to the latest FoLiA version.
+- ``alpino2folia`` -- Convert Alpino-DS XML to FoLiA XML
+- ``dcoi2folia`` -- Convert D-Coi XML to FoLiA XML
+- ``conllu2folia`` -- Convert files in the `CONLL-U format <http://http://universaldependencies.org/format.html>`_ to FoLiA XML.
+- ``rst2folia`` -- Convert ReStructuredText, a lightweight non-intrusive text markup language, to FoLiA, using `docutils <http://docutils.sourceforge.net/>`_.
+- ``tei2folia`` -- Convert a subset of TEI to FoLiA. See the extra documentation in the section below.
+- ``folia2salt`` -- Convert FoLiA XML to `Salt <https://corpus-tools.org/salt/>`_, which in turn enables further conversions (annis, paula, TCF, TigerXML, and others) through `Pepper <https://corpus-tools.org/pepper/>`_. See the extra documentation in the dedicated section below.
+All of these tools are written in Python, and thus require a Python 3 installation to run. More tools are added as time progresses.
+
+%package help
+Summary: Development documents and examples for FoLiA-tools
+Provides: python3-FoLiA-tools-doc
+%description help
+A number of command-line tools are readily available for working with FoLiA, to various ends. The following tools are currently available:
+- ``foliavalidator`` -- Tests if documents are valid FoLiA XML. **Always use this to test your documents if you produce your own FoLiA documents!**. See the extra documentation in the dedicated scetion below.
+- ``foliaquery`` -- Advanced query tool that searches FoLiA documents for a specified pattern, or modifies a document according to the query. Supports FQL (FoLiA Query Language) and CQL (Corpus Query Language).
+- ``foliaeval`` -- Evaluation tool, can compute various evaluation metrics for selected annotation types, either against
+ a gold standard reference or as a measure of inter-annotated agreement.
+- ``folia2txt`` -- Convert FoLiA XML to plain text (pure text, without any annotations). Use this to extract plain text
+ from any FoLiA document.
+- ``folia2annotatedtxt`` -- Like above, but produces output simple
+ token annotations inline, by appending them directly to the word using a specific delimiter.
+- ``folia2columns`` -- This conversion tool reads a FoLiA XML document
+ and produces a simple columned output format (including CSV) in which each token appears on one line. Note that only simple token annotations are supported and a lot of FoLiA data can not be intuitively expressed in a simple columned format!
+- ``folia2html`` -- Converts a FoLiA document to a semi-interactive HTML document, with limited support for certain token annotations.
+- ``folia2dcoi`` -- Convert FoLiA XML to D-Coi XML (only for annotations supported by D-Coi)
+- ``foliatree`` -- Outputs the hierarchy of a FoLiA document.
+- ``foliacat`` -- Concatenate multiple FoLiA documents.
+- ``foliacount`` -- This script reads a FoLiA XML document and counts certain structure elements.
+- ``foliacorrect`` -- A tool to deal with corrections in FoLiA, can automatically accept suggestions or strip all corrections so parsers that don't know how to handle corrections can process it.
+- ``foliaerase`` -- Erases one or more specified annotation types from the FoLiA document.
+- ``folialangid`` -- Does language detection on FoLiA documents, assigns language identifiers to different substructures
+- ``foliaid`` -- Assigns IDs to elements in FoLiA documents. Use this to automatically generate identifiers on certain (or all) elements.
+- ``foliafreqlist`` -- Output a frequency list on tokenised FoLiA documents.
+- ``foliamerge`` -- Merges annotations from two or more FoLiA documents.
+- ``foliatextcontent`` -- A tool for adding or stripping text redundancy (i.e. text associated with multiple structural levels), supports computing and adding offset information. Use this if you want to have text available on a different level (e.g. the global text level).
+- ``foliaupgrade`` -- Upgrades a document to the latest FoLiA version.
+- ``alpino2folia`` -- Convert Alpino-DS XML to FoLiA XML
+- ``dcoi2folia`` -- Convert D-Coi XML to FoLiA XML
+- ``conllu2folia`` -- Convert files in the `CONLL-U format <http://http://universaldependencies.org/format.html>`_ to FoLiA XML.
+- ``rst2folia`` -- Convert ReStructuredText, a lightweight non-intrusive text markup language, to FoLiA, using `docutils <http://docutils.sourceforge.net/>`_.
+- ``tei2folia`` -- Convert a subset of TEI to FoLiA. See the extra documentation in the section below.
+- ``folia2salt`` -- Convert FoLiA XML to `Salt <https://corpus-tools.org/salt/>`_, which in turn enables further conversions (annis, paula, TCF, TigerXML, and others) through `Pepper <https://corpus-tools.org/pepper/>`_. See the extra documentation in the dedicated section below.
+All of these tools are written in Python, and thus require a Python 3 installation to run. More tools are added as time progresses.
+
+%prep
+%autosetup -n FoLiA-tools-2.5.4
+
+%build
+%py3_build
+
+%install
+%py3_install
+install -d -m755 %{buildroot}/%{_pkgdocdir}
+if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi
+if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi
+if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi
+if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi
+pushd %{buildroot}
+if [ -d usr/lib ]; then
+ find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/lib64 ]; then
+ find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/bin ]; then
+ find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+if [ -d usr/sbin ]; then
+ find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst
+fi
+touch doclist.lst
+if [ -d usr/share/man ]; then
+ find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst
+fi
+popd
+mv %{buildroot}/filelist.lst .
+mv %{buildroot}/doclist.lst .
+
+%files -n python3-FoLiA-tools -f filelist.lst
+%dir %{python3_sitelib}/*
+
+%files help -f doclist.lst
+%{_docdir}/*
+
+%changelog
+* Fri May 05 2023 Python_Bot <Python_Bot@openeuler.org> - 2.5.4-1
+- Package Spec generated
diff --git a/sources b/sources
new file mode 100644
index 0000000..1bf42a1
--- /dev/null
+++ b/sources
@@ -0,0 +1 @@
+ad2500917d9e4d271585b9783e81cf4d FoLiA-tools-2.5.4.tar.gz