%global _empty_manifest_terminate_build 0 Name: python-FoLiA-tools Version: 2.5.4 Release: 1 Summary: FoLiA-tools contains various Python-based command line tools for working with FoLiA XML (Format for Linguistic Annotation) License: GPL URL: https://proycon.github.io/folia Source0: https://mirrors.nju.edu.cn/pypi/web/packages/0e/ab/eef9d57cabb8604535b36b17c16a10c9630c277e24f5395ec999dbfaba7e/FoLiA-tools-2.5.4.tar.gz BuildArch: noarch %description A number of command-line tools are readily available for working with FoLiA, to various ends. The following tools are currently available: - ``foliavalidator`` -- Tests if documents are valid FoLiA XML. **Always use this to test your documents if you produce your own FoLiA documents!**. See the extra documentation in the dedicated scetion below. - ``foliaquery`` -- Advanced query tool that searches FoLiA documents for a specified pattern, or modifies a document according to the query. Supports FQL (FoLiA Query Language) and CQL (Corpus Query Language). - ``foliaeval`` -- Evaluation tool, can compute various evaluation metrics for selected annotation types, either against a gold standard reference or as a measure of inter-annotated agreement. - ``folia2txt`` -- Convert FoLiA XML to plain text (pure text, without any annotations). Use this to extract plain text from any FoLiA document. - ``folia2annotatedtxt`` -- Like above, but produces output simple token annotations inline, by appending them directly to the word using a specific delimiter. - ``folia2columns`` -- This conversion tool reads a FoLiA XML document and produces a simple columned output format (including CSV) in which each token appears on one line. Note that only simple token annotations are supported and a lot of FoLiA data can not be intuitively expressed in a simple columned format! - ``folia2html`` -- Converts a FoLiA document to a semi-interactive HTML document, with limited support for certain token annotations. - ``folia2dcoi`` -- Convert FoLiA XML to D-Coi XML (only for annotations supported by D-Coi) - ``foliatree`` -- Outputs the hierarchy of a FoLiA document. - ``foliacat`` -- Concatenate multiple FoLiA documents. - ``foliacount`` -- This script reads a FoLiA XML document and counts certain structure elements. - ``foliacorrect`` -- A tool to deal with corrections in FoLiA, can automatically accept suggestions or strip all corrections so parsers that don't know how to handle corrections can process it. - ``foliaerase`` -- Erases one or more specified annotation types from the FoLiA document. - ``folialangid`` -- Does language detection on FoLiA documents, assigns language identifiers to different substructures - ``foliaid`` -- Assigns IDs to elements in FoLiA documents. Use this to automatically generate identifiers on certain (or all) elements. - ``foliafreqlist`` -- Output a frequency list on tokenised FoLiA documents. - ``foliamerge`` -- Merges annotations from two or more FoLiA documents. - ``foliatextcontent`` -- A tool for adding or stripping text redundancy (i.e. text associated with multiple structural levels), supports computing and adding offset information. Use this if you want to have text available on a different level (e.g. the global text level). - ``foliaupgrade`` -- Upgrades a document to the latest FoLiA version. - ``alpino2folia`` -- Convert Alpino-DS XML to FoLiA XML - ``dcoi2folia`` -- Convert D-Coi XML to FoLiA XML - ``conllu2folia`` -- Convert files in the `CONLL-U format <http://http://universaldependencies.org/format.html>`_ to FoLiA XML. - ``rst2folia`` -- Convert ReStructuredText, a lightweight non-intrusive text markup language, to FoLiA, using `docutils <http://docutils.sourceforge.net/>`_. - ``tei2folia`` -- Convert a subset of TEI to FoLiA. See the extra documentation in the section below. - ``folia2salt`` -- Convert FoLiA XML to `Salt <https://corpus-tools.org/salt/>`_, which in turn enables further conversions (annis, paula, TCF, TigerXML, and others) through `Pepper <https://corpus-tools.org/pepper/>`_. See the extra documentation in the dedicated section below. All of these tools are written in Python, and thus require a Python 3 installation to run. More tools are added as time progresses. %package -n python3-FoLiA-tools Summary: FoLiA-tools contains various Python-based command line tools for working with FoLiA XML (Format for Linguistic Annotation) Provides: python-FoLiA-tools BuildRequires: python3-devel BuildRequires: python3-setuptools BuildRequires: python3-pip %description -n python3-FoLiA-tools A number of command-line tools are readily available for working with FoLiA, to various ends. The following tools are currently available: - ``foliavalidator`` -- Tests if documents are valid FoLiA XML. **Always use this to test your documents if you produce your own FoLiA documents!**. See the extra documentation in the dedicated scetion below. - ``foliaquery`` -- Advanced query tool that searches FoLiA documents for a specified pattern, or modifies a document according to the query. Supports FQL (FoLiA Query Language) and CQL (Corpus Query Language). - ``foliaeval`` -- Evaluation tool, can compute various evaluation metrics for selected annotation types, either against a gold standard reference or as a measure of inter-annotated agreement. - ``folia2txt`` -- Convert FoLiA XML to plain text (pure text, without any annotations). Use this to extract plain text from any FoLiA document. - ``folia2annotatedtxt`` -- Like above, but produces output simple token annotations inline, by appending them directly to the word using a specific delimiter. - ``folia2columns`` -- This conversion tool reads a FoLiA XML document and produces a simple columned output format (including CSV) in which each token appears on one line. Note that only simple token annotations are supported and a lot of FoLiA data can not be intuitively expressed in a simple columned format! - ``folia2html`` -- Converts a FoLiA document to a semi-interactive HTML document, with limited support for certain token annotations. - ``folia2dcoi`` -- Convert FoLiA XML to D-Coi XML (only for annotations supported by D-Coi) - ``foliatree`` -- Outputs the hierarchy of a FoLiA document. - ``foliacat`` -- Concatenate multiple FoLiA documents. - ``foliacount`` -- This script reads a FoLiA XML document and counts certain structure elements. - ``foliacorrect`` -- A tool to deal with corrections in FoLiA, can automatically accept suggestions or strip all corrections so parsers that don't know how to handle corrections can process it. - ``foliaerase`` -- Erases one or more specified annotation types from the FoLiA document. - ``folialangid`` -- Does language detection on FoLiA documents, assigns language identifiers to different substructures - ``foliaid`` -- Assigns IDs to elements in FoLiA documents. Use this to automatically generate identifiers on certain (or all) elements. - ``foliafreqlist`` -- Output a frequency list on tokenised FoLiA documents. - ``foliamerge`` -- Merges annotations from two or more FoLiA documents. - ``foliatextcontent`` -- A tool for adding or stripping text redundancy (i.e. text associated with multiple structural levels), supports computing and adding offset information. Use this if you want to have text available on a different level (e.g. the global text level). - ``foliaupgrade`` -- Upgrades a document to the latest FoLiA version. - ``alpino2folia`` -- Convert Alpino-DS XML to FoLiA XML - ``dcoi2folia`` -- Convert D-Coi XML to FoLiA XML - ``conllu2folia`` -- Convert files in the `CONLL-U format <http://http://universaldependencies.org/format.html>`_ to FoLiA XML. - ``rst2folia`` -- Convert ReStructuredText, a lightweight non-intrusive text markup language, to FoLiA, using `docutils <http://docutils.sourceforge.net/>`_. - ``tei2folia`` -- Convert a subset of TEI to FoLiA. See the extra documentation in the section below. - ``folia2salt`` -- Convert FoLiA XML to `Salt <https://corpus-tools.org/salt/>`_, which in turn enables further conversions (annis, paula, TCF, TigerXML, and others) through `Pepper <https://corpus-tools.org/pepper/>`_. See the extra documentation in the dedicated section below. All of these tools are written in Python, and thus require a Python 3 installation to run. More tools are added as time progresses. %package help Summary: Development documents and examples for FoLiA-tools Provides: python3-FoLiA-tools-doc %description help A number of command-line tools are readily available for working with FoLiA, to various ends. The following tools are currently available: - ``foliavalidator`` -- Tests if documents are valid FoLiA XML. **Always use this to test your documents if you produce your own FoLiA documents!**. See the extra documentation in the dedicated scetion below. - ``foliaquery`` -- Advanced query tool that searches FoLiA documents for a specified pattern, or modifies a document according to the query. Supports FQL (FoLiA Query Language) and CQL (Corpus Query Language). - ``foliaeval`` -- Evaluation tool, can compute various evaluation metrics for selected annotation types, either against a gold standard reference or as a measure of inter-annotated agreement. - ``folia2txt`` -- Convert FoLiA XML to plain text (pure text, without any annotations). Use this to extract plain text from any FoLiA document. - ``folia2annotatedtxt`` -- Like above, but produces output simple token annotations inline, by appending them directly to the word using a specific delimiter. - ``folia2columns`` -- This conversion tool reads a FoLiA XML document and produces a simple columned output format (including CSV) in which each token appears on one line. Note that only simple token annotations are supported and a lot of FoLiA data can not be intuitively expressed in a simple columned format! - ``folia2html`` -- Converts a FoLiA document to a semi-interactive HTML document, with limited support for certain token annotations. - ``folia2dcoi`` -- Convert FoLiA XML to D-Coi XML (only for annotations supported by D-Coi) - ``foliatree`` -- Outputs the hierarchy of a FoLiA document. - ``foliacat`` -- Concatenate multiple FoLiA documents. - ``foliacount`` -- This script reads a FoLiA XML document and counts certain structure elements. - ``foliacorrect`` -- A tool to deal with corrections in FoLiA, can automatically accept suggestions or strip all corrections so parsers that don't know how to handle corrections can process it. - ``foliaerase`` -- Erases one or more specified annotation types from the FoLiA document. - ``folialangid`` -- Does language detection on FoLiA documents, assigns language identifiers to different substructures - ``foliaid`` -- Assigns IDs to elements in FoLiA documents. Use this to automatically generate identifiers on certain (or all) elements. - ``foliafreqlist`` -- Output a frequency list on tokenised FoLiA documents. - ``foliamerge`` -- Merges annotations from two or more FoLiA documents. - ``foliatextcontent`` -- A tool for adding or stripping text redundancy (i.e. text associated with multiple structural levels), supports computing and adding offset information. Use this if you want to have text available on a different level (e.g. the global text level). - ``foliaupgrade`` -- Upgrades a document to the latest FoLiA version. - ``alpino2folia`` -- Convert Alpino-DS XML to FoLiA XML - ``dcoi2folia`` -- Convert D-Coi XML to FoLiA XML - ``conllu2folia`` -- Convert files in the `CONLL-U format <http://http://universaldependencies.org/format.html>`_ to FoLiA XML. - ``rst2folia`` -- Convert ReStructuredText, a lightweight non-intrusive text markup language, to FoLiA, using `docutils <http://docutils.sourceforge.net/>`_. - ``tei2folia`` -- Convert a subset of TEI to FoLiA. See the extra documentation in the section below. - ``folia2salt`` -- Convert FoLiA XML to `Salt <https://corpus-tools.org/salt/>`_, which in turn enables further conversions (annis, paula, TCF, TigerXML, and others) through `Pepper <https://corpus-tools.org/pepper/>`_. See the extra documentation in the dedicated section below. All of these tools are written in Python, and thus require a Python 3 installation to run. More tools are added as time progresses. %prep %autosetup -n FoLiA-tools-2.5.4 %build %py3_build %install %py3_install install -d -m755 %{buildroot}/%{_pkgdocdir} if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi pushd %{buildroot} if [ -d usr/lib ]; then find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/lib64 ]; then find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/bin ]; then find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/sbin ]; then find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst fi touch doclist.lst if [ -d usr/share/man ]; then find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst fi popd mv %{buildroot}/filelist.lst . mv %{buildroot}/doclist.lst . %files -n python3-FoLiA-tools -f filelist.lst %dir %{python3_sitelib}/* %files help -f doclist.lst %{_docdir}/* %changelog * Fri May 05 2023 Python_Bot <Python_Bot@openeuler.org> - 2.5.4-1 - Package Spec generated