%global _empty_manifest_terminate_build 0 Name: python-tangled-up-in-unicode Version: 0.2.0 Release: 1 Summary: Access to the Unicode Character Database (UCD) License: BSD License URL: https://github.com/dylan-profiler/tangled-up-in-unicode Source0: https://mirrors.nju.edu.cn/pypi/web/packages/15/59/f6ae6399c25b61719ebcca245d9be0805ce0767fd12879b75f6d0b04331e/tangled_up_in_unicode-0.2.0.tar.gz BuildArch: noarch %description # Tangled up in Unicode This module provides access to character properties for all Unicode characters, from the Unicode Character Database (UCD) . This module provides an alternative to Python's standard library [`unicodedata`](https://docs.python.org/3/library/unicodedata.html). `Tangled up in Unicode` provides four main benefits compared to the standard library: - The [latest version](http://www.unicode.org/versions/latest/) of the Unicode database is used. - Adds human-readable class names (Property value aliases). - Extends the properties to use more potential of the database. - UCD version independent of Python version (Python 3.6 has UCD 9.0, 3.7 has UCD 11.0.0, 3.8 has 12.0.1, 3.9 has 13.0.0) Note that Python 3 added unicode support, but that this is different from the UCD. Unicode support handles storing and manipulating unicode characters, while this package aims to provide properties of specific characters. ## Example The default lookup in `unicodedata` for `$`: | Property | Value | |---------------------------|-------------------| | Name | Dollar Sign | | Category (Short) | Sc | | Bidirectional (Short) | ET | | Combining | 0 | | Mirrored | 0 | | East Asian Width (Short) | Na | | Decomposition | | Extra information provided by this package | Property | Value | |-------------------------------|-----------------------| | Category Alias (Long) | Currency_Symbol | | Bidirectional Alias (Long) | European_Terminator | | East Asian Width Alias (Long) | Narrow | | Script (Long) | Common | | Script (Short) | Zyyy | | Block (Long) | Basic_Latin | | Block (Short) | ASCII | | PropList | Pattern_Syntax | | Uppercase Character | | | Lowercase Character | | | Titlecase Character | | ## Properties comparison | Property | `tangled-up-in-unicode` | `unicodedata` | |---------------------------|-------------------------------|-----------------------| | Name | ☑ | ☑ | | Decimal | ☑ | ☑ | | Digit | ☑ | ☑ | | Numeric | ☑ | ☑ | | Combining | ☑ + alias | ☑ | | Mirrored | ☑ | ☑ | | Decomposition | ☑ | ☑ | | Category | ☑ + alias | ☑ | | Bidirectional | ☑ + alias | ☑ | | East Asian Width | ☑ + alias | ☑ | | Script | ☑ + alias | - | | Block | ☑ + alias | - | | Age | ☑ + alias | - | | Binary Property Values | ☑ | - | | Version | 14.0.0 ([latest](http://www.unicode.org/versions/latest/)) | 12.0.1 | _Table 1: presence of properties is denoted by ☑ (Unicode Character 'BALLOT BOX WITH CHECK' (U+2611))._ ## Usage ```python import tangled_up_in_unicode as unicodedata ``` The package can be installed via pip: ``` pip install tangled-up-in-unicode ``` ## Performance The module is written in Python. It can be compiled with Cython to gain [competitive performance](# "Meaning the null hypothesis of the two libraries having the same average runtime could not be rejected.") with the native library. ## Unsupported features Some of the features in `unicodedata` are not supported. | Feature | `tangled-up-in-unicode` | `unicodedata` | |-----------------------|-------------------------------|-----------------------| | lookup | - | ☑ | | normalize | - | ☑ | | ucd_3_2_0 | - | ☑ | ## Acknowledgements Where possible, code and documentation of the original module are used. This repository is part of the Dylan Profiling project. %package -n python3-tangled-up-in-unicode Summary: Access to the Unicode Character Database (UCD) Provides: python-tangled-up-in-unicode BuildRequires: python3-devel BuildRequires: python3-setuptools BuildRequires: python3-pip %description -n python3-tangled-up-in-unicode # Tangled up in Unicode This module provides access to character properties for all Unicode characters, from the Unicode Character Database (UCD) . This module provides an alternative to Python's standard library [`unicodedata`](https://docs.python.org/3/library/unicodedata.html). `Tangled up in Unicode` provides four main benefits compared to the standard library: - The [latest version](http://www.unicode.org/versions/latest/) of the Unicode database is used. - Adds human-readable class names (Property value aliases). - Extends the properties to use more potential of the database. - UCD version independent of Python version (Python 3.6 has UCD 9.0, 3.7 has UCD 11.0.0, 3.8 has 12.0.1, 3.9 has 13.0.0) Note that Python 3 added unicode support, but that this is different from the UCD. Unicode support handles storing and manipulating unicode characters, while this package aims to provide properties of specific characters. ## Example The default lookup in `unicodedata` for `$`: | Property | Value | |---------------------------|-------------------| | Name | Dollar Sign | | Category (Short) | Sc | | Bidirectional (Short) | ET | | Combining | 0 | | Mirrored | 0 | | East Asian Width (Short) | Na | | Decomposition | | Extra information provided by this package | Property | Value | |-------------------------------|-----------------------| | Category Alias (Long) | Currency_Symbol | | Bidirectional Alias (Long) | European_Terminator | | East Asian Width Alias (Long) | Narrow | | Script (Long) | Common | | Script (Short) | Zyyy | | Block (Long) | Basic_Latin | | Block (Short) | ASCII | | PropList | Pattern_Syntax | | Uppercase Character | | | Lowercase Character | | | Titlecase Character | | ## Properties comparison | Property | `tangled-up-in-unicode` | `unicodedata` | |---------------------------|-------------------------------|-----------------------| | Name | ☑ | ☑ | | Decimal | ☑ | ☑ | | Digit | ☑ | ☑ | | Numeric | ☑ | ☑ | | Combining | ☑ + alias | ☑ | | Mirrored | ☑ | ☑ | | Decomposition | ☑ | ☑ | | Category | ☑ + alias | ☑ | | Bidirectional | ☑ + alias | ☑ | | East Asian Width | ☑ + alias | ☑ | | Script | ☑ + alias | - | | Block | ☑ + alias | - | | Age | ☑ + alias | - | | Binary Property Values | ☑ | - | | Version | 14.0.0 ([latest](http://www.unicode.org/versions/latest/)) | 12.0.1 | _Table 1: presence of properties is denoted by ☑ (Unicode Character 'BALLOT BOX WITH CHECK' (U+2611))._ ## Usage ```python import tangled_up_in_unicode as unicodedata ``` The package can be installed via pip: ``` pip install tangled-up-in-unicode ``` ## Performance The module is written in Python. It can be compiled with Cython to gain [competitive performance](# "Meaning the null hypothesis of the two libraries having the same average runtime could not be rejected.") with the native library. ## Unsupported features Some of the features in `unicodedata` are not supported. | Feature | `tangled-up-in-unicode` | `unicodedata` | |-----------------------|-------------------------------|-----------------------| | lookup | - | ☑ | | normalize | - | ☑ | | ucd_3_2_0 | - | ☑ | ## Acknowledgements Where possible, code and documentation of the original module are used. This repository is part of the Dylan Profiling project. %package help Summary: Development documents and examples for tangled-up-in-unicode Provides: python3-tangled-up-in-unicode-doc %description help # Tangled up in Unicode This module provides access to character properties for all Unicode characters, from the Unicode Character Database (UCD) . This module provides an alternative to Python's standard library [`unicodedata`](https://docs.python.org/3/library/unicodedata.html). `Tangled up in Unicode` provides four main benefits compared to the standard library: - The [latest version](http://www.unicode.org/versions/latest/) of the Unicode database is used. - Adds human-readable class names (Property value aliases). - Extends the properties to use more potential of the database. - UCD version independent of Python version (Python 3.6 has UCD 9.0, 3.7 has UCD 11.0.0, 3.8 has 12.0.1, 3.9 has 13.0.0) Note that Python 3 added unicode support, but that this is different from the UCD. Unicode support handles storing and manipulating unicode characters, while this package aims to provide properties of specific characters. ## Example The default lookup in `unicodedata` for `$`: | Property | Value | |---------------------------|-------------------| | Name | Dollar Sign | | Category (Short) | Sc | | Bidirectional (Short) | ET | | Combining | 0 | | Mirrored | 0 | | East Asian Width (Short) | Na | | Decomposition | | Extra information provided by this package | Property | Value | |-------------------------------|-----------------------| | Category Alias (Long) | Currency_Symbol | | Bidirectional Alias (Long) | European_Terminator | | East Asian Width Alias (Long) | Narrow | | Script (Long) | Common | | Script (Short) | Zyyy | | Block (Long) | Basic_Latin | | Block (Short) | ASCII | | PropList | Pattern_Syntax | | Uppercase Character | | | Lowercase Character | | | Titlecase Character | | ## Properties comparison | Property | `tangled-up-in-unicode` | `unicodedata` | |---------------------------|-------------------------------|-----------------------| | Name | ☑ | ☑ | | Decimal | ☑ | ☑ | | Digit | ☑ | ☑ | | Numeric | ☑ | ☑ | | Combining | ☑ + alias | ☑ | | Mirrored | ☑ | ☑ | | Decomposition | ☑ | ☑ | | Category | ☑ + alias | ☑ | | Bidirectional | ☑ + alias | ☑ | | East Asian Width | ☑ + alias | ☑ | | Script | ☑ + alias | - | | Block | ☑ + alias | - | | Age | ☑ + alias | - | | Binary Property Values | ☑ | - | | Version | 14.0.0 ([latest](http://www.unicode.org/versions/latest/)) | 12.0.1 | _Table 1: presence of properties is denoted by ☑ (Unicode Character 'BALLOT BOX WITH CHECK' (U+2611))._ ## Usage ```python import tangled_up_in_unicode as unicodedata ``` The package can be installed via pip: ``` pip install tangled-up-in-unicode ``` ## Performance The module is written in Python. It can be compiled with Cython to gain [competitive performance](# "Meaning the null hypothesis of the two libraries having the same average runtime could not be rejected.") with the native library. ## Unsupported features Some of the features in `unicodedata` are not supported. | Feature | `tangled-up-in-unicode` | `unicodedata` | |-----------------------|-------------------------------|-----------------------| | lookup | - | ☑ | | normalize | - | ☑ | | ucd_3_2_0 | - | ☑ | ## Acknowledgements Where possible, code and documentation of the original module are used. This repository is part of the Dylan Profiling project. %prep %autosetup -n tangled-up-in-unicode-0.2.0 %build %py3_build %install %py3_install install -d -m755 %{buildroot}/%{_pkgdocdir} if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi pushd %{buildroot} if [ -d usr/lib ]; then find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/lib64 ]; then find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/bin ]; then find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/sbin ]; then find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst fi touch doclist.lst if [ -d usr/share/man ]; then find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst fi popd mv %{buildroot}/filelist.lst . mv %{buildroot}/doclist.lst . %files -n python3-tangled-up-in-unicode -f filelist.lst %dir %{python3_sitelib}/* %files help -f doclist.lst %{_docdir}/* %changelog * Mon Apr 10 2023 Python_Bot - 0.2.0-1 - Package Spec generated