diff options
| -rw-r--r-- | .gitignore | 1 | ||||
| -rw-r--r-- | python-wn.spec | 511 | ||||
| -rw-r--r-- | sources | 1 |
3 files changed, 513 insertions, 0 deletions
@@ -0,0 +1 @@ +/wn-0.9.4.tar.gz diff --git a/python-wn.spec b/python-wn.spec new file mode 100644 index 0000000..ee90d45 --- /dev/null +++ b/python-wn.spec @@ -0,0 +1,511 @@ +%global _empty_manifest_terminate_build 0 +Name: python-wn +Version: 0.9.4 +Release: 1 +Summary: Wordnet interface library +License: None +URL: https://pypi.org/project/wn/ +Source0: https://mirrors.nju.edu.cn/pypi/web/packages/4d/a9/612e69cf7621bf0e9f805c0e9b77ff6f931a88147872c769b4416d58ed62/wn-0.9.4.tar.gz +BuildArch: noarch + +Requires: python3-requests +Requires: python3-tomli +Requires: python3-typing-extensions +Requires: python3-sphinx +Requires: python3-furo +Requires: python3-sphinx-copybutton +Requires: python3-wn-editor +Requires: python3-pytest +Requires: python3-mypy +Requires: python3-flake8 +Requires: python3-nox +Requires: python3-types-requests +Requires: python3-starlette + +%description +Wn is a Python library for exploring information in wordnets. Install +it from PyPI and download some data: +```console +$ pip install wn +$ python -m wn download oewn:2022 # the Open English WordNet 2022 +``` +Then start exploring: +```python +>>> import wn +>>> en = wn.Wordnet('oewn:2022') # Create Wordnet object to query +>>> ss = en.synsets('win', pos='v')[0] # Get the first synset for 'win' +>>> ss.definition() # Get the synset's definition +'be the winner in a contest or competition; be victorious' +``` +## Features +- Multilingual by design; first-class support for wordnets in any language +- Interlingual queries via the [Collaborative Interlingual Index](https://github.com/globalwordnet/cili/) +- Six [similarity metrics](https://wn.readthedocs.io/en/latest/api/wn.similarity.html) +- Functions for [exploring taxonomies](https://wn.readthedocs.io/en/latest/api/wn.taxonomy.html) +- Support for [lemmatization] ([Morphy] for English is built-in) and unicode [normalization] +- Full support of the [WN-LMF 1.1](https://globalwordnet.github.io/schemas/) format, including word pronunciations and lexicon extensions +- SQL-based backend offers very fast startup and improved performance on many kinds of queries +[lemmatization]: https://wn.readthedocs.io/en/latest/guides/lemmatization.html#lemmatization +[normalization]: https://wn.readthedocs.io/en/latest/guides/lemmatization.html#normalization +[Morphy]: https://wn.readthedocs.io/en/latest/api/wn.morphy.html +## Available Wordnets +Any WN-LMF-formatted wordnet can be added to Wn's database from a local +file or remote URL, but Wn also maintains an index (see +[wn/index.toml](https://github.com/goodmami/wn/blob/main/wn/index.toml)) +of available projects, similar to a package manager for software, to aid +in the discovery and downloading of new wordnets. The projects in this +index are listed below. +### English Wordnets +There are several English wordnets available. In general it is +recommended to use the latest [Open English Wordnet], but if you have +stricter compatibility needs for, e.g., experiment replicability, you +may try the [OMW English Wordnet based on WordNet 3.0] (compatible with +the Princeton WordNet 3.0 and with the [NLTK]), or [OpenWordnet-EN] (for +use with the Portuguese wordnet [OpenWordnet-PT]). +| Name | Specifier | # Synsets | Notes | +| ------------------------------------------ | ---------------------- | --------: | ----- | +| [Open English WordNet] | `oewn:2022`<br/> `oewn:2021`<br/> `ewn:2020`<br/> `ewn:2019` | 120068<br/>120039<br/>120053<br/>117791 | Recommended<br/> <br/> <br/> | +| [OMW English Wordnet based on WordNet 3.0] | `omw-en:1.4` | 117659 | Included with `omw:1.4` | +| [OMW English Wordnet based on WordNet 3.1] | `omw-en31:1.4` | 117791 | | +| [OpenWordnet-EN] | `own-en:1.0.0` | 117659 | Included with `own:1.0.0` | +[Open English WordNet]: https://en-word.net +[Open Multilingual Wordnet]: https://github.com/omwn +[OMW English Wordnet based on WordNet 3.0]: https://github.com/omwn/omw-data +[OMW English Wordnet based on WordNet 3.1]: https://github.com/omwn/omw-data +[OpenWordnet-EN]: https://github.com/own-pt/openWordnet-PT +[OpenWordnet-PT]: https://github.com/own-pt/openWordnet-PT +[NLTK]: https://www.nltk.org/ +### Other Wordnets and Collections +These are standalone non-English wordnets and collections. The wordnets +of each collection are listed further down. +| Name | Specifier | # Synsets | Language | +| ------------------------------------------ | ----------------------------- | --------------: | ---------------- | +| [Open Multilingual Wordnet] | `omw:1.4` | n/a | multiple [[mul]] | +| [Open German WordNet] | `odenet:1.4`<br/>`odenet:1.3` | 36268<br/>36159 | German [de] | +| [Open Wordnets for Portuguese and English] | `own:1.0.0` | n/a | multiple [[mul]] | +| [KurdNet] | `kurdnet:1.0` | 2144 | Kurdish [ckb] | +[Open English WordNet]: https://github.com/globalwordnet/english-wordnet +[Open Multilingual Wordnet]: https://github.com/omwn +[OMW English Wordnet based on WordNet 3.0]: https://github.com/omwn +[OMW English Wordnet based on WordNet 3.1]: https://github.com/omwn +[Open German WordNet]: https://github.com/hdaSprachtechnologie/odenet +[Open Wordnets for Portuguese and English]: https://github.com/own-pt +[mul]: https://iso639-3.sil.org/code/mul +[KurdNet]: https://sinaahmadi.github.io/resources/kurdnet.html +### Open Multilingual Wordnet (OMW) Collection +The *Open Multilingual Wordnet* collection (`omw:1.4`) installs the +following lexicons (from +[here](https://github.com/omwn/omw-data/releases/tag/v1.4)) which can +also be downloaded and installed independently: +| Name | Specifier | # Synsets | Language | +| ---------------------------------------- | -------------- | --------: | -------------------------------- | +| Albanet | `omw-sq:1.4` | 4675 | Albanian [sq] | +| Arabic WordNet (AWN v2) | `omw-arb:1.4` | 9916 | Arabic [arb] | +| BulTreeBank Wordnet (BTB-WN) | `omw-bg:1.4` | 4959 | Bulgarian [bg] | +| Chinese Open Wordnet | `omw-cmn:1.4` | 42312 | Mandarin (Simplified) [cmn-Hans] | +| Croatian Wordnet | `omw-hr:1.4` | 23120 | Croatian [hr] | +| DanNet | `omw-da:1.4` | 4476 | Danish [da] | +| FinnWordNet | `omw-fi:1.4` | 116763 | Finnish [fi] | +| Greek Wordnet | `omw-el:1.4` | 18049 | Greek [el] | +| Hebrew Wordnet | `omw-he:1.4` | 5448 | Hebrew [he] | +| IceWordNet | `omw-is:1.4` | 4951 | Icelandic [is] | +| Italian Wordnet | `omw-iwn:1.4` | 15563 | Italian [it] | +| Japanese Wordnet | `omw-ja:1.4` | 57184 | Japanese [ja] | +| Lithuanian WordNet | `omw-lt:1.4` | 9462 | Lithuanian [lt] | +| Multilingual Central Repository | `omw-ca:1.4` | 45826 | Catalan [ca] | +| Multilingual Central Repository | `omw-eu:1.4` | 29413 | Basque [eu] | +| Multilingual Central Repository | `omw-gl:1.4` | 19312 | Galician [gl] | +| Multilingual Central Repository | `omw-es:1.4` | 38512 | Spanish [es] | +| MultiWordNet | `omw-it:1.4` | 35001 | Italian [it] | +| Norwegian Wordnet | `omw-nb:1.4` | 4455 | Norwegian (Bokmål) [nb] | +| Norwegian Wordnet | `omw-nn:1.4` | 3671 | Norwegian (Nynorsk) [nn] | +| OMW English Wordnet based on WordNet 3.0 | `omw-en:1.4` | 117659 | English [en] | +| Open Dutch WordNet | `omw-nl:1.4` | 30177 | Dutch [nl] | +| OpenWN-PT | `omw-pt:1.4` | 43895 | Portuguese [pt] | +| plWordNet | `omw-pl:1.4` | 33826 | Polish [pl] | +| Romanian Wordnet | `omw-ro:1.4` | 56026 | Romanian [ro] | +| Slovak WordNet | `omw-sk:1.4` | 18507 | Slovak [sk] | +| sloWNet | `omw-sl:1.4` | 42583 | Slovenian [sl] | +| Swedish (SALDO) | `omw-sv:1.4` | 6796 | Swedish [sv] | +| Thai Wordnet | `omw-th:1.4` | 73350 | Thai [th] | +| WOLF (Wordnet Libre du Français) | `omw-fr:1.4` | 59091 | French [fr] | +| Wordnet Bahasa | `omw-id:1.4` | 38085 | Indonesian [id] | +| Wordnet Bahasa | `omw-zsm:1.4` | 36911 | Malaysian [zsm] | +### Open Wordnet (OWN) Collection +The *Open Wordnets for Portuguese and English* collection (`own:1.0.0`) +installs the following lexicons (from +[here](https://github.com/own-pt/openWordnet-PT/releases/tag/v1.0.0)) +which can also be downloaded and installed independently: +| Name | Specifier | # Synsets | Language | +| -------------- | -------------- | --------: | --------------- | +| OpenWordnet-PT | `own-pt:1.0.0` | 52670 | Portuguese [pt] | +| OpenWordnet-EN | `own-en:1.0.0` | 117659 | English [en] | +### Collaborative Interlingual Index +While not a wordnet, the [Collaborative Interlingual Index] (CILI) +represents the interlingual backbone of many wordnets. Wn, including +interlingual queries, will function without CILI loaded, but adding it +to the database makes available the full list of concepts, their status +(active, deprecated, etc.), and their definitions. +| Name | Specifier | # Concepts | +| ---------------------------------- | ---------- | ---------: | +| [Collaborative Interlingual Index] | `cili:1.0` | 117659 | +[Collaborative Interlingual Index]: https://github.com/globalwordnet/cili/ +## Changes to the Index +### `ewn` → `oewn` +The 2021 version of the *Open English WordNet* (`oewn:2021`) has +changed its lexicon ID from `ewn` to `oewn`, so the index is updated +accordingly. The previous versions are still available as `ewn:2019` +and `ewn:2020`. +### `pwn` → `omw-en`, `omw-en31` +The wordnet formerly called the *Princeton WordNet* (`pwn:3.0`, +`pwn:3.1`) is now called the *OMW English Wordnet based on WordNet +3.0* (`omw-en`) and the *OMW English Wordnet based on WordNet 3.1* +(`omw-en31`). This is more accurate, as it is a OMW-produced +derivative of the original WordNet data, and it also avoids license or +trademark issues. +### `*wn` → `omw-*` for OMW wordnets +All OMW wordnets have changed their ID scheme from `...wn` to `omw-..` and the version no longer +includes `+omw` (e.g., `bulwn:1.3+omw` is now `omw-bg:1.4`). + +%package -n python3-wn +Summary: Wordnet interface library +Provides: python-wn +BuildRequires: python3-devel +BuildRequires: python3-setuptools +BuildRequires: python3-pip +%description -n python3-wn +Wn is a Python library for exploring information in wordnets. Install +it from PyPI and download some data: +```console +$ pip install wn +$ python -m wn download oewn:2022 # the Open English WordNet 2022 +``` +Then start exploring: +```python +>>> import wn +>>> en = wn.Wordnet('oewn:2022') # Create Wordnet object to query +>>> ss = en.synsets('win', pos='v')[0] # Get the first synset for 'win' +>>> ss.definition() # Get the synset's definition +'be the winner in a contest or competition; be victorious' +``` +## Features +- Multilingual by design; first-class support for wordnets in any language +- Interlingual queries via the [Collaborative Interlingual Index](https://github.com/globalwordnet/cili/) +- Six [similarity metrics](https://wn.readthedocs.io/en/latest/api/wn.similarity.html) +- Functions for [exploring taxonomies](https://wn.readthedocs.io/en/latest/api/wn.taxonomy.html) +- Support for [lemmatization] ([Morphy] for English is built-in) and unicode [normalization] +- Full support of the [WN-LMF 1.1](https://globalwordnet.github.io/schemas/) format, including word pronunciations and lexicon extensions +- SQL-based backend offers very fast startup and improved performance on many kinds of queries +[lemmatization]: https://wn.readthedocs.io/en/latest/guides/lemmatization.html#lemmatization +[normalization]: https://wn.readthedocs.io/en/latest/guides/lemmatization.html#normalization +[Morphy]: https://wn.readthedocs.io/en/latest/api/wn.morphy.html +## Available Wordnets +Any WN-LMF-formatted wordnet can be added to Wn's database from a local +file or remote URL, but Wn also maintains an index (see +[wn/index.toml](https://github.com/goodmami/wn/blob/main/wn/index.toml)) +of available projects, similar to a package manager for software, to aid +in the discovery and downloading of new wordnets. The projects in this +index are listed below. +### English Wordnets +There are several English wordnets available. In general it is +recommended to use the latest [Open English Wordnet], but if you have +stricter compatibility needs for, e.g., experiment replicability, you +may try the [OMW English Wordnet based on WordNet 3.0] (compatible with +the Princeton WordNet 3.0 and with the [NLTK]), or [OpenWordnet-EN] (for +use with the Portuguese wordnet [OpenWordnet-PT]). +| Name | Specifier | # Synsets | Notes | +| ------------------------------------------ | ---------------------- | --------: | ----- | +| [Open English WordNet] | `oewn:2022`<br/> `oewn:2021`<br/> `ewn:2020`<br/> `ewn:2019` | 120068<br/>120039<br/>120053<br/>117791 | Recommended<br/> <br/> <br/> | +| [OMW English Wordnet based on WordNet 3.0] | `omw-en:1.4` | 117659 | Included with `omw:1.4` | +| [OMW English Wordnet based on WordNet 3.1] | `omw-en31:1.4` | 117791 | | +| [OpenWordnet-EN] | `own-en:1.0.0` | 117659 | Included with `own:1.0.0` | +[Open English WordNet]: https://en-word.net +[Open Multilingual Wordnet]: https://github.com/omwn +[OMW English Wordnet based on WordNet 3.0]: https://github.com/omwn/omw-data +[OMW English Wordnet based on WordNet 3.1]: https://github.com/omwn/omw-data +[OpenWordnet-EN]: https://github.com/own-pt/openWordnet-PT +[OpenWordnet-PT]: https://github.com/own-pt/openWordnet-PT +[NLTK]: https://www.nltk.org/ +### Other Wordnets and Collections +These are standalone non-English wordnets and collections. The wordnets +of each collection are listed further down. +| Name | Specifier | # Synsets | Language | +| ------------------------------------------ | ----------------------------- | --------------: | ---------------- | +| [Open Multilingual Wordnet] | `omw:1.4` | n/a | multiple [[mul]] | +| [Open German WordNet] | `odenet:1.4`<br/>`odenet:1.3` | 36268<br/>36159 | German [de] | +| [Open Wordnets for Portuguese and English] | `own:1.0.0` | n/a | multiple [[mul]] | +| [KurdNet] | `kurdnet:1.0` | 2144 | Kurdish [ckb] | +[Open English WordNet]: https://github.com/globalwordnet/english-wordnet +[Open Multilingual Wordnet]: https://github.com/omwn +[OMW English Wordnet based on WordNet 3.0]: https://github.com/omwn +[OMW English Wordnet based on WordNet 3.1]: https://github.com/omwn +[Open German WordNet]: https://github.com/hdaSprachtechnologie/odenet +[Open Wordnets for Portuguese and English]: https://github.com/own-pt +[mul]: https://iso639-3.sil.org/code/mul +[KurdNet]: https://sinaahmadi.github.io/resources/kurdnet.html +### Open Multilingual Wordnet (OMW) Collection +The *Open Multilingual Wordnet* collection (`omw:1.4`) installs the +following lexicons (from +[here](https://github.com/omwn/omw-data/releases/tag/v1.4)) which can +also be downloaded and installed independently: +| Name | Specifier | # Synsets | Language | +| ---------------------------------------- | -------------- | --------: | -------------------------------- | +| Albanet | `omw-sq:1.4` | 4675 | Albanian [sq] | +| Arabic WordNet (AWN v2) | `omw-arb:1.4` | 9916 | Arabic [arb] | +| BulTreeBank Wordnet (BTB-WN) | `omw-bg:1.4` | 4959 | Bulgarian [bg] | +| Chinese Open Wordnet | `omw-cmn:1.4` | 42312 | Mandarin (Simplified) [cmn-Hans] | +| Croatian Wordnet | `omw-hr:1.4` | 23120 | Croatian [hr] | +| DanNet | `omw-da:1.4` | 4476 | Danish [da] | +| FinnWordNet | `omw-fi:1.4` | 116763 | Finnish [fi] | +| Greek Wordnet | `omw-el:1.4` | 18049 | Greek [el] | +| Hebrew Wordnet | `omw-he:1.4` | 5448 | Hebrew [he] | +| IceWordNet | `omw-is:1.4` | 4951 | Icelandic [is] | +| Italian Wordnet | `omw-iwn:1.4` | 15563 | Italian [it] | +| Japanese Wordnet | `omw-ja:1.4` | 57184 | Japanese [ja] | +| Lithuanian WordNet | `omw-lt:1.4` | 9462 | Lithuanian [lt] | +| Multilingual Central Repository | `omw-ca:1.4` | 45826 | Catalan [ca] | +| Multilingual Central Repository | `omw-eu:1.4` | 29413 | Basque [eu] | +| Multilingual Central Repository | `omw-gl:1.4` | 19312 | Galician [gl] | +| Multilingual Central Repository | `omw-es:1.4` | 38512 | Spanish [es] | +| MultiWordNet | `omw-it:1.4` | 35001 | Italian [it] | +| Norwegian Wordnet | `omw-nb:1.4` | 4455 | Norwegian (Bokmål) [nb] | +| Norwegian Wordnet | `omw-nn:1.4` | 3671 | Norwegian (Nynorsk) [nn] | +| OMW English Wordnet based on WordNet 3.0 | `omw-en:1.4` | 117659 | English [en] | +| Open Dutch WordNet | `omw-nl:1.4` | 30177 | Dutch [nl] | +| OpenWN-PT | `omw-pt:1.4` | 43895 | Portuguese [pt] | +| plWordNet | `omw-pl:1.4` | 33826 | Polish [pl] | +| Romanian Wordnet | `omw-ro:1.4` | 56026 | Romanian [ro] | +| Slovak WordNet | `omw-sk:1.4` | 18507 | Slovak [sk] | +| sloWNet | `omw-sl:1.4` | 42583 | Slovenian [sl] | +| Swedish (SALDO) | `omw-sv:1.4` | 6796 | Swedish [sv] | +| Thai Wordnet | `omw-th:1.4` | 73350 | Thai [th] | +| WOLF (Wordnet Libre du Français) | `omw-fr:1.4` | 59091 | French [fr] | +| Wordnet Bahasa | `omw-id:1.4` | 38085 | Indonesian [id] | +| Wordnet Bahasa | `omw-zsm:1.4` | 36911 | Malaysian [zsm] | +### Open Wordnet (OWN) Collection +The *Open Wordnets for Portuguese and English* collection (`own:1.0.0`) +installs the following lexicons (from +[here](https://github.com/own-pt/openWordnet-PT/releases/tag/v1.0.0)) +which can also be downloaded and installed independently: +| Name | Specifier | # Synsets | Language | +| -------------- | -------------- | --------: | --------------- | +| OpenWordnet-PT | `own-pt:1.0.0` | 52670 | Portuguese [pt] | +| OpenWordnet-EN | `own-en:1.0.0` | 117659 | English [en] | +### Collaborative Interlingual Index +While not a wordnet, the [Collaborative Interlingual Index] (CILI) +represents the interlingual backbone of many wordnets. Wn, including +interlingual queries, will function without CILI loaded, but adding it +to the database makes available the full list of concepts, their status +(active, deprecated, etc.), and their definitions. +| Name | Specifier | # Concepts | +| ---------------------------------- | ---------- | ---------: | +| [Collaborative Interlingual Index] | `cili:1.0` | 117659 | +[Collaborative Interlingual Index]: https://github.com/globalwordnet/cili/ +## Changes to the Index +### `ewn` → `oewn` +The 2021 version of the *Open English WordNet* (`oewn:2021`) has +changed its lexicon ID from `ewn` to `oewn`, so the index is updated +accordingly. The previous versions are still available as `ewn:2019` +and `ewn:2020`. +### `pwn` → `omw-en`, `omw-en31` +The wordnet formerly called the *Princeton WordNet* (`pwn:3.0`, +`pwn:3.1`) is now called the *OMW English Wordnet based on WordNet +3.0* (`omw-en`) and the *OMW English Wordnet based on WordNet 3.1* +(`omw-en31`). This is more accurate, as it is a OMW-produced +derivative of the original WordNet data, and it also avoids license or +trademark issues. +### `*wn` → `omw-*` for OMW wordnets +All OMW wordnets have changed their ID scheme from `...wn` to `omw-..` and the version no longer +includes `+omw` (e.g., `bulwn:1.3+omw` is now `omw-bg:1.4`). + +%package help +Summary: Development documents and examples for wn +Provides: python3-wn-doc +%description help +Wn is a Python library for exploring information in wordnets. Install +it from PyPI and download some data: +```console +$ pip install wn +$ python -m wn download oewn:2022 # the Open English WordNet 2022 +``` +Then start exploring: +```python +>>> import wn +>>> en = wn.Wordnet('oewn:2022') # Create Wordnet object to query +>>> ss = en.synsets('win', pos='v')[0] # Get the first synset for 'win' +>>> ss.definition() # Get the synset's definition +'be the winner in a contest or competition; be victorious' +``` +## Features +- Multilingual by design; first-class support for wordnets in any language +- Interlingual queries via the [Collaborative Interlingual Index](https://github.com/globalwordnet/cili/) +- Six [similarity metrics](https://wn.readthedocs.io/en/latest/api/wn.similarity.html) +- Functions for [exploring taxonomies](https://wn.readthedocs.io/en/latest/api/wn.taxonomy.html) +- Support for [lemmatization] ([Morphy] for English is built-in) and unicode [normalization] +- Full support of the [WN-LMF 1.1](https://globalwordnet.github.io/schemas/) format, including word pronunciations and lexicon extensions +- SQL-based backend offers very fast startup and improved performance on many kinds of queries +[lemmatization]: https://wn.readthedocs.io/en/latest/guides/lemmatization.html#lemmatization +[normalization]: https://wn.readthedocs.io/en/latest/guides/lemmatization.html#normalization +[Morphy]: https://wn.readthedocs.io/en/latest/api/wn.morphy.html +## Available Wordnets +Any WN-LMF-formatted wordnet can be added to Wn's database from a local +file or remote URL, but Wn also maintains an index (see +[wn/index.toml](https://github.com/goodmami/wn/blob/main/wn/index.toml)) +of available projects, similar to a package manager for software, to aid +in the discovery and downloading of new wordnets. The projects in this +index are listed below. +### English Wordnets +There are several English wordnets available. In general it is +recommended to use the latest [Open English Wordnet], but if you have +stricter compatibility needs for, e.g., experiment replicability, you +may try the [OMW English Wordnet based on WordNet 3.0] (compatible with +the Princeton WordNet 3.0 and with the [NLTK]), or [OpenWordnet-EN] (for +use with the Portuguese wordnet [OpenWordnet-PT]). +| Name | Specifier | # Synsets | Notes | +| ------------------------------------------ | ---------------------- | --------: | ----- | +| [Open English WordNet] | `oewn:2022`<br/> `oewn:2021`<br/> `ewn:2020`<br/> `ewn:2019` | 120068<br/>120039<br/>120053<br/>117791 | Recommended<br/> <br/> <br/> | +| [OMW English Wordnet based on WordNet 3.0] | `omw-en:1.4` | 117659 | Included with `omw:1.4` | +| [OMW English Wordnet based on WordNet 3.1] | `omw-en31:1.4` | 117791 | | +| [OpenWordnet-EN] | `own-en:1.0.0` | 117659 | Included with `own:1.0.0` | +[Open English WordNet]: https://en-word.net +[Open Multilingual Wordnet]: https://github.com/omwn +[OMW English Wordnet based on WordNet 3.0]: https://github.com/omwn/omw-data +[OMW English Wordnet based on WordNet 3.1]: https://github.com/omwn/omw-data +[OpenWordnet-EN]: https://github.com/own-pt/openWordnet-PT +[OpenWordnet-PT]: https://github.com/own-pt/openWordnet-PT +[NLTK]: https://www.nltk.org/ +### Other Wordnets and Collections +These are standalone non-English wordnets and collections. The wordnets +of each collection are listed further down. +| Name | Specifier | # Synsets | Language | +| ------------------------------------------ | ----------------------------- | --------------: | ---------------- | +| [Open Multilingual Wordnet] | `omw:1.4` | n/a | multiple [[mul]] | +| [Open German WordNet] | `odenet:1.4`<br/>`odenet:1.3` | 36268<br/>36159 | German [de] | +| [Open Wordnets for Portuguese and English] | `own:1.0.0` | n/a | multiple [[mul]] | +| [KurdNet] | `kurdnet:1.0` | 2144 | Kurdish [ckb] | +[Open English WordNet]: https://github.com/globalwordnet/english-wordnet +[Open Multilingual Wordnet]: https://github.com/omwn +[OMW English Wordnet based on WordNet 3.0]: https://github.com/omwn +[OMW English Wordnet based on WordNet 3.1]: https://github.com/omwn +[Open German WordNet]: https://github.com/hdaSprachtechnologie/odenet +[Open Wordnets for Portuguese and English]: https://github.com/own-pt +[mul]: https://iso639-3.sil.org/code/mul +[KurdNet]: https://sinaahmadi.github.io/resources/kurdnet.html +### Open Multilingual Wordnet (OMW) Collection +The *Open Multilingual Wordnet* collection (`omw:1.4`) installs the +following lexicons (from +[here](https://github.com/omwn/omw-data/releases/tag/v1.4)) which can +also be downloaded and installed independently: +| Name | Specifier | # Synsets | Language | +| ---------------------------------------- | -------------- | --------: | -------------------------------- | +| Albanet | `omw-sq:1.4` | 4675 | Albanian [sq] | +| Arabic WordNet (AWN v2) | `omw-arb:1.4` | 9916 | Arabic [arb] | +| BulTreeBank Wordnet (BTB-WN) | `omw-bg:1.4` | 4959 | Bulgarian [bg] | +| Chinese Open Wordnet | `omw-cmn:1.4` | 42312 | Mandarin (Simplified) [cmn-Hans] | +| Croatian Wordnet | `omw-hr:1.4` | 23120 | Croatian [hr] | +| DanNet | `omw-da:1.4` | 4476 | Danish [da] | +| FinnWordNet | `omw-fi:1.4` | 116763 | Finnish [fi] | +| Greek Wordnet | `omw-el:1.4` | 18049 | Greek [el] | +| Hebrew Wordnet | `omw-he:1.4` | 5448 | Hebrew [he] | +| IceWordNet | `omw-is:1.4` | 4951 | Icelandic [is] | +| Italian Wordnet | `omw-iwn:1.4` | 15563 | Italian [it] | +| Japanese Wordnet | `omw-ja:1.4` | 57184 | Japanese [ja] | +| Lithuanian WordNet | `omw-lt:1.4` | 9462 | Lithuanian [lt] | +| Multilingual Central Repository | `omw-ca:1.4` | 45826 | Catalan [ca] | +| Multilingual Central Repository | `omw-eu:1.4` | 29413 | Basque [eu] | +| Multilingual Central Repository | `omw-gl:1.4` | 19312 | Galician [gl] | +| Multilingual Central Repository | `omw-es:1.4` | 38512 | Spanish [es] | +| MultiWordNet | `omw-it:1.4` | 35001 | Italian [it] | +| Norwegian Wordnet | `omw-nb:1.4` | 4455 | Norwegian (Bokmål) [nb] | +| Norwegian Wordnet | `omw-nn:1.4` | 3671 | Norwegian (Nynorsk) [nn] | +| OMW English Wordnet based on WordNet 3.0 | `omw-en:1.4` | 117659 | English [en] | +| Open Dutch WordNet | `omw-nl:1.4` | 30177 | Dutch [nl] | +| OpenWN-PT | `omw-pt:1.4` | 43895 | Portuguese [pt] | +| plWordNet | `omw-pl:1.4` | 33826 | Polish [pl] | +| Romanian Wordnet | `omw-ro:1.4` | 56026 | Romanian [ro] | +| Slovak WordNet | `omw-sk:1.4` | 18507 | Slovak [sk] | +| sloWNet | `omw-sl:1.4` | 42583 | Slovenian [sl] | +| Swedish (SALDO) | `omw-sv:1.4` | 6796 | Swedish [sv] | +| Thai Wordnet | `omw-th:1.4` | 73350 | Thai [th] | +| WOLF (Wordnet Libre du Français) | `omw-fr:1.4` | 59091 | French [fr] | +| Wordnet Bahasa | `omw-id:1.4` | 38085 | Indonesian [id] | +| Wordnet Bahasa | `omw-zsm:1.4` | 36911 | Malaysian [zsm] | +### Open Wordnet (OWN) Collection +The *Open Wordnets for Portuguese and English* collection (`own:1.0.0`) +installs the following lexicons (from +[here](https://github.com/own-pt/openWordnet-PT/releases/tag/v1.0.0)) +which can also be downloaded and installed independently: +| Name | Specifier | # Synsets | Language | +| -------------- | -------------- | --------: | --------------- | +| OpenWordnet-PT | `own-pt:1.0.0` | 52670 | Portuguese [pt] | +| OpenWordnet-EN | `own-en:1.0.0` | 117659 | English [en] | +### Collaborative Interlingual Index +While not a wordnet, the [Collaborative Interlingual Index] (CILI) +represents the interlingual backbone of many wordnets. Wn, including +interlingual queries, will function without CILI loaded, but adding it +to the database makes available the full list of concepts, their status +(active, deprecated, etc.), and their definitions. +| Name | Specifier | # Concepts | +| ---------------------------------- | ---------- | ---------: | +| [Collaborative Interlingual Index] | `cili:1.0` | 117659 | +[Collaborative Interlingual Index]: https://github.com/globalwordnet/cili/ +## Changes to the Index +### `ewn` → `oewn` +The 2021 version of the *Open English WordNet* (`oewn:2021`) has +changed its lexicon ID from `ewn` to `oewn`, so the index is updated +accordingly. The previous versions are still available as `ewn:2019` +and `ewn:2020`. +### `pwn` → `omw-en`, `omw-en31` +The wordnet formerly called the *Princeton WordNet* (`pwn:3.0`, +`pwn:3.1`) is now called the *OMW English Wordnet based on WordNet +3.0* (`omw-en`) and the *OMW English Wordnet based on WordNet 3.1* +(`omw-en31`). This is more accurate, as it is a OMW-produced +derivative of the original WordNet data, and it also avoids license or +trademark issues. +### `*wn` → `omw-*` for OMW wordnets +All OMW wordnets have changed their ID scheme from `...wn` to `omw-..` and the version no longer +includes `+omw` (e.g., `bulwn:1.3+omw` is now `omw-bg:1.4`). + +%prep +%autosetup -n wn-0.9.4 + +%build +%py3_build + +%install +%py3_install +install -d -m755 %{buildroot}/%{_pkgdocdir} +if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi +if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi +if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi +if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi +pushd %{buildroot} +if [ -d usr/lib ]; then + find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/lib64 ]; then + find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/bin ]; then + find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/sbin ]; then + find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst +fi +touch doclist.lst +if [ -d usr/share/man ]; then + find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst +fi +popd +mv %{buildroot}/filelist.lst . +mv %{buildroot}/doclist.lst . + +%files -n python3-wn -f filelist.lst +%dir %{python3_sitelib}/* + +%files help -f doclist.lst +%{_docdir}/* + +%changelog +* Wed May 10 2023 Python_Bot <Python_Bot@openeuler.org> - 0.9.4-1 +- Package Spec generated @@ -0,0 +1 @@ +2bbb09c9995677cf3e101d319e61ed2f wn-0.9.4.tar.gz |
