diff options
| author | CoprDistGit <infra@openeuler.org> | 2023-04-10 15:02:24 +0000 |
|---|---|---|
| committer | CoprDistGit <infra@openeuler.org> | 2023-04-10 15:02:24 +0000 |
| commit | a4260d08cd7117e27359418bb69d4f9c8a076558 (patch) | |
| tree | e5cce581049136d352f93c5de58c84e430dbdb67 /python-mecab-python3.spec | |
| parent | 82138923b7cc1ebcad051bde120eb9e8742d53a4 (diff) | |
automatic import of python-mecab-python3
Diffstat (limited to 'python-mecab-python3.spec')
| -rw-r--r-- | python-mecab-python3.spec | 464 |
1 files changed, 464 insertions, 0 deletions
diff --git a/python-mecab-python3.spec b/python-mecab-python3.spec new file mode 100644 index 0000000..57fa9aa --- /dev/null +++ b/python-mecab-python3.spec @@ -0,0 +1,464 @@ +%global _empty_manifest_terminate_build 0 +Name: python-mecab-python3 +Version: 1.0.6 +Release: 1 +Summary: Python wrapper for the MeCab morphological analyzer for Japanese +License: BSD +URL: https://github.com/SamuraiT/mecab-python3 +Source0: https://mirrors.nju.edu.cn/pypi/web/packages/94/02/6a7146d3a4b4a47fb4c67346e25624cbcd72cefbe6d08107b1449e262abc/mecab-python3-1.0.6.tar.gz + + +%description +This is a Python wrapper for the [MeCab][] morphological analyzer for Japanese +text. It works with Python 3.6 and greater; if you need to use Python 2.7, use +v1.0.2. + +**Note:** If using MacOS Big Sur, you'll need to upgrade pip to version 20.3 or +higher to use wheels due to a pip issue. + +**issueを英語で書く必要はありません。** + +[MeCab]: https://taku910.github.io/mecab/ + +Note that Windows wheels require a [Microsoft Visual C++ +Redistributable][msvc], so be sure to install that. + +[msvc]: https://support.microsoft.com/en-us/help/2977003/the-latest-supported-visual-c-downloads + +# Basic usage + +```py +>>> import MeCab +>>> wakati = MeCab.Tagger("-Owakati") +>>> wakati.parse("pythonが大好きです").split() +['python', 'が', '大好き', 'です'] + +>>> tagger = MeCab.Tagger() +>>> print(tagger.parse("pythonが大好きです")) +python python python python 名詞-普通名詞-一般 +が ガ ガ が 助詞-格助詞 +大好き ダイスキ ダイスキ 大好き 形状詞-一般 +です デス デス です 助動詞 助動詞-デス 終止形-一般 +EOS +``` + +The API for `mecab-python3` closely follows the API for MeCab itself, +even when this makes it not very “Pythonic.” Please consult the [official MeCab +documentation][mecab-docs] for more information. + +[mecab-docs]: https://taku910.github.io/mecab/ + +# Installation + +Binary wheels are available for MacOS X, Linux, and Windows (64bit) are +installed by default when you use `pip`: + +```sh +pip install mecab-python3 +``` + +These wheels include a copy of the MeCab library, but not a dictionary. In +order to use MeCab you'll need to install a dictionary. `unidic-lite` is a good +one to start with: + +```sh +pip install unidic-lite +``` + +To build from source using pip, + +```sh +pip install --no-binary :all: mecab-python3 +``` + +## Dictionaries + +In order to use MeCab, you must install a dictionary. There are many different dictionaries available for MeCab. These UniDic packages, which include slight modifications for ease of use, are recommended: + +- [unidic](https://github.com/polm/unidic-py): The latest full UniDic. +- [unidic-lite](https://github.com/polm/unidic-lite): A slightly modified UniDic 2.1.2, chosen for its small size. + +The dictionaries below are not recommended due to being unmaintained for many years, but they are available for use with legacy applications. + +- [ipadic](https://github.com/polm/ipadic-py) +- [jumandic](https://github.com/polm/jumandic-py) + +For more details on the differences between dictionaries see [here](https://www.dampfkraft.com/nlp/japanese-tokenizer-dictionaries.html). + +# Common Issues + +If you get a `RuntimeError` when you try to run MeCab, here are some things to check: + +## Windows Redistributable + +You have to install [this][msvc] to use this package on Windows. + +## Installing a Dictionary + +Run `pip install unidic-lite` and confirm that works. If that fixes your +problem, you either don't have a dictionary installed, or you need to specify +your dictionary path like this: + + tagger = MeCab.Tagger('-r /dev/null -d /usr/local/lib/mecab/dic/mydic') + +Note: on Windows, use `nul` instead of `/dev/null`. Alternately, if you have a +`mecabrc` you can use the path after `-r`. + +## Specifying a mecabrc + +If you get this error: + + error message: [ifs] no such file or directory: /usr/local/etc/mecabrc + +You need to specify a `mecabrc` file. It's OK to specify an empty file, it just +has to exist. You can specify a `mecabrc` with `-r`. This may be necessary on +Debian or Ubuntu, where the `mecabrc` is in `/etc/mecabrc`. + +You can specify an empty `mecabrc` like this: + + tagger = MeCab.Tagger('-r/dev/null -d/home/hoge/mydic') + +## Using Unsupported Output Modes like `-Ochasen` + +Chasen output is not a built-in feature of MeCab, you must specify it in your +`dicrc` or `mecabrc`. Notably, Unidic does not include Chasen output format. +Please see [the MeCab documentation](https://taku910.github.io/mecab/#format). + +# Alternatives + +- [fugashi](https://github.com/polm/fugashi) is a Cython wrapper for MeCab with a Pythonic interface, by the current maintainer of this library +- [SudachiPy](https://github.com/WorksApplications/sudachi.rs) is a modern tokenizer with an actively maintained dictionary +- [pymecab-ko](https://github.com/NoUnique/pymecab-ko) is a wrapper of the Korean MeCab fork [mecab-ko](https://bitbucket.org/eunjeon/mecab-ko/src/master/) based on mecab-python3 +- [KoNLPy](https://konlpy.org/en/latest/) is a library for Korean NLP that includes a MeCab wrapper + +# Licensing + +Like MeCab itself, `mecab-python3` is copyrighted free software by +Taku Kudo <taku@chasen.org> and Nippon Telegraph and Telephone Corporation, +and is distributed under a 3-clause BSD license (see the file `BSD`). +Alternatively, it may be redistributed under the terms of the +GNU General Public License, version 2 (see the file `GPL`) or the +GNU Lesser General Public License, version 2.1 (see the file `LGPL`). + + +%package -n python3-mecab-python3 +Summary: Python wrapper for the MeCab morphological analyzer for Japanese +Provides: python-mecab-python3 +BuildRequires: python3-devel +BuildRequires: python3-setuptools +BuildRequires: python3-pip +BuildRequires: python3-cffi +BuildRequires: gcc +BuildRequires: gdb +%description -n python3-mecab-python3 +This is a Python wrapper for the [MeCab][] morphological analyzer for Japanese +text. It works with Python 3.6 and greater; if you need to use Python 2.7, use +v1.0.2. + +**Note:** If using MacOS Big Sur, you'll need to upgrade pip to version 20.3 or +higher to use wheels due to a pip issue. + +**issueを英語で書く必要はありません。** + +[MeCab]: https://taku910.github.io/mecab/ + +Note that Windows wheels require a [Microsoft Visual C++ +Redistributable][msvc], so be sure to install that. + +[msvc]: https://support.microsoft.com/en-us/help/2977003/the-latest-supported-visual-c-downloads + +# Basic usage + +```py +>>> import MeCab +>>> wakati = MeCab.Tagger("-Owakati") +>>> wakati.parse("pythonが大好きです").split() +['python', 'が', '大好き', 'です'] + +>>> tagger = MeCab.Tagger() +>>> print(tagger.parse("pythonが大好きです")) +python python python python 名詞-普通名詞-一般 +が ガ ガ が 助詞-格助詞 +大好き ダイスキ ダイスキ 大好き 形状詞-一般 +です デス デス です 助動詞 助動詞-デス 終止形-一般 +EOS +``` + +The API for `mecab-python3` closely follows the API for MeCab itself, +even when this makes it not very “Pythonic.” Please consult the [official MeCab +documentation][mecab-docs] for more information. + +[mecab-docs]: https://taku910.github.io/mecab/ + +# Installation + +Binary wheels are available for MacOS X, Linux, and Windows (64bit) are +installed by default when you use `pip`: + +```sh +pip install mecab-python3 +``` + +These wheels include a copy of the MeCab library, but not a dictionary. In +order to use MeCab you'll need to install a dictionary. `unidic-lite` is a good +one to start with: + +```sh +pip install unidic-lite +``` + +To build from source using pip, + +```sh +pip install --no-binary :all: mecab-python3 +``` + +## Dictionaries + +In order to use MeCab, you must install a dictionary. There are many different dictionaries available for MeCab. These UniDic packages, which include slight modifications for ease of use, are recommended: + +- [unidic](https://github.com/polm/unidic-py): The latest full UniDic. +- [unidic-lite](https://github.com/polm/unidic-lite): A slightly modified UniDic 2.1.2, chosen for its small size. + +The dictionaries below are not recommended due to being unmaintained for many years, but they are available for use with legacy applications. + +- [ipadic](https://github.com/polm/ipadic-py) +- [jumandic](https://github.com/polm/jumandic-py) + +For more details on the differences between dictionaries see [here](https://www.dampfkraft.com/nlp/japanese-tokenizer-dictionaries.html). + +# Common Issues + +If you get a `RuntimeError` when you try to run MeCab, here are some things to check: + +## Windows Redistributable + +You have to install [this][msvc] to use this package on Windows. + +## Installing a Dictionary + +Run `pip install unidic-lite` and confirm that works. If that fixes your +problem, you either don't have a dictionary installed, or you need to specify +your dictionary path like this: + + tagger = MeCab.Tagger('-r /dev/null -d /usr/local/lib/mecab/dic/mydic') + +Note: on Windows, use `nul` instead of `/dev/null`. Alternately, if you have a +`mecabrc` you can use the path after `-r`. + +## Specifying a mecabrc + +If you get this error: + + error message: [ifs] no such file or directory: /usr/local/etc/mecabrc + +You need to specify a `mecabrc` file. It's OK to specify an empty file, it just +has to exist. You can specify a `mecabrc` with `-r`. This may be necessary on +Debian or Ubuntu, where the `mecabrc` is in `/etc/mecabrc`. + +You can specify an empty `mecabrc` like this: + + tagger = MeCab.Tagger('-r/dev/null -d/home/hoge/mydic') + +## Using Unsupported Output Modes like `-Ochasen` + +Chasen output is not a built-in feature of MeCab, you must specify it in your +`dicrc` or `mecabrc`. Notably, Unidic does not include Chasen output format. +Please see [the MeCab documentation](https://taku910.github.io/mecab/#format). + +# Alternatives + +- [fugashi](https://github.com/polm/fugashi) is a Cython wrapper for MeCab with a Pythonic interface, by the current maintainer of this library +- [SudachiPy](https://github.com/WorksApplications/sudachi.rs) is a modern tokenizer with an actively maintained dictionary +- [pymecab-ko](https://github.com/NoUnique/pymecab-ko) is a wrapper of the Korean MeCab fork [mecab-ko](https://bitbucket.org/eunjeon/mecab-ko/src/master/) based on mecab-python3 +- [KoNLPy](https://konlpy.org/en/latest/) is a library for Korean NLP that includes a MeCab wrapper + +# Licensing + +Like MeCab itself, `mecab-python3` is copyrighted free software by +Taku Kudo <taku@chasen.org> and Nippon Telegraph and Telephone Corporation, +and is distributed under a 3-clause BSD license (see the file `BSD`). +Alternatively, it may be redistributed under the terms of the +GNU General Public License, version 2 (see the file `GPL`) or the +GNU Lesser General Public License, version 2.1 (see the file `LGPL`). + + +%package help +Summary: Development documents and examples for mecab-python3 +Provides: python3-mecab-python3-doc +%description help +This is a Python wrapper for the [MeCab][] morphological analyzer for Japanese +text. It works with Python 3.6 and greater; if you need to use Python 2.7, use +v1.0.2. + +**Note:** If using MacOS Big Sur, you'll need to upgrade pip to version 20.3 or +higher to use wheels due to a pip issue. + +**issueを英語で書く必要はありません。** + +[MeCab]: https://taku910.github.io/mecab/ + +Note that Windows wheels require a [Microsoft Visual C++ +Redistributable][msvc], so be sure to install that. + +[msvc]: https://support.microsoft.com/en-us/help/2977003/the-latest-supported-visual-c-downloads + +# Basic usage + +```py +>>> import MeCab +>>> wakati = MeCab.Tagger("-Owakati") +>>> wakati.parse("pythonが大好きです").split() +['python', 'が', '大好き', 'です'] + +>>> tagger = MeCab.Tagger() +>>> print(tagger.parse("pythonが大好きです")) +python python python python 名詞-普通名詞-一般 +が ガ ガ が 助詞-格助詞 +大好き ダイスキ ダイスキ 大好き 形状詞-一般 +です デス デス です 助動詞 助動詞-デス 終止形-一般 +EOS +``` + +The API for `mecab-python3` closely follows the API for MeCab itself, +even when this makes it not very “Pythonic.” Please consult the [official MeCab +documentation][mecab-docs] for more information. + +[mecab-docs]: https://taku910.github.io/mecab/ + +# Installation + +Binary wheels are available for MacOS X, Linux, and Windows (64bit) are +installed by default when you use `pip`: + +```sh +pip install mecab-python3 +``` + +These wheels include a copy of the MeCab library, but not a dictionary. In +order to use MeCab you'll need to install a dictionary. `unidic-lite` is a good +one to start with: + +```sh +pip install unidic-lite +``` + +To build from source using pip, + +```sh +pip install --no-binary :all: mecab-python3 +``` + +## Dictionaries + +In order to use MeCab, you must install a dictionary. There are many different dictionaries available for MeCab. These UniDic packages, which include slight modifications for ease of use, are recommended: + +- [unidic](https://github.com/polm/unidic-py): The latest full UniDic. +- [unidic-lite](https://github.com/polm/unidic-lite): A slightly modified UniDic 2.1.2, chosen for its small size. + +The dictionaries below are not recommended due to being unmaintained for many years, but they are available for use with legacy applications. + +- [ipadic](https://github.com/polm/ipadic-py) +- [jumandic](https://github.com/polm/jumandic-py) + +For more details on the differences between dictionaries see [here](https://www.dampfkraft.com/nlp/japanese-tokenizer-dictionaries.html). + +# Common Issues + +If you get a `RuntimeError` when you try to run MeCab, here are some things to check: + +## Windows Redistributable + +You have to install [this][msvc] to use this package on Windows. + +## Installing a Dictionary + +Run `pip install unidic-lite` and confirm that works. If that fixes your +problem, you either don't have a dictionary installed, or you need to specify +your dictionary path like this: + + tagger = MeCab.Tagger('-r /dev/null -d /usr/local/lib/mecab/dic/mydic') + +Note: on Windows, use `nul` instead of `/dev/null`. Alternately, if you have a +`mecabrc` you can use the path after `-r`. + +## Specifying a mecabrc + +If you get this error: + + error message: [ifs] no such file or directory: /usr/local/etc/mecabrc + +You need to specify a `mecabrc` file. It's OK to specify an empty file, it just +has to exist. You can specify a `mecabrc` with `-r`. This may be necessary on +Debian or Ubuntu, where the `mecabrc` is in `/etc/mecabrc`. + +You can specify an empty `mecabrc` like this: + + tagger = MeCab.Tagger('-r/dev/null -d/home/hoge/mydic') + +## Using Unsupported Output Modes like `-Ochasen` + +Chasen output is not a built-in feature of MeCab, you must specify it in your +`dicrc` or `mecabrc`. Notably, Unidic does not include Chasen output format. +Please see [the MeCab documentation](https://taku910.github.io/mecab/#format). + +# Alternatives + +- [fugashi](https://github.com/polm/fugashi) is a Cython wrapper for MeCab with a Pythonic interface, by the current maintainer of this library +- [SudachiPy](https://github.com/WorksApplications/sudachi.rs) is a modern tokenizer with an actively maintained dictionary +- [pymecab-ko](https://github.com/NoUnique/pymecab-ko) is a wrapper of the Korean MeCab fork [mecab-ko](https://bitbucket.org/eunjeon/mecab-ko/src/master/) based on mecab-python3 +- [KoNLPy](https://konlpy.org/en/latest/) is a library for Korean NLP that includes a MeCab wrapper + +# Licensing + +Like MeCab itself, `mecab-python3` is copyrighted free software by +Taku Kudo <taku@chasen.org> and Nippon Telegraph and Telephone Corporation, +and is distributed under a 3-clause BSD license (see the file `BSD`). +Alternatively, it may be redistributed under the terms of the +GNU General Public License, version 2 (see the file `GPL`) or the +GNU Lesser General Public License, version 2.1 (see the file `LGPL`). + + +%prep +%autosetup -n mecab-python3-1.0.6 + +%build +%py3_build + +%install +%py3_install +install -d -m755 %{buildroot}/%{_pkgdocdir} +if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi +if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi +if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi +if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi +pushd %{buildroot} +if [ -d usr/lib ]; then + find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/lib64 ]; then + find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/bin ]; then + find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/sbin ]; then + find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst +fi +touch doclist.lst +if [ -d usr/share/man ]; then + find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst +fi +popd +mv %{buildroot}/filelist.lst . +mv %{buildroot}/doclist.lst . + +%files -n python3-mecab-python3 -f filelist.lst +%dir %{python3_sitearch}/* + +%files help -f doclist.lst +%{_docdir}/* + +%changelog +* Mon Apr 10 2023 Python_Bot <Python_Bot@openeuler.org> - 1.0.6-1 +- Package Spec generated |
