diff options
| author | CoprDistGit <infra@openeuler.org> | 2023-04-10 09:06:01 +0000 |
|---|---|---|
| committer | CoprDistGit <infra@openeuler.org> | 2023-04-10 09:06:01 +0000 |
| commit | 8863553e1a8066a100ed9944f12093b9e86acd6b (patch) | |
| tree | 10bec48f472d187fe0c89aa6f2452d3d3c6dd1a1 | |
| parent | 7fea41ee9434fb760ef286264521cfdee70aec94 (diff) | |
automatic import of python-koalas
| -rw-r--r-- | .gitignore | 1 | ||||
| -rw-r--r-- | python-koalas.spec | 385 | ||||
| -rw-r--r-- | sources | 1 |
3 files changed, 387 insertions, 0 deletions
@@ -0,0 +1 @@ +/koalas-1.8.2.tar.gz diff --git a/python-koalas.spec b/python-koalas.spec new file mode 100644 index 0000000..cf8754d --- /dev/null +++ b/python-koalas.spec @@ -0,0 +1,385 @@ +%global _empty_manifest_terminate_build 0 +Name: python-koalas +Version: 1.8.2 +Release: 1 +Summary: Koalas: pandas API on Apache Spark +License: http://www.apache.org/licenses/LICENSE-2.0 +URL: https://github.com/databricks/koalas +Source0: https://mirrors.nju.edu.cn/pypi/web/packages/b6/c4/81dce696d90ac382e31ded2d8ba09205f8cb4633aa0a6fbb23c8d4874047/koalas-1.8.2.tar.gz +BuildArch: noarch + +Requires: python3-pandas +Requires: python3-pyarrow +Requires: python3-numpy +Requires: python3-matplotlib +Requires: python3-mlflow +Requires: python3-plotly +Requires: python3-pyspark + +%description +**NOTE**: Koalas supports Apache Spark 3.1 and below as it will be [officially included to PySpark in the upcoming Apache Spark 3.2](http://apache-spark-developers-list.1001551.n3.nabble.com/VOTE-SPIP-Support-pandas-API-layer-on-PySpark-td30996.html). This repository is now in maintenance mode. For Apache Spark 3.2 and above, please use PySpark directly. + +<p align="center"> + <img src="https://raw.githubusercontent.com/databricks/koalas/master/icons/koalas-logo.png" width="140"/> +</p> + +<p align="center"> + pandas API on Apache Spark + <br/> + <a href="https://koalas.readthedocs.io/en/latest/?badge=latest"><strong>Explore Koalas docs »</strong></a> + <br/> + <br/> + <a href="https://mybinder.org/v2/gh/databricks/koalas/master?filepath=docs%2Fsource%2Fgetting_started%2F10min.ipynb">Live notebook</a> + · + <a href="https://github.com/databricks/koalas/issues">Issues</a> + · + <a href="https://groups.google.com/forum/#!forum/koalas-dev">Mailing list</a> + <br/> + <strong><a href="https://www.gofundme.com/f/help-thirsty-koalas-devastated-by-recent-fires">Help Thirsty Koalas Devastated by Recent Fires</a></strong> +</p> + +The Koalas project makes data scientists more productive when interacting with big data, by implementing the pandas DataFrame API on top of Apache Spark. + +pandas is the de facto standard (single-node) DataFrame implementation in Python, while Spark is the de facto standard for big data processing. With this package, you can: + - Be immediately productive with Spark, with no learning curve, if you are already familiar with pandas. + - Have a single codebase that works both with pandas (tests, smaller datasets) and with Spark (distributed datasets). + +We would love to have you try it and give us feedback, through our [mailing lists](https://groups.google.com/forum/#!forum/koalas-dev) or [GitHub issues](https://github.com/databricks/koalas/issues). + +Try the Koalas 10 minutes tutorial on a live Jupyter notebook [here](https://mybinder.org/v2/gh/databricks/koalas/master?filepath=docs%2Fsource%2Fgetting_started%2F10min.ipynb). The initial launch can take up to several minutes. + +[](https://github.com/databricks/koalas/actions) +[](https://codecov.io/gh/databricks/koalas) +[](https://koalas.readthedocs.io/en/latest/?badge=latest) +[](https://pypi.org/project/koalas/) +[](https://anaconda.org/conda-forge/koalas) +[](https://mybinder.org/v2/gh/databricks/koalas/master?filepath=docs%2Fsource%2Fgetting_started%2F10min.ipynb) +[](https://pepy.tech/project/koalas) + + +## Getting Started + +Koalas can be installed in many ways such as Conda and pip. + +```bash +# Conda +conda install koalas -c conda-forge +``` + +```bash +# pip +pip install koalas +``` + +See [Installation](https://koalas.readthedocs.io/en/latest/getting_started/install.html) for more details. + +For Databricks Runtime, Koalas is pre-installed in Databricks Runtime 7.1 and above. Try [Databricks Community Edition](https://community.cloud.databricks.com/) for free. You can also follow these [steps](https://docs.databricks.com/libraries/index.html) to manually install a library on Databricks. + +Lastly, if your PyArrow version is 0.15+ and your PySpark version is lower than 3.0, it is best for you to set `ARROW_PRE_0_15_IPC_FORMAT` environment variable to `1` manually. +Koalas will try its best to set it for you but it is impossible to set it if there is a Spark context already launched. + +Now you can turn a pandas DataFrame into a Koalas DataFrame that is API-compliant with the former: + +```python +import databricks.koalas as ks +import pandas as pd + +pdf = pd.DataFrame({'x':range(3), 'y':['a','b','b'], 'z':['a','b','b']}) + +# Create a Koalas DataFrame from pandas DataFrame +df = ks.from_pandas(pdf) + +# Rename the columns +df.columns = ['x', 'y', 'z1'] + +# Do some operations in place: +df['x2'] = df.x * df.x +``` + +For more details, see [Getting Started](https://koalas.readthedocs.io/en/latest/getting_started/index.html) and [Dependencies](https://koalas.readthedocs.io/en/latest/getting_started/install.html#dependencies) in the official documentation. + + +## Contributing Guide + +See [Contributing Guide](https://koalas.readthedocs.io/en/latest/development/contributing.html) and [Design Principles](https://koalas.readthedocs.io/en/latest/development/design.html) in the official documentation. + + +## FAQ + +See [FAQ](https://koalas.readthedocs.io/en/latest/user_guide/faq.html) in the official documentation. + + +## Best Practices + +See [Best Practices](https://koalas.readthedocs.io/en/latest/user_guide/best_practices.html) in the official documentation. + + +## Koalas Talks and Blogs + +See [Koalas Talks and Blogs](https://koalas.readthedocs.io/en/latest/getting_started/videos_blogs.html) in the official documentation. + + + + +%package -n python3-koalas +Summary: Koalas: pandas API on Apache Spark +Provides: python-koalas +BuildRequires: python3-devel +BuildRequires: python3-setuptools +BuildRequires: python3-pip +%description -n python3-koalas +**NOTE**: Koalas supports Apache Spark 3.1 and below as it will be [officially included to PySpark in the upcoming Apache Spark 3.2](http://apache-spark-developers-list.1001551.n3.nabble.com/VOTE-SPIP-Support-pandas-API-layer-on-PySpark-td30996.html). This repository is now in maintenance mode. For Apache Spark 3.2 and above, please use PySpark directly. + +<p align="center"> + <img src="https://raw.githubusercontent.com/databricks/koalas/master/icons/koalas-logo.png" width="140"/> +</p> + +<p align="center"> + pandas API on Apache Spark + <br/> + <a href="https://koalas.readthedocs.io/en/latest/?badge=latest"><strong>Explore Koalas docs »</strong></a> + <br/> + <br/> + <a href="https://mybinder.org/v2/gh/databricks/koalas/master?filepath=docs%2Fsource%2Fgetting_started%2F10min.ipynb">Live notebook</a> + · + <a href="https://github.com/databricks/koalas/issues">Issues</a> + · + <a href="https://groups.google.com/forum/#!forum/koalas-dev">Mailing list</a> + <br/> + <strong><a href="https://www.gofundme.com/f/help-thirsty-koalas-devastated-by-recent-fires">Help Thirsty Koalas Devastated by Recent Fires</a></strong> +</p> + +The Koalas project makes data scientists more productive when interacting with big data, by implementing the pandas DataFrame API on top of Apache Spark. + +pandas is the de facto standard (single-node) DataFrame implementation in Python, while Spark is the de facto standard for big data processing. With this package, you can: + - Be immediately productive with Spark, with no learning curve, if you are already familiar with pandas. + - Have a single codebase that works both with pandas (tests, smaller datasets) and with Spark (distributed datasets). + +We would love to have you try it and give us feedback, through our [mailing lists](https://groups.google.com/forum/#!forum/koalas-dev) or [GitHub issues](https://github.com/databricks/koalas/issues). + +Try the Koalas 10 minutes tutorial on a live Jupyter notebook [here](https://mybinder.org/v2/gh/databricks/koalas/master?filepath=docs%2Fsource%2Fgetting_started%2F10min.ipynb). The initial launch can take up to several minutes. + +[](https://github.com/databricks/koalas/actions) +[](https://codecov.io/gh/databricks/koalas) +[](https://koalas.readthedocs.io/en/latest/?badge=latest) +[](https://pypi.org/project/koalas/) +[](https://anaconda.org/conda-forge/koalas) +[](https://mybinder.org/v2/gh/databricks/koalas/master?filepath=docs%2Fsource%2Fgetting_started%2F10min.ipynb) +[](https://pepy.tech/project/koalas) + + +## Getting Started + +Koalas can be installed in many ways such as Conda and pip. + +```bash +# Conda +conda install koalas -c conda-forge +``` + +```bash +# pip +pip install koalas +``` + +See [Installation](https://koalas.readthedocs.io/en/latest/getting_started/install.html) for more details. + +For Databricks Runtime, Koalas is pre-installed in Databricks Runtime 7.1 and above. Try [Databricks Community Edition](https://community.cloud.databricks.com/) for free. You can also follow these [steps](https://docs.databricks.com/libraries/index.html) to manually install a library on Databricks. + +Lastly, if your PyArrow version is 0.15+ and your PySpark version is lower than 3.0, it is best for you to set `ARROW_PRE_0_15_IPC_FORMAT` environment variable to `1` manually. +Koalas will try its best to set it for you but it is impossible to set it if there is a Spark context already launched. + +Now you can turn a pandas DataFrame into a Koalas DataFrame that is API-compliant with the former: + +```python +import databricks.koalas as ks +import pandas as pd + +pdf = pd.DataFrame({'x':range(3), 'y':['a','b','b'], 'z':['a','b','b']}) + +# Create a Koalas DataFrame from pandas DataFrame +df = ks.from_pandas(pdf) + +# Rename the columns +df.columns = ['x', 'y', 'z1'] + +# Do some operations in place: +df['x2'] = df.x * df.x +``` + +For more details, see [Getting Started](https://koalas.readthedocs.io/en/latest/getting_started/index.html) and [Dependencies](https://koalas.readthedocs.io/en/latest/getting_started/install.html#dependencies) in the official documentation. + + +## Contributing Guide + +See [Contributing Guide](https://koalas.readthedocs.io/en/latest/development/contributing.html) and [Design Principles](https://koalas.readthedocs.io/en/latest/development/design.html) in the official documentation. + + +## FAQ + +See [FAQ](https://koalas.readthedocs.io/en/latest/user_guide/faq.html) in the official documentation. + + +## Best Practices + +See [Best Practices](https://koalas.readthedocs.io/en/latest/user_guide/best_practices.html) in the official documentation. + + +## Koalas Talks and Blogs + +See [Koalas Talks and Blogs](https://koalas.readthedocs.io/en/latest/getting_started/videos_blogs.html) in the official documentation. + + + + +%package help +Summary: Development documents and examples for koalas +Provides: python3-koalas-doc +%description help +**NOTE**: Koalas supports Apache Spark 3.1 and below as it will be [officially included to PySpark in the upcoming Apache Spark 3.2](http://apache-spark-developers-list.1001551.n3.nabble.com/VOTE-SPIP-Support-pandas-API-layer-on-PySpark-td30996.html). This repository is now in maintenance mode. For Apache Spark 3.2 and above, please use PySpark directly. + +<p align="center"> + <img src="https://raw.githubusercontent.com/databricks/koalas/master/icons/koalas-logo.png" width="140"/> +</p> + +<p align="center"> + pandas API on Apache Spark + <br/> + <a href="https://koalas.readthedocs.io/en/latest/?badge=latest"><strong>Explore Koalas docs »</strong></a> + <br/> + <br/> + <a href="https://mybinder.org/v2/gh/databricks/koalas/master?filepath=docs%2Fsource%2Fgetting_started%2F10min.ipynb">Live notebook</a> + · + <a href="https://github.com/databricks/koalas/issues">Issues</a> + · + <a href="https://groups.google.com/forum/#!forum/koalas-dev">Mailing list</a> + <br/> + <strong><a href="https://www.gofundme.com/f/help-thirsty-koalas-devastated-by-recent-fires">Help Thirsty Koalas Devastated by Recent Fires</a></strong> +</p> + +The Koalas project makes data scientists more productive when interacting with big data, by implementing the pandas DataFrame API on top of Apache Spark. + +pandas is the de facto standard (single-node) DataFrame implementation in Python, while Spark is the de facto standard for big data processing. With this package, you can: + - Be immediately productive with Spark, with no learning curve, if you are already familiar with pandas. + - Have a single codebase that works both with pandas (tests, smaller datasets) and with Spark (distributed datasets). + +We would love to have you try it and give us feedback, through our [mailing lists](https://groups.google.com/forum/#!forum/koalas-dev) or [GitHub issues](https://github.com/databricks/koalas/issues). + +Try the Koalas 10 minutes tutorial on a live Jupyter notebook [here](https://mybinder.org/v2/gh/databricks/koalas/master?filepath=docs%2Fsource%2Fgetting_started%2F10min.ipynb). The initial launch can take up to several minutes. + +[](https://github.com/databricks/koalas/actions) +[](https://codecov.io/gh/databricks/koalas) +[](https://koalas.readthedocs.io/en/latest/?badge=latest) +[](https://pypi.org/project/koalas/) +[](https://anaconda.org/conda-forge/koalas) +[](https://mybinder.org/v2/gh/databricks/koalas/master?filepath=docs%2Fsource%2Fgetting_started%2F10min.ipynb) +[](https://pepy.tech/project/koalas) + + +## Getting Started + +Koalas can be installed in many ways such as Conda and pip. + +```bash +# Conda +conda install koalas -c conda-forge +``` + +```bash +# pip +pip install koalas +``` + +See [Installation](https://koalas.readthedocs.io/en/latest/getting_started/install.html) for more details. + +For Databricks Runtime, Koalas is pre-installed in Databricks Runtime 7.1 and above. Try [Databricks Community Edition](https://community.cloud.databricks.com/) for free. You can also follow these [steps](https://docs.databricks.com/libraries/index.html) to manually install a library on Databricks. + +Lastly, if your PyArrow version is 0.15+ and your PySpark version is lower than 3.0, it is best for you to set `ARROW_PRE_0_15_IPC_FORMAT` environment variable to `1` manually. +Koalas will try its best to set it for you but it is impossible to set it if there is a Spark context already launched. + +Now you can turn a pandas DataFrame into a Koalas DataFrame that is API-compliant with the former: + +```python +import databricks.koalas as ks +import pandas as pd + +pdf = pd.DataFrame({'x':range(3), 'y':['a','b','b'], 'z':['a','b','b']}) + +# Create a Koalas DataFrame from pandas DataFrame +df = ks.from_pandas(pdf) + +# Rename the columns +df.columns = ['x', 'y', 'z1'] + +# Do some operations in place: +df['x2'] = df.x * df.x +``` + +For more details, see [Getting Started](https://koalas.readthedocs.io/en/latest/getting_started/index.html) and [Dependencies](https://koalas.readthedocs.io/en/latest/getting_started/install.html#dependencies) in the official documentation. + + +## Contributing Guide + +See [Contributing Guide](https://koalas.readthedocs.io/en/latest/development/contributing.html) and [Design Principles](https://koalas.readthedocs.io/en/latest/development/design.html) in the official documentation. + + +## FAQ + +See [FAQ](https://koalas.readthedocs.io/en/latest/user_guide/faq.html) in the official documentation. + + +## Best Practices + +See [Best Practices](https://koalas.readthedocs.io/en/latest/user_guide/best_practices.html) in the official documentation. + + +## Koalas Talks and Blogs + +See [Koalas Talks and Blogs](https://koalas.readthedocs.io/en/latest/getting_started/videos_blogs.html) in the official documentation. + + + + +%prep +%autosetup -n koalas-1.8.2 + +%build +%py3_build + +%install +%py3_install +install -d -m755 %{buildroot}/%{_pkgdocdir} +if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi +if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi +if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi +if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi +pushd %{buildroot} +if [ -d usr/lib ]; then + find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/lib64 ]; then + find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/bin ]; then + find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst +fi +if [ -d usr/sbin ]; then + find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst +fi +touch doclist.lst +if [ -d usr/share/man ]; then + find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst +fi +popd +mv %{buildroot}/filelist.lst . +mv %{buildroot}/doclist.lst . + +%files -n python3-koalas -f filelist.lst +%dir %{python3_sitelib}/* + +%files help -f doclist.lst +%{_docdir}/* + +%changelog +* Mon Apr 10 2023 Python_Bot <Python_Bot@openeuler.org> - 1.8.2-1 +- Package Spec generated @@ -0,0 +1 @@ +423c3b93f51d076df668e1980987d6df koalas-1.8.2.tar.gz |
