%global _empty_manifest_terminate_build 0
Name: python-transbigdata
Version: 0.4.17
Release: 1
Summary: A Python package developed for transportation spatio-temporal big data processing and analysis.
License: BSD
URL: https://github.com/ni1o1/transbigdata
Source0: https://mirrors.nju.edu.cn/pypi/web/packages/18/94/fd5784ab7c74eba7c5fbac9167d552ff92236b049693b382f63a2b3790a1/transbigdata-0.4.17.tar.gz
BuildArch: noarch
Requires: python3-numpy
Requires: python3-pandas
Requires: python3-shapely
Requires: python3-geopandas
Requires: python3-scipy
Requires: python3-matplotlib
%description
English [中文版](README-zh_CN.md)
# TransBigData
[](https://transbigdata.readthedocs.io/en/latest/?badge=latest) [](https://pepy.tech/project/transbigdata) [](https://pepy.tech/project/transbigdata) [](https://github.com/ni1o1/transbigdata/actions/workflows/tests.yml) [](https://codecov.io/gh/ni1o1/transbigdata)
## Introduction
`TransBigData` is a Python package developed for transportation spatio-temporal big data processing, analysis and visualization. `TransBigData` provides fast and concise methods for processing common transportation spatio-temporal big data such as Taxi GPS data, bicycle sharing data and bus GPS data. `TransBigData` provides a variety of processing methods for each stage of transportation spatio-temporal big data analysis. The code with `TransBigData` is clean, efficient, flexible, and easy to use, allowing complex data tasks to be achieved with concise code.
For some specific types of data, `TransBigData` also provides targeted tools for specific needs, such as extraction of Origin and Destination(OD) of taxi trips from taxi GPS data and identification of arrival and departure information from bus GPS data. The latest stable release of the software can be installed via pip and full documentation
can be found at https://transbigdata.readthedocs.io/en/latest/. Introduction PPT can be found [here](https://github.com/ni1o1/transbigdata/blob/main/introduction/IntroductionofTransBigData.pdf) and [here(in Chinese)](https://github.com/ni1o1/transbigdata/blob/main/introduction/gridbasedframework.pdf)
### Target Audience
The target audience of `TransBigData` includes:
- Data science researchers and data engineers in the field of transportation big data, smart transportation systems, and urban computing, particularly those who want to integrate innovative algorithms into intelligent trasnportation systems
- Government, enterprises, or other entities who expect efficient and reliable management decision support through transportation spatio-temporal data analysis.
### Technical Features
* Provide a variety of processing methods for each stage of transportation spatio-temporal big data analysis.
* The code with `TransBigData` is clean, efficient, flexible, and easy to use, allowing complex data tasks to be achieved with concise code.
### Main Functions
Currently, `TransBigData` mainly provides the following methods:
* **Data Quality**: Provides methods to quickly obtain the general information of the dataset, including the data amount the time period and the sampling interval.
* **Data Preprocess**: Provides methods to clean multiple types of data error.
* **Data Gridding**: Provides methods to generate multiple types of geographic grids (Rectangular grids, Hexagonal grids) in the research area. Provides fast algorithms to map GPS data to the generated grids.
* **Data Aggregating**: Provides methods to aggregate GPS data and OD data into geographic polygon.
* **Data Visualization**: Built-in visualization capabilities leverage the visualization package keplergl to interactively visualize data on Jupyter notebook with simple code.
* **Trajectory Processing**: Provides methods to process trajectory data, including generating trajectory linestring from GPS points, and trajectory densification, etc.
* **Basemap Loading**: Provides methods to display Mapbox basemap on matplotlib figures
## Installation
`TransBigData` support Python >= 3.6
### Using pypi [](https://badge.fury.io/py/transbigdata)
`TransBigData` can be installed by using `pip install`. Before installing `TransBigData`, make sure that you have installed the available [geopandas package](https://geopandas.org/en/stable/getting_started/install.html). If you already have geopandas installed, run the following code directly from the command prompt to install `TransBigData`:
pip install transbigdata
### Using conda-forge [](https://anaconda.org/conda-forge/transbigdata) [](https://anaconda.org/conda-forge/transbigdata)
You can also install `TransBigData` by `conda-forge`, this will automaticaly solve the dependency, it can be installed with:
conda install -c conda-forge transbigdata
## Contributing to TransBigData [](https://github.com/ni1o1/transbigdata/graphs/contributors) [](https://gitter.im/transbigdata/community?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge&utm_content=badge) 
All contributions, bug reports, bug fixes, documentation improvements, enhancements and ideas are welcome. A detailed overview on how to contribute can be found in the [contributing guide](https://github.com/ni1o1/transbigdata/blob/master/CONTRIBUTING.md) on GitHub.
## Examples
### Example of data visualization
#### Visualize trajectories (with keplergl)

#### Visualize data distribution (with keplergl)

#### Visualize OD (with keplergl)

### Example of taxi GPS data processing
The following example shows how to use the `TransBigData` to perform data gridding, data aggregating and data visualization for taxi GPS data.
#### Read the data
```python
import transbigdata as tbd
import pandas as pd
#Read taxi gps data
data = pd.read_csv('TaxiData-Sample.csv',header = None)
data.columns = ['VehicleNum','time','lon','lat','OpenStatus','Speed']
data
```
| VehicleNum | time | lon | lat | OpenStatus | Speed | |
|---|---|---|---|---|---|---|
| 0 | 34745 | 20:27:43 | 113.806847 | 22.623249 | 1 | 27 |
| 1 | 34745 | 20:24:07 | 113.809898 | 22.627399 | 0 | 0 |
| 2 | 34745 | 20:24:27 | 113.809898 | 22.627399 | 0 | 0 |
| 3 | 34745 | 20:22:07 | 113.811348 | 22.628067 | 0 | 0 |
| 4 | 34745 | 20:10:06 | 113.819885 | 22.647800 | 0 | 54 |
| ... | ... | ... | ... | ... | ... | ... |
| 544994 | 28265 | 21:35:13 | 114.321503 | 22.709499 | 0 | 18 |
| 544995 | 28265 | 09:08:02 | 114.322701 | 22.681700 | 0 | 0 |
| 544996 | 28265 | 09:14:31 | 114.336700 | 22.690100 | 0 | 0 |
| 544997 | 28265 | 21:19:12 | 114.352600 | 22.728399 | 0 | 0 |
| 544998 | 28265 | 19:08:06 | 114.137703 | 22.621700 | 0 | 0 |
544999 rows × 6 columns
[](https://transbigdata.readthedocs.io/en/latest/?badge=latest) [](https://pepy.tech/project/transbigdata) [](https://pepy.tech/project/transbigdata) [](https://github.com/ni1o1/transbigdata/actions/workflows/tests.yml) [](https://codecov.io/gh/ni1o1/transbigdata)
## Introduction
`TransBigData` is a Python package developed for transportation spatio-temporal big data processing, analysis and visualization. `TransBigData` provides fast and concise methods for processing common transportation spatio-temporal big data such as Taxi GPS data, bicycle sharing data and bus GPS data. `TransBigData` provides a variety of processing methods for each stage of transportation spatio-temporal big data analysis. The code with `TransBigData` is clean, efficient, flexible, and easy to use, allowing complex data tasks to be achieved with concise code.
For some specific types of data, `TransBigData` also provides targeted tools for specific needs, such as extraction of Origin and Destination(OD) of taxi trips from taxi GPS data and identification of arrival and departure information from bus GPS data. The latest stable release of the software can be installed via pip and full documentation
can be found at https://transbigdata.readthedocs.io/en/latest/. Introduction PPT can be found [here](https://github.com/ni1o1/transbigdata/blob/main/introduction/IntroductionofTransBigData.pdf) and [here(in Chinese)](https://github.com/ni1o1/transbigdata/blob/main/introduction/gridbasedframework.pdf)
### Target Audience
The target audience of `TransBigData` includes:
- Data science researchers and data engineers in the field of transportation big data, smart transportation systems, and urban computing, particularly those who want to integrate innovative algorithms into intelligent trasnportation systems
- Government, enterprises, or other entities who expect efficient and reliable management decision support through transportation spatio-temporal data analysis.
### Technical Features
* Provide a variety of processing methods for each stage of transportation spatio-temporal big data analysis.
* The code with `TransBigData` is clean, efficient, flexible, and easy to use, allowing complex data tasks to be achieved with concise code.
### Main Functions
Currently, `TransBigData` mainly provides the following methods:
* **Data Quality**: Provides methods to quickly obtain the general information of the dataset, including the data amount the time period and the sampling interval.
* **Data Preprocess**: Provides methods to clean multiple types of data error.
* **Data Gridding**: Provides methods to generate multiple types of geographic grids (Rectangular grids, Hexagonal grids) in the research area. Provides fast algorithms to map GPS data to the generated grids.
* **Data Aggregating**: Provides methods to aggregate GPS data and OD data into geographic polygon.
* **Data Visualization**: Built-in visualization capabilities leverage the visualization package keplergl to interactively visualize data on Jupyter notebook with simple code.
* **Trajectory Processing**: Provides methods to process trajectory data, including generating trajectory linestring from GPS points, and trajectory densification, etc.
* **Basemap Loading**: Provides methods to display Mapbox basemap on matplotlib figures
## Installation
`TransBigData` support Python >= 3.6
### Using pypi [](https://badge.fury.io/py/transbigdata)
`TransBigData` can be installed by using `pip install`. Before installing `TransBigData`, make sure that you have installed the available [geopandas package](https://geopandas.org/en/stable/getting_started/install.html). If you already have geopandas installed, run the following code directly from the command prompt to install `TransBigData`:
pip install transbigdata
### Using conda-forge [](https://anaconda.org/conda-forge/transbigdata) [](https://anaconda.org/conda-forge/transbigdata)
You can also install `TransBigData` by `conda-forge`, this will automaticaly solve the dependency, it can be installed with:
conda install -c conda-forge transbigdata
## Contributing to TransBigData [](https://github.com/ni1o1/transbigdata/graphs/contributors) [](https://gitter.im/transbigdata/community?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge&utm_content=badge) 
All contributions, bug reports, bug fixes, documentation improvements, enhancements and ideas are welcome. A detailed overview on how to contribute can be found in the [contributing guide](https://github.com/ni1o1/transbigdata/blob/master/CONTRIBUTING.md) on GitHub.
## Examples
### Example of data visualization
#### Visualize trajectories (with keplergl)

#### Visualize data distribution (with keplergl)

#### Visualize OD (with keplergl)

### Example of taxi GPS data processing
The following example shows how to use the `TransBigData` to perform data gridding, data aggregating and data visualization for taxi GPS data.
#### Read the data
```python
import transbigdata as tbd
import pandas as pd
#Read taxi gps data
data = pd.read_csv('TaxiData-Sample.csv',header = None)
data.columns = ['VehicleNum','time','lon','lat','OpenStatus','Speed']
data
```
| VehicleNum | time | lon | lat | OpenStatus | Speed | |
|---|---|---|---|---|---|---|
| 0 | 34745 | 20:27:43 | 113.806847 | 22.623249 | 1 | 27 |
| 1 | 34745 | 20:24:07 | 113.809898 | 22.627399 | 0 | 0 |
| 2 | 34745 | 20:24:27 | 113.809898 | 22.627399 | 0 | 0 |
| 3 | 34745 | 20:22:07 | 113.811348 | 22.628067 | 0 | 0 |
| 4 | 34745 | 20:10:06 | 113.819885 | 22.647800 | 0 | 54 |
| ... | ... | ... | ... | ... | ... | ... |
| 544994 | 28265 | 21:35:13 | 114.321503 | 22.709499 | 0 | 18 |
| 544995 | 28265 | 09:08:02 | 114.322701 | 22.681700 | 0 | 0 |
| 544996 | 28265 | 09:14:31 | 114.336700 | 22.690100 | 0 | 0 |
| 544997 | 28265 | 21:19:12 | 114.352600 | 22.728399 | 0 | 0 |
| 544998 | 28265 | 19:08:06 | 114.137703 | 22.621700 | 0 | 0 |
544999 rows × 6 columns
[](https://transbigdata.readthedocs.io/en/latest/?badge=latest) [](https://pepy.tech/project/transbigdata) [](https://pepy.tech/project/transbigdata) [](https://github.com/ni1o1/transbigdata/actions/workflows/tests.yml) [](https://codecov.io/gh/ni1o1/transbigdata)
## Introduction
`TransBigData` is a Python package developed for transportation spatio-temporal big data processing, analysis and visualization. `TransBigData` provides fast and concise methods for processing common transportation spatio-temporal big data such as Taxi GPS data, bicycle sharing data and bus GPS data. `TransBigData` provides a variety of processing methods for each stage of transportation spatio-temporal big data analysis. The code with `TransBigData` is clean, efficient, flexible, and easy to use, allowing complex data tasks to be achieved with concise code.
For some specific types of data, `TransBigData` also provides targeted tools for specific needs, such as extraction of Origin and Destination(OD) of taxi trips from taxi GPS data and identification of arrival and departure information from bus GPS data. The latest stable release of the software can be installed via pip and full documentation
can be found at https://transbigdata.readthedocs.io/en/latest/. Introduction PPT can be found [here](https://github.com/ni1o1/transbigdata/blob/main/introduction/IntroductionofTransBigData.pdf) and [here(in Chinese)](https://github.com/ni1o1/transbigdata/blob/main/introduction/gridbasedframework.pdf)
### Target Audience
The target audience of `TransBigData` includes:
- Data science researchers and data engineers in the field of transportation big data, smart transportation systems, and urban computing, particularly those who want to integrate innovative algorithms into intelligent trasnportation systems
- Government, enterprises, or other entities who expect efficient and reliable management decision support through transportation spatio-temporal data analysis.
### Technical Features
* Provide a variety of processing methods for each stage of transportation spatio-temporal big data analysis.
* The code with `TransBigData` is clean, efficient, flexible, and easy to use, allowing complex data tasks to be achieved with concise code.
### Main Functions
Currently, `TransBigData` mainly provides the following methods:
* **Data Quality**: Provides methods to quickly obtain the general information of the dataset, including the data amount the time period and the sampling interval.
* **Data Preprocess**: Provides methods to clean multiple types of data error.
* **Data Gridding**: Provides methods to generate multiple types of geographic grids (Rectangular grids, Hexagonal grids) in the research area. Provides fast algorithms to map GPS data to the generated grids.
* **Data Aggregating**: Provides methods to aggregate GPS data and OD data into geographic polygon.
* **Data Visualization**: Built-in visualization capabilities leverage the visualization package keplergl to interactively visualize data on Jupyter notebook with simple code.
* **Trajectory Processing**: Provides methods to process trajectory data, including generating trajectory linestring from GPS points, and trajectory densification, etc.
* **Basemap Loading**: Provides methods to display Mapbox basemap on matplotlib figures
## Installation
`TransBigData` support Python >= 3.6
### Using pypi [](https://badge.fury.io/py/transbigdata)
`TransBigData` can be installed by using `pip install`. Before installing `TransBigData`, make sure that you have installed the available [geopandas package](https://geopandas.org/en/stable/getting_started/install.html). If you already have geopandas installed, run the following code directly from the command prompt to install `TransBigData`:
pip install transbigdata
### Using conda-forge [](https://anaconda.org/conda-forge/transbigdata) [](https://anaconda.org/conda-forge/transbigdata)
You can also install `TransBigData` by `conda-forge`, this will automaticaly solve the dependency, it can be installed with:
conda install -c conda-forge transbigdata
## Contributing to TransBigData [](https://github.com/ni1o1/transbigdata/graphs/contributors) [](https://gitter.im/transbigdata/community?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge&utm_content=badge) 
All contributions, bug reports, bug fixes, documentation improvements, enhancements and ideas are welcome. A detailed overview on how to contribute can be found in the [contributing guide](https://github.com/ni1o1/transbigdata/blob/master/CONTRIBUTING.md) on GitHub.
## Examples
### Example of data visualization
#### Visualize trajectories (with keplergl)

#### Visualize data distribution (with keplergl)

#### Visualize OD (with keplergl)

### Example of taxi GPS data processing
The following example shows how to use the `TransBigData` to perform data gridding, data aggregating and data visualization for taxi GPS data.
#### Read the data
```python
import transbigdata as tbd
import pandas as pd
#Read taxi gps data
data = pd.read_csv('TaxiData-Sample.csv',header = None)
data.columns = ['VehicleNum','time','lon','lat','OpenStatus','Speed']
data
```
| VehicleNum | time | lon | lat | OpenStatus | Speed | |
|---|---|---|---|---|---|---|
| 0 | 34745 | 20:27:43 | 113.806847 | 22.623249 | 1 | 27 |
| 1 | 34745 | 20:24:07 | 113.809898 | 22.627399 | 0 | 0 |
| 2 | 34745 | 20:24:27 | 113.809898 | 22.627399 | 0 | 0 |
| 3 | 34745 | 20:22:07 | 113.811348 | 22.628067 | 0 | 0 |
| 4 | 34745 | 20:10:06 | 113.819885 | 22.647800 | 0 | 54 |
| ... | ... | ... | ... | ... | ... | ... |
| 544994 | 28265 | 21:35:13 | 114.321503 | 22.709499 | 0 | 18 |
| 544995 | 28265 | 09:08:02 | 114.322701 | 22.681700 | 0 | 0 |
| 544996 | 28265 | 09:14:31 | 114.336700 | 22.690100 | 0 | 0 |
| 544997 | 28265 | 21:19:12 | 114.352600 | 22.728399 | 0 | 0 |
| 544998 | 28265 | 19:08:06 | 114.137703 | 22.621700 | 0 | 0 |
544999 rows × 6 columns