%global _empty_manifest_terminate_build 0 Name: python-autoviz Version: 0.1.58 Release: 1 Summary: Automatically Visualize any dataset, any size with a single line of code License: Apache License 2.0 URL: https://github.com/AutoViML/AutoViz Source0: https://mirrors.nju.edu.cn/pypi/web/packages/a4/48/7e29c710c9cb8cf0972d760ae591daced8eaa4ec49a9a626720dfccac459/autoviz-0.1.58.tar.gz BuildArch: noarch Requires: python3-bokeh Requires: python3-emoji Requires: python3-fsspec Requires: python3-holoviews Requires: python3-hvplot Requires: python3-ipython Requires: python3-jupyter Requires: python3-matplotlib Requires: python3-nltk Requires: python3-numpy Requires: python3-pandas Requires: python3-panel Requires: python3-pyamg Requires: python3-scikit-learn Requires: python3-seaborn Requires: python3-statsmodels Requires: python3-textblob Requires: python3-typing-extensions Requires: python3-wordcloud Requires: python3-xgboost Requires: python3-xlrd %description # AutoViz Automatically Visualize any dataset, any size with a single line of code. Now you can save these interactive charts as HTML files automatically with the `"html"` setting. ### Sep-2022 Update: AutoViz now provides data cleansing suggestions! #autoviz #datacleaning From version 0.1.50 onwards, AutoViz now automatically analyzes your dataset and provides suggestions for how to clean your data set. It detects missing values, identifies rare categories, finds infinite values, detects mixed data types, and so much more. This will help you tremendously speed up your data cleaning activities. If you have suggestions to add more data cleaning steps please file an `Issue` in our GitHub and we will gladly consider it. Here is an example of how data cleaning suggestions look:
In order to get this latest function, you must upgrade autoviz to the latest version by: ``` pip install autoviz --upgrade ``` In the same version, you can also get data suggestions by using `AV.AutoViz(......, verbose=1)` or by simply importing it:
``` from autoviz import data_cleaning_suggestions data_cleaning_suggestions(df) ``` ### Dec-23-2021 Update: AutoViz now does Wordclouds! #autoviz #wordcloud AutoViz can now create Wordclouds automatically for your NLP variables in data. It detects NLP variables automatically and creates wordclouds for them. See Colab notebook for example: [AutoViz Demo with HTML setting](https://colab.research.google.com/drive/1r5QqESRZDY98FFfDOgVtMAVA_oaGtqqx?usp=sharing) ### Dec 21, 2021: AutoViz now runs on Docker containers as part of MLOps pipelines. Check out Orchest.io We are excited to announce that AutoViz and Deep_AutoViML are now available as containerized applications on Docker. This means that you can build data pipelines using a fantastic tool like [orchest.io](orchest.io) to build MLOps pipelines visually. Here are two sample pipelines we have created: AutoViz pipeline: https://lnkd.in/g5uC-z66 Deep_AutoViML pipeline: https://lnkd.in/gdnWTqCG You can find more examples and a wonderful video on [orchest's web site](https://github.com/orchest/orchest-examples) ![banner](https://github.com/rsesha/autoviz_pipeline/blob/main/autoviz_orchest.png) ### Dec-17-2021 AutoViz now uses HoloViews to display dashboards with Bokeh and save them as Dynamic HTML for web serving #HTML #Bokeh #Holoviews Now you can use AutoViz to create Interactive Bokeh charts and dashboards (see below) either in Jupyter Notebooks or in the browser. Use chart_format as follows: - `chart_format='bokeh'`: interactive Bokeh dashboards are plotted in Jupyter Notebooks. - `chart_format='server'`, dashboards will pop up for each kind of chart on your web browser. - `chart_format='html'`, interactive Bokeh charts will be silently saved as Dynamic HTML files under `AutoViz_Plots` directory ![banner](intro2.gif) [![Pepy Downloads](https://pepy.tech/badge/autoviz)](https://pepy.tech/project/autoviz) [![Pepy Downloads per week](https://pepy.tech/badge/autoviz/week)](https://pepy.tech/project/autoviz) [![Pepy Downloads per month](https://pepy.tech/badge/autoviz/month)](https://pepy.tech/project/autoviz) [![standard-readme compliant](https://img.shields.io/badge/standard--readme-OK-green.svg)](https://github.com/RichardLitt/standard-readme) [![Python Versions](https://img.shields.io/pypi/pyversions/autoviz.svg)](https://pypi.org/project/autoviz) [![PyPI Version](https://img.shields.io/pypi/v/autoviz.svg)](https://pypi.org/project/autoviz) [![PyPI License](https://img.shields.io/pypi/l/autoviz.svg)](https://github.com/AutoViML/AutoViz/blob/master/LICENSE) AutoViz performs automatic visualization of any dataset with one line of code. Give it any input file (CSV, txt or json format) of any size and AutoViz will visualize it, provided you set the `max_rows_analyzed` and `max_cols_analyzed` setting within the bounds of your machine's memory limit. AutoViz can now create charts in multiple formats using the `chart_format` setting: - If `chart_format ='png'` or `'svg'` or `'jpg'`: Matplotlib charts are plotted inline. * Can be saved locally (using `verbose=2` setting) or displayed (`verbose=1`) in Jupyter Notebooks. * This is the default behavior for AutoViz. - If `chart_format='bokeh'`: Interactive Bokeh charts are plotted in Jupyter Notebooks. - If `chart_format='server'`, dashboards will pop up for each kind of chart on your browser. - If `chart_format='html'`, interactive Bokeh charts will be created and silently saved as HTML files under the `AutoViz_Plots` directory (under working folder) or any other directory that you specify using the `save_plot_dir` setting (during input). ## Table of Contents - [Install](#install) - [Usage](#usage) - [API](#api) - [Maintainers](#maintainers) - [Contributing](#contributing) - [License](#license) ## Install **Prerequsites** - [Anaconda](https://docs.anaconda.com/anaconda/install/) To clone AutoViz, it's better to create a new environment, and install the required dependencies: To install from PyPi: ```sh conda create -n python=3.7 anaconda conda activate # ON WINDOWS: `source activate ` pip install autoviz ``` To install from source: ```sh cd git clone git@github.com:AutoViML/AutoViz.git # or download and unzip https://github.com/AutoViML/AutoViz/archive/master.zip conda create -n python=3.7 anaconda conda activate # ON WINDOWS: `source activate ` cd AutoViz pip install -r requirements.txt ``` ## Usage Read this Medium article to know how to use [AutoViz](https://towardsdatascience.com/autoviz-a-new-tool-for-automated-visualization-ec9c1744a6ad). In the AutoViz directory, open a Jupyter Notebook and use this line to instantiate the library ```py from autoviz.AutoViz_Class import AutoViz_Class AV = AutoViz_Class() ``` Load a dataset (any CSV or text file) into a Pandas dataframe or give the name of the path and filename you want to visualize. If you don't have a filename, you can simply assign the filename argument `""` (empty string). Call AutoViz using the filename (or dataframe) along with the separator and the name of the target variable in the input. ```py filename = "" sep = "," dft = AV.AutoViz( filename, sep=",", depVar="", dfte=None, header=0, verbose=0, lowess=False, chart_format="svg", max_rows_analyzed=150000, max_cols_analyzed=30, save_plot_dir=None ) ``` AutoViz will do the rest. You will see charts and plots on your screen. ![var_charts](var_charts.JPG) `AV.AutoViz` is the main plotting function in AV. Depending on what `chart_format` you choose, AutoViz will automatically call either the `AutoViz_Main` function or `AutoViz_Holo` function. **Notes:** * AutoViz will visualize any sized file using a statistically valid sample. * `COMMA` is assumed as default separator in file. But you can change it. * Assumes first row as header in file but you can change it. - `verbose` option - if 0, display minimal information but displays charts on your notebook - if 1, print extra information on the notebook and also display charts - if 2, will not display any charts, it will simply save them in your local machine under `AutoViz_Plots` directory under your current working folder. - `chart_format` option - if `'svg','jpg' or 'png'`, displays all charts or saves them depending on verbose option. - if `'bokeh'`, plots interactive charts using Bokeh on your Jupyter Notebook - if `'server'`, will display charts on your browser with one chart type in each tab - if `'html'`, will create bokeh interactive charts and silently save them under `AutoViz_Plots` directory or any directory you specify in the `save_plot_dir` setting. ![bokeh_charts](bokeh_charts.JPG) ## API **Arguments** - `filename` - Make sure that you give filename as empty string ("") if there is no filename associated with this data and you want to use a dataframe, then use dfte to give the name of the dataframe. Otherwise, fill in the file name and leave dfte as empty string. Only one of these two is needed to load the data set. - `sep` - this is the separator in the file. It can be comma, semi-colon or tab or any value that you see in your file that separates each column. - `depVar` - target variable in your dataset. You can leave it as empty string if you don't have a target variable in your data. - `dfte` - this is the input dataframe in case you want to load a pandas dataframe to plot charts. In that case, leave filename as an empty string. - `header` - the row number of the header row in your file. If it is the first row, then this must be zero. - `verbose` - it has 3 acceptable values: 0, 1 or 2. With zero, you get all charts but limited info. With 1 you get all charts and more info. With 2, you will not see any charts but they will be quietly generated and save in your local current directory under the AutoViz_Plots directory which will be created. Make sure you delete this folder periodically, otherwise, you will have lots of charts saved here if you used verbose=2 option a lot. - `lowess` - this option is very nice for small datasets where you can see regression lines for each pair of continuous variable against the target variable. Don't use this for large data sets (that is over 100,000 rows) - `chart_format` - this can be `'svg', 'png', 'jpg'` or `'bokeh'` or `'server'` or `'html'`. You will get charts generated (inline with `verbose=0` or `1` option). Instead you can silently save them in multiple formats if you used `verbose=2` option. The latter options are useful for interactive charts. - `max_rows_analyzed` - limits the max number of rows that is used to display charts. If you have a very large data set with millions of rows, then use this option to limit the amount of time it takes to generate charts. We will take a statistically valid sample. - `max_cols_analyzed` - limits the number of continuous vars that can be analyzed - `save_plot_dir` - directory you want the plots to be saved. Default is None which means it is saved under the current directory under a sub-folder named `AutoViz_Plots`. If the `save_plot_dir` does not exist, it creates it. ![server_charts](server_charts.JPG) ## Maintainers * [@AutoViML](https://github.com/AutoViML) * [@morenoh149](https://github.com/morenoh149) * [@hironroy](https://github.com/hironroy) ## Contributing See [the contributing file](contributing.md)! PRs accepted. ## License Apache License, Version 2.0 ## DISCLAIMER This project is not an official Google project. It is not supported by Google and Google specifically disclaims all warranties as to its quality, merchantability, or fitness for a particular purpose. %package -n python3-autoviz Summary: Automatically Visualize any dataset, any size with a single line of code Provides: python-autoviz BuildRequires: python3-devel BuildRequires: python3-setuptools BuildRequires: python3-pip %description -n python3-autoviz # AutoViz Automatically Visualize any dataset, any size with a single line of code. Now you can save these interactive charts as HTML files automatically with the `"html"` setting. ### Sep-2022 Update: AutoViz now provides data cleansing suggestions! #autoviz #datacleaning From version 0.1.50 onwards, AutoViz now automatically analyzes your dataset and provides suggestions for how to clean your data set. It detects missing values, identifies rare categories, finds infinite values, detects mixed data types, and so much more. This will help you tremendously speed up your data cleaning activities. If you have suggestions to add more data cleaning steps please file an `Issue` in our GitHub and we will gladly consider it. Here is an example of how data cleaning suggestions look:
In order to get this latest function, you must upgrade autoviz to the latest version by: ``` pip install autoviz --upgrade ``` In the same version, you can also get data suggestions by using `AV.AutoViz(......, verbose=1)` or by simply importing it:
``` from autoviz import data_cleaning_suggestions data_cleaning_suggestions(df) ``` ### Dec-23-2021 Update: AutoViz now does Wordclouds! #autoviz #wordcloud AutoViz can now create Wordclouds automatically for your NLP variables in data. It detects NLP variables automatically and creates wordclouds for them. See Colab notebook for example: [AutoViz Demo with HTML setting](https://colab.research.google.com/drive/1r5QqESRZDY98FFfDOgVtMAVA_oaGtqqx?usp=sharing) ### Dec 21, 2021: AutoViz now runs on Docker containers as part of MLOps pipelines. Check out Orchest.io We are excited to announce that AutoViz and Deep_AutoViML are now available as containerized applications on Docker. This means that you can build data pipelines using a fantastic tool like [orchest.io](orchest.io) to build MLOps pipelines visually. Here are two sample pipelines we have created: AutoViz pipeline: https://lnkd.in/g5uC-z66 Deep_AutoViML pipeline: https://lnkd.in/gdnWTqCG You can find more examples and a wonderful video on [orchest's web site](https://github.com/orchest/orchest-examples) ![banner](https://github.com/rsesha/autoviz_pipeline/blob/main/autoviz_orchest.png) ### Dec-17-2021 AutoViz now uses HoloViews to display dashboards with Bokeh and save them as Dynamic HTML for web serving #HTML #Bokeh #Holoviews Now you can use AutoViz to create Interactive Bokeh charts and dashboards (see below) either in Jupyter Notebooks or in the browser. Use chart_format as follows: - `chart_format='bokeh'`: interactive Bokeh dashboards are plotted in Jupyter Notebooks. - `chart_format='server'`, dashboards will pop up for each kind of chart on your web browser. - `chart_format='html'`, interactive Bokeh charts will be silently saved as Dynamic HTML files under `AutoViz_Plots` directory ![banner](intro2.gif) [![Pepy Downloads](https://pepy.tech/badge/autoviz)](https://pepy.tech/project/autoviz) [![Pepy Downloads per week](https://pepy.tech/badge/autoviz/week)](https://pepy.tech/project/autoviz) [![Pepy Downloads per month](https://pepy.tech/badge/autoviz/month)](https://pepy.tech/project/autoviz) [![standard-readme compliant](https://img.shields.io/badge/standard--readme-OK-green.svg)](https://github.com/RichardLitt/standard-readme) [![Python Versions](https://img.shields.io/pypi/pyversions/autoviz.svg)](https://pypi.org/project/autoviz) [![PyPI Version](https://img.shields.io/pypi/v/autoviz.svg)](https://pypi.org/project/autoviz) [![PyPI License](https://img.shields.io/pypi/l/autoviz.svg)](https://github.com/AutoViML/AutoViz/blob/master/LICENSE) AutoViz performs automatic visualization of any dataset with one line of code. Give it any input file (CSV, txt or json format) of any size and AutoViz will visualize it, provided you set the `max_rows_analyzed` and `max_cols_analyzed` setting within the bounds of your machine's memory limit. AutoViz can now create charts in multiple formats using the `chart_format` setting: - If `chart_format ='png'` or `'svg'` or `'jpg'`: Matplotlib charts are plotted inline. * Can be saved locally (using `verbose=2` setting) or displayed (`verbose=1`) in Jupyter Notebooks. * This is the default behavior for AutoViz. - If `chart_format='bokeh'`: Interactive Bokeh charts are plotted in Jupyter Notebooks. - If `chart_format='server'`, dashboards will pop up for each kind of chart on your browser. - If `chart_format='html'`, interactive Bokeh charts will be created and silently saved as HTML files under the `AutoViz_Plots` directory (under working folder) or any other directory that you specify using the `save_plot_dir` setting (during input). ## Table of Contents - [Install](#install) - [Usage](#usage) - [API](#api) - [Maintainers](#maintainers) - [Contributing](#contributing) - [License](#license) ## Install **Prerequsites** - [Anaconda](https://docs.anaconda.com/anaconda/install/) To clone AutoViz, it's better to create a new environment, and install the required dependencies: To install from PyPi: ```sh conda create -n python=3.7 anaconda conda activate # ON WINDOWS: `source activate ` pip install autoviz ``` To install from source: ```sh cd git clone git@github.com:AutoViML/AutoViz.git # or download and unzip https://github.com/AutoViML/AutoViz/archive/master.zip conda create -n python=3.7 anaconda conda activate # ON WINDOWS: `source activate ` cd AutoViz pip install -r requirements.txt ``` ## Usage Read this Medium article to know how to use [AutoViz](https://towardsdatascience.com/autoviz-a-new-tool-for-automated-visualization-ec9c1744a6ad). In the AutoViz directory, open a Jupyter Notebook and use this line to instantiate the library ```py from autoviz.AutoViz_Class import AutoViz_Class AV = AutoViz_Class() ``` Load a dataset (any CSV or text file) into a Pandas dataframe or give the name of the path and filename you want to visualize. If you don't have a filename, you can simply assign the filename argument `""` (empty string). Call AutoViz using the filename (or dataframe) along with the separator and the name of the target variable in the input. ```py filename = "" sep = "," dft = AV.AutoViz( filename, sep=",", depVar="", dfte=None, header=0, verbose=0, lowess=False, chart_format="svg", max_rows_analyzed=150000, max_cols_analyzed=30, save_plot_dir=None ) ``` AutoViz will do the rest. You will see charts and plots on your screen. ![var_charts](var_charts.JPG) `AV.AutoViz` is the main plotting function in AV. Depending on what `chart_format` you choose, AutoViz will automatically call either the `AutoViz_Main` function or `AutoViz_Holo` function. **Notes:** * AutoViz will visualize any sized file using a statistically valid sample. * `COMMA` is assumed as default separator in file. But you can change it. * Assumes first row as header in file but you can change it. - `verbose` option - if 0, display minimal information but displays charts on your notebook - if 1, print extra information on the notebook and also display charts - if 2, will not display any charts, it will simply save them in your local machine under `AutoViz_Plots` directory under your current working folder. - `chart_format` option - if `'svg','jpg' or 'png'`, displays all charts or saves them depending on verbose option. - if `'bokeh'`, plots interactive charts using Bokeh on your Jupyter Notebook - if `'server'`, will display charts on your browser with one chart type in each tab - if `'html'`, will create bokeh interactive charts and silently save them under `AutoViz_Plots` directory or any directory you specify in the `save_plot_dir` setting. ![bokeh_charts](bokeh_charts.JPG) ## API **Arguments** - `filename` - Make sure that you give filename as empty string ("") if there is no filename associated with this data and you want to use a dataframe, then use dfte to give the name of the dataframe. Otherwise, fill in the file name and leave dfte as empty string. Only one of these two is needed to load the data set. - `sep` - this is the separator in the file. It can be comma, semi-colon or tab or any value that you see in your file that separates each column. - `depVar` - target variable in your dataset. You can leave it as empty string if you don't have a target variable in your data. - `dfte` - this is the input dataframe in case you want to load a pandas dataframe to plot charts. In that case, leave filename as an empty string. - `header` - the row number of the header row in your file. If it is the first row, then this must be zero. - `verbose` - it has 3 acceptable values: 0, 1 or 2. With zero, you get all charts but limited info. With 1 you get all charts and more info. With 2, you will not see any charts but they will be quietly generated and save in your local current directory under the AutoViz_Plots directory which will be created. Make sure you delete this folder periodically, otherwise, you will have lots of charts saved here if you used verbose=2 option a lot. - `lowess` - this option is very nice for small datasets where you can see regression lines for each pair of continuous variable against the target variable. Don't use this for large data sets (that is over 100,000 rows) - `chart_format` - this can be `'svg', 'png', 'jpg'` or `'bokeh'` or `'server'` or `'html'`. You will get charts generated (inline with `verbose=0` or `1` option). Instead you can silently save them in multiple formats if you used `verbose=2` option. The latter options are useful for interactive charts. - `max_rows_analyzed` - limits the max number of rows that is used to display charts. If you have a very large data set with millions of rows, then use this option to limit the amount of time it takes to generate charts. We will take a statistically valid sample. - `max_cols_analyzed` - limits the number of continuous vars that can be analyzed - `save_plot_dir` - directory you want the plots to be saved. Default is None which means it is saved under the current directory under a sub-folder named `AutoViz_Plots`. If the `save_plot_dir` does not exist, it creates it. ![server_charts](server_charts.JPG) ## Maintainers * [@AutoViML](https://github.com/AutoViML) * [@morenoh149](https://github.com/morenoh149) * [@hironroy](https://github.com/hironroy) ## Contributing See [the contributing file](contributing.md)! PRs accepted. ## License Apache License, Version 2.0 ## DISCLAIMER This project is not an official Google project. It is not supported by Google and Google specifically disclaims all warranties as to its quality, merchantability, or fitness for a particular purpose. %package help Summary: Development documents and examples for autoviz Provides: python3-autoviz-doc %description help # AutoViz Automatically Visualize any dataset, any size with a single line of code. Now you can save these interactive charts as HTML files automatically with the `"html"` setting. ### Sep-2022 Update: AutoViz now provides data cleansing suggestions! #autoviz #datacleaning From version 0.1.50 onwards, AutoViz now automatically analyzes your dataset and provides suggestions for how to clean your data set. It detects missing values, identifies rare categories, finds infinite values, detects mixed data types, and so much more. This will help you tremendously speed up your data cleaning activities. If you have suggestions to add more data cleaning steps please file an `Issue` in our GitHub and we will gladly consider it. Here is an example of how data cleaning suggestions look:
In order to get this latest function, you must upgrade autoviz to the latest version by: ``` pip install autoviz --upgrade ``` In the same version, you can also get data suggestions by using `AV.AutoViz(......, verbose=1)` or by simply importing it:
``` from autoviz import data_cleaning_suggestions data_cleaning_suggestions(df) ``` ### Dec-23-2021 Update: AutoViz now does Wordclouds! #autoviz #wordcloud AutoViz can now create Wordclouds automatically for your NLP variables in data. It detects NLP variables automatically and creates wordclouds for them. See Colab notebook for example: [AutoViz Demo with HTML setting](https://colab.research.google.com/drive/1r5QqESRZDY98FFfDOgVtMAVA_oaGtqqx?usp=sharing) ### Dec 21, 2021: AutoViz now runs on Docker containers as part of MLOps pipelines. Check out Orchest.io We are excited to announce that AutoViz and Deep_AutoViML are now available as containerized applications on Docker. This means that you can build data pipelines using a fantastic tool like [orchest.io](orchest.io) to build MLOps pipelines visually. Here are two sample pipelines we have created: AutoViz pipeline: https://lnkd.in/g5uC-z66 Deep_AutoViML pipeline: https://lnkd.in/gdnWTqCG You can find more examples and a wonderful video on [orchest's web site](https://github.com/orchest/orchest-examples) ![banner](https://github.com/rsesha/autoviz_pipeline/blob/main/autoviz_orchest.png) ### Dec-17-2021 AutoViz now uses HoloViews to display dashboards with Bokeh and save them as Dynamic HTML for web serving #HTML #Bokeh #Holoviews Now you can use AutoViz to create Interactive Bokeh charts and dashboards (see below) either in Jupyter Notebooks or in the browser. Use chart_format as follows: - `chart_format='bokeh'`: interactive Bokeh dashboards are plotted in Jupyter Notebooks. - `chart_format='server'`, dashboards will pop up for each kind of chart on your web browser. - `chart_format='html'`, interactive Bokeh charts will be silently saved as Dynamic HTML files under `AutoViz_Plots` directory ![banner](intro2.gif) [![Pepy Downloads](https://pepy.tech/badge/autoviz)](https://pepy.tech/project/autoviz) [![Pepy Downloads per week](https://pepy.tech/badge/autoviz/week)](https://pepy.tech/project/autoviz) [![Pepy Downloads per month](https://pepy.tech/badge/autoviz/month)](https://pepy.tech/project/autoviz) [![standard-readme compliant](https://img.shields.io/badge/standard--readme-OK-green.svg)](https://github.com/RichardLitt/standard-readme) [![Python Versions](https://img.shields.io/pypi/pyversions/autoviz.svg)](https://pypi.org/project/autoviz) [![PyPI Version](https://img.shields.io/pypi/v/autoviz.svg)](https://pypi.org/project/autoviz) [![PyPI License](https://img.shields.io/pypi/l/autoviz.svg)](https://github.com/AutoViML/AutoViz/blob/master/LICENSE) AutoViz performs automatic visualization of any dataset with one line of code. Give it any input file (CSV, txt or json format) of any size and AutoViz will visualize it, provided you set the `max_rows_analyzed` and `max_cols_analyzed` setting within the bounds of your machine's memory limit. AutoViz can now create charts in multiple formats using the `chart_format` setting: - If `chart_format ='png'` or `'svg'` or `'jpg'`: Matplotlib charts are plotted inline. * Can be saved locally (using `verbose=2` setting) or displayed (`verbose=1`) in Jupyter Notebooks. * This is the default behavior for AutoViz. - If `chart_format='bokeh'`: Interactive Bokeh charts are plotted in Jupyter Notebooks. - If `chart_format='server'`, dashboards will pop up for each kind of chart on your browser. - If `chart_format='html'`, interactive Bokeh charts will be created and silently saved as HTML files under the `AutoViz_Plots` directory (under working folder) or any other directory that you specify using the `save_plot_dir` setting (during input). ## Table of Contents - [Install](#install) - [Usage](#usage) - [API](#api) - [Maintainers](#maintainers) - [Contributing](#contributing) - [License](#license) ## Install **Prerequsites** - [Anaconda](https://docs.anaconda.com/anaconda/install/) To clone AutoViz, it's better to create a new environment, and install the required dependencies: To install from PyPi: ```sh conda create -n python=3.7 anaconda conda activate # ON WINDOWS: `source activate ` pip install autoviz ``` To install from source: ```sh cd git clone git@github.com:AutoViML/AutoViz.git # or download and unzip https://github.com/AutoViML/AutoViz/archive/master.zip conda create -n python=3.7 anaconda conda activate # ON WINDOWS: `source activate ` cd AutoViz pip install -r requirements.txt ``` ## Usage Read this Medium article to know how to use [AutoViz](https://towardsdatascience.com/autoviz-a-new-tool-for-automated-visualization-ec9c1744a6ad). In the AutoViz directory, open a Jupyter Notebook and use this line to instantiate the library ```py from autoviz.AutoViz_Class import AutoViz_Class AV = AutoViz_Class() ``` Load a dataset (any CSV or text file) into a Pandas dataframe or give the name of the path and filename you want to visualize. If you don't have a filename, you can simply assign the filename argument `""` (empty string). Call AutoViz using the filename (or dataframe) along with the separator and the name of the target variable in the input. ```py filename = "" sep = "," dft = AV.AutoViz( filename, sep=",", depVar="", dfte=None, header=0, verbose=0, lowess=False, chart_format="svg", max_rows_analyzed=150000, max_cols_analyzed=30, save_plot_dir=None ) ``` AutoViz will do the rest. You will see charts and plots on your screen. ![var_charts](var_charts.JPG) `AV.AutoViz` is the main plotting function in AV. Depending on what `chart_format` you choose, AutoViz will automatically call either the `AutoViz_Main` function or `AutoViz_Holo` function. **Notes:** * AutoViz will visualize any sized file using a statistically valid sample. * `COMMA` is assumed as default separator in file. But you can change it. * Assumes first row as header in file but you can change it. - `verbose` option - if 0, display minimal information but displays charts on your notebook - if 1, print extra information on the notebook and also display charts - if 2, will not display any charts, it will simply save them in your local machine under `AutoViz_Plots` directory under your current working folder. - `chart_format` option - if `'svg','jpg' or 'png'`, displays all charts or saves them depending on verbose option. - if `'bokeh'`, plots interactive charts using Bokeh on your Jupyter Notebook - if `'server'`, will display charts on your browser with one chart type in each tab - if `'html'`, will create bokeh interactive charts and silently save them under `AutoViz_Plots` directory or any directory you specify in the `save_plot_dir` setting. ![bokeh_charts](bokeh_charts.JPG) ## API **Arguments** - `filename` - Make sure that you give filename as empty string ("") if there is no filename associated with this data and you want to use a dataframe, then use dfte to give the name of the dataframe. Otherwise, fill in the file name and leave dfte as empty string. Only one of these two is needed to load the data set. - `sep` - this is the separator in the file. It can be comma, semi-colon or tab or any value that you see in your file that separates each column. - `depVar` - target variable in your dataset. You can leave it as empty string if you don't have a target variable in your data. - `dfte` - this is the input dataframe in case you want to load a pandas dataframe to plot charts. In that case, leave filename as an empty string. - `header` - the row number of the header row in your file. If it is the first row, then this must be zero. - `verbose` - it has 3 acceptable values: 0, 1 or 2. With zero, you get all charts but limited info. With 1 you get all charts and more info. With 2, you will not see any charts but they will be quietly generated and save in your local current directory under the AutoViz_Plots directory which will be created. Make sure you delete this folder periodically, otherwise, you will have lots of charts saved here if you used verbose=2 option a lot. - `lowess` - this option is very nice for small datasets where you can see regression lines for each pair of continuous variable against the target variable. Don't use this for large data sets (that is over 100,000 rows) - `chart_format` - this can be `'svg', 'png', 'jpg'` or `'bokeh'` or `'server'` or `'html'`. You will get charts generated (inline with `verbose=0` or `1` option). Instead you can silently save them in multiple formats if you used `verbose=2` option. The latter options are useful for interactive charts. - `max_rows_analyzed` - limits the max number of rows that is used to display charts. If you have a very large data set with millions of rows, then use this option to limit the amount of time it takes to generate charts. We will take a statistically valid sample. - `max_cols_analyzed` - limits the number of continuous vars that can be analyzed - `save_plot_dir` - directory you want the plots to be saved. Default is None which means it is saved under the current directory under a sub-folder named `AutoViz_Plots`. If the `save_plot_dir` does not exist, it creates it. ![server_charts](server_charts.JPG) ## Maintainers * [@AutoViML](https://github.com/AutoViML) * [@morenoh149](https://github.com/morenoh149) * [@hironroy](https://github.com/hironroy) ## Contributing See [the contributing file](contributing.md)! PRs accepted. ## License Apache License, Version 2.0 ## DISCLAIMER This project is not an official Google project. It is not supported by Google and Google specifically disclaims all warranties as to its quality, merchantability, or fitness for a particular purpose. %prep %autosetup -n autoviz-0.1.58 %build %py3_build %install %py3_install install -d -m755 %{buildroot}/%{_pkgdocdir} if [ -d doc ]; then cp -arf doc %{buildroot}/%{_pkgdocdir}; fi if [ -d docs ]; then cp -arf docs %{buildroot}/%{_pkgdocdir}; fi if [ -d example ]; then cp -arf example %{buildroot}/%{_pkgdocdir}; fi if [ -d examples ]; then cp -arf examples %{buildroot}/%{_pkgdocdir}; fi pushd %{buildroot} if [ -d usr/lib ]; then find usr/lib -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/lib64 ]; then find usr/lib64 -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/bin ]; then find usr/bin -type f -printf "/%h/%f\n" >> filelist.lst fi if [ -d usr/sbin ]; then find usr/sbin -type f -printf "/%h/%f\n" >> filelist.lst fi touch doclist.lst if [ -d usr/share/man ]; then find usr/share/man -type f -printf "/%h/%f.gz\n" >> doclist.lst fi popd mv %{buildroot}/filelist.lst . mv %{buildroot}/doclist.lst . %files -n python3-autoviz -f filelist.lst %dir %{python3_sitelib}/* %files help -f doclist.lst %{_docdir}/* %changelog * Tue Apr 11 2023 Python_Bot - 0.1.58-1 - Package Spec generated