The purpose of this article is to explain QQ Plot
What is QQ Plot in the Data Science Toolkit?
Purpose:
The purpose of the QQ plot is to test for normal distribution in a column of data. Many predictive models require that the data be normally distributed. If it is not, then the model will not work well. This plot can help a user determine if their data is normally distributed and whether or not it should be fed into a given predictive model.
Output:
- Single scatter plot visualization
- Cross table for p90,p10,p10/p90, mean, median, Swanson's mean.
- Normal Quantiles are on the x-axis and the sample Quantiles are on the y-axis
Example:
Data Science Toolkit QQ Plot User Guide: How to set up QQ plot
See RAI QQ Plot video below
Data Science Toolkit - QQ Plot from Ruths.ai on Vimeo.
For additional information on RAI Data Science Toolkit documentation, click here.