The purpose of this article is to explain how to setup Scatterplot Matrix


Scatter Plot Matrices are a great way to roughly determine if you have a linear correlation between multiple variables. This is particularly helpful in pinpointing specific variables that might have a similar correlation. Data is divided into groups with each groups containing a specified number of traits. These traits are displayed in a matrix format so that traits can be compared. Along the diagonal, there are histograms for each trait. Above the cells of the histogram are the labels for the traits. Above the cells of the scatterplots is the Pearson Coefficient Score. Groups are colored differently.



Purpose: Use the scatterplot matrix extension to understand the relationship between parameters. This visualization creates a scatter plot per each combination of variable allowing got identify correlation. 



Steps to run:


    1. Click the Scatterplot icon on the toolbar


 Another way to access Scatterplot Matrix Visualization

  1. Click "Insert" on the top menu bar. 
  2. Go to "Visualization" 
  3. Click Scatterplot Matrix.


 

    2. Click on the Properties menu on the top right corner of the visualization, and select the parameters.


Properties Menu Inputs

  1. General 
    •  Title:  the title of the visualization
    •  Description: Add a description of this visualization

  2. Data
    • Data Table: Add the data table.
    • Filtering, Marking, or limit by marking.

  3. Appearance
    • Show/Hide Legend
    • Show/Hide Pearson Correlation
    • Show/Hide Tooltips
    • Number of Bin for the Histogram(Traits) - How many bars on the Histogram
    • Text scale - Size of the text ( Pearson Correlation Score and Traits title)

  4. Selection Mode
    •  Choose either Pointer or Rectangle.
    •  The pointer arrow ONLY selects the Histogram visualization
    •  The rectangle setting is able to select parts of the scatterplot for further analysis and also able to mark on histograms.

  5. Column: Group Selection
    • Choose the column group
    • This setting will specify what to use to distinguish between groups

  6. Column: Trait Selection
    • Number of traits: This is where you can add more traits to your visualization.
    • Note when adding more or fewer traits click save for the visualizations to save the change, then go back to add the traits.
    • Traits 1,2,3, etc. - These are the traits that are going to be shown on the visualization. (HIstogram, Diagonal)

  7. Click Save after selecting all the settings. 




Outputs:

  • One Scatterplot Matrix Visualization 
  • Histogram (Diagonal) - Traits 
  • ScatterPlot - correlation between the multiple variables.
    • (Above each ScatterPlot - Pearson Coefficient Score)


Example: This Image was taken from "Scatterplot Matrix Example" template from the Exchange.ai Portal. - Below is a link.





How do I mark on the visualization?


To mark on the Scatterplot visualization, the settings in the properties menu under Selection Mode must be on the rectangle.

  • This will draw a rectangle box when selecting the on the scatterplot. 
  • To know this is activated there will be a (+) when panning across scatterplot visualization. 


To mark on the Histogram, it could either be pointer or rectangle settings in the properties menu under Selection Mode. 

  • Pointer/Rectangle selects one bar on the histogram. 




How to filter a subset of data:

  1. Open the filter panel by clicking the filter icon on the top bar.
  2. Choose the correct filtering scheme.
    • This will automatically filter the visualization upon selection. 





Link for Template example on the portal below:


Scatter Plot Matrix Template Example