The purpose of this article is to explain how to set up Training Set


Purpose

Splits the data into two groups for training and testing purposes. The split allows evaluating model performance using a dataset the model haven't learned.

Steps to run

  1. Go to the Tools menu
  2. Select the Data Science option
  3. Select the Training Set
  4. Select Data Table, Holdout (%), Seed, and the name of the column.

Training Set Inputs

  1. Data Table - Data table used for the set.
  2. Holdout (%) -  part of the training dataset that is set aside.
  3. Seed - is a number or other value that is generated 
  4. Name - Name of the column

            Note: After clicking OK, a new column will be added to the data table for either true or false.

Output

  • One Pie Chart Visualization 
  • Green - True
  • Blue - False

Example

For additional information on RAI Data Science Toolkit documentation, click here.