The purpose of this article is to explain how to setup Training Set




Purpose: Splits the data into two groups for training and testing purposes. The split allows to evaluate model performance using a dataset the model haven't learned.




Steps to run:

  1. Go to the Tools menu
  2. Select the Data Science option
  3. Select the Training Set
  4. Select Data Table, Holdout (%), Seed, and the name of the column.







Inputs:

  1. Data Table - Data table used for the set.
  2. Holdout (%) -  part of the training dataset that is set aside.
  3. Seed - is a number or other value that is generated 
  4. Name - Name of the column

            Note: The results (True/False) will be appended to the original data column.









Output:

  • One Pie Chart Visualization 
  • Green - True
  • Blue - False



Example: