Course Overview

This course demonstrates how to develop models to predict categorical and continuous outcomes, using such techniques as neural networks, decision trees, logistic regression, support vector machines, and Bayesian network models. Use of the binary classifier and numeric predictor nodes to automate model selection is included. Feature selection and detection of outliers are discussed. Expert options for each modeling node are reviewed in detail and advice is provided on when and how to use each model. You will also learn how to combine two or more models to improve prediction.

Who Should Attend

This advanced course follows either ‘Introduction to IBM SPSS Modeler and Data Mining’ or Advanced Data Preparation with IBM SPSS Modeler is essential for anyone who wishes to become familiar with the full range of modeling techniques available in IBM SPSS Modeler to create predictive models.

Course Certifications

This course is part of the following Certifications:

IBM Certified Specialist – SPSS Modeler Professional v2

Prerequisites

Prerequisites
General computer literacy
Experience using IBM SPSS Modeler (formerly Clementine), including familiarity with the IBM SPSS Modeler environment, creating streams, reading in data files, assessing data quality and handling missing data (including the type and data audit nodes), basic data manipulation (including the derive and select nodes), and creation of models.
Prior completion of Introduction to IBM SPSS Modeler and Data Mining is required and completion of Advanced Data Preparation with IBM SPSS Modeler is strongly encouraged.
An introductory course in statistics, or equivalent experience, would be helpful for the statistics-based modeling techniques.
Introduction to IBM SPSS Modeler and Data Mining (V14.2)
Introduction to IBM SPSS Modeler and Data Mining (V15)
Advanced Data Preparation Using IBM SPSS Modeler (V15)

Course Content

  • Preparing data for modeling
  • Searching for data anomalies
  • Selecting predictors
  • Data reduction with principal components
  • Neural networks
  • Support vector machines
  • Cox regression
  • Time series analysis
  • Decision trees
  • Linear regression
  • Logistic regression
  • Discriminant analysis
  • Bayesian networks
  • Numeric Predictor node
  • Binary Classifier Node
  • Combining models to improve performance
  • Getting the most from models
  • Appendix A: Decision List