TRAINING COURSE "SUPERVISED NEURAL NETWORKS"

 

Supervised Neural networks are simply the best regression models at this time, thanks to their capacity to approximate just about any regression function. Of course, this capacity translates into exceptional performances in probabilistic classification. These qualities are now widely acknowledged, and just about every major Data Mining software now incorporates supervised Neural Networks.

Yet, this very power is a double-edged sword, and many Neural Networks-based projects end up in closets because of a lack of understanding of their basic inner workings. As a consequence, these truly wondeful tools are widely underused, and their reputation is not up their true capabilities.


Depending on previous background, this 1, 2 or 3 days training (see outline below) will bring you up to the necessary level for fully taking advantage of the power of the most popular supervised Neural networks. The training is illustrated with many demonstrations done with commercial software.
 

Course outline

General principles

The concept of "universal approximation"

Universal approximation and regression

Regression and probabilistic classification

The Multilayer Perceptron (MLP)

The "standard" neuron

Activation function

Role of the weights : slope and orientation

Threshold (or bias) : a special weight

Architecture and main properties of the MLP

Role of the hidden layer

How many neurons in the hidden layer ?

Classification : output layer and a posteriori probabilities

MLP and parcimonious approximation

Training of a MLP

Chosing an error function in classification

Initialisation before training

Descent optimization and local minima

First order training algorithms ("BackProp")

Second order training algorithms (Quasi-Newton, Levenberg-Marquardt, Conjugate Gradients)

Overparametrization and overtraining, validation set

RBF networks

The notion of "influence zone"

What are "Radial Basis Functions" ?

Architecture and main properties of RBF networks

Training of RBF networks

Gradient descent methods

Heuristics

MLP or RBF : which one should you chose ?

Training

Generalization capacity

How to use (supervised) Neural networks

When should you use (or not use) Neural networks ?

Comparison of Neural networks with other regression techniques (Linear Regression, Multiple Linear Regression), and classification techniques (Discriminant Analysis, Logistic Regression, Decision Trees)

How do you estimate the generalization capacity of a Neural Network ?

Validation, multiple validation

Re-sampling methods : cross-validation, "leave-one-out", bootstrap

Leverage and confidence interval. Simulated  Leave-one-out

How can you improve the performance of a Neural Network ?

Chosing the right variables, dimensionality reduction

How to optimize the hidden layer

"Early stopping", regularization, pruning

Cooperation between several networks in regression and classification