Machine_Learning_Guidance

DNN Troubleshooting

OpenAI Talk

Same talk @ IBM

Basic

Intuition

                          Tune hyperparameter
                                  |
Start simple -> Implement & Debug -> Evaluate -> ?
                                  |
                         Improve model & Data

Details

Start simple

Architecture

Arch Start here Consider this afterwards
Images LetNet-like ResNet
Sequences LSTM with one hidden layer
Temporal Convs
Attention model
WaveNet-like
Others MLP with one hidden layer Problem-dependent

Defaults

Data

Normalize scale of input data

Simply the problem itself

Implement

Most common DL bugs

Let the model start Running

Error analysis

Evaluation

Apply the bias-variance decomposition

Error source Value Analysis
Goal performance 1%  
Train error 20% Train - Goal = 19%
Underfit
Validation error 27% Val - Train = 7%
Overfit
Test error 28% Test - Val = 1%
Val overfit

Choose Hyperparameter

Improve model/data