TODO preprocessor - subtract mean, divide by std dev weights - normalRandom(n) * sqrt(2.0/n) batch normalization l2, l1 regularization dropout cross entropy