Comment on the disadvantage of using linear functions as activation functions for multilayer neural networks.”