( PDF) A penalty function method for exploratory adaptive-critic #NeuralNetwork control gianluca di muro - Academia .edu