palavras-chave Black Box Optimization Dynamic Movement Primitives Parametrized Policies Path Integral Reinforcement Learning Robotics