Reinforcement Learning and Robotics – Part II: Policy Gradient Control for Mobile Robotics Application Gustavo Andrade – C2SR, SYSTEC, FEUP. Room B009