Pricing and hedging financial derivatives with reinforcement learning methods