Gaussian Processes for Fast Policy Optimisation of POMDP-based Dialogue Managers