Training and evaluation of an MDP model for social multi-user human-robot interaction