A Deep Reinforcement Learning technique incorporating Actor Critic framework and hybrid value networks to improve the dialogue policy.