Reinforcement Learning Deep-Q Learning