Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning