A Q-learning-based Multipath Scheduler for Data Transmission Optimization in Heterogeneous Wireless Networks