Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms (Foundations and Trends(r) in Optimization)

R 2,761
or 4 x payments of R690.25 with Payflex

Availability: Currently in Stock
Delivery: 10-20 working days

Customers who purchased this also purchased....