Bellman
Jupyter Notebooks
Approximating MDPs
Learning from samples
Trajectory Optimisation
API Reference
bellman
bellman.agents
bellman.agents.background_planning
bellman.agents.decision_time_planning
bellman.agents.mbpo
bellman.agents.mepo
bellman.agents.pets
bellman.agents.trpo
bellman.benchmark
bellman.distributions
bellman.drivers
bellman.environments
bellman.harness
bellman.networks
bellman.policies
bellman.training
bellman.trajectory_optimisers
Bellman
»
bellman
»
bellman.agents
»
bellman.agents.trpo
bellman.agents.trpo
ΒΆ