方略学科导航

搜索结果: 1-15 共查到“知识库 Reinforcement learning”相关记录16条 . 查询时间(0.093 秒)

VISIBLE ROUTES IN 3D DENSE CITY USING REINFORCEMENT LEARNING 3D GIS Visibility Routes Reinforcement learning 2018/11/9

In the last few years, the 3D GIS domain has developed rapidly, and has become increasingly accessible to different disciplines. 3D Spatial analysis of Built-up areas seems to be one of the most chall...

存档附件原文地址

Kernel-Based Reinforcement Learning in Average-Cost Problems Average–cost problem dynamic programming kernel smoothing local averaging Markov decision process (MDP) 2015/7/8

Reinforcement learning (RL) is concerned with the identification of optimal controls in Markov decision processes (MDPs) where no explicit model of the transition probabilities is available. Many exis...

存档附件原文地址

ADAPTIVE STEP-SIZES FOR REINFORCEMENT LEARNING reinforcement learning machine learning step-size learning rate evaluation adaptive 2014/12/18

The central theme motivating this dissertation is the desire to develop reinforcement learning algorithms that “just work” regardless of the domain in which they are applied. The largest impediment to...

存档附件原文地址

Reinforcement Learning for Mapping Instructions to Actions Actions Mapping Instructions 2014/11/26

Reinforcement Learning for Mapping Instructions to Actions。

存档附件原文地址

Electric Power Market Modeling with Multi-Agent Reinforcement Learning Electric Power Market Modeling Multi-Agent Reinforcement Learning 2014/10/22

Agent-based modeling (ABM) is a relatively new tool for use in electric power market research. At heart are software agents representing real-world stakeholders in the industry: utilities, power produ...

存档附件原文地址

Reinforcement Learning for the Soccer Dribbling Task Reinforcement Learning Soccer Dribbling Task 2013/6/17

We propose a reinforcement learning solution to the \emph{soccer dribbling task}, a scenario in which a soccer agent has to go from the beginning to the end of a region keeping possession of the ball,...

存档附件原文地址

Cover Tree Bayesian Reinforcement Learning Cover Tree Bayesian Learning 2013/6/14

This paper proposes an online tree-based Bayesian approach for reinforcement learning. For inference, we employ a generalised context tree model. This defines a distribution on multivariate Gaussian p...

存档附件原文地址

Regret Bounds for Reinforcement Learning with Policy Advice Regret Bounds Reinforcement LearningPolicy Advice 2013/6/13

In some reinforcement learning problems an agent may be provided with a set of input policies, perhaps learned from prior experience or provided by advisors. We present a reinforcement learning with p...

存档附件原文地址

ABC Reinforcement Learning ABC Reinforcement Learning 2013/4/28

This paper introduces a simple, general framework for likelihood-free Bayesian reinforcement learning, through Approximate Bayesian Computation (ABC). The main advantage is that we only require a prio...

存档附件原文地址

Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems Efficient Reinforcement Learning High Dimensional Linear Quadratic Systems 2013/4/28

We study the problem of adaptive control of a high dimensional linear quadratic (LQ) system. Previous work established the asymptotic convergence to an optimal controller for various adaptive control ...

存档附件原文地址

A Greedy Approximation of Bayesian Reinforcement Learning with Probably Optimistic Transition Model Reinforcement Learning Uncertain Knowledge Probabilistic Reasoning Optimal Behavior in Polynomial Time 2013/5/2

Bayesian Reinforcement Learning (RL) is capable of not only incorporating domain knowledge, but also solving the exploration-exploitation dilemma in a natural way. As Bayesian RL is intractable except...

存档附件原文地址

Monte-Carlo utility estimates for Bayesian reinforcement learning Monte-Carlo estimates Bayesian reinforcement learning 2013/5/2

This paper introduces a set of algorithms for Monte-Carlo Bayesian reinforcement learning. Firstly, Monte-Carlo estimation of upper bounds on the Bayes-optimal value function is employed to construct ...

存档附件原文地址

Bayesian multitask inverse reinforcement learning Bayesian inference multitask learning inverse reinforce-ment learning 2011/7/6

We generalise the problem of inverse reinforcement learning to multiple tasks, from a set of demonstrations. Each demonstration may represent one expert trying to solve a different task.

存档附件原文地址

Optimal Reinforcement Learning for Gaussian Systems Optimal Reinforcement Learning Gaussian Systems 2011/7/5

The exploration-exploitation tradeoff is among the central challenges of reinforcement learning. A hypothetical exact Bayesian learner would provide the optimal solution, but is intractable in general...

存档附件原文地址

Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's Dilemma Numerical analysis reinforcement learning model 2011/1/5

Humans and other animals can adapt their social behavior in response to environmental cues including the feedback obtained through experience. Nevertheless, the effects of the experience-based learnin...

存档附件原文地址

中国研究生教育排行榜-条

正在加载...

中国学术期刊排行榜-条

正在加载...

世界大学科研机构排行榜-条

正在加载...

中国大学排行榜-条

正在加载...

人　物-篇

正在加载...

课　件-篇

正在加载...

视听资料-篇

正在加载...

研招资料 -篇

正在加载...

知识要闻-篇

正在加载...

国际动态-篇

正在加载...

会议中心-篇

正在加载...

学术指南-篇

正在加载...

学术站点-篇

正在加载...

中国研究生教育排行榜-条

中国学术期刊排行榜-条

世界大学科研机构排行榜-条

中国大学排行榜-条

人 物-篇

课 件-篇

视听资料-篇

知识库-篇

研招资料 -篇

知识要闻-篇

国际动态-篇

会议中心-篇

学术指南-篇

学术站点-篇

人　物-篇

课　件-篇