policy iteration

policy iteration
мат. итерация по стратегиям (метод последовательных приближений в пространстве стратегий)

Большой англо-русский и русско-английский словарь. 2001.

Игры ⚽ Нужно сделать НИР?

Смотреть что такое "policy iteration" в других словарях:

  • Markov decision process — Markov decision processes (MDPs), named after Andrey Markov, provide a mathematical framework for modeling decision making in situations where outcomes are partly random and partly under the control of a decision maker. MDPs are useful for… …   Wikipedia

  • Partially observable Markov decision process — A Partially Observable Markov Decision Process (POMDP) is a generalization of a Markov Decision Process. A POMDP models an agent decision process in which it is assumed that the system dynamics are determined by an MDP, but the agent cannot… …   Wikipedia

  • Нейроуправление — (англ. Neurocontrol)  частный случай интеллектуального управления, использующий искусственные нейронные сети для решения задач управления динамическими объектами. Нейроуправление находится на стыке таких дисциплин, как искусственный… …   Википедия

  • Ronald A. Howard — has been a professor at Stanford University since 1965. In 1964 he defined the profession of decision analysis, and since then has been developing the field as professor in the Department of Engineering Economic Systems (now the Department of… …   Wikipedia

  • MPI — Millî Piyango Idaresi Genel MüDüRlüGü (International » Turkish) ** Medical & Pharmaceutical Information (Medical) * Message Passing Interface (Computing » Networking) * Meeting Professionals International (Business » Firms) * Master Patient Index …   Abbreviations dictionary

  • Land use forecasting — undertakes to project the distribution and intensity of trip generating activities in the urban area. In practice, land use models are demand driven, using as inputs the aggregate information on growth produced by an aggregate economic… …   Wikipedia

  • ancient Greek civilization — ▪ historical region, Eurasia Introduction       the period following Mycenaean civilization, which ended in about 1200 BC, to the death of Alexander the Great, in 323 BC. It was a period of political, philosophical, artistic, and scientific… …   Universalium

  • Delphi method — The Delphi method (  /ˈdɛl …   Wikipedia

  • Committee on the Present Danger — Logo of the Committee on the Present Danger. The Committee on the Present Danger (CPD) is an American foreign policy interest group. Its current stated single goal is to stiffen American resolve to confront the challenge presented by terrorism… …   Wikipedia

  • Intelligence cycle management — This article is at the top level of a series of articles about Intelligence Cycle Management.Within the context of government, military and business affairs, intelligence (the gathering and analysis of accurate, reliable information) is intended… …   Wikipedia

  • ancient Rome — ▪ ancient state, Europe, Africa, and Asia Introduction       the state centred on the city of Rome. This article discusses the period from the founding of the city and the regal period, which began in 753 BC, through the events leading to the… …   Universalium


Поделиться ссылкой на выделенное

Прямая ссылка:
Нажмите правой клавишей мыши и выберите «Копировать ссылку»