reinforcement_learning [Neurosurgery Wiki]

This page is read only. You can view the source, but not change it. Ask your administrator if you think this is wrong.

Using this [[algorithm]], the [[machine]] is trained to make specific [[decision]]s. It works this way: the machine is exposed to an [[environment]] where it trains itself continually using [[trial]] and [[error]]. This machine learns from past [[experience]] and tries to capture the best possible [[knowledge]] to make accurate business decisions. Example of Reinforcement Learning: Markov Decision Process.
----


Machine learning [[algorithm]]s can be divided into 3 broad categories — [[supervised learning]], [[unsupervised learning]], and [[reinforcement learning]]. Supervised learning is useful in cases where a property (label) is available for a certain [[dataset]] (training set), but is missing and needs to be predicted for other instances. Unsupervised learning is useful in cases where the challenge is to discover implicit relationships in a given unlabeled dataset (items are not pre-assigned). Reinforcement learning falls between these 2 extremes — there is some form of feedback available for each predictive step or action, but no precise label or error message.