m8ta
You are not authenticated, login.
text: sort by
tags: modified
type: chronology
{1085}
hide / / print
ref: Parush-2011.01 tags: basal ganglia reinforcement learning hypothesis frontiers israel date: 01-24-2012 04:05 gmt revision:2 [1] [0] [head]

PMID-21603228[0] Dopaminergic Balance between Reward Maximization and Policy Complexity.

  • model complexity discounting is an implicit thing.
    • the basal ganglia aim at optimization of independent gain and cost functions. Unlike previously suggested single-variable maximization processes, this multi-dimensional optimization process leads naturally to a softmax-like behavioral policy
  • In order for this to work:
    • dopamine directly affects striatal excitability and thus provides a pseudo-temperature signal that modulates the tradeoff between gain and cost.

____References____

[0] Parush N, Tishby N, Bergman H, Dopaminergic Balance between Reward Maximization and Policy Complexity.Front Syst Neurosci 5no Issue 22 (2011)