Appendix C: Notation

Uppercase are used for sets, and Greek letters represent parameters of the algorithms.

Set of states.

Individual states. Full views.

Number of states.

$FD=\{$ fd $_i \: \vert \: i=1..n_f\}$ Set of feature detectors.

fd $_{i_1}, \ldots,$ fd $_{i_k})$ Partial view of order .

Set of actions of the robot.

Number of actions.

$EA=\{ea_i \: \vert \: i=1..n_e\}$ Set of elementary actions.

Number of motors of the robot.

$ea_i=(m_i \leftarrow k)$ Elementary action that assigns value to motor .

$c(ea_{i_1}, \ldots, ea_{i_k})$ Partial command of order .

$a=(ea_{1},\ldots,ea_{n_m})$ Action. Combination of elementary actions. Full command.

Partial rule composed by partial view and partial command .

$w_{\emptyset}$ The empty partial rule.

$w_{1} \oplus w_{2}$ Composition of two partial rules.

$C=\{w_i \: \vert \: i=1..n_r\}$ Controller or set of partial rules.

$\mu$ Maximum number of elements of .

$C',C'_{ant}$ Subset of rules active at a given time step and at the previous one.

Active rules with a partial command in accordance with .

Expected value of the partial rule .

Expected error in the value estimation of the partial rule .

$\overline{e}$ Average error in the value prediction.

Confidence index.

Confidence on the statistics of the partial rule .

$\beta$ Top value of the confidence.

$\eta$ Index where the confidence function reaches the value $\beta$ .

$\epsilon_w = e_w \: c_w + \overline{e} \: (1-c_w)$ Error in the return prediction of the partial rule .

$\rho_w=1/(1+\epsilon_w)$ Relevance of rule .

$I_w=[q_w \pm 2 \epsilon_w]$ Value interval of the partial rule .

Updating ratio for the statistics of the partial rule .

$\alpha$ Learning rate. Top value of .

Number of times rule has been used.

Most relevant active partial rule w.r.t. action .

Most reliable value estimation for action .

Reward received after the execution of .

$\gamma$ Discount factor.

Goodness of a given situation.

$q=r_a+\gamma v$ Value of executing action in given situation.

$\tau$ Number of new partial rules created at a time.

$\lambda$ Redundancy threshold used for partial-rule elimination.

Josep M Porta 2005-02-17

	Set of states.
	Individual states. Full views.
	Number of states.
$FD=\{$ fd $_i \: \vert \: i=1..n_f\}$	Set of feature detectors.
fd $_{i_1}, \ldots,$ fd $_{i_k})$	Partial view of order .
	Set of actions of the robot.
	Number of actions.
$EA=\{ea_i \: \vert \: i=1..n_e\}$	Set of elementary actions.
	Number of motors of the robot.
$ea_i=(m_i \leftarrow k)$	Elementary action that assigns value to motor .
$c(ea_{i_1}, \ldots, ea_{i_k})$	Partial command of order .
$a=(ea_{1},\ldots,ea_{n_m})$	Action. Combination of elementary actions. Full command.
	Partial rule composed by partial view and partial command .
$w_{\emptyset}$	The empty partial rule.
$w_{1} \oplus w_{2}$	Composition of two partial rules.
$C=\{w_i \: \vert \: i=1..n_r\}$	Controller or set of partial rules.
$\mu$	Maximum number of elements of .
$C',C'_{ant}$	Subset of rules active at a given time step and at the previous one.
	Active rules with a partial command in accordance with .
	Expected value of the partial rule .
	Expected error in the value estimation of the partial rule .
$\overline{e}$	Average error in the value prediction.
	Confidence index.
	Confidence on the statistics of the partial rule .
$\beta$	Top value of the confidence.
$\eta$	Index where the confidence function reaches the value $\beta$ .
$\epsilon_w = e_w \: c_w + \overline{e} \: (1-c_w)$	Error in the return prediction of the partial rule .
$\rho_w=1/(1+\epsilon_w)$	Relevance of rule .
$I_w=[q_w \pm 2 \epsilon_w]$	Value interval of the partial rule .
	Updating ratio for the statistics of the partial rule .
$\alpha$	Learning rate. Top value of .
	Number of times rule has been used.
	Most relevant active partial rule w.r.t. action .
	Most reliable value estimation for action .
	Reward received after the execution of .
$\gamma$	Discount factor.
	Goodness of a given situation.
$q=r_a+\gamma v$	Value of executing action in given situation.
$\tau$	Number of new partial rules created at a time.
$\lambda$	Redundancy threshold used for partial-rule elimination.