What is wrong about discounting reward?
Below explains some properties of RL. What is incorrectly connected?
What is wrong about optimal value functions?
We stop the value iteration when:
What is the advantage of policy iteration over value iteration?
Match the algorithms with appropriate target.
| Stavka koja se može prevući | arrow_right_alt | Odgovarajuća stavka |
|---|---|---|
Q-learning | arrow_right_alt | |
TD(0) | arrow_right_alt | |
MC | arrow_right_alt |
|
SARSA | arrow_right_alt |
Categorize algorithms.
Value iteration
MC
TD
DQN
Model-free
Model-based
Categorize algorithms.
DQN
Q-learning
SARSA
TD
On-policy
Off-policy
MC
TD
DP
SARSA
Bootstrapping
Sampling
What is wrong about DQN? Choose all.
What is wrong about the below explanations about the representation of search problems?
What is true about search problem and algorithms? Choose all.
What is the data structure of frontier by the algorithm?
Tree search
Graph search
DFS
UCS
A*
Best first search
Closed list
Queue
Priority queue
m is finite
(Row) expands the (Column) unexplored node in the frontier. Choose the right
Deepest | Shallowest | Cheapest | |
|---|---|---|---|
BFS | |||
DFS | |||
DLS | |||
UCS | |||
IDS |
When is inappropriate to use the bi-directional search?
What is wrong about DLS? Choose all.
What is true about UCS? Choose all.
What is true about A* search?
What is wrong about relaxed problems?
What is wrong about approaches for obtaining heuristics?
Categorize local search methods.
Hill-climbing
Stochastic hill-climbing
Simulated annealing
Local beam search
Select highest valued neighbor
Select neighbor that produces an improvement
Accept bad moves
Match the explanation of approaches of ordering variable in CSP.
| Stavka koja se može prevući | arrow_right_alt | Odgovarajuća stavka |
|---|---|---|
Degree heuristic | arrow_right_alt | The variable with the fewest remaining legal values in its domain |
Most constrained variable | arrow_right_alt | The variable involved in the largest number of constraints on other unassigned variables. |
Match the explanation of approaches of filtering variable in CSP.
| Stavka koja se može prevući | arrow_right_alt | Odgovarajuća stavka |
|---|---|---|
Arc consistency | arrow_right_alt | Propagation from assigned variable to unassigned variable |
Both | arrow_right_alt | Examines further implications, not only 1-step |
Forward checking | arrow_right_alt | Detects failures earlier |
What can be inferred from below?
Match the explanation of well-known problems of planning.
| Stavka koja se može prevući | arrow_right_alt | Odgovarajuća stavka |
|---|---|---|
Ramification problem | arrow_right_alt | Representing all things that stay the same |
Frame problem | arrow_right_alt | Defining conditions for an action to succeed |
Qualification problem | arrow_right_alt | Representing implicit consequences of actions |
What is true about linear planning?
Consider PDDL. What is included in what?
Actions
Initial state
Goal
Types
Predicates
Objects
Constants
Domain definition
Problem definition
What is wrong about Bayes Rule?
Match the type of causal chain with the definition of joint distribution.
| Stavka koja se može prevući | arrow_right_alt | Odgovarajuća stavka |
|---|---|---|
Common cause | arrow_right_alt | |
Common effect | arrow_right_alt | |
Casual chain | arrow_right_alt |