Unterschiede
Hier werden die Unterschiede zwischen zwei Versionen der Seite angezeigt.
Beide Seiten, vorherige Überarbeitung Vorherige Überarbeitung Nächste Überarbeitung | Vorherige Überarbeitung | ||
ef_informatik:machine-learning [2022-02-07 10:51] – [Syllabus] hof | ef_informatik:machine-learning [2022-03-29 09:58] (aktuell) – [Syllabus] hof | ||
---|---|---|---|
Zeile 16: | Zeile 16: | ||
* Hidden Markov Models (a tiny bit of theory) | * Hidden Markov Models (a tiny bit of theory) | ||
* -> what if we cannot observe all the internal state? | * -> what if we cannot observe all the internal state? | ||
- | * -> infer probability of hidden (latent) state from observations | + | * -> infer probability of hidden (latent) state from observations |
- | * Reinforcement Learning | + | * Deep Reinforcement Learning |
* Q-Function is not a table, but a neural network, which is trained by repeatedly running the game and updating the neuron states. | * Q-Function is not a table, but a neural network, which is trained by repeatedly running the game and updating the neuron states. | ||
* builds on top of [[neuronales_netzwerk_handschrift: | * builds on top of [[neuronales_netzwerk_handschrift: | ||
+ | * Prereqs: | ||
+ | * OpenAI Gym environment: | ||
+ | * keras / tensorflow? |