Unterschiede

Hier werden die Unterschiede zwischen zwei Versionen der Seite angezeigt.

--- ef_informatik:machine-learning [2022-02-07 10:51] – [Syllabus] hof
+++ ef_informatik:machine-learning [2022-03-29 09:58] (aktuell) – [Syllabus] hof
@@ Zeile 16: / Zeile 16: @@
   * Hidden Markov Models (a tiny bit of theory)
     * -> what if we cannot observe all the internal state?
     * -> infer probability of hidden (latent) state from observations
-  * Reinforcement Learning
+  * Deep Reinforcement Learning
     * Q-Function is not a table, but a neural network, which is trained by repeatedly running the game and updating the neuron states.
     * builds on top of [[neuronales_netzwerk_handschrift:|sca's NN program]]
+    * Prereqs:
+      * OpenAI Gym environment: https://gym.openai.com/docs/
+      * keras / tensorflow?