Machine Learning

  • Git & GitHub: proficiency assumed
  • Jupyter Notebook [1w]
    • crash course: Edit, run, publish
    • Login to shared notebook
    • setup your own server
  • Q-Learning Intro [3w]
    • Along the lines of Microsoft ML Chapter 8
    • Assumes we can observe the entire state.
    • Reward / action only depends on the current state of the game.
    • Immediate reward given by game score (points)
    • Q-Function is a table (Q-Table) dynamically updated by Bellman-formula
  • Hidden Markov Models (a tiny bit of theory)
    • → what if we cannot observe all the internal state?
    • → infer probability of hidden (latent) state from observations
  • Reinforcement Learning
    • Q-Function is not a table, but a neural network, which is trained by repeatedly running the game and updating the neuron states.
    • builds on top of sca's NN program
  • ef_informatik/machine-learning.1644231103.txt.gz
  • Zuletzt geändert: 2022-02-07 10:51
  • von hof