Reinforcement Learning
What is the optimal action that you should take in a specific situation? This question is not only raised in our everyday life, it is also relevant to every control system. In this research, I address this question with a machine learning methodology called reinforcement learning. Furthermore, I work on several real world use cases such as:
* Identifying the optimal reorder policy for a set of warehouses to minimize the overall logistics costs under the assumption of stochastic demand and stochastic lead times.
* Finding the elevator group control policy that minimises the travel time of each passenger in Switzerlands highest building.
* Finding the optimal building automation control to minimize energy consumption and maximize the comfort.
If you are interested in diving deeper into this topic, I would recommend you to (1) have a coffee chat with me and (2) go through the following literature/videos: https://www.youtube.com/watch?v=VMp6pq6_QjI and https://www.youtube.com/watch?v=WXuK6gekU1Y provide you with an intuitive understanding of what reinforcement learning “is doing”. https://www.youtube.com/watch?v=93M1l_nrhpQ goes deeper and is also highlighting the underlying mathematics. Furthermore, https://www.youtube.com/watch?v=W_9kcQmaWjo covers the topic multi-agent reinforcement learning, which plays an essential part in this research.
Email: patric.hammler@roche.com
Related Literature:
[1] Supply chain optimization: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3302881
[2] Elevator group control: https://ieeexplore.ieee.org/abstract/document/8998335
[3] Building automation: https://dl.acm.org/doi/abs/10.1145/3360322.3360857
Contact
- Name / Titel
- Patric Hammler
- Funktion
- Research Assistant
- patric.hammler@unibe.ch
- Phone
- +41 31 684 66 90