Reinforcement Learning for Power Grid Control with Stability Guarantees