Reinforcement Learning for Channel Coding: Learned Bit-Flipping Decoding