Designing a generalised reward for building energy management reinforcement learning agents