What if the inputs of the neural network were not just an enemy character in the game, but inputs for how the screen scrolls and zooms, enemies are spawned, enemies react, die, what happens to the player when he touches different things... basicly every single activity that happens during the game.
I was thinking you could train it all with backpropagation, you actually hand feed the neural network the motions of the game as if it was playing, then after it is trained it should be able to run the game for you.
It would be like a little man placing the game for you as you went through it.