scored:

died:

Learning Rate

Discount Factor

Action Randomization