OnBoard Knowledge Journal 2025, 2, 4
14 of 15
competitive environment but also provides a coherent and engaging experience for human observers. Beyond
its technical success, the project stands out for its educational and environmental impact. By integrating a
narrative centered around polar ice melting, the game serves as an engaging tool to raise awareness about
climate change, using interactive media to promote sustainability and informed decision-making. However,
some limitations were observed. Despite high performance, the agent occasionally exhibited suboptimal
behaviors, such as unnecessary movements, delayed reactions, or getting stuck at field edges. These issues
suggest potential improvements in observation encoding, exploration strategies, or the network architecture.
In summary, the project demonstrates that deep reinforcement learning, when properly guided and evaluated,
can produce not only effective and adaptive AI agents but also contribute meaningfully to education and
social awareness through serious games.
Author Contributions: Flavio Arregoces: Conceptualization, Methodology, Software, Visualization, Investigation,
Resources, Writing – original draft. Cristian Gonzalez: Software, Visualization, Validation, Formal analysis, Data
curation, Writing – review & editing. Bella Mejia: Investigation, Resources, Data curation, Writing – original draft,
Writing – review & editing. Jorge Sanchez: Conceptualization, Methodology, Writing – review & editing, Supervision,
Project administration. Yovany Zhu: Validation, Formal analysis, Writing – review & editing, Funding acquisition.
All authors have read and agreed to the published version of the manuscript. Refer to the taxonomía CRediT for term
explanations. Authorship should be limited to those who have contributed substantially to the work reported.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable, since the present study does not involvehuman personnel or
animals.
Informed Consent Statement: This study is limited to the use of technological resources, so nohuman personnel or
animals are involved.
Conflicts of Interest: Under the authorship of this research, it is declared that there is no conflict of interest with the
present research.
References
1. Bansal, T., Pachocki, J., Sidor, S., Sutskever, I., and Mordatch, I. (2018). Emergent complexity via multi-agent
competition. arXiv preprint arXiv:1710.03748.
2. Brown, C. and Green, D. (2021). Serious games for environmental education. In Proceedings of the International
Conference on Game-Based Learning, pages 112–125.
3. Chuck, C., Qi, C., Munje, M. J., Li, S., Rudolph, M., Shi, C., Agarwal, S., Sikchi, H., Peri, A., Dayal, S., Kuo, E., Mehta,
K., Wang, A., Stone, P., Zhang, A., and Niekum, S. (2024). Robot air hockey: A manipulation testbed for robot learning
with reinforcement learning. arXiv preprint arXiv:2405.03113.
4. Heinrich, J. and Silver, D. (2016). Deep reinforcement learning from self-play in imperfect-information games. In
Advances in Neural Information Processing Systems.
5. Lee, J. and Kim, S. (2023). Generalization of ppo in complex game environments. Journal of Artificial Intelligence
Research.
6. Orsula, M. (2024). Learning to play air hockey with model-based deep reinforcement learning. Robotics and Autonomous
Systems.
7. Schott, G. (2024). Game over for climate change? communicating and visualising global warming in digital games.
Games and Culture.
8. Schulman, J., Wolski, F., Dhariwal, P., Radford, A., and Klimov, O. (2017a). Proximal policy optimization algorithms.
arXiv preprint arXiv:1707.06347.
9. Schulman, J., Wolski, F., Dhariwal, P., Radford, A., and Klimov, O. (2017b). Proximal policy optimization algorithms.
arXiv preprint arXiv:1707.06347. Version cited as conference submission.
10. Silver, D., Hubert, T., Schrittwieser, J., Antonoglou, I., Lai, M., Guez, A., Lanctot, M., Sifre, L., Kumaran, D., Graepel, T.,
Lillicrap, T., Simonyan, K., and Hassabis, D. (2017). Mastering chess and shogi by self-play with a general reinforcement
learning algorithm. arXiv preprint arXiv:1712.01815.
11. Smith, J. and Doe, M. (2022). Artificial Intelligence in Modern Video Games. Game AI Press.
12. Taitler, S. and Shimkin, N. (2017). Learning control for air hockey striking using deep reinforcement learning. In
International Conference on Control, Artificial Intelligence, Robotics & Optimization, pages 22–27.