CHOWDHURY, Tanay. Reinforcement Learning from Human and AI Feedback for Large Language Model Alignment: A Review. International Journal on Smart & Sustainable Intelligent Computing, [S. l.], v. 3, n. 1, p. 11–24, 2026. DOI: 10.63503/j.ijssic.2026.234. Disponível em: https://submissions.adroidjournals.com/index.php/ijssic/article/view/234. Acesso em: 24 apr. 2026.