ResearchAWS Machine Learning·April 30, 2026

Reinforcement fine-tuning with LLM-as-a-judge

AWS just introduced reinforcement fine-tuning using LLMs as judges. This approach enhances model training by leveraging feedback from large language models, improving overall performance and adaptability in various tasks.

Read the full article on AWS Machine Learning

More in Research

ResearchMIT Technology Review1d

What Anthropic’s latest AI discovery does—and doesn’t—show

Anthropic just revealed new insights about AI alignment and safety. Their findings could lead to better understanding and control of AI behaviors in future models.

ResearchWired2d

Scientists’ Side Hustle? Using AI and Quantum Computing to Generate New Peptides

Scientists are using AI and quantum computing to generate new peptides. This approach could accelerate drug discovery and lead to more effective treatments.

ResearchMIT Technology Review4d

The Download: Claude’s inner workings and OpenAI’s “super app”

Anthropic is revealing the inner workings of Claude, detailing its architecture and capabilities. This transparency aims to enhance user trust and understanding of how Claude operates in various applications.

ResearchTechCrunch4d

Can AI answer the $3 trillion question?

TechCrunch explores how AI could tackle the $3 trillion question of global economic challenges. By leveraging advanced models, AI aims to provide insights that could reshape economic strategies and decision-making.