B, an open-weight multimodal vision AI model designed to deliver strong math, science, document and UI reasoning with far ...
Berlin Coyotiv and OpenServ Labs published a research paper introducing BRAID (Bounded Reasoning for Autonomous ...
DescrybeLM answered all 200 bar exam questions correctly. ChatGPT, Claude, and Gemini each missed between 13 and 23—and ...
The company mainly trained Phi-4-reasoning-vision-15B on open-source data. The data included images and text-based descriptions of the objects depicted in those images. Before it started training the ...
Scientists warn that current AI tests reward polite responses rather than real moral reasoning in large language models.
The Naglieri Nonverbal Ability Test (NNAT) is a nonverbal assessment designed to measure general reasoning ability in K-12 students, helping schools identify students with strong problem-solving ...
Microsoft releases Phi-4 Reasoning Vision 15B, a multimodal AI model that activates its own thinking mode and handles ...
OpenAI launches GPT-5.4 across ChatGPT, API, and Codex with stronger reasoning, coding, and computer use capabilities.
Metilience unveils a hybrid AI reasoning engine for high-stakes exams, leveraging structured cognitive error analysis ...
OpenAI has launched its new ChatGPT 5.4 with Extreme Reasoning mode for long-duration task focus. As well as a 1M-token context window ...
OpenAI’s next GPT model is coming—and soon, according to a person with knowledge of it.Among the highlights, the new model, ...