22
submitted 10 hours ago by Bee@mander.xyz to c/science@mander.xyz

Test-time inference has emerged as a powerful paradigm for enabling language models to ``think'' longer and more carefully about complex challenges, much like skilled human experts. While reinforcement learning (RL) can drive self-improvement in language models on verifiable tasks, some models exhibit substantial gains while others quickly plateau. For instance, we find that Qwen-2.5-3B far exceeds Llama-3.2-3B under identical RL training for the game of Countdown. This discrepancy raises a critical question: what intrinsic properties enable effective self-improvement? We introduce a framework to investigate this question by analyzing four key cognitive behaviors -- verification, backtracking, subgoal setting, and backward chaining -- that both expert human problem solvers and successful language models employ. Our study reveals that Qwen naturally exhibits these reasoning behaviors, whereas Llama initially lacks them. In systematic experimentation with controlled behavioral datasets, we find that priming Llama with examples containing these reasoning behaviors enables substantial improvements during RL, matching or exceeding Qwen's performance. Importantly, the presence of reasoning behaviors, rather than correctness of answers, proves to be the critical factor -- models primed with incorrect solutions containing proper reasoning patterns achieve comparable performance to those trained on correct solutions. Finally, leveraging continued pretraining with OpenWebMath data, filtered to amplify reasoning behaviors, enables the Llama model to match Qwen's self-improvement trajectory. Our findings establish a fundamental relationship between initial reasoning behaviors and the capacity for improvement, explaining why some language models effectively utilize additional computation while others plateau.

144
submitted 12 hours ago by Bee@mander.xyz to c/news@lemmy.world

Scientists turned out at rallies from coast to coast and in Europe to “Stand Up for Science,” revitalizing a movement to defend scientific integrity that started during Trump’s first term.

8
submitted 1 day ago by Bee@mander.xyz to c/biology@mander.xyz
14
Does Ambition Breed Dishonesty? (greatergood.berkeley.edu)
submitted 1 day ago by Bee@mander.xyz to c/science@mander.xyz

Ambition is a predictor of success. But according to a new study, the motives behind it can also lead to lying and cheating.

8
submitted 1 day ago by Bee@mander.xyz to c/science@mander.xyz

Lifestyle improvements like adopting a healthy diet or quitting smoking can slow biological aging processes.

10
submitted 2 days ago* (last edited 2 days ago) by Bee@mander.xyz to c/health@lemmy.world

A landmark study published in PNAS has unveiled that brain aging follows a distinct yet nonlinear trajectory with critical transition points. The research, conducted by an international team of scientists led by Lilianne R. Mujica-Parodi, PhD, of Stony Brook University, offers new insights into when interventions to prevent cognitive decline might be most effective.

7
submitted 2 days ago* (last edited 2 days ago) by Bee@mander.xyz to c/biology@mander.xyz

The brain plays a vital role in controlling reproductive functions. It helps to maintain a delicate balance of hormones, all of which can be affected by the metabolism. Investigating the impact of the metabolism on reproductive development and function is critical to a better understanding of health and diseases. Professor Carol Fuzeti Elias and Dr Cristina Sáenz de Miera Patín from the University of Michigan in the USA, carry out groundbreaking research in neuroscience, exploring the molecular and neural mechanisms at play.

3
Mapping infant brains. (elifesciences.org)
submitted 2 days ago by Bee@mander.xyz to c/neuroscience@mander.xyz

Movie watching may provide scientists a window into infant brain activity and organization.

7
submitted 2 days ago by Bee@mander.xyz to c/neuroscience@mander.xyz

Cross-sectional age-skill profiles suggest that cognitive skills start declining by age 30 if not earlier. If accurate, such age-driven skill losses pose a major threat to the human capital of societies with rapidly aging populations. We estimate actual age-skill profiles from individual changes in literacy and numeracy skills at different ages. We use the unique German longitudinal component of the Programme of the International Assessment of Adult Competencies (PIAAC-L) that retested a large representative sample of adults after 3.5 years. Our empirical approach separates age from cohort effects and corrects for measurement error from reversion to the mean. Two main results emerge. First, average skills increase strongly into the forties before decreasing slightly in literacy and more strongly in numeracy. Second, skills decline at older ages only for those with below-average skill usage. White-collar and higher-educated workers with above-average usage show increasing skills even beyond their forties. Women have larger skill losses at older age, particularly in numeracy.

12
submitted 2 days ago by Bee@mander.xyz to c/science@mander.xyz

Large language models (LLMs) have shown remarkable advancements in chemistry and biomedical research, acting as versatile foundation models for various tasks. We introduce AMP-Designer, an LLM-based approach, for swiftly designing antimicrobial peptides (AMPs) with desired properties. Within 11 days, AMP-Designer achieved the de novo design of 18 AMPs with broad-spectrum activity against Gram-negative bacteria. In vitro validation revealed a 94.4% success rate, with two candidates demonstrating exceptional antibacterial efficacy, minimal hemotoxicity, stability in human plasma, and low potential to induce resistance, as evidenced by significant bacterial load reduction in murine lung infection experiments. The entire process, from design to validation, concluded in 48 days. AMP-Designer excels in creating AMPs targeting specific strains despite limited data availability, with a top candidate displaying a minimum inhibitory concentration of 2.0 micrograms per milliliter against Propionibacterium acnes. Integrating advanced machine learning techniques, AMP-Designer demonstrates remarkable efficiency, paving the way for innovative solutions to antibiotic resistance.

42
submitted 3 days ago by Bee@mander.xyz to c/medicine@mander.xyz
136
submitted 3 days ago by Bee@mander.xyz to c/science@mander.xyz
view more: next ›

Bee

joined 1 week ago