Qwen2.5-7B-Instruct
PulseAugur coverage of Qwen2.5-7B-Instruct — every cluster mentioning Qwen2.5-7B-Instruct across labs, papers, and developer communities, ranked by signal.
-
MICA framework enhances LLM emotional support dialogues with novel RL approach
Researchers have introduced MICA, a novel reinforcement learning framework designed to improve the performance of large language models in multi-turn emotional support dialogues. This critic-free approach addresses chal…
-
AI doctor agent uses reinforcement learning for proactive medical consultations
Researchers have developed DoctorAgent-RL, a novel multi-agent reinforcement learning framework designed to improve AI's capabilities in real-world clinical consultations. This system trains a doctor agent, utilizing th…
-
DPN-LE method precisely edits LLM personalities with minimal neuron intervention
Researchers have developed DPN-LE, a novel method for editing the "personality" of large language models by targeting specific neurons. Existing techniques often degrade overall model performance by modifying too many n…