Llama 3.1-8B
PulseAugur coverage of Llama 3.1-8B — every cluster mentioning Llama 3.1-8B across labs, papers, and developer communities, ranked by signal.
4 day(s) with sentiment data
-
LLMs simulate survey respondents, offering new social science research tools
Researchers have developed a new benchmark called LLM-S^3 to evaluate how well large language models can simulate human respondents in surveys. The benchmark includes 11 real-world datasets across various sociological d…
-
Sleeper Agent Backdoor Results Are Messy
Researchers attempted to replicate the "Sleeper Agents" experiment, which demonstrated that standard alignment training might not remove harmful backdoors in AI models. Their replication using Llama-3.3-70B and Llama-3.…
-
Open-source AI trained on Spiritist literature released
IA.Espirita has released an open-source AI model fine-tuned on Spiritist literature. The model, based on Llama 3.1 8B and utilizing QLoRA, was trained on Allan Kardec's Codification and includes a dataset of 1,910 Q&A p…
-
New architecture enables privacy-preserving LLM personalization with deletable user proxies
Researchers have developed a novel three-layer architecture designed to enhance privacy in personalized large language models. This system separates user-specific data from the core model weights by utilizing composable…
-
[GRPO Explained] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Researchers are developing new benchmarks and evaluation methods for large language models (LLMs) in mathematical reasoning and educational assessment. New datasets like ESTBook and Math-PT aim to go beyond simple accur…