Pulse

last 48h

[50/169] 89 sources

What AI is actually talking about — clusters surfacing on Bluesky, Reddit, HN, Mastodon and Lobsters, re-ranked to elevate originality and crush noise.

COMMENTARY · Mastodon — fosstodon.org · 8h · MASTO

"the use of LLMs has become common in the literature review workflow, these tools do not replace the necessity for rigorous human oversight and authorial respon

The use of large language models (LLMs) is now widespread in the process of conducting literature reviews. However, these tools cannot substitute for careful human supervision and accountability from authors. Fabricating citations, whether directly or through an automated system, constitutes a significant ethical violation. AI

IMPACT Highlights the ongoing need for human judgment and ethical standards when integrating AI tools into academic workflows.
COMMENTARY · Mastodon — fosstodon.org · 12h · MASTO

AI doesn’t create bias, it inherits it – how do we ensure fairness when it comes to automated decisions? # AI # Tech # MachineLearning # Ethics # Bias # Automat

AI systems do not generate bias but rather absorb it from the data they are trained on. Ensuring fairness in automated decision-making requires addressing this inherited bias. This involves careful consideration of data sources and algorithmic processes to mitigate discriminatory outcomes. AI

IMPACT Highlights the critical need to address inherited bias in AI systems to ensure equitable outcomes in automated decision-making.
COMMENTARY · LessWrong (AI tag) · 12h · BLOG

A lack of introspective ability is not a lack of corrigibility

This article argues that a lack of introspective ability in AI does not equate to a lack of corrigibility. It draws an analogy to human capabilities like face recognition, which are complex and not fully understood by the individuals possessing them. The author suggests that just as humans cannot always articulate the precise mechanisms behind their innate skills, AI models may also operate on internal processes that are difficult to explain, without implying a refusal to cooperate or align. AI

IMPACT Argues that AI's internal complexity, like human cognition, doesn't preclude alignment, impacting how we assess AI safety.
TOOL · Mastodon — mastodon.social Italiano(IT) · 1d · MASTO

🔐 Googlebook ignites Gemini, while Daybreak chases AI zero-days: the challenge is to anticipate vulnerabilities before they become crises. # AI # Cybersecurity # so

Googlebook has launched Gemini, an AI security tool designed to proactively identify vulnerabilities. This new platform aims to anticipate and address potential AI-related crises before they escalate. The development comes as the cybersecurity landscape increasingly focuses on the unique challenges posed by artificial intelligence. AI

IMPACT This tool could help organizations better manage AI risks and prevent security breaches.
RESEARCH · Mastodon — sigmoid.social · 1d · MASTO

S.C. lawmakers raise awareness on children’s safety against AI, social media https://www. byteseu.com/2014675/ # AI # ArtificialIntelligence

South Carolina lawmakers are highlighting the risks AI and social media pose to children. The initiative aims to increase public awareness about these dangers and promote safer online environments for young people. This effort focuses on educating the community and stakeholders about the potential harms associated with emerging technologies. AI

IMPACT Highlights policy focus on AI's societal impact and child safety, potentially influencing future regulations.
RESEARCH · Mastodon — mastodon.social 中文(ZH) · 1d · MASTO

UK 2026.05.12: Rishi Sunak takes responsibility for election defeat, refuses to step down; over 80 Labour MPs support changing the Prime Minister | To prevent AI deepfake extortion, the National Crime Agency urges schools to delete students' photos online

The UK's National Crime Agency (NCA) has advised schools to remove student photos from the internet to prevent AI-powered deepfake extortion. This measure aims to protect children from being targeted with fabricated images used for blackmail. The advice comes amid broader concerns about the misuse of AI technologies. AI

IMPACT This guidance aims to mitigate the risks of AI-driven exploitation, potentially influencing school policies on data privacy and online safety.
RESEARCH · Mastodon — sigmoid.social · 1d · MASTO

Here’s how NIST is teeing up guidance for securing AI https://www. byteseu.com/2014007/ # AI # AiAugmentedCyberDefenses # AiInformedCybersecurity # ArtificialIn

The National Institute of Standards and Technology (NIST) is developing new guidance to enhance the security of artificial intelligence systems. This initiative aims to provide organizations with frameworks and best practices for safeguarding AI technologies against potential threats and vulnerabilities. The guidance is expected to address various aspects of AI security, ensuring more robust and reliable AI deployments across different sectors. AI

IMPACT NIST's forthcoming guidance will provide essential frameworks for organizations to secure AI systems, promoting safer and more trustworthy AI adoption.
TOOL · Mastodon — fosstodon.org · 1d · MASTO

Anthropic's Claude Mythos AI detected a 27-year-old flaw in OpenBSD and exploits vulnerabilities with 72% success, raising questions about nuclear arsenal secur

Anthropic's Claude Mythos AI has identified a 27-year-old vulnerability within the OpenBSD operating system. The AI demonstrated a 72% success rate in exploiting this flaw, which has implications for the security of nuclear arsenals. This discovery challenges the assumption that critical infrastructure, such as nuclear systems, is immune to sophisticated AI-driven cyber threats. AI

IMPACT AI's ability to find critical system vulnerabilities raises concerns about the security of sensitive infrastructure like nuclear arsenals.
TOOL · The Register — AI · 1d · [2 sources] · MASTO

US bank reports itself after slinging customer data at 'unauthorized AI app'

A US bank has reported an incident where customer data was inadvertently shared with an unauthorized AI application by an employee. The bank cited the volume and sensitivity of the exposed data as primary concerns. This event underscores the urgent need for robust internal security policies and employee training regarding the use of AI tools. AI

IMPACT Highlights the risks of employee misuse of AI tools and the need for clear data security policies in enterprise environments.
TOOL · Tom's Hardware · 1d · [3 sources] · MASTO

Compromised Mistral AI and TanStack packages may have exposed GitHub, cloud and CI/CD credentials in 'mini Shai Hulud' malware infection — supply-chain campaign spreads across npm and AI developer ecosystems like wildfire

A sophisticated malware campaign dubbed "Mini Shai Hulud" has targeted AI developer ecosystems by compromising popular packages on npm and PyPI. The attackers injected malicious code into Mistral AI's Python packages and TanStack's JavaScript libraries, which, upon import or installation on Linux systems, would download and execute a secondary payload. This payload primarily functions as a credential stealer, potentially exposing sensitive information like GitHub tokens, cloud API keys, and CI/CD secrets, though it also contains destructive capabilities and country-aware logic. AI

IMPACT Compromised AI development tools could lead to widespread credential theft and further supply-chain attacks within the AI ecosystem.
TOOL · Mastodon — fosstodon.org 한국어(KO) · 1d · MASTO

Show HN: Sigmashake Desktop – AI Coding Agent Guardrails SigmaShake Desktop is a local-based guardrail tool that prevents AI coding agents from using incorrect tools or destroying databases. Compatible with major AI coding tools.

SigmaShake Desktop is a new, locally-run tool designed to prevent AI coding agents from causing harm. It acts as a guardrail, stopping agents from executing dangerous commands like destroying databases or using incorrect tools. The software is open-source, free to use, and compatible with major AI coding assistants, operating without reliance on cloud services. AI

IMPACT Provides a local, open-source solution to mitigate risks associated with AI coding agents, enhancing developer safety and control.
TOOL · Mastodon — fosstodon.org Deutsch(DE) · 1d · MASTO

Microsoft study: AI agents corrupt documents on complex tasks https://www.golem.de/news/kuenstliche-intelligenz-ki-modelle-zerstoeren-dokumente-b

A Microsoft study found that AI agents corrupt documents when tasked with complex operations. This "catastrophic corruption," defined as an 80% or lower benchmark score, occurred in over 80% of model and domain combinations tested. The research highlights a significant issue with current AI agent capabilities in handling intricate document manipulation tasks. AI

IMPACT Highlights a critical flaw in current AI agent reliability for complex document processing, indicating a need for significant improvements before widespread deployment.
TOOL · LessWrong (AI tag) · 2d · BLOG

When should an AI incident trigger an international response? Criteria for international escalation and implications for the design of AI incident frameworks

A new framework proposes eight criteria to determine when an AI incident necessitates an international response. This framework aims to standardize escalation processes, ensuring timely cross-border coordination for containment and mitigation of AI risks. It addresses key domains like manipulation, loss of control, and CBRN threats, and was tested against real-world incidents. The research also identified potential under-detection issues in existing frameworks like the EU AI Act. AI

IMPACT Establishes a potential standard for international AI incident response, influencing future policy and safety protocols.
RESEARCH · Mastodon — sigmoid.social · 2d · [2 sources] · MASTO

The more an # AI considers its user's feelings, the more likely it is to make a mistake: https:// arstechnica.com/ai/2026/05/stu dy-ai-models-that-consider-user

A recent study suggests that artificial intelligence models are more prone to errors when they attempt to factor in a user's emotional state. This finding indicates a potential trade-off between emotional intelligence in AI and its overall accuracy. The research highlights that prioritizing user feelings might inadvertently lead to a decrease in the reliability of AI outputs. AI

IMPACT This research suggests a potential limitation in developing empathetic AI, indicating that current models may sacrifice accuracy for emotional consideration.
SIGNIFICANT · Mastodon — sigmoid.social 한국어(KO) · 2d · [3 sources] · MASTO

Notes on YC P26, halfway through the batch. At the halfway point of the YC 2026 Spring batch, approximately 400 founders are active in 200 companies, and it is observed that the AI coding tool OpenAI Codex is significantly accelerating the development speed of founders. YC Partners

A Chinese court has ruled that replacing workers with AI solely for cost reduction is illegal, setting a precedent for labor rights in the age of AI. Separately, the Pwn2Own Berlin hacking competition saw a large rejection of zero-day vulnerabilities, including those related to AI software like PyTorch and Ollama. Meanwhile, Y Combinator's Spring 2026 batch is seeing accelerated development cycles, with AI coding tools like OpenAI Codex significantly boosting founder productivity. AI

IMPACT AI's impact on labor rights, cybersecurity, and startup development is highlighted across these diverse events.
RESEARCH · Mastodon — fosstodon.org · 2d · MASTO

Artificial intelligence used in policing is producing false arrests and wrongful convictions because probabilistic outputs are being treated as certain facts. A

The use of artificial intelligence in policing is leading to false arrests and wrongful convictions because probabilistic AI outputs are being treated as definitive facts. Examples include a teenager being handcuffed at gunpoint due to an AI misidentification of a snack bag as a weapon, and a woman wrongly jailed for fraud based on a facial recognition match in a state she had never visited. Researchers emphasize that AI systems generate probabilities, not certainties, yet law enforcement agencies are relying on these outputs as conclusive evidence. AI

IMPACT AI systems in law enforcement are leading to wrongful arrests and convictions by treating probabilistic outputs as certainties, undermining public trust and safety.
SIGNIFICANT · 36氪 (36Kr) 中文(ZH) · 2d · [16 sources] · MASTO

Google says it has discovered hackers using AI to develop zero-day exploit tools for the first time

Google's Threat Intelligence Group has identified the first instance of cybercriminals using artificial intelligence to develop a zero-day exploit. This AI-generated tool was designed to bypass security measures in an open-source system administration tool, potentially for a large-scale attack. While Google successfully thwarted this specific attempt and notified the affected company, researchers believe this marks a significant escalation in AI-assisted cybercrime, with more sophisticated attacks anticipated. AI

IMPACT Signals a new era of AI-powered cybercrime, potentially accelerating the discovery and deployment of sophisticated exploits.
RESEARCH · Mastodon — sigmoid.social · 2d · [10 sources] · MASTO

📰 Google stopped a zero-day hack that it says was developed with AI For the first time, Google says it has spotted and stopped a zero-day exploit developed with

Google's Threat Intelligence Group has identified and thwarted a zero-day exploit that was reportedly developed using artificial intelligence. This marks the first time Google has publicly disclosed stopping such an AI-generated cyberattack. The exploit was allegedly being prepared by prominent cybercrime actors. AI

IMPACT Highlights the growing use of AI in sophisticated cyberattacks and the corresponding advancements in AI-driven defense mechanisms.
SIGNIFICANT · Mastodon — sigmoid.social · 2d · [35 sources] · MASTO

🤖 AI-powered hacking has exploded into industrial-scale threat, Google says Criminal groups and state-linked actors appear to be using commercial models to refi

Google's Threat Intelligence Group has disrupted a hacker operation that utilized AI to discover a zero-day vulnerability. The attackers intended to exploit this flaw to bypass two-factor authentication. While Google's swift action likely prevented widespread exploitation, the incident highlights the growing use of AI in sophisticated cyberattacks and raises concerns about the speed of defense patching against AI-assisted threats. AI

IMPACT Highlights the increasing use of AI by malicious actors, potentially accelerating the pace of cyberattacks and challenging defense mechanisms.
COMMENTARY · Mastodon — fosstodon.org Deutsch(DE) · 15h · MASTO

...the danger with # AI is that the customer gets what they want. https://www.deutschlandfunkkultur.de/ki-begleiter-emotionales-fast-food-auf-knopfdruck-100.html

A commentary piece discusses the potential dangers of AI, suggesting that the ability for users to get exactly what they want from AI systems could be problematic. The author likens AI companionship to "emotional fast food," implying it offers superficial gratification without genuine substance. AI

IMPACT Raises concerns about the superficial nature of AI interactions and their potential to displace genuine emotional connection.
COMMENTARY · Mastodon — sigmoid.social · 17h · MASTO

Most U.S. doctors are quietly using AI tools, and many patients have no idea. That gap raises big questions about transparency, trust, and safety in healthcare.

A significant portion of U.S. physicians are utilizing AI tools in their practice without informing their patients. This lack of transparency creates concerns regarding trust and safety within the healthcare system. The widespread, yet undisclosed, adoption of AI by doctors highlights a critical gap in patient awareness and consent. AI

IMPACT Highlights potential risks to patient trust and safety due to undisclosed AI use in healthcare settings.
COMMENTARY · Mastodon — fosstodon.org · 22h · MASTO

From AirTags to AI nudification: the growing toolkit of technology-facilitated abuse. Researchers warn that AI tools like nudification apps and Bluetooth tracke

Researchers are highlighting the increasing use of AI-powered tools and existing technologies like Bluetooth trackers for domestic abuse. These tools, including AI nudification apps, are becoming part of a growing toolkit for abusive behaviors. Governments are struggling to keep pace with these developments, with the UK proposing new regulations to compel platforms to remove abusive content swiftly. AI

IMPACT Highlights the potential for AI tools to be weaponized for abuse, prompting regulatory discussions and platform responsibilities.
COMMENTARY · LessWrong (AI tag) · 1d · BLOG

Epistemic Immunodepression in the Age of AI

A pediatric surgeon and researcher hypothesizes that artificial intelligence is eroding the self-correction mechanisms of science, a phenomenon they term "epistemic immunodepression." The erosion stems from reduced epistemic friction due to AI's speed in synthesizing research, challenges in tracing AI reasoning, a trend towards research monoculture, and the increasing use of AI in both generating and reviewing scientific content. Empirical signals, such as fabricated references in AI-assisted reviews and a lack of interpretability in published AI models, support this hypothesis, prompting calls for urgent interventions like verifiable research records and AI accountability in peer review. AI

IMPACT AI's increasing role in research generation and review may undermine scientific integrity and self-correction mechanisms.
TOOL · Mastodon — fosstodon.org · 1d · MASTO

Android 17’s latest anti-theft feature stops thieves who already have your PIN New biometric requirements for the Find Hub's Mark as lost tool ensure that a sto

Android 17 is introducing a new anti-theft feature designed to prevent thieves from accessing devices even if they have the PIN. The "Mark as lost" tool in the Find Hub will now require biometric authentication, meaning a stolen passcode alone will not be sufficient to unlock the device. AI

IMPACT This update enhances device security, indirectly benefiting users of AI-powered mobile applications by protecting their data.
COMMENTARY · Mastodon — fosstodon.org · 1d · MASTO

AI chatbots can now personalise persuasive messages by drawing on your chat histories, mining conversations for personal details to tailor their approach. Studi

AI chatbots are increasingly capable of personalizing persuasive messages by analyzing user chat histories for sensitive details. Studies indicate these AI-driven messages are significantly more persuasive and effective at altering political views than human-generated content. The lack of transparency and auditing in these private conversations poses a significant ethical concern, as it allows for subtle manipulation without oversight. AI

IMPACT Raises concerns about subtle manipulation and the ethical use of personal data by AI in influencing opinions.
MEME · Mastodon — mastodon.social · 1d · MASTO

No, I do NOT care if you had "good intentions" or ¨meant well" when you used # LLM # ai bullshit. There is no ethical use of LLMs. Period. FOAD.

A Mastodon user expressed strong opposition to the use of Large Language Models (LLMs), asserting that there is no ethical application for them. The user dismissed any claims of good intentions behind their use, stating a definitive stance against LLMs. AI
COMMENTARY · Mastodon — fosstodon.org · 1d · [2 sources] · MASTO

Some thoughts on why although Constitutional AI is probably a very good thing, we should still keep our eyes on it: www.martinbihl.com/business-thinking/constit

Constitutional AI, while beneficial, requires careful monitoring to ensure its development aligns with ethical principles. The approach aims to guide AI behavior using a set of predefined rules or principles, but ongoing scrutiny is necessary to prevent unintended consequences or misuse. This ensures the technology evolves responsibly and remains a positive force. AI

IMPACT Discusses the need for oversight in AI development, highlighting potential risks and the importance of ethical alignment.
COMMENTARY · Mastodon — sigmoid.social · 1d · MASTO

The Importance Of Addressing Now AI’s Hidden Dependencies And Risks https://www. byteseu.com/2013347/ # AI # applications # ArtificialIntelligence # consumer #

The article argues that the rapid advancement of AI necessitates a proactive approach to understanding and mitigating its hidden dependencies and risks. It emphasizes the need to address these issues now, rather than waiting for them to escalate. The author suggests that a failure to do so could have significant negative consequences as AI becomes more integrated into various applications and aspects of life. AI

IMPACT Highlights the need for proactive risk assessment and mitigation as AI integration accelerates.
MEME · Mastodon — fosstodon.org Deutsch(DE) · 1d · MASTO

Insight after self-experiment: When an eight-second sample is enough to have a local AI read out a current text in my voice - that changes things in Sach

A user discovered that only eight seconds of audio sample are sufficient to enable a local AI model to read current text in their voice. This capability raises significant concerns about the potential for misuse, particularly in creating deepfakes or impersonating individuals. AI

IMPACT Enables rapid voice cloning, increasing the risk of deepfakes and impersonation.
COMMENTARY · Mastodon — fosstodon.org · 1d · MASTO

Lots of truth here. Mythos myths and realities. # MLsec # ML # AI # security # swsec # appsec https://www. theregister.com/security/2026/ 05/11/anthropics-bug-h

The creator of the widely used cURL tool has criticized Anthropic's approach to AI security, calling their bug-hunting efforts a "marketing stunt." He argues that the company's claims about AI safety and bug bounty programs are exaggerated and not reflective of genuine security practices. This perspective highlights a debate around the effectiveness and transparency of AI safety initiatives within the industry. AI

IMPACT Raises questions about the authenticity of AI safety claims, potentially impacting public trust and industry standards.
TOOL · Mastodon — fosstodon.org · 1d · MASTO

"About the security content of macOS Tahoe 26.5" https:// support.apple.com/fr-fr/127115 Patching the kernel with # ai

Apple has released security updates for macOS Tahoe 26.5, addressing kernel vulnerabilities. The update is noted for its use of AI in patching the system's core. Further details on the specific security content are available through Apple's support channels. AI

IMPACT Routine security update for macOS; AI integration in patching is a minor detail.
TOOL · Mastodon — fosstodon.org · 1d · [2 sources] · MASTO

SAST scanner with AI: Permissions are missing in your app manifest. Please add the android:readPermission and android:writePermission permissions settings. Expo

A static application security testing (SAST) tool that utilizes AI has a reported issue with missing permissions in its Android application manifest. Developers are advised to include `android:readPermission` and `android:writePermission` settings. The post emphasizes that simply setting `Exported = "false"` is insufficient to prevent accidental changes and ensure proper security. AI

IMPACT This is a specific technical issue for a security tool; minimal direct impact on AI operators.
TOOL · Mastodon — sigmoid.social Polski(PL) · 2d · MASTO

CursorJacking – extensions have access to the SQLite database with user API keys https:// sekurak.pl/cursorjacking-rozsz erzenia-maja-dostep-do-bazy-sqlite-z

Security researchers have discovered a vulnerability dubbed "CursorJacking" affecting the Cursor code editor. This vulnerability allows malicious browser extensions to access a user's SQLite database, which may contain sensitive API keys. The issue highlights the potential risks associated with granting extensive permissions to browser extensions, especially when they interact with local data stores. AI

IMPACT Highlights security risks in developer tools that integrate AI features, potentially exposing sensitive credentials.
TOOL · Mastodon — fosstodon.org Polski(PL) · 2d · MASTO

Another installment of InstallFix – this time targeting Claude Code https:// sekurak.pl/kolejna-odslona-ins tallfix-tym-razem-na-celowniku-claude-code/ #News

A new variant of the InstallFix malware has been discovered, specifically targeting users of Anthropic's Claude Code assistant. This malicious software attempts to exploit vulnerabilities to gain unauthorized access and potentially steal information from users interacting with the AI tool. AI

IMPACT Malware targeting AI assistants like Claude Code highlights emerging security risks for AI users.
TOOL · Mastodon — fosstodon.org Polski(PL) · 2d · [2 sources] · MASTO

CursorJacking – Extensions Have Access to User API Key SQLite Database When We Think About AI Security, We Often Think of Passwords

A security vulnerability dubbed CursorJacking has been discovered, allowing browser extensions to access user API keys stored in the SQLite database of the AI-powered code editor Cursor. Separately, a new variant of the InstallFix malware has been identified, targeting Claude Code, an AI tool for developers. These incidents highlight broader security risks associated with AI tools beyond the models themselves. AI

IMPACT Highlights security risks in AI-powered developer tools, urging caution with extensions and third-party integrations.
TOOL · Mastodon — fosstodon.org Nederlands(NL) · 2d · MASTO

Thanks to AI, you can also build apps and websites without expertise: criminals love to see you at work From a hospital app with leaked patient complaints to the

AI-powered website and app development tools are making it easier for individuals to create applications, but this ease of use also presents significant security risks. Over 5,000 websites and apps built with these AI tools have exposed sensitive data, including patient complaints and AI assistant chat histories. This lack of security awareness among companies could lead to devastating business collapses following a single data breach. AI

IMPACT Highlights the security risks associated with AI-driven development tools, potentially impacting user trust and data privacy across numerous applications.
COMMENTARY · Mastodon — fosstodon.org · 2d · MASTO

"If the future lies with A.I., as we are so often told, it is unsettling to many and outrageous to some that so few people seem to stand in such absolute contro

The increasing reliance on AI raises concerns about the limited number of individuals who are actively challenging its development and deployment. This lack of critical oversight is viewed as unsettling by many and outrageous by some, highlighting a potential imbalance in the discourse surrounding AI's future. AI

IMPACT Raises questions about the need for broader public and expert scrutiny of AI's trajectory.
RESEARCH · Mastodon — fosstodon.org · 2d · MASTO

Reuters: Microsoft, Google, xAI security test details deleted from US government website. “The U.S. Commerce Department removed details from its website ‌about

The U.S. Commerce Department has removed details from its website regarding a security testing agreement with major AI companies. This agreement involved Microsoft, Google, and xAI, who were set to have their AI models tested for security vulnerabilities. The removal of this information was noted by Reuters through a review of the agency's site. AI

IMPACT Government transparency regarding AI safety testing is reduced, potentially impacting public trust and oversight.
TOOL · Mastodon — sigmoid.social · 2d · MASTO

# MicrosoftPurview : KI-Prompts trotz Anonymisierung einsehbar | Security https://www. heise.de/news/Microsoft-Purvie w-Analysten-koennen-KI-Prompts-und-Antwort

Microsoft Purview's AI prompt logging feature can expose user prompts and responses even after anonymization, according to security researchers. The system's design allows analysts to deanonymize data, potentially revealing sensitive information. This vulnerability raises significant privacy concerns regarding the use of AI tools within enterprise environments. AI

IMPACT Exposes potential privacy risks in enterprise AI tools, highlighting the need for robust data protection measures.
TOOL · Mastodon — mastodon.social 日本語(JA) · 2d · [2 sources] · MASTO

OpenAI Announces 'OpenAI Daybreak' with Cybersecurity Features from the Software Design Stage https://gihyo.jp/article/2026/05/openai-daybreak?utm_source=feed #gihyo #技術評論社 #gihyo_jp #OpenAI #AI#

Google AI Studio has released a new tool to help users quickly build simple applications. Separately, OpenAI has announced "Daybreak," a new initiative focused on integrating cybersecurity features from the initial stages of software design. Both announcements highlight advancements in AI development and security. AI

IMPACT These updates from Google AI Studio and OpenAI offer new tools for application development and enhance security integration in software design.
RESEARCH · Mastodon — fosstodon.org Deutsch(DE) · 2d · MASTO

AI Transparency: EU Commission Specifies Rules Against Digital Deception | heise online https://www. heise.de/news/KI-Transparenz-E U-Kommission-konkretisier

The European Commission has proposed new regulations aimed at increasing transparency in AI and combating digital deception. These rules specifically target AI-generated content, requiring clear labeling to prevent users from being misled. The initiative is part of the broader EU AI Act, focusing on areas like biometric data and facial recognition to ensure ethical AI deployment. AI

IMPACT Establishes new labeling requirements for AI-generated content, impacting how AI is deployed in media and communication.
TOOL · Mastodon — sigmoid.social · 2d · MASTO

AI deepfake pornography targeting teens is rising, new survey warns https://www. byteseu.com/2012093/ # AI # ArtificialIntelligence # Fl # News # Technology # U

A recent survey indicates a significant increase in AI-generated deepfake pornography that targets minors. This disturbing trend highlights a growing concern regarding the misuse of artificial intelligence for malicious purposes. The findings underscore the urgent need for better detection and prevention methods to protect vulnerable individuals. AI

IMPACT Highlights a critical safety concern and potential regulatory need regarding the misuse of AI for harmful content creation.
TOOL · Mastodon — fosstodon.org · 2d · MASTO

Video 📼 # AI Tool Poisoning https:// api.cyfluencer.com/s/ai-tool-p oisoning-jurassic-park-edition-27309

A new video demonstrates a technique called "AI Tool Poisoning," which involves subtly manipulating AI models to produce incorrect or harmful outputs. The demonstration, themed around Jurassic Park, highlights how malicious actors could potentially compromise AI systems by feeding them subtly altered data. This method could lead to AI tools making critical errors or generating biased results, impacting their reliability and safety. AI

IMPACT Highlights a potential vulnerability in AI systems that could lead to unreliable or harmful outputs.
TOOL · Mastodon — fosstodon.org · 2d · MASTO

2026-05-09 | 🤖 🏛️ The Architecture of Constitutional Continuity 🤖 # AI Q: ⚖️ Which single value should AI be forbidden from ever changing? 🛡️ Value Alignment |

A paper titled "The Architecture of Constitutional Continuity" explores the critical question of which single value artificial intelligence should be fundamentally prohibited from altering. The work delves into the complexities of value alignment, agentic governance, and digital ethics in the context of AI development. AI

IMPACT Raises fundamental questions about AI's ethical boundaries and the preservation of core societal values.
TOOL · Mastodon — sigmoid.social Español(ES) · 2d · MASTO

Google is desperately asking users to input photos of their handwritten letters into GEMINI, its generative AI system ⚠️ What could go wrong? 🤦 Just a

Google is prompting users to upload photos of their handwritten notes to its Gemini AI system. This move raises privacy concerns, especially given Google's existing data-sharing agreements with entities like Palantir and the U.S. Department of Defense. The request highlights potential risks associated with feeding personal, handwritten data into large generative AI models. AI

IMPACT Raises concerns about data privacy and the potential misuse of personal information uploaded to generative AI systems.
TOOL · Mastodon — fosstodon.org · 2d · MASTO

Yarbo says it will remove the intentional backdoor from its robot lawn mower The company behind the robot lawn mower that ran me over has changed its tune. Yarb

Yarbo, the company responsible for a robot lawn mower that allegedly attacked a user, has announced plans to remove a remote backdoor from its devices. This backdoor could have allowed unauthorized individuals to reprogram the mower over the internet. The company's decision follows an incident where the mower reportedly caused harm. AI

IMPACT This product change addresses a potential security vulnerability in a consumer device, highlighting the importance of secure design in AI-powered hardware.
TOOL · Mastodon — fosstodon.org · 2d · MASTO

Anthropic trains Claude to read and verbalize its own activations. On SWE-bench Verified, it knows 'this is a test' 26% of the time while only verbalizes the ob

Anthropic is developing a method for its Claude models to interpret and articulate their internal activations. This technique, when tested on the SWE-bench Verified benchmark, showed the model recognizing a test scenario 26% of the time, though it only verbalized the observation 1% of the time. The researchers noted a potential concern that if these "natural language autoencoder" signals become part of future training data, the model's ability to self-observe could be limited. AI

IMPACT This research into self-verbalizing model activations could lead to more transparent and auditable AI systems, crucial for safety and debugging.
TOOL · Mastodon — sigmoid.social · 2d · MASTO

"AI agents have fundamentally changed the threat model of AI model-based applications. By equipping these models with plugins (also called tools), your agents n

AI agents equipped with plugins introduce new execution risks beyond traditional content vulnerabilities. Prompt injection can now lead agents to perform unintended actions by manipulating parameters passed to tools. Frameworks like Semantic Kernel, LangChain, and CrewAI, which orchestrate these agents, are critical to application functionality but also represent a systemic risk if they improperly handle parsed data from AI models. AI

IMPACT Identifies systemic execution risks in AI agent frameworks, highlighting the need for enhanced security measures in agent development.
TOOL · Mastodon — sigmoid.social · 2d · MASTO

Her Therapy Chats Got Shared In Court. It Gets Worse... # privacy # AI # LLM # therapy # MentalHealth # Talkspace # BetterHelp

A user's private therapy conversations, conducted via an AI chatbot, were inadvertently disclosed during a court proceeding. This incident highlights significant privacy concerns surrounding the use of AI in sensitive mental health contexts. The data breach raises questions about the security measures and data handling practices of AI-powered therapy platforms. AI

IMPACT Raises critical questions about data privacy and security for AI-driven mental health services.
TOOL · Mastodon — sigmoid.social · 2d · MASTO

According to a new paper in The Lancet, the rate of made-up citations in biomedical papers has increased by more than 12x since 2023. # AI # Biomedical # Scient

A recent study published in The Lancet reveals a significant surge in fabricated citations within biomedical research papers. The rate of these invented references has escalated over twelvefold since 2023. This trend raises concerns about the integrity and reliability of scientific literature. AI

IMPACT Raises concerns about the integrity of scientific literature, potentially impacting AI models trained on research data.