PulseAugur
LIVE 07:37:19
commentary · [1 source] ·
2
commentary

LLM spiders strain version control systems, prompting access restrictions

Large Language Model (LLM) web crawlers are reportedly causing significant strain on version control backends, particularly those with web-based frontends. One user noted that these spiders scrape every commit, leading to massive load. This has prompted discussions about disabling public web access to repositories in favor of direct cloning for non-commercial users. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT LLM data scraping may necessitate changes in how code repositories are accessed and secured.

RANK_REASON User commentary on the impact of LLM crawlers on version control systems.

Read on Mastodon — fosstodon.org →

COVERAGE [1]

  1. Mastodon — fosstodon.org TIER_1 · [email protected] ·

    @ mgorny I've heard before that a WWW front-end to version control is one of the worst. The LLM spiders scrape every commit and the load on the back end is mass

    @ mgorny I've heard before that a WWW front-end to version control is one of the worst. The LLM spiders scrape every commit and the load on the back end is massive. I can see a future where non-commercial people with vc repositories disable all of the mechanism for strangers read…