Wikimedia Foundation signs paid licensing deals for Wikipedia content used in AI training

The Wikimedia Foundation announced new licensing deals with major AI firms to charge for high-volume access to Wikipedia content used to train AI models, citing rising infrastructure costs from large-scale scraping.
Wikimedia Foundation signs paid licensing deals for Wikipedia content used in AI training
A What happened
The Wikimedia Foundation announced licensing deals with Microsoft, Meta, Amazon, Perplexity, and Mistral AI for access to Wikipedia content used to train AI models. The deals expand participation in Wikimedia Enterprise, which sells higher-speed, higher-volume API access to Wikipedia’s 65 million articles compared with free public APIs. The foundation reported rising infrastructure costs and bot-driven bandwidth growth, and it disclosed an approximately 8 percent year-over-year decline in human traffic after improved bot detection. Wikipedia paused a pilot of AI-generated article summaries after volunteer editors objected, while Jimmy Wales said he welcomes AI training on Wikipedia data but said companies should pay their share of the costs.

Key insights

  • 1

    Wikimedia Enterprise is positioned as a paid alternative to scraping and free APIs: Wikimedia Enterprise sells higher-speed, higher-volume API access than free public APIs, and the foundation is using licensing deals to move major AI developers onto the commercial platform.

  • 2

    Bot activity is stated as a major driver of infrastructure strain and traffic measurement changes: The foundation reported a 50 percent increase in multimedia bandwidth since January 2024 and said bots generated 65 percent of the most expensive infrastructure requests; it also disclosed an 8 percent year-over-year drop in human traffic after improved bot detection.

  • 3

    Wales supports AI training on Wikipedia but rejects uncompensated use at scale: Jimmy Wales said he welcomes AI models training on Wikipedia data but said companies should pay their fair share of the costs they create.

Takeaways

The Wikimedia Foundation is expanding paid licensing for Wikipedia content used in AI training while reporting significant bot-driven infrastructure costs and changes in measured human traffic.

Topics

Technology & Innovation Artificial Intelligence Big Tech

Stay ahead with OwlBrief

Daily briefs that distill the world’s important events — clear, verified, and designed for understanding.

Newsletter

Get OwlBrief in your inbox

A fast, high-signal digest of the day’s most important events — plus the context that makes them make sense.

Quick to read. Useful all day.