Telegram Group & Telegram Channel
维基基金会称 AI 爬虫导致带宽消耗增加五成
#维基百科

维基媒体基金会周二表示,无休止抓取内容的 AI 爬虫给其服务器造成了巨大压力。自 2024 年 1 月以来,下载多媒体内容的带宽增长了 50%。维基媒体基金会托管了维基百科以及维基共享资源,其中维基共享资源提供了 1.44 亿份采用开放许可证的媒体文件,自 2024 年初以来,为获取模型训练数据 AI 公司通过直接抓取、API 和批量下载大幅增加了自动抓取量。非人类流量的指数级增长带来了高昂的技术和财务成本。维基百科的缓存系统是为可预测的人类浏览行为而设计的,但 AI 爬虫会不加区分的抓取内容,会抓取通常人类访问量很少的页面,导致缓存系统失效。维基媒体基金会发现,尽管机器人程序只占总页面浏览量的 35%,但它在其核心基础设施最昂贵的请求中占了 65%。原因是机器人请求的成本远高于人类请求,而且在快速增加。更糟糕的是 AI 爬虫通常不遵守规则,无视 robots.txt,会使用欺骗性的浏览器用户代理伪装成人类访客,甚至会轮流使用家庭 IP 地址以避免被屏蔽。AI 爬虫对网站可靠性团队造成了持续的干扰,团队必须一直阻止爬虫,以免影响人类访客的页面访问速度。
Ars:AI bots strain Wikimedia as bandwidth surges 50%



tg-me.com/SolidotR/1990
Create:
Last Update:

维基基金会称 AI 爬虫导致带宽消耗增加五成
#维基百科

维基媒体基金会周二表示,无休止抓取内容的 AI 爬虫给其服务器造成了巨大压力。自 2024 年 1 月以来,下载多媒体内容的带宽增长了 50%。维基媒体基金会托管了维基百科以及维基共享资源,其中维基共享资源提供了 1.44 亿份采用开放许可证的媒体文件,自 2024 年初以来,为获取模型训练数据 AI 公司通过直接抓取、API 和批量下载大幅增加了自动抓取量。非人类流量的指数级增长带来了高昂的技术和财务成本。维基百科的缓存系统是为可预测的人类浏览行为而设计的,但 AI 爬虫会不加区分的抓取内容,会抓取通常人类访问量很少的页面,导致缓存系统失效。维基媒体基金会发现,尽管机器人程序只占总页面浏览量的 35%,但它在其核心基础设施最昂贵的请求中占了 65%。原因是机器人请求的成本远高于人类请求,而且在快速增加。更糟糕的是 AI 爬虫通常不遵守规则,无视 robots.txt,会使用欺骗性的浏览器用户代理伪装成人类访客,甚至会轮流使用家庭 IP 地址以避免被屏蔽。AI 爬虫对网站可靠性团队造成了持续的干扰,团队必须一直阻止爬虫,以免影响人类访客的页面访问速度。
Ars:AI bots strain Wikimedia as bandwidth surges 50%

BY Solidot 纯净版


Warning: Undefined variable $i in /var/www/tg-me/post.php on line 283

Share with your friend now:
tg-me.com/SolidotR/1990

View MORE
Open in Telegram


telegram Telegram | DID YOU KNOW?

Date: |

The STAR Market, as is implied by the name, is heavily geared toward smaller innovative tech companies, in particular those engaged in strategically important fields, such as biopharmaceuticals, 5G technology, semiconductors, and new energy. The STAR Market currently has 340 listed securities. The STAR Market is seen as important for China’s high-tech and emerging industries, providing a space for smaller companies to raise capital in China. This is especially significant for technology companies that may be viewed with suspicion on overseas stock exchanges.

Export WhatsApp stickers to Telegram on iPhone

You can’t. What you can do, though, is use WhatsApp’s and Telegram’s web platforms to transfer stickers. It’s easy, but might take a while.Open WhatsApp in your browser, find a sticker you like in a chat, and right-click on it to save it as an image. The file won’t be a picture, though—it’s a webpage and will have a .webp extension. Don’t be scared, this is the way. Repeat this step to save as many stickers as you want.Then, open Telegram in your browser and go into your Saved messages chat. Just as you’d share a file with a friend, click the Share file button on the bottom left of the chat window (it looks like a dog-eared paper), and select the .webp files you downloaded. Click Open and you’ll see your stickers in your Saved messages chat. This is now your sticker depository. To use them, forward them as you would a message from one chat to the other: by clicking or long-pressing on the sticker, and then choosing Forward.

telegram from us


Telegram Solidot 纯净版
FROM USA