News

Switzerland launched an open-source model called Apertus on Monday as an alternative to proprietary models like OpenAI’s ChatGPT or Anthropic’s Claude, reports SWI as spotted by Engadget. The model’s ...
In the letter, Noyb noted that Meta only recently notified EU users on its platforms that they had until May 27 to opt their public posts out of Meta's AI training data sets.
In China, that resource is now powering an explosive new market—real-world AI training data sets—and investors are beginning to take notice.
A major AI training data set contains millions of examples of personal data Millions of images of passports, credit cards, birth certificates, and other documents containing personally ...
Here’s a full rundown of what data poisoning means, the risks and how to prevent it in your organization. What Is Data Poisoning? Jennifer Glenn, research director for IDC’s security and trust group, ...
In this TechRepublic interview, researcher Amy Chang details the decomposition method and shares how organizations can ...
By injecting malicious or misleading data into training data sets, adversaries manipulate AI models to produce biased, inaccurate or even harmful results.
Research outfit Epoch AI tried to quantify this problem in a paper earlier this year, measuring the rate of increase in LLM training data sets against the "estimated stock of human-generated ...
A massive volunteer-led effort to collect training data in more languages, from people of more ages and genders, could help make the next generation of voice AI more inclusive and less exploitative.
According to the outlet, subtitles from approximately 53,000 movies and 85,000 TV episodes were found in a large AI-training data set used by Apple, Anthropic, Meta, Nvidia, Salesforce, Bloomberg ...
California passed a law that'll require AI companies to say which data sets they used to train their models. But few are saying whether they'll comply.
When established technologies take up the most space in training data sets, what’s to make LLMs recommend new technologies (even if they’re better)?