
Massive AI Dataset Breach: DataComp CommonPool Reveals Widespread Personal Data Exposure
Researchers have uncovered a troubling amount of personal information lurking in one of the largest open-source datasets used to train AI models. The dataset, known as DataComp CommonPool, pulls together






