Datasets for Data Mining, Data Science, and Machine Learning
HitCompanies Datasets, comprehensive data on random 10,000 UK companies sampled from HitCompanies, updated automatically using AI/Machine Learning. ICWSM-2009 dataset contains 44 million blog posts made between August 1st and October 1st, 2008. Infochimps, an open catalog and marketplace for data.