Find a Question:
Yahoo gives dataset for machine learning free
Yahoo has a dataset based on anonymous interactions between users and different feeds from Yahoo released for use in research into artificial intelligence or machine learning.
The total set consists of about 110 billion events and covers a total 13,5TB of data. Yahoo collected the interactions of approximately 20 million users between February 2015 and May the same year. Yahoo calls the dataset Yahoo Newsfeed data set. The set consists of user interactions on the Yahoo homepage, News, Sports, Finance, Movies, and Real Estate.
The set is available as part of Yahoo Labs Webscope data part program. Webscope is a library of anonymous data for scientific research. The anonymous data is categorized by age, gender and geographic data. On the other hand there are the items themselves that title, summary and key phrases from the news articles are included. It is partially visible on what the items are viewed on device.
Yahoo Labs hopes to release the sets that the data is utilized by the machine learning community and data scientists to validate models’ data sets from the real world. ” Labs hopes to set a benchmark can be for large systems.
Answer this Question
You must be Logged In to post an Answer.
Not a member yet? Sign Up Now »
Star Points Scale
Earn points for Asking and Answering Questions!