Yahoo gives dataset for machine learning free




Yahoo has a dataset based on anonymous interactions between users and different feeds from Yahoo released for use in research into artificial intelligence or machine learning.

The total set consists of about 110 billion events and covers a total 13,5TB of data. Yahoo collected the interactions of approximately 20 million users between February 2015 and May the same year. Yahoo calls the dataset Yahoo Newsfeed data set. The set consists of user interactions on the Yahoo homepage, News, Sports, Finance, Movies, and Real Estate.

The set is available as part of Yahoo Labs Webscope data part program. Webscope is a library of anonymous data for scientific research. The anonymous data is categorized by age, gender and geographic data. On the other hand there are the items themselves that title, summary and key phrases from the news articles are included. It is partially visible on what the items are viewed on device.

Yahoo Labs hopes to release the sets that the data is utilized by the machine learning community and data scientists to validate models’ data sets from the real world. ” Labs hopes to set a benchmark can be for large systems.

yahoo labs


In: A Technology & Gadgets Asked By: [21995 Red Star Level]

Answer this Question

You must be Logged In to post an Answer.

Not a member yet? Sign Up Now »