Difference between revisions of "MovieLens Dataset"

Jump to: navigation, search
m (Remove links to pages that are actually redirects to this page.)
(Redirected page to MovieLens Benchmark)
(Tag: New redirect)
Line 1: Line 1:
A [[MovieLens Dataset]] is a [[benchmark dataset|benchmark]] [[item recommendations dataset]] from [[MovieLens website]].
#REDIRECT [[MovieLens Benchmark ]]
* <B>Context:</B>
** It can (typically) have [[Movie Record]]s, such as <code><BR>1|Toy Story (1995)|01-Jan-1995||http://us.imdb.com/M/title-exact?Toy%20Story%20(1995) |0|0|0|1|1|1|0|0|0|0|0|0|0|0|0|0|0|0|0<BR>2|GoldenEye (1995)|01-Jan-1995||http://us.imdb.com/M/title-exact?GoldenEye%20(1995) |0|1|1|0|0|0|0|0|0|0|0|0|0|0|0|0|1|0|0<BR>3|Four Rooms (1995)|01-Jan-1995||http://us.imdb.com/M/title-exact?Four%20Rooms%20(1995) |0|0|0|0|0|0|0|0|0|0|0|0|0|0|0|0|1|0|0<BR>...</code>
** It can be based on 1 to 5 [[Movie Rating]]s.
** It can associated to a [[MovieLens Data Usage License]] [http://files.grouplens.org/datasets/movielens/ml-10m-README.html].
* <B>Example(s):</B>
** [[MovieLens 100K]] [http://grouplens.org/datasets/movielens/100k/], which contains 943 users and 1,682 items, which has 100,000 ratings with a density of 6.3%.
** [[MovieLens 1M]] [http://grouplens.org/datasets/movielens/1m/], which contains 6,040 users and 3,952 items, which has 1,000,209 ratings with a density of 3.8%.
** [[MovieLens 10M]].
** [[MovieLens 20M]] [http://files.grouplens.org/datasets/movielens/ml-20m-README.html], which contains 20,000,263 ratings and 465,564 tag applications across 27,278 movies. These data were created by 138,493 users between 1995-01-09 and 2015-03-31. This dataset was generated on October 17, 2016.
* <B>Counter-Example(s):</B>
** [[Netflix Prize Dataset]].
* <B>See:</B> [[Data-Driven Item Recommendations]].
== References ==
=== 2017 ===
* http://grouplens.org/datasets/movielens/
** QUOTE: GroupLens Research has collected and made available rating data sets from the [[MovieLens web site]] (http://movielens.org). The data sets were collected over various periods of time, depending on the size of the set. Before using these data sets, please review their README files for the usage licenses and other details.
=== 2016 ===
* ([[Harper & Konstan, 2016]]) ⇒ [[F. Maxwell Harper]], and [[Joseph A . Konstan]]. ([[2016]]). “The Movielens Datasets: History and Context.” Acm transactions on interactive intelligent systems (tiis) 5, no. 4 </s>
** ABSTRACT: The [[MovieLens Dataset|MovieLens dataset]]s are widely used in education, research, and industry. They are downloaded hundreds of thousands of times each year, reflecting their use in popular press programming books, traditional and online courses, and software. These datasets are a product of member activity in the MovieLens movie recommendation system, an active research platform that has hosted many experiments since its launch in 1997. This article documents the history of MovieLens and the MovieLens datasets. We include a discussion of lessons learned from running a long-standing, live research platform from the perspective of a research organization. We document best practices and limitations of using the MovieLens datasets in new research.

Revision as of 18:50, 12 February 2020