OpenAI News Dataset
(Redirected from OpenAI News Articles)
Jump to navigation
Jump to search
A OpenAI News Dataset is a news article dataset that contains aggregated news content for NLP tasks by OpenAI, Inc..
- AKA: OpenAI News, OpenAI News Articles, OpenAI News Corpus, OpenAI News Data, OpenAI News Collection.
- Context:
- It can typically contain 5GB news articles with headlines, body text, and metadata.
- It can typically include news category labels through topic classification and section tags.
- It can typically provide publication timestamps with article dates and update times.
- It can often support summarization models through article-summary pairs and abstractive examples.
- It can often enable topic modeling through category distributions and keyword extractions.
- It can often facilitate event detection through temporal patterns and entity mentions.
- It can range from being a Small News Sample to being a Large News Archive, depending on its article count.
- It can range from being a Single-Source News Dataset to being a Multi-Source News Dataset, depending on its outlet diversity.
- It can range from being a Local News Dataset to being a Global News Dataset, depending on its geographic scope.
- It can range from being a General News Dataset to being a Specialized News Dataset, depending on its topic focus.
- ...
- Examples:
- Counter-Examples:
- Social Media Feed, which contains user posts rather than professional journalism.
- Blog RSS Feed, which has personal opinions rather than news reporting.
- Press Release Archive, which contains promotional content rather than news articles.
- See: News Article Dataset, Text Corpus, OpenAI Platform Dataset Collection, CNN-Daily Mail Dataset, Text Summarization Task, 20-Newsgroups Corpus, News Aggregation, Natural Language Processing Dataset.