Material Contracts Corpus
Jump to navigation
Jump to search
A Material Contracts Corpus is a legal document corpus that contains over one million material contracts filed with the SEC from 2000 to 2023.
- AKA: MCC, Material Contracts Dataset, SEC Material Contracts Collection.
- Context:
- It can typically support SEC Contract Analysis Tasks with machine-generated contract metadata.
- It can typically enable Legal Technology Research through contract provision extraction.
- It can often facilitate Contract Compliance Reviews by providing regulatory filing context.
- It can often serve Academic Legal Research with annotated contract examples.
- It can range from being a Raw Material Contracts Corpus to being an Annotated Material Contracts Corpus, depending on its metadata enrichment level.
- It can range from being a Small Material Contracts Corpus to being a Large Material Contracts Corpus, depending on its document count.
- It can range from being a Single-Year Material Contracts Corpus to being a Multi-Decade Material Contracts Corpus, depending on its temporal coverage.
- It can range from being a Industry-Specific Material Contracts Corpus to being a Cross-Industry Material Contracts Corpus, depending on its sector coverage.
- It can integrate with SEC EDGAR Database for source document verification.
- It can utilize Contract Metadata Extraction Systems for automated annotation.
- ...
- Examples:
- SEC-Filed Contract Corpuses, such as:
- Material Contracts Corpus (Stanford Law) with 1,038,766 contracts and ML-generated metadata.
- EDGAR Contract Archive with historical filings from 1996 onward.
- Temporal Instances, such as:
- Material Contracts Corpus (2000-2010) covering the early digital filing era.
- Material Contracts Corpus (2020-2023) with modern filing formats.
- ...
- SEC-Filed Contract Corpuses, such as:
- Counter-Examples:
- LEDGAR Dataset, which focuses on provision-level classification rather than whole contracts.
- ContractNLI Dataset, which contains NDA contracts for natural language inference rather than material contracts.
- Private Contract Repository, which lacks SEC filing requirements and public access.
- ...
- See: SEC EDGAR Database, Legal Document Corpus, Contract Metadata Extraction System, SEC Contract Analysis Task, LEDGAR Dataset, Contract Provision Classification, Legal Technology Research.