510 commercial contracts with 13,101 expert-labeled clauses across 41 legal categories.
Browse commercial Legal → Visit original source ↗CUAD is The Atticus Project's legal contract review benchmark. 510 commercial contracts — M&A, licensing, supply, consulting, etc. — manually labeled by law students supervised by attorneys. 13,101 clause-level annotations across 41 legal categories (e.g., Governing Law, Change of Control, Non-Compete, IP Assignment). Widely used for training legal NLP models.
LQS is our 7-dimension quality score, computed from the dataset's published statistics. See methodology →
Composite score computed from the 7 dimensions below: completeness, uniqueness, validation health, size adequacy, format compliance, label density, and class balance.
Common tasks and benchmarks where CUAD — Contract Understanding Atticus Dataset is the default or competitive choice.
What's actually in the dataset — from the maintainer's published stats.
CUAD — Contract Understanding Atticus Dataset is distributed under CC BY 4.0. This is a third-party public dataset; LabelSets indexes and scores it but does not host or redistribute the data. Always verify current license terms with the maintainer before commercial use.
LabelSets sellers offer paid legal datasets with what public datasets often can't give you:
Other entries in the Legal catalog.