About FactChecks and Data Commons.org

What is Data Commons?

What is ClaimReview?

What data is being released?

Which publishers’ fact checks are being released?

How often will the data be refreshed?

What can I do with the data?

Under what license is this data being released?

How can we add our Claim Reviews to the published dataset ?

What if I disagree with the fact check?

What if I see some error in the ClaimReview structured data?


Q: What is Data Commons?

Datacommons.org is a collaborative, community initiative with a mission to maintain and provide access to useful structured data on the Internet in an easy to use manner. This collective effort is driven by people from participating companies (Google, Microsoft) and Universities (CMU, Duke University).

Q: What is ClaimReview?

Schema.org's ClaimReview is an open standard schema that was designed by the fact checking community to encode essential information about a fact check in a structured format. It has been adopted by many fact checkers around the world, including those verified by IFCN. This wide adoption of ClaimReview enabled major platforms to power fact check features inside their products, such as Google News, Google Search, Bing Search.

Q: What data is being released?

The data being released is structured markup of fact checking articles, that the publishers have associated with their articles. Each line is an individual markup that follows the ClaimReview schema and includes the url field that points back to the original fact checking article. The content of the fact checking article is not part of the release.

The released dataset of ClaimReview markups is a subset of ClaimReview markup available on the Web and is not representative of the entire schema.org markups ClaimReview available on the web. As the fact checking community grows, we expect the size of the released dataset will grow in future versions.

Q: Which publishers’ fact checks are being released?

The initial release of data include selected fact check markups from the following publishers: FactCheck.org, Snopes, PolitiFact, The Washington Post, The Weekly Standard. We intend to expand the list of publishers in future data releases. Please note that each fact checker has their own editorial standards and rating systems that are used to populate the ClaimReview markup.

Q: How often will the data be refreshed?

We intend to refresh the dataset to reflect the evolving underlying data and as more publishers are included. However, we don't have a set schedule for periodic update.

Q: What can I do with the data?

Be creative! This release is a step towards making it easier for researchers around the world to conduct their study in this important research area. We hope this can help the academia community come up with innovative ways to combat misinformation. We hope that studies using the dataset will help the public understand better how fact checkers are fighting the misinformation and provide useful insights.

Q: Under what license is this data being released?

The compilation of the dataset is licensed under CC BY. The license on the structured data of each ClaimReview markup is specified in the field sdLicense.

Additionally, each publisher may have their own license terms for content on their website. Please refer to the website of the publisher.

Q: How can we add our Claim Reviews to the published dataset?

The initial release includes a small set of selected fact check markups. Please express your interest on including your datasets by writing to us at support@datacommons.org.

Q: What if I disagree with the fact check?

Please follow the fact checking article URL as specified in the structured data and reach out to the publisher of the fact check.

Q: What if I see some error in the ClaimReview structured data?

If you believe the publisher has made the error and you can identify the publisher in the structured data, please reach out to the publisher. If you believe the error was introduced by the data compilation process, please reach out to support@datacommons.org.