FAQ
Q: What is Data Commons?
DataCommons.org is a collaborative, community initiative with a mission to maintain and provide access to useful structured data on the Internet in an easy to use manner.
Q: What is ClaimReview?
Schema.org's ClaimReview is an open standard schema that was designed by the fact checking community to encode essential information about a fact check in a structured format. It has been adopted by many fact checkers around the world, including those verified by IFCN. This wide adoption of ClaimReview enabled major platforms to power fact check features inside their products, such as Google News, Google Search, Bing Search, Facebook.
Q: What data is included in the research dataset ?
The data in the research data set includes structured markup of fact checking articles, that the publishers have associated with their articles. Each line is an individual markup that follows the ClaimReview schema and includes the url field that points back to the original fact checking articles. The content of the fact checking article is not part of the release.
The released dataset of ClaimReview markups is a subset of ClaimReview markup available on the Web and is not representative of the entire schema.org markups ClaimReview available on the web. As the fact checking community grows, we expect the size of the released dataset will grow in future versions.
Q: What data is included in the Fact Check Markup Tool data feed?
This is a regularly updated feed of ClaimReview markup created via the Google Fact Check Markup Tool. The data in the feed also follows the schema.org ClaimReview markup. The feed itself is in DataFeed format.
Q: Which publishers’ fact checks are being released in the research dataset?
The initial release of data include selected fact check markups from the following publishers: FactCheck.org, Snopes, PolitiFact, The Washington Post, The Weekly Standard. We intend to expand the list of publishers in future data releases. Please note that each fact checker has their own editorial standards and rating systems that are used to populate the ClaimReview markup.
Q: How often will the data be refreshed?
We intend to refresh the research dataset to reflect the evolving underlying data and as more publishers are included. However, we don't have a set schedule for periodic updates.
The Fact Check Markup Tool data feed is refreshed daily.
Q: What can I do with the data?
Be creative! This release of fact checks is a step towards making it easier for researchers around the world to conduct their study in this important research area. We hope this can help the academic community come up with innovative ways to combat misinformation. We hope that studies using the dataset will help the public understand better how fact checkers are fighting the misinformation and provide useful insights.
Q: Under what license is this data being released?
The compilation of the dataset is licensed under CC BY. The license on the structured data of each ClaimReview markup is specified in the field sdLicense.
Additionally, each publisher may have their own license terms for content on their website. Please refer to the website of the publisher.
Q: How can we add our ClaimReviews to the published datasets?
The initial release includes a small set of selected fact check markups. Please express your interest on including your datasets by writing to us at factcheck-support@datacommons.org.
Q: What if I disagree with the fact check?
Please follow the fact checking article URL as specified in the structured data and reach out to the publisher of the fact check.
Q: What if I see some errors in the ClaimReview structured data?
If you believe the publisher has made the error and you can identify the publisher in the structured data, please reach out to the publisher. If you believe the error was introduced by the data compilation process, please reach out to Factcheck Support.