What AI does
- Scans large documents with many claim-reference pairs rapidly
- Identifies numerical mismatches (wrong p-values, incorrect percentages, swapped comparator arms)
- Flags claims that use stronger language than the source supports (e.g., “significantly improved” where the source reports a trend)
- Structures the verification output into a reviewable format
What AI cannot do
AI checks whether a claim is technically supported by the cited reference. It does not check whether the claim is used in an appropriate context, whether important information from the reference has been omitted, or whether the reference cited is the best available evidence. These require human judgement.Before you start
- Ensure every verifiable claim in the document has a clear reference citation. Number or label claims if the document does not already do so.
- Obtain the full text of every cited reference. Partial verification against abstracts only creates a false sense of security — do not rely on abstracts alone.
- Confirm that references are finalised. Running this workflow on a document with placeholder references wastes effort.
Steps
Prepare the document
Ensure every verifiable claim has a clear reference citation. If the document does not already map claims to references, label them before starting.
Gather all reference materials
Obtain the full text of every cited reference. Use publisher PDFs wherever possible — scanned PDFs may have OCR errors in results tables.
Run automated verification with RefCheckr
Submit the document and reference materials to RefCheckr for a first-pass claim-to-reference comparison.
Review every flagged item
For each flag, open the cited reference and manually assess whether the claim is supported, partially supported, or not supported. Do not dismiss flags without checking.
Spot-check unflagged items
Automated tools do not catch everything. Manually spot-check 20–30% of unflagged claims, prioritising those with specific numerical data, comparative language, or endpoint results.
Assess context for supported claims
A claim may be technically supported by a reference but used in a context that changes its meaning. Review each claim in the context of the full document narrative, not just as an isolated claim-reference pair.
Correct and annotate
Fix inaccurate claims, update references, and document the verification outcome for each item reviewed.
Prompt pattern
Use this pattern when running manual verification or using a general-purpose LLM to support the process.Interpreting verification results
- SUPPORTED
- PARTIALLY SUPPORTED
- NOT SUPPORTED
- CANNOT VERIFY
The reference directly supports the claim as written. Spot-check the context — a supported claim used in a misleading context is still a problem.
Human review checklist
- All flagged claims have been manually reviewed against the cited reference
- A sample of unflagged claims has been spot-checked for accuracy
- Numerical data (p-values, CIs, percentages, sample sizes) verified against source
- Claims use language consistent with the strength of evidence (no overstatement)
- Qualifiers in the source are preserved in the claims (subgroup, post-hoc, exploratory)
- Safety claims are verified with the same rigour as efficacy claims
- All NOT SUPPORTED or PARTIALLY SUPPORTED flags have been resolved
- Changes made during verification are documented
- The document is ready for formal review with reference accuracy confirmed
Common failure modes
| Risk | What to look for |
|---|---|
| False negatives | Automated tools miss numerical errors when the reference and claim discuss the same metric — manually verify all specific data points in key claims |
| Context blindness | A claim is technically supported but used in a comparative context the reference does not support — assess claims in the context of the full document |
| Reference text quality | OCR errors in scanned PDFs cause tools to misread numbers — use publisher PDFs where possible |
| Scope limitation | Verification confirms that claims match cited references; it does not assess whether those references are the best available evidence |
Relevant tools
RefCheckr
Primary tool for systematic claim-to-reference comparison.
Next steps
Check Compliance
Pre-screen for compliance signals after verifying references.
Final Review
Complete the QC process before submission or publication.