Hallucination Red-Team
Canonical path: skills/legal-methodology/hallucination-red-team/SKILL.md
Agent Trigger Description
Use when stress-testing draft legal work product for hallucination risk by separating factual claims, legal claims, cited authorities, user-provided information, model-generated information, unsupported claims, uncertainty, and safer revisions.
What this produces: Hallucination risk report; Safer revised version or not-ready finding
What you give it: The complete draft legal output to red-team; The user-provided facts, source documents, and cited authorities; The intended use and audience of the output
When to use it: A draft contains legal claims, factual claims, citations, quotations, or confident conclusions that may be unsupported.
At a glance
| Practice area | Legal Methodology |
|---|---|
| Category | verification |
| Risk level | high |
| Recommended quality checks | attorney-review-gate citation-integrity-check source-validation-check assumption-audit hallucination-red-team jurisdiction-deadline-gates privilege-confidentiality-check output-format-compliance-check |
| Eval coverage | Manual eval ready |
| Compatible platforms | chatgpt, claude, cursor, codex, gemini, generic-md |
| Related skills | red team verifier, citation integrity check, source validation |
Example output not yet available.
Purpose
Force a draft legal output to account for every factual claim, legal claim, cited authority, user-provided fact, model-generated statement, unsupported claim, and uncertainty marker. The skill produces either a safer revised version or a "not ready for use" finding.
This focused hallucination pass complements the broader Red-Team Verifier. It does not independently verify the law and does not certify that a draft is correct.
Use When
- A draft contains legal claims, factual claims, citations, quotations, or confident conclusions that may be unsupported.
- A user asks for a hallucination check, unsupported-claim audit, or safer revised version.
- A draft will be sent externally, filed, used in a client communication, or relied on for a high-risk decision.
- A model produced the draft without a clear claim/source table.
Required Inputs
- The complete draft to red-team.
- All user-provided facts and source documents available for comparison.
- Any cited authorities, research notes, or source lists.
- The intended use and recipient.
If the draft is missing, stop and request it. If sources are missing, continue only as a limitation-heavy hallucination risk screen.
Do Not Use When
- The user wants new legal analysis instead of a verification pass.
- The output has no claims to audit.
- The user asks for a final certification that no hallucinations exist.
Legal Safety Rules
- Produce draft legal work product for attorney review. This is not legal advice.
- Do not claim the absence of hallucinations. State only what was checked and what remains unresolved.
- Do not invent sources, facts, citations, or safer substitute law.
- Do not assert that an authority exists or is fabricated unless verified by source material available in the session.
- Preserve privilege and confidentiality.
Workflow
- Segment the draft. Break the draft into discrete factual claims, legal claims, citations, quotations, assumptions, recommendations, and conclusions.
- Classify information origin. For each item, mark whether it is user-provided, source-provided, model-generated, inferred, or unclear.
- Inventory authorities. List every authority, citation, quotation, source URL, statute, rule, case, regulation, or guidance reference.
- Flag unsupported claims. Mark claims with no source, insufficient source support, contradicted source support, unclear origin, or overbroad phrasing.
- Flag authority risks. Mark invented-looking or unverifiable authorities, missing pin cites, jurisdiction mismatch, outdated authority risk, and unsupported legal propositions.
- Mark uncertainty. Add or recommend
[CONFIRM: ...],[VERIFY: ...],[citation needed],[pin cite needed],[verify jurisdiction], and[deadline verification required]markers where needed. - Decide readiness. If material unsupported claims, authority risks, or missing gates remain, mark "not ready for use." Otherwise mark "ready for attorney review," not final use.
- Revise safely. Produce a safer revised version only where the revision removes overclaiming, labels uncertainty, or narrows to supported facts. Do not supply missing law.
Output Format
Deliver:
- Draft Label — "Draft legal work product for attorney review. Not legal advice."
- Readiness Finding — Ready for attorney review / Not ready for use, with reasons.
- Claim-Origin Table
| # | Claim | Type | Origin | Support status | Risk | Required action |
|---|
- Authority Risk Table
| Authority | Use in draft | Verification status | Hallucination risk | Required action |
|---|
- Unsupported Claims List
- Uncertainty Markers To Add
- Safer Revised Version — Or "No safe revision without additional sources/attorney judgment."
- Attorney Verification Checklist
Attorney Verification Checklist
- [ ] Every factual claim is traced to user-provided or source-provided material, or is marked as unsupported.
- [ ] Every legal claim is supported by verified authority or flagged for attorney research.
- [ ] Every cited authority and quotation has been independently checked.
- [ ] Model-generated claims have been removed, sourced, or marked for verification.
- [ ] Jurisdiction and deadline gaps are resolved or visibly flagged.
- [ ] The revised draft does not present unsupported analysis as settled law.
- [ ] The output is ready for attorney review only; it has not been treated as final.
Full raw SKILL.md
--- name: Hallucination Red-Team description: "Use when stress-testing draft legal work product for hallucination risk by separating factual claims, legal claims, cited authorities, user-provided information, model-generated information, unsupported claims, uncertainty, and safer revisions." practice_area: legal-methodology task_type: verification jurisdictions: [] risk_level: high requires_attorney_review: true inputs: - "The complete draft legal output to red-team" - "The user-provided facts, source documents, and cited authorities" - "The intended use and audience of the output" outputs: - "Hallucination risk report" - "Safer revised version or not-ready finding" related_skills: - skills/legal-methodology/red-team-verifier/SKILL.md - skills/legal-methodology/citation-integrity-check/SKILL.md - skills/legal-methodology/source-validation/SKILL.md tags: - legal-methodology - hallucination-check - red-team - unsupported-claims - verification --- # Hallucination Red-Team ## Purpose Force a draft legal output to account for every factual claim, legal claim, cited authority, user-provided fact, model-generated statement, unsupported claim, and uncertainty marker. The skill produces either a safer revised version or a "not ready for use" finding. This focused hallucination pass complements the broader Red-Team Verifier. It does not independently verify the law and does not certify that a draft is correct. ## Use When - A draft contains legal claims, factual claims, citations, quotations, or confident conclusions that may be unsupported. - A user asks for a hallucination check, unsupported-claim audit, or safer revised version. - A draft will be sent externally, filed, used in a client communication, or relied on for a high-risk decision. - A model produced the draft without a clear claim/source table. ## Required Inputs - The complete draft to red-team. - All user-provided facts and source documents available for comparison. - Any cited authorities, research notes, or source lists. - The intended use and recipient. If the draft is missing, stop and request it. If sources are missing, continue only as a limitation-heavy hallucination risk screen. ## Do Not Use When - The user wants new legal analysis instead of a verification pass. - The output has no claims to audit. - The user asks for a final certification that no hallucinations exist. ## Legal Safety Rules - Produce draft legal work product for attorney review. This is not legal advice. - Do not claim the absence of hallucinations. State only what was checked and what remains unresolved. - Do not invent sources, facts, citations, or safer substitute law. - Do not assert that an authority exists or is fabricated unless verified by source material available in the session. - Preserve privilege and confidentiality. ## Workflow 1. **Segment the draft.** Break the draft into discrete factual claims, legal claims, citations, quotations, assumptions, recommendations, and conclusions. 2. **Classify information origin.** For each item, mark whether it is user-provided, source-provided, model-generated, inferred, or unclear. 3. **Inventory authorities.** List every authority, citation, quotation, source URL, statute, rule, case, regulation, or guidance reference. 4. **Flag unsupported claims.** Mark claims with no source, insufficient source support, contradicted source support, unclear origin, or overbroad phrasing. 5. **Flag authority risks.** Mark invented-looking or unverifiable authorities, missing pin cites, jurisdiction mismatch, outdated authority risk, and unsupported legal propositions. 6. **Mark uncertainty.** Add or recommend `[CONFIRM: ...]`, `[VERIFY: ...]`, `[citation needed]`, `[pin cite needed]`, `[verify jurisdiction]`, and `[deadline verification required]` markers where needed. 7. **Decide readiness.** If material unsupported claims, authority risks, or missing gates remain, mark "not ready for use." Otherwise mark "ready for attorney review," not final use. 8. **Revise safely.** Produce a safer revised version only where the revision removes overclaiming, labels uncertainty, or narrows to supported facts. Do not supply missing law. ## Output Format Deliver: 1. **Draft Label** — "Draft legal work product for attorney review. Not legal advice." 2. **Readiness Finding** — Ready for attorney review / Not ready for use, with reasons. 3. **Claim-Origin Table** | # | Claim | Type | Origin | Support status | Risk | Required action | |---|---|---|---|---|---|---| 4. **Authority Risk Table** | Authority | Use in draft | Verification status | Hallucination risk | Required action | |---|---|---|---|---| 5. **Unsupported Claims List** 6. **Uncertainty Markers To Add** 7. **Safer Revised Version** — Or "No safe revision without additional sources/attorney judgment." 8. **Attorney Verification Checklist** ## Attorney Verification Checklist - [ ] Every factual claim is traced to user-provided or source-provided material, or is marked as unsupported. - [ ] Every legal claim is supported by verified authority or flagged for attorney research. - [ ] Every cited authority and quotation has been independently checked. - [ ] Model-generated claims have been removed, sourced, or marked for verification. - [ ] Jurisdiction and deadline gaps are resolved or visibly flagged. - [ ] The revised draft does not present unsupported analysis as settled law. - [ ] The output is ready for attorney review only; it has not been treated as final.
You are assisting with a legal task using AgentCounsel, a platform-agnostic legal skills library. Use the skill provided below and follow it exactly. Operating rules (these always apply): - Produce draft legal work product for review by a licensed attorney. This is not legal advice and not a final answer. - Never invent legal authority, citations, quotations, facts, or deadlines. Mark every gap with a visible placeholder such as [CONFIRM: ...] or [VERIFY: ...]. - Identify jurisdiction, governing law, posture, and the relevant date — or flag them as unknown. Never compute a deadline. - Keep facts, assumptions, analysis, strategy, and verification items visibly separate. - Follow the skill's Workflow and Output Format. Complete its Attorney Verification Checklist. - If a Required Input is missing, stop and ask for it. Do not guess. === BEGIN SKILL: Hallucination Red-Team === --- name: Hallucination Red-Team description: "Use when stress-testing draft legal work product for hallucination risk by separating factual claims, legal claims, cited authorities, user-provided information, model-generated information, unsupported claims, uncertainty, and safer revisions." practice_area: legal-methodology task_type: verification jurisdictions: [] risk_level: high requires_attorney_review: true inputs: - "The complete draft legal output to red-team" - "The user-provided facts, source documents, and cited authorities" - "The intended use and audience of the output" outputs: - "Hallucination risk report" - "Safer revised version or not-ready finding" related_skills: - skills/legal-methodology/red-team-verifier/SKILL.md - skills/legal-methodology/citation-integrity-check/SKILL.md - skills/legal-methodology/source-validation/SKILL.md tags: - legal-methodology - hallucination-check - red-team - unsupported-claims - verification --- # Hallucination Red-Team ## Purpose Force a draft legal output to account for every factual claim, legal claim, cited authority, user-provided fact, model-generated statement, unsupported claim, and uncertainty marker. The skill produces either a safer revised version or a "not ready for use" finding. This focused hallucination pass complements the broader Red-Team Verifier. It does not independently verify the law and does not certify that a draft is correct. ## Use When - A draft contains legal claims, factual claims, citations, quotations, or confident conclusions that may be unsupported. - A user asks for a hallucination check, unsupported-claim audit, or safer revised version. - A draft will be sent externally, filed, used in a client communication, or relied on for a high-risk decision. - A model produced the draft without a clear claim/source table. ## Required Inputs - The complete draft to red-team. - All user-provided facts and source documents available for comparison. - Any cited authorities, research notes, or source lists. - The intended use and recipient. If the draft is missing, stop and request it. If sources are missing, continue only as a limitation-heavy hallucination risk screen. ## Do Not Use When - The user wants new legal analysis instead of a verification pass. - The output has no claims to audit. - The user asks for a final certification that no hallucinations exist. ## Legal Safety Rules - Produce draft legal work product for attorney review. This is not legal advice. - Do not claim the absence of hallucinations. State only what was checked and what remains unresolved. - Do not invent sources, facts, citations, or safer substitute law. - Do not assert that an authority exists or is fabricated unless verified by source material available in the session. - Preserve privilege and confidentiality. ## Workflow 1. **Segment the draft.** Break the draft into discrete factual claims, legal claims, citations, quotations, assumptions, recommendations, and conclusions. 2. **Classify information origin.** For each item, mark whether it is user-provided, source-provided, model-generated, inferred, or unclear. 3. **Inventory authorities.** List every authority, citation, quotation, source URL, statute, rule, case, regulation, or guidance reference. 4. **Flag unsupported claims.** Mark claims with no source, insufficient source support, contradicted source support, unclear origin, or overbroad phrasing. 5. **Flag authority risks.** Mark invented-looking or unverifiable authorities, missing pin cites, jurisdiction mismatch, outdated authority risk, and unsupported legal propositions. 6. **Mark uncertainty.** Add or recommend `[CONFIRM: ...]`, `[VERIFY: ...]`, `[citation needed]`, `[pin cite needed]`, `[verify jurisdiction]`, and `[deadline verification required]` markers where needed. 7. **Decide readiness.** If material unsupported claims, authority risks, or missing gates remain, mark "not ready for use." Otherwise mark "ready for attorney review," not final use. 8. **Revise safely.** Produce a safer revised version only where the revision removes overclaiming, labels uncertainty, or narrows to supported facts. Do not supply missing law. ## Output Format Deliver: 1. **Draft Label** — "Draft legal work product for attorney review. Not legal advice." 2. **Readiness Finding** — Ready for attorney review / Not ready for use, with reasons. 3. **Claim-Origin Table** | # | Claim | Type | Origin | Support status | Risk | Required action | |---|---|---|---|---|---|---| 4. **Authority Risk Table** | Authority | Use in draft | Verification status | Hallucination risk | Required action | |---|---|---|---|---| 5. **Unsupported Claims List** 6. **Uncertainty Markers To Add** 7. **Safer Revised Version** — Or "No safe revision without additional sources/attorney judgment." 8. **Attorney Verification Checklist** ## Attorney Verification Checklist - [ ] Every factual claim is traced to user-provided or source-provided material, or is marked as unsupported. - [ ] Every legal claim is supported by verified authority or flagged for attorney research. - [ ] Every cited authority and quotation has been independently checked. - [ ] Model-generated claims have been removed, sourced, or marked for verification. - [ ] Jurisdiction and deadline gaps are resolved or visibly flagged. - [ ] The revised draft does not present unsupported analysis as settled law. - [ ] The output is ready for attorney review only; it has not been treated as final. === END SKILL === First, confirm which Required Inputs you have and ask me for any that are missing. Then proceed with the Workflow.