FinchContext
Run with

Build TAR Validation Report

Skill: Convert TAR/predictive-coding metrics into a defensibility validation report

Region: United States Category: Legal / eDiscovery Does: Takes the outputs and metrics of a Technology-Assisted Review (TAR / predictive coding) effort and assembles a validation report — seed/training detail, recall and precision estimates, elusion testing, and a defensibility statement — for disclosure to opposing counsel or the court and to satisfy ESI-protocol obligations. Authority: EDRM Technology-Assisted Review · The Sedona Conference TAR Case Law Primer / TAR perspectives · Da Silva Moore v. Publicis Groupe (SDNY 2012) · Rio Tinto plc v. Vale S.A. (SDNY 2016)

Courts accept TAR where the process is reasonable and validated, not where a particular tool is used — Rio Tinto framed the question as the reasonableness and transparency of the process. This report documents that the review met a defensible recall target via statistically sound sampling. Metric targets and disclosure scope are case- and protocol-specific (this is a Medium-confidence skill) — confirm against the ESI protocol and current case law before relying on it.


When this applies


Input data required

Group Fields
Workflow TAR method (TAR 1.0 SAL/SPL vs TAR 2.0 CAL), tool/version, reviewer roles
Population total documents, documents excluded (non-text, etc.), date range
Training seed-set size & selection (judgmental/random), number of training rounds, documents reviewed for training
Control set control-set size, prevalence (richness) estimate, confidence level & margin of error
Results documents classified responsive/non-responsive, cutoff/threshold, documents produced for review
Validation recall estimate + CI, precision, elusion test (responsive rate in the discard/null set)
QC reviewer-consistency checks, resolution of disagreements

Report structure

1. Scope & methodology — population, TAR method, tool, workflow narrative
2. Prevalence (richness) — estimated responsive rate with confidence level & margin of error
3. Training — seed selection, rounds, stabilization criteria
4. Cutoff determination — how the responsiveness threshold was set
5. Validation results:
     - Recall estimate (+ confidence interval)
     - Precision
     - Elusion test — responsive rate in the null/discard set (key completeness check)
     - Control-set performance
6. QC — reviewer-consistency / disagreement resolution
7. Defensibility statement — why the process was reasonable & proportional; what is disclosed
8. Appendices — sampling math, statistical assumptions

Build rules


Worked example (results summary)

Population: 480,000 docs (text-bearing). Method: TAR 2.0 (CAL), Tool vX.
Prevalence: 4.2% responsive (95% confidence, ±0.6%).
Training: judgmental seeds + CAL; 11 batches; stabilized when batch yield < 2 responsive/200.
Cutoff: rank score ≥ 0.55 → 27,400 docs produced for review.
Validation (95% confidence):
   Recall   = 82% (CI 78–86%)
   Precision= 61%
   Elusion test on the 452,600 discarded docs: responsive rate 0.7% (CI 0.4–1.0%)
Conclusion: recall target (≥80%) met; low elusion rate supports completeness. Methodology and
   recall disclosed to opposing counsel per ESI Protocol §5; seed coding withheld as work product.

Validation checklist


Last updated: 2026-05-31 — TAR validation metrics, recall targets, and disclosure obligations are case- and protocol-specific; confirm the statistical approach and what must be disclosed against the case ESI protocol, current FRCP, and TAR case law (Da Silva Moore, Rio Tinto) and Sedona Conference guidance before relying on the report.