Evidence Management

Last Updated: March 27, 2026 (NQU-401 review)

Overview

Evidence is the foundation of any inquiry. Nquiry provides comprehensive tools for uploading, organizing, and analyzing evidence items while maintaining the metadata needed for professional documentation.

Evidence Types

Nquiry supports seven evidence types, each with appropriate handling:

Type	Icon	Description	Best For
Document	📄	PDFs, Word docs, spreadsheets, text files	Policies, records, correspondence
Interview	🎤	Transcripts, recordings, statements	Witness accounts, depositions
Website	🌐	URLs, screenshots, web content	Online sources, public records
Observation	👁️	Field notes, site visit documentation	Physical inspections, site visits
Dataset	📊	Structured data, exports, logs	System data, statistical evidence
Note	📝	General notes, summaries, memos	Internal documentation
Standard	📏	Policies, regulations, compliance criteria	Normative/compliance documents, IRAC pattern

Creating Evidence

Required Fields

Title: Descriptive name for the evidence
Type: One of the seven evidence types
Investigation: Auto-assigned to current project

Optional Fields

Description: Summary of what the evidence contains
Content Text: Full text content or transcript
Source: Where the evidence came from
Source Date: When the evidence was created/obtained
Investigator Notes: Private notes about the evidence
To Be Collected: Flag for planned but not-yet-obtained evidence

File Attachments

Each evidence item can have multiple file attachments:

PDFs, images, Word documents, Excel files
Files are stored securely in encrypted S3 storage
Text is extracted from PDFs and Office documents for search

Evidence Collection Tracking

"To Be Collected" Feature

Mark evidence items as "to be collected" when planning your investigation:

Create placeholder evidence items during planning
Flag them as "to be collected"
Add notes about how/when to obtain
System tracks collection progress

When evidence content is added (text or attachments), the "to be collected" flag automatically clears.

Collection Status Dashboard

The Overview page shows:

Total evidence items
Items with attachments
Items pending collection
Evidence linked to questions

Linking Evidence to Questions

Why Link Evidence?

Linking evidence to questions:

Ensures all questions have supporting evidence
Identifies evidence gaps
Provides context for AI analysis
Enables traceability in reports

How to Link

From the evidence detail dialog:

Click "Link to Questions"
Select relevant questions
Add optional relevance notes
Save links

From the Questions page:

Open a question
View linked evidence
Add new links

Link Quality Indicators

Evidence cards show:

Number of linked questions (badge)
Unlinked evidence highlighted in filters
Questions without evidence flagged in analysis

Evidence for AI Analysis

How Evidence is Used

When generating AI analysis:

Semantic Search: System finds evidence most relevant to the question
Context Building: Relevant chunks are assembled for AI review
File Processing: PDFs and documents are read by the AI
Citation Tracking: AI cites specific evidence in its analysis

Processing for Search

Evidence is automatically processed for semantic search:

Text content is chunked into smaller segments
Chunks are converted to vector embeddings
Embeddings enable similarity-based retrieval

Processing status is shown in analysis generation feedback.

File Type Support

The AI can directly analyze:

PDFs: Full document analysis with page references
Images: Visual analysis (JPEG, PNG, GIF, WebP)
Word Documents: Text extraction and analysis
Excel Files: Sheet-by-sheet text extraction

Evidence Organization

Filtering

Filter evidence by:

Type: Document, interview, website, etc.
Link Status: All, linked to questions, not linked
Collection Status: Collected, to be collected

Sorting

Sort evidence by:

Newest first (default)
Oldest first
Source date
Type
Title (A-Z)

Bulk Upload

Upload multiple evidence files at once via the bulk upload dialog. The dialog supports drag-and-drop and shows file details in a wider layout for comfortable review.

In-App File Viewer

View evidence attachments and background documents directly in the app without downloading:

Supported formats: PDFs (rendered in browser PDF viewer), images (PNG, JPEG, GIF, WebP, SVG), text files (plain text, markdown, CSV, HTML)
Where it appears: "View" link next to "Download" on evidence detail pages; eye icon button on background documents in Settings
Unsupported formats: .docx, .xlsx, and other binary formats show only the Download option
Files are served securely through the app with 5-minute caching

Evidence Notes

Investigator Notes

Private notes attached directly to evidence:

Not included in reports by default
Useful for tracking follow-up items
Visible only to team members

Scoping AI Analysis with Evidence Notes (Evidence Windowing)

Investigator notes serve a second purpose beyond annotation — they can be used to scope which portion of an evidence document gets sent to the AI during analysis. This is called evidence windowing.

The full document text is always preserved in the database and used for search and retrieval. Windowing only affects what gets loaded into the AI prompt at analysis time, keeping prompts focused and reducing noise.

Spreadsheet files (.xlsx, .xls, .csv)

Add range hints to the evidence notes field to scope which rows, columns, or sheets are sent to the AI:

Hint	Example	Effect
Cell range	`B1:D20`	Sends columns B–D, rows 1–20
Sheet + range	`Sheet1!B1:D20`	Scopes to a specific sheet and range
Row range	`rows 5-20`	Sends only rows 5 through 20 (header always included)
Column range	`columns A-F`	Sends only columns A through F
Sheet filter	`Sheet2 only`	Sends only the named sheet

PDF files

Add page hints to scope which pages are sent to the AI:

Hint	Example	Effect
Page range	`pages 12-15`	Sends pages 12 through 15
Single page	`page 5`	Sends only page 5
Abbreviated range	`pp. 5-10`	Sends pages 5 through 10
Short form	`p. 7`	Sends only page 7

Example workflow — procedure compliance: Upload a multi-page procedure document as evidence. In the investigator notes, write page 5. When analysis runs, only page 5 is sent to the AI in the prompt. In the Direction for AI field on the Generate Analysis dialog, add the analytical instruction: "Determine whether the subject followed the sterilization procedure on page 5." The windowed content and the investigator direction work together for a focused, precise analysis.

Key behaviors:

Windowing hints are detected automatically — no special syntax beyond the natural language patterns above
Multiple hints are supported (e.g., Sheet2 only, rows 1-20)
If a requested range exceeds the document size, available content is returned without error
Spreadsheet hints on PDF files are ignored (and vice versa)
A note is prepended to the prompt when windowing is applied: "[Note: Content windowed per investigator guidance. Full data preserved for search.]"
Documents without windowing hints pass through unchanged

Implemented: NQU-298 (spreadsheets), NQU-314 (PDFs)

Evidence Notes (Annotations)

Structured annotations on evidence:

Observation: What the evidence shows
Analysis: Interpretation of significance
Follow-up: Additional work needed
Concern: Flags or red flags
General: Other notes

Security and Compliance

Storage Security

Files encrypted at rest (AES-256)
Files encrypted in transit (TLS 1.2+)
Stored in organization-isolated S3 paths
Access requires authentication + org authorization

Audit Logging

All evidence actions are logged:

Upload, view, download, delete
Link/unlink to questions
User, timestamp, IP address
Success/failure status

HIPAA Considerations

PHI detection available via Bedrock Guardrails
Evidence access restricted to org members
Audit trail maintained for compliance

Evidence Evaluation

The 10 CIGIE/GAO quality criteria below shape what the AI looks for when assessing evidence. They're used as a conceptual framework, not as individually-scored dimensions — per NQU-100, the active output is a simpler 2-field assessment (confidence + reasoning) rolled up across all 10.

Relevance - Logical connection to the question
Reliability - Trustworthiness of source
Sufficiency - Adequate quantity and scope
Validity - Accurately represents what it claims
Competence - Quality appropriate to form
Completeness - No critical gaps
Timeliness - Current and applicable
Objectivity - Fact-based vs. opinion
Authenticity - Genuine and verifiable
Consistency - Aligns with other evidence

See Evidence Evaluation Framework for the historical reference.

Best Practices

Collection

Document source and date for every item
Use consistent naming conventions
Attach original files when possible
Record chain of custody for physical evidence

Organization

Link evidence to questions as you collect
Use investigator notes for follow-up items
Filter regularly to find unlinked items
Review "to be collected" items weekly

Quality

Verify authenticity before relying on evidence
Seek corroboration for critical claims
Note any concerns about reliability
Document chain of custody

Three-Stage Hybrid Retrieval Pipeline

Added 2026-03-06 (NQU-379). Major update 2026-03-27 (NQU-462).

Nquiry uses a three-stage hybrid retrieval approach:

Stage 1 — Keyword Search. PostgreSQL tsvector/tsquery finds evidence containing the same terms as the question. Catches exact-match evidence (case numbers, names, policy codes, identifiers) that semantic search misses. Canonical example: NQU-380 DOC-198 found exclusively by keyword search.

Stage 2 — Semantic Search (Vector Embeddings). Amazon Titan Text Embeddings V2 via pgvector converts questions and evidence into meaning vectors. Catches evidence that's relevant even when it uses different words.

Stage 3 — Reranking. Cohere Rerank 3.5 re-evaluates merged results from Stages 1 and 2 by reading question and evidence together. Promotes keyword-surfaced results that semantic search missed; demotes false positives.

Team-draft interleaving (NQU-462): Results from keyword and vector search are combined using alternating picks (team-draft), replacing the old ad-hoc "guaranteed keyword top-20" slot reservation. This scales naturally without hardcoded slot counts.

Query-adaptive weighting (NQU-462): The system automatically classifies queries as entity-heavy (proper nouns, dates, IDs → 2.5x keyword boost), conceptual (abstract → 1.0x), or hybrid (moderate → 1.5x). No LLM needed — uses heuristic detection of capitalized multi-word sequences and specific identifiers.

Continuous feedback (NQU-462): After each analysis generation, the system computes retrieval quality by comparing retrieved evidence vs. AI citations, storing results in the retrieval_benchmark_result table for trend monitoring.

Baseline metrics (Davenheim, 5 queries): R@5=0.166, R@10=0.283, R@20=0.323, MRR=0.400, NDCG@20=0.362.

Storage and Quotas

Plan	Storage Limit
Trial	2 GB
Professional	50 GB
Team	200 GB

Storage used is shown in Settings → Billing.

Large files (>25MB) may require splitting or compression.

AI Quality Metrics - How evidence retrieval affects analysis
Investigation Workflow - Evidence in the collection phase
Evidence Evaluation Framework - Quality criteria

Overview​

Evidence Types​

Creating Evidence​

Required Fields​

Optional Fields​

File Attachments​

Evidence Collection Tracking​

"To Be Collected" Feature​

Collection Status Dashboard​

Linking Evidence to Questions​

Why Link Evidence?​

How to Link​

Link Quality Indicators​

Evidence for AI Analysis​

How Evidence is Used​

Processing for Search​

File Type Support​

Evidence Organization​

Filtering​

Sorting​

Bulk Upload​

In-App File Viewer​

Evidence Notes​

Investigator Notes​

Scoping AI Analysis with Evidence Notes (Evidence Windowing)​

Evidence Notes (Annotations)​

Security and Compliance​

Storage Security​

Audit Logging​

HIPAA Considerations​

Evidence Evaluation​

Best Practices​

Collection​

Organization​

Quality​

Three-Stage Hybrid Retrieval Pipeline​

Storage and Quotas​

Related Documentation​