docs: Add preprints on prompt-level incentive restructuring and hallucination mitigation by mahashu · Pull Request #766 · dair-ai/Prompt-Engineering-Guide

mahashu · 2026-05-20T16:49:08Z

Description

This pull request updates the literature catalog by indexing two companion open-science preprints evaluating prompt-level constraint architectures on frontier models (ChatGPT and Gemini).

Papers Added & Core Findings

Standing on a Trapdoor: AI Hallucination and Prompt-Level Cost Restructuring (Kowalski et al., 2026; DOI: 10.5281/zenodo.20019087)
- Focus: Introduces the baseline IDK+COMP constraint framework across 410 trials. Documents how default corporate alignment layers trade factual precision for conversational fluency, and demonstrates how to force model generation to terminate cleanly at the factual boundary.
A Puma in a Teacup: Signal Quality and Hallucination Suppression Through Prompt-Level Incentive Restructuring (Kowalski et al., 2026; DOI: 10.5281/zenodo.19502460)
- Focus: Analyzes context-saturation behaviors, unhedged refusal bounds, and creative signal optimization across 362 trials using test strings with no ground truth. Documents the "Brake-and-Slide" failure mode where isolation of the compression mandate (COMP alone) paradoxically forces Gemini's fabrication rates up to 70%, while removing refusal permissions (IDK) spikes it to 100%.

This contribution ensures that developers tracking the Applications index have direct access to both the empirical performance metrics and the underlying behavioral failure modes of inference-time safety layers.

vercel · 2026-05-20T16:49:13Z

@mahashu is attempting to deploy a commit to the DAIR-AI Team on Vercel.

A member of the Team first needs to authorize it.

mahashu · 2026-05-20T17:06:23Z

Clarification on methodology: The 410 trials represent the cumulative project total. The initial 362 trials are established in "A Puma in a Teacup" to map the core evaluation matrix across multiple governance conditions (including baseline, OGS, OGS-IDK, and COMP). The newest tracking block (trials 363-410) is introduced in "Standing on a Trapdoor" to isolate the specific IDK+COMP constraint configuration, requiring direct cross-comparison against those pre-established governance frameworks to map structural failure and suppression mechanics.

updated placeholders to Zenodo papers' URLs

corrected papers' URLs - again. This is terrible!

Update papers.en.mdx

e3c1ed9

mahashu added 3 commits May 20, 2026 21:00

Update papers.en.mdx

5b2bb98

updated placeholders to Zenodo papers' URLs

Update papers.en.mdx

42ba21b

Update papers.en.mdx

f0f45ea

corrected papers' URLs - again. This is terrible!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Add preprints on prompt-level incentive restructuring and hallucination mitigation#766

docs: Add preprints on prompt-level incentive restructuring and hallucination mitigation#766
mahashu wants to merge 4 commits into
dair-ai:mainfrom
mahashu:patch-1

mahashu commented May 20, 2026

Uh oh!

vercel Bot commented May 20, 2026

Uh oh!

mahashu commented May 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mahashu commented May 20, 2026

Description

Papers Added & Core Findings

Uh oh!

vercel Bot commented May 20, 2026

Uh oh!

mahashu commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mahashu commented May 20, 2026 •

edited

Loading