🛰 AI Brief — 17 June 2026

🥇 PreAct: Computer-Using Agents that Get Faster on Repeated Tasks · prio 13

This research addresses a core bottleneck in agentic workflows by providing a mechanism for persistent, task-specific memory and significant runtime optimization, directly relevant to the community’s interest in building efficient, reliable AI agents. arxiv.org · Agents Agent Memory Tool Use

🥈 RAG from A to Z: Architect's Cheat Sheet (Vector Databases, Chunking, Reranking, and 8 Production Pitfalls) · prio 13

For the builder community, transitioning from simple RAG to production-grade systems is a critical challenge, and this guide provides structured architectural patterns to address common scaling, latency, and quality pitfalls. habr.com · RAG Vector Database Reranking Chunking LangChain ChromaDB OpenAI

🥉 ProvenanceGuard: Source-Aware Factuality Verification for MCP-Based LLM Agents · prio 12

As builders move to complex multi-source MCP-based agents, preventing cross-source conflation—where correct info is attributed to the wrong source—is critical for reliability, especially in high-stakes domains like medicine. arxiv.org · Agents MCP RAG RAG Evaluation arXiv

4️⃣ Analyzing Agent Trajectories to Close the Intent-Execution Gap · prio 11

Aggregate benchmarks currently mask significant differences in how frontier models solve complex tasks. By shifting focus to trajectory analysis and system-harness alignment, AI builders can better diagnose agent failures and optimize agentic workflows beyond surface-level metrics. arxiv.org · 8 sources · Agents LLM Evals Code Agents Anthropic Google OpenAI xAI Qwen

5️⃣ Software Delegation Contracts: Measuring Reviewability in AI Coding-Agent Work · prio 11

For AI-builder communities, this research demonstrates a measurable trade-off when using delegation contracts: they do not improve code correctness in this study but provide significant gains in work reviewability, which is essential for scaling agentic coding workflows. arxiv.org · Code Agents LLM Evals

⚠️ Knowledge Gaps