Tag: rag

34 articles available

View all tags View all posts

Do Higher LLM Scores Really Signal Approaching AGI

agi

CommunityJul 11, 20262026-07-11

Do Higher LLM Scores Really Signal Approaching AGI

Public research suggests rising LLM scores reflect tools, memory, and planning systems, not a simple march toward AGI.

agi

SourceJul 8, 20262026-07-08

Control AI Data Risks by Storage Path

How to separate session, RAG, and model parameter paths in generative AI to design confidentiality, deletion, and audit controls.

Designing Organizational Memory for Agentic Process Execution

agi

SourceJul 7, 20262026-07-07

Designing Organizational Memory for Agentic Process Execution

Agent bottlenecks are not just reasoning. Separate organizational knowledge into memory layers for reliability and control.

AI Search Speed Gains and Verification Tradeoffs

llm

CommunityJul 6, 20262026-07-06

AI Search Speed Gains and Verification Tradeoffs

AI search can speed up answers, but citations, data, and technical details still require direct source verification.

ReContext Makes Long Context Actually Usable in Reasoning

llm

SourceJul 4, 20262026-07-04

ReContext Makes Long Context Actually Usable in Reasoning

ReContext highlights that long-context value depends on reusing evidence already in the prompt, not just larger windows.

Context Governance for Verifiable AI Agent Knowledge

llm

SourceJul 3, 20262026-07-03

Context Governance for Verifiable AI Agent Knowledge

How ContextNest frames context governance with a verifiable knowledge vault layer for auditable AI agents beyond retrieval quality.

Interpreting RAG Retrieval With Sparse Autoencoder Features

llm

SourceJul 2, 20262026-07-02

Interpreting RAG Retrieval With Sparse Autoencoder Features

Explores using sparse autoencoders to disentangle dense RAG embeddings for interpretable retrieval analysis and steering.

LLM Data Fusion for Single and Multi Truth

llm

SourceJun 29, 20262026-06-29

LLM Data Fusion for Single and Multi Truth

A look at using LLMs for single- and multi-truth data fusion, with implications for RAG, memory, and data quality.

KARLA Rethinks Retrieval During Token Generation for LLMs

hardware

SourceJun 26, 20262026-06-26

KARLA Rethinks Retrieval During Token Generation for LLMs

KARLA explores retrieving facts during token generation, reframing RAG tradeoffs around noise, latency, cost, and attribution.

Temporal Validity Challenges in RAG and Evolving Knowledge

agi

SourceJun 26, 20262026-06-26

Temporal Validity Challenges in RAG and Evolving Knowledge

How RAG mixes past and current facts, causing stale-fact errors, and why temporal validity matters in retrieval.

Beyond RAG for Domain-Specific LLM Decision Tasks

llm

CommunityJun 25, 20262026-06-25

Beyond RAG for Domain-Specific LLM Decision Tasks

RAGBench and LegalBench show why enterprise LLM evaluation must separate retrieval quality from domain-specific judgment.

Structure-Aware Retrieval Matters for Enterprise Document RAG

hardware

SourceJun 4, 20262026-06-04

Structure-Aware Retrieval Matters for Enterprise Document RAG

In enterprise document RAG, retrieval granularity often matters more than reasoning. Why structure-aware search helps.

DistractionIF Exposes Hidden Instruction Risks In RAG Systems

llm

SourceMay 30, 20262026-05-30

DistractionIF Exposes Hidden Instruction Risks In RAG Systems

DistractionIF shows how RAG systems misread instruction-like noise in documents and why pipeline design matters.

RAG Security Risks From Combined Injection And Poisoning

hardware

SourceMar 27, 20262026-03-27

RAG Security Risks From Combined Injection And Poisoning

Examines security risks in RAG when prompt injection and database poisoning combine across retrieval and indexing.

Epistemic Stability For Industrial LLM Hallucination Control

hardware

SourceMar 12, 20262026-03-12

Epistemic Stability For Industrial LLM Hallucination Control

Industrial LLM hallucinations framed as a reproducibility problem, comparing five prompt strategies to reduce output variance across repeated runs.

Grounding Self-Driving Explanations With Retrieval-Augmented Demonstrations

hardware

SourceMar 10, 20262026-03-10

Grounding Self-Driving Explanations With Retrieval-Augmented Demonstrations

RAG-Driver grounds driving explanations with retrieved expert demonstrations via RA-ICL, but evaluation still relies on BLEU, METEOR, and CIDEr.

When Long-Term Memory Hurts New Task Learning

agi

CommunityMar 9, 20262026-03-09

When Long-Term Memory Hurts New Task Learning

Long-term memory can boost performance yet cause negative forward transfer as tasks evolve. Design deletion, summarization, and replacement policies.

Combustion Knowledgebase And QA Benchmark For LLM Pipelines

hardware

SourceMar 7, 20262026-03-07

Combustion Knowledgebase And QA Benchmark For LLM Pipelines

A 3.5B-token combustion knowledgebase and CombustionQA benchmark unify knowledge injection and evaluation into one pipeline.

Designing Long-Form LLM Workflows Beyond Large Context Windows

llm

CommunityMar 6, 20262026-03-06

Designing Long-Form LLM Workflows Beyond Large Context Windows

For long policy reports, context and upload limits push chunked workflows that separate evidence retrieval from drafting, improving traceability and quality.

Guide-Driven Conversational Learning Workflow With Micro-Quizzes

hardware

CommunityMar 4, 20262026-03-04

Guide-Driven Conversational Learning Workflow With Micro-Quizzes

A guide-driven dialogue study loop: paste fragments, then run understanding checks, structured explanations, and tailored quizzes.

Designing Memory, Continual Learning, And Recursive Improvement Systems

llm

CommunityFeb 16, 20262026-02-16

Designing Memory, Continual Learning, And Recursive Improvement Systems

Compare RAG vs parameter updates for long-term memory, then outline validation and gating needed for recursive self-improvement loops.

Reranking in RAG Pipelines: Benefits, Costs, and Evaluation

hardware

GuideFeb 15, 20262026-02-15

Reranking in RAG Pipelines: Benefits, Costs, and Evaluation

Learn how reranking after top-K retrieval improves ranking quality in RAG, and how to evaluate gains against added latency and cost.

Prevent Citation Hallucinations Across The Five-Step RAG Pipeline

hardware

GuideFeb 14, 20262026-02-14

Prevent Citation Hallucinations Across The Five-Step RAG Pipeline

Practical checklist to reduce citation hallucinations in long-form RAG by auditing chunking, retrieval/reranking, and refusal when evidence is thin.

Cloudflare Converts HTML Pages Into Markdown For Agents

llm

TrustedFeb 12, 20262026-02-12

Cloudflare Converts HTML Pages Into Markdown For Agents

Cloudflare’s “Markdown for Agents” converts requested HTML pages to Markdown, easing RAG inputs while raising citation, control, and injection risks.