Tag: explainer

232 articles · Page 2 / 10

How code agents can use bug reproduction tests as diagnostic signals during patch generation, not just post-hoc checks.

SourceJul 2, 20262026-07-02

Interpreting RAG Retrieval With Sparse Autoencoder Features

Explores using sparse autoencoders to disentangle dense RAG embeddings for interpretable retrieval analysis and steering.

agi

SourceJul 2, 20262026-07-02

Latent Space Control for Trustworthy LLM Behavior

From steering vectors to model calibrators, this paper frames latent-space intervention as a path to better LLM control and trust.

hardware

SourceJul 2, 20262026-07-02

On-Device AI Security Across App, Model, and OS

A look at the main security risks in mobile on-device AI, focusing on attack surfaces across apps, models, and OS.

hardware

CommunityJul 2, 20262026-07-02

Public AI Infrastructure: Distributed Access or Concentrated Scale

Examines distributed vs. concentrated public AI compute strategies and what they mean for sovereign AI capacity.

llm

SourceJul 1, 20262026-07-01

Why Generator Evaluator Consistency Matters In LLM Self-Review

Why LLM self-review should be judged by generator-evaluator consistency, not accuracy alone, in agent workflows.

hardware

CommunityJun 29, 20262026-06-29

Model Distillation, API Control, and Sovereign AI Risks

How model distillation expands from efficiency to API cost, competitive training, and control over data and compute.

hardware

SourceJun 29, 20262026-06-29

Translating Medical AI Explanations Into Clinical Workflow

How a speech-based cognitive impairment framework turns SHAP and linguistic features into clinical explanations for usability.

hardware

SourceJun 29, 20262026-06-29

What Should LLM Unlearning Actually Remove Precisely

A position paper argues LLM unlearning should mean dataset-defined deletion, not output suppression or behavior editing.

hardware

SourceJun 28, 20262026-06-28

Enforcing Agent Policies Beyond Prompt-Based Safety Guards

How formalized policies can deterministically govern agent tool calls beyond probabilistic prompt steering and filters.

llm

SourceJun 28, 20262026-06-28

Why Benchmarks Miss Much of LLM Performance

How single-run LLM benchmarks can miss usable performance, and why model choice, retries, and cost matter.

llm

SourceJun 27, 20262026-06-27

Why Agent Configs Need Deterministic Control Planes

Why reused coding agent config files can become an unmanaged control layer with security and operational risks.

hardware

SourceJun 26, 20262026-06-26

Agent-Driven Iteration Loops for Industrial Recommender Systems

A look at AgentX and the shift from model changes to automating hypothesis, code, experiment, and analysis loops.

hardware

SourceJun 26, 20262026-06-26

Emotion Vectors in Open LLMs and Behavior Control

Examines whether emotion vectors in open-weight LLMs are internal representations or merely correlated signals for behavior.

hardware

SourceJun 26, 20262026-06-26

HiLSVA Reframes Scientific Visualization Agent Control and Oversight

HiLSVA emphasizes plan-first workflows, human oversight, and provenance over full autonomy in scientific visualization agents.

llm

CommunityJun 26, 20262026-06-26

How Generative AI Makes Money And Why Profitability Debates Persist

A look at how generative AI earns revenue, why infrastructure costs loom large, and how investment and cloud deals shape profitability.

hardware

SourceJun 26, 20262026-06-26

KARLA Rethinks Retrieval During Token Generation for LLMs

KARLA explores retrieving facts during token generation, reframing RAG tradeoffs around noise, latency, cost, and attribution.

agi

SourceJun 26, 20262026-06-26

Temporal Validity Challenges in RAG and Evolving Knowledge

How RAG mixes past and current facts, causing stale-fact errors, and why temporal validity matters in retrieval.

agi

CommunityJun 25, 20262026-06-25

Balancing AI Benefits and Existential Risks Economically

Why AI's growth benefits and existential risks should be compared within one economic framework, not separate debates.

llm

SourceJun 25, 20262026-06-25

Evaluating VLM Visual Search Beyond Accuracy and Tokens

A framework for evaluating VLM visual search with classic human tasks, using token length and search cost beyond accuracy.

hardware

SourceJun 25, 20262026-06-25

Grounded LLM Workflows for Inherited Disease Diagnosis Ranking

DeepBD highlights grounded LLM workflows for inherited disease diagnosis, emphasizing traceable evidence and recall gains.

llm

SourceJun 25, 20262026-06-25

How LLMs Fail Plausibly on Research Math Problems

A look at four plausible LLM failure modes in research-level math and why verification design matters beyond accuracy.

hardware

SourceJun 25, 20262026-06-25

Modeling LLM Verifier Loops With Convergence Guarantees

A framework modeling LLM-verifier loops as a four-stage absorbing Markov chain to analyze convergence and failure points.

hardware

SourceJun 25, 20262026-06-25

Rethinking Agent Safety Beyond Model Internal Guardrails

Why agent safety must shift from internal prompts and filters to external runtime permission enforcement.

Aionda

Tag: explainer

Bug Reproduction Tests as Signals for Code Agents

Interpreting RAG Retrieval With Sparse Autoencoder Features

Latent Space Control for Trustworthy LLM Behavior

On-Device AI Security Across App, Model, and OS

Public AI Infrastructure: Distributed Access or Concentrated Scale

Why Generator Evaluator Consistency Matters In LLM Self-Review

Model Distillation, API Control, and Sovereign AI Risks

Translating Medical AI Explanations Into Clinical Workflow

What Should LLM Unlearning Actually Remove Precisely

Enforcing Agent Policies Beyond Prompt-Based Safety Guards

Why Benchmarks Miss Much of LLM Performance

Why Agent Configs Need Deterministic Control Planes

Agent-Driven Iteration Loops for Industrial Recommender Systems

Emotion Vectors in Open LLMs and Behavior Control

HiLSVA Reframes Scientific Visualization Agent Control and Oversight

How Generative AI Makes Money And Why Profitability Debates Persist

KARLA Rethinks Retrieval During Token Generation for LLMs

Temporal Validity Challenges in RAG and Evolving Knowledge

Balancing AI Benefits and Existential Risks Economically

Evaluating VLM Visual Search Beyond Accuracy and Tokens

Grounded LLM Workflows for Inherited Disease Diagnosis Ranking

How LLMs Fail Plausibly on Research Math Problems

Modeling LLM Verifier Loops With Convergence Guarantees

Rethinking Agent Safety Beyond Model Internal Guardrails