Tag: explainer

232 articles available

View all tags View all posts

ConceptSMILE Audits Concept Explanations Under Input Perturbations

agi

SourceJul 13, 20262026-07-13

ConceptSMILE Audits Concept Explanations Under Input Perturbations

ConceptSMILE audits concept-based explanations for stability, faithfulness, and consistency under input perturbations.

Latent Confounding Can Bias Bayesian Causal Discovery Posterior

agi

SourceJul 13, 20262026-07-13

Latent Confounding Can Bias Bayesian Causal Discovery Posterior

Shows how latent confounding can skew Bayesian causal discovery posterior toward spurious edges, not just uncertainty.

Clinical-Reasoning LLM Advances HCC Risk And Treatment Guidance

agi

SourceJul 12, 20262026-07-12

Clinical-Reasoning LLM Advances HCC Risk And Treatment Guidance

HCC-STAR reads EMR narratives to rank HCC risk, treatment priorities, and evidence-backed explanations.

MetaNCA Learns Rules Beyond Fixed Network Architectures

agi

SourceJul 12, 20262026-07-12

MetaNCA Learns Rules Beyond Fixed Network Architectures

MetaNCA explores self-organizing neural weights with local rules and tests generalization to unseen architectures.

Rethinking Medical LLM Evaluation for Clinical Reasoning

hardware

SourceJul 12, 20262026-07-12

Rethinking Medical LLM Evaluation for Clinical Reasoning

A survey argues medical LLMs should be judged by clinical reasoning capacity, not just benchmark accuracy.

Tracing Jailbreaks Through Internal Attribution Graph Path Rerouting

hardware

SourceJul 12, 20262026-07-12

Tracing Jailbreaks Through Internal Attribution Graph Path Rerouting

A look at interpreting LLM jailbreaks as internal path rerouting, with key findings, limits, and safety implications.

XAI Under the EU AI Act for Certification

hardware

SourceJul 12, 20262026-07-12

XAI Under the EU AI Act for Certification

Under the EU AI Act, XAI appears closer to supporting evidence for high-risk AI assurance than a substitute for certification.

Three Axes for Comparing Korean LLM Performance

hardware

CommunityJul 10, 20262026-07-10

Three Axes for Comparing Korean LLM Performance

Korean LLMs are better judged by naturalness, pragmatic understanding, and instruction following than by one rank.

Interpreting VLM Adversarial Risk via Spectral Subspaces

hardware

SourceJul 10, 20262026-07-10

Interpreting VLM Adversarial Risk via Spectral Subspaces

A look at interpreting transformer-based VLM adversarial vulnerability through intermediate spectral subspaces.

Governing Agentic AI Beyond Outputs and Into Actions

hardware

SourceJul 9, 20262026-07-09

Governing Agentic AI Beyond Outputs and Into Actions

Why agentic AI governance must cover autonomy, tool use, external actions, audit logs, and human oversight.

Interpreting Transformer Circuits Beyond Reversible Modular Arithmetic

llm

SourceJul 9, 20262026-07-09

Interpreting Transformer Circuits Beyond Reversible Modular Arithmetic

A look at transformer circuit analysis for composite modular multiplication, extending interpretation beyond reversible operations.

VASP Agent for Reliable Scientific Computation Workflows

agi

SourceJul 9, 20262026-07-09

VASP Agent for Reliable Scientific Computation Workflows

VASP Agent targets reliable scientific automation by combining input consistency, long-run supervision, and output validation.

Interpreting Individual Parameters In Sparse Transformer Models

hardware

SourceJul 8, 20262026-07-08

Interpreting Individual Parameters In Sparse Transformer Models

Examines whether individual parameters in sparse transformers carry stable meanings amid polysemantic behavior.

Attention Limits in RLHF Preference Learning and Reward Models

hardware

SourceJul 7, 20262026-07-07

Attention Limits in RLHF Preference Learning and Reward Models

Examines how attention-limited pairwise labels in RLHF can distort reward learning and be mistaken for true preference.

Measuring LLM Emotion Interpretation Under Semantic Stress

hardware

SourceJul 7, 20262026-07-07

Measuring LLM Emotion Interpretation Under Semantic Stress

A study examines how LLMs' emotion interpretation consistency can weaken under semantic stress in affective dialogue.

How Question AIs Shift Search Toward Accuracy

hardware

CommunityJul 7, 20262026-07-07

How Question AIs Shift Search Toward Accuracy

Question-based AI speeds research, but answer accuracy and source verification remain critical for reliable work.

How AI Shifts Skills, Tasks, and Learning

hardware

CommunityJul 6, 20262026-07-06

How AI Shifts Skills, Tasks, and Learning

Drawing on OECD and ILO reports, this explains how AI reshapes tasks before jobs and shifts learning toward understanding and verification.

How AI Changes Reading Without Replacing Understanding

hardware

CommunityJul 4, 20262026-07-04

How AI Changes Reading Without Replacing Understanding

AI-assisted reading can lower comprehension barriers, but heavy reliance on summaries may weaken deep thinking.

Training-Free Attribution for Long Document Multimodal QA

agi

SourceJul 4, 20262026-07-04

Training-Free Attribution for Long Document Multimodal QA

A look at MultAttnAttrib for long-document multimodal QA, covering attribution benefits, limits, and evaluation criteria.

Conditional Co-Ablation Reveals Hidden Backup Transformer Circuits

agi

SourceJul 3, 20262026-07-03

Conditional Co-Ablation Reveals Hidden Backup Transformer Circuits

How CoAx exposes backup circuits that single ablation can miss due to self-repair in transformers.

Context Governance for Verifiable AI Agent Knowledge

llm

SourceJul 3, 20262026-07-03

Context Governance for Verifiable AI Agent Knowledge

How ContextNest frames context governance with a verifiable knowledge vault layer for auditable AI agents beyond retrieval quality.

Counterfactual Coaching From Latent Space in StarCraft II

agi

SourceJul 3, 20262026-07-03

Counterfactual Coaching From Latent Space in StarCraft II

A look at RL research using latent space to generate counterfactual feedback in StarCraft II and its coaching potential.

Where AI Meets Quantum Information in Practice

agi

SourceJul 3, 20262026-07-03

Where AI Meets Quantum Information in Practice

Reviewing where AI and quantum information already deliver practical gains, and why quantum ML advantage still needs caution.

Why Foundational Learning Still Matters in the AI Era

hardware

CommunityJul 3, 20262026-07-03

Why Foundational Learning Still Matters in the AI Era

AI can boost productivity but also amplify errors, making foundational learning essential for problem framing, verification, and judgment.