Tag: explainer

232 articles · Page 4 / 10

A look at why linear recurrent memory can work in partially observable RL through an HMM belief filtering view.

hardware

SourceMay 30, 20262026-05-30

Expert-Guided LLMs for Marine Lead Data Extraction

How expert-guided LLM agents structure marine lead and isotope data hidden in scientific literature.

llm

CommunityMay 29, 20262026-05-29

Coding Models Differ in Execution and Planning Styles

Coding model differences appear not in prose quality but in planning, tool use, and context handling scope.

llm

SourceMay 29, 20262026-05-29

Measuring Neural Networks' Preference for Simpler Solutions

A look at a proposed metric that approximates neural simplicity bias with data-dependent polynomials and its limits.

agi

SourceMay 29, 20262026-05-29

Q-Guided Alignment for Return-Conditioned Offline RL Control

Examines limits of RTG-only conditioning and how Q-guided alignment aims to improve controllability and reliability in offline RL.

agi

CommunityMay 29, 20262026-05-29

Reading AI Pricing Through Limits and Infrastructure Costs

AI pricing is better understood through usage caps, fallback rules, and inference infrastructure efficiency, not subscription fees alone.

hardware

SourceMay 28, 20262026-05-28

From Black-Box Grading to Rubric-Based Explainable Scoring

A look at rubric- and concept-based grading that makes open-ended scoring more reviewable, editable, and accountable.

llm

SourceMay 28, 20262026-05-28

Evaluating AI Agents for E-Commerce Dispute Resolution Tasks

CyberJurors evaluates agent systems on multi-round, multimodal evidence handling and platform rule adaptation in e-commerce disputes.

hardware

CommunityMay 28, 20262026-05-28

How Far Can Multimodal AI Be Trusted

Why multimodal AI still struggles with charts and scientific figures, and how to verify image-based conclusions in practice.

hardware

CommunityMay 28, 20262026-05-28

Vertical Integration Matters More Than Model Speed

AI vertical integration is less about chips than controlling the training stack, latency, throughput, utilization, and recovery.

llm

SourceMay 23, 20262026-05-23

Structuring Table QA With Navigation And Progressive Inference

A look at structuring table QA with guided cell navigation and staged inference to improve accuracy and verify evidence paths.

llm

SourceMay 21, 20262026-05-21

MOCHA Reframes Agent Skills Beyond Prompt Tuning Alone

MOCHA treats agent skills as multi-field artifacts and argues they must be optimized with platform constraints in mind.

hardware

SourceMay 20, 20262026-05-20

Neurosymbolic Ternary Claim Verification With Explainable Argumentation Framework

A study on claim verification that proposes ternary decisions and explainable argumentation under incomplete or conflicting evidence.

llm

SourceApr 3, 20262026-04-03

Prompt-Guided Image Compression for VLM Efficiency Gains

How prompt-guided image compression for VLMs shifts focus from human visual quality to preserving clues needed for tasks.

hardware

SourceMar 27, 20262026-03-27

How Mathematics Should Govern AI Use Now

Why mathematics must address AI through values, practice, teaching, technology, and ethics to protect autonomy.

agi

SourceMar 27, 20262026-03-27

Memory and Randomness Bottlenecks in Probabilistic Trustworthy AI

A unified view of probabilistic trustworthy AI: performance bottlenecks may lie in memory and random data movement, not just compute.

llm

SourceMar 27, 20262026-03-27

What Infant Vision Learning Suggests for AI Systems

How infant low-data visual learning links concepts, causality, and prediction to reshape AI vision and robotics design.

agi

SourceMar 27, 20262026-03-27

Wireless World Models for AI-Native 6G Networks

How wireless world models combine 3D geometry and wave propagation to improve real-world generalization in AI-native 6G.

hardware

SourceMar 26, 20262026-03-26

Rethinking LLM Agents as Adaptive Computation Graphs

View LLM agents as runtime-adaptive computation graphs to optimize accuracy, cost, latency, debugging, and control.

agi

SourceMar 20, 20262026-03-20

Judicial AI Depends on Human Algorithm Interaction Design

In courts, AI outcomes hinge less on model accuracy than on judge uptake, override patterns, accountability, and TEVV.

llm

SourceMar 20, 20262026-03-20

Medical AI Robotics Needs Governance Before Performance Claims

In medical AI robotics, governance, validation, and monitoring matter more than performance demos alone.

llm

SourceMar 20, 20262026-03-20

Tracing Long-Running Reasoning in Binary Analysis Agents

Examines why structured exploration and verifiable workflows may matter more than longer reasoning in LLM binary analysis.

agi

SourceMar 18, 20262026-03-18

Why Prediction-Equivalent Models Disagree on Feature Attribution

Models with identical predictions can still produce different feature attributions, challenging XAI reliability, audits, and governance.

hardware

SourceMar 18, 20262026-03-18

Reasoning With AI, Not Letting It Decide Alone

How combining LLMs with computational argumentation could shift AI from making decisions for us to reasoning with us.

Aionda

Tag: explainer

Why Linear Recurrent Memory Works in POMDP RL

Expert-Guided LLMs for Marine Lead Data Extraction

Coding Models Differ in Execution and Planning Styles

Measuring Neural Networks' Preference for Simpler Solutions

Q-Guided Alignment for Return-Conditioned Offline RL Control

Reading AI Pricing Through Limits and Infrastructure Costs

From Black-Box Grading to Rubric-Based Explainable Scoring

Evaluating AI Agents for E-Commerce Dispute Resolution Tasks

How Far Can Multimodal AI Be Trusted

Vertical Integration Matters More Than Model Speed

Structuring Table QA With Navigation And Progressive Inference

MOCHA Reframes Agent Skills Beyond Prompt Tuning Alone

Neurosymbolic Ternary Claim Verification With Explainable Argumentation Framework

Prompt-Guided Image Compression for VLM Efficiency Gains

How Mathematics Should Govern AI Use Now

Memory and Randomness Bottlenecks in Probabilistic Trustworthy AI

What Infant Vision Learning Suggests for AI Systems

Wireless World Models for AI-Native 6G Networks

Rethinking LLM Agents as Adaptive Computation Graphs

Judicial AI Depends on Human Algorithm Interaction Design

Medical AI Robotics Needs Governance Before Performance Claims

Tracing Long-Running Reasoning in Binary Analysis Agents

Why Prediction-Equivalent Models Disagree on Feature Attribution

Reasoning With AI, Not Letting It Decide Alone