Tag: explainer

232 articles · Page 3 / 10

CineCap targets cinematic video captioning, focusing on camera motion, shot size, angle, and structured scene reasoning.

llm

SourceJun 24, 20262026-06-24

HOLMES Challenges LLMs With Higher-Order Logic Reasoning

HOLMES probes higher-order logic reasoning beyond final answers, exposing limits in LLM rule, predicate, and constraint handling.

agi

SourceJun 23, 20262026-06-23

The AI Evaluability Gap in Risk Governance

Why AI deployment decisions depend not just on performance, but on sufficient evaluation evidence and governance links.

agi

SourceJun 23, 20262026-06-23

RLHF Alignment Through the Lens of Social Choice

A look at recent research framing RLHF as preference aggregation, with implications for fairness and safety.

agi

SourceJun 23, 20262026-06-23

Role-Based Agentic AI for Intent-Driven Network Operations

Examines role-based agentic AI for intent-driven telecom operations, with focus on autonomy, orchestration, and safety.

hardware

CommunityJun 23, 20262026-06-23

UK Backs Open AI on Everyday Hardware

The UK funds open AI and general-purpose hardware research to expand access, efficiency, and tech autonomy.

agi

CommunityJun 22, 20262026-06-22

AI, Fermi Paradox, and the Meaning of L

A look at the Fermi Paradox through Drake equation variable L, observation limits, and AI risk claims.

agi

CommunityJun 20, 20262026-06-20

AI Coding Needs Review More Than Speed Gains

AI coding can boost output, but not quality or accountability. The real bottleneck is review, validation, and approval.

agi

SourceJun 20, 20262026-06-20

How Linear Transformer FFN Blocks Really Are

Examines per-block linear recoverability of transformer FFNs and what R^2_lin may imply for compression and interpretability.

llm

SourceJun 20, 20262026-06-20

How LLMs Encode Essay Quality for Scoring

Examines how LLMs encode essay quality in hidden representations and whether those signals persist across prompt changes.

hardware

SourceJun 20, 20262026-06-20

Interpreting Style-Caption TTS With Cross-Attention Attribution

A look at why cross-attention attribution matters for interpreting word-level style control in caption-based TTS.

agi

SourceJun 20, 20262026-06-20

JustDiag for Auditable Root Cause Analysis in LLM Workflows

Why JustDiag reframes LLM root cause analysis around evidence, alternatives, contradictions, and uncertainty.

hardware

SourceJun 20, 20262026-06-20

Measuring False Intervention in DeFi Supervisory AI Agents

Why DeFi supervisory AI should measure false intervention separately from accuracy, with practical checks for evaluation.

hardware

SourceJun 20, 20262026-06-20

Query Placement Matters in Diffusion LLM In-Context Learning

Why query placement may affect diffusion LLM in-context learning, and what prior position-bias results imply.

llm

SourceJun 20, 20262026-06-20

When Learner-Based Drift Detection Works Better in Streaming

Explains when learner-based drift detection outperforms statistical tests in streaming ML and what matters operationally.

hardware

SourceJun 19, 20262026-06-19

Can General Models Extract Legal Networks Reliably

Using FineREX, this examines why legal-record extraction for smuggling knowledge graphs needs domain-specific schemas and review.

hardware

SourceJun 19, 20262026-06-19

Self-Review Alignment for Safer LLM Reasoning Outputs

Explores combining a conscience step with DPO so LLMs review reasoning during inference while balancing safety and performance.

hardware

SourceJun 12, 20262026-06-12

Reframing Shielded RL as Design-Time Structure Analysis

A concise look at shielded RL reinterpreted as a design-time tool for structural safety analysis, not runtime blocking.

hardware

CommunityJun 12, 20262026-06-12

Rethinking AI Job Impacts Beyond Mass Unemployment Fears

Official reports suggest AI is reshaping tasks and productivity before causing broad job losses.

agi

SourceJun 12, 20262026-06-12

Rethinking AI Loss of Control Through Operational Definitions

Examines vague AI loss-of-control language and reframes it around goals, audits, interruption, and rollback.

agi

SourceJun 2, 20262026-06-02

Why Mechanistic Interpretability Needs Auditable Validation Rules

Mechanistic interpretability matters, but auditable, reproducible validation rules are what safety-critical AI needs.

hardware

SourceJun 2, 20262026-06-02

TriLens Tracks Hallucination Signals Across LLM Internal Layers

TriLens explores white-box hallucination detection by tracking layer-wise entropy signals before incorrect answers emerge.

agi

SourceJun 1, 20262026-06-01

Do Warm Personalized AI Replies Persuade Users More?

Examines how contextual personalization and warmth affect trust, persuasion, and reliance in conversational AI.

agi

CommunityJun 1, 20262026-06-01

Why AI Stops Reproducing Lyrics and Long Texts

Why AI services often block long copyrighted text reproduction but allow transforms of user-provided text.

Aionda

Tag: explainer

CineCap And The Challenge Of Cinematic Video Captioning

HOLMES Challenges LLMs With Higher-Order Logic Reasoning

The AI Evaluability Gap in Risk Governance

RLHF Alignment Through the Lens of Social Choice

Role-Based Agentic AI for Intent-Driven Network Operations

UK Backs Open AI on Everyday Hardware

AI, Fermi Paradox, and the Meaning of L

AI Coding Needs Review More Than Speed Gains

How Linear Transformer FFN Blocks Really Are

How LLMs Encode Essay Quality for Scoring

Interpreting Style-Caption TTS With Cross-Attention Attribution

JustDiag for Auditable Root Cause Analysis in LLM Workflows

Measuring False Intervention in DeFi Supervisory AI Agents

Query Placement Matters in Diffusion LLM In-Context Learning

When Learner-Based Drift Detection Works Better in Streaming

Can General Models Extract Legal Networks Reliably

Self-Review Alignment for Safer LLM Reasoning Outputs

Reframing Shielded RL as Design-Time Structure Analysis

Rethinking AI Job Impacts Beyond Mass Unemployment Fears

Rethinking AI Loss of Control Through Operational Definitions

Why Mechanistic Interpretability Needs Auditable Validation Rules

TriLens Tracks Hallucination Signals Across LLM Internal Layers

Do Warm Personalized AI Replies Persuade Users More?

Why AI Stops Reproducing Lyrics and Long Texts