Tag: deep-dive

394 articles · Page 2 / 17

Using LLMs as semantic injectors, this approach adapts time series models with process documents and metadata.

hardware

SourceJul 9, 20262026-07-09

Measuring How Hallucinations Distort Downstream Vision-Language Reasoning

HIVE evaluates how vision-language hallucinations propagate into later reasoning and distort downstream predictions.

hardware

SourceJul 9, 20262026-07-09

Reusable Skills for Better AI Data Science Workflows

Examines whether reusable skill files improve quality, auditability, and operations in repetitive AI data science tasks.

hardware

CommunityJul 9, 20262026-07-09

What Really Matters in Backend Code Evaluation

Why backend evaluation should prioritize SSOT consistency and catching critical PR-stage defects over raw code generation.

agi

CommunityJul 8, 20262026-07-08

AI Conversation and Gaming Compete for User Time

Examines how conversational AI and games compete for attention, highlighting different user needs and social dynamics.

agi

SourceJul 8, 20262026-07-08

Can Model Merging Beat Averaging in DiLoCo Aggregation

Examines whether model merging can outperform averaging in DiLoCo aggregation while balancing communication costs and final performance.

agi

SourceJul 8, 20262026-07-08

When Coding Agents Speed Up but Learning Slows

AI coding agents may raise productivity while reducing developer understanding, retention, and long-term problem-solving capacity.

agi

SourceJul 8, 20262026-07-08

Control AI Data Risks by Storage Path

How to separate session, RAG, and model parameter paths in generative AI to design confidentiality, deletion, and audit controls.

agi

SourceJul 8, 20262026-07-08

FreqDepthKV for Robust KV Cache Compression in Long Contexts

A concise look at FreqDepthKV, a method targeting KV cache bottlenecks in long-context LLM inference.

agi

SourceJul 8, 20262026-07-08

How Frontier AI Exposure Diverges Across National Economies

Using 141-country employment data, this piece explains why frontier AI exposure varies by job mix, productivity potential, and labor risk.

agi

SourceJul 8, 20262026-07-08

LLMs for SSH Research With Multilingual Knowledge Graphs

Applying LLMs to SSH research requires checking multilingual corpora, knowledge graphs, evaluation, bias, and governance together.

agi

SourceJul 8, 20262026-07-08

Radiology AI for Draft Reporting in Clinical Workflow

Examines Harrison.Rad 1.5 as a radiology draft-reporting model, focusing on workflow value, supervision, and deployment risks.

agi

SourceJul 8, 20262026-07-08

Why Tool-Calling Agent Security Is a Structural Problem

Why text-driven tool calls make AI agent delegation a structural security issue, backed by refusal-rate evidence.

agi

SourceJul 7, 20262026-07-07

Designing Organizational Memory for Agentic Process Execution

Agent bottlenecks are not just reasoning. Separate organizational knowledge into memory layers for reliability and control.

llm

SourceJul 7, 20262026-07-07

Finding First Errors in Small Model Physics Reasoning

A look at training small models to find first reasoning errors, use structured feedback, and revise answers in physics tasks.

agi

SourceJul 7, 20262026-07-07

Hierarchical Memory and Agentic Reasoning for Long Videos

Why long-video AI struggles with narrative and causal links, and how hierarchical memory and agentic reasoning help.

hardware

CommunityJul 7, 20262026-07-07

Why LLM Automation Does Not Lower Real-World Costs

Explains why better LLM performance and office automation do not directly reduce electricity, rent, or food costs.

llm

SourceJul 7, 20262026-07-07

Rethinking Agent Memory as Executable World State

Why agent memory may need to shift from text logs to object-centric executable environment models for long tasks.

hardware

SourceJul 7, 20262026-07-07

When Safety Alignment Over-Refuses Cyber Defense Requests

Examines how LLM safety alignment can over-refuse legitimate cyber defense requests and reduce utility.

agi

SourceJul 7, 20262026-07-07

Unified Diffusion Faces Label Conflicts in Medical Segmentation

A look at SNR-adaptive unified diffusion for medical segmentation, focusing on label conflicts over headline gains.

agi

SourceJul 7, 20262026-07-07

Utility Design for Stable Cooperation in Social Dilemmas

A MARL study on stabilizing cooperation in sequential social dilemmas through a utility function combining altruism and fairness.

hardware

SourceJul 6, 20262026-07-06

AI Data Centers Expand Into Power And Cooling

AI data center competition is expanding beyond chips to power reliability, cooling design, and water use.

hardware

SourceJul 6, 20262026-07-06

AI Reliability Talent Becomes the Real Deployment Bottleneck

Beyond GPUs, the urgent task is building AI reliability talent and TEVV-based operational governance.

llm

CommunityJul 6, 20262026-07-06

AI Search Speed Gains and Verification Tradeoffs

AI search can speed up answers, but citations, data, and technical details still require direct source verification.

Aionda

Tag: deep-dive

Injecting Process Semantics Into Time Series Forecasting

Measuring How Hallucinations Distort Downstream Vision-Language Reasoning

Reusable Skills for Better AI Data Science Workflows

What Really Matters in Backend Code Evaluation

AI Conversation and Gaming Compete for User Time

Can Model Merging Beat Averaging in DiLoCo Aggregation

When Coding Agents Speed Up but Learning Slows

Control AI Data Risks by Storage Path

FreqDepthKV for Robust KV Cache Compression in Long Contexts

How Frontier AI Exposure Diverges Across National Economies

LLMs for SSH Research With Multilingual Knowledge Graphs

Radiology AI for Draft Reporting in Clinical Workflow

Why Tool-Calling Agent Security Is a Structural Problem

Designing Organizational Memory for Agentic Process Execution

Finding First Errors in Small Model Physics Reasoning

Hierarchical Memory and Agentic Reasoning for Long Videos

Why LLM Automation Does Not Lower Real-World Costs

Rethinking Agent Memory as Executable World State

When Safety Alignment Over-Refuses Cyber Defense Requests

Unified Diffusion Faces Label Conflicts in Medical Segmentation

Utility Design for Stable Cooperation in Social Dilemmas

AI Data Centers Expand Into Power And Cooling

AI Reliability Talent Becomes the Real Deployment Bottleneck

AI Search Speed Gains and Verification Tradeoffs