Tag: llm

999 articles · Page 2 / 42

A curated link roundup from recently collected official updates and tech news.

hardware

CommunityJul 10, 20262026-07-10

Controlling Drift In Long-Running Coding Agents

Long-running coding agents need drift control, fixed specs, and review gates more than stronger reasoning alone.

agi

SourceJul 10, 20262026-07-10

Crossmodal Speech Emotion Analysis With Audio And Generated Transcripts

Why combining audio with generated multilingual transcripts matters for speech emotion analysis, and where errors and cost tradeoffs remain.

hardware

SourceJul 10, 20262026-07-10

MiniMax's 2.7 Trillion Model Rumor and Open Weights

Key issues in the MiniMax report: a rumored 2.7 trillion-parameter LLM, possible open weights, licensing, and inference costs.

agi

SourceJul 10, 20262026-07-10

RAID Finds Six Goalie AI Exploits in NHL 26

RAID found six scoring exploits in NHL 26 goalie AI in one run, highlighting automated QA and reusable red-team testing.

hardware

CommunityJul 10, 20262026-07-10

Three Axes for Comparing Korean LLM Performance

Korean LLMs are better judged by naturalness, pragmatic understanding, and instruction following than by one rank.

hardware

SourceJul 10, 20262026-07-10

Interpreting VLM Adversarial Risk via Spectral Subspaces

A look at interpreting transformer-based VLM adversarial vulnerability through intermediate spectral subspaces.

k-ai-pulse

RoundupJul 9, 20262026-07-09

AI Resource Roundup (24h) - 2026-07-09

A curated link roundup from recently collected official updates and tech news.

hardware

SourceJul 9, 20262026-07-09

Continual Learning for Adaptive Modular Soft Robot Control

A look at an arXiv paper proposing continual learning for adaptive control of modular soft robots under morphology changes.

agi

SourceJul 9, 20262026-07-09

How Deployment Rules Shift Multi-Agent AI Safety

A study showing that deployment rules, not just models, can causally reshape multi-agent behavior and safety outcomes.

hardware

SourceJul 9, 20262026-07-09

Gimitest Framework for Testing RL Policy Failures

Gimitest is an open-source framework for testing RL policies under changing conditions to uncover failures and vulnerabilities.

hardware

SourceJul 9, 20262026-07-09

Governing Agentic AI Beyond Outputs and Into Actions

Why agentic AI governance must cover autonomy, tool use, external actions, audit logs, and human oversight.

llm

SourceJul 9, 20262026-07-09

Injecting Process Semantics Into Time Series Forecasting

Using LLMs as semantic injectors, this approach adapts time series models with process documents and metadata.

llm

SourceJul 9, 20262026-07-09

Interpreting Transformer Circuits Beyond Reversible Modular Arithmetic

A look at transformer circuit analysis for composite modular multiplication, extending interpretation beyond reversible operations.

hardware

SourceJul 9, 20262026-07-09

Measuring How Hallucinations Distort Downstream Vision-Language Reasoning

HIVE evaluates how vision-language hallucinations propagate into later reasoning and distort downstream predictions.

agi

SourceJul 9, 20262026-07-09

PCBWorld Redefines Evaluation for Engine-Grounded PCB Routing AI

An overview of PCBWorld, a KiCad-based environment for evaluating PCB routing AI with native actions and DRC feedback.

hardware

SourceJul 9, 20262026-07-09

Reusable Skills for Better AI Data Science Workflows

Examines whether reusable skill files improve quality, auditability, and operations in repetitive AI data science tasks.

agi

SourceJul 9, 20262026-07-09

VASP Agent for Reliable Scientific Computation Workflows

VASP Agent targets reliable scientific automation by combining input consistency, long-run supervision, and output validation.

hardware

CommunityJul 9, 20262026-07-09

What Really Matters in Backend Code Evaluation

Why backend evaluation should prioritize SSOT consistency and catching critical PR-stage defects over raw code generation.

agi

CommunityJul 8, 20262026-07-08

AI Conversation and Gaming Compete for User Time

Examines how conversational AI and games compete for attention, highlighting different user needs and social dynamics.

k-ai-pulse

RoundupJul 8, 20262026-07-08

AI Resource Roundup (24h) - 2026-07-08

A curated link roundup from recently collected official updates and tech news.

agi

SourceJul 8, 20262026-07-08

Can Model Merging Beat Averaging in DiLoCo Aggregation

Examines whether model merging can outperform averaging in DiLoCo aggregation while balancing communication costs and final performance.

agi

SourceJul 8, 20262026-07-08

When Coding Agents Speed Up but Learning Slows

AI coding agents may raise productivity while reducing developer understanding, retention, and long-term problem-solving capacity.

agi

SourceJul 8, 20262026-07-08

Control AI Data Risks by Storage Path

How to separate session, RAG, and model parameter paths in generative AI to design confidentiality, deletion, and audit controls.

Aionda

Tag: llm

AI Resource Roundup (24h) - 2026-07-10

Controlling Drift In Long-Running Coding Agents

Crossmodal Speech Emotion Analysis With Audio And Generated Transcripts

MiniMax's 2.7 Trillion Model Rumor and Open Weights

RAID Finds Six Goalie AI Exploits in NHL 26

Three Axes for Comparing Korean LLM Performance

Interpreting VLM Adversarial Risk via Spectral Subspaces

AI Resource Roundup (24h) - 2026-07-09

Continual Learning for Adaptive Modular Soft Robot Control

How Deployment Rules Shift Multi-Agent AI Safety

Gimitest Framework for Testing RL Policy Failures

Governing Agentic AI Beyond Outputs and Into Actions

Injecting Process Semantics Into Time Series Forecasting

Interpreting Transformer Circuits Beyond Reversible Modular Arithmetic

Measuring How Hallucinations Distort Downstream Vision-Language Reasoning

PCBWorld Redefines Evaluation for Engine-Grounded PCB Routing AI

Reusable Skills for Better AI Data Science Workflows

VASP Agent for Reliable Scientific Computation Workflows

What Really Matters in Backend Code Evaluation

AI Conversation and Gaming Compete for User Time

AI Resource Roundup (24h) - 2026-07-08

Can Model Merging Beat Averaging in DiLoCo Aggregation

When Coding Agents Speed Up but Learning Slows

Control AI Data Risks by Storage Path