Tag: llm

999 articles · Page 7 / 42

How ontology constraints reduce noisy paths in multi-hop KGQA and improve reasoning for complex queries.

SourceJun 29, 20262026-06-29

Translating Medical AI Explanations Into Clinical Workflow

How a speech-based cognitive impairment framework turns SHAP and linguistic features into clinical explanations for usability.

hardware

SourceJun 29, 20262026-06-29

What Should LLM Unlearning Actually Remove Precisely

A position paper argues LLM unlearning should mean dataset-defined deletion, not output suppression or behavior editing.

hardware

SourceJun 28, 20262026-06-28

Enforcing Agent Policies Beyond Prompt-Based Safety Guards

How formalized policies can deterministically govern agent tool calls beyond probabilistic prompt steering and filters.

llm

SourceJun 28, 20262026-06-28

Why Benchmarks Miss Much of LLM Performance

How single-run LLM benchmarks can miss usable performance, and why model choice, retries, and cost matter.

llm

SourceJun 27, 20262026-06-27

Why Agent Configs Need Deterministic Control Planes

Why reused coding agent config files can become an unmanaged control layer with security and operational risks.

hardware

SourceJun 27, 20262026-06-27

Financial Recommendations Need Explainability Before Cross-Channel Linking

In financial recommendations, linking anonymous web sessions and logged-in app behavior requires explainability and privacy checks before performance gains.

hardware

SourceJun 27, 20262026-06-27

Learning Motion Feasibility Before Costly Planning in Clutter

A study on filtering infeasible motion attempts in cluttered scenes using point-cloud predictors before sampling-based planning.

agi

SourceJun 27, 20262026-06-27

NVC Constraints Shift LLM Safety Toward De-Escalation Quality

How prompt-level NVC constraints shift LLM safety from toxicity blocking to de-escalation quality, with key tradeoffs.

agi

SourceJun 27, 20262026-06-27

OpenFinGym Reframes How Financial AI Systems Are Evaluated

OpenFinGym shifts financial AI evaluation from single-task accuracy to workflow-level testing across prediction, trading, and risk.

llm

SourceJun 27, 20262026-06-27

SBI Versus MCMC for Rapid Epidemiological Bayesian Inference

A comparison of SBI and MCMC in SECIR epidemiological models, focusing on posterior agreement, speed, and repeated use.

hardware

SourceJun 26, 20262026-06-26

Agent-Driven Iteration Loops for Industrial Recommender Systems

A look at AgentX and the shift from model changes to automating hypothesis, code, experiment, and analysis loops.

hardware

CommunityJun 26, 20262026-06-26

How Agentic AI Redefines Enterprise Coding Metrics Today

Enterprise AI value is shifting from single-response quality to long-running workflow execution and review gates.

k-ai-pulse

RoundupJun 26, 20262026-06-26

AI Resource Roundup (24h) - 2026-06-26

A curated link roundup from recently collected official updates and tech news.

hardware

SourceJun 26, 20262026-06-26

Emotion Vectors in Open LLMs and Behavior Control

Examines whether emotion vectors in open-weight LLMs are internal representations or merely correlated signals for behavior.

hardware

SourceJun 26, 20262026-06-26

HiLSVA Reframes Scientific Visualization Agent Control and Oversight

HiLSVA emphasizes plan-first workflows, human oversight, and provenance over full autonomy in scientific visualization agents.

llm

CommunityJun 26, 20262026-06-26

How Generative AI Makes Money And Why Profitability Debates Persist

A look at how generative AI earns revenue, why infrastructure costs loom large, and how investment and cloud deals shape profitability.

hardware

SourceJun 26, 20262026-06-26

KARLA Rethinks Retrieval During Token Generation for LLMs

KARLA explores retrieving facts during token generation, reframing RAG tradeoffs around noise, latency, cost, and attribution.

llm

CommunityJun 26, 20262026-06-26

Model Release Control or De Facto Permit System

Examines whether government early access and company-gated previews are turning AI model launches into a de facto permit system.

hardware

SourceJun 26, 20262026-06-26

Privacy Risks Shift From Models to Agent Operations

Why LLM agent privacy risks arise from data flows, memory, tools, logs, and delegated permissions in operation.

agi

CommunityJun 26, 20262026-06-26

Read AI Investment News by the Verbs First

AI investment news should be read through official verbs and numbers, not AGI narratives. Build, explore, and assess matter.

agi

SourceJun 26, 20262026-06-26

Rethinking Trust in Video Reasoning Under Visual Corruption

Examines the Blind Trust Problem in video reasoning and a reliability-based strategy for frame and tool selection.

llm

SourceJun 26, 20262026-06-26

Small Text Changes, Big Risks for NLP Guardrails

How meaning-preserving text substitutions can mislead classifiers and LLM guardrails, and what teams should measure first.

agi

SourceJun 26, 20262026-06-26

Temporal Validity Challenges in RAG and Evolving Knowledge

How RAG mixes past and current facts, causing stale-fact errors, and why temporal validity matters in retrieval.

Aionda

Tag: llm

Ontology-Guided KGQA Cuts Noisy Multi-Hop Reasoning Paths

Translating Medical AI Explanations Into Clinical Workflow

What Should LLM Unlearning Actually Remove Precisely

Enforcing Agent Policies Beyond Prompt-Based Safety Guards

Why Benchmarks Miss Much of LLM Performance

Why Agent Configs Need Deterministic Control Planes

Financial Recommendations Need Explainability Before Cross-Channel Linking

Learning Motion Feasibility Before Costly Planning in Clutter

NVC Constraints Shift LLM Safety Toward De-Escalation Quality

OpenFinGym Reframes How Financial AI Systems Are Evaluated

SBI Versus MCMC for Rapid Epidemiological Bayesian Inference

Agent-Driven Iteration Loops for Industrial Recommender Systems

How Agentic AI Redefines Enterprise Coding Metrics Today

AI Resource Roundup (24h) - 2026-06-26

Emotion Vectors in Open LLMs and Behavior Control

HiLSVA Reframes Scientific Visualization Agent Control and Oversight

How Generative AI Makes Money And Why Profitability Debates Persist

KARLA Rethinks Retrieval During Token Generation for LLMs

Model Release Control or De Facto Permit System

Privacy Risks Shift From Models to Agent Operations

Read AI Investment News by the Verbs First

Rethinking Trust in Video Reasoning Under Visual Corruption

Small Text Changes, Big Risks for NLP Guardrails

Temporal Validity Challenges in RAG and Evolving Knowledge