Topic

Tool Use

Function calling, API integration, and tool-augmented model behavior.

4 papers · latest 2026-04-13

Most active fields for this topic

Reasoning & Agents · 4

EE-MCP: Self-Evolving MCP-GUI Agents via Automated Environment Generation and Experience Learning

Tiantian He, Yihang Chen, Keyue Jiang et al.

significant🔴 AdvancedReasoning & Agents Tool Use AI Agents

cs.AIcs.AI

EE-MCP shows how MCP-plus-GUI agents can self-improve by generating environments, synthesizing gap tasks, and accumulating reusable experience, with clear gains across desktop apps.

Details → arXiv →

Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models

Shilin Yan, Jintao Tong, Hongwei Xue et al.

breakthrough🔴 AdvancedReasoning & Agents AI Agents Multimodal Understanding

cs.CVcs.AIcs.CV

Act Wisely separates task accuracy from tool-efficiency rewards so multimodal agents learn when not to call tools, cutting unnecessary invocations by orders of magnitude while improving accuracy, latency, and cost.

Details → arXiv →

A Formal Security Framework for MCP-Based AI Agents: Threat Taxonomy, Verification Models, and Defense Mechanisms

Nirajan Acharya, Gaurav Kumar Gupta

breakthrough🔴 AdvancedReasoning & Agents AI Agents Tool Use

cs.CRcs.AIcs.CR

First formal security framework for MCP-based AI agents, defining threats and verifiable defenses. Essential for builders deploying LLM agents with external tool access in production environments.

Details → arXiv →

Full-Duplex-Bench-v3: Benchmarking Tool Use for Full-Duplex Voice Agents Under Real-World Disfluency

Guan-Ting Lin, Chen Chen, Zhehuai Chen et al.

significant🟡 IntermediateReasoning & Agents Tool Use AI Agents

cs.CL

Voice agents often fail when users stutter, pause, or interrupt, leading to broken API calls and frustrated users. This benchmark uses real human speech to reveal exactly how top models handle these messy realities. It allows developers to test if their voice systems can actually execute tasks reliably in natural conversation.

Details → arXiv →