Question 1

Why is tool use the highest-risk AI agent capability?

Accepted Answer

Tool use converts what would be a harmful text response into a harmful real-world action. Without tools, a misdirected agent produces bad output that a human reads and can reject. With tools, a misdirected agent can delete files, send emails, exfiltrate data, query databases, and execute code before any human review. The attack surface scales with the number of tools: each tool is a capability an attacker can try to redirect. An agent with ten tools has ten times the attack surface of an agent with one tool, and the interactions between tools multiply the possible harmful actions further.

Question 2

What is tool poisoning in AI agents?

Accepted Answer

Tool poisoning is an attack where the tool's description or metadata is modified to change how the agent behaves when it uses that tool. An agent selects which tool to call and how to call it based largely on tool descriptions. If an attacker can modify a tool description, they can cause the agent to misuse the tool or to call it in ways the operator did not intend. In the context of MCP servers, a malicious MCP server can advertise tools with descriptions that contain hidden instructions, causing any agent that loads that server to behave differently.

Question 3

What is tool shadowing?

Accepted Answer

Tool shadowing is an attack where a malicious tool is registered with a name or description similar to a legitimate tool. When the agent needs to call the legitimate tool, it calls the malicious shadow instead. The malicious tool may perform the same action as the legitimate tool (to avoid detection) while also performing an additional malicious action, or it may silently substitute a different action entirely. Tool shadowing is most dangerous in MCP environments where multiple servers can be loaded simultaneously and tool names can collide or overlap.

Question 4

What is a confused deputy attack in AI agents?

Accepted Answer

A confused deputy attack is when an entity with high authority (the agent) is manipulated into acting on behalf of a less trusted party. In AI agents, the agent typically runs with the authority of the deployment context, which may include access to databases, file systems, and APIs that the user would not have direct access to. When an attacker manipulates the agent through prompt injection or malicious tool output, the agent uses its elevated authority to perform actions the attacker could not perform directly. The agent becomes a confused deputy: it acts with authority it legitimately has, but in the service of an attacker rather than its operator.

Question 5

What is the MCP attack surface?

Accepted Answer

The Model Context Protocol (MCP) is an open standard for connecting AI agents to tools and data sources. Its attack surface includes: server-side tool poisoning where a malicious MCP server advertises tools with descriptions containing hidden instructions, cross-context tool invocation where one MCP server can trigger actions on another server's tools, supply chain attacks on MCP server packages distributed through registries, and tool name collisions where multiple loaded servers register tools with identical or similar names and the agent calls the wrong one. MCP increases tool-use capability significantly but also increases the attack surface proportionally.

Question 6

What is SSRF in the context of AI agents?

Accepted Answer

Server-Side Request Forgery (SSRF) via AI agent HTTP tools occurs when an attacker redirects an agent's HTTP tool to make requests to internal network addresses that the attacker could not reach directly. The agent typically runs inside a private network with access to internal services. If an attacker can control what URL the agent fetches, they can read internal service responses, probe internal network topology, and in some cases interact with internal services that have no public authentication. AgentIQ's network_security policy defends against this by blocking requests to localhost, 127.0.0.1, 192.168.*, and 10.0.* address ranges.

Question 7

How do tool_call policies work in the AgentIQ Policy DSL?

Accepted Answer

Tool call policies use the tool_call resource in the Mirror Policy DSL. A deny rule blocks when the condition is true: deny tool_call where function.name == 'dangerous_function' blocks any call to that function. Rules can combine function name and argument checks: deny tool_call where function.name == 'http_request' && contains(function.arguments, 'localhost') blocks HTTP requests to localhost. Field access uses dot notation: function.name, function.arguments, function.arguments.url. The allow rule creates exceptions to a preceding deny: deny tool_call where true followed by allow rules creates an allowlist. Use C-style operators (&&, ||, !) not Python-style operators.

Question 8

What does the file_security policy protect against?

Accepted Answer

The file_security policy blocks read_file tool calls that target sensitive paths: /etc/ (system configuration), .ssh/ (SSH keys and config), and .env (environment files containing secrets and API keys). It adds an explicit allow for /tmp/ to demonstrate the allowlist pattern. This policy prevents a redirected agent from reading system credentials, SSH private keys, or application secrets even if it has legitimate file-reading capability for other purposes. The policy uses the starts_with and contains built-in functions from the Mirror Policy DSL.

Question 9

What does the sql_security policy protect against?

Accepted Answer

The sql_security policy blocks execute_sql tool calls whose arguments contain classic SQL injection patterns: OR 1=1 (always-true condition for bypassing WHERE clauses), UNION SELECT (data extraction via union injection), DROP TABLE (destructive DDL), DELETE FROM (mass deletion), and -- (SQL comment for truncating queries). The icontains function makes checks case-insensitive. This defends against both direct SQL injection attacks and indirect injection where an attacker has embedded SQL fragments in content the agent retrieves and passes to a database tool.

Question 10

What does the network_security policy protect against?

Accepted Answer

The network_security policy prevents SSRF (Server-Side Request Forgery) by blocking http_request tool calls to internal network addresses: localhost, 127.0.0.1 (loopback), 192.168.* (private Class C), and 10.0.* (private Class A). It adds an allow rule for URLs starting with https:// to demonstrate safe-by-default network access. This protects internal services that are accessible to the agent's network context but should not be reachable via user-controlled or attacker-injected URLs.

Question 11

How do tool output checks work in AgentIQ?

Accepted Answer

Tool output checks use the tool_output resource in the Mirror Policy DSL. The rule deny tool_output where detect_pii(tool_output.content) == true blocks tool results that contain PII before they enter the agent's context window. This prevents a compromised tool from leaking personal data into the agent's context, which could then be included in agent responses or used in subsequent tool calls. Tool output checks run after the tool executes but before the result becomes visible to the model.

Question 12

What is the allowlist pattern for tool call policies?

Accepted Answer

The allowlist pattern starts with a blanket deny then adds specific allow rules: deny tool_call where true blocks all tool calls, then allow tool_call where function.name == 'safe_function' permits specific tools. This is the most restrictive and most secure pattern: the agent can only call tools explicitly approved. The denylist pattern is the inverse: specific deny rules block known-bad tool calls while everything else is allowed. For production systems handling sensitive data, the allowlist pattern is preferred because unknown tools are blocked by default.

Question 13

How does policy composition work for tool security?

Accepted Answer

The chain construct in the Mirror Policy DSL groups multiple policies that evaluate in sequence. A complete tool security chain might include an input layer (blocking injection in user messages), a tool layer (blocking dangerous tool calls and argument patterns), and an output layer (blocking PII in tool results). Each policy in the chain evaluates independently: a message blocked by the input layer does not proceed to the tool layer. The production_security pre-built policy uses this chain pattern to provide comprehensive coverage across all three layers.

Concept	Syntax	Example
Block tool call	`deny tool_call where [condition];`	`deny tool_call where function.name == "exec";`
Allow exception	`allow tool_call where [condition];`	`allow tool_call where function.name == "read_log";`
Block tool output	`deny tool_output where [condition];`	`deny tool_output where detect_pii(tool_output.content) == true;`
Function name	`function.name`	`function.name == "execute_sql"`
Arguments (flat)	`function.arguments`	`contains(function.arguments, "localhost")`
Arguments (nested)	`function.arguments.field`	`function.arguments.url`
Contains (case-sensitive)	`contains(text, substring)`	`contains(function.arguments, ".ssh/")`
Contains (case-insensitive)	`icontains(text, substring)`	`icontains(function.arguments, "DROP TABLE")`
Starts with	`starts_with(text, prefix)`	`starts_with(function.arguments, "/etc/")`
Ends with	ends_with(text, suffix)	`ends_with(function.arguments.url, ".company.com")`
AND	`&&` (NOT "and")	`function.name == "read_file" && contains(...)`
OR	`\|\|` (NOT "or")	`icontains(..., "DROP") \|\| icontains(..., "DELETE")`
NOT	`!` (NOT "not")	`!ends_with(function.arguments.url, ".company.com")`

Tool Use & MCP Security

Why tool use is the highest-risk agent capability

Tool attack taxonomy

Tool poisoning and tool shadowing

Confused deputy attacks

MCP attack surface

SSRF via HTTP tools

Writing tool_call policies

The three pre-built tool policies

Production tool security checklist

Runtime tool call policies for production AI agents