Agentic AI · the safe tool-use pattern we ship by default

Agentic AI that can send email and move money is not just a chatbot. Here's the safe tool-use pattern we ship.

Last verified22 April 2026

Listen

By Dezső MezőFounder, DField Solutions

ShareX LinkedIn#

Agentic AI · the safe tool-use pattern we ship by default

An agentic AI that can send email, edit a CRM, move money, or call shell · is a security surface, not a chatbot. The blast radius of a jailbreak / prompt injection multiplies by whatever tools you wired up. This is the 4-layer pattern we apply on every agent build.

Layer 1 · Capability scoping

The agent gets tools that are the minimum it needs. Not 'send_email' in general · 'send_email_to_customer' where 'customer' is scoped to the current user's account. Design each tool as narrow as possible; any tool that could 'do anything' is wrong.

Layer 2 · User-backed authorization

Every tool wrapper re-authorizes the current user. The LLM can call the tool; the tool checks 'is this user allowed to do X to Y'. Treat the LLM as untrusted caller logic · always.

def safe_send_email(to: str, body: str, ctx: UserCtx):
    if to not in ctx.allowed_recipients:
        raise PermissionError(f"recipient {to} not authorized")
    if len(body) > MAX_BODY:
        raise ValueError("body too long")
    audit_log(ctx.user_id, "send_email", to, hash(body))
    return email.send(to=to, body=body, from=ctx.user_email)

Layer 3 · Per-call audit logging

Every tool call · who, what, when, why (the prompt that triggered it), how much (tokens, dollars). Keep 90-day retention minimum for compliance + debugging. You'll need this for the first incident; not having it at that moment is the worst-case day.

Layer 4 · Circuit breakers

Rate limit per user. Rate limit per tool. Global kill switch on misbehaviour. When MTTR (mean-time-to-revoke) of a compromised agent is < 10 minutes, the damage budget is bounded. When it's an hour, it isn't.

If you can't answer 'what can the agent do that would materially hurt the business?' in < 60 seconds, you haven't scoped tools enough. Start by reducing surface, not by adding guardrails to a broad one.

ShareX LinkedIn#

Dezső Mező

Founder, DField Solutions

I'm a full-stack engineer and I build across the whole stack myself · AI agents, web and mobile apps, blockchain, backends, security, right down to the OS layer. If it's software, I've probably built it and broken it.

ABOUT Let's talk

Keep reading

18 Apr 2026·11 min read

LLM prompt injection playbook · the 2026 attack surface

The prompt injection surface is not a single bug · it's five categories, each with a distinct defence. Here's our playbook.

Read

20 Apr 2026·11 min read

What an AI security audit actually checks in 2026

AI security isn't a checkbox. Here's the nine-point audit we run on every LLM system we ship, plus which bugs turn up most often on systems we didn't build.

Read

22 Apr 2026·8 min read

LLM prompt caching in production · a 60-80% cost cut

Prompt caching is the single biggest LLM cost lever in 2026. 4 patterns, real savings numbers, 2 gotchas worth knowing.

Read

RELATED PROJECTS

Websites, web apps & online shops · Custom software · everything else · AI solutions · 2026Vilya ProtectionVilya Protection · assassination-prevention software platform for public figures and large events. The demo shows the full operational dashboard.

Custom software · everything else · Websites, web apps & online shops · AI solutions · 2026AutoImportEU→HU car-import arbitrage platform - turns 'you can buy this car abroad and resell it at home' into a live, scored feed.

AI solutions · Websites, web apps & online shops · Custom software · everything else · 2026ClarixAIA misconception-pattern radar for teachers · open-ended student answers in, the reasoning errors dominating a cohort out.

Let's talk

Would rather build together?

Let's talk about your project. 30 minutes, no strings.

Let's talk