Original research · Logistics

DHL Group.

Last measured · 22 May 2026 Wave · Q2-2026-PILOT Tier · proprietary Confidence · C
Brand
Deutsche Post AG
Agent success

The core shipping flow is agent-ready.

Bottleneck No dominant gap
Reference-grade, the open question is breadth.
6.5 /10
Agent Success Score
AI Visibility 46 / 100

Found & recommended by AI agents

AI Usability 68 / 100

Search-class agents touched the close in some runs; the full agent-fleet access profile lands in a later wave.

Coverage · 1 of 6 lanes measured Commerce lane · Wave Q2-2026-PILOT
Commerce 6.5
Talent
After-sales
Procurement
Investor
Press

This page measures the commerce lane: can an agent find DHL Group, and once it arrives, transact. Talent, after-sales, procurement, investor and press lanes run on different surfaces and are not yet measured.

Your brand isn't measured yet

What does an AI agent do with your website?

The same measurement as DHL Group, free for your domain. Five agent classes, one real task, your score in 48 hours.

Free · no card · 48h

The test

Every agent type. Every run. 0% error.

We asked five kinds of AI agent to price a 5 kg parcel on dhl.de. The simplest one, a plain fetch with no browser, had the answer on the first request: €7.69, sitting in machine-readable structured data. The browser agents ran the franking wizard end to end with no login, and returned €7.69 on all three runs, zero deviation. The only door no agent opened was the final checkout, behind a login.

One label, many breeds. From a plain reader to an autonomous operator, the kinds behind ChatGPT, Perplexity, and Claude Code:

Plain reader reads your raw page text, no browser Succeeded
Search assistant finds you through search Not yet run
Coding agent a script hitting your site Succeeded
Computer-use agent clicks and types like a person Succeeded
Autonomous operator runs the whole task unattended Succeeded

Scope. This is one service. DHL offers express, freight, international shipping, returns, and business logistics. Each has its own pricing structure and booking flow.

Commerce lane

Found, and able to transact?

Two questions, measured separately. A brand can be recommended and still un-buyable, or perfectly buyable and never found.

AI Visibility

When someone asks an agent to ship a parcel, does it route to DHL Group?

46 / 100

DHL Group comes up about four times in ten. The rest of the time, an agent recommends an alternative first.

Discoverability · 18-datapoint audit
AI Usability

Once an agent is on DHL Group's site, can it book the shipment?

68 / 100

Search-class agents surface order-ready in some runs; the full agent-fleet access profile lands in a later wave.

What this does not yet cover

The agent reaches order-ready: product selected, price visible, no login. The actual checkout, cart and payment, sits behind a login wall and was not part of the test. The agentic purchase itself is unproven.

Evidence · 85 / 100 A measure of how provable and consistent the result is, grounded in cross-method ground-truth agreement (the methods that ran returned the same price), not a separate measured run. A confidence layer on the two scores above, not a third sales axis.

What's next

What this means for DHL Group.

Diagnosis

On product discovery, DHL is already reference-grade where the task is direct, agents find the surface, read the price logic, reach order-ready. The gap isn't completion; it's breadth. Measured: one consumer parcel flow (standard domestic). Open: whether the same quality holds across express, returns, international, and B2B freight.

What changes the outcome

Scale, not repair. Extend the proven surface across express, returns, international, and B2B freight, then re-run the fleet to prove the lead holds across the full logistics category.

What proof looks like

The proof isn't usage once reached. It's DHL selected directly across more logistics clusters than the one measured, re-tested the same way each wave.

Audit · €1,900

Commission an audit.

Where the BrandScore opens the question, an audit closes it. An interpretive engagement on your full surface, scored under the same methodology.

Get the audit
Founding · €4,500

Found with us.

Strategic partnership for brands building agent success as a long-term capability, not a one-off engagement.

Apply
Snapshot

Audit an adjacent property.

The BrandScore covers the primary domain. Get the same methodology applied to an adjacent property: a country site, a sub-brand, a category beyond the DAX-40 slate.

Get a Snapshot

This is DHL Group. What about your brand?

You just saw how an AI agent treats a DAX 40 brand.

The same measurement runs free against your domain: five agent classes, one real buying task, your Agent Success Score in 48 hours, in the exact format of this page.

Measured, not guessed. Real agents against your real site, no questionnaire.
Publicly comparable. Your score in the same grid as the measured DAX 40 brands.
Next wave closing. Start now to be in the next index round.

Free · no card · 48h

Channel position

No intermediary stands between agents and DHL Group. The gap is being found, not the channel.

Consumer logistics intent reaches dhl.de directly. The remaining intermediary capture sits in B2B freight comparison, a smaller, niche category, and does not materially displace the brand in agent recommendations. Reference architecture for the v3 pilot sample.

85% direct
15% via intermediary
Frozen task slate

Hyperize-selected tasks.

One task from the public sector grid. Task list is frozen before each wave runs.

DHL Paket Inland Standard

Close state
order-ready
Bottleneck
B2B freight-comparison intent retained by intermediaries; consumer tracking is direct.

Fairness note

Q2 2026 audit complete on a single task (Paket Inland Standard), scored under the public Task Selection Doctrine. AI Visibility re-audited 2026-05-23 on an unbranded informational probe (46.04, refining the 2026-05-22 first re-baseline of 48.75) so DHL sits on the same unbranded discovery basis as the rest of the index. Fairness Review pending the sector fairness grid.

Methodology

How the score was produced.

Discoverability is audit-pipeline-derived. 3-provider sample (openai, perplexity, anthropic), 3 query variants per task, 2 runs per variant · 18 valid datapoints scored against a five-state handoff cascade. [S1]

AI Usability is derived from the access-profile above (usability-derivation/v1): how far the best agent reached (close state) modulated by how many agent classes succeeded. The per-class profile is the truth; the score is a reproducible summary of it, not a separate rating. Fleet phases (HTTP / Coding / Browser / ACT) produce the profile. [S2]

Formula

Agent Success Score = (AI Visibility × 0.20) + (AI Usability × 0.70) + (Evidence × 0.10)

On a 0–100 scale, displayed 0–10. AI Usability bundles the agent's reach + completion; AI Visibility is audit-derived discoverability. Weighting is public; the per-prompt derivation is not.

Measurement scope

Confidence C · one measured task on a 3-provider track (openai, perplexity, anthropic). Confidence promotes to B with a second task plus a fourth provider on the next wave.

History

Measurement timeline.

Each wave appends; nothing overwrites. Frozen Wave Rule.

  1. Entry · 01

    15 May 2026

    Wave · Protocol

    a fleet wave

    v1-hand-assessment

    First-pass hand assessment from the a fleet wave era. Superseded by the v3 audit-derived measurement below.

  2. Entry · 02

    18 May 2026

    Wave · Protocol

    WAVE-Q2-2026-PILOT

    ars-methodology/v1.1

    v3 audit-derived AI Visibility (56.39, 18/18 valid datapoints across 3 providers). AI Usability from a fleet wave Gate-2 phases, HTTP/Coding success, Browser 3/3 at €7.69, ACT order-ready.

  3. Entry · 03

    22 May 2026

    Wave · Protocol

    WAVE-Q2-2026-PILOT

    ars-methodology/v1.1

    AI Visibility re-audited on an unbranded informational probe (48.75, was 56.39 with a branded informational that inflated discovery). Brings DHL onto the same unbranded basis as the rest of the index. AI Usability unchanged (core franking flow agent-ready). Composite recomputed two-axis to 65.85; the retired hand-C/A (100/100) and the 4-dimension composite (89.78) were removed. Confidence C, single task.

Sources

Evidence and provenance.

Public methodology references and internal evidence pointers behind every claim above.

  1. [S1]

    Gate-1 audit run · DHL Wave Q2 2026 (unbranded re-audit,)

    Accessed · 22 May 2026

    Internal · Hyperize evidence

    • · AI Visibility score (audit-derived, 18/18 valid datapoints, unbranded informational probe)
    • · AI platforms queried (openai/perplexity/anthropic, 3-provider track)
    • · the close state reached (order_ready)
  2. [S2]

    Hyperize fleet · a fleet wave Gate-2 phases (HTTP/Coding/Browser 3/3 + ACT; source for C/A/E)

    Accessed · 16 May 2026

    Internal · Hyperize evidence

    • · AI Usability and Evidence inputs
    • · fleet test outcomes (legacy)
  3. [S3]

    a fleet wave act phase (Giorgio repo, legacy reference)

    Accessed · 18 May 2026

    Internal · Hyperize evidence

    • · act phase verification (status=success, 3/3 steps)
  4. Accessed · 18 May 2026

    Public · hyperize.ai

    • · fairness declaration
    • · Third-Party Interception framing
Last updated · 22 May 2026 Next review · 30 Sept 2026 Wave · Q2-2026-PILOT Tier · proprietary Confidence · C Index score · 6.5/10 Machine-readable record

Editorial coverage

The DAX 40 Agent Success Index is a point-in-time snapshot of the agent-success of public digital touchpoints. Results are not statements about product quality, company performance, service quality, or the legal obligations of the brands named. Brand names and logos remain the property of their respective owners and are used solely for identification and reporting purposes in the context of editorial coverage (§ 23 MarkenG, Art. 5 GG).

Brands wishing to respond, engage, or correct a factual error may contact hello@hyperize.ai. Responses received are published in full alongside the findings. Full methodology and editorial-coverage notice: coverage statement.