DHL Group.
The core shipping flow is agent-ready.
Found & recommended by AI agents
Search-class agents touched the close in some runs; the full agent-fleet access profile lands in a later wave.
This page measures the commerce lane: can an agent find DHL Group, and once it arrives, transact. Talent, after-sales, procurement, investor and press lanes run on different surfaces and are not yet measured.
Your brand isn't measured yet
What does an AI agent do with your website?
The same measurement as DHL Group, free for your domain. Five agent classes, one real task, your score in 48 hours.
Every agent type. Every run. 0% error.
We asked five kinds of AI agent to price a 5 kg parcel on dhl.de. The simplest one, a plain fetch with no browser, had the answer on the first request: €7.69, sitting in machine-readable structured data. The browser agents ran the franking wizard end to end with no login, and returned €7.69 on all three runs, zero deviation. The only door no agent opened was the final checkout, behind a login.
One label, many breeds. From a plain reader to an autonomous operator, the kinds behind ChatGPT, Perplexity, and Claude Code:
Scope. This is one service. DHL offers express, freight, international shipping, returns, and business logistics. Each has its own pricing structure and booking flow.
Found, and able to transact?
Two questions, measured separately. A brand can be recommended and still un-buyable, or perfectly buyable and never found.
When someone asks an agent to ship a parcel, does it route to DHL Group?
DHL Group comes up about four times in ten. The rest of the time, an agent recommends an alternative first.
Discoverability · 18-datapoint auditOnce an agent is on DHL Group's site, can it book the shipment?
Search-class agents surface order-ready in some runs; the full agent-fleet access profile lands in a later wave.
What this does not yet cover
The agent reaches order-ready: product selected, price visible, no login. The actual checkout, cart and payment, sits behind a login wall and was not part of the test. The agentic purchase itself is unproven.
Evidence · 85 / 100 A measure of how provable and consistent the result is, grounded in cross-method ground-truth agreement (the methods that ran returned the same price), not a separate measured run. A confidence layer on the two scores above, not a third sales axis.
What this means for DHL Group.
On product discovery, DHL is already reference-grade where the task is direct, agents find the surface, read the price logic, reach order-ready. The gap isn't completion; it's breadth. Measured: one consumer parcel flow (standard domestic). Open: whether the same quality holds across express, returns, international, and B2B freight.
Scale, not repair. Extend the proven surface across express, returns, international, and B2B freight, then re-run the fleet to prove the lead holds across the full logistics category.
The proof isn't usage once reached. It's DHL selected directly across more logistics clusters than the one measured, re-tested the same way each wave.
Commission an audit.
Where the BrandScore opens the question, an audit closes it. An interpretive engagement on your full surface, scored under the same methodology.
Get the auditFound with us.
Strategic partnership for brands building agent success as a long-term capability, not a one-off engagement.
ApplyAudit an adjacent property.
The BrandScore covers the primary domain. Get the same methodology applied to an adjacent property: a country site, a sub-brand, a category beyond the DAX-40 slate.
Get a SnapshotThis is DHL Group. What about your brand?
You just saw how an AI agent treats a DAX 40 brand.
The same measurement runs free against your domain: five agent classes, one real buying task, your Agent Success Score in 48 hours, in the exact format of this page.
No intermediary stands between agents and DHL Group. The gap is being found, not the channel.
Consumer logistics intent reaches dhl.de directly. The remaining intermediary capture sits in B2B freight comparison, a smaller, niche category, and does not materially displace the brand in agent recommendations. Reference architecture for the v3 pilot sample.
Hyperize-selected tasks.
One task from the public sector grid. Task list is frozen before each wave runs.
DHL Paket Inland Standard
- Close state
- order-ready
- Bottleneck
- B2B freight-comparison intent retained by intermediaries; consumer tracking is direct.
Fairness note
Q2 2026 audit complete on a single task (Paket Inland Standard), scored under the public Task Selection Doctrine. AI Visibility re-audited 2026-05-23 on an unbranded informational probe (46.04, refining the 2026-05-22 first re-baseline of 48.75) so DHL sits on the same unbranded discovery basis as the rest of the index. Fairness Review pending the sector fairness grid.
How the score was produced.
Discoverability is audit-pipeline-derived. 3-provider sample (openai, perplexity, anthropic), 3 query variants per task, 2 runs per variant · 18 valid datapoints scored against a five-state handoff cascade. [S1]
AI Usability is derived from the access-profile above (usability-derivation/v1): how far the best agent reached (close state) modulated by how many agent classes succeeded. The per-class profile is the truth; the score is a reproducible summary of it, not a separate rating. Fleet phases (HTTP / Coding / Browser / ACT) produce the profile. [S2]
Agent Success Score = (AI Visibility × 0.20) + (AI Usability × 0.70) + (Evidence × 0.10)
On a 0–100 scale, displayed 0–10. AI Usability bundles the agent's reach + completion; AI Visibility is audit-derived discoverability. Weighting is public; the per-prompt derivation is not.
Measurement scope
Confidence C · one measured task on a 3-provider track (openai, perplexity, anthropic). Confidence promotes to B with a second task plus a fourth provider on the next wave.
Measurement timeline.
Each wave appends; nothing overwrites. Frozen Wave Rule.
- Entry · 01
15 May 2026
Wave · Protocola fleet wave
v1-hand-assessment
First-pass hand assessment from the a fleet wave era. Superseded by the v3 audit-derived measurement below.
- Entry · 02
18 May 2026
Wave · ProtocolWAVE-Q2-2026-PILOT
ars-methodology/v1.1
v3 audit-derived AI Visibility (56.39, 18/18 valid datapoints across 3 providers). AI Usability from a fleet wave Gate-2 phases, HTTP/Coding success, Browser 3/3 at €7.69, ACT order-ready.
- Entry · 03
22 May 2026
Wave · ProtocolWAVE-Q2-2026-PILOT
ars-methodology/v1.1
AI Visibility re-audited on an unbranded informational probe (48.75, was 56.39 with a branded informational that inflated discovery). Brings DHL onto the same unbranded basis as the rest of the index. AI Usability unchanged (core franking flow agent-ready). Composite recomputed two-axis to 65.85; the retired hand-C/A (100/100) and the 4-dimension composite (89.78) were removed. Confidence C, single task.
Evidence and provenance.
Public methodology references and internal evidence pointers behind every claim above.
- [S1]Accessed · 22 May 2026
Gate-1 audit run · DHL Wave Q2 2026 (unbranded re-audit,)
Internal · Hyperize evidence
- · AI Visibility score (audit-derived, 18/18 valid datapoints, unbranded informational probe)
- · AI platforms queried (openai/perplexity/anthropic, 3-provider track)
- · the close state reached (order_ready)
- [S2]Accessed · 16 May 2026
Hyperize fleet · a fleet wave Gate-2 phases (HTTP/Coding/Browser 3/3 + ACT; source for C/A/E)
Internal · Hyperize evidence
- · AI Usability and Evidence inputs
- · fleet test outcomes (legacy)
- [S3]Accessed · 18 May 2026
a fleet wave act phase (Giorgio repo, legacy reference)
Internal · Hyperize evidence
- · act phase verification (status=success, 3/3 steps)
- Accessed · 18 May 2026
Public · hyperize.ai
- · fairness declaration
- · Third-Party Interception framing
Read the doctrine. Challenge the score. Extend the slate.
Task Selection.
The fairness doctrine behind the slate above. Five failure modes, six criteria, public before each wave.
Read ChallengeDisagree with this score.
Send evidence under public Fairness Review. Failed reviews are documented with the named failure mode.
Challenge ExtendSubmit your own task.
Open Surface Run · additive measurement. The Hyperize-selected slate stays frozen; your task gets the same methodology.
SubmitEditorial coverage
The DAX 40 Agent Success Index is a point-in-time snapshot of the agent-success of public digital touchpoints. Results are not statements about product quality, company performance, service quality, or the legal obligations of the brands named. Brand names and logos remain the property of their respective owners and are used solely for identification and reporting purposes in the context of editorial coverage (§ 23 MarkenG, Art. 5 GG).
Brands wishing to respond, engage, or correct a factual error may contact hello@hyperize.ai. Responses received are published in full alongside the findings. Full methodology and editorial-coverage notice: coverage statement.