Retail Vendor Scorecard

What this is

The vendor performance scorecard: the dimensions, weighting model, and period-based measurement framework that translates vendor compliance data into a composite score used for purchase allocation, renegotiation, and rationalization decisions.

Purpose

The scorecard converts raw compliance events — a short shipment here, a late ASN there — into a structured performance record that buyers can act on. Without a scorecard, vendor relationship decisions are made on relationships and gut feel. With one, they're made on evidence. The scorecard also defines the chargeback trigger thresholds: poor scorecard performance in a given dimension automatically generates a chargeback at the established administrative fee rate.

Structure

Measurement period — Scorecard runs by month, season, and year. Purchases must be linked to receipts that may cross period boundaries; the scoring methodology must account for this lag.

Core performance dimensions:

Dimension	What it measures
Fill rate	Units shipped ÷ units ordered, by PO
On-time delivery	% of deliveries within agreed delivery window
ASN accuracy	% of ASNs that match the physical receipt (quantity, item, UPC)
EDI compliance	% of EDI transactions (850, 855, 856, 810) transmitted correctly and on time
Invoice accuracy	% of invoices matching PO cost and terms without adjustment
Product quality	% of units received meeting acceptance criteria (defect rate below threshold)
Carton/marking compliance	% of cartons meeting packing, marking, and unit-load specifications
RTV compliance	% of RTV requests resolved within agreed credit timeline

Weighting factors — Each dimension is assigned a relative weight that sums to 100. Weights must be established before any evaluation period begins, agreed with vendor leadership. A retailer with a fragile supply chain might weight fill rate and on-time delivery heavily; one with quality issues might weight acceptance criteria.

Composite score — Weighted average of dimension scores produces a single vendor score per period. Composite score drives: (1) purchase allocation — high-scoring vendors receive preferred buying; (2) negotiation posture — score becomes the buyer's opening data point; (3) rationalization candidate list — vendors below a defined threshold for N consecutive periods are flagged for rationalization review.

Chargeback triggers — Each compliance dimension has a defined threshold below which automatic chargebacks are generated. The chargeback matrix specifies the administrative fee rate and occurrence escalation logic. The scorecard is the source data; the chargeback matrix is the enforcement mechanism.

Consumers

Buyers use the scorecard in negotiation sessions and purchase allocation decisions. The Operations Agent monitors scorecard trends and alerts when a vendor approaches a rationalization threshold or a chargeback-trigger threshold. The smart contract layer, for smart-contract-native vendors, encodes scorecard thresholds as on-chain conditions: falling below threshold automatically triggers the contractual consequence without human intervention.

Invariants

Scorecard dimensions and weights must be agreed with the vendor before the measurement period begins. Retroactive reweighting is not permitted.
The scorecard must be run on a consistent schedule — monthly at minimum, with seasonal and annual aggregates. Irregular scoring destroys its credibility in negotiation.
Purchases linked to cross-period receipts must use a defined methodology (receipt date or PO close date) applied consistently. The methodology must be documented.

Platform (2030)

Agent mandate: Operations Agent computes scorecard dimensions continuously — not at period close. Business Agent reads the current scorecard composite in vendor negotiation preparation. Neither agent makes chargeback decisions based on the scorecard; the chargeback matrix and the smart contract engine execute those automatically.

Live scorecard from live contract state. Traditional scorecards are period-close computations: query transaction history at month-end, produce a report. In the Canary Go model, the vendor's smart contract IS the scorecard. Each compliance event submitted to the contract updates the relevant dimension counter in real time. Operations Agent reads contract state continuously — the scorecard composite is always current, not a snapshot. The vendor can query their own contract state at any time to see their current performance, eliminating the "we didn't know we were below threshold" dispute category.

Exception-first alerting. Operations Agent does not generate periodic scorecard reports. It monitors the trend on each dimension and alerts only when: (1) a dimension composite crosses below the chargeback trigger threshold; (2) the rate of decline suggests threshold breach within N periods; (3) the overall composite drops below the rationalization candidate threshold. Alerts surface to the merchant dashboard with one-click context. Normal performance generates no noise.

MCP surface. scorecard(vendor_id) returns current composite score and all dimension scores in a single call. scorecard_trend(vendor_id, n_periods) returns composite and dimension trend data — the input for negotiation preparation. rationalization_candidates() returns vendors below composite threshold with dimension breakdown. Low-bandwidth, agent-optimized.

RaaS. Scorecard inputs — fill rate, on-time delivery, ASN accuracy — are derived from receipt events (receiving confirmations, PO completions, ASN matches). The scorecard is a computed view over an append-only event log, not a mutable record. Sequence matters: a fill rate calculation that uses out-of-order receipt events produces a wrong numerator. scorecard(vendor_id) should be a pre-aggregated view, not a real-time join over raw events — at scale (many vendors × many periods × many line items) real-time join cost is prohibitive. Scorecard history exportable by period for vendor negotiation and rationalization analysis.

retail-vendor-compliance-standards — the clause matrix that defines what is being measured
retail-vendor-lifecycle — the lifecycle stage that triggers scorecard evaluation
retail-chargeback-matrix — translates scorecard dimension failures into financial consequences
retail-three-way-match — the receipt-side data source for fill rate and ASN accuracy