Accuracy, published
These are the real numbers from Renzy's scanner against an open dataset. No marketing — just MAE, MAPE and RMSE refreshed weekly.
MAPE on kcal
97.8%
Mean absolute percentage error on total calories.
MAE on kcal
136
Mean absolute error. Lower is better.
MAPE on grams
3.9%
Portion estimation is the bottleneck.
How it's computed
Each dataset entry is a real-plate photo with weight measured on a kitchen scale and calories reconstructed from USDA. We run the production pipeline exactly like a user scan — no shortcuts, no extra context, no cache.
- Open dataset: each photo + ground-truth is documented at /api/admin/accuracy-dataset.
- Identical pipeline to production: vision → garnish filter → sauce + brand override → scale anchor → critique → USDA → confirm drink.
- Standard metrics: MAPE, MAE, RMSE. No outlier filtering — the worst cases also appear on this page.
- Re-run weekly automatically. If a model iteration regresses, you'll see it here before we do.
Error distribution
From the dataset's plates, this is the distribution of percent error on kcal. The tighter the bars cluster on the left, the better.
The 5 worst cases
What we got most wrong in this batch. Publishing them forces us to fix them in the next iteration.
Coca-Cola lata 330ml
Truth: 139 kcal · Predicted: 275 kcal
Error: 97.8%
Last run: 5/11/2026, 6:33:37 AM
Sample size: 1 platos
Model: anthropic/claude-sonnet-4.6
Batch: cron-2026-05-11-mp0tokls
Want to replicate the benchmark with your own dataset? Email us at hola@renzy.app.