### Checklist (starter)
- Slice by *time / sensor / geography / site / protocol*
- Track dataset versions, not just model versions
- Keep a “hard set” aligned with deployment reality
- Maintain a failure-case gallery
Replace this placeholder with your notes.
← Back to writing
Evaluation slices: how to catch domain shift early
February 12, 2025