On-Device AI in 2026: NPU Inference Design, Model Routing, and Drift Tests | Pulse Latellu