Running a model locally on Apple Silicon hardware using the MLX framework, which is optimized for efficient inference on Mac devices.