On-device LLM Inference