Athul Ramkumar. “Enabling On-Device Inference of Large Language Models : Challenges, Techniques, and Applications”. International Journal of Scientific Research in Computer Science, Engineering and Information Technology 10, no. 6 (November 18, 2024): 595–604. Accessed August 2, 2025. https://www.ijsrcseit.com/index.php/home/article/view/CSEIT241061100.