The idea of local LLMs is fascinating. You can run an AI model on your laptop or your own server and get effectively unlimited access without worrying about usage limits, but that idea starts to break ...