You don’t need to run the LLM on the device. It supports 13 different protocols for both text completion and chat completion.
There’s also RisuAI which has some nicer features like better integration with function calling. Sillytavern is working on Function calling more but it’s not quite there yet.
The nice thing about Sillytavern is that it also has plugins for Alltalk for TTS and ComfyUi/A1111 for image generation directly from the roleplay interface.
It also has support for RAG through upload of documents and web scraping and a shitload of other features it would take awhile to list here.
All LLMs are text completion engines, no matter what fancy bells they tack on.
If your task is some kind of text completion or repetition of text provided in the prompt context LLMs perform wonderfully.
For everything else you are wading through territory you could probably do easier using other methods.