Wallaroo.AI (Version 2025.1)
2025.1 (Current Version)
2024.4
2024.3
2023.2
LLM Operations
LLM Tutorials
LLM Deploy
Deploy with QAIC AI Acceleration
Deploy with QAIC AI Acceleration
The following tutorials demonstrate deploying different LLMs with
Qualcomm QIAC
AI acceleration.
Deploy Llama with Continuous Batching Using Native vLLM Framework and QAIC AI Acceleration
Deploy RAG Llama with QAIC
Deploy Llama with Continuous Batching Using Native vLLM Framework with QAIC using OpenAI Inference
Deploy RAG Llama with OpenAI compatibility on QAIC