Wallaroo.AI (Version 2025.1)
  • 2025.1 (Current Version)
    • 2024.4
    • 2024.3
    • 2023.2
2025.1 (Current Version)
  • 2024.4
  • 2024.3
  • Home
Wallaroo Feature
  • Deploy12
  • Edge6
  • Observability1
  • Observe8
  • Optimization1
  • Parallel Infer1
  • Run Anywhere17
  • Serve11
Models
  • Aloha2
  • Ccfraud1
  • Hf Summarizer1
  • Hf-Summarization2
  • Houseprice2
  • Houseprice-Prediction2
  • Hugging Face1
  • Linear-Regression1
  • Llamav21
  • Llm1
  • Mobilenet1
  • Python ARIMA1
  • R-Cnn1
  • Resnet1
  • Resnet501
  • U-Net3
  • Whisper-Large-V21
  • Yolov82
  • Yolov8n2
Tags
  • Wallaroo SDK1
  1. LLM Operations
  2. LLM Tutorials
  3. LLM Deploy
  4. Deploy with QAIC AI Acceleration

Deploy with QAIC AI Acceleration


The following tutorials demonstrate deploying different LLMs with Qualcomm QIAC AI acceleration.


Deploy Llama with Continuous Batching Using Native vLLM Framework and QAIC AI Acceleration

Deploy RAG Llama with QAIC

Deploy Llama with Continuous Batching Using Native vLLM Framework with QAIC using OpenAI Inference

Deploy RAG Llama with OpenAI compatibility on QAIC

© 2025 Wallaroo Labs, Inc.