Wallaroo.AI (Version 2024.2)
2024.2
2024.3 (Current Version)
LLM Operations
LLM Tutorials
LLM Performance Optimizations
LLM Performance Optimizations
The following tutorials demonstrate optimizing LLM performance through Wallaroo.
Llama 3 8B Instruct with vLLM
Quantized Llava 34B with Llama.cpp