Introduction
What's NEW!
Retrieval Augmented Generation (RAG) support is live! - KAITO RagEngine uses LlamaIndex and FAISS, learn about from here! Latest Release: July 18th, 2025. KAITO v0.5.1.
First Release: Nov 15th, 2023. KAITO v0.1.0.
KAITO is an operator that automates the AI/ML model inference or tuning workload in a Kubernetes cluster. The target models are popular open-sourced large models such as falcon and phi-3.