Skip to main content

Title

Add official support for Microsoft's Phi-4 models in KAITO.

Summary

  • Model Description: The phi-4 model from Microsoft, released in December 2024, is a state-of-the-art language model designed for advanced reasoning and high-quality text generation. It is built on a blend of synthetic datasets, filtered public domain websites, and academic books, ensuring robust performance in various tasks. With 14.7B parameters, phi-4 excels in memory and compute-constrained environments, offering precise instruction adherence and strong safety measures through supervised fine-tuning and direct preference optimization. This model is particularly suited for general-purpose AI applications requiring reasoning and logic. The knowledge cut-off date for phi-4 is set to June 2024.
  • Model Usage Statistics: As of January 2025, phi-4 has garnered over 88k downloads.
  • Model License: MIT License

Requirements

The following table describes the basic model characteristics and the resource requirements of running it.

FieldNotes
Family namePhi-4
Typeconversational
Download sitehttps://huggingface.co/microsoft/phi-4
Versionf957856cd926f9d681b14153374d755dd97e45ed
Storage size30 GB
GPU count01 GPU
Total GPU memory30 GB
Per GPU memory80 GB

Runtimes

This section describes how to configure the runtime framework to support the inference calls.

OptionsNotes
RuntimeHuggingface Transformer & VLLM
Distributed InferenceFalse
Custom configurationsPrecision: BF16. Can run on one machine with 30+ GB of GPU memory.

History

  • 01/16/2025: Open proposal PR.
  • xx/xx/2025: Phi-4 Merged TBD