Aphrodite supports a large variety of generative Transformer models in Hugging Face Transformers. The following is the list of model architectures that we currently support.
Decoder-only Language Models
Architecture | Example HF Model |
---|
AquilaForCausalLM | BAAI/AquilaChat-7B |
ArcticForCausalLM | Snowflake/snowflake-arctic-instruct |
BaiChuanForCausalLM | baichuan-inc/Baichuan2-13B-Chat |
BloomForCausalLM | bigscience/bloomz |
ChatGLMModel | THUDM/chatglm3-6b |
CohereForCausalLM | CohereForAI/c4ai-command-r-v01 |
DbrxForCausalLM | databricks/dbrx-instruct |
DeciLMForCausalLM | DeciLM/DeciLM-7B |
DeepseekForCausalLM | deepseek-ai/deepseek-moe-16b-base |
DeepseekV2ForCausalLM | deepseek-ai/DeepSeek-V2.5 |
ExaoneForCausalLM | LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct |
FalconForCausalLM | tiiuae/falcon-7b |
GPT2LMHeadModel | gpt2 |
GPTBigCodeForCausalLM | bigcode/starcoder |
GPTJForCausalLM | pygmalionai/pygmalion-6b |
GPTNeoXForCausalLM | EleutherAI/pythia-12b |
GemmaForCausalLM | google/gemma-7b |
Gemma2ForCausalLM | google/gemma-2-9b |
GraniteForCausalLM | ibm-research/PowerLM-3b |
GraniteMoeForCausalLM | ibm-research/PowerMoE-3b |
InternLMForCausalLM | internlm/internlm-7b |
InternLM2ForCausalLM | internlm/internlm2-7b |
JAISLMHeadModel | core42/jais-13b |
JambaForCausalLM | ai21labs/Jamba-v0.1 |
LlamaForCausalLM | meta-llama/Meta-Llama-3.1-8B |
MPTForCausalLM | mosaicml/mpt-7b |
MambaForCausalLM | state-spaces/mamba-2.8b-hf |
MiniCPMForCausalLM | openbmb/MiniCPM-2B-dpo-bf16 |
MiniCPM3ForCausalLM | openbmb/MiniCPM3-4B |
MistralForCausalLM | mistralai/Mistral-7B-v0.1 |
MixtralForCausalLM | mistralai/Mixtral-8x7B-v0.1 |
NemotronForCausalLM | nvidia/Minitron-8B-Base |
NVLM_D | nvidia/NVLM-D-72B |
OPTForCausalLM | facebook/opt-66b |
OLMoForCausalLM | allenai/OLMo-7B-hf |
OlmoeForCausalLM | allenai/OLMoE-1B-7B-0125 |
OrionForCausalLM | OrionStarAI/Orion-14B-Chat |
PersimmonForCausalLM | adept/persimmon-8b-chat |
PhiForCausalLM | microsoft/phi-2 |
Phi3ForCausalLM | microsoft/Phi-3-medium-128k-instruct |
Phi3SmallForCausalLM | microsoft/Phi-3-small-128k-instruct |
PhiMoEForCausalLM | microsoft/Phi-3.5-MoE-instruct |
QwenLMHeadModel | Qwen/Qwen-7B |
Qwen2ForCausalLM | Qwen/Qwen2-72B |
Qwen2MoeForCausalLM | Qwen/Qwen1.5-MoE-A2.7B |
Qwen2VLForConditionalGeneration | Qwen/Qwen2-VL-7B-Instruct |
SolarForCausalLM | upstage/solar-pro-preview-instruct |
StableLmforCausalLM | stabilityai/stablelm-3b-4e1t |
Starcoder2ForCausalLM | bigcode/starcoder2-3b |
XverseForCausalLM | xverse/XVERSE-65B-Chat |
Encoder-Decoder Language Models
Architecture | Example Model |
---|
BartForConditionalGeneration | facebook/bart-large-cnn |
Embedding Models
Architecture | Example Model |
---|
MistralModel | intfloat/e5-mistral-7b-instruct |
Qwen2ForRewardModel | Qwen/Qwen2.5-Math-RM-72B |
Gemma2Model | BAAI/bge-multilingual-gemma2 |
Multimodal Language Models
Architecture | Supported Modalities | Example Model |
---|
Blip2ForConditionalGeneration | Image | Salesforce/blip2-opt-6.7b |
ChameleonForConditionalGeneration | Image | facebook/chameleon-7b |
ChatGLMModel | Image | THUDM/chatglm3-6b |
FuyuForCausalLM | Image | adept/fuyu-8b |
InternVLChatModel | Image | OpenGVLab/InternVL2-8B |
LlavaForConditionalGeneration | Image | llava-hf/llava-v1.5-7b-hf |
LlavaNextForConditionalGeneration | Image | llava-hf/llava-v1.6-mistral-7b-hf |
LlavaNextVideoForConditionalGeneration | Video | llava-hf/LLaVA-NeXT-Video-7B-hf |
LlavaOnevisionForConditionalGeneration | Image, Video | llava-hf/llava-onevision-qwen2-7b-ov-hf |
MiniCPMV | Image | openbmb/MiniCPM-V-2_6 |
MllamaForConditionalGeneration | Image | meta-llama/Llama-3.2-11B-Vision-Instruct |
MolmoForCausalLM | Image | allenai/Molmo-7B-D-0924 |
PaliGemmaForConditionalGeneration | Image | google/paligemma-3b-pt-224 |
Phi3VForCausalLM | Image | microsoft/Phi-3.5-vision-instruct |
PixtralForConditionalGeneration | Image | mistralai/Pixtral-12B-2409 |
QWenLMHeadModel | Image | Qwen/Qwen-VL |
Qwen2VLForConditionalGeneration | Image | Qwen/Qwen2-VL-7B-Instruct |
UltravoxModel | Audio | fixie-ai/ultravox-v0_3 |
Speculative Models
Architecture | Example Model |
---|
EAGLEModel | abhigoyal/vllm-eagle-llama-68m-random |
MedusaModel | abhigoyal/vllm-medusa-llama-68m-random |
MLPSpeculatorPreTrainedModel | ibm-fms/llama-160m-accelerator |
If your model uses any of the architectures above, you can seamlessly run your model with Aphrodite.