models

/

all-mpnet-base-v2-5B

Version: 5B GPU I64

You can quickly deploy all-mpnet-base-v2-5B to an endpoint using our MAX container. It includes the latest version of MAX with GPU support and our Python-based inference server called MAX Serve.

With the following Docker command, you’ll get an OpenAI-compatible endpoint running all-mpnet-base-v2-5B:

docker run --gpus 1 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_HUB_ENABLE_HF_TRANSFER=1" \
    --env "HF_TOKEN=" \
    -p 8000:8000 \
    docker.modular.com/modular/max-openai-api:nightly \
    --huggingface-repo-id sentence-transformers/all-mpnet-base-v2

In order to download the model from Hugging Face, you just need to fill in the HF_TOKEN value with your access token, unless the model is from https://huggingface.co/modularai.

Learn more

For more information about the container image, see the MAX container documentation.

To learn more about how to deploy MAX to the cloud, check out our MAX Serve tutorials.

DETAILS

EmbeddingMODEL CLASS
MAX Model

MAX Models are popular open-source models converted to MAX’s native graph format. Anything with the label is either SOTA or being worked on. Learn more about MAX Models.

Browse all MAX Models

HARDWARE
GPU
QUANTIZATION
I64
ARCHITECTURE
MAX Model

MAX GITHUB

Modular / MAX

MODEL

sentence-transformers

sentence-transformers/all-mpnet-base-v2

QUESTIONS ABOUT THIS MODEL?

Leave a comment

PROBLEMS WITH THE CODE?

File an Issue

TAGS

sentence-transformers

/

pytorch

/

onnx

/

safetensors

/

openvino

/

mpnet

/

fill-mask

/

feature-extraction

/

sentence-similarity

/

transformers

/

en

/

dataset:s2orc

/

dataset:flax-sentence-embeddings/stackexchange_xml

/

dataset:ms_marco

/

dataset:gooaq

/

dataset:yahoo_answers_topics

/

dataset:code_search_net

/

dataset:search_qa

/

dataset:eli5

/

dataset:snli

/

dataset:multi_nli

/

dataset:wikihow

/

dataset:natural_questions

/

dataset:trivia_qa

/

dataset:embedding-data/sentence-compression

/

dataset:embedding-data/flickr30k-captions

/

dataset:embedding-data/altlex

/

dataset:embedding-data/simple-wiki

/

dataset:embedding-data/QQP

/

dataset:embedding-data/SPECTER

/

dataset:embedding-data/PAQ_pairs

/

dataset:embedding-data/WikiAnswers

/

arxiv:1904.06472

/

arxiv:2102.07033

/

arxiv:2104.08727

/

arxiv:1704.05179

/

arxiv:1810.09305

/

license:apache-2.0

/

autotrain_compatible

/

endpoints_compatible

/

region:us

Resources & support for
running all-mpnet-base-v2-5B

@ Copyright - Modular Inc - 2025