LLaVA is an advanced open-source chatbot designed for multimodal instruction research and language processing.
Version:
7B GPU:
This version is not quantized and a GPU is recommended.
Install our magic
package manager:
curl -ssL https://magic.modular.com/ | bash
Then run the source
command that's printed in your terminal.
Install Max Pipelines in order to run this model.
magic global install max-pipelines && magic global update
Start a local endpoint for llava-v1.5/7B:
max-pipelines serve --huggingface-repo-id=liuhaotian/llava-v1.5-7b
The endpoint is ready when you see the URI printed in your terminal:
Server ready on http://0.0.0.0:8000 (Press CTRL+C to quit)
Now open another terminal to send a request using curl
:
curl -N http://0.0.0.0:8000/v1/chat/completions -H "Content-Type: application/json" -d '{
"model": "liuhaotian/llava-v1.5-7b",
"stream": true,
"messages": [
{
"role": "user",
"content": [
{
"type": "text",
"text": "What is in this image?"
},
{
"type": "image_url",
"image_url": {
"url": "http://images-assets.nasa.gov/image/iss072e571418/iss072e571418~orig.jpg"
}
}
]
}
]
}' | grep -o '"content":"[^"]*"' | sed 's/"content":"//g' | sed 's/"//g' | tr -d '
' | sed 's/\n//g'
🎉 Hooray! You’re running Generative AI. Our goal is to make this as easy as possible.
DETAILS
MAX GITHUB
Modular / MAX
MODEL
liuhaotian
liuhaotian/llava-v1.5-7b
QUESTIONS ABOUT THIS MODEL?
Leave a comment
PROBLEMS WITH THE CODE?
File an Issue
TAGS
ENTERPRISES
@ Copyright - Modular Inc - 2025