How do I install the vllm mlx MCP server?

Install via: pip install vllm-mlx. Then configure it in your Claude Desktop or Cursor settings.

Is the vllm mlx MCP server free?

Yes, the vllm mlx MCP server is open source and free to use. You may need a vllm mlx account or API key for authentication.

vllm mlx

pipStandard

OpenAI-compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s.

903210Updated 1 month ago

Overview

Installation

1Install the package

pippip install vllm-mlx

2Add to Claude

Add this configuration to your claude_desktop_config.json:

claude_desktop_config.json

{
  "mcpServers": {
    "waybarrios-vllm-mlx-github": {
      "command": "uvx",
      "args": [
        "pip install vllm-mlx"
      ]
    }
  }
}

3Verify installation

Restart Claude Desktop, then ask:

"What tools do you have available from vllm mlx?"

Prerequisites

Python 3.10 or higher
pip or uv package manager
Claude Desktop or Claude API access

Configuration

No configuration required. This server works out of the box.

Usage Examples

List Resources

"What resources are available in vllm mlx?"

Claude will query available resources and return a list of what you can access.

Get Details

"Show me details about [specific item] in vllm mlx"

Claude will fetch and display detailed information about the requested item.

Create New

"Create a new [item] in vllm mlx with [details]"

Claude will use the appropriate tool to create the resource and confirm success.

Works with vllm mlx

OpenAI

Advanced foundation models via API and ChatGPT

Browse all tools |Check your stack

Related skills

Browse all skills

models

@arimxyer

system-prompts-and-models-of-ai-tools-chinese

@CreatorEdition

apple_foundation_models_claude_skill

claude-code-gpt-5-codex

@teremterem

Need help with MCP integration?

We build custom MCP integrations for B2B companies. From simple connections to complex multi-tool setups.

View implementation services

Free AI scan