workflowstacks

The marketplace for AI skills that launch offers, rank in AI search, and automate operations. No coding required.

π•βš‘πŸ’¬

Marketplace

  • Browse Skills
  • AI Agents
  • Claude Skills
  • MCP Servers
  • Prompts

Solutions

  • For Founders
  • For Agencies
  • For Ecommerce
  • Agent Builder
  • Starter Packs
  • Playbooks

Learn

  • How It Works
  • What Are Skills
  • What Are Agents
  • What Is MCP
  • For Creators
  • Submit a Tool
  • Security

Company

  • Become a Creator
  • About
  • Enterprise
  • API Docs
  • Terms
  • Privacy
  • Support
Compatible with
πŸ€–ChatGPT
✨Claude
πŸ’ŽGemini
πŸ›οΈShopify
πŸ”Ahrefs
πŸ“ŠSheets
πŸ’¬WhatsApp
πŸ“±Meta Ads
+50 moreCreator program β†’

Β© 2026 WorkflowStacks. All rights reserved.

TermsPrivacySupport
mcp-server

vMLX: Fast MLX Models

Get super fast MLX models with vMLX, ideal for startup founders using MCP servers, built with Python.
642 stars70 forksPythonQuality 8/10Updated 6/11/2026100% free Β· open source
What it does

vMLX is a Python tool that provides an optimized and compressed model serving solution with L2 disk cache, L1 paged cache, and hybrid scheduler for fast and efficient model deployment

Install / run
git clone https://github.com/jjang-ai/vmlx && cd vmlx
When to use it
  • β€’When you need to deploy large machine learning models and want to reduce memory usage
  • β€’When you require fast model serving and inference with low latency
  • β€’When you want to survive model serving interruptions, such as restarts, with a disk cache
Quick start
  1. 1Clone the vMLX repository and navigate to the project directory
  2. 2Install the required dependencies with `pip install -r requirements.txt`
  3. 3Configure the model serving settings in the `config.json` file
  4. 4Start the vMLX server with `python app.py`
  5. 5Test the model serving with a sample request using `curl` or a tool like Postman
Ready-to-paste prompt
curl -X POST -H 'Content-Type: application/json' -d '@input.json' http://localhost:8000/predict
Heads up: Make sure you have the correct Python version installed, as vMLX has specific dependencies and may not be compatible with all Python versions
Saves to your device

Topics

anthropic-api
kvcache-compression
kvcache-optimization
kvcache-reuse
llm
lmstudio
macbook
mcp-server
mlx
mlxllm
mlxstudio
omlx
omlx-alternative
openai-api
openclaw
openclaw-agent
persistent-memory
prefix-cache
vmlx
Quick Actions
Details
Creator
jjang-ai
Language
Python
Category
mcp-server
Published
2/18/2026

Are you the creator of this tool? Claim your listing β†’ and earn 85% of every sale.

Related skills

More mcp-server tools founders pair with this one.

mcp-serverβ˜… 207,966
ECC: Optimize Performance
Get optimized performance with ECC, a research-first agent harness system for founders working with AI agents and developer tools like Claude Code and Codex, with 203k+ GitHub stars.
mcp-serverβ˜… 144,909
Dify: Streamline Workflow
Get a production-ready platform with Dify. For founders needing agentic workflow development.
mcp-serverβ˜… 33,656
AstrBot: AI Agent Assistant
Get an AI-powered chatbot framework for startup founders. 34k+ GitHub stars
mcp-serverβ˜… 28,578
Composio: Build AI Agents
Get a sandboxed workbench for building AI agents with Composio, used by developers with 29k+ GitHub stars
mcp-serverβ˜… 23,507
headroom: Reduce Token Count
Get same answers with 60-95% fewer tokens for your LLM, ideal for founders using Python and LLM technology, backed by 11k+ GitHub stars
mcp-serverβ˜… 21,133
MaxKB: Build Enterprise Agents
Get enterprise-grade agents with MaxKB, used by founders, with 21k+ GitHub stars. For startup founders building AI-powered tools.