Pure Rust + CUDA LLM inference engine โ no PyTorch, OpenAI-compatible, serves Qwen3 to Kimi-K2
Read the entire source before you build โ unlike paid marketplaces that hide it behind a buy button.
Are you the creator of this tool? Claim your listing โ and earn 85% of every sale.
More ai-agent tools founders pair with this one.