vMLX - JANGTQ Uber Compressed MLX Models - L2 Disk Cache (survives restart) + L1 Paged (super fast ttft) + Hybrid SSM Scheduler + Cont Batching + etc!
Read the entire source before you build โ unlike paid marketplaces that hide it behind a buy button.
More ai-agent tools founders pair with this one.