workflowstacks

The marketplace for AI skills that launch offers, rank in AI search, and automate operations. No coding required.

๐•โšก๐Ÿ’ฌ

Marketplace

  • Browse Skills
  • AI Agents
  • Claude Skills
  • MCP Servers
  • Prompts

Solutions

  • For Founders
  • For Agencies
  • For Ecommerce
  • Agent Builder
  • Starter Packs
  • Playbooks

Learn

  • How It Works
  • What Are Skills
  • What Are Agents
  • What Is MCP
  • For Creators
  • Submit a Tool
  • Security

Company

  • Become a Creator
  • About
  • Enterprise
  • API Docs
  • Terms
  • Privacy
  • Support
Compatible with
๐Ÿค–ChatGPT
โœจClaude
๐Ÿ’ŽGemini
๐Ÿ›๏ธShopify
๐Ÿ”Ahrefs
๐Ÿ“ŠSheets
๐Ÿ’ฌWhatsApp
๐Ÿ“ฑMeta Ads
+50 moreCreator program โ†’

ยฉ 2026 WorkflowStacks. All rights reserved.

TermsPrivacySupport
ai-agent

TensorRT-LLM: Fast LLM Inference

Get efficient Large Language Model inference with TensorRT-LLM, a Python API for founders, backed by 14k+ GitHub stars.
intermediateโฑ 30 minutes๐Ÿ’ต Free + LLM API costs
13,993 stars2,500 forksPythonQuality 8/10Updated 6/29/2026100% free ยท open source
What it is

Use Python to define and run Large Language Models.

What you can make with it

Agents that perform tasks like answering customer queries, generating product descriptions.

How it helps

TensorRT LLM helps users perform inference efficiently on NVIDIA GPUs, saving time and costs.

Real use case example

"A founder uses TensorRT LLM to create a customer support chatbot. They write a Python script to define the model, train it on customer data, and deploy it on their GPU. The chatbot answers common questions, freeing up human support staff to focus on complex issues."

If you're new

Picking up this skill takes some prior programming experience and familiarity with Python and AI concepts.

If you're senior

Senior engineers and professionals use TensorRT LLM for demanding language model applications requiring high performance and efficient inference.

Common confusion cleared up

Don't confuse TensorRT LLM with other AI engines; it's specifically designed for large language models and NVIDIA GPU acceleration.

Best inside these AI tools
Claude DesktopCodex CLIContinue
Pairs with
Stripe webhookNotion databaseGemini
Why we list it on WorkflowStacks: A marketplace of AI tools includes this for access to state-of-the-art optimizations.

About this skill

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

Topics

blackwell
cuda
llm-serving
moe
pytorch
What's inside โ€” free to inspect
No purchase needed

Read the entire source before you build โ€” unlike paid marketplaces that hide it behind a buy button.

33
top-level files
18
folders
1900.2M
repo size
Other
license
Key files
.editorconfig
.pre-commit-config.yaml
AGENTS.md
README.md
requirements-dev.txt
requirements.txt
File tree
.claude/
.codex/
.devcontainer/
.github/
3rdparty/
benchmarks/
cpp/
docker/
docs/
enroot/
examples/
jenkins/
scripts/
security_scanning/
tensorrt_llm/
tests/
triton_backend/
triton_kernels/
.clang-format
.clang-tidy
.clangd
.cmake-format.json
.coderabbit.yaml
.cursorignore
Quick Actions
Details
Creator
NVIDIA
Language
Python
Category
ai-agent
Published
8/16/2023

Are you the creator of this tool? Claim your listing โ†’ and earn 85% of every sale.

Related skills

More ai-agent tools founders pair with this one.

ai-agentโ˜… 512,463
build-your-own-x: Learn by Recreating
Improve programming skills by rebuilding tech from scratch. For startup founders and programmers.
ai-agentโ˜… 450,722
freeCodeCamp
FreeCodeCamp.org's open-source codebase and curriculum. Learn math, programming, and computer science for free.
ai-agentโ˜… 439,771
Public-apis: Free API Access
Get free APIs for development. For startup founders and developers.
ai-agentโ˜… 390,947
Free Programming Books
:books: Freely available programming books
ai-agentโ˜… 350,624
coding-interview-university: Learn to Code
Get a complete study plan to become a software engineer, for startup founders, with 351k+ GitHub stars
ai-agentโ˜… 297,550
Selfhosted Control with awesome-selfhosted
Host your own network services and web apps. For founders seeking privacy and control.