Platforms for deploying and serving ML/AI models at scale
TensorRT-LLM — a leading open-source project in the AI/LLM ecosystem.