Hardware-aware graph transforms, fusions, and optimal quantization
in
PyTorch.
Click a card to compare the original model with the Embedl quantized version
Drop your .pt2 file here
or
Stored in your browser only. Never sent to our server.
Connect your own Claude to TorchDeploy tools via MCP:
/mcp
claude mcp add --transport http torchdeploy https://<your-domain>/mcp