Hardware-aware graph transforms, fusions, and optimal quantization in PyTorch.
Click a card to compare a model with the deployment-ready model for TensorRT
Drop your .pt2 file here
.pt2
or browse files