Hardware-aware graph transforms, fusions, and optimal quantization in PyTorch.
Click a card to compare the original model with the Embedl quantized version
Drop your .pt2 file here
.pt2
or browse files
Sign in with Google to download transformed models.