PyTorch Eager Mode Quantization TensorRT Acceleration 05-24-2024 05-24-2024 blog 7 minutes read (About 1051 words)TensorRT Acceleration for PyTorch Native Eager Mode Quantization Models Deep Learning, Python, Inference, Quantization, Accelerated Computing, NVIDIA, TensorRT, PyTorch, GPU Read More
TensorRT Python Inference 05-18-2024 05-18-2024 blog 12 minutes read (About 1843 words)TensorRT Python Inference Example Deep Learning, Python, Inference, NVIDIA, TensorRT, GPU Read More
Reparameterization Trick 05-08-2024 05-08-2024 blog 12 minutes read (About 1749 words)How Distribution of a Random Variable Changes when the Variable is Transformed in a Deterministic Way Deep Learning, Statistics, Probability, Mathematics, Jacobian, Variational Autoencoder Read More
Quantization Unit Test 03-25-2024 03-25-2024 blog 5 minutes read (About 748 words)How To Unit Test Quantization Implementation Deep Learning, Mathematics, Quantization, Software Engineering, Unit Test Read More
PyTorch Custom ONNX Operator Export 02-11-2024 02-11-2024 blog 10 minutes read (About 1573 words)Easier PyTorch to TensorRT Custom Plugin Integration Deep Learning, Software Engineering, TensorRT, PyTorch, ONNX Read More
How To Debug Deep Learning Inference Applications 01-01-2024 01-01-2024 article 23 minutes read (About 3511 words)First Principles of Evaluating Deep Learning Inference Deep Learning, Software Engineering, Deep Learning Inference, Numerical Errors Read More
Numerical Errors In HPC and Deep Learning 12-26-2023 12-26-2023 blog 4 minutes read (About 575 words)Numerical Errors are Evil Deep Learning, Numerical Errors, High Performance Computing Read More
Deformable Attention 12-16-2023 12-16-2023 blog 34 minutes read (About 5063 words)Attention with Learned Spatial Feature Sampling Deep Learning, Computer Vision, Transformer, Attention Read More
Deformable Convolution 12-08-2023 12-08-2023 blog 6 minutes read (About 889 words)Convolution with Learned Spatial Feature Sampling Deep Learning, Computer Vision, Convolution Read More
Function Approximation Using Lookup Table and Interpolation 09-22-2023 09-22-2023 blog 7 minutes read (About 1001 words)Using Motorola CPU32 as an Example Deep Learning, Quantization, Computer Architecture Read More
ONNX Opset Operator Counts 07-10-2023 07-10-2023 blog 4 minutes read (About 549 words)How ONNX Expands the Support for Deep Learning Inference Over the Years Deep Learning, JavaScript, ONNX Read More
PyTorch Leaf Tensor 06-19-2023 06-19-2023 blog 10 minutes read (About 1503 words)Understanding PyTorch Leaf Tensor Deep Learning, PyTorch, Graph Algorithm Read More