Optimal Brain Surgeon 05-14-2025 05-14-2025 blog 40 minutes read (About 6031 words)Derivation and Extension of The Classical Optimal Brain Surgeon Algorithm Deep Learning, Mathematics, Neural Networks, Calculus, Neural Network Pruning Read More
Tensor Calculus Layout Conventions 05-08-2025 05-08-2025 blog 39 minutes read (About 5799 words)Numerator Layout VS Denominator Layout Mathematics, Calculus Read More
TensorRT Implicit Weight Quantization 04-29-2025 04-29-2025 blog 8 minutes read (About 1265 words)TensorRT Implicit Weight Quantization Caveats and Tricks Deep Learning, Mathematics, Quantization, TensorRT Read More
Automatic Differentiation Revisited 04-12-2025 04-12-2025 blog 15 minutes read (About 2198 words)Jacobian Matrix, De Novo Chain Rule Expression, Jacobian-Vector Product, Vector-Jacobian Product Deep Learning, Mathematics, Linear Algebra, Calculus, Automatic Differentiation Read More
Chain Rule 04-06-2025 04-06-2025 blog 14 minutes read (About 2153 words)Chain Rule Using Jacobian Matrix Multiplications Mathematics, Linear Algebra, Calculus, Jacobian, Chain Rule Read More
Derivatives 03-31-2025 03-31-2025 blog 17 minutes read (About 2585 words)Derivatives, Partial Derivatives, Directional Derivatives, and Total Derivatives Mathematics, Linear Algebra, Calculus, Jacobian Read More
Field of View 03-03-2025 03-03-2025 blog 4 minutes read (About 627 words)Mathematical Derivation of Field of View Equation Mathematics, Physics, Photography, Camera Read More
Depth of Field 02-23-2025 02-23-2025 blog 13 minutes read (About 2015 words)Mathematical Derivation of Depth of Field Equation Mathematics, Physics, Photography, Camera Read More
AWQ: Activation-Aware Weight Quantization 01-01-2025 01-01-2025 blog 18 minutes read (About 2738 words)Same Performance as Group-Wise Weight-Only Quantization But with Better Accuracy Deep Learning, Mathematics, Quantization, Accelerated Computing, CUDA Read More
CuTe Swizzle 12-01-2024 03-04-2025 blog 19 minutes read (About 2808 words)CuTe Shared Memory Swizzling Abstractions Mathematics, Accelerated Computing, CUDA, CUTLASS, CuTe Read More
CuTe Matrix Transpose 11-20-2024 12-26-2024 article an hour read (About 10825 words)Matrix Transpose CUDA Kernel Implementation Using CuTe Mathematics, Accelerated Computing, CUDA, CUTLASS, CuTe Read More
CuTe Layout Algebra 10-20-2024 10-20-2024 article 2 hours read (About 16932 words)Mathematical Fundamentals to CUTLASS Computing Mathematics, Accelerated Computing, CUDA, CUTLASS, CuTe, Category Theory Read More