Automatic Differentiation 02-21-2022 03-17-2022 article 35 minutes read (About 5232 words)Mathematical Foundations to Neural Network Optimization Deep Learning, Machine Learning, Mathematics, Mathematical Optimization Read more
ONNX IO Stream 01-03-2022 01-03-2022 blog 3 minutes read (About 480 words)Hard Drive Free ONNX IO Deep Learning, Machine Learning, ONNX Read more
ResNet CIFAR Classification Using LibTorch C++ API 07-01-2021 07-01-2021 blog 6 minutes read (About 837 words)User Experience on LibTorch C++ API Deep Learning, Machine Learning, CPP, PyTorch, LibTorch Read more
Non-Autoregressive Model and Non-Autoregressive Decoding for Sequence to Sequence Tasks 04-27-2021 04-27-2021 blog 19 minutes read (About 2780 words)Enable Parallelized Non-Autoregressive Decoding Process for Sequence to Sequence Tasks Deep Learning, Machine Learning Read more
Autoregressive Model and Autoregressive Decoding for Sequence to Sequence Tasks 04-26-2021 04-26-2021 blog 17 minutes read (About 2523 words)Understand Autoregressive Models for Sequence to Sequence Tasks Deep Learning, Machine Learning Read more
Pruning for Neural Networks 03-01-2021 03-01-2021 article 14 minutes read (About 2172 words)Mathematical Foundations to Neural Network Pruning Deep Learning, Machine Learning, Mathematics Read more
Bootstrap Methods 01-31-2021 01-31-2021 blog 8 minutes read (About 1183 words)Using Bootstrap Distribution to Approximate Sampling Distribution Machine Learning, Statistics Read more
Gibbs Sampling 01-25-2021 01-25-2021 blog 21 minutes read (About 3168 words)Sampling Method for Multivariate Joint Distributions Machine Learning, Statistics Read more
Motivations for Sampling in Statistical Inference 01-18-2021 01-30-2021 blog 8 minutes read (About 1153 words)Statistical Inference Requires Sampling Machine Learning, Statistics Read more
Simple Inference Server 12-30-2020 12-30-2020 project 7 minutes read (About 975 words)Running Machine Learning Inference as Service from Scratch Deep Learning, Machine Learning, Python Read more
Cross Entropy Loss VS Log Loss VS Sum of Log Loss 07-18-2020 07-18-2020 blog 23 minutes read (About 3404 words)Conventional Classification Loss Functions Elucidated Machine Learning, Optimization, Mathematics Read more
Population Based Training 06-28-2020 06-28-2020 blog 14 minutes read (About 2059 words)Model Optimization with Genetically Evolved Hyperparameters Machine Learning, Optimization, Genetics Algorithm Read more
Quantization for Neural Networks 05-17-2020 03-17-2022 article 37 minutes read (About 5616 words)Mathematical Foundations to Neural Network Quantization Deep Learning, Machine Learning, Mathematics Read more
Gated Linear Units (GLU) and Gated CNN 03-09-2020 03-09-2020 blog 6 minutes read (About 826 words)Gated Linear Units and Gated CNN Explained Deep Learning, Machine Learning, Natural Language Processing Read more
Evolved Transformer Explained 03-07-2020 03-07-2020 blog 5 minutes read (About 774 words)Evolution Search on Transformer Neural Network Architectures Deep Learning, Machine Learning, Natural Language Processing, Neural Architecture Search, Evolution Algorithm Read more
Group Lasso 02-13-2020 02-13-2020 blog 5 minutes read (About 742 words)Understand the Sparsity Introduced by Group Lasso Machine Learning, Statistics Read more
Gradient Boosting Machine 01-26-2020 01-26-2020 article 12 minutes read (About 1778 words)Fundamental Theories of Gradient Boosting Machine Machine Learning, Data Science Read more
Save, Load and Inference From TensorFlow 2.x Frozen Graph 01-09-2020 01-09-2020 blog 6 minutes read (About 826 words)Continuing to Support Frozen Graph for TensorFlow 2.x Deep Learning, Machine Learning, TensorFlow Read more
Word2Vec Models Revisited 08-23-2019 08-23-2019 article 23 minutes read (About 3524 words)A Review on the Classic Word2Vec Models Machine Learning, Natural Language Processing, Optimization Read more
Hierarchical Softmax 08-17-2019 08-17-2019 article 8 minutes read (About 1191 words)Orchestrated Softmax for Fast Language Model Training Machine Learning, Natural Language Processing, Optimization Read more