Word2Vec Models Revisited 08-23-2019 08-23-2019 article 23 minutes read (About 3524 words)A Review on the Classic Word2Vec Models Machine Learning, Natural Language Processing, Optimization Read more
Hierarchical Softmax 08-17-2019 08-17-2019 article 8 minutes read (About 1191 words)Orchestrated Softmax for Fast Language Model Training Machine Learning, Natural Language Processing, Optimization Read more
Label Smoothing 08-11-2019 12-26-2020 blog 6 minutes read (About 956 words)Regularization for Classification Models Machine Learning, Optimization, Regularization Read more
Cross Entropy, KL Divergence, and Maximum Likelihood Estimation 08-10-2019 08-10-2019 blog 6 minutes read (About 942 words)Some Theories for Machine Learning Optimization Machine Learning, Information Theory, Cross Entropy, KL Divergence, Maximum Likelihood Estimation Read more
Noise Contrastive Estimation 07-26-2019 07-26-2019 article 30 minutes read (About 4509 words)Accelerate the Training of Neural Language Models Machine Learning, Natural Language Processing, Optimization Read more
Dropout Explained 06-04-2019 06-04-2019 blog 8 minutes read (About 1153 words)The Math of Dropout You Know and Don't Know Machine Learning, Deep Learning Read more
Introduction to Variational Inference 05-25-2019 05-25-2019 article an hour read (About 7329 words)Setting Up Foundations to Bayesian Learning Machine Learning, Statistics, Variational Inference Read more
Introduction to Bayesian Filter 03-21-2019 03-21-2019 article 9 minutes read (About 1329 words)Restate Some Basic Concepts and Math in Bayesian Filter Machine Learning, Statistics, Bayesian Filter Read more
CycleGAN Voice Converter 06-13-2018 06-13-2018 project 8 minutes read (About 1179 words)CycleGAN-Based Voice Conversions Machine Learning, Deep Learning, Generative Adversarial Networks, Computer Vision Read more
CycleGAN Image Converter 06-10-2018 06-10-2018 project 9 minutes read (About 1286 words)CycleGAN-Based Image-to-Image Translations Machine Learning, Deep Learning, Generative Adversarial Networks, Computer Vision Read more