2025-02-20

[2025-2026, Planning…]
- Language Model Learning & Practice (Student-Oriented Series) (TBD)
- Current Trends in Goal-Guided Conversational AI Models (TBD)
- Using Language Models in Specific Domains (TBD)
[2024-2025] Use AI to Detect AI-Generated Text
[2023-2024] Using Language Models in Specific Domains
- 1 Introduction
- 2 Domain-specific Training Data
  - [Any Domain] Use Unlabelled Text to Improve Instruction Following Language Models (Notes 1, 2, 3, 4, 5)
  - [Medical/Health] ChatDoctor (Notes 1, 2, 3; Slides 1, 2, 3)
  - [Medical/Health] MedicalGPT-zh (Notes, Slides)
  - [Medical/Health] MING (Notes, Slides)
  - [Medical/Health] SoulChat (Notes, Slides)
  - [Mobile Interaction] Tiny Models, Mighty Powers - ReALM (1, 2, 3, 4)
- 3 Automatic Model Evaluation
  - [Any Domain] Evaluating Language Models with Language Models (1 Introduction)
  - [Any Domain] Evaluating Language Models with Language Models (2 PandaLM)
  - [Any Domain] Evaluating Language Models with Language Models (3 Shepherd, 1,2,3,4)
  - [Medical/Health] Comparing ChatDoctor and ChatGPT3.5 using BERT-Score
[2024] Tiny Models, Mighty Powers - ReALM (1, 2, 3, 4)
[2023-2024] Use Unlabelled Text to Improve Instruction Following Language Models
[2023] Evaluating Language Models with Language Models
- 1 Introduction
- 2 PandaLM
- 3 Shepherd (1, 2, 3, 4)
[2023-2025] Past of Goal-Guided Conversational AI Models
[2022-2024] Conference “Interesting”s
[2023] Chinese Natural Language Understanding, NLU, in Dialogue Systems
[2021] Fantastic Trees (Decision Trees, Random Forest, Adaboost, Gradient Boosting DT, XGBoost)
[2020] Improving Your English Communication Skills (Writing Emails, Speaking English and Building ePortfolio)
[2017] CRF Layer on the Top of BiLSTM
- CRF Layer on the Top of BiLSTM - 1 Outline and Introduction
- CRF Layer on the Top of BiLSTM - 2 CRF Layer (Emission and Transition Score)
- CRF Layer on the Top of BiLSTM - 3 CRF Loss Function
- CRF Layer on the Top of BiLSTM - 4 Real Path Score
- CRF Layer on the Top of BiLSTM - 5 The Total Score of All the Paths
- CRF Layer on the Top of BiLSTM - 6 Infer the Labels for a New Sentence
- CRF Layer on the Top of BiLSTM - 7 Chainer Implementation Warm Up
- CRF Layer on the Top of BiLSTM - 8 Demo Code

Notes (Single Post)

Paper Explained

Detailed Links:
* Fantastic Trees (Decision Trees, Random Forest, Adaboost, Gradient Boosting DT, XGBoost)
Droot (Dog Tree)
* Probabilistic Graphical Models Revision Notes

* Super Machine Learning Revision Notes

Reviewing...

[2017] CRF Layer on the Top of BiLSTM (BiLSTM-CRF)

CRF Layer on the Top of BiLSTM - 1 Outline and Introduction
CRF Layer on the Top of BiLSTM - 2 CRF Layer (Emission and Transition Score)
CRF Layer on the Top of BiLSTM - 3 CRF Loss Function
CRF Layer on the Top of BiLSTM - 4 Real Path Score
CRF Layer on the Top of BiLSTM - 5 The Total Score of All the Paths
CRF Layer on the Top of BiLSTM - 6 Infer the Labels for a New Sentence
CRF Layer on the Top of BiLSTM - 7 Chainer Implementation Warm Up
CRF Layer on the Top of BiLSTM - 8 Demo Code

The dog needs to find the best path to get his favorite bone toy and return home following the way he came

2021-04-05

Fantastic Trees

This note summarises the Youtube Videos published by Josh Starmer (Youtube Account: StatQuest with Josh Starmer). I would like to say a big thank you to him and his super useful videos!

Droot (Dog Tree)

2020-04-06

Main Points of Interesting Papers

This page lists the notes of interesting papers at different research topics of Natural Language Processing (NLP). Each note briefly describes the main points of each paper. Hope this would be helpful for you to quickly get the ideas of them. (Please be free to correct me if you found mistakes.)

2019-11-01

Improving Your English Communication Skills

This is the note of this online course, Improving Your English Communication Skills on Coursera.

2019-01-07

Probabilistic Graphical Models Revision Notes

[Last Updated: 2020.02.23]

This note summarises the online course, Probabilistic Graphical Models Specialization on Coursera.
Any comments and suggestions are most welcome!

2018-01-23

Super Machine Learning Revision Notes

[Last Updated: 06/01/2019]

This article aims to summarise:

basic concepts in machine learning (e.g. gradient descent, back propagation etc.)
different algorithms and various popular models
some practical tips and examples were learned from my own practice and some online courses such as Deep Learning AI.

If you a student who is studying machine learning, hope this article could help you to shorten your revision time and bring you useful inspiration. If you are not a student, hope this article would be helpful when you cannot recall some models or algorithms.

Moreover, you can also treat it as a “Quick Check Guide”. Please be free to use Ctrl+F to search any key words interested you.

Any comments and suggestions are most welcome!

2018-01-17

My Life

Reading for Learning English
- [2017.07.25 -] Successful Writing at Work, Ninth Edition
- [2017.09.27 -] The Curse of the Cheese Pyramid (Geronimo Stilton)
- [2016.12.17 - 2017.09.26] Lost Treasure of the Emerald Eye (Geronimo Stilton)
Books
- [2016.05.01 -] Love in the Time of Cholera
- [2015.02.24 - 2016.12.24] 1Q84
- [2013.12.07 - 2015.04.10] The Three-Body Problem Series
TV Series & Movies
- [2018.03.02 - ] The Big Bang Theory
- [2018.01.27 - 2018.03.01] Sherlock
- [2016.04.13 - 2018.01.26] Downton Abbey
- [2015.04.01 - 2017.05.26] the Vampire Diaries
Piano
- [2016.09.01 - 2017.02.05] OST : Just One Time is Enough (Aska Yang)
Sports (First Time)
- [2017] Skiing
- [2014] Ice Skating
- [2007] Swimming

2017-12-07

CRF Layer on the Top of BiLSTM - 8

3.4 Demo

In this section, we will make two fake sentences which only have 2 words and 1 word respectively. Moreover, we will also randomly generate their true answers. Finally, we will show how to train the CRF Layer by using Chainer v2.0. All the codes including the CRF layer are avaialbe from GitHub.

2017-12-06

CRF Layer on the Top of BiLSTM - 7

3 Chainer Implementation

In this section, the structure of code will be explained. In addition, an important tip of implementing the CRF loss layer will also be given. Finally, the Chainer (version 2.0) implementation source code will be released in the next article.

2017-11-24

CRF Layer on the Top of BiLSTM - 6

2.6 Infer the labels for a new sentence

In the previous sections, we learned the structure of BiLSTM-CRF model and the details of CRF loss function. You can implement your own BiLSTM-CRF model by various opensource frameworks (Keras, Chainer, TensorFlow etc.). One of the greatest things is the backpropagation of on your model is automatically computed on these frameworks, therefore you do not need to implement the backpropagation by yourself to train your model (i.e. compute the gradients and to update parameters). Moreover, some frameworks have already implemented the CRF layer, so combining a CRF layer with your own model would be very easy by only adding about one line code.

In this section, we will explore how to infer the labels for a sentence during the test when our model is ready.

CreateMoMo

Table of Contents

Fantastic Trees

Main Points of Interesting Papers

Improving Your English Communication Skills

Probabilistic Graphical Models Revision Notes

[Last Updated: 2020.02.23]

Super Machine Learning Revision Notes

[Last Updated: 06/01/2019]

My Life

CRF Layer on the Top of BiLSTM - 8

3.4 Demo

CRF Layer on the Top of BiLSTM - 7

3 Chainer Implementation

CRF Layer on the Top of BiLSTM - 6

2.6 Infer the labels for a new sentence