layer_embeeding
)I want a glass of orange juice to go alone with my cereal.
Context:
Come up with a few context to target pairs to create our supervised learning problem
Rule: randomly pick a word as context word; randomly pick another word within some window (±3) as target word
I want a glass of orange juice to go along with my cereal.
Context | Target |
---|---|
orange | juice |
orange | glass |
orange | go |
… | … |
Context | ⟶ | Target |
---|---|---|
c (“orange”) | ⟶ | t (“juice”) |
Oc→E→ec→softmax→ˆy
Softmax:p(t|c)=eθTtecΣ10,000j=1eθTjec
L(ˆy,y)=−Σ10,000i=1yilogˆyi
Bengio et. al., 2003, A neural probabilistic language model↩
Mikolov et. al., 2013. Efficient estimation of word representations in vector space↩
Space, Right Arrow or swipe left to move to next slide, click help below for more details