or: Efficient Estimation of Word Representations in Vector Space

This is the original word2vec paper, which introduces the skip-gram and CBOW models for creating word vectors. Skip-gram works much better on some tasks, but takes 3X the training time as CBOW.

should be treated as a two-parter with word2vec 2.