Words that occur in similar contexts tend to have similar meanings. This link between similarity in how words are distributed and similarity in what they mean is called the distributional hypothesis.
- words which are synonyms tended to occur in the same environment
- with the amount of meaning difference between two words “corresponding roughly to the amount of difference in their environments”
vector semantics instantiates this linguistic hypothesis by learning representations of the meaning of words directly from their distributions in texts.
