Explicit semantic analysis with R

Explicit semantic analysis (ESA) was proposed by Gabrilovich and Markovitch (2007) to compute a document position in a high-dimensional concept space. At the core, the technique compares the terms of the input document with the terms of documents describing the concepts estimating the relatedness of the document to each concept. In spatial terms if I know the relative distance of the input document from meaningful concepts (e.g. ‘car’, ‘Leonardo da Vinci’, ‘poverty’, ‘electricity’), I can infer the meaning of the document relatively to explicitly defined concepts because of the document’s position in the concept space.

(more…)

Tuesday, 26 April 2016

tweets


Twitter: frbailo

links


blogroll


RSS r-bloggers.com

RSS Simply Statistics

RSS Statistical Modeling, Causal Inference, and Social Science

  • The cleantech job market: Every modeler is supposed to be a great Python programmer.
    This post is by Phil Price, not Andrew. I’ve had a run of luck ever since I left my staff scientist position at Lawrence Berkeley Laboratory to become a freelance consultant doing statistical modeling and forecasting, mostly related to electricity … Continue reading →
  • J. K. Rowling (2) vs. Joan Didion; Arnold advances
    Our most recent competition was close. Ethan goes for Mr. 22 based on the duplication of duplicate names: Well Benny can stretch to a double major – Major General for us, working with the Brit’s Major Andre. But that’s not … Continue reading →
  • Do doctors get too little respect nowadays? Or too much?
    This news article laments that doctors don’t get enough respect: ‘Kind of Awkward’: Doctors Find Themselves on a First-Name Basis . . . Female doctors were more than twice as likely as male doctors to be addressed by their first … Continue reading →