Explicit semantic analysis with R

Explicit semantic analysis (ESA) was proposed by Gabrilovich and Markovitch (2007) to compute a document position in a high-dimensional concept space. At the core, the technique compares the terms of the input document with the terms of documents describing the concepts estimating the relatedness of the document to each concept. In spatial terms if I know the relative distance of the input document from meaningful concepts (e.g. ‘car’, ‘Leonardo da Vinci’, ‘poverty’, ‘electricity’), I can infer the meaning of the document relatively to explicitly defined concepts because of the document’s position in the concept space.


Tuesday, 26 April 2016


Twitter: frbailo



RSS r-bloggers.com

  • RObservations #15: I reverse-engineered Atlas.co (well, some of it)
    Introduction A while back, the famous travel Youtubers Kara and Nate announced that they launched a new company called Atlas.co which prints custom souvenir maps as hangable artwork. After spending some time looking at how the maps looked I thought it would be interesting to see if it is ... Continue reading: RObservations #15: I reverse-engineered Atlas.co (well, […]
  • Excess Deaths in 2020
    Prompted by a guest visit to Mine Çetinkaya-Rundel’s Advanced Data Visualization class here at Duke, I’ve updated my US and state excess death graphs. Earlier posts (like this one from February) will update as well. I am interested in all-cause mortality in the United States for 2020. I look ... Continue reading: Excess Deaths in […]
  • Descriptive Statistics in R
    Descriptive Statistics in R, You’ll learn about descriptive statistics in this tutorial, which is one strategy you might employ in exploratory data analysis. Before you invest time constructing intricate models, it’s necessary to first... The post Descriptive Statistics in R appeared first on finnstats. Continue reading: Descriptive Statistics in R
  • Personal Highlights of Scikit-Learn 1.0
    Yes! After more than 10 years, scikit-learn released its 1.0 version on 24 September 2021. In this post, I'd like to point out some personal highlights apart from the release highlights. 1. Feature Names This one is listed in the release highlights, but deserves to be mentioned again. This is not yet available for all […]
  • Row-wise operations with the {tidyverse}
    You can read the original post in its original format on Rtask website by ThinkR here: Row-wise operations with the {tidyverse} We are often asked how to perform row-wise operations in a data.frame (or a tibble) the answer is, as usual, “it depends” 🙂 Let’s look at some cases ... Continue reading: Row-wise operations with […]

RSS Simply Statistics

RSS Statistical Modeling, Causal Inference, and Social Science