Explicit semantic analysis with R

Explicit semantic analysis (ESA) was proposed by Gabrilovich and Markovitch (2007) to compute a document position in a high-dimensional concept space. At the core, the technique compares the terms of the input document with the terms of documents describing the concepts estimating the relatedness of the document to each concept. In spatial terms if I know the relative distance of the input document from meaningful concepts (e.g. ‘car’, ‘Leonardo da Vinci’, ‘poverty’, ‘electricity’), I can infer the meaning of the document relatively to explicitly defined concepts because of the document’s position in the concept space.

(more…)

Tuesday, 26 April 2016

tweets


Twitter: frbailo

links


blogroll


RSS r-bloggers.com

  • Model-Based Causal Forests for Heterogeneous Treatment Effects
    A new arXiv paper investigates which building blocks of random forests, especially causal forests and model-based forests, make them work for heterogeneous treatment effect estimation, both in randomized trials and observational studies. ... Continue reading: Model-Based Causal Forests for Heterogeneous Treatment Effects
  • A Major Contribution to Learning R
    Prominent statistician Frank Harrell has come out with a radically new R tutorial, rflow. The name is short for “R workflow,” but I call it “R in a box” –everything one needs for beginning serious usage of R, starting from little or no background. By serious usage I mean real ... Continue reading: A Major […]
  • Evaluating GitHub Activity for Contributors
    Say you have a bug report or feature request to make to a package. How can you use information on GitHub to manage your expectations (will there be a quick fix) and actions (should you go ahead and fork the repository)? In this post, we shall go over ... Continue reading: Evaluating GitHub Activity for […]
  • Developing React Applications in RStudio Workbench
    Introduction RStudio Workbench provides a development environment for R, Python, and many other languages. When developing a performant web application you may progress from Shiny towards tools li... Continue reading: Developing React Applications in RStudio Workbench
  • Food Crisis Analysis and, Forecasting with Neural Network Autoregression
    The war between Russia and Ukraine has affected the global food supply other than many vital things. Primarily cereal crop products have been affected the most because the imports have been provided to the world mainly through Ukraine and Russia. Let’s check the situation we’ve mentioned for G20 ... Continue reading: Food Crisis Analysis and, […]

RSS Simply Statistics

RSS Statistical Modeling, Causal Inference, and Social Science