The two alternatives to the monasterisation of the World wide web

Saint Michael’s Abbey, in the Susa Valley, Piedmont. Source: Wikipedia.

In Medieval Europe, information was physically concentrated in very few secluded libraries and archives. Powerful institutions managed them and regulated who could access what. The library of the fictional abbey that is described in Umberto Eco’s The Name of the Rose is located in a fortified tower and only the librarian knows how to navigate its mysteries. Monasteries played an essential role in preserving written information and creating new intelligence from that knowledge. But being written information a scarce resource, with the keys to libraries came also authority and power. Similarly, Internet companies are amassing information within their fortified walls. In so doing, they provide services that we now see as essential but they also contravene the two core principles of the Internet: openness and decentralisation.


Monday, 7 May 2018


  • easyMTS: My First R Package (Story, and Results)
    This weekend I decided to create my first R package… it’s here! Although I’ve been using R for 15 years, developing a package has been the one thing slightly out of reach for me. Now that I’ve been through the process once, with a package that’s not completely done (but at least has a […]
  • easyMTS R Package: Quick Solver for Mahalanobis-Taguchi System (MTS)
    A new R package in development. Please cite if you use it. The post easyMTS R Package: Quick Solver for Mahalanobis-Taguchi System (MTS) appeared first on Quality and Innovation.
  • Hyper-Parameter Optimization of General Regression Neural Networks
    A major advantage of General Regression Neural Networks (GRNN) over other types of neural networks is that there is only a single hyper-parameter, namely the sigma. In the previous post (, I’ve shown how to use the random search strategy to find a close-to-optimal value of the sigma by using various random number generators, including […]
  • Cluster multiple time series using K-means
    I have been recently confronted to the issue of finding similarities among time-series and though about using k-means to cluster them. To illustrate the method, I’ll be using data from the Penn World Tables, readily available in R (inside the {pwt9} package): library(tidyverse) library(lubridate) library(pwt9) library(brotools) First, of all, let’s only select the needed columns: […]
  • A Shiny Intro Survey to an Open Science Course
    Last week, we started a new course titled “Statistical Programming and Open Science Methods”. It is being offered under the research program of TRR 266 “Accounting for Transparency” and enables students to conduct data-based research so that...

RSS Simply Statistics

  • You can replicate almost any plot with R
    Although R is great for quickly turning data into plots, it is not widely used for making publication ready figures. But, with enough tinkering you can make almost any plot in R. For examples check out the flowingdata blog or the Fundamentals of Data Visualization book. Here I show five charts from the lay press […]
  • So You Want to Start a Podcast
    Podcasting has gotten quite a bit easier over the past 10 years, due in part to improvements to hardware and software. I wrote about both how I edit and record both of my podcasts about 2 years ago and, while not much has changed since then, I thought it might be helpful if I organized […]
  • The data deluge means no reasonable expectation of privacy - now what?
    Today a couple of different things reminded me about something that I suppose many people are talking about but has been on my mind as well. The idea is that many of our societies social norms are based on the reasonable expectation of privacy. But the reasonable expectation of privacy is increasingly a thing of […]

RSS Statistical Modeling, Causal Inference, and Social Science

  • When presenting a new method, talk about its failure modes.
    A coauthor writes: I really like the paper [we are writing] as it is. My only criticism of it perhaps would be that we present this great new method and discuss all of its merits, but we do not really discuss when it fails / what its downsides are. Are there any cases where the […]
  • The best is the enemy of the good. It is also the enemy of the not so good.
    This post is by Phil Price, not Andrew. The Ocean Cleanup Project’s device to clean up plastic from the Great Pacific Garbage Patch is back in the news because it is back at work and is successfully collecting plastic. A bunch of my friends are pretty happy about it and have said so on social […]
  • On the term “self-appointed” . . .
    I was reflecting on what bugs me so much about people using the term “self-appointed” (for example, when disparaging “self-appointed data police” or “self-appointed chess historians“). The obvious question when someone talks about “self-appointed” whatever is, Who self-appointed you to decide who is illegitimately self-appointed? But my larger concern is with the idea that being […]