2018 Italian general election: Details on my simulation

This article describes the simulation behind the app that you find here

This simulation of the results for the 2018 general election is based on the results from the last two national elections (the Italian parliament election in 2013 and the European Parliament election 2014) and national polls conducted until 16 February 2018. The simulation is based on one assumption, which is reasonable but not necessarily realistic: the relative territorial strength of parties is stable. From this assumption derives that if the national support for a party (as measured by national voting intention polls) varies, it varies consistently and proportionally everywhere. A rising tide lifts all boats and vice versa. The assumption has some empirical justification. If we compare the difference from the national support (in percentage) for each district in 2013 and 2014 we see a significant correlation, especially in the major parties.

Votes to party in the 2018 Chamber districts

(more…)

Tuesday, 27 February 2018

tweets


Twitter: frbailo

links


blogroll


RSS r-bloggers.com

  • Lecture slides: Real-World Data Science (Fraud Detection, Customer Churn & Predictive Maintenance)
    These are slides from a lecture I gave at the School of Applied Sciences in Münster. In this lecture, I talked about Real-World Data Science and showed examples on Fraud Detection, Customer Churn & Predictive Maintenance. Real-World Data Scie...
  • Use foreach with HPC schedulers thanks to the future package
    The future package is a powerful and elegant cross-platform framework for orchestrating asynchronous computations in R. It's ideal for working with computations that take a long time to complete; that would benefit from using distributed, parallel frameworks to make them complete faster; and that you'd rather not have locking up your interactive R session. You […]
  • Feature Selection using Genetic Algorithms in R
    From a gentle introduction to a practical solution, this is a post about feature selection using genetic algorithms in R.
  • Using clusterlab to benchmark clustering algorithms
    Clusterlab is a CRAN package (https://cran.r-project.org/web/packages/clusterlab/index.html) for the routine testing of clustering algorithms. It can simulate positive (data-sets with __1 clusters) and negative controls (data-sets with 1 cluster). Why test clustering algorithms? Because they often fail in identifying the true K in practice, published algorithms are not always well tested, and we need to know […]
  • Selecting ‘special’ photos on your phone
    At the beginning of the new year I always want to clean up my photos on my phone. It just never happens. So now (like so many others I think) I have a lot of photos on my phone from … Continue reading →

RSS Simply Statistics

  • How Data Scientists Think - A Mini Case Study
    In episode 71 of Not So Standard Deviations, Hilary Parker and I inaugurated our first “Data Science Design Challenge” segment where we discussed how we would solve a given problem using data science. The idea with calling it a “design challenge” was to contrast it with common “hackathon” type models where you are presented with […]
  • The Netflix Data War
    A recent article in the Wall Street Journal, “At Netflix, Who Wins When It’s Hollywood vs. the Algorithm?” by Shalini Ramachandran and Joe Flint details some of the internal debates within Netflix between the Los Angeles-based content team, which is in charge of developing and marketing new content for the streaming service, and the data […]
  • The Role of Theory in Data Analysis
    In data analysis, we make use of a lot of theory, whether we like to admit it or not. In a traditional statistical training, things like the central limit theorem and the law of large numbers (and their many variations) are deeply baked into our heads. I probably use the central limit theorem everyday in […]

RSS Statistical Modeling, Causal Inference, and Social Science