Il successo della Lega, i media e le “crisi” migratorie

La crescita di Salvini e della Lega è forse per la politica italiana l’evento più significativo del 2018. Nel gennaio 2018, prima delle elezioni di marzo, la Lega di Salvini era intorno al 12-13%. Alla fine del 2018 la Lega era stimata sopra al 30%. Un guadagno di quasi 20 punti percentuali in 12 mesi.

Fig 1. La crescita della Lega (media mobile dei sondaggi, 30 giorni)

(more…)

Sunday, 8 December 2019

Il voto per le europee a Milano

La geografia socio-politica delle grandi città italiane del centro-nord è radicalmente cambiata negli ultimi 25 anni. Se osserviamo la distribuzione dei voti a Milano tra i partiti alle elezioni europee del 1994, tenutesi pochi mesi dopo la straordinaria vittoria elettorale di Silvio Berlusconi nel Marzo dello stesso anno in cui Forza Italia ottenne il 21% e il Polo delle Libertà più quasi il 43%), vediamo un chiaro spostameno da destra verso il centro-sinistra e il PD.

Voti assegnati ai partiti nelle elezioni europee del 1994 e 2019.

(more…)

Wednesday, 12 June 2019

How to sync your Zotero library (and files) with WebDAV

In this post, I explain how to use an online file storing and sharing service like AARNet’s CloudStor (but any WebDAV service will do) to access and update your Zotero library from different computers.

(more…)

Sunday, 10 March 2019

recogeo: A new R package to reconcile changing geographic boundaries (and corresponding variables)

Demographics information is usually reported in relation to precise boundaries: administrative, electoral, statistical, etc. Comparing demographics information reported at different point in time is often problematic because boundaries keep changing. The recogeo package faciliates reconciling boundaries and their data by a spatial analysis of the boundaries of two different periods. In this post, I explain how to install the package, reconcile two spatial objects and check the results.

(more…)

Friday, 1 February 2019

Are you parallelizing your raster operations? You should!

If you plan to do anything with the raster package you should definitely consider parallelize all your processes, especially if you are working with very large image files. I couldn’t find any blog post describing how to parallelize with the raster package (it is well documented in the package documentation, though). So here my notes.
(more…)

Thursday, 17 January 2019

How to (quickly) enrich a map with natural and anthropic details


In this post I show how to enrich a ggplot map with data obtained from the Open Street Map (OSM) API. After adding elevation details to the map, I add water bodies and elements identifying human activity. To highlight the areas more densely inhabitated, I propose to use a density-based clustering algorithm of OSM features.

(more…)

Thursday, 9 August 2018

The two alternatives to the monasterisation of the World wide web

Saint Michael’s Abbey, in the Susa Valley, Piedmont. Source: Wikipedia.

In Medieval Europe, information was physically concentrated in very few secluded libraries and archives. Powerful institutions managed them and regulated who could access what. The library of the fictional abbey that is described in Umberto Eco’s The Name of the Rose is located in a fortified tower and only the librarian knows how to navigate its mysteries. Monasteries played an essential role in preserving written information and creating new intelligence from that knowledge. But being written information a scarce resource, with the keys to libraries came also authority and power. Similarly, Internet companies are amassing information within their fortified walls. In so doing, they provide services that we now see as essential but they also contravene the two core principles of the Internet: openness and decentralisation.

(more…)

Monday, 7 May 2018

Local participation and not unemployment explains the M5S result in the South

The abundance of economic data and the scarcity of social data with a comparable level of granularity is a problem for the quantitative analysis of social phenomena. I argue that this fundamental problem has misguided the analysis of the electoral results of the Five Star Movement (M5S) and its interpretation. In this article, I provide statistical evidence suggesting that — in the South — unemployment is not associated with the exceptional increase in the M5S support and that local participation is a stronger predictor of support than most of the demographics.

What happened

The 2018 Italian general elections (elections, since both the Chamber of Deputies and the Senate, were renewed) saw

  1. a significant increase in the number of votes for two parties, the Five Start Movement (M5S) and the League (formerly Northern League),

and

  1. an increase in the importance geography as an explanatory dimension for the distribution of votes.

The following two maps show where the M5S and the League have increased electoral support from 2013 to 2018. (Electoral data are always data for the election of the Chamber of Deputies).

Vote difference: 2018-2013 (a few communes have not reported all the results, notably Rome)

 

The geographic pattern is quite simple. The M5S has increased its support in the South and maintained its votes in the North, the League has significantly strengthened its support in the North but has also collected votes in the South, where it had virtually no support. The third and the fourth most voted parties, the Democratic Party (PD) and Berlusconi’s Forza Italia (FI), have lost votes almost everywhere. If we map the results of the four parties side-by-side with the same scale, the PD and FI almost faded into the background.

Votes in the 2018 General elections

Yet, major metropolitan areas do not always follow the national trend. If Naples unambiguously voted M5S, Turin, Milan and Rome did saw the Democratic Party as the most voted party in the wealthiest districts.

Votes in the 2018 General elections (Clock-wise from top-left: Turin, Milan, Naples, Rome)

The density of the distribution of results at the commune and sub-commune level in the macro regions indicates that if the M5S electorally dominates in the South and in the two major islands, the League is the most popular party in the North.

Distribution of votes at commune or sub-commune level

The territoriality of the results, especially along the North-South dimension, makes the analysis especially complicated. This because the strong result of the League in the North and of the M5S in the South might simplistically suggest that immigration (which is much stronger in the North) explains the League’s result in the North and unemployment and poverty (stronger in the South) explain the M5S’s result in the South. This reading is especially attractive since immigration and the M5S proposal to introduce a guaranteed minim income have dominated the campaign.

(more…)

Tuesday, 20 March 2018

2018 Italian general election: Details on my simulation

This article describes the simulation behind the app that you find here

This simulation of the results for the 2018 general election is based on the results from the last two national elections (the Italian parliament election in 2013 and the European Parliament election 2014) and national polls conducted until 16 February 2018. The simulation is based on one assumption, which is reasonable but not necessarily realistic: the relative territorial strength of parties is stable. From this assumption derives that if the national support for a party (as measured by national voting intention polls) varies, it varies consistently and proportionally everywhere. A rising tide lifts all boats and vice versa. The assumption has some empirical justification. If we compare the difference from the national support (in percentage) for each district in 2013 and 2014 we see a significant correlation, especially in the major parties.

Votes to party in the 2018 Chamber districts

(more…)

Tuesday, 27 February 2018

Quick analysis of the Italian referendum results

The 2016 Italian referendum torpedoed the constitutional reform presented by the government presided by Matteo Renzi (41). According to the final count, which includes 1.2 million votes cast overseas, the reform was rejected by almost 60% of the voters.

Three parties played a predominant role during the electoral campaign: the ruling Democraric Party (PD), leaded by the chief of government Renzi, the Five Star Movement (M5S), founded and leaded by Beppe Grillo (68), and the Lega Nord (LN), leaded by Matteo Salvini (43). The fourth Italian party, Forza Italia, for different reasons – including the health of Silvio Berlusconi (80) – played a minor role.

(more…)

Monday, 5 December 2016

tweets


Twitter: frbailo

links


blogroll


RSS r-bloggers.com

  • Working with Notion API from R
    When searching for a solution where I could store some flat files as a database, Notion came up. The nice thing about it is that it offers an API to most of its functionality. At the time of this writing this is still in beta, but hopefully it will bec... The post Working with Notion […]
  • rbind in r-Combine Vectors, Matrix or Data Frames by Rows
    rbind in r, In this article, will describe the uses and applications of rbind(), rbind.fill() and bind_rows() functions in R programming. rbind() in R... The post rbind in r-Combine Vectors, Matrix or Data Frames by Rows appeared first on finnstats. The post rbind in r-Combine Vectors, Matrix or Data Frames by Rows first appeared on […]
  • Which Religious Groups Have the Most Sex?
    There has been plenty of discussion about declining fertility rates and patterns of marriage among people in the United States following the news that the US birth rate declined to its lowest since the Great Depression. There are a lot of debates a... The post Which Religious Groups Have the Most Sex? first appeared on […]
  • Class imbalance and classification metrics with aircraft wildlife strikes
    This is the latest in my series of screencasts demonstrating how to use the tidymodels packages, from just starting out to tuning more complex models with many hyperparameters. I recently participated in SLICED, a competitive data science prediction... The post Class imbalance and classification metrics with aircraft wildlife strikes first appeared on R-bloggers.
  • rOpenSci News Digest, June 2021
    Dear rOpenSci friends, it’s time for our monthly news roundup! You can read this post on our blog. Now let’s dive into the activity at and around rOpenSci! 🔗 rOpenSci HQ 🔗 R-universe Video and resources from our pas... The post rOpenSci News Digest, June 2021 first appeared on R-bloggers.

RSS Simply Statistics

  • Streamline - tidy data as a service
    Tldr: We started a company called Streamline Data Science https://streamlinedatascience.io/ that offers tidy data as a service. We are looking for customers, partnerships and employees as we scale up after closing our funding round! Most of my career, I have worked in the muck of data cleaning. In the world of genomics, a lot of […]
  • The Four Jobs of the Data Scientist
    In 2019 I wrote a post about The Tentpoles of Data Science that tried to distill the key skills of the data scientist. In the post I wrote: When I ask myself the question “What is data science?” I tend to think of the following five components. Data science is (1) the application of design […]
  • Palantir Shows Its Cards
    File this under long-term followup, but just about four years ago I wrote about Palantir, the previously secretive but now soon to be public data science company, and how its valuation was a commentary on the value of data science more generally. Well, just recently Palantir filed to go public and therefore submitted a registration […]

RSS Statistical Modeling, Causal Inference, and Social Science

  • Pittsburgh by Frank Santoro
    Last year we discussed a silly study, and that lead us to this interesting blog by Chris Gavaler, which pointed me to a recent picture storybook, Pittsburgh, by Frank Santoro. The book was excellent. I don’t have any insights to share here; I just wanted to thank Santoro for writing the book and Gavaler for […]
  • Meta-meta-science studies
    August Wartin asks: Are you are familiar with any (economic) literature that attempts to model academia or the labor market for researchers (or similar), incorporating stuff like e.g. publication bias, researcher degrees of freedom, the garden of forking paths etcetera (and that perhaps also discusses possible proposals/mechanisms to mitigate these problems)? And perhaps you might […]
  • She’s thinking of buying a house, but it has a high radon measurement. What should she do?
    Someone wrote in with a question: My Mom, who has health issues, is about to close on a new house in **, NJ. We just saw that ** generally is listed as an area with high radon. If the house has a radon measurement over 4 and the seller puts vents to bring it into […]