The Dirt Floats

The Kosovo Liberation Army (Albanian acronym UÇK) supposedly run, during the conflict of 1999, torture camps in northern Albania. According to an investigation conducted by Altin Raxhimi, Michael Montgomery and Vladimir Karaj and published (here) by the Balkan Investigative Journalism Network at least 18 people were killed in one of those, a factory compound in Kukës, Albania. Eyewitnesses say prisoner were mainly alleged Kosovo Albanian collaborationist. But as well Serbs and Roma were held in the camp.  And women.

Kosovo’s Prime Minister, Hashim Thaçi, who was then the political director of the KLA, and Agim Çeku, former Prime Minister and former chief of the KLA headquarters, told the BBC they were not aware of any KLA prisons where captives were abused or where civilians were held.

The same sources that witnessed the base in Kukës, told us that the interrogators in Kukës were KLA officers who had been involved in the capture of suspected collaborators.
Both our sources concerning the base, identified several KLA officers involved in the abuses at
One of them is currently in a top position in the judicial system in Kosovo.

After ten years, the history of the ex-Yugoslavia conflicts (so far mainly written by journalists) is still incomplete. Because the people who fought those wars are now ruling that very same land (nationalism is still an effective language to speak). And because the Balkans are the very same mirror and unconscious of Europe (Rada Iveković, 1999). The 1990s wars tell Europe where its own states are coming from: murders and  deportations. And Dorian does not like portraits.

Monday, 11 May 2009


Twitter: frbailo




  • Lecture slides: Real-World Data Science (Fraud Detection, Customer Churn & Predictive Maintenance)
    These are slides from a lecture I gave at the School of Applied Sciences in Münster. In this lecture, I talked about Real-World Data Science and showed examples on Fraud Detection, Customer Churn & Predictive Maintenance. Real-World Data Scie...
  • Use foreach with HPC schedulers thanks to the future package
    The future package is a powerful and elegant cross-platform framework for orchestrating asynchronous computations in R. It's ideal for working with computations that take a long time to complete; that would benefit from using distributed, parallel frameworks to make them complete faster; and that you'd rather not have locking up your interactive R session. You […]
  • Feature Selection using Genetic Algorithms in R
    From a gentle introduction to a practical solution, this is a post about feature selection using genetic algorithms in R.
  • Using clusterlab to benchmark clustering algorithms
    Clusterlab is a CRAN package ( for the routine testing of clustering algorithms. It can simulate positive (data-sets with __1 clusters) and negative controls (data-sets with 1 cluster). Why test clustering algorithms? Because they often fail in identifying the true K in practice, published algorithms are not always well tested, and we need to know […]
  • Selecting ‘special’ photos on your phone
    At the beginning of the new year I always want to clean up my photos on my phone. It just never happens. So now (like so many others I think) I have a lot of photos on my phone from … Continue reading →

RSS Simply Statistics

  • How Data Scientists Think - A Mini Case Study
    In episode 71 of Not So Standard Deviations, Hilary Parker and I inaugurated our first “Data Science Design Challenge” segment where we discussed how we would solve a given problem using data science. The idea with calling it a “design challenge” was to contrast it with common “hackathon” type models where you are presented with […]
  • The Netflix Data War
    A recent article in the Wall Street Journal, “At Netflix, Who Wins When It’s Hollywood vs. the Algorithm?” by Shalini Ramachandran and Joe Flint details some of the internal debates within Netflix between the Los Angeles-based content team, which is in charge of developing and marketing new content for the streaming service, and the data […]
  • The Role of Theory in Data Analysis
    In data analysis, we make use of a lot of theory, whether we like to admit it or not. In a traditional statistical training, things like the central limit theorem and the law of large numbers (and their many variations) are deeply baked into our heads. I probably use the central limit theorem everyday in […]

RSS Statistical Modeling, Causal Inference, and Social Science