Tuesday, December 01, 2015

Tiny Bits of Land People Fight Over #1 Rockall

People will fight over any bit of land. "Rockall is about 25 metres (80 ft) wide and 31 metres (100 ft) long at its base[24] and rises sheer to a height of 17.15 m (56.27 ft)" from wikipedia.

A probably fake photo from 1974 of HMS Tartar's trip there. 'A sentry-box was constructed on Hall's Ledge, with two marines in full ceremonial uniform posted alongside, and the Union Flag was hoisted above.'

Every now and again Britain lands some people on this lump and takes a photo to prove it is theres. 'Former SAS member and survival expert Tom McClean lived on the island from 26 May 1985 to 4 July 1985 to affirm the UK's claim to the island'. Waves roll over the island so he had to hide in a bolted down giant coffin for the duration.


They do this partly because owning the Falklands isn't grim enough for them. And partly for all the oil and gas and such that might be between Rockall and Ireland.

Friday, November 20, 2015

Bombing Back to the Stone Age

There is a common meme that is easy to find with a twitter search

If you read Jared Diamond or Stephen Pinker they talk about the really high levels of violence in the stone age.
Or to describe it with statistics

"By many estimates, 10 to 20 percent of all Stone Age humans died at the hands of other people.
This puts the past 100 years in perspective. Since 1914, we have endured world wars, genocides and government-sponsored famines, not to mention civil strife, riots and murders. Altogether, we have killed a staggering 100 million to 200 million of our own kind. But over the century, about 10 billion lives were lived — which means that just 1 to 2 percent of the world’s population died violently. Those lucky enough to be born in the 20th century were on average 10 times less likely to come to a grisly end than those born in the Stone Age. And since 2000, the United Nations tells us, the risk of violent death has fallen even further, to 0.7 percent."


To reduce violence don't send people back to the stone age.

Tuesday, November 17, 2015

How Good Will the Upcoming Stephen King Adaptations be?

This is the third and final post in a series on Stephen King adaptations. (The first is Are Stephen King films better than the books?, and the second Do Stephen King's Better Books Make Better Films?)
According to IMDB the Stephen King Novels below are currently being adapted
Titlegood doubled
The Talisman8.2
Rose Madder7.22
Lisey's Story7.22
Mr. Mercedes7.74
Gerald's Game6.86
Cell7.22
11.22.638.52
The Dark Tower8
Three of these books have exactly the same rating on Goodreads. The correlation between the ratings and his books and his films previously discussed I will use to predict future movies ratings. The analysis gives a 95% confidence prediction intervals of
Titleestimatelower boundupper bound
The Talisman7.56.48.6
Rose Madder5.954.87.0
Lisey's Story5.954.87.0
Mr. Mercedes6.795.78.8
Gerald's Game5.374.26.4
Cell5.954.87.0
11.22.638.056.99.15
The Dark Tower7.26.18.2
The upcoming films with their predicted IMDB ratings are in red in the graph below. Three are predicted to have a rating of 5.95.

There is an interesting 538 podcast here about a company that predicts film earnings. He mentions the correlation between film quality and earnings and film quality. This whole topic is an interesting challenge for prediction.

Stephen King is unique among authors in the number and variety of adaptations his works have gone through. So he is possibly the only author this could be even tried with. I am really looking forward to 11.22.63 and the Talisman now. And if Gerald's Game beats the predicted IMDB score that is a bonus.




Code


mydata = read.csv("King.csv")  
library(ggplot2)
attach(mydata)     # attach the data frame 
king.lm = lm(imdb ~ good.doubled)

Call:
lm(formula = imdb ~ good.doubled)

Coefficients:
 (Intercept)  good.doubled  
      -5.821         1.628  

upcoming = read.csv("upcoming.csv")  
predict(king.lm, upcoming, interval="predict")

> predict(king.lm, upcoming, interval="predict")

Title,Publication date,Pages,imdb,goodreads,good doubled,clr
Carrie,05/04/1974,199,7.4,3.89,7.78,1
Salem's Lot,17/10/1975,439,6.8,3.97,7.94,1
The Shining,28/01/1977,447,8.4,4.12,8.24,1
The Stand,Sep-78,823,7.3,4.32,8.64,1
The Dead Zone,Aug-79,428,7.3,3.88,7.76,1
Firestarter,29/09/1980,426,6,3.8,7.6,1
Cujo,08/09/1981,319,6,3.61,7.22,1
The Running Man,May-82,219,6.6,3.74,7.48,1
Christine,29/04/1983,526,6.6,3.69,7.38,1
Pet Sematary,14/11/1983,374,6.6,3.86,7.72,1
Thinner,19/11/1984,309,5.7,3.6,7.2,1
It,15/09/1986,1138,6.9,4.12,8.24,1
Misery,08/06/1987,310,7.8,4.06,8.12,1
The Tommyknockers,10/11/1987,558,5.4,3.42,6.84,1
The Dark Half,20/10/1989,431,5.9,3.71,7.42,1
Needful Things,Oct-91,690,6.2,3.84,7.68,1
Dolores Claiborne,Nov-92,305,7.4,3.76,7.52,1
The Green Mile,March–August 1996,400,8.5,4.39,8.78,1
Bag of Bones,22/09/1998,529,5.8,3.84,7.68,1
Dreamcatcher,20/03/2001,620,5.5,3.53,7.06,1
Under the Dome,10/11/2009,1074,6.8,3.89,7.78,1
Shawshank Redemption,10/11/2009,181,9.3,4.51,9.02,1
Stand by me,10/11/2009,80,8.1,4.25,8.5,1
The Mist,10/11/2009,230,7.2,3.88,7.76,1
The Langoliers,10/11/2009,230,6.1,3.71,7.42,1
Apt Pupil,1983,179,6.7,3.8,7.7,1
Hearts in Atlantis,2000,640,6.9,3.77,7.54,1
The Talisman,na,na,7.5,4.1, 8.2,2
Rose Madder,na,na,5.95,3.61, 7.22,2
Lisey's Story,na,na,5.95,3.61, 7.22,2
Mr. Mercedes,na,na,6.79,3.87, 7.74,2
Gerald's Game,na,na,5.37,3.43, 6.86,2
Cell,na,na,5.95,3.61, 7.22,2
11.22.63,na,na,8.05,4.26, 8.52,2
The Dark Tower,na,na,7.2,4,8,2


mydata = read.csv("King.csv")
attach(mydata)
p1 <- ggplot(mydata, aes(x=good.doubled, y=imdb)) +
    geom_point(colour = factor(clr),shape=1,size=2) +    # Use hollow circles
    geom_smooth(method=lm,se=FALSE)           
            
p1 <- p1 + ylab("IMDB Ratings")  
p1 <- p1 + xlab("GoodReads Ratings")  
p1 <- p1 + ggtitle("Upcoming Stephen King Adaptations")
p1 <- p1 + annotate("text", x = 6.97, y = 5.26, label = "The Tommyknockers", size=3, colour="blue3")  
p1 <- p1 + annotate("text", x = 7.85, y = 7.42, label = "Carrie", size=3, colour="blue3") 
#Salem's Lot,17/10/1975,439,6.8,3.97,7.94,1
p1 <- p1 + annotate("text", x =8.0 , y =6.9, label = "Salem's Lot", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 7.8, y = 6.5, label = "Pet Sematary", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8.34, y = 8.51, label = "The Shining", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8.62, y = 8.2, label = "Stand By Me", size=3, colour="blue")
p1 <- p1 + annotate("text", x = 8.9, y = 8.4, label = "The Green Mile", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 9.0, y = 9.09, label = "Shawshank\nRedemption" , size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8.75, y = 7.3, label = "The Stand", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8.27, y = 6.85 , label = "It", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8.19, y = 7.74, label = "Misery", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8.05, y = 6.7, label = "Under the Dome", size=3, colour="blue3") 
#Under the Dome,10/11/2009,1074,6.8,3.89,7.78,1
p1 <- p1 + annotate("segment", x = 7.9, xend = 7.79, y = 6.7, yend = 6.8, colour = "blue3") 
#Dolores Claiborne,Nov-92,305,7.4,3.76,7.52,1
p1 <- p1 + annotate("text", x = 7.5, y = 7.3, label = "Dolores Claiborne", size=3, colour="blue3") 
#The Dark Half,20/10/1989,431,5.9,3.71,7.42,1
p1 <- p1 + annotate("text", x = 7.5, y = 5.81, label = "The Dark Half", size=3, colour="blue3")
#Bag of Bones,22/09/1998,529,5.8,3.84,7.68,1
p1 <- p1 + annotate("text", x = 7.78, y = 5.7, label = "Bag of Bones", size=3, colour="blue3")
#The Dead Zone,Aug-79,428,7.3,3.88,7.76,1
p1 <- p1 + annotate("text", x = 7.5, y = 7.72, label = "The Dead Zone", size=3, colour="blue3")
p1 <- p1 + annotate("segment", x = 7.76, xend = 7.5, y = 7.33, yend = 7.66, colour = "blue3") 
#The Mist,10/11/2009,230,7.2,3.88,7.76,1
p1 <- p1 + annotate("text", x = 7.76, y = 7.1, label = "The Mist", size=3, colour="blue3")
#Firestarter,29/09/1980,426,6,3.8,7.6,1
p1 <- p1 + annotate("text", x = 7.71, y = 6, label = "Firestarter", size=3, colour="blue3")
#The Langoliers,10/11/2009,230,6.1,3.71,7.42,1
p1 <- p1 + annotate("text", x = 7.56, y = 6.11, label = "The Langoliers", size=3, colour="blue3")
#Cujo,08/09/1981,319,6,3.61,7.22,1
p1 <- p1 + annotate("text", x = 7.23, y = 6.1, label = "Cujo", size=3, colour="blue3")
#The Running Man,May-82,219,6.6,3.74,7.48,1
p1 <- p1 + annotate("text", x = 7.48, y = 6.7, label = "The Running Man", size=3, colour="blue3")
#Christine,29/04/1983,526,6.6,3.69,7.38,1
p1 <- p1 + annotate("text", x = 7.38, y = 6.5, label = "Christine", size=3, colour="blue3")
#Thinner,19/11/1984,309,5.7,3.6,7.2,1
p1 <- p1 + annotate("text", x = 7.25, y = 5.6, label = "Thinner", size=3, colour="blue3")
#Needful Things,Oct-91,690,6.2,3.84,7.68,1
p1 <- p1 + annotate("text", x = 7.83, y = 6.2, label = "Needful Things", size=3, colour="blue3")
#Dreamcatcher,20/03/2001,620,5.5,3.53,7.06,1
p1 <- p1 + annotate("text", x = 7.2, y = 5.5, label = "Dreamcatcher", size=3, colour="blue3")
#The Talisman,na,na,7.52,4.1, 8.2,2
p1 <- p1 + annotate("text", x = 8.36, y = 7.52, label = "The Talisman", size=3, colour="red3")
#Rose Madder,na,na,5.93,3.61, 7.22,2
p1 <- p1 + annotate("text", x = 7.07, y = 5.93, label = "Rose Madder", size=3, colour="red3")
#Lisey's Story,na,na,5.93,3.61, 7.22,2
p1 <- p1 + annotate("text", x = 7.07, y = 6.05, label = "Lisey's Story", size=3, colour="red3")
#Mr. Mercedes,na,na,6.78,3.87, 7.74,2
p1 <- p1 + annotate("text", x = 7.60, y = 6.81, label = "Mr. Mercedes", size=3, colour="red3")
#Gerald's Game,na,na,5.34,3.43, 6.86,2
p1 <- p1 + annotate("text", x = 7.03, y = 5.36, label = "Gerald's Game", size=3, colour="red3")
#Cell,na,na,5.93,3.61, 7.22,2
p1 <- p1 + annotate("text", x = 7.28, y = 5.92, label = "Cell", size=3, colour="red3")
#11.22.63,na,na,8.05,4.26, 8.52,2
p1 <- p1 + annotate("text", x = 8.63, y = 8.05, label = "11.22.63", size=3, colour="red3")
#The Dark Tower,na,na,7.2,4,8,2
p1 <- p1 + annotate("text", x = 8.16, y = 7.2, label = "The Dark Tower", size=3, colour="red3")
p1 <- p1 + ylab("IMDB Ratings")  
p1 <- p1 + xlab("GoodReads Ratings")
p1
ggsave("plot.png", width=10, height=10, dpi=100)
detach(mydata)

Friday, November 13, 2015

Do Stephen King's Better Books Make Better Films?

My last post (and data) got a bit popular on reddit. Some people noticed that Stephen King films ratings and the books ratings seemed highly correlated.

I think Max is right on this. So I made a graph showing how movie and book ratings are correlated

Now the actual correlation figure is


 cor(good.doubled, imdb, use="complete")
[1] [1] 0.8766 so it looks like highly rated King books make highly rated films

It could be that a good film makes people read and rate highly a book. But my basic conclusion is Stephen King's highly rated books make higher rated films















Appendix: Code for the Graph


mydata = read.csv("King.csv")
attach(mydata)
cor(good.doubled, imdb, use="complete")
p1 <- ggplot(mydata, aes(x=good.doubled, y=imdb)) +
    geom_point(shape=1) +    # Use hollow circles
    geom_smooth(method=lm,   # Add linear regression line
                se=FALSE)    # Don't add shaded confidence region
p1 <- p1 + ylab("IMDB Ratings")  
p1 <- p1 + xlab("GoodReads Ratings *2")  
p1 <- p1 + ggtitle("Stephen King: Books vs. Movies Correlation")
p1 <- p1 + geom_text(aes(label=ifelse(good.doubled>0,as.character(Title),'')),hjust=0,just=0,size=2, position = "jitter")
p1
ggsave("correlate.png")





Other Correlations I changed the publication data column into a column of Years since 1974.


 cor(good.doubled, Years, use="complete")
[1] -0.1879052 Not a strong correlation about Kings adapted books get better or worse since he started
> cor(good.doubled, Pages, use="complete")
[1] -0.02715659 no relationship between the length of a book and it being rated highly or lowly.
> cor(Years, Pages, use="complete")
[1] 0.482048 King's adapted books may be getting a bit longer over time

Wednesday, November 11, 2015

Are Stephen King films better than the books?

Rejoice, Snobs: The Book IS Better Than The Movie
The literary originals have higher ratings than the film adaptations 74 percent of the time
. This recent piece claims books are usually better than their films versions.

This article reminded me of a claim that Stephen King films are better than his books. So I extracted the ratings from IMDB and Goodreads. I doubled the Goodreads ratings to make them out of 10.


I included miniseries. Carrie was made twice but I only included the original. The Langoliers, Maximum Overdrive, Lawnmower man and Secret garden seem not to have independent English language printed book versions.

 

The Shining is one of two films with a higher rating than the book. King famously hates the film

"Are you mystified by the cult that's grown around Kubrick's Shining?
I don't get it. But there are a lot of things that I don't get. But obviously people absolutely love it, and they don't understand why I don't. The book is hot, and the movie is cold; the book ends in fire, and the movie in ice."

 

To make this a fair test films and book would have to be graded on a curve. By what the average rating of each is. There is a good discussion on how to normalise the original comparison here. But I think this graph is enough to show Stephen King books are better than his films.

Less than 8% of King adaptations are rated higher compared to 26% usually for book adaptations. To get King to have the average quality of adaptations, sixish Stephen King Films being rated better than is books. This would require Dolores Claiborne, The Green Mile, Stand By Me, Misery and maybe Carrie to be much worse books or these already highly rated films to be much better. I am going to call this Myth Busted*, Stephen King adaptations are less successful than the average adaptation.

 

TitlePublication datePagesimdbgoodreadsgood doubled
CarrieApril 5, 19741997.43.897.78
'Salem's LotOctober 17, 19754396.83.977.94
The ShiningJanuary 28, 19774478.44.128.24
The StandSep-788237.34.328.64
The Dead ZoneAug-794287.33.887.76
FirestarterSeptember 29, 198042663.87.6
CujoSeptember 8, 198131963.617.22
The Running ManMay-822196.63.747.48
ChristineApril 29, 19835266.63.697.38
Pet SemataryNovember 14, 19833746.63.867.72
ThinnerNovember 19, 19843095.73.67.2
ItSeptember 15, 198611386.94.128.24
MiseryJune 8, 19873107.84.068.12
The TommyknockersNovember 10, 19875585.43.426.84
The Dark HalfOctober 20, 19894315.93.717.42
Needful ThingsOct-916906.23.847.68
Dolores ClaiborneNov-923057.43.767.52
The Green MileMarch–August 19964008.54.398.78
Bag of BonesSeptember 22, 19985295.83.847.68
DreamcatcherMarch 20, 20016205.53.537.06
Under the DomeNovember 10, 200910746.83.897.78
Shawshank Redemption19821819.34.519.02
Stand by me1982808.14.258.5
The Mist19832307.23.887.76
Apt Pupil19831796.73.87.7
Hearts in Atlantis20006406.93.777.54

   
 mydata = read.csv("King.csv")  
 library(ggplot2)  
 p1 <- ggplot(mydata, aes(x = good.doubled, y = imdb))  
 p1 <- p1 + geom_abline(intercept=0, slope=1)  
 p1 <- p1 + geom_point(shape=1)  
 p1 <- p1 + ylim(5, 10)  
 p1 <- p1 + xlim(5, 10)  
 
 p1 <- p1 + ylab("IMDB Ratings")  
 p1 <- p1 + xlab("GoodReads Ratings *2")  
 p1 <- p1 + ggtitle("Stephen King: Books vs. Movies")  

p1 <- p1 + annotate("text", x = 7.3, y = 5.3, label = "The Tommyknockers", size=3, colour="blue3")  
p1 <- p1 + annotate("text", x = 7.95, y = 7.37, label = "Carrie", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8, y = 6.5, label = "Pet Sematary", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 7.9, y = 8.54, label = "The Shining", size=3, colour="red3") 
p1 <- p1 + annotate("text", x = 8.75, y = 9.4, label = "Shawshank Redemption", size=3, colour="red3")
p1 <- p1 + annotate("text", x = 9.18, y = 8.4, label = "The Green Mile", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8.72, y = 8, label = "Stand By Me", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8.92, y = 7.3, label = "The Stand", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8.3, y = 6.85 , label = "It", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8.32, y = 7.7, label = "Misery", size=3, colour="blue3") 
p1 <- p1 + annotate("text", x = 8.18, y = 6.7, label = "Under the Dome", size=3, colour="blue3") 

p1 <- p1 + annotate("text", x = 9.28, y = 5.0, label = "Books Higher Rated", size=5, colour="blue3") 
p1 <- p1 + annotate("text", x = 5.7, y = 9.90, label = "Films Higher Rated", size=5, colour="red3") 



p1 <- p1 + annotate("text", x = 6.9, y = 6, label = "Cujo", size=3, colour="blue3") 
p1 <- p1 + annotate("segment", x = 7.05, xend = 7.2, y = 6.0, yend = 6.0, colour = "blue3") 


p1 <- p1 + annotate("text", x = 6.45, y = 5.52, label = "Dreamcatcher", size=3, colour="blue3") 
p1 <- p1 + annotate("segment", x = 6.8, xend = 7.05, y = 5.5, yend = 5.5, colour = "blue3") 


p1 <- p1 + annotate("text", x = 6.80, y = 7.4, label = "Dolores Claiborne", size=3, colour="blue3")
p1 <- p1 + annotate("segment", x = 7.28, xend = 7.5, y = 7.4, yend = 7.4, colour = "blue3") 

p1 <- p1 + annotate("text", x = 7.92, y = 5.72, label = "The Dark Half", size=3, colour="blue3")
p1
ggsave("plot.png")
*This needs proper statistical significance testing so not really busted.

Tuesday, November 03, 2015

Odd Mathematical Patents #1: Gömböc

The Gömböc is a self righting shape. A bit like a turtle that will always rotate back to standing up if placed on its back. This shape was patented under US D614077 S1 and a cease and destist has been send to someone giving plans to 3D print one.
Prof. Domokos has politely requested that I remove this file. I have respectfully declined to do so, as I do not believe it violates any of their established legal rights, and I believe it may have actual value for researchers interested in the Gömböc and mono-monostatic bodies in general.

Gömböc's are cool. And the inventors were very ingenious. You can buy one from the patent holders here. Patenting a shape though strikes me as weird.

Monday, October 12, 2015

Stacked Area Chart In Javascript

A stacked area chart of transatlantic slave trace made with c3.js and the transatlantic slave trade database

The graph shows the counts for each region the slaves were disembarked by year. The html and javascript for this graph is on a gist here

The data used can be gotten from the slavevoyages.org website with the filters beloew

An actual graph you can play with is here and in the result tab below

Saturday, October 10, 2015

Danny Boyle films

Danny Boyle films revolve around a group of friends who find X. And how X affects their relationship.

Where for each film X is

Shallow Grave Bag of Money
Trainspotting Bag of heroin
The Beach A map
Millions Bag of money
Sunshine A spaceship
28 Days Later... An outpost
Slumdog Millionaire A gameshow

I can't haven't seen so do not know what X is in A Life Less Ordinary, 127 Hours, Steve Jobs or Trance

Friday, October 09, 2015

The Man in the White Suit

The man in the white suit is about a scientist who invents clothes that never wear out and never need washing. It is from 1951 and deals Alec Guinness finds himself hated by all. Both the capital owning class who realise they will have no clothes to sell anymore. And the poor labourers who won't have work making cloth, clothes or washing them.

All these 1950s cloth making jobs have left Britain in the decades since. Hans Rosling pointed out that the washing machine in common use since then is one of the best invention ever for freeing us from drudgery.

Worry about how computers will get rid of all the jobs are common now. Race against the machine is one good book on the topic. But the washer women in The Man in the White Suit are not in Britain anymore. Though much of the clothes making has just moved country without making the people doing it much richer.

We now in Europe live the terrifying world imagined in the Man in the White Suit. Where you can clothe yourself in pennies for very little. Wash your clothes for little effort of money. And it is not that bad. Technological progress is not bad. As Stephen Hawking points out it is not the technology but who the benefits go to

If machines produce everything we need, the outcome will depend on how things are distributed. Everyone can enjoy a life of luxurious leisure if the machine-produced wealth is shared, or most people can end up miserably poor if the machine-owners successfully lobby against wealth redistribution. So far, the trend seems to be toward the second option, with technology driving ever-increasing inequality.

Tuesday, October 06, 2015

Golf Courses Versus Book Shops

"Between 1990 and 2010, the number of 18-hole golf courses in Ireland exploded from 213 to 352."

Searching Golden Pages gives

300 results for Booksellers

397 results for Golf Course

It looks to me that there are more golf courses than book shops in Ireland.

Friday, October 02, 2015

Pouring one out for your Ancient Greek Homies

Hecuba mother of Hector and Paris in the Iliad pours one out for her homies

Stay, till I bring the cup with Bacchus crown'd,

In Jove's high name, to sprinkle on the ground,

And pay due vows to all the gods around

Pouring some of your alcoholic beverage on the ground is an ancient and common across many cultures. It is a practice called Libation.

The other word for today is Adoxography "fine writing on a trivial or base subject". The best adoxography books ive read recently are Red: The history of red hair, Paper an elegy and The Phone Book.

Thursday, September 24, 2015

Penrose's Law

Penrose's Law[3][4] states that the population size of prisons and psychiatric hospitals are inversely related, although this is generally viewed as something of an oversimplification
That was thought up by from Lionel Penrose in 1939 (he was Roger Penrose's father). And it still seems broadly accurate
During 1960–2004, there was a 74% population-adjusted decrease in mental institution beds and a 52% increase in the prison population. The same period saw a 500% increase in overall crime and a 900% increase in violent crimes, with a concurrent 94% increase in the size of the country's police force. Penrose's law proved remarkably robust in the longitudinal perspective

Sunday, September 20, 2015

Irish Suicide Statistics

There is an interesting article in 20th Sept issue of Irish Daily Mail on Sunday by Alison O'Reilly on the emotional difficulties priests in Ireland have.

The figures given claim priests are at a much greater risk of suicide than the general public

However if looking at who priests are I do not think this is the case. According to the CSO here

"The age-standardised death rate from suicide was 12.1 deaths per 100,000 in 2011" not the 5.12 figure in the article.

Also Males are at a higher risk anyway. "male suicide rates were five times higher at 20.5 deaths per 100,000"

And Older males at a higher risk still. "male suicide rates were highest in the 45-64 age-group (28 per 100,000)". The average age of an Irish priest is around 65.

Just by their gender and age profile priests 35ish to the general population 28 per 100,000 do not seem that different.

Finally priests are unmarried which is a well known correlate of suicide. To take one paper Suicide and marital status in Northern Ireland "Never marrying increased male suicide risk and its effect increased with age IRR among over 55 year-olds = 2.33". 2.33 * 20.5 base =47 per 100k

I had not realised quite how bad it was for those bachelor farmers in terms of suicide. This (greater than 28) rate compares with 1.2 for murder in Ireland.

The story here is not how bad suicide is for priests but how bad the problem is for all single older men in Ireland.

Sunday, September 13, 2015

Japan Loves Faxes

In Japan, faxes are still used extensively for cultural and graphemic reasons and are available for sending to both domestic and international recipients from over 81% of all convenience stores nationwide.

Coupland explains that he thinks the Japanese addressing system is the cause of its love of faxes.

Graphemic reasons are that typing was never really popular in Japan. Partly because of the three alphabets and partly just because of an appreciation of calligraphy.

Another reason is the use of forms that do not really fit electronic formats

But I love the explanation that the Japanese addressing system has lead to an 80s technology getting popular and it staying alive

Monday, June 15, 2015

When did we get too much stuff?

Kevin Kelly pointed out that we have more types of things (species of technology) than even the most wealthy had in the past. "count the number of species of technology in our household. And it came up with 6,000 different species of products. I did some research and found out that the King of England, Henry VIII, had only about 7,000 items in his household. And he was the King of England, and that was the entire wealth of England at the time." Kevin Kelly

How much more stuff can we get over time? The Argos catalog seems to get bigger each year and it seemed to me an ideal way to measure the amount of stuff we get to decide we don't want.

Retromash is a great site for old Argos catalogs. I Counted the total page numbers in each years catalog. I don't have the slaves undergraduates needed to count everything in the actual catalog. Retromash don't have much after 1990 so I went to ebay and they list the number of pages. This produces this data (with links to the ebay sources). One weirdness is there seemed to have been an instore catalog and superstore (to be delivered) catalog for a few years.

I have tried to give the number of pages of the Autumn/Winter Argos catalog you get in the shop in any given year.

The graph is just a simple ggplot2 scatterplot

mydata = read.table("argos.tsv", header=TRUE)

ggplot(mydata, aes(x=Year, y=pages)) + geom_point(shape=1) + geom_smooth(method=lm, se=FALSE)

ggsave(file="argos.png")

It looks like there are nine times the number of things we can choose now from Argos as there was in 1975. To put a prediction on this, the amount of things you can buy nearby doubles ever decade.

Tuesday, May 26, 2015

Sports in decline

I read this interesting article in the diminishing interest in surfing. Can Anyone Save the Surf Industry?. The author uses

All the outdoor individual sports seem to have declining interest as measured by google trends. This could be to do with how people search google. But it does look like they are getting less popular.

The good news is Golf also seems to be in the decline

Tuesday, May 19, 2015

Uhaul Map of the US

You rent a Uhaul truck in New York to move to a new job in Texas. UHaul will be left with a truck in Texas. If people really want to go New York -> Texas but less so Texas-> New York they will reduce the price of the Texas -> New York. If you get all the prices to move one way between all the cities in the US you end up with a good idea of where people are moving. And as people usually move for jobs where the jobs are.

The idea I saw first in Marginal Revolution. This blogpost seems to be where the whole UHaul weighted graph idea came from. Dan Armstrong and Páll Hilmarsson
I took a list of the 34 most populous US cities (all over 500,000) from wikipedia. This is 1122 links in total. The 294 cities is 86142 total links. You only seem to be able to get containers not trucks from Honolulu, Hawaii.
This is a Complete graph where each edge has a length and a weight/capacity (price). some cities are really cheap to leave because enough people are moving (sinking) there that UHaul want to get the trucks back to the cities people are leaving (source) The extreme costing trips are

Source Destination Price
San Jose, CA Washington, DC 4237
San Francisco, CA Baltimore, MD 4188
San Jose, CA Baltimore, MD 4181
San Jose, CA Washington, DC 4132
Baltimore, MD Washington, DC 74
Washington, DC Baltimore, MD 79
You can get the spreadsheet with all 1122 trips here

The trips with the biggest difference between one way and another are by price

Source Destination Round dif Round ratio
San Jose, CA Washington, DC 2404 2.3
San Francisco, CA Washington, DC 2345 2.3
Philadelphia, PA Portland, OR 2213 3

and by ratio

San Francisco, CA Las Vegas, NV 608 4.1
Source Destination Round dif Round ratio
San Jose, CA Las Vegas, NV 580 4.1
San Francisco, CA Las Vegas, NV 608 4.1
Philadelphia, PA Jacksonville, FL 1301 3.9

The spreadsheet with these calculation is here. the code to work all this out is pretty raw but it is here.
I will come back to this later and work out Eigenvalue Centrality and maybe how distance relates to prices. Also it would be interesting to see if some places are summer sinks and some winter sinks in a few months time.

Sunday, April 19, 2015

England's Temperature in 2014

2014 was UK's hottest year on record, says Met Office. What does that mean on a day to day basis? England has a collected data on daily temperature from 1772 in the Hadley Centre Central England Temperature (HadCET) dataset.

According to wikipedia, last year was the hottest year in this dataset with an average temperature of 10.94 °C.

I downloaded this Hadley Centre dataset. And I followed this tutorial. Based on an original graphic by Tufte. The picture is similar to the one from my post on Dublin's 2014 Weather.


Looking at the black line that represents last years temperatures it was the winter and autumn that were far above average. Instead of a scorching hot summer the three record hot days were in October and December. No day in 2014 was the coldest for that date in the recorded time.

The wikipedia page on this dataset shows the average temperature of this dataset rising over time

Saturday, April 18, 2015

Peak Loom Band

I was talking to the owner of a craft shop today. She told me she knew of many people who were stuck with stock from the loom band craze of last year. Waterford Whispers reported it as "Parents Urged To Hack Off Any Child’s Arm That Comes In Contact With Loom Bands".


A Google Trends shows the craze's lifespan



By the end of summer all that was left was a bin of tiny plastic hoops at bargain prices. Are crazes lasting less time now?


The Age Of Earthquakes points out




Crazes end. This is not like the Tulip or the South Seas Bubble. I don't remember hearing people would get rich selling loom band to each other. Unlike the way Beanie babies were sold as an investment. The sort of thing you could split up during a divorce



BTW she thinks Parachute Cord will be the next craze

Sunday, April 12, 2015

Photographing your food and everything else

Who was the first person to photograph all their meals? It seems a reasonably common thing now "26 out of 45 customers spend an average of 3 minutes taking photos of the food."
First Camera, Then Fork discussed the taking photos of your food phenomena but doesn't describe who initiated it.
But who was the first person to start photographing all their meals? Or indeed all their activities. Obviously its much cheaper for us to do now with smartphones but was there an eccentric Victorian lord who photographed everything? Life loggers have been around since the 1990s. I'd be surprised if none of them deliberately photographed their food at the time.
“Tell me what you eat, and I will tell you what you are.” Jean Anthelme Brillat-Savarin in 1825.

If food photos are much more popular how about all photos? How many photos are taken a day and how does that compare with in the past?

Facebook Photo uploads total 300 million per day
Instagram averages 70 million photos a day
Flickr 1 million photo shares a day
Photobucket says "More than 2.25 million images are shared daily"
There is a buzzfeed page on photo stats How Many Photos Have Been Taken Ever?
It’s estimated only a few million pictures were taken in the 80 years before the first commercial camera was introduced.
"By 1930, about a billion photos were taken a year." So in three days more photos are being uploaded onto facebook/instagram than then taken in the world in the 1930s.
"By 1970, about 10 billion photos were taken a year." Or a month of photos being put up onto facebook/instagram.
This article "This is What the History of Camera Sales Looks Like with Smartphones Included" has a great graph that explains why this huge increase in photo taking.


Friday, April 03, 2015

Smoky cooking is worse than war

Over 4 million people die prematurely from illness attributable to the household air pollution from cooking with solid fuels.
that is according the WHO.

Compare that to other things we worry more about
Road traffic injuries 1,274845
Suicide 844 460
War 70,000
Murder 430,000

'the number of war deaths has also plummeted. In the 1950s, there were almost 250 deaths caused by war per million people. Now, there are less than 10 per million' With 7 billion people in the world that would be 70,000 per year. '437,000 people murdered worldwide in 2012'
The war statistic is a bit misleading as things go be very quiet until a big one kicks off.

Still the idea that instead of sitting around camp fires singing peace protest songs we should sing campfire protest songs was surprising to me.

Thursday, April 02, 2015

Farage's Claims about HIV in the UK

Labout leader Ed Miliband called out UKIP leader Nigel Farage here on a hiv claim


The claim is reported here as
Nigel Farage has lashed out at NHS treatment for foreign HIV patients – branding them “health tourists”. The UK Independence Party leader made the claims in ITV’s leadership debate this evening. He said: “Here’s a fact, and I’m sure other people will be mortified that I dare to talk about it. “There are 7000 diagnoses in this country every year for people who are HIV positive, but 60 percent of them are not for British nationals.

To break this down "In 2012, there were 6,360 new diagnoses of HIV". "There were 6,000 people (4,480 men and 1,520 women) newly diagnosed with HIV in the UK in 2013". The 7000 figure sounds high. Lets see what data is produced to back this up.

"60 percent of them are not for British nationals."
Thanks to Roma in the comments who pointed out 'new diagnoses reported among people born in the UK ... to 46% (2,220/4,980))' from the 2014 report. This is not a nationality figure. But it is pretty close to one. Some born in the UK won't be British Nationals and a fair few born abroad will be. A graph from here shows




Implying from ethnicity to nationality does not really work I think. But it does make the 60% claim look like it needs more evidence. The figure for Asians and Black Caribbean's is about 500 or under 10%.
Another vaguely nationality source of data I can find is from HIV in the United Kingdom: 2013 Report Appendix 1 total men+women African born 11,100+20,700 out of 98,400 which is about a third of HIV carries in the UK seem to be African born heterosexuals. Again with the assumption that all African born people are not British nationals.

To me these claims, of 7000 new cases and 60% of them not British Nationals, look dubious but I am open to evidence correcting me.

Monday, March 23, 2015

Areas of Ireland that have only ever had white male TDs

There is a great map on the mirror website in the article 'The UK map of white male power' of constituencies in the UK that have only ever had white male MPs

I thought it would be interesting to see what the same map would look for Ireland. The wikipedia page for Irish Women TD's is here, from it I took the list of constituencies that have been represented by Female TDs.

Ireland has not had many non white TDs. Moosajee Bhamjee is the first non white TD as far as I know. Leo Varadkar's father is Indian. And I am probably missing many more but historical demographics suggest non white Irish politicians were rare. We had a low immigrant population until recently. To keep the graph comparable to the UK one I keep the same not 'white male' filter. Please correct me with anyone I am missing.

Taking distinct constituencies from the Female TD's wikipedia page. And a map of Irish constituencies now from here. I coloured in the constituencies I could find female TDs for in green.


For some old constituencies, like Cork Mid, I could not work out where they were. One ex-constituency Limerick City–Lmk East had a female deputy, Kathleen O'Callaghan in 1921. I took the map of where this constituency used to cover when she represented it and turned that area in the east part of Limerick-West green.
Map of Limerick City–Lmk East in 1921

Another Dublin North has completely changed its location. When Margaret Collins-O'Driscoll represented it in 1923 it covered what is now Dublin North-East. So this area is also coloured green.


The constituencies I could not find a female TD for were: Cork South-West, Limerick West and Louth. The constituencies wikipedia listed as having had a female TD are Carlow–Kilkenny, Cavan–Monaghan, Clare, Clare–Galway South,  Cork East, Cork Mid, Cork North–Central, Cork North–East, Cork North–West, Cork South–Central, Donegal North–East,   Donegal South–West,   Dublin Ballyfermot, Dublin Central, Dublin County Mid, Dublin Mid, Dublin North, Dublin North–Central,   Dublin North–West,   Dublin South, Dublin South–Central,   Dublin South–East,   Dublin South–West, Dublin St Patrick's, Dublin West, Dún Laoghaire, Galway West,   Kildare North,   Laois–Offaly,   Limerick City–Limerick East,   Limerick East, Longford–Westmeath,   Meath East,   Monaghan, Roscommon, Roscommon–Leitrim,   Sligo–Leitrim,   Tipperary North,   Tipperary South, Waterford,   Waterford County,   Westmeath,   Wexford, Wicklow .

This map looks more diverse than the UK one to me. But the proportional representation system Ireland has means we have larger constituencies with a few Members of Parliament in each. This means the same proportion of non white male Members of Parliament should cover a bigger area.
I will update this map with any corrections people give me in the comments.

Wednesday, March 04, 2015

Pandemics and the Internet

My last point pointed out how many people died in the flu pandemic in Ireland in 1918.

To take the example of Japan according to Gapminder 1918 had a huge drop. The other big drop is the is the second world war with all the bomb dropping and shooting that involved.

David Eagleman has an interesting point here about how the internet can help prevent and reduce epidemics.

"The internet can be our key to survival because the ability to work telepresently can inhibit microbial transmission by reducing human-to-human contact. In the face of an otherwise devastating epidemic, businesses can keep supply chains running with the maximum number of employees working from home. This can reduce host density below the tipping point required for an epidemic. If we are well prepared when an epidemic arrives, we can fluidly shift into a self-quarantined society in which microbes fail due to host scarcity."

Eagleman has a good short video on his thesis

The long term effects of a sudden switch to everyone avoiding each other for a month or two could be huge. These would include

Education: How schools help spread influenza has been studied. 'School closures during the 2009 influenza pandemic: national and local experiences'. If all the schools were closed for a few months and people would move to Khan Academy and other online education sites. After this period a switch back to a fully non online world won't happen

Telecommuting: In a similar way online telecommuting would become much more popular. After a quarantine lite period the use of online project management and other telecommuting tools would become mainstream.

Shopping: If you don't meet people in school or at work you meet them in the shops. Deliveries of shopping would be strongly encouraged in the event of a pandemic. They should probably even be sponsored. Shops would not get as popular again once everyone got used to online shopping.

Banking: No one likes queuing in the banks at the best of times. Even ATMs would become horrible grubby in a pandemic world. Everything including social welfare payments would try and avoid using the fomite that is cash.

Telemedecine: People with the influenza need to be kept away from people who are sick. People with other illnesses will have to be dealt with remotely to avoid them coming into contact with people with influenza.

Public Events: Public events parades, cinemas, bars and museums would be closed. By their nature these involve people. If public events are made cheaper to attend virtually that will reduce the need for people to meet up. By this I mean if Sky Sports is made free for a few months people will be less annoyed no fans are allowed attend the football game.

There are many people without access to the internet that would not be helped by the use of digital technologies. Hopefully the use of digital technologies will help focus more of the traditional public health effort on them.

When the next pandemic happens the internet will reduce the consequences. Many industries will also change but the main thing is to avoid the 50 to 100 million the last pandemic killed.

Tuesday, March 03, 2015

Tree Rings and Life Expectancy

Andy Kirk here has an interesting blog post on dendrochronology and visualisation literacy.
Here is an example of a tree ring visualisation showing how over time the tree grows and leaves down rings.







I am going to visualise another time series expected lifespan.
Gapminder uses a line graph to visualise life expectancy over time. I downloaded the life expectancy data from gapminder.































The interesting points here are the famine where the life expectancy dropped from an estimated 38.3 to 14.1. Also the 1918 flu epidemic causes an obvious drop from 55.3 in 1917 to 49.68 and back to 55.8 in 1919.
I use this data to create a graph using the code below. The idea is like tree rings except that instead of each line laid down in a particular year each line represents the life expectancy in that year.









































The size of each ring should be a good representation on the number of years people could expect to live in that year. However I just multiplied the years given by Gapminder *6 to give the number of pixels each circles radius should be. A proper visualisation has to be more careful not to distort the number than this. Roughly, living twice as long should look like a tree that is twice as big.

The code to create this graph in a canvas element of a webpage if here. So what do you think, does this visualisation show increase in lifespan in the last 200 years well?

Saturday, February 28, 2015

What Colour are Books?

What colour are famous books?

Colours Used I counted up the occurrence of the
colours = ["red","orange","yellow","green","blue","purple","pink","brown","black","gray","white", "grey"]
in Ulysses by James Joyce. I'll post the word count code soon

red 113, orange 12, yellow 50, green 98, blue 82, purple 17, pink 21, brown 59, black 146, gray 2, white 163, grey 68

Turned this count into a barchart with r package ggplot2 graphing package

library(ggplot2)
df <- data.frame(colours = factor(c("pink","red","orange","yellow","green","blue","purple", "brown", "black", "white", "grey"), levels=c("pink","red","orange","yellow","green","blue","purple","brown", "black", "white", "grey")),
                 total_counts = c(21.0, 113.0,12.0, 50.0, 98.0, 82.0, 17.0, 59.0, 146.0,163.0,70.0))
colrs = factor(c("pink","red","orange","yellow","green","blue","purple", "brown", "black", "white", "grey"))

bp <- ggplot(data=df, aes(x=colours, y=total_counts)) + geom_bar(stat="identity",fill=colrs)+guides(fill=FALSE)
bp + theme(axis.title.x = element_blank(), axis.title.y = element_blank())+ ggtitle("Ulysses Color Counts")
bp 

There is a huge element of unweaving the rainbow in just counting the times a colour is mentioned in a book. The program distills “The sea, the snotgreen sea, the scrotumtightening sea.” into a single number. Still I think the ability to quickly look at the colour palette of a book is interesting.

The same picture made from the colours in Anna Karenina by Leo Tolstoy, Translated by Constance Garnett


Translations
Translations produce really funny graphs with this method. According to Jenks@GreekMythComix the ancient Greeks did not really use colours in the same abstract way we did. Things were not 'orange' so much as 'the colour of an orange'. The counts in the Alexander Pope translation of the Iliad are
red 36, yellow 11, green 16, blue 9, purple 43, brown 4, black 69, gray 1, white 25, grey 6

Because colours are not really mentioned in the original Iliad these sorts of graphs could be a quick way to compare translations. Google book trends does not seem to show increased use of these colours overtime.

Sunday, February 22, 2015

2014 Weather Visualizations

There is a great tutorial by Brad Boehmke here on how to build a visualization of temperature in one year compared to a dataset. The infographic is based on one by Tufte

Met Eireann have datasets going back to 1985 on their website here. Some basic data munging on the Met Eireann set for Dublin Airport and I followed the rstats code from the tutorial above to build the graphs below. Wexford would be more interesting for Sun and Kerry for Rain and Wind but those datasets would not download for me.

The first is a comparison of the temperature in 2014 compared to the same date in other years.

Next I looked at average wind speed

And finally the number of hours of sun

These visualizations doesn't look like 2014 was a particularly unusual year for Irish weather. With 30 years of past data if weather was random (which it isn't) at random around 12 days would break the high and low mark for most of these measures. Only the number of sunny days beat this metric. The data met.ie gives contains every day since 1985

maxtp: - Maximum Air Temperature (C)

mintp: - Minimum Air Temperature (C)

rain: - Precipitation Amount (mm)

wdsp: - Mean Wind Speed (knot)

hm: - Highest ten minute mean wind speed (knot)

ddhm: - Mean Wind Direction over 10 minutes at time of highest 10 minute mean (degree)

hg: - Highest Gust (knot)

sun: - Sunshine duration (hours)

dos: - Dept of Snow (cm)

g_rad - Global Radiation (j/cm sq.)

i: - Indicator

Gust might be an interesting one given the storms we had winter 2014. I put big versions of these pictures here, here and here.

Wednesday, February 18, 2015

When were Wodehouse's stories set?

They seem to be sometime before the first world war. But I have never figured out when Wodehouse's books take place. From Something New by P.G. Wodehouse "Whoever carries this job through gets one thousand pounds.” Ashe started. “One thousand pounds–five thousand dollars!” “Five thousand.” Looking at historical exchange rates at www.measuringworth.com the rate stayed close to 1901s $4.87 up until the book was published in 1915. Because exchange rates did not change much at the time they do not help work out when a book was set.

Monday, February 09, 2015

Ancient Death Counts from Poems

What killed you in an ancient battle? Could we look at ancient epics for clues as to what killed people in fights at the time?

Pinker's better Angels of our Nature talks about how archeologists look at bones to see evidence of violent injuries that lead to death. The book talks about examinations of ancient bones unearthed in peat bogs and on long-forgotten battlefields. This bone examination will not tell us about injuries to people that do not cut bones.

The epic poems include the Iliad, Beowulf and the Táin. They were passed down from Bards who memorised them and travelled from place to place reciting them. Some recent research suggests that these epics may have some basis in history. The social network described for the characters usually resembles one real people would have. The social network between characters in Homer’s Odyssey is remarkably similar to real social networks today. That suggests the story is based, at least in part, on real events, say researchers. 'They discovered that while the networks associated with Beowulf and the Iliad had many of the properties of real social networks, the network associated with Tain was less realistic. That led them to conclude that the societies described in the Iliad and Beowulf are probably based on real ones, whereas the Tain appears more artificial.'

There is a site that examines and lists the deaths in the Iliad here. I extracted from there counts for each mentioned body part killed or wounded someone*.

head 21 
jaw 2 
cheek 1 
ear 1 
eye 1 
mouth 1 
nose 1 
skull 1  

neck 12 
throat 3  
  
collar 1 
chest 17 
shoulder 7 
collar bone 2 
nipple 1 
ribs 1 1 of these wound
 
arm 4 3 of these wound
hand 1 1 of these wound
  
back 11 
buttock 2 
  
gut 10 1 of these wound
stomach 5 
liver 3 
 
side 6 
  
thigh 2 1 of these wound
hip 1 
knee 1 
leg 1 
foot 1 1 of these wound
 
groin 2 
testicles 1

I totalled these by body region

Head  29
Neck  15
Upper Body 29
Arm  5
Back  13
Lower Body 18
Side  6
Leg  6
Groin  3 
Using Color brewer to pick out colours I made bins of 5
25-30 RGB 153,0,13
20-25 RGB 203,24,29
15-20 RGB 239,59,44
10-15 RGB 251,106,74
5-10  RGB 252,146,114
1-5   RGB 252,187,161
0     RGB 0,0,0
And I made this into this weird picture. I got the drawing from here. And the idea from Greek myth comix.

Any translation will have disagreements so the original source or as close as we can get to it should be used. Ian Johnson's is the basis for these counts.

Upper body counts for 73 of the deaths: arm, back, legs and lower body count for only 51. But gut, liver and stomach (and maybe buttock) do account for 18 deaths which seems like modern archeology could miss. For example many bog bodies seem to have been ritually killed which may have involved more beheading then the standard violent death.

It would be interesting to do a similar count with the other epic poems to see if liver injury is as common in them or whether that relates to Greek culture.

Anyway please comment what you think about this sort of quantitative analysis of stories that are meant to be entertainment. Can they tell us anything about the ancient world?

*Alcmaon's death I left out as no specific part is named.

Wednesday, February 04, 2015

Irish Alcohol Consumption in 2020

Drink blitz sees bottle of wine rise to €9 minimum 'Irish people still drink an annual 11.6 litres of pure alcohol per capita, 20pc lower than at the turn of the last decade. The aim is to bring down Ireland's consumption of alcohol to the OECD average of 9.1 litres in five years' time.'

What would Irish alcohol consumption be if current trends continue? Knowing this the effectiveness of new measures can be estimated.

The OECD figures are here. I put them in a .csv here.The WHO figures for alcohol consumption are here I loaded the data in R Package

datavar <- read.csv("OECDAlco.csv")

attach(datavar)

plot(Date,Value,

     main="Ireland Alcohol Consumption")
Which looks like this

Looking at that graph alcohol consumption rose from the first year we have data for 1960 until about 2000 and then started dropping. So if the trend since 2000 continued what would alcohol consumption be in 2020?

'Irish people still drink an annual 11.6 litres' I would like to see the source for this figure. We drank 11.6 litres in 2012 according to the OECD. I cannot find OECD figures for 2014. In 2004 we drank 13.6L the claimed 20pc reduction of this is 10.9L, not 11.6L. Whereas the 14.3L we drank in 2002 with a 20pc reduction would now be 11.4. This means it really looks to me like the Independent were measuring alcohol usage up to 2012.

Taking the data since 2000 until 2012.

newdata <- datavar[ which(datavar$Date > 1999), ]

detach(datavar)

attach(newdata)

plot(Date,Value,

     main="Ireland Alcohol Consumption")

cor(Date,Value)

The correlation between year and alcohol consumption since 2000 is [1] -0.9274126. It look like there is a close relationship between the year and the amount of alcohol consumed in that time. Picking 2000, near the peak of alcohol consumption, as the starting date for analysis is arguable. But 2002 was the start of this visible trend in reduced alcohol consumption.

Now I ran a linear regression to predict based on this data alcohol consumption in 2015 and 2020.

> linearModelVar <- lm(Value ~ Date, newdata)
> linearModelVar$coefficients[[2]]*2015+linearModelVar$coefficients[[1]]
[1] 10.42143
> linearModelVar$coefficients[[2]]*2020+linearModelVar$coefficients[[1]]
[1] 9.023077
> 
This means based on data from 2000-2012 we would expect people to drink 10.4 litres this year. Reducing to drinking 9 litres in 2020. So with current trends Irish alcohol consumption will be lower than 'the aim is to bring down Ireland's consumption of alcohol to the OECD average of 9.1 litres in five years'.

There could be something else that is going to alter the trend. One obvious one would be a glut of young adults. People in their 20 drink more than older people. If there are a higher proportion of youths about then the alcohol consumption will rise all else being equal. So will there be a higher proportion of people in their 20s in 5 years time?

The population pyramids projections for Ireland are here. Looking at these there seems to have been a higher proportion of young adults in 2010 than there will be in 2020 which would imply lower alcohol consumption

it would be interesting to see the data and the model that the prediction of Irish alcohol consumption are based on. And to see how minimum alcohol pricing changes the results of these models. But without seeing those models it looks like the Government strategy is promising current trends to continue in response to a new law.