Panama Papers as you have never seen it
As global markets expand and become more interconnected, businesses are increasingly looking for resources to help identify competitive and profitable opportunities. Several data leakages in the last years have shown that a common approach to this is the creation of offshore companies, i.e. companies created in low-tax, offshore jurisdictions. Our goal is to analyze motivating factors for creating such offshore entities. We believe that a better understanding of the reasons can help find ways to deal with those tendencies. This has an impact on the social good, because fiscal prudence and opennesss in international trade can have a powerful effect on improving society.
Our analysis is based on data provided from the Offshore Leaks Database [1]. It contains information about 500,000 offshore companies, foundations and trusts including links to people and companies in more than 200 countries and territories. The information comes from the Panama Papers, the Offshore Leaks and the Bahamas Leaks investigations cunducted in the years 2013 to 2016. While those data leaks contain diverse sorts of documents from emails to bank documents, the database provides only structured overview information excluding the raw files themselves. The latest investigation on the Paradise Papers, which was released in November 2017 is not included, in our analysis.
In order to better understand the underlying structures of the offshore businesses, we analyze the available data on a country level. We identify the most involved countries and try to find factors that characterize them. To this aim, we enrich the dataset with information about the economical and social background of countries from the Index of Economic Freedom[2]. Furthermore, we investigate how the different countries are connected and how their presence in the offshore business evolved in the last 35 years. In particular, we want to see whether the publication of the leaks influenced investment behavior.
Before we present our findings, we want to clarify some of the used terminology:
Above are two plots describing the number of entities opened throughout the years in both the tax haven " jurisdictions " and the origin/destination countries. The first thing that caught our attention is the amount of missing data, in particular, the entities without a registered origin in the Bahamas. Despite the loss of information, this shows that the Bahamas is a country of special interest and worth being investigated. Looking at the tax havens we can see a strong presence of British overseas territories and Crown Dependencies such as British Virgin Islands, Cayman Islands, and British Anguilla, alongside British Commonwealth territories like the Bahamas and Cook Islands. Most of the countries heavily involved in the scheme have some sort of financial secrecy. A detailed look into countries according to their secrecy and the scale of their offshore financial activities can be found in the Financial Secrecy Index [2]. Now looking at the number of entities opened from origin countries, we can see the prescence of a significant number of tax havens. Those were interpreted as entities that were terminated and then re-established again at a certain period of time in the same country, or a movement of entities from one haven to another.
As the number of offshore entities increase, the number of tax havens involved increases. Of course this is only a small portion of the data available in the real world but it can be seen that the countries mostly invest in Panama and British Virgin Islands. It would have been truly interesting to see the true distribution of the entities registered in Bahamas. Although Paradise papers
Trying to investigate if origin countries with high entity count are economically similar, we applied principal component analysis on the Index of Economic Freedom using only the data of the countries that are involved in the leak. This data is divided into 5 main categories ( Rule of law, Government size, Regulatory efficiency, Open markets, and Monetary measures) each of which is sub-divided into more detailed features. We can clearly see that most of the top 12 origin countries ( with respect to entity count ) ( colored blue ) are on the left-side of the x-axis and close to each other. This indicates that those countries indeed have similar economical factors. It is necessary to note that not ONLY the countries with great economic standing are the ones that invest the most, but also the ones with mediocre overall economy have a great contribution. However, this may be due to the fact that this data only points to a fraction of what really is out there.
Let us now take a closer look at how the different countries are connected. We measure the connectedness of two countries by the number of offshore entities coming from one country and founded in the other.
To begin with, we want to see if there is a pattern in the way players in origin countries select special countries for their offshore accounts. Therefore, we cluster the origin countries into groups with similar selection information using k-means clustering.
The selection patterns of the four resulting clusters are visualized in the matrices below. Each row corresponds to an origin country and each column to a goal country. The color of a cell indicates for the corresponding origin country the relative frequency of offshore entities that where founded in the corresponding goal country.
And indeed it is easy to see a pattern that characterizes the countries which are in the same cluster: Cluster 0 contains those countries where the majority of offshore entities are founded in the British Virgin Islands. Cluster 1 contains the countries with the largest number of offshore entities in Panama, cluster 2 those countries with a majority of entities in the Seychelles. The countries in cluster 3 show more diverse distributions of destination countries. However there are still interesting patterns. For example, for many countries in this cluster the main destination of entities is the country itself, see for example the Cook Islands or Samoa.
Now that we know that the countries can be categorized by the way jurisdictions for offshores are selected, an obvious question to ask is what causes those different structures. In other words, we want to know how the countries that are in the same cluster are similar to each other and different to the coutries in other clusters. This in turn could help us to find the underlying factors that motivate the selection of destination countries.
Countries belonging to the same cluster show similar trends on the size of their movements, both for incorporations and inactivations. As shown in Figure 3.A, all the clusters share a similar trend regarding the creation of new offshore accounts. More precisely, the majority of countries experienced a peak of movements in the years preceding 2007, with some particular differences. For both clusters 2 and 4, the countries within them tend to share the same trend regarding the incorporation of offshores and slightly different trends regarding the inactivation of them, as shown in Figure 3.B. Cluster 1, which is also the cluster with the highest number of registered accounts is composed of countries that are acting almost independently between them:
We (Federico Pucci, Sarah Sallinger, Mazen Fouad A-wali Mahdi) are three Master's students at EPFL (École polytechnique fédérale de Lausanne). This data story was created as outcome of a project for the fall 2017 edition of the Applied Data Analysis course at EPFL.
[1] The International Consortium of Investigative Journalists. Offshore Leaks Database. Retrieved November 1, 2017.
[2] The Heritage Foundation. Index of Economic Freedom. Retrieved December 18, 2017.
[3] Florez, F. REST Countries. Retrieved December 18, 2017.
Contact us and we'll get back to you within 24 hours.