Inside the Leak!

Panama Papers as you have never seen it

A brief introduction

As global markets expand and become more interconnected, businesses are increasingly looking for resources to help identify competitive and profitable opportunities. Several data leakages in the last years have shown that a common approach to this is the creation of offshore companies, i.e. companies created in low-tax, offshore jurisdictions. Our goal is to analyze motivating factors for creating such offshore entities. We believe that a better understanding of the reasons can help find ways to deal with those tendencies. This has an impact on the social good, because fiscal prudence and opennesss in international trade can have a powerful effect on improving society.

Our analysis is based on data provided from the Offshore Leaks Database [1]. It contains information about 500,000 offshore companies, foundations and trusts including links to people and companies in more than 200 countries and territories. The information comes from the Panama Papers, the Offshore Leaks and the Bahamas Leaks investigations cunducted in the years 2013 to 2016. While those data leaks contain diverse sorts of documents from emails to bank documents, the database provides only structured overview information excluding the raw files themselves. The latest investigation on the Paradise Papers, which was released in November 2017 is not included, in our analysis.

In order to better understand the underlying structures of the offshore businesses, we analyze the available data on a country level. We identify the most involved countries and try to find factors that characterize them. To this aim, we enrich the dataset with information about the economical and social background of countries from the Index of Economic Freedom[2]. Furthermore, we investigate how the different countries are connected and how their presence in the offshore business evolved in the last 35 years. In particular, we want to see whether the publication of the leaks influenced investment behavior.


Terminology

Before we present our findings, we want to clarify some of the used terminology:

  • Offshore entity: A company, trust or fund created in a low-tax, offshore location.
  • Jurisdiction: By jurisdiction we mean a territory over which authority is exercised. In the context of offshore entities it refers to the country in which the offshore is founded, i.e. the tax haven.
  • Incorporation and Inactivation: Those two terms refer to the process of founding and closing an offshore entity, respectively.
  • Net amount: The difference between the number of incorporations and the number of inactivations is referred to as net amount.


Most involved Countries and jurisdictions


Number of Entities by Jurisdiction

Number of Entities by Origin Country

Interesting insights

Above are two plots describing the number of entities opened throughout the years in both the tax haven " jurisdictions " and the origin/destination countries. The first thing that caught our attention is the amount of missing data, in particular, the entities without a registered origin in the Bahamas. Despite the loss of information, this shows that the Bahamas is a country of special interest and worth being investigated. Looking at the tax havens we can see a strong presence of British overseas territories and Crown Dependencies such as British Virgin Islands, Cayman Islands, and British Anguilla, alongside British Commonwealth territories like the Bahamas and Cook Islands. Most of the countries heavily involved in the scheme have some sort of financial secrecy. A detailed look into countries according to their secrecy and the scale of their offshore financial activities can be found in the Financial Secrecy Index [2]. Now looking at the number of entities opened from origin countries, we can see the prescence of a significant number of tax havens. Those were interpreted as entities that were terminated and then re-established again at a certain period of time in the same country, or a movement of entities from one haven to another.





Where do the most involved countries invest at?

As the number of offshore entities increase, the number of tax havens involved increases. Of course this is only a small portion of the data available in the real world but it can be seen that the countries mostly invest in Panama and British Virgin Islands. It would have been truly interesting to see the true distribution of the entities registered in Bahamas. Although Paradise papers were not analyzed, it is worth to note that more than 70 percent of the new records belong to entities incorporated in Bermuda and the Cayman Islands.







Let's look at their Economical Factors

Trying to investigate if origin countries with high entity count are economically similar, we applied principal component analysis on the Index of Economic Freedom using only the data of the countries that are involved in the leak. This data is divided into 5 main categories ( Rule of law, Government size, Regulatory efficiency, Open markets, and Monetary measures) each of which is sub-divided into more detailed features. We can clearly see that most of the top 12 origin countries ( with respect to entity count ) ( colored blue ) are on the left-side of the x-axis and close to each other. This indicates that those countries indeed have similar economical factors. It is necessary to note that not ONLY the countries with great economic standing are the ones that invest the most, but also the ones with mediocre overall economy have a great contribution. However, this may be due to the fact that this data only points to a fraction of what really is out there.





Extricating the connections

Let us now take a closer look at how the different countries are connected. We measure the connectedness of two countries by the number of offshore entities coming from one country and founded in the other.
To begin with, we want to see if there is a pattern in the way players in origin countries select special countries for their offshore accounts. Therefore, we cluster the origin countries into groups with similar selection information using k-means clustering.
The selection patterns of the four resulting clusters are visualized in the matrices below. Each row corresponds to an origin country and each column to a goal country. The color of a cell indicates for the corresponding origin country the relative frequency of offshore entities that where founded in the corresponding goal country.


And indeed it is easy to see a pattern that characterizes the countries which are in the same cluster: Cluster 0 contains those countries where the majority of offshore entities are founded in the British Virgin Islands. Cluster 1 contains the countries with the largest number of offshore entities in Panama, cluster 2 those countries with a majority of entities in the Seychelles. The countries in cluster 3 show more diverse distributions of destination countries. However there are still interesting patterns. For example, for many countries in this cluster the main destination of entities is the country itself, see for example the Cook Islands or Samoa.


Now that we know that the countries can be categorized by the way jurisdictions for offshores are selected, an obvious question to ask is what causes those different structures. In other words, we want to know how the countries that are in the same cluster are similar to each other and different to the coutries in other clusters. This in turn could help us to find the underlying factors that motivate the selection of destination countries.


What's their geographical location?

Countries ISO 3166-1 numeric code is obtained through Restcountries API.

The first thing we consider as a possible factor is geographical closeness. To this means we draw a map where the color of each country represents its cluster. Countries colored in white do not occur in the database and therefore do not belong to any cluster. We observe that most South American countries (Brasil being the most apparent exception) have the majority of offshore entities in Panama (Cluster 1). In Northern America and the UK, most offshore entities are founded on the British Virgin Islands. So indeed geographical closeness might be a factor. Other interesting observations include that most of the countries in Cluster 2 are African. The only exceptions are Bulgaria and the Seychelles (which is also the most popular tax haven in these countries).




Clusters are not sharing economical factors

We investigate which influence the economic factors specified in the Index of Eonomic freedom data set have on the assignment to the clusters. There are several ways to tackle this question. One approach we took was to fit a multinomial logit model which predicts the cluster from the economical factors. As economical factors we considered the gross domestic product (GDP), the GDP growth rate, government expenditures, the infalation, foreign direct investment (FDI) inflow and the public debt. We then interpreted the coefficients of the classifier. It turned out that none of the mentioned factors have a significant influence. Another approach is to do a principal component analysis (PCA). A PCA considering the three main factors can be visulized as follows, where every point corresponds to a country and the color indicates its cluster.
Similarly to the previous analysis, it is difficult to see a pattern in this. Therefore, we conclude that the economic factors we analyzed have no influence.




So what?

Note that in general this kind of reasoning only enables us to restrict the set of possible factors, while it does not enable us to identify the relevant factors with certainty. This would require more background research by experts. We still think that the map is a good basis to detact interesting tendencies. For example, starting from there one could try to see whether countries belonging to the Commonwealth of nations belong to the same cluster. We observe that the UK, India and Australia are in the same cluster. However, Canada is not. One can think of many questions like this considereing the historical and political background of countries.


Network of countries

The network below describes if there is an established connection between two countries. As expected the countries present at the inner core of the network are the countries with the most number of connections which naturally are the tax havens. As we move from the inner core of the network to the outer core we can notice that the number of connections is decreasing but yet significant, and those are the origin countries that are most involved in the data. At the shell, the network represents the countries with the least amount of connections and can also be interpreted as the countries that are least involved. Keep in mind that a connection here only resembles that there is a link between those two countries and doesn't tell us anything about the number of entities entrenched. It is still valid that a country with one connection may have a large number of entities but that is not the general trend. As mentioned previously, we have seen cases where there is local activity within a country and those were terminated from the graph due to them being self loops, hence the presence of some nodes without any connections.


About this project

Check us on Github

We (Federico Pucci, Sarah Sallinger, Mazen Fouad A-wali Mahdi) are three Master's students at EPFL (École polytechnique fédérale de Lausanne). This data story was created as outcome of a project for the fall 2017 edition of the Applied Data Analysis course at EPFL.


References

[1] The International Consortium of Investigative Journalists. Offshore Leaks Database. Retrieved November 1, 2017.
[2] The Heritage Foundation. Index of Economic Freedom. Retrieved December 18, 2017.
[3] Florez, F. REST Countries. Retrieved December 18, 2017.

Visit our website

qr_code

Contact us and we'll get back to you within 24 hours.