Centroid
The centroid of a country (or area) can de defined in multiple ways. On the earth surface it is more useful to speak of the geographical center. It is possible to find lists of these centers for area's on earth.
One quickly realises that these centers are not (always) good enough. Either they are in a location far away from population centers, or they fall in between a set of islands, etc. So we need to come up with a better estimate.
Any better estimate must reflect what we intend to achieve in the first place. We are talking about transportation and its impact. So we talk about distances, mode of transportation, the amount transported and the impact of it all. Any centroid should take this into account.
If we know only the country a product is bought in, we can either assumes the worst or the best. The worst is the maximum impact and the best the minimum impact. The minimum impact will imply transportation from the most centered location in a country.
What is now the most centered location? The distances to all other locations should be small as possible. And the usage of these distances should be as small as possible. This will take into account the population sizes of all the locations.
This seems to be a Weber problem. This problem is not solvable exactly, but must be solved iteratively. The locations are then the towns and cities in a country. And the weights are defined by the population sizes of each towns. And the distances are defined by the distances by road.
In practice it will be quite difficult to solve this for all towns in an area. But can we do it for the largest towns? And how many towns should we incorporate?
Recipe
Using the above considerations we can setup a recipe:
- Largest cities - find the largest cities (by population size) for a country. Star with the 5 largest cities;
- Population - list the population size for each of the largest cities, and calculate the fraction of the total population of all largest cities;
- distances - calculate the distances between all the largest cities. We can use the directions of OpenStreetMap for this;
- weighted distance - calculate the weighted distances, i.e. the distance times the city fraction;
- weighted distance sum - add all the weighted fractions for a specific city (the centroid city);
- minimum weighted distance sum - find the weighted distance sum that is the lowest. The corresponding city is the centroid city;
- add cities - more cities can be added to see whether the found centroid city is correct. For this recommence on step 1;
Result
This recipe has been tried on the following countries:
Country | Centroid city | spreadsheet |
---|---|---|
Belgium | Brussels | link |
France | Paris | link |
Norway | Skien | link |
This approach of finding a centroid seems to give good results. For Belgium there is not a large difference with the geographic centroid, which is expected for a densely populated country. For Norway it is much closer to the population area's.