International trade data: why doesn't it add up?

There are dozens of official sources of data on international trade. We write this post because, if you compare these different sources, you will find that they do not agree with one another. Even if you focus on what seems to be the same indicator for the same year in the same country, discrepancies are large.

For example, for China in 2010, the estimated total value of goods exports was $1.48 trillion according to World Bank Data, but it was $1.58 trillion according to WTO Data. That's a difference of about 7%, or a hundred billion US dollars.

Such differences between sources can also be found for rich countries where statistical agencies tend to follow international reporting guidelines more closely. In Italy, for example, Eurostat figures of the value of exported goods in 2015 are 10% higher than the merchandise trade figures published by the OECD.

And there are also large bilateral discrepancies within sources. According to IMF data, for example, the value of goods that Canada reports exporting to the US is almost $20 billion more that the value of goods that the US reports importing from Canada.

In this blog post we explain how international trade data is collected and processed, and why there are such large discrepancies.

What data is available?

The data hubs from several large international organizations publish and maintain extensive cross-country datasets on international trade. Here's a list of the most important ones:

In addition to these sources, there are also many other academic projects that publish data on international trade. These projects tend to rely on data from one or more of the sources above; and they typically process and merge series in order to improve coverage and consistency. Three important sources are:

How large are discrepancies between sources?

In the visualization below we provide a comparison of the data published by several of the sources listed above, country by country, since 1955 up until today.

For each country, we exclude trade in services, and we focus only on estimates of the total value of exported goods, expressed as shares of GDP.4

As we can clearly see in this chart, different data sources tell often very different stories. And this is true, to varying degrees, across all countries and years. You can use the option labeled 'change country', at the bottom of the chart, to focus on any country.

Constructing this chart was demanding. It required downloading trade data from many different sources, collecting the relevant series, and then standardising them so that the units of measure and the geographical territories were consistent.

All series, except the two long-run series from CEPII and NBER-UN, were produced from data published by the sources in current US dollars, and then converted to GDP shares using a unique source (World Bank).5

So, if all series are in the same units (share of national GDP), and they all measure the same thing (value of goods exported from one country to the rest of the world), what explains the differences?

Let's dig deeper to understand what's going on.

Why doesn't the data add up?

Differences in guidelines used by countries to record and report trade data

Broadly speaking, there are two main approaches used to estimate international merchandise trade:

Under these two approaches, it is common to distinguish between 'traded merchandise' and 'traded goods'. The distinction is often made because goods simply being transported through a country (i.e. goods in transit) are not considered to change the stock of material resources of a country, and are hence often excluded from the more narrow concept of 'merchandise trade'.

Also, adding to the complexity, countries often rely on measurement protocols that are developed alongside these approaches and concepts that are not perfectly compatible to begin with. In Europe, for example, countries use the 'Compilers guide on European statistics on international trade in goods'.

Measurement error and other inconsistencies

Even when two sources rely on the same broad accounting approach, discrepancies arise because countries fail to adhere perfectly to the protocols.

In theory, for example, the exports of country A to country B should mirror the imports of country B from country A. But in practice this is rarely the case because of differences in valuation. According to the BPM6, imports and exports should be recorded in the balance of payments accounts on a 'free on board (FOB) basis', which means using prices that include all charges up to placing the goods on board a ship at the port of departure. Yet many countries stick to FOB values only for exports, and use CIF values for imports (CIF stands for 'Cost, Insurance and Freight', and includes the costs of transportation).7

The chart below gives you an idea of how large import-export asymmetries are. Shown are the differences between the value of goods that each country reports exporting to the US, and the value of goods that the US reports importing from the same countries. For example, for China, the figure in the chart corresponds to the “Value of merchandise imports in the US from China” minus “Value of merchandise exports from China to the US”.

The differences in the chart below, which are both positive and negative, suggest that there is more going on than differences in FOB vs CIF values. If all asymmetries were coming from CIF-FOB differences, then we should only see positive values in the chart (recall that, unlike FOB values, CIF values include the cost of transportation, so CIF values are larger).

What else is going on here?

Another common source of measurement error relates to the inconsistent attribution of trade partners. An example is failure to follow the guidelines on how to treat goods passing through intermediary countries for processing or merchanting purposes. As global production chains become more complex, countries find it increasingly difficult to unambiguously establish the origin and final destination of merchandise, even when rules are established in the manuals.8

And there are still more potential sources of discrepancies. For example differences in customs and tax regimes, and differences between "general" and "special" trade systems (i.e. differences between statistical territories and actual country borders, which do not often coincide because of things like 'custom free zones').9

Even when two sources have identical trade estimates, inconsistencies in published data can arise from differences in exchange rates. If a dataset reports cross-country trade data in US dollars, estimates will vary depending on the exchange rates used. Different exchange rates will lead to conflicting estimates, even if figures in local currency units are consistent.

Wrapping up

Asymmetries in international trade statistics are large and they arise for a variety of reasons. These include conceptual inconsistencies across measurement standards, as well as inconsistencies in the way countries apply agreed protocols. Here's a checklist of issues to keep in mind when comparing sources.

These factors have long been recognized by many organizations producing trade data. Indeed, international organizations often incorporate corrections, in an attempt to improve data quality along these lines.

The OECD's Balanced International Merchandise Trade Statistics, for example, uses its own approach to correct and reconcile international merchandise trade statistics.10

The corrections applied in the OECD's 'balanced' series make this the best source for cross-country comparisons. However, this dataset has low coverage across countries, and it only goes back to 2011. This is an important obstacle, since the complex adjustments introduced by the OECD imply we can't easily improve coverage by appending data from other sources. At Our World in Data we have chosen to rely on CEPII as the main source for exploring long-run changes in international trade; but we also rely on World Bank and OECD data for up-to-date cross-country comparisons.

There are two key lessons from all of this. The first lesson is that, for most users of trade data out there, there is no obvious way of choosing between sources. And the second lesson is that, because of statistical glitches, researchers and policymakers should always take analysis of trade data with a pinch of salt. For example, in a recent high-profile report, researchers attributed mismatches in bilateral trade data to illicit financial flows through trade misinvoicing (or trade-based money laundering). As we show here, this interpretation of the data is not appropriate, since mismatches in the data can, and often do arise from measurement inconsistencies rather than malfeasance.11

Hopefully the discussion and checklist above can help researchers better interpret and choose between conflicting data sources.


  1. For more information on how the COW trade datasets were constructed see: (i) Barbieri, Katherine and Omar M. G. Omar Keshk. 2016. Correlates of War Project Trade Data Set Codebook, Version 4.0. Available at and (ii) Barbieri, Katherine, Omar M. G. Keshk, and Brian Pollins. 2009. “TRADING DATA: Evaluating our Assumptions and Coding Rules.” Conflict Management and Peace Science, 26(5): 471–491. Available at:

  2. The NBER-UN trade data and documentation is available at

  3. Further information on CEPII's methodology can be found at

  4. The chart includes series labeled by the sources as 'merchandise trade' and 'goods trade'. As we explain below, part of the asymmetries in trade data come from the fact that, although 'merchandise' and 'goods' are equivalent in the dictionary, these two terms often measure related but different things.

  5. In the 'Sources' tab in the chart you find a full explanation of how we constructed all series, as well as links to the original raw data.

  6. For example, if there is no change in ownership (e.g. a firm exports goods to it's factory in another country for processing, and then re-imports the processed goods) the manual says that statistical agencies should only record the net difference in value. You can find more details about this in this OECD Statistics Briefing.

  7. This issue is actually also a source of disagreement between National Accounts data and customs data. You can read more about it in this report: Harrison, Anne (2013) FOB/CIF Issue in Merchandise Trade/Transport of Goods in BPM6 and the 2008 SNA, Twenty-Fifth Meeting of the IMF Committee on Balance of Payments Statistics, Washington, D.C.

  8. Precisely because of the difficulty that arises when trying to establish the origin and final destination of merchandise, some sources distinguish between national and dyadic (i.e. 'directed') trade estimates.

  9. For more details about general and special trade see:

  10. The OECD approach consists of four steps, which they describe as follows: "First, data are collected and organized, and imports are converted to FOB prices to match the valuation of exports. Secondly, data are adjusted for several specific large problems known to drive asymmetries. Presently these include “modular” adjustments for unallocated and confidential trade; for exports by Hong Kong, China; for Swiss non-monetary gold; and for clear-cut cases of product misclassifications. The list of modules is expected to grow over time. In the third step, adjusted data are balanced using a “Symmetry Index” that weights exports and imports. As the final step, the data are also converted to Classification of Products by Activity (CPA) products to better align with National Accounts statistics, such as in national Supply-Use tables." You can read more about it here: In addition to the OECD, other sources also use corrections. The IMF's DOTS dataset, for example, uses a 6 percent rule for converting import valuations (in CIF) into export values (in FOB). More information can be found at the IMF's (2018) working paper on 'New Estimates for Direction of Trade Statistics'.

  11. For more details on this see Forstater, M. (2018) Illicit Financial Flows, Trade Misinvoicing, and Multinational Tax Avoidance: The Same or Different?, CGD Policy Paper 123, available online at:

Cite this work

Our articles and data visualizations rely on work from many different people and organizations. When citing this article, please also cite the underlying data sources. This article can be cited as:

Esteban Ortiz-Ospina and Diana Beltekian (2018) - “International trade data: why doesn't it add up?” Published online at Retrieved from: '' [Online Resource]

BibTeX citation

    author = {Esteban Ortiz-Ospina and Diana Beltekian},
    title = {International trade data: why doesn't it add up?},
    journal = {Our World in Data},
    year = {2018},
    note = {}
Our World in Data logo

Reuse this work freely

All visualizations, data, and code produced by Our World in Data are completely open access under the Creative Commons BY license. You have the permission to use, distribute, and reproduce these in any medium, provided the source and authors are credited.

The data produced by third parties and made available by Our World in Data is subject to the license terms from the original third-party authors. We will always indicate the original source of the data in our documentation, so you should always check the license of any such third-party data before use and redistribution.

All of our charts can be embedded in any site.