Abstract
Combining unique, annual, bilateral data on labor flows of highly skilled immigrants for 10 OECD destinations between 2000 and 2012, with new databases comprising both unilateral and bilateral policy instruments, we present the first judicious cross-country assessment of policies aimed to attract and select high-skilled workers. Points-based systems are much more effective in attracting and selecting high-skilled migrants than requiring a job offer, labor market tests, and shortage lists. Offers of permanent residency, while attracting the highly skilled, overall reduce the human capital content of labor flows because they prove more attractive to non-high-skilled workers. Bilateral recognition of diploma and social security agreements foster greater flows of high-skilled workers and improve the skill selectivity of immigrant flows. Conversely, double taxation agreements deter high-skilled migrants, although they do not alter overall skill selectivity. Our results are robust to a variety of empirical specifications that account for destination-specific amenities, multilateral resistance to migration, and the endogeneity of immigration policies.
Introduction
If Europe really wants to have a knowledge based economy, if it wants to play a leading role in innovation and research, if it wants to be competitive in the global economy, it needs to do much more to attract the smartest and the brightest.
Cecilia Malmström, European Union Commissioner (2012)1
Policymakers worldwide, cognizant of the pivotal role that human capital plays in the economic development of receiving nations, increasingly vie to attract “the best and brightest” (Kapur and McHale 2005) in the “global competition to attract high-skilled migrants” (Boeri et al. 2012). At the center of this contest are the countries of the Organisation for Economic Co-operation and Development (OECD), which have historically attracted the largest proportion of high-skilled migrants (Artuç et al. 2014), arguably in part because of the domestic supply of skills falling short of domestic demand (Papademetriou and Sumption 2013). Because high-skilled migrants are motivated to move internationally by myriad factors, however, the efficacy of nation states’ policies regarding high-skilled migrants remains highly contested. The lack of existing evidence is largely due to the paucity of adequate data. To the best of our knowledge, this article provides the first test of the efficacy of policies targeting high-skilled migrants in a comparative cross-country setting.
As shown in Fig. 1, ever more countries are engaging in the intense global competition to attract internationally mobile human capital. By 2015, approximately 44 % of the 172 United Nations member states declared an explicit interest in increasing their numbers of high-skilled migrants. Highly developed destinations are at the vanguard of this global trend, with two-thirds of OECD nations having implemented, or being in the process of implementing, policies specifically aiming to attract high-skilled migrants. The desirability of high-skilled immigrant workers, and thus the reason for the proliferation of policies aimed at attracting the highly skilled, has been well documented. Examples include spurring technological progress (Kerr and Lincoln 2010), raising productivity (Peri et al. 2015), and fostering economic growth (Boubtane et al. 2014). These changing policy objectives have been accompanied by a large rise in high-skilled migration. Between the last two census rounds, in 2000/2001 and 2010/2011, OECD countries witnessed an unprecedented 70 % rise in the number of tertiary-educated migrants to 35 million (Arslan et al. 2014).
Despite the concurrent rise in the number of high-skilled immigrants worldwide and the proliferation of policies targeting high-skilled immigrants, the effectiveness of such policies remains contested (Bhagwati and Hanson 2009). Doomernik et al. (2009) argued that attracting high-skilled migrants will likely depend on broader economic and social factors. Papadimitriou et al. (2008) coined the term “immigration package” to describe the overall basket of factors that high-skilled migrants’ consider when deciding where to move.
This study examines the effectiveness of skill-selective migration policies in increasing the inflow and selection of high-skilled-labor immigrants, having accounted for a raft of economic and noneconomic factors. The empirical (pseudo-gravity) model is derived from, and consistent with, an underlying micro-founded random utility model (RUM) and is arguably the richest to date in terms of the model being well specified. It also accounts for recent innovations in the empirical literature: namely, a high proportion of zeroes in the dependent variable and multilateral resistance to migration.
The current study contributes to the literature on the determinants of international migration by being the first to examine the efficacy of policies targeting high-skilled migrants in a comparative cross-country setting. Prior cross-country studies have evaluated the effects of entire immigration regimes on aggregate bilateral migration flows (Czaika and de Haas 2016; Mayda 2010; Ortega and Peri 2013) or have focused on particular migration categories, such as asylum seekers (Hatton 2005, 2009; Holzer et al. 2000; Thielemann 2006; Vogler and Rotte 2000) or irregular migrants (Czaika and Hobolth 2016).
Our analysis combines three new data collections: (1) a unique data set of bilateral migration flows harmonized by skill level, (2) a novel database of unilateral policies targeting high-skilled immigrants, and (3) a third data set pertaining to other elements of the “immigration package” that includes bilateral migration policies.
The key result of this study is that supply-side policies—for example, points-based systems (PBS)—are much more effective in attracting and selecting high-skilled migrants than are demand-led policies, including job offer systems, labor market tests, and shortage lists. The provision of post-entry rights, as captured in the model by the offer of permanent residency, is effective in attracting high-skilled migrants, but overall, it reduces the human capital content of labor flows.
Theoretical Framework
Sjaastad’s (1962) canonical paper laid the foundation for the modern approach to theorize migration in economics, casting potential migrants as rational maximizers of human capital investments who assess the attractiveness of potential destinations by comparing their associated costs and benefits. The micro-founded pseudo-gravity model of international migration has since become the theoretical workhorse for understanding migrants’ location decisions. The theoretical foundations of this article, derived from a RUM, are therefore largely “off the shelf” and have been detailed elsewhere (see Beine and Parsons 2015; Beine and Salomone 2013; Beine et al. 2011, 2013, 2014; Bertoli and Moraga 2013, 2015; Bertoli et al. 2013; Boeri et al. 2012; Grogger and Hanson 2011; Ortega and Peri 2013). In particular, the analysis denotes scale and selection equations (see, e.g., Beine et al. 2011; Boeri et al. 2012; Grogger and Hanson 2011; Ortega and Peri 2013).
Empirical Considerations
Empirical research has operationalized migration policies in two ways. The first approach constructs policy indices that measure the restrictiveness of immigration systems (Boeri et al. 2012; Mayda 2010; Ortega and Peri 2013). Typically, a value of 0 is assigned to the index for a particular country in period 0. This value is increased or decreased by 1 should a policy in a particular year be deemed to be more or less restrictive. This approach can be used to assess how within variation in the intensity of migration policies affects migration to a specific country—what may be termed the intensive margin.2
The second approach, which we adopt in this article, is to use a binary variable that equals unity should a particular policy be in force in a specific year, or 0 if the policy is absent. This approach is advantageous in that variations both within and across countries can potentially be exploited, while policies can also be examined in combination. We therefore measure the average effect of a policy being in place (or not)—what may be termed the extensive margin.
Given that policies can be implemented to varying degrees (along the intensive margin), we code countries as having a particular policy in place (or not) based on the same criteria, in each case coded for the high-skilled migration channel that is most relevant in terms of the volume of high-skilled migrants the country admits. Given our research design, our approach necessarily glosses over some details in countries’ policies, which is the inevitable trade-off between sacrificing some specificity and external validity.3
Although in theory we can exploit both the between and within variation in our data, we rely most heavily on the between variation given that we observe few policy changes in our data over the period, thereby precluding a more detailed examination of the intensive margin. In terms of statistical identification, we assume that migration policies targeting high-skilled migrants are essentially random, given that we have controlled for a panoply of other potential determinants as suggested by the existing literature, in addition to batteries of fixed effects along a number of dimensions. We believe that the existence of any remaining omitted variables is unlikely. This approach is inevitable given the cross-country nature of the study because no instruments will likely be available as in single-country cases. Nor is it feasible to exploit quasi-experimental designs (see, e.g., Kato and Sparber 2013; Shih 2016). To this end, we pay particular attention to obtaining data of the highest quality for our analysis while going to the greatest lengths possible to ensure that the data are comparable across countries and over time, and this approach constitutes a particular strength of the current work.
Given our empirical approach, the estimation of Eq. (1) evokes a number of empirical considerations. In their seminal paper, Santos Silva and Tenreyro (2006) showed that in log-linear models, the variance of the error term is a function of the independent variables such that the expected value of the error term will also depend on the value of the regressors. We follow Santos Silva and Tenreyro (2006), who proposed the use of the pseudo-Poisson maximum likelihood (PPML) estimator that instead results in consistent and unbiased estimates in the presence of heteroskedasticity. Additionally, the use of this estimator allows for the inclusion of zeroes in the dependent variable, which is important because our data set contains 8,168 zero-migration observations of a maximum of 23,920 observations.
Bertoli and Moraga (2015) argued that the derivation of Eq. (1) is dependent on two assumptions: (1) the utility derived from each destination varies neither across origins nor across individuals, and (2) the stochastic component of utility is independent and identically distributed (i.i.d.) and conforms to an extreme value type I (EVT-1) distribution. Two key implications result. First, the scale of migration from country o to country d crucially depends on the utility associated with all other possible destinations. Bertoli and Moraga (2013) coined the term multilateral resistance to migration (MRM). A failure to account for MRM constitutes an omitted variable bias. A number of alternatives have been proposed to deal with this potential omission (Anderson and van Wincoop 2003; Bertoli and Moraga 2013; Feenstra 2004; Head et al. 2010). We adopt an alternative approach as suggested by Baier and Bergstrand (2009) to approximate the MRM terms, as first applied to the migration literature by Gröschl (2012).4
Second, for the model to be consistent with the underlying RUM—one that does not violate the independence of irrelevant alternatives (IIA) assumption—a set of origin-time dummy variables must be used to control for the population at origin; this requirement in turn implies that the expected value of the gross migration flow conditional on the independent variables (as well as the dummy variables) is independent across all individuals in the data set. It is important to note that the imposition of these fixed effects also controls for credit constraints, the omission of which will likely lead to alternative results (Belot and Hatton 2012).
Data
A comparison of policies targeting high-skilled migrants across countries requires new data on bilateral migration flows disaggregated by skill as well as measures of migration policies specifically targeting high-skilled migrants. Given the contested nature of the efficacy of these policies, a full battery of other potential determinants must also be considered.
Migration Flows of the Highly Skilled
Migration flow data disaggregated by skill level are some of the most seldom-recorded and most difficult to collate of any migration statistics on a comparable cross-country basis.5 Given the heterogeneity in destination countries’ recording mechanisms, the migration flow data disaggregated by skill are derived from a variety of sources, including administrative data files (Australia, Canada, Israel, New Zealand, and the United States), work or residence permits (Switzerland and the United Kingdom), population and employment registers (Norway and Sweden), and employment visas (the Republic of Korea). Given our focus on comparing migration policies across countries, significant efforts were required to collate data at the greatest level of detail possible and subsequently to harmonize the data to the greatest extent possible in order to facilitate meaningful comparisons across countries and over time. We record immigration flows by occupation pertaining to incoming economic migrants entering destination-country labor markets. Any unemployed individuals are dropped. High-skilled migrants’ dependents will not be captured in our data unless they, too, are recorded as entering destination countries’ labor markets.6 This focus on migrants entering destination countries for employment purposes is important because these are the individuals whom the relevant policies are attempting to attract.
The raw data obtained were harmonized to the greatest degree possible so that comparable cross-country estimates could be obtained. First, the flow data pertain to labor migrants arriving from abroad as opposed to those individuals that change their status in the destination country. Second, with the exception of Israel,7 all the data refer to immigrants’ nationality as opposed to their country of birth or country of last previous residence; this distinction is important because migration costs are determined (at least in part) by nationality (Beine et al. 2014). Third, the data refer to long-term or permanent migrants—that is, those staying for 12 months or more.8 Finally, because countries use differing nomenclatures when recording individuals’ occupations (Parsons et al. 2014), these data were collected at the lowest possible level of disaggregation to ensure that they could be suitably harmonized to a broad notion of human capital.9
Importantly, we adopt two measures of skill. The first, which we refer to as high-skilled, includes all those in the first three major groups of the International Standard Classification of Occupations (ISCO) 2008: (1) managers, senior officials, and legislators; (2) professionals; and (3) technicians and associate professionals. The second includes those of all other skill levels—which for the sake of simplicity, we refer to as low-skilled—and comprises those in all other ISCO 2008 categories. Students, entrepreneurs, and investors are dropped from the data set because our focus is on destination-country labor markets. In addition, business visitors are omitted from the data set because the period of stay and precise occupations of those individuals remain unclear. Given the great variation in detail in the data obtained, meaningfully aggregating the raw data into alternative skill classifications was not possible (see Czaika and Parsons 2016). Our data capture, on average, more than 700,000 skilled migrants per year from 185 origins who reside in 10 OECD destinations (see Table 4 in the appendix), according to our harmonized definition; the greatest number migrated in 2007, when more than 830,000 were recorded in total.10
Migration Policies Targeting the Highly Skilled
Labor immigration systems may be broadly termed as demand- or supply-driven (Chaloff and Lemaitre 2009). Demand-driven systems address rather short-term fluctuations on labor markets and aim for immediate employability of incoming migrant labor. They often require labor migrants to have obtained a job offer before gaining entry to the domestic labor market, and employers typically take a leading role in the recruitment process. Most European systems as well as the U.S. labor immigration system are at least partly employer driven. The job offer requirement is, in effect, a general test about a foreign worker’s employability in the domestic labor market. Such requirements are effective in selecting migrant workers who are immediately employable, but skilled migrants that do not fill an immediate shortage in the domestic labor market might be deterred.
Supply-driven systems are designed to address more structural and prospective labor market imbalances by fostering human capital accumulation. In these systems, highly qualified migrants can typically apply for work permits in the absence of job offers, although a job offer may still grant preferential access. Qualifications, age, work experience, language skills, and previous wages are usually assessed on an individual basis through a PBS, whereby applicants are selected independently of prevailing labor market conditions. Canada and Australia pioneered these skill-selective immigration systems, which aim to attract high-skilled migrants in large numbers. Despite any potential downside regarding the immediate employability of workers admitted through a PBS, supply-driven systems are often seen as relatively effective in attracting high-skilled migrants (Facchini and Lodigiani 2014). In fact, Boeri et al. (2012) argued that it is only such supply-driven systems that can meaningfully attract and capitalize on human capital over the longer term.11
Whether a country has implemented an employer-driven (demand) system, an immigrant-driven (supply) system, or a mixture of the two depends on policymakers’ priorities when addressing long-term deficiencies in human capital compared with short-term labor market shortages (Papademetriou and Sumption 2013). In practice, despite countries leaning toward a demand- or supply-side orientation, immigration policies tend to comprise a mixture of elements, both demand and supply (Parsons et al. 2014), termed “hybrid systems” (Papademetriou et al. 2008). For example, Australia and Canada have recently begun to combine their PBS with shortage lists that constitute demand elements (Koslowski 2014).
To capture immigration policy systems, we choose nine policy elements that collectively capture many of the key differences between destination countries’ policy stances. These instruments reflect three broader policy categories comprising skill-selective admission policies (shortage lists, job offer requirements, labor market tests, PBS), post-entry policy instruments (permanency rights, financial incentive schemes), and bilateral labor agreements specifying double taxation, diploma recognition, and social security, including pension portability.12 It is unlikely that a single policy instrument makes a particular destination country more or less attractive for high-skilled migrants, but rather combinations of immigration policies that establish cross-country variation of immigration systems (Cerna 2009).
Besides a job offer requirement, most European immigration systems require work permit requests to be approved after a labor market test, which is a case-by-case assessment of whether equivalent domestic workers are currently available to fill advertised positions. Labor market tests avoid the recruitment of unemployable migrants and those who might reduce the employability of native workers. To lower the bureaucratic burden of labor market tests—particularly if it is obvious that entire occupations cannot be filled locally—countries have developed shortage lists of occupations that are exempt from labor market tests. Shortage lists encapsulate assessments of labor market shortages on an occupation-by-occupation basis, in contrast with the individual approach of a labor market test. Depending on the number and types of shortage-listed occupations, such lists can facilitate the entire recruitment process but only for those occupations for which shortages have been identified.13
High-skilled migrants are also hypothesized to be strongly attracted by prospects of permanent residency. Even though many high-skilled migrants begin employment on (renewable) temporary work permits, most OECD destinations currently offer a road to permanency after a migrant lives and works in the country for a number of years. Migrants who are not granted these permanency rights on arrival—those who begin on a temporary visa—are often constrained in changing the employer that originally sponsored their work permit. Financial incentives, including tax exemptions and other economic incentives, also predominantly target high-skilled migrants.14
Our analysis also includes bilateral policies and agreements that relate to social security, double taxation (and tax evasion), and the recognition of diplomas, all of which aim to facilitate the admission and transition of high-skilled employees. Social security agreements regulate the equality in treatment between signatories regarding payments abroad, including old age pension, pension portability, disability support, parenting payment for widowed persons, and unemployment benefits. Double-taxation agreements prevent the double taxation of income, capital, and inheritances that are increasingly important for attracting highly mobile skilled workers who may hold multiple residences, including in their home country. These agreements also seek to reduce fiscal evasion. Bilateral agreements that recognize the credentials of migrants’ overseas aim to foster migrants’ integration into host-country labor markets by ensuring that qualifications of migrants are of equivalence to domestic degree programs. Many countries have implemented procedures for institutionalizing assessments and for recognizing foreign diplomas and credentials.
To isolate the effect of immigration policies, we must control for treaties that facilitate the freedom of movement of people. We construct a single variable that is both bilateral and time-varying, thereby capturing whether the two members of a country pair in a particular year are signatories to a freedom-of-movement agreement. The agreements captured include the Schengen agreement; the freedom of movement afforded to member states of the European Union and the European Free Trade Association; the de facto right of abode between Australia and New Zealand; and the Common Travel Area comprising Ireland, the United Kingdom, the Isle of Man, Jersey, and Guernsey. Each unilateral (bilateral) policy variable is coded as 1 should a particular policy instrument be in place for a particular country (pair) in a given year.
Amenities and “Gravity” Variables
Total unemployment and high-skilled wage data are taken from the OECD.15,16 High-skilled wages are calculated as average annual wages multiplied by the ratio of the 9th decile to the 5th decile. Total population data are taken from the International Database of the U.S. Census Bureau.17 Our dyadic control for immigrant networks is taken from the OECD Database on Immigrants in OECD and non-OECD countries (DIOC),18 which provides statistics for the numbers of immigrants residing in each of the OECD countries in 2000, 2005, and 2010. Flows from 2000 to 2004 are equated with the 2000 migration network, as captured by the bilateral stock of migrants in 2000, flows from 2005 to 2009 with the 2005 stock, and flows from 2010 to 2012 with the 2010 stock. Contiguity, common language, distance, and the sharing of a colonial heritage are all taken from the Centre d'Études Prospectives et d'Informations Internationales (CEPII) database (see Head et al. 2010).
A number of amenity variables are included that aim to capture the relative attractiveness of the 10 OECD destinations. Our Net-of-tax measure captures differences in tax rates across countries. This measure is calculated by subjecting a fixed annual salary of $150,000 (in purchasing power parity terms) to the differing tax schedules as provided by the OECD.19 Ceteris paribus, lower taxes are thought to increase the relative attractiveness of particular destinations for high-income earners.20 The appeal of global cities, in which high-skilled migrants no doubt agglomerate, is proxied by the prevailing UN salary country multipliers in each year.21 These multipliers are calculated based on the cost of living in major cities in each of the OECD destinations and reflect (among other things) the variety of goods that high-skilled migrants are able to consume as well as the urban amenities available to them. A quality-of-education variable is included by way of the Programme for International Student Assessment (PISA) scores, as provided by the OECD.22 We hypothesize that high-skilled workers value the provision of education for their children. Finally, the level of technological development that high-skilled migrants are hypothesized to favor is proxied by the density of mobile phone use (Information and Communications Technology (ICT) coverage), measured as the number of mobile or cellular phone subscriptions per 100 inhabitants. These data are taken from the United Nations.23 Descriptive statistics on all variables are reported in the appendix (Table 6).
Baseline Results
Table 1 reports the baseline results from estimating Eq. (1). Model 1 reports estimates of the economic and standard gravity in addition to the freedom-of-movement dummy variables. Models 2 and 3 additionally consider bilateral and unilateral policies, respectively, while Model 4 estimates all our core variables. All regressions reported in Table 1 include a full set of origin-time fixed effects to ensure the theoretical consistency of the empirical estimates.
Across the first four models, our estimates are remarkably stable. Even though all 10 destination countries are highly developed, an increase in high-skilled wages of 10 % is associated with an increase in high-skilled immigration flows of between 7 % and 11 %. High-skilled migrants include in their calculus prevailing unemployment rates and are deterred from moving to areas with fewer job opportunities. Migration networks foster high-skilled migration flows: a 10 % increase in the size of the bilateral migrant community is associated with an increase in high-skilled flows of more than 1 % along the same migrant corridor. Other migration cost-reducing factors captured by cultural, linguistic, geographical, and political proximity are all statistically significant and in the expected direction. Shared common border, language, colonial heritage, and freedom of movement between origin and destination all have a positive influence on high-skilled flows. Increasing geographical distance, however—a proxy for migration costs—naturally reduces high-skilled worker flows.
Models 2 and 4 include three major types of bilateral agreements that we assume are shaping the dynamic of high-skilled migration. Bilateral agreements aimed at recognizing foreign qualifications are associated with an increase in the number of high-skilled migrants by 30 % to 60 %. Evidence on bilateral agreements that regulate social security concerns are less robust, although we find mild evidence of a positive association in our full model. The net effect of the two countervailing forces underpinning the expected sign of the double taxation agreements variable interestingly is negative, suggesting that high-skilled individuals care more about avoiding tax than they care about being taxed only once as provided for in such agreements.
Turning to our main policy variables of interest, two of the three demand-driven instruments—the need to obtain a job offer, and shortage lists—significantly deter the absolute inflow of high-skilled migrants. Countries requiring a job offer recruit almost one-half as many high-skilled migrants. Labor market tests, however, have no influence on high-skilled migration flows in the baseline models. Shortage lists, which are even more rigid in preselecting high-skilled migrants, seem to represent an additional barrier for recruiting high-skilled migrants in large numbers.
Our main result is that PBSs appear to represent the most effective policy for attracting high-skilled migrants. Major PBS countries (Australia, Canada, New Zealand, the United Kingdom) attract, on average, 1.5 times the number of high-skilled migrants when compared with countries that adopt alternative policy tools. As opposed to the other policy measures, PBS across the destination countries might well operate differently. To address this concern, Model 5 includes separate dummy variables for PBS country—the results from which suggest that PBSs in Australia and New Zealand are the most effective.
The provision of permanency rights is also an important incentive for high-skilled migrants. Countries providing a road to permanency attract, on average, double the number of high-skilled migrants in comparison with those that do not. Permanency rights—even if permanent settlement is not the prime intention of the migrant at entry—increase the option value of staying longer in the host country and expand future opportunities. Financial incentives, such as tax breaks, are another attempt to attract international talent. The results from our baseline model suggest that such schemes have no effect, however.
Robustness Checks
Table 2 reports a series of robustness tests of the core model specification. Model 1 in Table 2 includes two MRM terms that albeit both significant do not alter the other estimates significantly; the exception is for financial incentive schemes, which become statistically significant at the 10 % level. The estimates for these terms (negative for the distance measure and positive for the adjacency measure) are omitted for the sake of brevity. Model 2 includes dyad fixed effects to address concerns that an omitted variable—for example, cultural distance—might be driving the results. They adequately control for such an omission because, as shown by the pioneering work of Geert Hofstede (Hofstede 1980; Hofstede et al. 2010), cultural distances change extremely slowly over time. Although the addition of dyad fixed effects improves the goodness of fit, several of the other estimates become smaller in size; the social security variable is no longer significant, and the labor market test variable becomes negative and significant. Nevertheless, the key findings remain intact.
Another particular concern (as shown in Fig. 2 in the appendix) is that the policy variables fail to capture many policy changes over the period 2002–2012, meaning that the estimation needs to rely quite heavily on the between variation across countries. We cannot impose a set of destination fixed effects, therefore, which might lead to fears of an omitted (destination country) variable bias. To address such concerns, Model 3 in Table 2 is equivalent to the core model that includes both origin-time and destination-time fixed effects. The difference between the R2 in Model 3 in Table 2 and the R2 from the core Model 4 in Table 1 is only 0.1 %, however, which provides confidence that an omitted variable is not responsible for driving the results.
Model 4 in Table 2 extends the core model with the inclusion of five additional variables that proxy for the role of economic and social amenities that have traditionally been viewed as determining the relative attractiveness of potential destinations (Gosnell and Abrams 2011; Tiebout 1956). All the variables are included simultaneously, without causing any significant changes in the other variables of interest. The coefficients on the amenity variables are largely as expected. The ICT coverage variable is used to capture the degree to which a location is culturally and technologically avant-garde because it has been argued that a rising creative class is attracted to such places (Florida 2002). A 10 % increase in ICT coverage is associated with a nearly 9 % increase in the inflow of high-skilled workers.
The net-of-tax variable proxies the attractiveness of national tax schemes and is shown, rather unsurprisingly, to significantly attract large numbers of foreign high-skilled workers. Whereas the importance of global cities for attracting international talent is well established (e.g., Sassen 2011), rising living costs—including property prices and rents—are major disincentives to moving to those cities. The coefficient on the global city living-cost variable is insignificant and negative, however, which suggests that the cornucopia of urban amenities and available product varieties compensate for relatively high living costs.
The estimates of the coefficient on the educational sector, as measured by global PISA scores, are significantly negative. In other words, the results suggest that high-skilled workers locate to those destinations that fare relatively poorly in education. This result can be attributed to the fact that Korea, which performs best overall on the PISA, plays host to the fewest high-skilled migrants in the sample; comparatively, the United States, which performs worst overall on the PISA, plays host to the greatest number of migrants. It is also likely that high-skilled migrant workers are able to place their children in private schools, so concerns about average PISA scores across the country might not be taken into consideration when they are deciding where to move.
Finally, a measure of life expectancy is included in the estimation as a proxy for the overall quality of living conditions (including health services provision) in addition to other factors that affect longevity. The coefficient on this measure is insignificant, which might suggest that high-skilled migrants care more about the provision of good health care (for example) privately as opposed to average health outcomes across the country. The imposition of the amenity measures does not alter any of the results that concern economic or policy variables. Moreover, the R2 of Model 4 in Table 2 is identical to that of the core Model 4 in Table 1, suggesting that in the empirical framework, amenities—that is, noneconomic factors—seem to play little role in determining the destination choices of high-skilled migrants.
Model 5 in Table 2 is estimated to address concerns that the results might be driven at least partly by the fact that the overarching data set is not perfectly balanced. Model 5 is therefore estimated on a reduced, although balanced, panel of dyad-year observations for the period 2003–2008. Again, the main results remain unchanged. Finally, Model 6 in Table 2 is estimated with an Arrelano-Bond dynamic panel estimator (Roodman 2009) to insulate our results from potentially endogeneity in the policy variables otherwise not accounted for, especially reverse causality. In addition to internal lags and first-difference instruments, unionization in the destination country’s labor force is included as another external instrument.24 Unfortunately, given the large number of variables included in the system–Generalized Method of Moments (GMM) estimation, it is not possible to include a full set of origin-time fixed effects, but separate origin and time fixed effects are included—a modification might drive some of the differences in the results. Nevertheless, the major policy results based on the estimates on permanency, PBS, and shortage lists remain intact, although the coefficients on these variables become significantly smaller.25
The Skill Composition of International Migration Flows
Even if particular skill-selecting and skill-attracting policies are associated with larger inflows of high-skilled migrants, the overall effect on the composition of total labor migration flows—operationalized as the share of high-skilled in the total labor inflow—remains uncertain.
This overall effect might in part result from our definition of high-skilled migration (ISCO classification categories 1–3). At least some skill-selective policies do not solely target these occupations and may similarly apply and encourage workers of lower skill levels. Shortage lists, for example, often include occupations that are not highly skilled according to this definition. Labor market tests and job offer requirements are policy instruments that may be argued to be a priori skill-neutral, although their application largely depends on underlying labor market demand and labor shortages. Given that such shortages are generally more prevalent in high-skilled occupations, however, we may still expect that even these demand-driven policies somehow have stronger effects on high-skilled labor inflows.
Whether these skill-selective policies are effective in altering the composition of labor inflows in favor of the highly skilled remains an empirical question. Table 3 reports estimates of the proportion of high-skilled migrants in total labor migration flows (see Eq. (2)). Because this share is bounded between 0 and 1, the effects of the explanatory variables tend to be nonlinear, while the variances tend to decrease when the mean approaches the limits. We therefore estimate a zero-one inflated beta-fit model with slightly modified zero-one boundaries (Smithson and Verkuilen 2006).
Model 1 in Table 3 reports the baseline, Model 2 includes the set of bilateral policy variables, and Model 3 adds the unilateral policy variables. Model 4 simultaneously runs seemingly unrelated regressions (SUR) on high-skilled migration (Model 4a) and on non-high-skilled migration (Model 4b) to control for the cross-correlation in the error terms between the two groups of workers to ensure that the greatest number of observations are maintained in the data.26
A rising skill premium, as captured by the difference between the measure of high-skilled wages and the prevailing median salary in a particular year, significantly alters the composition of labor flows in favor of high-skilled immigrants. Although the wage gap between the 90th percentile and the mean wage was approximately 45 % in 2000, it increased to more than 63 % in 2012 across these 10 OECD destination countries. Thus, a rising skill premium shifts the skill composition of labor inflows toward higher-skilled workers, as predicted by the Roy model (Borjas 1987).
Models 1–3 in Table 3 provide evidence that high-skilled foreign workers are more sensitive to business cycle fluctuations, such that higher unemployment at destination reduces the skill selectivity of incoming migrants. It is interesting, however, that the SUR estimates show that although high-skilled migrants are significantly deterred by high unemployment, migration flows of their lesser-skilled counterparts are negatively correlated with destination unemployment levels. Migrant networks play a more important role in facilitating migration for lower-skilled workers—a result consistent with Beine et al. (2011). This finding is unsurprising given that migrant networks are purported to reduce migration costs that are relatively higher for lower-skilled workers. The existence of migrant networks may therefore alter the selection of migrants over time (see McKenzie and Rapoport 2010).
The beta-fit regressions show that flows between contiguous country pairs tend to encourage fewer high-skilled workers because low-skilled workers are more sensitive to migration costs and may take advantage of migrating to neighboring countries. Country pairs with freedom of movement also encourage larger shares of non-high-skilled workers, thereby leading to a more negative selection on skills. The results from the beta-fit regressions also suggest that longer distances between two countries increase the skill selectivity of the migration flow, which again would suggest (as the SUR estimates show) that non-high-skilled workers are more sensitive to increases in migration costs. Somewhat surprisingly, the regressions show that migration between countries that share a colonial heritage tends to be more skill-selective, and the SUR regressions show that this effect might be driven by a large deterrent effect for non-high-skilled workers. Similarly, the estimated coefficient on language in the beta-fit regressions indicates that common language reduces the selection on skills, and the SUR regressions suggest that a common language spurs the movement of non-high-skilled more than their high-skilled counterparts.27
With regards to migration policies, PBSs again prove the most effective in improving the incoming distribution of skills at destination. PBSs assess skill profiles and filter labor migrants according to perceived long-term skill requirements and therefore are effective instruments not only for recruiting relatively large numbers of high-skilled migrants but also for shifting the skill composition in favor of the highly skilled.
The beta-fit regression in Model 3 suggests (albeit weakly) that requiring a job offer at entry increases the skill selectivity of incoming workers. The corresponding SUR results, based on the full sample, suggest the opposite: job-offer systems deter both sets of workers—the highly skilled, worst of all—the overall effect of which would be to reduce the incoming selectivity on skills. Labor market tests are shown to increase the share of high-skilled relative to lower-skilled migrants. The SUR regressions suggest this result is due to a positive effect exerted on the high-skilled migrant flow. This finding may indicate that countries implementing labor market tests might be more successful at filling lower-skilled positions domestically, meaning that the overall skill composition of the incoming flow increases. The imposition of shortage lists, however, significantly reduces the overall selection on skills because such lists deter high-skilled migrants more than low-skilled ones. This instrument is therefore not effective in attracting highly qualified migrants because the lists often comprise occupations that are not classified as highly skilled.
The beta-fit regressions show that permanency rights reduce the overall skill selectivity of immigrant flows. The SUR results indicate that although permanency rights provide positive incentives for both high- and otherwise-skilled workers, the effect on the latter is larger, such that the overall effect is negative. Both skill groups somewhat counterintuitively seem to be deterred by financial incentive schemes. Although tax breaks and allowances are expected to be relevant in the decision of high-income earners to migrate, robust empirical support for this presumption is not found. Finally, turning to the measures of bilateral agreements, recognition of diplomas, and social security agreements, both seem to be effective in increasing the skill composition of migrant flows, while no overall effect of international double-taxation agreements is seen.
Conclusion
High-skilled migration policies are en vogue, in large part because of increasing demand from various businesses that lobby governments for political support in filling labor market shortages with foreign workers. The phenomenon of business-driven labor migration policies is not new, as demonstrated by the guest worker programs of the 1950s and 1960s. The main difference, though, is that employers increasingly demand skill sets that often require tertiary education or other highly qualified expertise—skills that cannot be fully met by domestic labor. Governments have decided to respond to these demands by implementing various types of skill-specific and skill-selective immigration regimes that facilitate the international recruitment of desired workers.
Recent policy developments demonstrate, however, an increasing hybridization of skill-selective immigration systems. Demand-driven policies coexist with supply-driven elements in an attempt to balance the conflicting aims of numbers versus employability. The main result of this article is that supply-led systems—that is, points-based systems—increase both the absolute numbers of high-skilled migrants and the skill composition of international labor flows. Conversely, demand-driven systems, which are usually based on the principle of job contingency and are often supplemented by case-by-case (labor market test) or occupation-by-occupation (shortage lists) assessments of labor market needs, are shown to have little effect—and potentially even a negative one. This general conclusion needs to be clarified, however, because the aims of these policy tools differ. Points-based systems, such as those pioneered by Canada and Australia, were initially introduced with the idea that “there can never be enough of a good thing” and implemented as population policies with the aim of achieving the large-scale immigration of skilled workers. Other countries’ immigration policies—for example, those largely used across Europe—were founded on the notion of integrating migrants both economically and socially, and prioritize labor market outcomes over the numbers of migrants actually recruited.
Acknowledgments
The research presented in this article is part of the Drivers and Dynamics of High-Skilled Migration (DDHSM) project, which received generous funding from the Alfred P. Sloan Foundation (Grant 2011-10-22) and KNOMAD. The authors would like to thank Laurin Janes, Sebastien Rojon, Farhan Samanani, and Lena Wettach for nothing short of exemplary research assistance. We are grateful to attendees of the DEMIG Conference, Wolfson College, Oxford, September 2014; the Drivers and Dynamics of High-Skilled Migration Workshop, Oxford Martin School, October 2014, in particular Rey Koslowski, Çağlar Özden, and Martin Ruhs; the IRES Research Seminar, Université Catholique de Louvain, February 2015, especially Frédéric Docquier and David de la Croix; and finally to the participants of the 8th International Conference on Migration and Development, World Bank, June 2015. This article also reflects comments by three anonymous reviewers received through the KNOMAD peer review process and benefited from editing by Sherrie Brown as well as two anonymous referees during the Demography peer review process.
Appendix
Notes
We thank an anonymous referee for this suggestion.
An important ongoing innovation in this regard is the IMPALA project (Beine et al. 2015), which uses lawyers—experts on the texts of migration legislation—to code dummy variables for destination countries.
The data collection underpinning the current analysis took more than two years to complete and proved particularly expensive. For the sake of brevity, interested readers are referred to Czaika and Parsons (2016) for a more complete overview of the data collection.
This, of course, means that for family members who enter countries’ labor markets through family reunification channels, at which point their occupations are not recorded, our estimates will represent lower bounds.
The majority of immigrants arriving in Israel during the period (74 %) comprised individuals from the countries of the former Soviet Union, which is recorded as a single entity in the data set. This no doubt reduces any discrepancies between the two series.
The single exception to this is our inclusion of H1-B visa data for the United States.
Therefore, our estimates for countries adopting immigration policies that admit greater numbers of high-skilled migrants through on-shore channels will represent lower bounds.
Although this number is somewhat artificially inflated because of the inclusion of H1-B visa data for the United States, which are based on I-94 admissions data (Czaika and Parsons 2016), the results remain robust to their inclusion and exclusion.
For a fuller discussion as to how countries conceptualize the time horizon of their migration policies, interested readers are directed to Parsons et al. (2014).
See Czaika and Parsons (2016) for a more detailed description on the evolution and diffusion of these policy instruments across Western immigration destinations since 2000.
The accuracy of the underlying shortage analysis in identifying and assessing labor market needs has often been criticized (e.g., Sumption 2013). Therefore, the effect of a shortage list on the overall number of high-skilled immigrants is rather ambiguous, even more so when shortage lists also include occupations that require lower skill levels.
Given the heterogeneity of high-skilled migration policies across countries, methodologically we adopt a set of statements against which a 0 or 1 can be assigned to ensure consistency when coding our policy variables. Our data are always coded for the most attractive and most relevant HSM policies (in terms of volumes). Thus, while glossing over some detail, we primarily aim to capture the existence of some major policy instruments of skill-selective immigration systems. For example, for a labor market test, the guiding statement is simply, “Is there a mechanism in place to attempt to ensure the position cannot be filled by domestic workers?” The remaining statements can be found in Table 5 in the appendix. Nevertheless, because destination countries typically implement numerous policies that often relate to more than one class of migrant (Czaika and de Haas 2013), we adhere to a series of coding assumptions in order to ensure that the data are comparable both across countries and over time. These assumptions can also be found in Table 5 in the appendix. The aim is thus to imagine how a migrant with full knowledge of policies views the incentives presented within policies at the point of deciding to move. In many instances, such incentives all or predominantly fall under a single visa, which offer an array of specific provisions for high-skilled migrants. In the United States for example, we select the conjunction of H1B and EB visas because few applications for EB visas are made by new arrivals. With a processing and wait time of several years for the EB2/3 categories (and only a very narrow band of applicability within EB1), such policies are unlikely to allow migrants to respond to immediate opportunities. Therefore, the EB visas alone cannot be used to characterize immediately available incentives. They might, however, factor into long-term plans, making them attractive in terms of providing possibilities for permanent residence, spousal work rights, greater scope for family reunification, and so forth. In other words, since visa switching is clearly a strategy used by skilled migrants, looking at the provisions of these two visas in conjunction makes most sense in terms of capturing the incentives available to most skilled migrants.
OECD Data Unemployment: https://data.oecd.org/unemp/unemployment-rate.htm
OECD Data Earnings and Wages: https://data.oecd.org/earnwage/average-wages.htm
U.S. Census Bureau, International Database: http://www.census.gov/population/international/data/idb/informationGateway.php
OECD Database on Immigrants in OECD and non-OECD Countries: http://www.oecd.org/els/mig/dioc.htm
OECD Tax Database: http://www.oecd.org/tax/tax-policy/tax-database.htm
The results do not change when alternative annual salaries of $150,000, $200,000, and $250,000 are considered.
These were calculated from data available from United Nations International Civil Service Commission, Post Adjustment Reports: http://icsc.un.org/secretariat/cold.asp?include=par
OECD, PISA: http://www.oecd.org/pisa/
UN Data: http://data.un.org/Default.aspx
Trade union density corresponds to the ratio of wage and salary earners that are trade union members, divided by the total number of wage and salary earners. See Trade Union Density - OECD.Stat: https://stats.oecd.org/Index.aspx?DataSetCode=UN_DEN
As a test of reverse causality, we ran additional logit panel regressions of respective policy changes on high-skilled migration flows. Based on these tests (available from the authors on request), we can conclude that we find little to no evidence for a reverse causality of high-skilled migration inflows on the likelihood of a policy change in the expected direction. These results, in combination with the respective GMM result, give confidence that our policy estimates are consistent and do not suffer from endogeneity bias.
When calculating the shares of the highly skilled in the total (Models 1–3), regressions cannot be run if the total number highly skilled is equal to 0 because these observations are dropped from the estimation.
However, the SUR regressions show no significant difference of the effect of common language on skill selectivity.