I know this has been the topic of discussion in several threads but I've got very ropey wifi at the moment and am too lazy to search for the most appropriate one so figured I'd start a new thread.
The paper gives a couple of examples where ethnic labels group people together erroneously, including this one,
It explains why these groupings are so important in genetic research,Another example in frequent use is “Bantu”, which effectively refers to a very broad linguistic grouping comprising hundreds of millions of people in Africa, speaking over 400 distinct languages or dialects. There is some overlap between genetic clusters and Bantu-speaking dialects or languages, but not across the whole group. Furthermore, the word Bantu was used as a catch-all term in apartheid-era South Africa for many different Black African peoples, including groups that were not Bantu-speaking, and had widespread derogatory use in that society. The continued use of these and other similar terms is particularly a problem for longitudinal studies that stretch back decades into the past when such terminologies were current. But even contemporary public health and governmental datasets use census terms which are often arbitrary, outmoded and inconsistent.
In other words, if you group people one way you'll get different results to if you group them another way. And if you're grouping them based on racist catch-all terms for "not like us" then don't be that surprised if you get racist results....human genetics is an inherently statistical science, one that describes correlations between genomic and phenotypic variation, and attempts to distinguish genetic and environmental effects on phenotypes. The way we group people plays a central role in these analyses, and in many cases, categories enable us to compare and contrast phenotypes and genotypes, and use simpler and more interpretable statistical models, which add to our power of discovery.
They make it clear that while these groupings may be spurious, they still matter,
This is a key point,These categories [of race and ethnicity], although socially constructed, can have profound biological effects. For example, by influencing a person’s geographical surroundings, their levels of chronic stress, their access to resources, and other aspects of their life history, they may have a major impact on prenatal and childhood development and the expression of human traits.
Thus, the social categories and other groupings that individuals belong to are inescapable components of genetics research. However, within the human genetics community, some aspects of the academic language used to describe groups and subsets of people may foster erroneous beliefs beyond academia about human biology and the nature of these categories. Such descriptions frequently invoke concepts of ancestry and population structure, for reasons we will discuss below. But ancestry itself is often a poorly understood concept, and its relationship to genetic data is not straightforward.
...it is sometimes assumed, incorrectly, that these labels, self-identified or otherwise assigned, can be used straightforwardly as a proxy for genetic ancestry. This reinforces the commonly-held but simplistic assumption that differences between ethnic groups are substantially due to genetic differences, and are in some sense innate.
They then provide a list of terms they recommend against using and provide explanations for this conclusion and alternative suggestions while emphasising that they expect people to disagree with them and are writing to stimulate discussion.framing social environments in purely genetic terms may seem to imply a genetic explanation, rather than societal, cultural and historical explanations, for the existence of these categories and the differences between them. Moreover, some of the labels used are simply archaic and discredited terms from earlier eras of anthropology and human genetics, and therefore inappropriate as descriptors of human groupings today.