ethnic

Romani

Also known as: Roma, Romany, Gypsies (pejorative)

The Romani people trace their origins to populations in northwestern India, with linguistic and genetic evidence pointing to a departure from the subcontinent sometime between the sixth and eleventh centuries CE. Their language, Romani, belongs to the Indo-Aryan branch of the Indo-European family and retains core vocabulary and grammatical structures closely related to languages such as Hindi and Punjabi. Historical records first mention groups matching Romani descriptions in the Byzantine Empire by the eleventh century, after which communities moved westward through Anatolia and the Balkans, reaching western Europe by the fourteenth and fifteenth centuries. This timeline remains approximate because written sources from the period often conflated Romani with other mobile groups.

Genetic studies have strengthened the case for an Indian source while revealing subsequent admixture. Analyses of autosomal, mitochondrial, and Y-chromosomal markers consistently show that Romani populations carry a distinctive South Asian ancestry component that is absent or rare in surrounding European groups. Research led by teams including Martínez-Cruz and colleagues in 2016, as well as earlier work by Kalaydjieva and colleagues, identified founder lineages and estimated that the proto-Romani population experienced a relatively small effective size during migration, followed by regional gene flow with local Europeans. Ancient DNA directly from early Romani burials is still limited, so current conclusions rely heavily on modern sampling calibrated against broader Eurasian datasets.

Archaeological evidence is sparse because Romani groups were historically mobile and used perishable materials, yet material culture and burial practices documented at medieval sites in the Balkans occasionally align with ethnographic descriptions of early Romani communities. Debates persist over whether a single migration wave or multiple dispersals occurred, and over the precise geographic corridor—whether through the Caucasus, Iran, or Armenia—taken out of India. Some researchers argue that linguistic stratification within Romani dialects supports at least two distinct streams, while others maintain that observed variation can be explained by later fragmentation after arrival in Europe.

These findings matter for understanding how small migrating groups maintain distinct identities while contributing to the genetic and cultural diversity of larger host societies. The Romani case illustrates repeated episodes of admixture, marginalization, and resilience that parallel other diasporic histories across Eurasia. Ongoing work combining ancient DNA from potential migration routes with refined linguistic phylogenies continues to narrow uncertainties about timing and demography, offering a clearer picture of one thread in the larger tapestry of human movement out of South Asia.

Geographic distribution: Europe, Americas

Biological ancestry and ethnic identity are related in some cases but are not equivalent. Individuals within one ethnicity may have different ancestral backgrounds. See our methodology.

Related