Authors
Maurer, T., Purucker, L., Hutter, F., Pfaffelhuber, P., Heinzel, C. S.
Abstract
While classifiers such as TabPFN and SNIPPER achieve strong intercontinental performance, their accuracy in classifying individuals within Europe remains low. One major factor contributing to this limitation is the set of genetic markers used for classification. Marker panels such as the VISAGE Enhanced Tool are commonly employed in forensic genetics because they contain ancestry informative markers (AIMs) that distinguish very well between major continental populations. However, these panels are often not optimized for fine scale differentiation within continents, where genetic variation is more subtle and population structure is rather continuous. We apply machine learning to select informative markers for intra European classification, using data from. Compared with the VISAGE Enhanced Tool and allele frequency based approaches, our marker sets achieve substantially higher accuracy within Europe: For four European populations, accuracy improves from 68.2% (VISAGE, 104 markers) to 73.7% (100 new markers) and 82.3% (200 new markers). For five populations, accuracy rises from 56.1% (VISAGE) to 64.5% (100 new markers). Our results show that tailored marker selection markedly improves intra continental classification. While optimized here for Europe, the method can be applied to any region with sufficient training data.
Preprint server:
bioRxiv
The authors list and abstract were imported from bioRxiv on 11 Nov 2025.
Advertisement
Stats
- Recommendations n/a n/a positive of 0 vote(s)
- Views 33
- Comments 0