
Population genomic analysis identifies the complex structural variation at the fibromelanosis (fm) locus in chicken
- Select a language for the TTS:
- UK English Female
- UK English Male
- US English Female
- US English Male
- Australian Female
- Australian Male
- Language selected: (auto detect) - EN
Play all audios:
ABSTRACT Phenotypic diversity and its genetic basis are central questions in biology, with domesticated animals offering valuable insights due to their rapid evolution the last 10,000 years.
In chickens, fibromelanosis (FM) is a striking pigmentation phenotype characterized by hyperpigmentation. A previous study identified a complex structural variant involving both two large
duplications (127.4 and 170.5 kb in size) and inversions associated with upregulated expression of the _Endothelin 3_ (_EDN3_) gene. However, the detailed organization of the structural
arrangements have remained unclear. In this study, we conducted a comprehensive genomic survey of 517 FM chickens representing 44 different populations. Our results elucidate the complex
arrangement of the duplications and inversions at the _FM_ locus based on the large-scale genomic survey, population level genotyping, and linkage disequilibrium analysis, providing
conclusive support for one specific configuration of the two large duplications, resolving a controversy that has been unresolved for more than a decade. Our results show that the birth of
this complex structural variant must have involved an interchromosomal rearrangement creating fixed heterozygosity due to sequence differences between the two copies of the 127.4 kb
duplication. This study shows how population genomics can be used to understand complex structural variations that underlie phenotypic variation. SIMILAR CONTENT BEING VIEWED BY OTHERS THREE
CHROMOSOME-LEVEL DUCK GENOME ASSEMBLIES PROVIDE INSIGHTS INTO GENOMIC VARIATION DURING DOMESTICATION Article Open access 11 October 2021 A CHROMOSOME-LEVEL GENOME ASSEMBLY FOR THE SILKIE
CHICKEN RESOLVES COMPLETE SEQUENCES FOR KEY CHICKEN METABOLIC, REPRODUCTIVE, AND IMMUNITY GENES Article Open access 06 December 2023 EVOLUTIONARY ORIGIN OF GENOMIC STRUCTURAL VARIATIONS IN
DOMESTIC YAKS Article Open access 19 September 2023 INTRODUCTION How phenotypic diversity evolves and its genetic basis is a fundamental question in biology. Domesticated animals constitute
a valuable resource to explore this topic due to their rapid phenotypic evolution within the last 10,000 years1. A characteristic feature of domestic animals is altered pigmentation pattern,
a phenotypic change that occurred early during domestications of animals as documented by ancient illustrations and documents. A striking pigmentation phenotype in domestic chicken is
fibromelanosis (FM) that is widespread among Asian chicken breeds, like Chinese Silkie and Ayam Cemani from Indonesia. FM is characterized by a massive expansion of melanocytes resulting in
excessive skin and tissue pigmentation2. The FM trait is highly valued culturally and commercially in Asia. For instance, Silkie chickens are prized in ornamental breeding for their
distinctive appearance, while Ayam Cemani chickens hold cultural significance and command high market prices3. FM chickens are valued for their deep eumelanin deposition, used in food such
as Chinese black-bone soup and in Chinese traditional medicine4,5. A previous study demonstrated that the dominant FM trait in several breeds of chicken is caused by a complex structural
rearrangement involving two duplications, 127.4 and 170.5 kb in size2. One of the duplications encompass the _EDN3_ gene, which has a critical role for melanoblast differentiation and
expansion6. The fact that _EDN3_ shows a highly significant upregulation at the mRNA level in skin from FM chickens strongly suggested that this is the causal mechanism for the massive
expansion of pigment cells in FM chicken2. It was not possible to resolve the organization of the complex rearrangement with the short-read whole genome sequence data available a decade ago,
even with the structural variation detection tools widely used today7, and three possible configurations of the complex structural rearrangement (Fig. 1) were established based on PCR
analysis of breakpoint regions2. A single recombinant found in a pedigree segregating for the _FM_ mutation was only consistent with one of the possible configurations denoted FM-22.
However, more recent data based on long read sequencing assemblies have given conflicting results, in which some studies2,8,9 have supported the FM-2 configuration whereas other studies came
to the conclusion that their assembly of this region from Yeonsan Ogye10 and Silkie11 chicken supported the FM-1 configuration. In order to resolve the structural arrangement of the _FM_
locus, here we have analyzed whole genome sequence data from 517 FM chickens representing 44 chicken breeds worldwide, all with the FM phenotype (Fig. 2A). Our comprehensive population
genomic analysis showed that all these breeds have the same structural variations (SVs) at the _FM_ locus as previously reported2. Our population genomic analysis provides overwhelming
support for the FM-2 model based on the pattern of linkage disequilibrium across the complex rearrangement reflecting where recombination events can occur. This study highlights how
population genomics data can be used to resolve complex structural variations that are challenging to resolve even using long-read sequence data. RESULTS COMPREHENSIVE GENOMIC SURVEY OF THE
_FM_ LOCUS We analysed whole genome sequencing data of 517 FM chickens representing 44 breeds of chicken (Fig. 2A). Our results showed that all these breeds have the same structural variants
(SVs) at the _FM_ locus (Fig. 2B and C). The SVs involve duplications of two genomic regions on chromosome 20:10,766,772–10,894,151 bp (DUP1) and chr20:11,306,686–11,477,501 bp (DUP2) (Fig.
2C), consistent with previous findings2,12. GENETIC PROFILE OF THE _FM_ ALLELE We performed a comprehensive analysis of genomic data to characterize the arrangement of the complex
chromosomal structural variation of the _FM_ allele (Figs. 3 and 4). Firstly, we analyzed the sequence depth at the two duplications associated with the FM phenotype and confirmed that DUP1
and DUP2 occur as two copies in the _FM_ allele (Fig. 3A and B). This analysis allowed us to identify _FM_ homozygotes that were used in the further analysis. We next determined the genetic
differentiation between wild-type (_FM*N_/_N_) and _FM_ homozygous chickens (_FM*FM_/_FM_) (Fig. 3C). _F_ST results showed that there were significant genetic differences in the two SV
regions, especially in DUP1 and its flanks. Notably, there is an obvious reduction of genetic differentiation at the front segment of the DUP1 region (Fig. 3C). Nucleotide diversity analysis
showed a marked reduction of heterozygosity in _FM_ homozygotes but interestingly only in the DUP1 region and its flanks (Fig. 3D). We next analyzed the frequency of the reference allele
for all SNPs across the region harboring the two duplications. This revealed a remarkable bubble for the DUP1 region in _FM_ homozygotes not present in wild-type chicken (Fig. 4A and B). In
the bubble region, _FM_ homozygotes tended to be fixed for the reference allele or the non-reference allele, or the two alleles occur at exactly the same frequency (50%) (Fig. 4B). A weak
tendency for a similar pattern was noted for the DUP2 region where two regions (marked with red arrows) had an excess of SNPs showing an allele frequency of 50% (Fig. 4B). The non-duplicated
412.54 kb region between DUP1 and DUP2 showed a very similar distribution of allele frequencies as present in wild-type chickens (Fig. 4A and B). These data are consistent with strong
suppression of recombination between FM and wild-type chromosomes but only for the DUP1 region. CHARACTERIZATION OF THE STRUCTURAL ARRANGEMENT OF _FM_ ALLELE The above results refute FM-1 as
a possible organization of the _FM_ allele. This is because the FM-1 configuration (Fig. 1A) implies an inversion involving one copy of DUP1, the connecting region (412.54 kb) and one copy
of DUP2, and the bubble should extend across the entire region which it does not (Fig. 4B). We therefore conclude that the FM-2 configuration must be the correct organization because it is
the only one consistent with a single previously reported recombination event2 and none of the recent reports based on PacBio long reads found support for the FM-3 order10,11. In addition,
if the configuration of FM-3 is correct, we should observe a bubble in the DUP2 region, not in the DUP1 region. Phylogenetic trees constructed for the DUP1 and DUP2 regions revealed
differences in branch lengths between two SV regions which support the hypothesis of an inversion near the DUP1 region (Supplementary Fig. 1), consistent with our proposed structural
configuration of the _FM_ locus. The large number of SNPs showing an allele frequency of 50% in the DUP1 region implies that the complex rearrangement creating the _FM_ allele must have
involved an inter-chromosomal event involving two different haplotypes creating fixed heterozygosity for the positions in regions of severely suppressed recombination (after the
rearrangement) where the two haplotypes differed (Fig. 4C). In conclusion, FM-2 (Fig. 1A) is now the confirmed organization of the _FM_ allele and there is strong suppression of
recombination for most of the DUP1-inverted DUP2-DUP1 region whereas there is no strong suppression of recombination from the end of the second copy of the DUP1 region to the second copy of
DUP2 including the single copy 412.54 kb intervening region (Fig. 4C). This explains the bubble pattern across the DUP1 region (Fig. 4B) in _FM_ homozygotes as well as the corresponding
pattern of genetic differentiation and nucleotide diversity (Fig. 3C and D). TRACING THE FORMATION OF THE _FM_ ALLELE There is a sharp disruption of the region of fixed heterozygosity within
DUP1 in the _FM_ allele, marked by a red arrow in Fig. 4B. This could either reflect a recombination with wild-type chromosomes have occurred or a shift from a large region where the two
donor haplotypes forming the _FM_ duplication showed many sequence differences to a region where they happened to be identical. Similarly, two reference-consistent regions in DUP2 may
reflect regions in which one of the copies of DUP2 not undergoing recombination is identical to the genome reference and thus the frequency of the reference allele among chickens homozygous
for the _FM_ allele will always be 50% or higher (Fig. 4B), because we estimate the allele frequency as the average of sequences from the two copies of DUP2. Linkage disequilibrium (LD)
analysis at the _FM_ locus also showed a clear LD block boundary between two segments within DUP1 which is in perfect agreement with the break of fixed heterozygosity (Figs. 4B and 5). While
LD decay patterns are influenced by recombination rates, population structure, and historical selection events, the significantly elevated LD at the _FM_ locus (Fig. 5) are consistent with
the presence of an inversion. DISCUSSION RESOLVING THE STRUCTURAL CONFIGURATION OF THE _FM_ ALLELE The _FM_ allele constitutes a truly complex structural variant involving two large
duplications (127.4 kb and 170.5 kb), an inversion and translocation of the 170.5 duplication and with a 412.5 kb non-duplicated intervening region (Fig. 1). Previous PCR studies established
three possible configurations of this complex rearrangement (Fig. 1) that could not be resolved using whole-genome sequencing based on short read data2,12. However, one recombinant found in
a pedigree analysis provided strong support for the FM-2 configuration5. Subsequent studies based on long-read sequencing gave conflicting results, some supporting the FM-1
configuration10,11 and others supporting FM-28,9,13. The current study based on extensive whole genome sequence data and the analysis of linkage disequilibrium patterns across more than 40
breeds carrying the _FM_ allele now provides conclusive evidence for the FM-2 configuration and that the _FM_ allele arose only one time. The unique feature with an inverted copy of DUP2
inserted between two tandem copies of DUP1 results in strong suppression of recombination explaining the allele frequency bubble at DUP1 when reads from the _FM_ allele are aligned to the
reference genome (Fig. 4). Furthermore, the data shows that the _FM_ mutation must have involved an inter-chromosomal exchange resulting in fixed heterozygosity in the DUP1 region at the
sites with sequence differences between the two copies (Fig. 4B). In contrast, fixed heterozygosity and an allele frequency bubble is not noted for DUP2 because one of the DUP2 copies is
located outside the region of suppressed recombination (see FM-2 in Fig. 1A). This result is also in complete agreement with the previously reported recombination event between a wild-type
and an _FM_ allele2. Given that the _FM_ allele likely originated thousands of years ago5, sufficient time has passed for sequence variation in the second copy of DUP2 and the corresponding
region of wild-type chromosomes to become randomized due to recombination. This is illustrated by the fact that the nucleotide diversity is clearly reduced only at the DUP1 region and its
flanking region (Fig. 3D) consistent with the observed allele frequency bubble (Fig. 4). EVOLUTION OF THE _FM_ LOCUS Our study did not reveal any genetic heterogeneity at the _FM_ locus
since exactly the same structural rearrangements were detected across all 44 FM chicken breeds analyzed. The _FM_ allele must involve multiple structural rearrangements and it is challenging
to establish the exact mechanism causing these rearrangements with confidence. However, one possible scenario is that the first event involved a duplication of the entire region from DUP1
to DUP2 (710.4 kb) by unequal cross-over, followed by an inversion involving DUP2 and the intervening sequence (412.5 kb) in which the latter was lost. This scenario is consistent with the
fixed heterozygosity for the DUP1 sequence, because the two copies originate from different chromosome homologs. The fixed heterozygosity at DUP1 has since then been maintained due to
suppressed recombination while the fixed heterozygosity for DUP2 has been eroded due to recombination affecting the second copy of DUP2 (Fig. 4). It is still an open question whether the two
events (the duplication and the inversion) occurred in a single meiosis or in a stepwise fashion over multiple generations. LIMITED CAPABILITY OF LONG-READ DATA IN COMPLEX STRUCTURAL
VARIATION For complex structural variations, such as the _FM_ locus, which involve copy number variation, rearrangements, inversion, and translocations, long-read sequencing faces
limitations for instance when the size of a duplication exceeds the read length. In such cases, the inter-sequence assembly strategy plays a critical role and can significantly influence the
accuracy of the assembly14. As more and more research focus on large-scale structural variation15, such as pan-genome16, we suggest that the assembly of regions showing complex structural
variation can be validated using population genomics data. Despite previous conflicting reports2,10,11,12, the population genomic approach employed here successfully clarified the structural
arrangement, demonstrating the value of population-scale sequencing in deciphering complex genomic rearrangements. STRUCTURAL VARIANTS AS DRIVERS OF PHENOTYPIC DIVERSITY The _FM_ locus
exemplifies how structural variants contribute to phenotypic diversity in domesticated animals17,18,19,20,21,22. These are often gain-of-function mutations with a dominant inheritance,
resulting in altered gene expression. Other prominent examples include Dominant white color in pigs21, Greying with age in horses17,18, and all three comb phenotypes in chicken, Pea-comb23,
Rose-comb24, and Duplex comb22. These cases collectively highlight the evolutionary significance of structural variants, particularly in shaping key morphological traits under artificial
selection in domestic animals. MATERIALS AND METHODS SAMPLES COLLECTION In this study, we analyzed 817 chicken samples, comprising 517 FM chicken samples from 44 breeds from various
geographic locations worldwide, and an additional 300 wild-type samples (Supplementary Table 1). All samples included in this analysis were previously reported in
studies9,10,11,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50. DATA PROCESSING Sequencing reads underwent quality control using FastQC (version 0.11.8) to
assess read quality metrics. Index adaptors and raw reads with more than 50% bases in low quality (Q ≤ 5) or 10% “N” content were filtered out using Btrim (version 0.3.0)51 software. Cleaned
reads were then mapped to the reference genome of the red junglefowl (GRCg6a) using the BWA-MEM (version 0.7.17-r1188)52 algorithm with default settings. Bam files were sorted using
SAMtools (version 1.9)53 and PCR duplicates were removed using Picard (version 2.18.6) tools. Sequencing depth relative to the reference genome was calculated using SAMtools (version 1.9)53.
The “RealignerTargetCreator” and “IndelRealigner” tools in Genome Analysis Toolkit (GATK, version 3.7)54 were used to reduce mismatches around INDELs, with base quality score recalibration
(BQSR) performed to reduce mapping errors. Joint calling for SNPs and small INDELs was performed using the “HaplotypeCaller” and “GenotypeGVCFs” tools from GATK (version 3.7)54. Low quality
SNPs were filtered, with parameters following previous research7,55: “QUAL < 30.0 || QD < 2.0 || MQ < 40.0 || FS > 60.0 || MQRankSum < − 12.5 || ReadPosRankSum < − 8.0 ||
SOR > 3.0”. Loci with max-missing rate above 0.10 and minor allele frequency less than 0.05 (–maf 0.05; –max-missing 0.9) were filtered using VCFtools (version 0.1.16)56. Only biallelic
loci were retained. SVS IDENTIFICATION SVs were detected using relative sequence depth analysis. Sequence depth was calculated using SAMtools (version 1.9)53 to identify the genomic regions
of interest, specifically focusing on the DUP1 (chr20:10,766,772–10,894,151) and DUP2 (chr20:11,306,686–11,477,501) regions. The relative sequencing depth is the ratio of the sequencing
depth of the candidate region to the whole genome background. To differentiate between wild type (_FM*N_/_N_), homozygous (_FM*FM_/_FM_), and heterozygous (_FM*FM_/_N_) samples, relative
sequence depth analysis was performed using the average sequence depth of the target region divided by the average sequence depth of whole genome or chromosome 20. NUCLEOTIDE DIVERSITY AND
GENETIC DIFFERENTIATION Nucleotide diversity (π) and genetic differentiation (_F_ST) was calculated using VCFtools (version 0.1.16)56 using a 20 kb window length with 10 kb sliding window,
focusing on the SV regions and their flanking sequences (chr20:10,600,000–11,700,000). To evaluate the statistical significance of _F_ST estimates across different genomic regions, we
performed a _Z_-score transformation relative to the genome-wide distribution. _P_-values were derived from the standard normal distribution, and Benjamini–Hochberg false discovery rate
(FDR) correction was applied to control for multiple testing. _F_ST windows with FDR-adjusted _P_-values < 0.05 were considered significantly different from the genome-wide expectation.
Allele frequency distributions were analyzed using VCFtools (version 0.1.16)56, with particular attention to differences between wild type, heterozygous, and homozygous FM chickens. Regions
showing significant deviations in allele frequency were identified and further examined for recombination breakpoints manually. PHYLOGENETIC ANALYSIS Phylogenetic relationships among FM
chickens were inferred using the identified SV regions. Neighbor-joining (NJ) trees were constructed using and RapidNJ software and visualized using iTOL57. Separate trees were constructed
for the DUP1 and DUP2 regions. LINKAGE DISEQUILIBRIUM ANALYSIS To assess LD decay patterns across the FM locus, we computed pairwise linkage disequilibrium (LD) using r2 between SNPs using
LDBlockShow (version 1.40)58 with default parameters. LD decay patterns were used to infer historical recombination rates, assuming a constant recombination landscape over time. The LD block
structure was analyzed to identify boundaries within the DUP1 and DUP2 regions, indicating possible recombination breakpoints. To determine whether LD decay in the _FM_ locus was
significantly different from the genome-wide background, we performed a genome-wide _Z_-score transformation within 10 kb windows. We then converted _Z_-scores to _P_-values using the
standard normal distribution and applied FDR correction to control for multiple testing. Regions with FDR-adjusted _P_-values < 0.05 were considered significantly different from
genome-wide LD expectations. This analysis assumes that LD patterns reflect both historical recombination events and population demographic history. Specifically, LD tends to persist in
regions under strong selection, low recombination, or with recent population bottlenecks. DATA AVAILABILITY All of the whole genome sequencing (WGS) datasets used in this study have been
previously reported (Supplementary Table 1). The variant call format (VCF) data was deposited in the Genome Variation Map (GVM) under accession project number PRJCA032154. CODE AVAILABILITY
All analyses were performed using previously published or developed tools, as indicated in Methods. No custom code was developed or used. REFERENCES * Andersson, L. & Purugganan, M.
Molecular genetic variation of animals and plants under domestication. _Proc. Natl. Acad. Sci. USA_ 119, e2122150119. https://doi.org/10.1073/pnas.2122150119 (2022). Article PubMed PubMed
Central CAS Google Scholar * Dorshorst, B. et al. A complex genomic rearrangement involving the endothelin 3 locus causes dermal hyperpigmentation in the chicken. _PLoS Genet._ 7,
e1002412. https://doi.org/10.1371/journal.pgen.1002412 (2011). Article PubMed PubMed Central CAS Google Scholar * Siddiqui, S. A., Toppi, V. & Syiffah, L. A comparative review on
Ayam Cemani chicken - A comparison with the most common chicken species in terms of nutritional values, LCA, price and consumer acceptance. _Trop. Anim. Health Prod._ 56, 161.
https://doi.org/10.1007/s11250-024-03980-6 (2024). Article PubMed PubMed Central CAS Google Scholar * Shoenfeld, N., Amital, H. & Shoenfeld, Y. The effect of melanism and vitamin D
synthesis on the incidence of autoimmune disease. _Nat. Clin. Pract. Rheumatol._ 5, 99–105. https://doi.org/10.1038/ncprheum0989 (2009). Article PubMed MATH CAS Google Scholar * Li, S.
_Compendium of materia medica_ 1st edn, 255 (Overseas Chinese Publishing House, 2017). MATH Google Scholar * Kim, I. S. et al. Microenvironment-derived factors driving metastatic
plasticity in melanoma. _Nat. Commun._ 8, 14343. https://doi.org/10.1038/ncomms14343 (2017). Article ADS PubMed PubMed Central CAS Google Scholar * Ma, C., Shi, X., Li, X., Zhang, Y.
P. & Peng, M. S. Comprehensive evaluation and guidance of structural variation detection tools in chicken whole genome sequence data. _BMC Genomics_ 25, 970.
https://doi.org/10.1186/s12864-024-10875-1 (2024). Article PubMed PubMed Central MATH CAS Google Scholar * Dharmayanthi, A. B. et al. The origin and evolution of fibromelanosis in
domesticated chickens: Genomic comparison of Indonesian Cemani and Chinese Silkie breeds. _PLoS One_ 12, e0173147. https://doi.org/10.1371/journal.pone.0173147 (2017). Article PubMed
PubMed Central CAS Google Scholar * Shinde, S. S., Sharma, A. & Vijay, N. Decoding the fibromelanosis locus complex chromosomal rearrangement of black-bone chicken: Genetic
differentiation, selective sweeps and protein-coding changes in Kadaknath chicken. _Front. Genet._ 14, 1180658. https://doi.org/10.3389/fgene.2023.1180658 (2023). Article PubMed PubMed
Central CAS Google Scholar * Sohn, J. I. et al. Whole genome and transcriptome maps of the entirely black native Korean chicken breed Yeonsan Ogye. _GigaScience_
https://doi.org/10.1093/gigascience/giy086 (2018). Article PubMed PubMed Central MATH Google Scholar * Zhu, F. et al. A chromosome-level genome assembly for the Silkie chicken resolves
complete sequences for key chicken metabolic, reproductive, and immunity genes. _Commun. Biol._ 6, 1233. https://doi.org/10.1038/s42003-023-05619-y (2023). Article PubMed PubMed Central
MATH CAS Google Scholar * Shinomiya, A. et al. Gene duplication of endothelin 3 is closely correlated with the hyperpigmentation of the internal organs (Fibromelanosis) in silky chickens.
_Genetics_ 190, 627–638. https://doi.org/10.1534/genetics.111.136705 (2012). Article PubMed PubMed Central MATH CAS Google Scholar * Sharma, A. & Vijay, N. What is the correct
genomic structure of the complex chromosomal rearrangement at the Fm locus in Silkie chicken?. _bioRxiv_ https://doi.org/10.1101/2024.02.05.578760 (2024). Article PubMed PubMed Central
MATH Google Scholar * Ahsan, M. U., Liu, Q., Perdomo, J. E., Fang, L. & Wang, K. A survey of algorithms for the detection of genomic structural variants from long-read sequencing data.
_Nat. Methods_ 20, 1143–1158. https://doi.org/10.1038/s41592-023-01932-w (2023). Article PubMed PubMed Central MATH CAS Google Scholar * Ho, S. S., Urban, A. E. & Mills, R. E.
Structural variation in the sequencing era. _Nat. Rev. Genet._ 21, 171–189. https://doi.org/10.1038/s41576-019-0180-9 (2020). Article PubMed MATH CAS Google Scholar * Liao, W. W. et al.
A draft human pangenome reference. _Nature_ 617, 312–324. https://doi.org/10.1038/s41586-023-05896-x (2023). Article ADS PubMed PubMed Central MATH CAS Google Scholar * Rubin, C. J.
et al. An intronic copy number variation in Syntaxin 17 determines speed of greying and melanoma incidence in Grey horses. _Nat. Commun._ 15, 7510. https://doi.org/10.1038/s41467-024-51898-2
(2024). Article PubMed PubMed Central MATH CAS Google Scholar * Rosengren Pielberg, G. et al. A cis-acting regulatory mutation causes premature hair graying and susceptibility to
melanoma in the horse. _Nat. Genet._ 40, 1004–1009. https://doi.org/10.1038/ng.185 (2008). Article PubMed CAS Google Scholar * Liu, X. et al. Evolutionary origin of genomic structural
variations in domestic yaks. _Nat. Commun._ 14, 5617. https://doi.org/10.1038/s41467-023-41220-x (2023). Article ADS PubMed PubMed Central MATH CAS Google Scholar * Serres-Armero, A.
et al. Copy number variation underlies complex phenotypes in domestic dog breeds and other canids. _Genome Res._ 31, 762–774. https://doi.org/10.1101/gr.266049.120 (2021). Article PubMed
PubMed Central CAS Google Scholar * Giuffra, E. et al. A large duplication associated with dominant white color in pigs originated by homologous recombination between LINE elements
flanking KIT. _Mamm. Genome_ 13, 569–577. https://doi.org/10.1007/s00335-002-2184-5 (2002). Article PubMed CAS Google Scholar * Dorshorst, B. et al. A genomic duplication is associated
with ectopic eomesodermin expression in the embryonic chicken comb and two duplex-comb phenotypes. _PLoS Genet._ 11, e1004947. https://doi.org/10.1371/journal.pgen.1004947 (2015). Article
PubMed PubMed Central CAS Google Scholar * Wright, D. et al. Copy number variation in intron 1 of SOX5 causes the Pea-comb phenotype in chickens. _PLoS Genet._ 5, e1000512.
https://doi.org/10.1371/journal.pgen.1000512 (2009). Article PubMed PubMed Central CAS Google Scholar * Imsland, F. et al. The Rose-comb mutation in chickens constitutes a structural
rearrangement causing both altered comb morphology and defective sperm motility. _PLoS Genet._ 8, e1002775. https://doi.org/10.1371/journal.pgen.1002775 (2012). Article PubMed PubMed
Central MATH CAS Google Scholar * Ulfah, M. et al. Genetic features of red and green junglefowls and relationship with Indonesian native chickens Sumatera and Kedu Hitam. _BMC Genomics_
17, 320. https://doi.org/10.1186/s12864-016-2652-z (2016). Article PubMed PubMed Central CAS Google Scholar * Wang, M. S. et al. 863 genomes reveal the origin and domestication of
chicken. _Cell Res._ 30, 693–701. https://doi.org/10.1038/s41422-020-0349-y (2020). Article PubMed PubMed Central MATH CAS Google Scholar * Youm, D. J. et al. The idiosyncratic genome
of Korean long-tailed chicken as a valuable genetic resource. _iScience_ 26, 106236. https://doi.org/10.1016/j.isci.2023.106236 (2023). Article ADS PubMed PubMed Central MATH CAS
Google Scholar * Chen, X. et al. Population genomic sequencing delineates global landscape of copy number variations that drive domestication and breed formation of in chicken. _Front.
Genet._ 13, 830393. https://doi.org/10.3389/fgene.2022.830393 (2022). Article PubMed PubMed Central CAS Google Scholar * Wang, Y. M. et al. Integrating genomic and transcriptomic data
to reveal genetic mechanisms underlying piao chicken rumpless trait. _Genomics Proteomics Bioinform._ 19, 787–799. https://doi.org/10.1016/j.gpb.2020.06.019 (2021). Article MATH Google
Scholar * Zhi, Y. et al. Genome-wide genetic structure of Henan indigenous chicken breeds. _Animals_ https://doi.org/10.3390/ani13040753 (2023). Article PubMed PubMed Central MATH
Google Scholar * Beauclair, L. et al. Sequence properties of certain GC rich avian genes, their origins and absence from genome assemblies: case studies. _BMC Genomics_ 20, 734.
https://doi.org/10.1186/s12864-019-6131-1 (2019). Article PubMed PubMed Central MATH CAS Google Scholar * Bortoluzzi, C. et al. Parallel genetic origin of foot feathering in birds.
_Mol. Biol. Evolut._ 37, 2465–2476. https://doi.org/10.1093/molbev/msaa092 (2020). Article MATH CAS Google Scholar * Fan, W. L. et al. Genome-wide patterns of genetic variation in two
domestic chickens. _Genome Biol. Evolut._ 5, 1376–1392. https://doi.org/10.1093/gbe/evt097 (2013). Article MATH Google Scholar * Yi, G. et al. Genome-wide patterns of copy number
variation in the diversified chicken genomes using next-generation sequencing. _BMC Genomics_ 15, 962. https://doi.org/10.1186/1471-2164-15-962 (2014). Article PubMed PubMed Central MATH
Google Scholar * Noorai, R. E., Shankar, V., Freese, N. H., Gregorski, C. M. & Chapman, S. C. Discovery of genomic variations by whole-genome resequencing of the North American
Araucana chicken. _PLoS One_ 14, e0225834. https://doi.org/10.1371/journal.pone.0225834 (2019). Article PubMed PubMed Central CAS Google Scholar * Chen, C. et al. Whole-genome
resequencing reveals melanin deposition candidate genes of Luning chicken. _BMC Genomics_ 25, 858. https://doi.org/10.1186/s12864-024-10774-5 (2024). Article PubMed PubMed Central MATH
CAS Google Scholar * Gu, J., Liang, Q., Liu, C. & Li, S. Genomic analyses reveal adaptation to hot arid and harsh environments in native chickens of China. _Front. Genet._ 11, 582355.
https://doi.org/10.3389/fgene.2020.582355 (2020). Article PubMed PubMed Central CAS Google Scholar * Lawal, R. A. et al. The wild species genome ancestry of domestic chickens. _BMC
Biol._ 18, 13. https://doi.org/10.1186/s12915-020-0738-1 (2020). Article PubMed PubMed Central MATH CAS Google Scholar * Tian, S. et al. Genomic analyses reveal genetic adaptations to
tropical climates in chickens. _iScience_ 23, 101644. https://doi.org/10.1016/j.isci.2020.101644 (2020). Article ADS PubMed PubMed Central CAS Google Scholar * Zhao, X. et al.
Significant genomic introgression from grey junglefowl (_Gallus sonneratii_) to domestic chickens (_Gallus gallus domesticus_). _J. Anim. Sci. Biotechnol._ 15, 45.
https://doi.org/10.1186/s40104-024-01006-7 (2024). Article PubMed PubMed Central CAS Google Scholar * Chu, J. et al. The genomic characteristics affect phenotypic diversity from the
perspective of genetic improvement of economic traits. _iScience_ 26, 106426. https://doi.org/10.1016/j.isci.2023.106426 (2023). Article ADS PubMed PubMed Central MATH CAS Google
Scholar * Li, M. et al. De novo assembly of 20 chicken genomes reveals the undetectable phenomenon for thousands of core genes on microchromosomes and subtelomeric regions. _Mol. Biol.
Evolut._ https://doi.org/10.1093/molbev/msac066 (2022). Article Google Scholar * Guo, Y. et al. Researching on the fine structure and admixture of the worldwide chicken population reveal
connections between populations and important events in breeding history. _Evolut. Appl._ 15, 553–564. https://doi.org/10.1111/eva.13241 (2022). Article MATH Google Scholar * Wu, M. Y. et
al. Historic and modern genomes unveil a domestic introgression gradient in a wild red junglefowl population. _Evolut. Appl._ 13, 2300–2315. https://doi.org/10.1111/eva.13023 (2020).
Article MATH Google Scholar * Li, D. et al. Breeding history and candidate genes responsible for black skin of Xichuan black-bone chicken. _BMC Genomics_ 21, 511.
https://doi.org/10.1186/s12864-020-06900-8 (2020). Article PubMed PubMed Central MATH CAS Google Scholar * Zeng, T. et al. Analysis of genome and methylation changes in Chinese
indigenous chickens over time provides insight into species conservation. _Commun. Biol._ 5, 952. https://doi.org/10.1038/s42003-022-03907-7 (2022). Article PubMed PubMed Central MATH
CAS Google Scholar * Zhong, H. A. et al. Microevolutionary mechanism of high-altitude adaptation in Tibetan chicken populations from an elevation gradient. _Evolut. Appl._ 15, 2100–2112.
https://doi.org/10.1111/eva.13503 (2022). Article MATH CAS Google Scholar * Tan, X. et al. Whole-genome variants dataset of 209 local chickens from China. _Sci. Data_ 11, 169.
https://doi.org/10.1038/s41597-024-02995-w (2024). Article PubMed PubMed Central MATH CAS Google Scholar * Xu, D. et al. Whole-genome sequencing revealed genetic diversity, structure
and patterns of selection in Guizhou indigenous chickens. _BMC Genomics_ 24, 570. https://doi.org/10.1186/s12864-023-09621-w (2023). Article PubMed PubMed Central MATH CAS Google
Scholar * Bendesky, A. et al. The main genetic locus associated with the evolution of gamecocks is centered on ISPD. _G3-Genes Genomes Genet._ https://doi.org/10.1093/g3journal/jkad267
(2024). Article Google Scholar * Kong, Y. Btrim: A fast, lightweight adapter and quality trimming program for next-generation sequencing technologies. _Genomics_ 98, 152–153.
https://doi.org/10.1016/j.ygeno.2011.05.009 (2011). Article PubMed MATH CAS Google Scholar * Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler
transform. _Bioinformatics_ 25, 1754–1760. https://doi.org/10.1093/bioinformatics/btp324 (2009). Article PubMed PubMed Central MATH CAS Google Scholar * Li, H. et al. The sequence
alignment/map format and SAMtools. _Bioinformatics_ 25, 2078–2079. https://doi.org/10.1093/bioinformatics/btp352 (2009). Article PubMed PubMed Central MATH CAS Google Scholar *
McKenna, A. et al. The genome analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. _Genome Res._ 20, 1297–1303. https://doi.org/10.1101/gr.107524.110
(2010). Article PubMed PubMed Central MATH CAS Google Scholar * Gu, L. H. et al. Genomic insights into local adaptation and phenotypic diversity of Wenchang chickens. _Poultry Sci._
103, 103376. https://doi.org/10.1016/j.psj.2023.103376 (2024). Article MATH CAS Google Scholar * Danecek, P. et al. The variant call format and VCFtools. _Bioinformatics_ 27, 2156–2158.
https://doi.org/10.1093/bioinformatics/btr330 (2011). Article PubMed PubMed Central MATH CAS Google Scholar * Letunic, I. & Bork, P. Interactive Tree of Life (iTOL) v6: Recent
updates to the phylogenetic tree display and annotation tool. _Nucleic Acids Res._ 52, W78–W82. https://doi.org/10.1093/nar/gkae268 (2024). Article PubMed PubMed Central Google Scholar *
Dong, S. S. et al. LDBlockShow: A fast and convenient tool for visualizing linkage disequilibrium and haplotype blocks based on variant call format files. _Brief. Bioinform._
https://doi.org/10.1093/bib/bbaa227 (2021). Article PubMed PubMed Central Google Scholar Download references ACKNOWLEDGEMENTS We thank Min-Sheng Peng (Kunming Institute of Zoology,
Chinese Academy of Sciences, Kunming, China) and Xun-He Huang (School of Life Science, Jiaying University, Meizhou, China) for their support and discussion. FUNDING Open access funding
provided by Uppsala University. The project was financially supported by Vetenskapsrådet (2017-02907), Knut and Alice Wallenberg Foundation (KAW 2023.0160), and Erik Philip-Sörensen
Foundation (G2024-033). AUTHOR INFORMATION AUTHORS AND AFFILIATIONS * Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden Cheng Ma & Leif Andersson *
Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, USA Leif Andersson Authors * Cheng Ma View author publications You can also search for this
author inPubMed Google Scholar * Leif Andersson View author publications You can also search for this author inPubMed Google Scholar CONTRIBUTIONS C.M. and L.A conceived and designed the
project. C.M. performed bioinformatic analyses. C.M. and L.A. wrote the manuscript. CORRESPONDING AUTHORS Correspondence to Cheng Ma or Leif Andersson. ETHICS DECLARATIONS COMPETING
INTERESTS The authors declare no competing interests. ETHICAL APPROVAL Data were directly downloaded from published studies and no additional ethics approval was needed. Each study is
referenced and details on ethics approval are available in each manuscript. ADDITIONAL INFORMATION PUBLISHER’S NOTE Springer Nature remains neutral with regard to jurisdictional claims in
published maps and institutional affiliations. SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION 1. SUPPLEMENTARY INFORMATION 2. RIGHTS AND PERMISSIONS OPEN ACCESS This article is licensed
under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give
appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in
this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative
Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a
copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. Reprints and permissions ABOUT THIS ARTICLE CITE THIS ARTICLE Ma, C., Andersson, L. Population genomic analysis
identifies the complex structural variation at the fibromelanosis (_FM_) locus in chicken. _Sci Rep_ 15, 9239 (2025). https://doi.org/10.1038/s41598-025-94250-4 Download citation * Received:
04 December 2024 * Accepted: 12 March 2025 * Published: 18 March 2025 * DOI: https://doi.org/10.1038/s41598-025-94250-4 SHARE THIS ARTICLE Anyone you share the following link with will be
able to read this content: Get shareable link Sorry, a shareable link is not currently available for this article. Copy to clipboard Provided by the Springer Nature SharedIt content-sharing
initiative