Czech J. Genet. Plant Breed., X:X | DOI: 10.17221/116/2025-CJGPB

Whole genome sequencing analysis of two sinensis tea (Camellia sinensis var. sinensis) clones: Assessment of molecular variations to search for breeding markersOriginal Paper

Dwinita Wikan Utami1*, Adhityo Wicaksono2, M. Khais Prayoga3, Heri Syahrian3, Vitria P. Rahadi3, Erdiansyah Rezamela3, Budi Martono4, Tri Joko Santoso1, Nur Kholilatul Izzah4, Haris Maulana1, Ady Daryanto1, Rerenstradika Tizar Terryana5, Imas Rita Saadah1, David Virya Chen2
1 Research Center for Horticulture, Research Organization for Agriculture and Food, National Research and Innovation Agency (BRIN), Bogor, West Java Province, Indonesia
2 Scientific Department, Genomik Solidaritas Indonesia (GSI Lab), Jakarta, Indonesia
3 Research Institute for Tea and Cinchona, Bandung, Indonesia
4 Research Center for Estate Crops, Research Organization for Agriculture and Food, National Research and Innovation Agency (BRIN), Bogor, West Java Province, Indonesia
5 Research Center for Applied Botany, Research Organization for Life Sciences and Environment, National Research and Innovation Agency (BRIN), Bogor, West Java Province, Indonesia

Tea (Camellia sinensis (L.) Kuntze) is a globally important crop valued for its flavour diversity and health benefits. Whole-genome sequencing (WGS) was performed to compare genomic variation and functional potential between clone Yabukita and locally adapted clone I.1.93. Using next-generation sequencing, approximately 10× genome coverage was achieved for both clones, with high mapping efficiency (98.24% for Yabukita and 97.88% for clone I.1.93), ensuring reliable downstream analyses. Single nucleotide polymorphism (SNP) analysis revealed distinct genomic patterns, with Yabukita showing a more uniform chromosomal SNP distribution, while clone I.1.93 exhibited higher SNP densities on specific chromosomes, particularly chromosomes 5 and 13. Silent mutations predominated in Yabukita (48.21%), whereas missense mutations were more frequent in clone I.1.93 (57.97%), suggesting greater functional divergence. Most SNPs occurred in non-coding regions, indicating potential regulatory roles. GO and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses revealed highly similar shared pathways, including photosynthesis and protein interactions, alongside clone-specific enrichment related to photosynthesis in Yabukita and stress responses in clone I.1.93. miRNA profiling identified distinct regulatory patterns, including the clone-specific miR530 in clone I.1.93. Biosynthetic gene cluster analysis further predicted secondary metabolite pathways associated with terpenoid, polyketide, and saccharide biosynthesis. These findings provide valuable genomic insights for tea improvement and breeding programs.

Keywords: comparative genomics; next-generation sequencing; single-nucleotide polymorphism; tea breeding

Received: November 24, 2025; Revised: January 20, 2026; Accepted: February 6, 2026; Prepublished online: February 23, 2026 

Download citation

References

  1. An Y., Zhang X., Jiang S., Zhao J., Zhang F. (2022): TeaPVs: A comprehensive genomic variation database for tea plant (Camellia sinensis). BMC Plant Biology, 22: 513. Go to original source... Go to PubMed...
  2. Aziz M.A., Masmoudi K. (2025): Molecular breakthroughs in modern plant breeding techniques. Horticultural Plant Journal, 11: 15-41. Go to original source...
  3. Buchfink B., Xie C., Huson D.H. (2014): Fast and sensitive protein alignment using DIAMOND. Nature Methods, 12: 59-60. Go to original source... Go to PubMed...
  4. Christinet L., Burdet F.X., Zaiko M., Hinz U., Zrÿd J.P. (2004): Characterization and functional identification of a novel plant 4,5-extradiol dioxygenase involved in betalain pigment biosynthesis in Portulaca grandiflora. Plant Physiology, 134: 265-274. Go to original source... Go to PubMed...
  5. Cingolani P., Platts A., Wang L.L., Coon M., Nguyen T., Wang L., Land S.J., Lu X., Ruden D.M. (2012): A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly, 6: 80-92. Go to original source... Go to PubMed...
  6. Doyle J., Doyle J. (1987): A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemical Bulletin, 19: 11-15.
  7. Galaxy Community (2024): The Galaxy platform for accessible, reproducible, and collaborative data analyses: 2024 update. Nucleic Acids Research, 52: W83-W94. Go to original source... Go to PubMed...
  8. Gorelova V., Ambach L., Rébeillé F., Stove C., Van Der Straeten D. (2017): Folates in plants: Research advances and progress in crop biofortification. Frontiers in Chemistry, 5: 1-20. Go to original source... Go to PubMed...
  9. Gupta P., O'Neill H., Wolvetang E.J., Chatterjee A., Gupta I. (2024): Advances in single-cell long-read sequencing technologies. NAR Genomics and Bioinformatics, 6: 1-11. Go to original source... Go to PubMed...
  10. Hanson A.D., Gregory J.F. (2011): Folate biosynthesis, turnover, and transport in plants. Annual Review of Plant Biology, 62: 105-125. Go to original source... Go to PubMed...
  11. Harvey W.T., Ebert P., Ebler J., Audano P.A., Munson K.M., Hoekzema K., Porubsky D., Beck C.R., Marschall T., Garimella K., Eichler E.E. (2023): Whole-genome long-read sequencing down sampling and its effect on variant-calling precision and recall. Genome Research, 33: 2029-2040. Go to original source... Go to PubMed...
  12. Hossain R., Quispe C., Saikat A.S.M., Jain D., Habib A., Janmeda P., Islam M.T., Radha, Daştan S.D., Kumar M., Butnariu M., Cho W.C., Sharifi-Rad J., Kipchakbayeva A., Calina D. (2022): Biosynthesis of secondary metabolites based on the regulation of microRNAs. BioMed Research International, 2022: 1-15. Go to original source... Go to PubMed...
  13. Huang D.W., Sherman B.T., Lempicki R.A. (2009): Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature Protocols, 4: 44-57. Go to original source... Go to PubMed...
  14. Iacopino S., Licausi F. (2020): The contribution of plant dioxygenases to hypoxia signaling. Frontiers in Plant Science, 11: 1-8. Go to original source... Go to PubMed...
  15. Itoh T., Tokunaga K., Matsuda Y., Fujii I., Abe I., Ebizuka Y., Kushiro T. (2010): Reconstitution of a fungal meroterpenoid biosynthesis reveals the involvement of a novel family of terpene cyclases. Nature Chemistry, 2: 858-864. Go to original source... Go to PubMed...
  16. Kautsar S.A., Suarez Duran H.G., Blin K., Osbourn A., Medema M.H. (2017): PlantiSMASH: Automated identification, annotation and expression analysis of plant biosynthetic gene clusters. Nucleic Acids Research, 45: W55-W63. Go to original source... Go to PubMed...
  17. Li H. (2018): Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics, 34: 3094-3100. Go to original source... Go to PubMed...
  18. Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R. (2009): The Sequence Alignment/Map format and SAMtools. Bioinformatics, 25: 2078-2079. Go to original source... Go to PubMed...
  19. Li H., Lin Q., Yan M., Wang M., Wang P., Zhao H., Wang Y., Ni D., Guo F. (2021): Relationship between secondary metabolism and miRNA for important flavor compounds in different tissues of tea plant (Camellia sinensis) as revealed by genome-wide miRNA analysis. Journal of Agricultural and Food Chemistry, 69: 2001-2012. Go to original source... Go to PubMed...
  20. Liu D., Zhang C., Ye Y., Mei P., Gong Y., Liu Z., Sun C., Zhao X., Ding S., Chen J., Chen L., Ma C. (2025): TEA5K: A high-resolution and liquid-phase multiple-SNP array for molecular breeding in tea plant. Journal of Nanobiotechnology, 23: 481. Go to original source... Go to PubMed...
  21. Lozada D.N., Bosland P.W., Barchenger D.W., Haghshenas-Jaryani M., Sanogo S., Walker S. (2022): Chile pepper (Capsicum) breeding and improvement in the multi-omics era. Frontiers in Plant Science, 13: 1-10. Go to original source... Go to PubMed...
  22. Martín J.F., Liras P. (2022): Comparative molecular mechanisms of biosynthesis of naringenin and related chalcones in actinobacteria and plants. Antibiotics, 11: 1-21. Go to original source... Go to PubMed...
  23. Martono B., Syafaruddin. (2018): Genetic variability of 21 tea genotypes (Camellia sinensis) based on RAPD markers. Journal of Industrial and Beverage Crops, 5: 77-86. Go to original source...
  24. Nawrocki E.P., Eddy S.R. (2013): Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics, 29: 2933-2935. Go to original source... Go to PubMed...
  25. Nelson C.E., Hersh B.M., Carroll S.B. (2004): The regulatory content of intergenic DNA shapes genome architecture. Genome Biology, 5: R25. Go to original source... Go to PubMed...
  26. Olena A.F., Patton J.G. (2010): Genomic organization of microRNAs. Journal of Cellular Physiology, 222: 540-545. Go to original source... Go to PubMed...
  27. Pichersky E., Raguso R.A. (2018): Why do plants produce so many terpenoid compounds? New Phytologist, 220: 692-702. Go to original source... Go to PubMed...
  28. Prayoga M.K., Syahrian H., Rahadi V.P., Atmaja M.I.P., Maulana H., Anas. (2022): Quality diversity of 35 tea clones (Camellia sinensis var. sinensis) processed for green tea. Biodiversitas, 23: 810-816. Go to original source...
  29. Singh J., Kumar D., Chauhan S., Dhillon H.K., Kumar S., Kumar V., Kapoor R. (2023): Role of omics approaches in vegetable breeding for insect pest resistance. SN Applied Sciences, 5: 1-10. Go to original source...
  30. Srivastava S., Singh R. (2018): Comparative study of withanolide biosynthesis-related miRNAs in root and leaf tissues of Withania somnifera. Applied Biochemistry and Biotechnology, 185: 1145-1159. Go to original source... Go to PubMed...
  31. Stanke M., Morgenstern B. (2005): AUGUSTUS: A web server for gene prediction in eukaryotes that allows user-defined constraints. Nucleic Acids Research, 33: 465-467. Go to original source... Go to PubMed...
  32. Valentic T.R., Jackson D.R., Brady S.F., Tsai S.C. (2016): Comprehensive analysis of a novel ketoreductase for pentangular polyphenol biosynthesis. ACS Chemical Biology, 11: 3421-3430. Go to original source... Go to PubMed...
  33. Wei S., Zhang W., Fu R., Zhang Y. (2021): Genome-wide characterization of 2-oxoglutarate and Fe (II)-dependent dioxygenase family genes in tomato during growth cycle and their roles in metabolism. BMC Genomics, 22: 1-14. Go to original source... Go to PubMed...
  34. Winterhalter P., Rouseff R. (2001): Carotenoid-derived aroma compounds: An introduction. ACS Symposium Series, 7: 1-17. Go to original source...
  35. Xu X., Bai G. (2015): Whole-genome resequencing: Changing the paradigms of SNP detection, molecular mapping and gene discovery. Molecular Breeding, 35: 1-10. Go to original source...
  36. Yagi C., Ikeda N., Sato D. (2010): Characteristics of eight Japanese tea cultivars. College of Tropical Agriculture and Human Resources, 1167-1168.
  37. Zhang H., Li Z., Zhou S., Li S.M., Ran H., Song Z., Yu T., Yin W.B. (2022): A fungal NRPS-PKS enzyme catalyses the formation of the flavonoid naringenin. Nature Communications, 13: 1-11. Go to original source... Go to PubMed...
  38. Zhu Q.W., Luo Y.P. (2013): Identification of miRNAs and their targets in tea (Camellia sinensis). Journal of Zhejiang University - Science B, 14: 916-923. Go to original source... Go to PubMed...
  39. Zverinova S., Guryev V. (2022): Variant calling: Considerations, practices, and developments. Human Mutation, 43: 976-985. Go to original source... Go to PubMed...

This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.