AI-Based Sequence Similarity Analysis as Digital Genetic Evidence: A Pilot Study on Growth-Related Genes

dc.citation.epage22
dc.citation.issue1
dc.citation.spage13
dc.citation.volume1
dc.contributor.authorRosyid R. Al-Hakim
dc.contributor.authorGalih Samodra
dc.contributor.authorYanuar Zulardiansyah Arief
dc.contributor.authorReina Melani
dc.contributor.authorLukman Hakim
dc.contributor.authorFiqih Nurkholis
dc.contributor.departmentFaculty of Engineering
dc.date.accessioned2026-03-19T02:34:53Z
dc.date.issued2026-02-28
dc.description.abstractIntroduction — Stunting remains a major public health challenge, particularly in low- and middle-income countries, where growth impairment is influenced by complex interactions between environmental and biological factors. While nutritional and socioeconomic determinants have been extensively studied, the potential role of genetic susceptibility related to growth regulation remains underexplored from a bio-digital and forensic informatics perspective. This study investigates whether sequence-level similarity patterns among growth-related genes can be represented as digital genetic evidence using artificial intelligence–based computational analysis. Methods — This pilot exploratory study analyzed protein and coding DNA sequences of six candidate growth-related genes (IGF1, IGF1R, GH1, GHR, LEP, SLC39A8) obtained from curated RefSeq Homo sapiens databases. An alignment-free analytical framework was implemented using k-mer term frequency–inverse document frequency (TF-IDF) feature extraction combined with principal component analysis for dimensionality reduction. Pairwise similarity assessment and embedding-based visualization were employed to explore latent sequence relationships. Results — The analysis revealed distinct similarity patterns among growth-related genes, with hormonally associated genes and receptor proteins forming coherent clusters, while nutrient transporter–related genes exhibited clear separation in the embedding space. These patterns were biologically plausible and consistent with known functional characteristics, despite the absence of explicit functional annotation during feature extraction. Conclusion — The findings demonstrate that AI-based alignment-free sequence analysis can generate reproducible similarity representations that function as digital genetic evidence. As a pilot exploratory study, this work highlights the feasibility of sequence-level similarity profiling for investigating growth-related genetic susceptibility, while providing a methodological foundation for future large-scale and population specific studies.
dc.description.referencesUncontrolled Keywords: digital genetic evidence; bioinformatics; artificial intelligence; sequence similarity; stunting susceptibility
dc.description.statusPublished
dc.identifier.emailayzulardiansyah@unimas.my
dc.identifier.urihttps://ejournal.uhb.ac.id/index.php/jbdfi/article/view/2189
dc.identifier.urihttps://scholarhub.unimas.my/handle/123456789/259
dc.publisherLPPM Universitas Harapan Bangsa, Indonesia
dc.relation.ispartofJournal of Bio-Digital and Forensic Informatics (JBDFI)
dc.titleAI-Based Sequence Similarity Analysis as Digital Genetic Evidence: A Pilot Study on Growth-Related Genes
dc.typeArticles
dc.type.statusYes

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Al-Hakim+et+al_JBDFI_1-1_13-22.pdf
Size:
3.53 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description:

Collections