| 年份 | 2019 |
| 學(xué)科 | 計算生物與生物信息學(xué)?COMPUTATIONAL BIOLOGY AND BIOINFORMATICS |
| 國家/州 | NJ,United States of America |
Enabling Personalized Medicine: A Novel Deep Learning Tool for Classifying Genetic Mutations Using Text from Clinical Evidence
The understanding of genetic mutations and their effects is the foundation of personalized medicine. Currently, this interpretation is time-consuming, costly, and susceptible to bias, involving the manual reviewing of thousands of scientific texts on individual mutations. To address these issues, a deep-learning natural language processing tool was developed to automatically classify genetic variants and their effects. Opensource data on genetic variants and related clinical literature was utilized to engineer features that represent the relationship between variations and their impacts. Text-mining algorithms such as term frequency-inverse document frequency, coupled with high dimensional vector representations, were performed on the text corpus to embed the relationships between terms. Additionally, physicochemical properties of the substituted amino acids and their respective Grantham scores were used to map the severity of the changes and the amino acid evolutionary distances. The machine-learning system is the concatenation of a Multi-Layer Perceptron and a bidirectional Long Short-Term Memory Network that incorporates dimensionality reduction to capture principle features and mitigate noise. After training, the predictor achieved a high accuracy of 92.3% and an F1-score of 85.5. The tool was validated based on feature prioritization and previously annotated mutations through cross-validation. The deep learning predictor was then applied to currently unclassified genetic variations and identified 13 as novel oncogenic mutations. Ultimately, this study not only helps solve one of precision medicine’s primary limitations, but also presents industry viability, significantly streamlining the research process and potentially leading to the development of new therapies.
高中生科研 英特爾 Intel ISEF
資訊 · 課程 · 全程指導(dǎo)
請掃碼添加微信好友

[vc_btn title="聯(lián)系電話:(021) 63526628" color="black" align="center" css_animation="fadeIn" link="url:tel%3A02163526628||target:%20_blank|rel:nofollow"]
英特爾國際科學(xué)與工程大獎賽,簡稱 "ISEF",由美國 Society for Science and the Public(科學(xué)和公共服務(wù)協(xié)會)主辦,英特爾公司冠名贊助,是全球規(guī)模最大、等級最高的中學(xué)生的科研科創(chuàng)賽事。ISEF 的學(xué)術(shù)活動學(xué)科包括了所有數(shù)學(xué)、自然科學(xué)、工程的全部領(lǐng)域和部分社會科學(xué)。ISEF 素有全球青少年科學(xué)學(xué)術(shù)活動的“世界杯”之美譽,旨在鼓勵學(xué)生團隊協(xié)作,開拓創(chuàng)新,長期專一深入地研究自己感興趣的課題。
>>> 實用鏈接匯總 <<<
· 數(shù)學(xué) · 物理 · 化學(xué) · 生物 · 計算機 · 工程 ·
Studies that primarily focus on the discipline and techniques of computer science and mathematics as they relate to biological systems. This includes the development and application of data-analytical and theoretical methods, mathematical modeling and computational simulation techniques to the study of biological, behavior, and social systems.
Computational Biomodeling?(MOD):?Studies that involve computer simulations of biological systems most commonly with a goal of understanding how cells or organism develop, work collectively and survive.
Computational Epidemiology (EPD):?The study of disease frequency and distribution, and risk factors and socioeconomic determinants of health within populations. Such studies may include gathering information to confirm existence of disease outbreaks, developing case definitions and analyzing epidemic data, establishing disease surveillance, and implementing methods of disease prevention and control.
Computational Evolutionary Biology?(EVO):?A study that applies the discipline and techniques of computer science and mathematics to explore the processes of change in populations of organisms, especially taxonomy, paleontology, ethology, population genetics and ecology.
Computational Neuroscience?(NEU):?A study that applies the discipline and techniques of computer science and mathematics to understand brain function in terms of the information processing properties of the structures that make up the nervous system.
Computational Pharmacology?(PHA):?A study that applies the discipline and techniques of computer science and mathematics to predict and analyze the responses to drugs.
Genomics?(GEN):?The study of the function and structure of genomes using recombinant DNA, sequencing, and bioinformatics.
Other?(OTH):?Studies that cannot be assigned to one of the above subcategories. If the project involves multiple subcategories, the principal subcategory should be chosen instead of Other.

? 2025. All Rights Reserved. 滬ICP備2023009024號-1