Objective: To develop and validate a computed tomography(CT)-based radiomics nomogram for predicting human epidermal growth factor receptor 2(HER2) status in patients with gastric cancer.Methods: This retrospective st...Objective: To develop and validate a computed tomography(CT)-based radiomics nomogram for predicting human epidermal growth factor receptor 2(HER2) status in patients with gastric cancer.Methods: This retrospective study included 134 patients with gastric cancer(HER2-negative: n=87;HER2-positive: n=47) from April 2013 to March 2018, who were then randomly divided into training(n=94) and validation(n=40) cohorts. Radiomics features were obtained from the CT images showing gastric cancer. Least absolute shrinkage and selection operator(LASSO) regression analysis was utilized for building the radiomics signature. A multivariable logistic regression method was applied to develop a prediction model incorporating the radiomics signature and independent clinicopathologic risk predictors, which were then visualized as a radiomics nomogram. The predictive performance of the nomogram was assessed in the training and validation cohorts.Results: The radiomics signature was significantly associated with HER2 status in both training(P<0.001) and validation(P=0.023) cohorts. The prediction model that incorporated the radiomics signature and carcinoembryonic antigen(CEA) level demonstrated good discriminative performance for HER2 status prediction,with an area under the curve(AUC) of 0.799 [95% confidence interval(95% CI): 0.704-0.894] in the training cohort and 0.771(95% CI: 0.607-0.934) in the validation cohort. The calibration curve of the radiomics nomogram also showed good calibration. Decision curve analysis showed that the radiomics nomogram was useful.Conclusions: We built and validated a radiomics nomogram with good performance for HER2 status prediction in gastric cancer. This radiomics nomogram could serve as a non-invasive tool to predict HER2 status and guide clinical treatment.展开更多
The fields of regenerative medicine and tissue engineering offer new therapeutic options to restore,maintain or improve tissue function following disease or injury.To maximize the biological function of a tissue-engin...The fields of regenerative medicine and tissue engineering offer new therapeutic options to restore,maintain or improve tissue function following disease or injury.To maximize the biological function of a tissue-engineered clinical product,specific conditions must be maintained within a bioreactor to allow the maturation of the product in preparation for implantation.Specifically,the bioreactor should be designed to mimic the mechanical,electrochemical and biochemical environment that the product will be exposed to in vivo.Real-time monitoring of the functional capacity of tissue-engineered products during manufacturing is a critical component of the quality management process.The present review provides a brief overview of bioreactor engineering considerations.In addition,strategies for bioreactor automation,in-line product monitoring and quality assurance are discussed.展开更多
A switching variability index (SVl) constant false alarm rate (CFAR) detector is proposed for improving the detection performance of VI-CFAR detectors in multiple targets backgrounds. When the presence of non-homo...A switching variability index (SVl) constant false alarm rate (CFAR) detector is proposed for improving the detection performance of VI-CFAR detectors in multiple targets backgrounds. When the presence of non-homogeneity in CFAR reference windows is indicated by a VI-CFAR detector, a switching- CFAR detector is introduced to optimize the performance of the VI-CFAR detector in homogeneous, multiple targets and clutter edge backgrounds. The structure and parameters selection method of the SVI-CFAR detector is presented. Comparisons with classic CFAR detectors and recently proposed detectors are also given. Theoretical analysis and simulation results show that SVICFAR detector maintains the good performance of the VI-CFAR detector in homogeneous and clutter edge backgrounds, while greatly improving the capacity of anti-multi targets.展开更多
Osteosarcoma is the most common primary bone sarcoma that mostly occurs in young adults. The causes of osteosarcoma are heterogeneous and still not fully understood. Identification of novel, important oncogenic factor...Osteosarcoma is the most common primary bone sarcoma that mostly occurs in young adults. The causes of osteosarcoma are heterogeneous and still not fully understood. Identification of novel, important oncogenic factors in osteosarcoma and development of better, effective therapeutic approaches are in urgent need for better treatment of osteosarcoma patients. In this study, we uncovered that the oncogene MYC is significantly upregulated in metastastic osteosarcoma samples. In addition, high MYC expression is associated with poor survival of osteosarcoma patients. Analysis of MYC targets in osteosarcoma revealed that most of the osteosarcoma super enhancer genes are bound by MYC. Treatment of osteosarcoma cells with super enhancer inhibitors THZ1 and JQ1 effectively suppresses the proliferation, migration, and invasion of osteosarcoma cells. Mechanistically,THZ1 treatment suppresses a large group of super enhancer containing MYC target genes including CDK6 and TGFB2. These findings revealed that the MYC-driven super enhancer signaling is crucial for the osteosarcoma tumorigenesis and targeting the MYC/super enhancer axis represents as a promising therapeutic strategy for treatment of osteosarcoma patients.展开更多
非常高兴有机会来到协和,这里有很多老朋友。我在美国国立肿瘤研究所(National Cancer Institute,NCI)工作了5年,专攻生物信息学。今天的演讲分为三个部分:第一部分,从NCI角度简要介绍生物信息领域出现的新动态与新需求;第二部分,如...非常高兴有机会来到协和,这里有很多老朋友。我在美国国立肿瘤研究所(National Cancer Institute,NCI)工作了5年,专攻生物信息学。今天的演讲分为三个部分:第一部分,从NCI角度简要介绍生物信息领域出现的新动态与新需求;第二部分,如何把生物信息技术转化到一些平台的发展中;展开更多
This study aims to investigate the short-term effects of ambient air pollutants on outpatient visits for childhood allergic diseases.Daily data on ambient air pollutants(NO2,SO2,CO and PM2.5)and outpatient visits for ...This study aims to investigate the short-term effects of ambient air pollutants on outpatient visits for childhood allergic diseases.Daily data on ambient air pollutants(NO2,SO2,CO and PM2.5)and outpatient visits for childhood allergic diseases(asthma,atopic dermatitis and allergic rhinitis)were obtained in Shanghai,China from 2013 to 2014.The effects of ambient air pollutants were estimated for total outpatient visits for childhood allergic diseases,gender and age stratification and disease classification by using distributed lag non-linear model(DLNM).We found positive associations between short-term exposure to air pollutants and childhood allergic diseases.Girls and children aged 7 years old were more likely to be sensitive to ambient air pollutants.NO2 and SO2 showed stronger effects on asthma and atopic dermatitis,respectively.This study provides evidence that short-term exposure to ambient air pollutants can increase the risk of childhood allergic diseases.展开更多
Transcriptional regulators(TRs)participate in essential processes in cancer pathogenesis and are critical therapeutic targets.Identification of drug response-related TRs from cell line-based compound screening data is...Transcriptional regulators(TRs)participate in essential processes in cancer pathogenesis and are critical therapeutic targets.Identification of drug response-related TRs from cell line-based compound screening data is often challenging due to low m RNA abundance of TRs,protein modifications,and other confounders(CFs).In this study,we developed a regression-based pharmacogenomic and Ch IP-seq data integration method(Re Phine)to infer the impact of TRs on drug response through integrative analyses of pharmacogenomic and Ch IP-seq data.Re Phine was evaluated in simulation and pharmacogenomic data and was applied to pan-cancer datasets with the goal of biological discovery.In simulation data with added noises or CFs and in pharmacogenomic data,Re Phine demonstrated an improved performance in comparison with three commonly used methods(including Pearson correlation analysis,logistic regression model,and gene set enrichment analysis).Utilizing Re Phine and Cancer Cell Line Encyclopedia data,we observed that Re Phinederived TR signatures could effectively cluster drugs with different mechanisms of action.Re Phine predicted that loss-offunction of EZH2/PRC2 reduces cancer cell sensitivity toward the BRAF inhibitor PLX4720.Experimental validation confirmed that pharmacological EZH2 inhibition increases the resistance of cancer cells to PLX4720 treatment.Our results support that Re Phine is a useful tool for inferring drug response-related TRs and for potential therapeutic applications.The source code for Re Phine is freely available at https://github.com/coexps/Re Phine.展开更多
With the advances in artificial intelligence(AI),data‐driven algorithms are becoming increasingly popular in the medical domain.However,due to the nonlinear and complex behavior of many of these algorithms,decision‐...With the advances in artificial intelligence(AI),data‐driven algorithms are becoming increasingly popular in the medical domain.However,due to the nonlinear and complex behavior of many of these algorithms,decision‐making by such algorithms is not trustworthy for clinicians and is considered a blackbox process.Hence,the scientific community has introduced explainable artificial intelligence(XAI)to remedy the problem.This systematic scoping review investigates the application of XAI in breast cancer detection and risk prediction.We conducted a comprehensive search on Scopus,IEEE Explore,PubMed,and Google Scholar(first 50 citations)using a systematic search strategy.The search spanned from January 2017 to July 2023,focusing on peer‐reviewed studies implementing XAI methods in breast cancer datasets.Thirty studies met our inclusion criteria and were included in the analysis.The results revealed that SHapley Additive exPlanations(SHAP)is the top model‐agnostic XAI technique in breast cancer research in terms of usage,explaining the model prediction results,diagnosis and classification of biomarkers,and prognosis and survival analysis.Additionally,the SHAP model primarily explained tree‐based ensemble machine learning models.The most common reason is that SHAP is model agnostic,which makes it both popular and useful for explaining any model prediction.Additionally,it is relatively easy to implement effectively and completely suits performant models,such as tree‐based models.Explainable AI improves the transparency,interpretability,fairness,and trustworthiness of AI‐enabled health systems and medical devices and,ultimately,the quality of care and outcomes.展开更多
Computational modeling has emerged as a time-saving and cost-effective alternative to traditional animal testing for assessing chemicals for their potential hazards.However,few computational modeling studies for immun...Computational modeling has emerged as a time-saving and cost-effective alternative to traditional animal testing for assessing chemicals for their potential hazards.However,few computational modeling studies for immunotoxicity were reported,with few models available for predicting toxicants due to the lack of training data and the complex mechanisms of immunotoxicity.In this study,we employed a data-driven quantitative structure–activity relationship(QSAR)modeling workflow to extensively enlarge the limited training data by revealing multiple targets involved in immunotoxicity.To this end,a probe data set of 6,341 chemicals was obtained from a high-throughput screening(HTS)assay testing for the activation of the aryl hydrocarbon receptor(AhR)signaling pathway,a key event leading to immunotoxicity.Searching this probe data set against PubChem yielded 3,183 assays with testing results for varying proportions of these 6,341 compounds.100 assays were selected to develop QSAR models based on their correlations to AhR agonism.Twelve individual QSAR models were built for each assay using combinations of four machine-learning algorithms and three molecular fingerprints.5-fold cross-validation of the resulting models showed good predictivity(average CCR=0.73).A total of 20 assays were further selected based on QSAR model performance,and their resulting QSAR models showed good predictivity of potential immunotoxicants from external chemicals.This study provides a computational modeling strategy that can utilize large public toxicity data sets for modeling immunotoxicity and other toxicity endpoints,which have limited training data and complicated toxicity mechanisms.展开更多
Background Asthma has been a global problem,especially in children.We aim to evaluate the contemporary prevalence and influencing factors of asthma among children aged 3–7 years in Shanghai,China.Methods A random sam...Background Asthma has been a global problem,especially in children.We aim to evaluate the contemporary prevalence and influencing factors of asthma among children aged 3–7 years in Shanghai,China.Methods A random sample of preschool children was included in this study.The International Study of Asthma and Allergies in Childhood questionnaire was adopted to assess the childhood asthma.Multivariable logistic regression models were used to evaluate the associations between independent variables and childhood asthma.Results Of 6389 preschool children who were invited to take part in this study,6163(response rate:96.5%)completed the questionnaire and were included in the analysis.The overall prevalence of asthma was 14.6%which increased more than six folds from 2.1%in 1990.Being male,younger age,preterm delivery,being born in spring or autumn,being delivered by elective cesarean section without indication,miscarriage,high socioeconomic status,having allergy history,and exposure to passive smoking,latex paint,and dust were potential risk factors for childhood asthma.Spending more time outdoors(>30 min/day),having indoor plants,and cleaning rooms more frequently were potential protective factors.Conclusions The prevalence of childhood asthma in Shanghai has increased dramatically during the past three decades.The findings about risk and protective factors of childhood asthma could be used to develop appropriate strategies to prevent and control childhood asthma in Shanghai and in other similar metropolitan cities.展开更多
The FAIR principles articulate the behaviors expected from digital artifacts that are Findable,Accessible,Interoperable and Reusable by machines and by people.Although by now widely accepted,the FAIR Principles by des...The FAIR principles articulate the behaviors expected from digital artifacts that are Findable,Accessible,Interoperable and Reusable by machines and by people.Although by now widely accepted,the FAIR Principles by design do not explicitly consider actual implementation choices enabling FAIR behaviors.As different communities have their own,often well-established implementation preferences and priorities for data reuse,coordinating a broadly accepted,widely used FAIR implementation approach remains a global challenge.In an effort to accelerate broad community convergence on FAIR implementation options,the GO FAIR community has launched the development of the FAIR Convergence Matrix.The Matrix is a platform that compiles for any community of practice,an inventory of their self-declared FAIR implementation choices and challenges.The Convergence Matrix is itself a FAIR resource,openly available,and encourages voluntary participation by any self-identified community of practice(not only the GO FAIR Implementation Networks).Based on patterns of use and reuse of existing resources,the Convergence Matrix supports the transparent derivation of strategies that optimally coordinate convergence on standards and technologies in the emerging Internet of FAIR Data and Services.展开更多
Comprehensive characterization of spatial and temporal gene expression patterns in humans is critical for uncovering the regulatory codes of the human genome and understanding the molecular mechanisms of human disease...Comprehensive characterization of spatial and temporal gene expression patterns in humans is critical for uncovering the regulatory codes of the human genome and understanding the molecular mechanisms of human diseases.Ubiquitously expressed genes(UEGs)refer to the genes expressed across a majority of,if not all,phenotypic and physiological conditions of an organism.It is known that many human genes are broadly expressed across tissues.However,most previous UEG studies have only focused on providing a list of UEGs without capturing their global expression patterns,thus limiting the potential use of UEG information.In this study,we proposed a novel data-driven framework to leverage the extensive collection of40,000 human transcriptomes to derive a list of UEGs and their corresponding global expression patterns,which offers a valuable resource to further characterize human transcriptome.Our results suggest that about half(12,234;49.01%)of the human genes are expressed in at least 80%of human transcriptomes,and the median size of the human transcriptome is 16,342 genes(65.44%).Through gene clustering,we identified a set of UEGs,named LoVarUEGs,which have stable expression across human transcriptomes and can be used as internal reference genes for expression measurement.To further demonstrate the usefulness of this resource,we evaluated the global expression patterns for 16 previously predicted disallowed genes in islet beta cells and found that seven of these genes showed relatively more varied expression patterns,suggesting that the repression of these genes may not be unique to islet beta cells.展开更多
Tuberculosis drug resistance continues to threaten global health but the underline molecular mechanisms are not clear.Ethambutol(EMB),one of the well-known first-line drugs in tuberculosis treatment is,unfortunately,n...Tuberculosis drug resistance continues to threaten global health but the underline molecular mechanisms are not clear.Ethambutol(EMB),one of the well-known first-line drugs in tuberculosis treatment is,unfortunately,not free from drug resistance problems.Genomic studies have shown that some genetic mutations in Mycobacterium tuberculosis(Mtb)EmbR,and EmbC/A/B genes cause EMB resistance.EmbR-PknH pair controls embC/A/B operon,which encodes EmbC/A/B genes,and EMB interacts with EmbA/B proteins.However,the EmbR binding site on PknH was unknown.We conducted molecular simulation on the EmbR-peptides binding structures and discovered phosphorylated PknH 273-280(N′-HEALS^(P)DPD-C′)makesβstrand with the EmbR FHA domain,asβ-MoRF(MoRF;molecular recognition feature)does at its binding site.Hydrogen bond number analysis also supported the peptides’β-MoRF forming activity at the EmbR FHA domain.Also,we discovered that previously known phosphorylation residues might have their chronological order according to the phosphorylation status.The discovery validated that Mtb PknH 273-280(N′-HEALSDPD-C′)has reliable EmbR binding affinity.This approach is revolutionary in the computer-aided drug discovery field,because it is the first trial to discover the protein-protein interaction site,and find binding partner in nature from this site.展开更多
基金supported by the National Key Research and Development Program of China (No. 2017YFC1309100)National Natural Scientific Foundation of China (No. 81771912, 81601469, and 81701782)+1 种基金the Science and Technology Planning Project of Guangdong Province (No. 2017B020227012)the Science and Technology Planning Project of Guangzhou (No. 20191A011002).
文摘Objective: To develop and validate a computed tomography(CT)-based radiomics nomogram for predicting human epidermal growth factor receptor 2(HER2) status in patients with gastric cancer.Methods: This retrospective study included 134 patients with gastric cancer(HER2-negative: n=87;HER2-positive: n=47) from April 2013 to March 2018, who were then randomly divided into training(n=94) and validation(n=40) cohorts. Radiomics features were obtained from the CT images showing gastric cancer. Least absolute shrinkage and selection operator(LASSO) regression analysis was utilized for building the radiomics signature. A multivariable logistic regression method was applied to develop a prediction model incorporating the radiomics signature and independent clinicopathologic risk predictors, which were then visualized as a radiomics nomogram. The predictive performance of the nomogram was assessed in the training and validation cohorts.Results: The radiomics signature was significantly associated with HER2 status in both training(P<0.001) and validation(P=0.023) cohorts. The prediction model that incorporated the radiomics signature and carcinoembryonic antigen(CEA) level demonstrated good discriminative performance for HER2 status prediction,with an area under the curve(AUC) of 0.799 [95% confidence interval(95% CI): 0.704-0.894] in the training cohort and 0.771(95% CI: 0.607-0.934) in the validation cohort. The calibration curve of the radiomics nomogram also showed good calibration. Decision curve analysis showed that the radiomics nomogram was useful.Conclusions: We built and validated a radiomics nomogram with good performance for HER2 status prediction in gastric cancer. This radiomics nomogram could serve as a non-invasive tool to predict HER2 status and guide clinical treatment.
基金US Army Medical Research and Development Command through the Medical Technology Enterprise Consortium under Contract#W81XWH-15-9-0001.
文摘The fields of regenerative medicine and tissue engineering offer new therapeutic options to restore,maintain or improve tissue function following disease or injury.To maximize the biological function of a tissue-engineered clinical product,specific conditions must be maintained within a bioreactor to allow the maturation of the product in preparation for implantation.Specifically,the bioreactor should be designed to mimic the mechanical,electrochemical and biochemical environment that the product will be exposed to in vivo.Real-time monitoring of the functional capacity of tissue-engineered products during manufacturing is a critical component of the quality management process.The present review provides a brief overview of bioreactor engineering considerations.In addition,strategies for bioreactor automation,in-line product monitoring and quality assurance are discussed.
基金supported by the National Natural Science Foundation of China(61102158)the China Postdoctoral Science Foundation(2011M500667)
文摘A switching variability index (SVl) constant false alarm rate (CFAR) detector is proposed for improving the detection performance of VI-CFAR detectors in multiple targets backgrounds. When the presence of non-homogeneity in CFAR reference windows is indicated by a VI-CFAR detector, a switching- CFAR detector is introduced to optimize the performance of the VI-CFAR detector in homogeneous, multiple targets and clutter edge backgrounds. The structure and parameters selection method of the SVI-CFAR detector is presented. Comparisons with classic CFAR detectors and recently proposed detectors are also given. Theoretical analysis and simulation results show that SVICFAR detector maintains the good performance of the VI-CFAR detector in homogeneous and clutter edge backgrounds, while greatly improving the capacity of anti-multi targets.
基金supported by National Natural Science Foundation of China (81670874, 81500354, and 81772999)Shenzhen Science Foundation (JCYJ20160308104109234)
文摘Osteosarcoma is the most common primary bone sarcoma that mostly occurs in young adults. The causes of osteosarcoma are heterogeneous and still not fully understood. Identification of novel, important oncogenic factors in osteosarcoma and development of better, effective therapeutic approaches are in urgent need for better treatment of osteosarcoma patients. In this study, we uncovered that the oncogene MYC is significantly upregulated in metastastic osteosarcoma samples. In addition, high MYC expression is associated with poor survival of osteosarcoma patients. Analysis of MYC targets in osteosarcoma revealed that most of the osteosarcoma super enhancer genes are bound by MYC. Treatment of osteosarcoma cells with super enhancer inhibitors THZ1 and JQ1 effectively suppresses the proliferation, migration, and invasion of osteosarcoma cells. Mechanistically,THZ1 treatment suppresses a large group of super enhancer containing MYC target genes including CDK6 and TGFB2. These findings revealed that the MYC-driven super enhancer signaling is crucial for the osteosarcoma tumorigenesis and targeting the MYC/super enhancer axis represents as a promising therapeutic strategy for treatment of osteosarcoma patients.
基金the Interdisciplinary Program of Shanghai Jiao Tong University(No.ZH2018QNA30)the Three Year Action Plan of Shanghai Public Health System Construction(No.GWV-10.1-XK05)。
文摘This study aims to investigate the short-term effects of ambient air pollutants on outpatient visits for childhood allergic diseases.Daily data on ambient air pollutants(NO2,SO2,CO and PM2.5)and outpatient visits for childhood allergic diseases(asthma,atopic dermatitis and allergic rhinitis)were obtained in Shanghai,China from 2013 to 2014.The effects of ambient air pollutants were estimated for total outpatient visits for childhood allergic diseases,gender and age stratification and disease classification by using distributed lag non-linear model(DLNM).We found positive associations between short-term exposure to air pollutants and childhood allergic diseases.Girls and children aged 7 years old were more likely to be sensitive to ambient air pollutants.NO2 and SO2 showed stronger effects on asthma and atopic dermatitis,respectively.This study provides evidence that short-term exposure to ambient air pollutants can increase the risk of childhood allergic diseases.
基金supported by the National Key R&D Program of China(2018YFC0910500)the Neil Shen’s SJTU Medical Research Fund+6 种基金the SJTU-Yale Collaborative Research Seed Fundthe National Natural Science Foundation of China(Grant Nos.31370751 and 31728012)the Shanghai Municipal Commission of Health and Family Planning(Grant No.20144Y0179)the Science and Technology Commission of Shanghai Municipality(STCSM)(Grant No.17DZ 22512000)the Shanghai Municipal Science and Technology Major Project(Grant No.2018SHZDZX01)the Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence(LCNBI)ZJLab。
文摘Transcriptional regulators(TRs)participate in essential processes in cancer pathogenesis and are critical therapeutic targets.Identification of drug response-related TRs from cell line-based compound screening data is often challenging due to low m RNA abundance of TRs,protein modifications,and other confounders(CFs).In this study,we developed a regression-based pharmacogenomic and Ch IP-seq data integration method(Re Phine)to infer the impact of TRs on drug response through integrative analyses of pharmacogenomic and Ch IP-seq data.Re Phine was evaluated in simulation and pharmacogenomic data and was applied to pan-cancer datasets with the goal of biological discovery.In simulation data with added noises or CFs and in pharmacogenomic data,Re Phine demonstrated an improved performance in comparison with three commonly used methods(including Pearson correlation analysis,logistic regression model,and gene set enrichment analysis).Utilizing Re Phine and Cancer Cell Line Encyclopedia data,we observed that Re Phinederived TR signatures could effectively cluster drugs with different mechanisms of action.Re Phine predicted that loss-offunction of EZH2/PRC2 reduces cancer cell sensitivity toward the BRAF inhibitor PLX4720.Experimental validation confirmed that pharmacological EZH2 inhibition increases the resistance of cancer cells to PLX4720 treatment.Our results support that Re Phine is a useful tool for inferring drug response-related TRs and for potential therapeutic applications.The source code for Re Phine is freely available at https://github.com/coexps/Re Phine.
文摘With the advances in artificial intelligence(AI),data‐driven algorithms are becoming increasingly popular in the medical domain.However,due to the nonlinear and complex behavior of many of these algorithms,decision‐making by such algorithms is not trustworthy for clinicians and is considered a blackbox process.Hence,the scientific community has introduced explainable artificial intelligence(XAI)to remedy the problem.This systematic scoping review investigates the application of XAI in breast cancer detection and risk prediction.We conducted a comprehensive search on Scopus,IEEE Explore,PubMed,and Google Scholar(first 50 citations)using a systematic search strategy.The search spanned from January 2017 to July 2023,focusing on peer‐reviewed studies implementing XAI methods in breast cancer datasets.Thirty studies met our inclusion criteria and were included in the analysis.The results revealed that SHapley Additive exPlanations(SHAP)is the top model‐agnostic XAI technique in breast cancer research in terms of usage,explaining the model prediction results,diagnosis and classification of biomarkers,and prognosis and survival analysis.Additionally,the SHAP model primarily explained tree‐based ensemble machine learning models.The most common reason is that SHAP is model agnostic,which makes it both popular and useful for explaining any model prediction.Additionally,it is relatively easy to implement effectively and completely suits performant models,such as tree‐based models.Explainable AI improves the transparency,interpretability,fairness,and trustworthiness of AI‐enabled health systems and medical devices and,ultimately,the quality of care and outcomes.
基金National Institute of General Medical Sciences(Grant R01GM148743)National Institute of Child Health and Human Development(Grant UHD113039)+1 种基金National Science Foundation(Grant 2402311)National Institute of Environmental Health Sciences(Grants R01ES031080 and R35ES031709).
文摘Computational modeling has emerged as a time-saving and cost-effective alternative to traditional animal testing for assessing chemicals for their potential hazards.However,few computational modeling studies for immunotoxicity were reported,with few models available for predicting toxicants due to the lack of training data and the complex mechanisms of immunotoxicity.In this study,we employed a data-driven quantitative structure–activity relationship(QSAR)modeling workflow to extensively enlarge the limited training data by revealing multiple targets involved in immunotoxicity.To this end,a probe data set of 6,341 chemicals was obtained from a high-throughput screening(HTS)assay testing for the activation of the aryl hydrocarbon receptor(AhR)signaling pathway,a key event leading to immunotoxicity.Searching this probe data set against PubChem yielded 3,183 assays with testing results for varying proportions of these 6,341 compounds.100 assays were selected to develop QSAR models based on their correlations to AhR agonism.Twelve individual QSAR models were built for each assay using combinations of four machine-learning algorithms and three molecular fingerprints.5-fold cross-validation of the resulting models showed good predictivity(average CCR=0.73).A total of 20 assays were further selected based on QSAR model performance,and their resulting QSAR models showed good predictivity of potential immunotoxicants from external chemicals.This study provides a computational modeling strategy that can utilize large public toxicity data sets for modeling immunotoxicity and other toxicity endpoints,which have limited training data and complicated toxicity mechanisms.
基金The study was funded by special grant for Preschool Children’s Health Management from Shanghai Municipal Education Commission,grants from National Natural Science Foundation of China(81874266,81673183)key project from Shanghai Municipal Science and Technology Commission(18411951600).
文摘Background Asthma has been a global problem,especially in children.We aim to evaluate the contemporary prevalence and influencing factors of asthma among children aged 3–7 years in Shanghai,China.Methods A random sample of preschool children was included in this study.The International Study of Asthma and Allergies in Childhood questionnaire was adopted to assess the childhood asthma.Multivariable logistic regression models were used to evaluate the associations between independent variables and childhood asthma.Results Of 6389 preschool children who were invited to take part in this study,6163(response rate:96.5%)completed the questionnaire and were included in the analysis.The overall prevalence of asthma was 14.6%which increased more than six folds from 2.1%in 1990.Being male,younger age,preterm delivery,being born in spring or autumn,being delivered by elective cesarean section without indication,miscarriage,high socioeconomic status,having allergy history,and exposure to passive smoking,latex paint,and dust were potential risk factors for childhood asthma.Spending more time outdoors(>30 min/day),having indoor plants,and cleaning rooms more frequently were potential protective factors.Conclusions The prevalence of childhood asthma in Shanghai has increased dramatically during the past three decades.The findings about risk and protective factors of childhood asthma could be used to develop appropriate strategies to prevent and control childhood asthma in Shanghai and in other similar metropolitan cities.
基金FAIRsharing is funded by grants awarded to SAS that include elements of this workspecifically,grants from the UK BBSRC and Research Councils(BB/L024101/1,BB/L005069/1)+1 种基金European Union(H2020-EU.3.1,634107,H2020-EU.1.4.1.3,654241,H2020-EU.1.4.1.1,676559),IMI(116060)and NIH(U54 AI117925,1U24AI117966-01,1OT3OD025459-01,1OT3OD025467-01,1OT3OD025462-01)the new FAIRsharing award from the Wellcome Trust(212930/Z/18/Z),as well as a related award(208381/A/17/Z).
文摘The FAIR principles articulate the behaviors expected from digital artifacts that are Findable,Accessible,Interoperable and Reusable by machines and by people.Although by now widely accepted,the FAIR Principles by design do not explicitly consider actual implementation choices enabling FAIR behaviors.As different communities have their own,often well-established implementation preferences and priorities for data reuse,coordinating a broadly accepted,widely used FAIR implementation approach remains a global challenge.In an effort to accelerate broad community convergence on FAIR implementation options,the GO FAIR community has launched the development of the FAIR Convergence Matrix.The Matrix is a platform that compiles for any community of practice,an inventory of their self-declared FAIR implementation choices and challenges.The Convergence Matrix is itself a FAIR resource,openly available,and encourages voluntary participation by any self-identified community of practice(not only the GO FAIR Implementation Networks).Based on patterns of use and reuse of existing resources,the Convergence Matrix supports the transparent derivation of strategies that optimally coordinate convergence on standards and technologies in the emerging Internet of FAIR Data and Services.
基金We thank Dr.Yongkun Wang from the Network and Information Center at Shanghai Jiao Tong University(SJTU)for his support in high-performance computing.We thank Ph.D.Candidate Wei Liu from Yale University for her support in the acquisition of physiological trait-related genes.HL is supported by the National Key R&D Program of China(Grant No.2018YFC0910500)JG and JD are supported by the SJTU-Yale Collaborative Research Seed Fund and Neil Shen’s SJTU Medical Research Fund,China.JG and HL are partially supported by the Shanghai Municipal Commission of Health and Family Planning,China(Grant No.2018ZHYL0223)the Science and Technology Commission of Shanghai Municipality(STCSM),China(Grant No.17DZ2251200).
文摘Comprehensive characterization of spatial and temporal gene expression patterns in humans is critical for uncovering the regulatory codes of the human genome and understanding the molecular mechanisms of human diseases.Ubiquitously expressed genes(UEGs)refer to the genes expressed across a majority of,if not all,phenotypic and physiological conditions of an organism.It is known that many human genes are broadly expressed across tissues.However,most previous UEG studies have only focused on providing a list of UEGs without capturing their global expression patterns,thus limiting the potential use of UEG information.In this study,we proposed a novel data-driven framework to leverage the extensive collection of40,000 human transcriptomes to derive a list of UEGs and their corresponding global expression patterns,which offers a valuable resource to further characterize human transcriptome.Our results suggest that about half(12,234;49.01%)of the human genes are expressed in at least 80%of human transcriptomes,and the median size of the human transcriptome is 16,342 genes(65.44%).Through gene clustering,we identified a set of UEGs,named LoVarUEGs,which have stable expression across human transcriptomes and can be used as internal reference genes for expression measurement.To further demonstrate the usefulness of this resource,we evaluated the global expression patterns for 16 previously predicted disallowed genes in islet beta cells and found that seven of these genes showed relatively more varied expression patterns,suggesting that the repression of these genes may not be unique to islet beta cells.
基金This work was supported by the National Institutes of Health Grant No.7R01GM118467-05the National Natural Science Foundation of China(31720103901).
文摘Tuberculosis drug resistance continues to threaten global health but the underline molecular mechanisms are not clear.Ethambutol(EMB),one of the well-known first-line drugs in tuberculosis treatment is,unfortunately,not free from drug resistance problems.Genomic studies have shown that some genetic mutations in Mycobacterium tuberculosis(Mtb)EmbR,and EmbC/A/B genes cause EMB resistance.EmbR-PknH pair controls embC/A/B operon,which encodes EmbC/A/B genes,and EMB interacts with EmbA/B proteins.However,the EmbR binding site on PknH was unknown.We conducted molecular simulation on the EmbR-peptides binding structures and discovered phosphorylated PknH 273-280(N′-HEALS^(P)DPD-C′)makesβstrand with the EmbR FHA domain,asβ-MoRF(MoRF;molecular recognition feature)does at its binding site.Hydrogen bond number analysis also supported the peptides’β-MoRF forming activity at the EmbR FHA domain.Also,we discovered that previously known phosphorylation residues might have their chronological order according to the phosphorylation status.The discovery validated that Mtb PknH 273-280(N′-HEALSDPD-C′)has reliable EmbR binding affinity.This approach is revolutionary in the computer-aided drug discovery field,because it is the first trial to discover the protein-protein interaction site,and find binding partner in nature from this site.