Prediction of chromosomal abnormalities in the screening of the first trimester of pregnancy using machine learning methods: a study protocol

Shaban, Mahla; Mollazadeh, Sanaz; Eslami, Saeid; Tara, Fatemeh; Sharif, Samaneh; Arghavanian, Fatemeh Erfanian

doi:10.1186/s12978-024-01839-5

Study Protocol
Open access
Published: 03 July 2024

Prediction of chromosomal abnormalities in the screening of the first trimester of pregnancy using machine learning methods: a study protocol

Mahla Shaban¹,
Sanaz Mollazadeh²,
Saeid Eslami³,
Fatemeh Tara⁴,
Samaneh Sharif⁵ &
…
Fatemeh Erfanian Arghavanian²

Reproductive Health volume 21, Article number: 101 (2024) Cite this article

16 Accesses
Metrics details

Abstract

Background

For women in the first trimester, amniocentesis or chorionic villus sampling is recommended for screening. Machine learning has shown increased accuracy over time and finds numerous applications in enhancing decision-making, patient care, and service quality in nursing and midwifery. This study aims to develop an optimal learning model utilizing machine learning techniques, particularly neural networks, to predict chromosomal abnormalities and evaluate their predictive efficacy.

Methods/ design

This cross-sectional study will be conducted in midwifery clinics in Mashhad, Iran in 2024. The data will be collected from 350 pregnant women in the high-risk group who underwent screening tests in the first trimester (between 11-14 weeks) of pregnancy. Information collected includes maternal age, BMI, smoking habits, history of trisomy 21 and other chromosomal disorders, CRL and NT levels, PAPP-A and B-HCG levels, presence of insulin-dependent diabetes, and whether the pregnancy resulted from IVF. The study follows up with the women during their clinic visits and tracks the results of amniocentesis. Sampling is based on Convenience Sampling, and data is gathered using a checklist of characteristics and screening/amniocentesis results. After preprocessing, feature extraction is conducted to identify and predict relevant features. The model is trained and evaluated using K-fold cross-validation.

Discussion

There is a growing interest in utilizing artificial intelligence methods, like machine learning and deep learning, in nursing and midwifery. This underscores the critical necessity for nurses and midwives to be well-versed in artificial intelligence methods and their healthcare applications. It can be beneficial to develop a machine learning model, specifically focusing on neural networks, for predicting chromosomal abnormalities.

Ethical code

IR.MUMS.NURSE.REC. 1402.134

Plain English Summary

Approximately 3% of newborns are affected by congenital abnormalities and genetic diseases, leading to disability and death. Among live births, around 3000 cases of Down syndrome (trisomy 21) can be expected based on the country's birth rate. Pregnant women carrying fetuses with Down syndrome face an increased risk of pregnancy complications. Artificial intelligence methods, such as machine learning and deep learning, are being used in nursing and midwifery to improve decision-making, patient care, and research. Nurses need to actively participate in the development and implementation of AI-based decision support systems. Additionally, nurses and midwives should play a key role in evaluating the effectiveness of artificial intelligence-based technologies in professional practice.

Background

Congenital abnormalities and genetic diseases lead to disability and death in approximately 3% of newborns [1]. Chromosomal disorders, including trisomy 21, trisomy 18, trisomy 13, and sex chromosome disorders, affect about 1 in 150 live births [2]. These disorders can lead to physical and psychological challenges in affected children and an increased risk of pregnancy complications for pregnant women [3,4,5,6]. The screening tests for aneuploidy involve assessing certain hormone levels and using ultrasound to measure nuchal translucency [7,8,9]. Screening in the first trimester of pregnancy includes two biochemical markers: human chorionic gonadotropin (βhCG Free) and plasma protein A concentration (PAPP-A), along with the measurement of nuchal translucency by ultrasound, which is performed between the 11th and 14th weeks of pregnancy [3]. High-risk individuals may undergo invasive procedures like amniocentesis or chorionic villus sampling. However, these procedures can have complications and may increase stress and anxiety levels among mothers [10,11,12]. Additionally, studies have shown that a small percentage of cases identified as high-risk actually have aneuploidy [1, 13]. Midwives play a crucial role in providing advice and care to mothers during pregnancy, delivery, and postpartum, offering emotional support to reduce anxiety and stress [14]. In recent times, there has been a surge of interest in artificial intelligence (AI) methods such as machine learning and deep learning worldwide. These methods are being integrated into nursing and midwifery to enhance decision-making, patient care, service delivery, and research studies. It is essential for nurses to be actively engaged in the development and implementation of AI-based decision support systems, particularly when these systems impact their direct patient care. Additionally, nurses and midwives should play a more active role in conducting detailed and interdisciplinary research to assess the clinical, ethical, and legal implications of AI-based technologies in professional practice [15,16,17]. Machine learning, a subset of computer science and AI, focuses on deploying data and algorithms to imitate human learning and steadily improve accuracy. This technique involves developing algorithms that can learn from experience to enhance system performance, using data as the source of experience to build predictive models [18,19,20]. Nurses and midwives should be actively involved in the development and implementation of AI-based decision support systems. Machine learning aims to create machines that can learn and make decisions without direct programming, and it can help predict chromosomal abnormalities, potentially aiding in decisions about procedures for pregnant mothers. The aim of the present study is to create a machine-learning model, focusing on neural networks, to predict chromosomal abnormalities.

Main goal

Predicting chromosomal abnormalities during the first three months of pregnancy through machine-learning techniques.

Specific objectives:

1.
Assessing the sensitivity of the optimized neural network in predicting chromosomal abnormalities during the first-trimester screening.
2.
Identifying the key characteristics of the optimized neural network for predicting chromosomal abnormalities during the first-trimester screening.
3.
Contrasting the performance of the optimized neural network with decision trees in diagnosing chromosome abnormalities during the first-trimester screening.
4.
Contrasting the performance of the optimized neural network with random forest in diagnosing chromosome abnormalities during the first-trimester screening.

Research inquiries:

1.
How sensitive are optimized neural networks in predicting chromosomal abnormalities during first-trimester screening?
2.
What are the distinguishing features of an optimized neural network in predicting chromosomal abnormalities during first-trimester screening?
3.
Is there a significant difference between the results of the optimized neural network and the decision tree in diagnosing chromosomal abnormalities during the first three months?
4.
Do the optimized neural network results differ from random forest results in detecting chromosomal abnormalities during first-trimester screening?

Methods/design

Study design

In this study, a cross-sectional approach will be used. It involves data from 350 pregnant women who underwent the first-trimester screening test at 11 to 14 weeks of pregnancy at the Mashhad clinic and were classified as high-risk. After receiving approval from the Medical Ethics Committee of the University of Sciences Mashhad Medicine and a letter of recommendation from the faculty of Midwifery Nursing of Mashhad, the researcher contacted the research centers, obtained necessary permissions, and began sampling at midwifery clinics each day. Data collection entails assessing factors such as the mother's age, BMI, maternal smoking, trisomy 21 history, CRL level, NT, PAPP-A, B-HCG, presence of insulin-dependent diabetes, and IVF pregnancy status. This data is gathered during visits to obstetric clinics for first-trimester screening results, with follow-up amniocentesis for those deemed high-risk. The study aims to diagnose chromosomal abnormalities accurately using first-trimester screening parameters to reduce stress associated with unnecessary amniocentesis testing. Initially, the required data is collected with a designated sample size and undergoes pre-processing. This involves managing missing values, eliminating anomalies, and standardizing the data. Subsequently, the researcher conducts statistical analyses on all input characteristics in the first phase to unveil significant relationships with the response variable, specifically regarding chromosomal abnormalities 13, 18, and 21. Moving on to the second phase, a predictive model is constructed employing machine learning techniques. In this study, two model variants are developed using all input features and influential features for decision-making. While the first model utilizes data gathered from pregnant women in its entirety, the second model employs filter-based feature selection methods to pinpoint essential features for building a prediction model. The process of model creation encompasses training and evaluation stages where K-fold cross-validation is employed to gauge model efficiency and performance. The model's decisions are juxtaposed with actual patient data from the dataset to compute model error, aiming to minimize it. Furthermore, the study will focus on constructing a model based on Artificial Neural Networks (ANN), seeking to optimize the network's structure and parameters. Given the pivotal role of structure and hyperparameters in network performance and prediction accuracy, an optimization approach such as Particle Swarm Optimization (PSO) will be leveraged to pinpoint optimal hyperparameter values and network structure. This optimization process is envisaged to enhance the accuracy of the prediction model in identifying chromosomal abnormalities post the initial screening, alongside other methodologies. Additionally, machine learning techniques like decision trees will be utilized for comparative analysis of results.

Sample size and sampling method

In machine learning methods, sample size is typically not fixed; the more data available, the more efficient to enhance model effectiveness. With a significance level of 0.07 confidence level of 99% (i.e., z = 2.58), and precision of 0.05, a minimum of 173 individuals were calculated using the formula. The final sample size was set at 190 individuals, accounting for a ten percent dropout rate. While this calculation is customary in statistical methods, for machine learning models, a larger dataset of at least 350 individuals is necessary for more precise model design and comprehensive evaluation.

$$\begin{aligned} z=2/58\quad \mathrm p=0/07\quad \mathrm d=0/05\\ \mathrm N=\frac{\mathrm z^2\times\mathrm p\times\left(1-\mathrm p\right)}{\mathrm d^2} \end{aligned}$$

Inclusion criteria

First-trimester screening and NT ultrasound between 11-14 weeks of pregnancy, along with amniocentesis.

Exclusion criteria

Mother's unwillingness to participate, presence of twins or multiples, failure to undergo amniocentesis for high-risk screening cases.

Study implementation platform and data collection locations

Midwifery clinics in Mashhad hospitals served as the research setting.

Recruitment approach

Researchers conducted sampling in selected centers, convenience sampling, and collected necessary data after obtaining participants' consent.

Data analysis

To ensure a reliable and standardized assessment of the prediction model, we employ the K-fold cross-validation method. This approach gauges the model's ability to generalize to new data by partitioning the dataset into k subsets. Training and evaluation are conducted on these subsets, enhancing system reliability through the assessment of varied random batches. Subsequently, results for accuracy, precision, sensitivity, and specificity are provided to assess the model's predictive capacity effectively.

Discussion

Aneuploidy screening tests are divided into three categories: first-trimester screening, second-trimester screening, and combined first and second-trimester screening. First-trimester screening involves evaluating human chorionic gonadotropin (βhCG Free) and plasma protein A concentration (PAPP-A) and measuring nuchal translucency using ultrasound between the 11th and 14th weeks of pregnancy [7,8,9]. After first-trimester screening, high-risk individuals are recommended to undergo amniocentesis or chorionic villus sampling. However, these invasive procedures are time-consuming and expensive. Studies show that common complications of amniocentesis include fetal death, bleeding, and amniotic fluid leakage, premature rupture of membranes, amnionitis, and spontaneous abortion [7, 21, 22]. Research also suggests that amniocentesis can lead to increased stress and anxiety levels among mother [14] .According to a study by Hassanzadeh et al., only 10% of high-risk cases identified through first-trimester screening were confirmed as aneuploidy by amniocentesis [1]. Additionally, a study by Delkhosh et al. found that 5.2% of cases suspected of trisomy 21 during first and second-trimester screenings through amniocentesis were found to have aneuploidy [13]. Midwives play a crucial role in advising and caring for mothers during pregnancy, delivery, and postpartum. They provide emotional support to reduce mothers' anxiety and stress, ensuring the health of both mother and fetus and making pregnancy safe. The role of Utilizing artificial intelligence methods, such as machine learning and deep learning, in nursing and midwifery to greatly improve decision-making, patient care, service delivery, and research studies can be significant. It is imperative that nurses and midwives actively engage in the development and implementation of AI-based decision support systems. Machine learning aims to create machines that can learn and make decisions without direct programming, and it has the potential to accurately predict chromosomal abnormalities, thereby playing a crucial role in decisions about procedures for pregnant mothers.

Availability of data and materials

No datasets were generated or analysed during the current study.

References

Hasanzadeh R, Naghizadeh S, Azari S, Ebrahimpour Mirza Rezaei M. Diagnosis of Aneuploidies by amniocentesis in high risk cases of first trimester screening test. Iranian J Obstetr Gynecol Infertil. 2014;17(119):18–26.
Google Scholar
Turnpenny PD, Ellard S, Cleaver R. Emery's Elements of Medical Genetics E-Book: Emery's Elements of Medical Genetics E-Book: Elsevier Health Sciences; 2020.
Cunningham FG, Leveno KJ, Dashe JS, Hoffman BL, Spong CY, Casey BM. Williams Obstetrics 26e: McGraw Hill LLC; 2022.
Brock JK, Walsh JD, Allen VM. The Effect of Fetal Trisomy 21 on Adverse Perinatal Obstetrical Outcomes in Nova Scotia, 2000–2019. J Obstet Gynaecol Can. 2021;43(5):583–8.
Article PubMed Google Scholar
Channell MM, Hahn LJ, Rosser TC, Hamilton D, Frank-Crawford MA, Capone GT, et al. Characteristics Associated with Autism Spectrum Disorder Risk in Individuals with Down Syndrome. J Autism Dev Disord. 2019;49(9):3543–56.
Article PubMed Google Scholar
Nguyen TL, Duchon A, Manousopoulou A, Loaëc N, Villiers B, Pani G, et al. Correction of cognitive deficits in mouse models of Down syndrome by a pharmacological inhibitor of DYRK1A. Dis Model Mech. 2018;11(9):dmm035634.
Article PubMed PubMed Central Google Scholar
Akolekar R, Beta J, Picciarelli G, Ogilvie C, D’Antonio F. Procedure-related risk of miscarriage following amniocentesis and chorionic villus sampling: a systematic review and meta-analysis. Ultrasound Obstet Gynecol. 2015;45(1):16–26.
Article CAS PubMed Google Scholar
Bakker M, Birnie E, de Robles Medina P, Sollie KM, Pajkrt E, Bilardo CM. Total pregnancy loss after chorionic villus sampling and amniocentesis: a cohort study. Ultrasound Obstet Gynecol. 2017;49(5):599–606.
Article CAS PubMed Google Scholar
Ghasemi G, Tara F, Mirteimouri M, Nikdoust S, Deldar K, Alerasool. The outcomes of pregnancy and delivery in the infants with positive karyotype test results for Down syndrome manifestation. Iran J Obstet Gynecol Infertil. 2022;25(1).
Ali A, Abdelhaleem⃰ Z, editors. Impact of Structured Prenatal Counseling on Anxiety Level among Women Undergoing Amniocentesis. 2014.
Çaliskan EÖS, Çakiroglu Y, Yalçinkaya Ö, Polat A, Çorakçi A. The effects of maternal anxiety prior to amniocentesis on uterine and fetal umbilical blood flow. Turk Ger Gynecol Assoc. 2009;10(3):162–7.
Google Scholar
Mojahed S, Reyhanizadeh F, Tabatabaei RS, Dehghani A. Evaluation of the effect of education on perceived stress of mother candidates for amniocentesis. J Educ Health Promot. 2021;10:267.
Article PubMed PubMed Central Google Scholar
Delkhosh F, Navinezhad M, Sharifzadeh M, Naghibinasab MS, Eftekhar Yazdi M. Prevalence of aneuploidy in pregnant women with high risk fetal screening in Sabzevar perinatal clinic, 2016–2019. Iranian J Obstetr Gynecol Infertil. 2022;25(10):31–8.
Google Scholar
Andaroon N, Kordi M, Kimiaee SA, Esmaily H. Effect of Individual Counseling Program by a Midwife on Anxiety during Pregnancy in Nulliparous Women. Iranian J Obstetr Gynecol Infertil. 2018;20(12):86–95.
Google Scholar
Bajwa J, Munir U, Nori A, Williams B. Artificial intelligence in healthcare: transforming the practice of medicine. Future Healthc J. 2021;8(2):e188–94.
Article PubMed PubMed Central Google Scholar
O’Connor S, Yan Y, Thilo FJS, Felzmann H, Dowding D, Lee JJ. Artificial intelligence in nursing and midwifery: A systematic review. J Clin Nurs. 2023;32(13–14):2951–68.
Article PubMed Google Scholar
Zhang H, Mo J, Jiang H, Li Z, Hu W, Zhang C, et al. Deep Learning Model for the Automated Detection and Histopathological Prediction of Meningioma. Neuroinformatics. 2021;19(3):393–402.
Article PubMed Google Scholar
Mahesh B. Machine Learning Algorithms -A Review. 2019.
Zhou Z-H. Machine Learning. 2021.
Leng J, Sharrock WW. Handbook of Research on Computational Science and Engineering: Theory and Practice: Engineering Science Reference; 2012.
Fatemeh Tara ML. Somayeh Evaluation of Early and Late Complications of Amniocentesis in Ommolbanin Hospital in Mashhad, Iran during 2014-2016. Iran J Obstet Gynecol Infertil. 2018;21.
Shirgholami NA, Mazaheri F, Tabatabaee M. Amniocentesis Complications in Yazd Baghaeipour Polyclinic: A Cross-Sectional Study. World J Peri Neonatol. 2020;3(1).

Download references

Acknowledgments

We thank the volunteer participants for sharing their experiences and giving their time and help to make this study possible.

Funding

This Study is funded by Mashhad University of Medical Sciences.

Author information

Authors and Affiliations

Department of Midwifery, Research Student Committee, Mashhad University of Medical Sciences, Mashhad, Iran
Mahla Shaban
Nursing and Midwifery Care Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
Sanaz Mollazadeh & Fatemeh Erfanian Arghavanian
Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
Saeid Eslami
Department of Obstetrics and Gynecology, Faculty of Medicine, Mashhad University of Medical, Mashhad, Iran
Fatemeh Tara
Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
Samaneh Sharif

Authors

Mahla Shaban
View author publications
You can also search for this author in PubMed Google Scholar
Sanaz Mollazadeh
View author publications
You can also search for this author in PubMed Google Scholar
Saeid Eslami
View author publications
You can also search for this author in PubMed Google Scholar
Fatemeh Tara
View author publications
You can also search for this author in PubMed Google Scholar
Samaneh Sharif
View author publications
You can also search for this author in PubMed Google Scholar
Fatemeh Erfanian Arghavanian
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M. Sh., S. M., S.E., S. Sh., F. T., and F. E. A., contributed to the design of the protocol. M. Sh., S.E., S. Sh., and F. E. A., contributed to the implementation and analysis plan. M. Sh., S. M., S.E., S. Sh., F. E. A., have written the first draft of this protocol article and all authors have critically read the text and contributed with inputs and revisions, and all authors read and approved the final manuscript.

Corresponding author

Correspondence to Fatemeh Erfanian Arghavanian.

Ethics declarations

Ethics approval and consent to participate

Written informed consent will be obtained from each participant. This protocol has been approved by the Ethics Committee of the Mashhad University of Medical Sciences, Mashhad, Iran (code number: IR.MUMS.REC. 1402.134)

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Shaban, M., Mollazadeh, S., Eslami, S. et al. Prediction of chromosomal abnormalities in the screening of the first trimester of pregnancy using machine learning methods: a study protocol. Reprod Health 21, 101 (2024). https://doi.org/10.1186/s12978-024-01839-5

Download citation

Received: 11 June 2024
Accepted: 26 June 2024
Published: 03 July 2024
DOI: https://doi.org/10.1186/s12978-024-01839-5

Prediction of chromosomal abnormalities in the screening of the first trimester of pregnancy using machine learning methods: a study protocol

Abstract

Background

Methods/ design

Discussion

Ethical code

Plain English Summary

Background

Main goal

Specific objectives:

Research inquiries:

Methods/design

Study design

Sample size and sampling method

Inclusion criteria

Exclusion criteria

Study implementation platform and data collection locations

Recruitment approach

Data analysis

Discussion

Availability of data and materials

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Reproductive Health

Contact us

Prediction of chromosomal abnormalities in the screening of the first trimester of pregnancy using machine learning methods: a study protocol

Abstract

Background

Methods/ design

Discussion

Ethical code

Plain English Summary

Background

Main goal

Specific objectives:

Research inquiries:

Methods/design

Study design

Sample size and sampling method

Inclusion criteria

Exclusion criteria

Study implementation platform and data collection locations

Recruitment approach

Data analysis

Discussion

Availability of data and materials

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Reproductive Health

Contact us