A recent large-scale retrospective study conducted by researchers at the Fleury Group in Brazil has shed light on the potential of using complete blood count (CBC) as a risk stratification tool for breast cancer using machine learning. The study, approved by the Fleury Group’s Research Ethics Committee and conducted in compliance with Brazilian legislation and data protection laws, analyzed CBC test results from nearly 400,000 women aged 40-70 who were screened for breast cancer between 2004 and 2022.
Here are some key findings from the study:
– Researchers collected CBC test results from 396,848 women across eight Brazilian states.
– The case group included women diagnosed with breast cancer or highly suspected of having it, while the control group consisted of women with negative imaging results.
– Various histologic subtypes of breast cancer were identified, with ductal carcinoma being the most common among invasive cases.
– The study utilized two machine learning models, ridge regression, and LightGBM, to analyze CBC biomarkers and derived ratios for predicting breast cancer risk.
– The models were trained and evaluated using the Area Under the Curve (AUC) metric, with feature selection and hyperparameter tuning conducted to optimize performance.
The study also employed the SHapley Additive exPlanations (SHAP) approach for explainability analysis and categorized the population into four risk groups based on the models’ probability outputs.
Overall, this groundbreaking study provides valuable insights into the potential of utilizing CBC as a risk stratification tool for breast cancer, highlighting the importance of leveraging machine learning algorithms for predictive modeling in healthcare.
Source: [Scientific Reports](insert link)
Frequently Asked Questions (FAQs) Related to the Above News
What was the purpose of the study conducted by researchers at the Fleury Group in Brazil?
The purpose of the study was to investigate the potential of using complete blood count (CBC) as a risk stratification tool for breast cancer using machine learning.
How many women were included in the study?
The study included CBC test results from 396,848 women aged 40-70 who were screened for breast cancer between 2004 and 2022.
What were the key findings of the study?
The study identified various histologic subtypes of breast cancer, utilized machine learning models to analyze CBC biomarkers for predicting breast cancer risk, and categorized the population into four risk groups based on probability outputs.
What machine learning models were used in the study?
The study utilized ridge regression and LightGBM machine learning models to analyze CBC biomarkers and derived ratios for predicting breast cancer risk.
How were the models trained and evaluated in the study?
The models were trained and evaluated using the Area Under the Curve (AUC) metric, with feature selection and hyperparameter tuning conducted to optimize performance.
What approach was used for explainability analysis in the study?
The study employed the SHapley Additive exPlanations (SHAP) approach for explainability analysis.
What are the implications of this study for breast cancer detection?
This study provides valuable insights into the potential of utilizing CBC as a risk stratification tool for breast cancer, highlighting the importance of leveraging machine learning algorithms for predictive modeling in healthcare.
Please note that the FAQs provided on this page are based on the news article published. While we strive to provide accurate and up-to-date information, it is always recommended to consult relevant authorities or professionals before making any decisions or taking action based on the FAQs or the news article.