EHA Library - The official digital education library of European Hematology Association (EHA)

INTEGRATION OF A MACHINE LEARNING-BASED MODEL INTO ROUTINE CYTOMORPHOLOGICAL DIAGNOSTICS: FIRST EXPERIENCES FROM THE BELUGA TRIAL
Author(s): ,
Christian Pohlkamp
Affiliations:
MLL Munich Leukemia Laboratory,Munich,Germany
,
Niroshan Nadarajah
Affiliations:
MLL Munich Leukemia Laboratory,Munich,Germany
,
Adam Wahida
Affiliations:
Torsten Haferlach Leukämiediagnostik-Stiftung,Munich,Germany
,
Inseok Heo
Affiliations:
Amazon Web Services, Envision Engineering,Herndon,United States
,
William Wetton
Affiliations:
Amazon Web Services, Envision Engineering,Herndon,United States
,
Rudolf Drescher
Affiliations:
MetaSystems,Altlussheim,Germany
,
Siegfried Hänselmann
Affiliations:
MetaSystems,Altlussheim,Germany
,
Thomas Lörch
Affiliations:
MetaSystems,Altlussheim,Germany
,
Claudia Haferlach
Affiliations:
MLL Munich Leukemia Laboratory,Munich,Germany
,
Wolfgang Kern
Affiliations:
MLL Munich Leukemia Laboratory,Munich,Germany
Torsten Haferlach
Affiliations:
MLL Munich Leukemia Laboratory,Munich,Germany
EHA Library. Pohlkamp C. 06/09/21; 324718; S310
Christian Pohlkamp
Christian Pohlkamp
Contributions
Abstract
Presentation during EHA2021: All Oral presentations will be made available as of Friday, June 11, 2021 (09:00 CEST) and will be accessible for on-demand viewing until August 15, 2021 on the Virtual Congress platform.

Abstract: S310

Type: Oral Presentation

Session title: Patient's voice and health outcomes in hematology

Background
Cytomorphology is essential to quickly assess hematological neoplasms. Experienced personnel and short TAT of <24 hours are mandatory. Machine learning (ML) has proven its power in image classification. Following our pilot study (ASH 2020, Blood 136, # 195) we report on our first experiences with an ML-based blood cell differentiation tool in daily routine.

Aims
To evaluate feasibility in a prospective blinded trial in parallel to daily routine workflow, we registered the BELUGA study (ClinicalTrials.gov Identifier: NCT04466059) which includes a head-to-head-comparison between human investigators and the ML-based classifier regarding accuracy over all malignant/non-malignant cell types in peripheral blood.

Methods
We established an automated routine workflow including the capture of 500 digital cell images per blood smear in 40x oil magnification after focusing on regions of interest (Metafer Scanning System, MetaSystems, Altlussheim; Zeiss Axio Imager.Z2 microscope). Cells are classified by the ML-based deep neural network (DNN). Resulting data are compared to the independent microscopic cell differentiation done by hem specialists. With regard to future use in routine, misclassifications can be corrected by humans. Corrected images can be manually flagged to join a dataset for continuous retraining of the DNN.

Results
Since Jan 1, 2021 the BELUGA trial included 1,586 blood samples. The most common malignant diagnoses were: CLL, MPN, mature B-cell neoplasm, MDS, MBL, CML, AML and CMML. In all cases human investigators reach out to count 100-200 cells according to WHO guidelines. The addressed cell classes were PMN, lymphocytes, monocytes, eosinophils, basophils, bands, metamyelocytes, myelocytes, promyelocytes, myeloblasts, atypical/bilobulated promyelocytes in APL/APLv, hairy cells, atypical lymphocytes (potentially neoplastic), plasma cells, normoblasts, smudge cells, plus one garbage class for damaged  cells. 155,159 cells were differentiated by humans (12 individuals, median experience in the field: 8 years), 769,347 cells (median 500 per sample) by the DNN model as described (ASH 2020, Blood 136, # 195).  While human analysis took 5-10 minutes per sample, the DNN processes ~500 cells/images within 15 seconds after automated scanning time of 7 min. Probability of prediction was >0.95 for 46% of images, >0.90 was achieved for 55%.  During the pilot study, we noticed misclassifications in cells representing neighbouring stages of maturation or in reactive vs. neoplastic lymphocytes. For refining the DNN, we reduced the number of images used for training, keeping only 10% (8,425 cells) of the original dataset. Our new version (V.2) tested now in the BELUGA study achieves an overall accuracy of 96.6% (vs. 89.6% in V.1) on this separate prospective test set. First evidence of clinical relevance could be seen in two cases with low count APL. The DNN (V.1) was able to detect atypical promyelocytes at a probability of ≥0.98.

Conclusion
We successfully integrated a fully automated process (cell scanning and classification by DNN) parallel to our diagnostic routine workflow without critical delay in TAT. The DNN outperformed human experts (investigation time, number of cells investigated) in a real-life setting and seems to also provide a higher sensitivity and specificity for critical cell types, thus being a helpful support for cytomorphologists. Such infrastructure being cloud-based and allowing access for uploading/classifying digital cell images could serve experts worldwide 24/7.

Keyword(s): Diagnosis, Leukemia, Phenotype

Presentation during EHA2021: All Oral presentations will be made available as of Friday, June 11, 2021 (09:00 CEST) and will be accessible for on-demand viewing until August 15, 2021 on the Virtual Congress platform.

Abstract: S310

Type: Oral Presentation

Session title: Patient's voice and health outcomes in hematology

Background
Cytomorphology is essential to quickly assess hematological neoplasms. Experienced personnel and short TAT of <24 hours are mandatory. Machine learning (ML) has proven its power in image classification. Following our pilot study (ASH 2020, Blood 136, # 195) we report on our first experiences with an ML-based blood cell differentiation tool in daily routine.

Aims
To evaluate feasibility in a prospective blinded trial in parallel to daily routine workflow, we registered the BELUGA study (ClinicalTrials.gov Identifier: NCT04466059) which includes a head-to-head-comparison between human investigators and the ML-based classifier regarding accuracy over all malignant/non-malignant cell types in peripheral blood.

Methods
We established an automated routine workflow including the capture of 500 digital cell images per blood smear in 40x oil magnification after focusing on regions of interest (Metafer Scanning System, MetaSystems, Altlussheim; Zeiss Axio Imager.Z2 microscope). Cells are classified by the ML-based deep neural network (DNN). Resulting data are compared to the independent microscopic cell differentiation done by hem specialists. With regard to future use in routine, misclassifications can be corrected by humans. Corrected images can be manually flagged to join a dataset for continuous retraining of the DNN.

Results
Since Jan 1, 2021 the BELUGA trial included 1,586 blood samples. The most common malignant diagnoses were: CLL, MPN, mature B-cell neoplasm, MDS, MBL, CML, AML and CMML. In all cases human investigators reach out to count 100-200 cells according to WHO guidelines. The addressed cell classes were PMN, lymphocytes, monocytes, eosinophils, basophils, bands, metamyelocytes, myelocytes, promyelocytes, myeloblasts, atypical/bilobulated promyelocytes in APL/APLv, hairy cells, atypical lymphocytes (potentially neoplastic), plasma cells, normoblasts, smudge cells, plus one garbage class for damaged  cells. 155,159 cells were differentiated by humans (12 individuals, median experience in the field: 8 years), 769,347 cells (median 500 per sample) by the DNN model as described (ASH 2020, Blood 136, # 195).  While human analysis took 5-10 minutes per sample, the DNN processes ~500 cells/images within 15 seconds after automated scanning time of 7 min. Probability of prediction was >0.95 for 46% of images, >0.90 was achieved for 55%.  During the pilot study, we noticed misclassifications in cells representing neighbouring stages of maturation or in reactive vs. neoplastic lymphocytes. For refining the DNN, we reduced the number of images used for training, keeping only 10% (8,425 cells) of the original dataset. Our new version (V.2) tested now in the BELUGA study achieves an overall accuracy of 96.6% (vs. 89.6% in V.1) on this separate prospective test set. First evidence of clinical relevance could be seen in two cases with low count APL. The DNN (V.1) was able to detect atypical promyelocytes at a probability of ≥0.98.

Conclusion
We successfully integrated a fully automated process (cell scanning and classification by DNN) parallel to our diagnostic routine workflow without critical delay in TAT. The DNN outperformed human experts (investigation time, number of cells investigated) in a real-life setting and seems to also provide a higher sensitivity and specificity for critical cell types, thus being a helpful support for cytomorphologists. Such infrastructure being cloud-based and allowing access for uploading/classifying digital cell images could serve experts worldwide 24/7.

Keyword(s): Diagnosis, Leukemia, Phenotype

By clicking “Accept Terms & all Cookies” or by continuing to browse, you agree to the storing of third-party cookies on your device to enhance your user experience and agree to the user terms and conditions of this learning management system (LMS).

Cookie Settings
Accept Terms & all Cookies