Bettina Baeßler

@usz.ch

University Hospital Zurich
Institute of Diagnostic and Interventional Radiology



                    

https://researchid.co/bettina.baessler
109

Scopus Publications

6731

Scholar Citations

38

Scholar h-index

76

Scholar i10-index

Scopus Publications

  • Joint image reconstruction and segmentation of real-time cardiovascular magnetic resonance imaging in free-breathing using a model based on disentangled representation learning
    Tobias Wech, Oliver Schad, Simon Sauer, Jonas Kleineisel, Nils Petri, Peter Nordbeck, Thorsten A. Bley, Bettina Baeßler, Bernhard Petritsch, and Julius F. Heidenreich

    Elsevier BV

  • Repeatability of radiomic features in myocardial T1 and T2 mapping
    Mathias Manzke, Fabian C. Laqua, Benjamin Böttcher, Ann-Christin Klemenz, Marc-André Weber, Bettina Baeßler, and Felix G. Meinel

    Springer Science and Business Media LLC
    Abstract Purpose To investigate the test–retest repeatability of radiomic features in myocardial native T1 and T2 mapping. Methods In this prospective study, 50 healthy volunteers (29 women and 21 men, mean age 39.4 ± 13.7 years) underwent two identical cardiac magnetic resonance imaging (MRI) examinations at 1.5 T. The protocol included native T1 and T2 mapping in both short-axis and long-axis orientation. For T1 mapping, we investigated standard (1.9 × 1.9 mm) and high (1.4 × 1.4 mm) spatial resolution. After manual segmentation of the left ventricular myocardium, 100 radiomic features from seven feature classes were extracted and analyzed. Test–retest repeatability of radiomic features was assessed using the intraclass correlation coefficient (ICC) and classified as poor (ICC < 0.50), moderate (0.50–0.75), good (0.75–0.90), and excellent (> 0.90). Results For T1 maps acquired in short-axis orientation at standard resolution, repeatability was excellent for 6 features, good for 29 features, moderate for 19 features, and poor for 46 features. We identified 15 features from 6 classes which showed good to excellent reproducibility for T1 mapping in all resolutions and all orientations. For short-axis T2 maps, repeatability was excellent for 6 features, good for 25 features, moderate for 23 features, and poor for 46 features. 12 features from 5 classes were found to have good to excellent repeatability in T2 mapping independent of slice orientation. Conclusion We have identified a subset of features with good to excellent repeatability independent of slice orientation and spatial resolution. We recommend using these features for further radiomics research in myocardial T1 and T2 mapping. Key Points Question The study addresses the need for reliable radiomic features for quantitative analysis of the myocardium to ensure diagnostic consistency in cardiac MRI. Findings We have identified a subset of radiomic features demonstrating good to excellent repeatability in native T1 and T2 mapping independent of slice orientation and resolution. Clinical relevance Radiomic features have been proposed as diagnostic and prognostic biomarkers in various heart diseases. By identifying a subset of particularly reproducible radiomic features our study serves to inform the selection of radiomic features in future research and clinical applications. Graphical Abstract

  • Color-map recommendation for MR relaxometry maps
    Miha Fuderer, Barbara Wichtmann, Fabio Crameri, Nandita M. de Souza, Bettina Baeßler, Vikas Gulani, Meiyun Wang, Dirk Poot, Ruud de Boer, Matt Cashmore,et al.

    Wiley
    AbstractPurposeTo harmonize the use of color for MR relaxometry maps and therefore recommend the use of specific color‐maps for representing , , and maps and their inverses.MethodsPerceptually linearized color‐maps were chosen to have similar color settings as those proposed by Griswold et al. in 2018. A Delphi process, polling the opinion of a panel of 81 experts, was used to generate consensus on the suitability of these maps.ResultsConsensus was reached on the suitability of the logarithm‐processed Lipari color‐map for and the logarithm‐processed Navia color‐map for and . There was consensus on color bars being mandatory and on the use of a specific value indicating “invalidity.” There was no consensus on whether the ranges should be fixed per anatomy.ConclusionThe authors recommend the use of the logarithm‐processed Lipari color‐map for displaying quantitative maps and maps; likewise, the authors recommend the logarithm‐processed Navia color‐map for displaying , , , and maps.This work originated with the Quantitative MR Study Group of the International Society of Magnetic Resonance in Medicine (ISMRM); it has the approval of the Publication Committee and of the Board of the ISMRM.

  • Radiomics workflow definition & challenges - German priority program 2177 consensus statement on clinically applied radiomics
    Ralf Floca, Jonas Bohn, Christian Haux, Benedikt Wiestler, Frank G. Zöllner, Annika Reinke, Jakob Weiß, Marco Nolden, Steffen Albert, Thorsten Persigehl,et al.

    Springer Science and Business Media LLC
    Abstract Objectives Achieving a consensus on a definition for different aspects of radiomics workflows to support their translation into clinical usage. Furthermore, to assess the perspective of experts on important challenges for a successful clinical workflow implementation. Materials and methods The consensus was achieved by a multi-stage process. Stage 1 comprised a definition screening, a retrospective analysis with semantic mapping of terms found in 22 workflow definitions, and the compilation of an initial baseline definition. Stages 2 and 3 consisted of a Delphi process with over 45 experts hailing from sites participating in the German Research Foundation (DFG) Priority Program 2177. Stage 2 aimed to achieve a broad consensus for a definition proposal, while stage 3 identified the importance of translational challenges. Results Workflow definitions from 22 publications (published 2012–2020) were analyzed. Sixty-nine definition terms were extracted, mapped, and semantic ambiguities (e.g., homonymous and synonymous terms) were identified and resolved. The consensus definition was developed via a Delphi process. The final definition comprising seven phases and 37 aspects reached a high overall consensus (> 89% of experts “agree” or “strongly agree”). Two aspects reached no strong consensus. In addition, the Delphi process identified and characterized from the participating experts’ perspective the ten most important challenges in radiomics workflows. Conclusion To overcome semantic inconsistencies between existing definitions and offer a well-defined, broad, referenceable terminology, a consensus workflow definition for radiomics-based setups and a terms mapping to existing literature was compiled. Moreover, the most relevant challenges towards clinical application were characterized. Critical relevance statement Lack of standardization represents one major obstacle to successful clinical translation of radiomics. Here, we report a consensus workflow definition on different aspects of radiomics studies and highlight important challenges to advance the clinical adoption of radiomics. Key Points Published radiomics workflow terminologies are inconsistent, hindering standardization and translation. A consensus radiomics workflow definition proposal with high agreement was developed. Publicly available result resources for further exploitation by the scientific community. Graphical Abstract

  • Deep Learning k-Space-to-Image Reconstruction Facilitates High Spatial Resolution and Scan Time Reduction in Diffusion-Weighted Imaging Breast MRI
    Stephanie Tina Sauer, Sara Aniki Christner, Anna‐Maria Lois, Piotr Woznicki, Carolin Curtaz, Andreas Steven Kunz, Elisabeth Weiland, Thomas Benkert, Thorsten Alexander Bley, Bettina Baeßler,et al.

    Wiley
    BackgroundFor time‐consuming diffusion‐weighted imaging (DWI) of the breast, deep learning‐based imaging acceleration appears particularly promising.PurposeTo investigate a combined k‐space‐to‐image reconstruction approach for scan time reduction and improved spatial resolution in breast DWI.Study TypeRetrospective.Population133 women (age 49.7 ± 12.1 years) underwent multiparametric breast MRI.Field Strength/Sequence3.0T/T2 turbo spin echo, T1 3D gradient echo, DWI (800 and 1600 sec/mm2).AssessmentDWI data were retrospectively processed using deep learning‐based k‐space‐to‐image reconstruction (DL‐DWI) and an additional super‐resolution algorithm (SRDL‐DWI). In addition to signal‐to‐noise ratio and apparent diffusion coefficient (ADC) comparisons among standard, DL‐ and SRDL‐DWI, a range of quantitative similarity (e.g., structural similarity index [SSIM]) and error metrics (e.g., normalized root mean square error [NRMSE], symmetric mean absolute percent error [SMAPE], log accuracy error [LOGAC]) was calculated to analyze structural variations. Subjective image evaluation was performed independently by three radiologists on a seven‐point rating scale.Statistical TestsFriedman's rank‐based analysis of variance with Bonferroni‐corrected pairwise post‐hoc tests. P < 0.05 was considered significant.ResultsBoth DL‐ and SRDL‐DWI allowed for a 39% reduction in simulated scan time over standard DWI (5 vs. 3 minutes). The highest image quality ratings were assigned to SRDL‐DWI with good interreader agreement (ICC 0.834; 95% confidence interval 0.818–0.848). Irrespective of b‐value, both standard and DL‐DWI produced superior SNR compared to SRDL‐DWI. ADC values were slightly higher in SRDL‐DWI (+0.5%) and DL‐DWI (+3.4%) than in standard DWI. Structural similarity was excellent between DL‐/SRDL‐DWI and standard DWI for either b value (SSIM ≥ 0.86). Calculation of error metrics (NRMSE ≤ 0.05, SMAPE ≤ 0.02, and LOGAC ≤ 0.04) supported the assumption of low voxel‐wise error.Data ConclusionDeep learning‐based k‐space‐to‐image reconstruction reduces simulated scan time of breast DWI by 39% without influencing structural similarity. Additionally, super‐resolution interpolation allows for substantial improvement of subjective image quality.Evidence Level4Technical EfficacyStage 1

  • Cooperative AI training for cardiothoracic segmentation in computed tomography: An iterative multi-center annotation approach
    Bianca Lassen-Schmidt, Bettina Baessler, Matthias Gutberlet, Josephine Berger, Jan M. Brendel, Andreas M. Bucher, Tilman Emrich, Philipp Fervers, Jonathan Kottlors, Philipp Kuhl,et al.

    Elsevier BV

  • Perfect Match: Radiomics and Artificial Intelligence in Cardiac Imaging
    Bettina Baeßler, Sandy Engelhardt, Amar Hekalo, Anja Hennemuth, Markus Hüllebrand, Ann Laube, Clemens Scherer, Malte Tölle, and Tobias Wech

    Ovid Technologies (Wolters Kluwer Health)
    Cardiovascular diseases remain a significant health burden, with imaging modalities like echocardiography, cardiac computed tomography, and cardiac magnetic resonance imaging playing a crucial role in diagnosis and prognosis. However, the inherent heterogeneity of these diseases poses challenges, necessitating advanced analytical methods like radiomics and artificial intelligence. Radiomics extracts quantitative features from medical images, capturing intricate patterns and subtle variations that may elude visual inspection. Artificial intelligence techniques, including deep learning, can analyze these features to generate knowledge, define novel imaging biomarkers, and support diagnostic decision-making and outcome prediction. Radiomics and artificial intelligence thus hold promise for significantly enhancing diagnostic and prognostic capabilities in cardiac imaging, paving the way for more personalized and effective patient care. This review explores the synergies between radiomics and artificial intelligence in cardiac imaging, following the radiomics workflow and introducing concepts from both domains. Potential clinical applications, challenges, and limitations are discussed, along with solutions to overcome them.

  • Robustness of radiomic features in healthy abdominal parenchyma of patients with repeated examinations on dual-layer dual-energy CT
    Mirjam Schöneck, Simon Lennartz, David Zopfs, Kristina Sonnabend, Robert Wawer Matos Reimer, Miriam Rinneburger, Josefine Graffe, Thorsten Persigehl, Clemens Hentschke, Bettina Baeßler,et al.

    Elsevier BV

  • The challenges of research data management in cardiovascular science: a DGK and DZHK position paper—executive summary
    Sabine Steffens, Katrin Schröder, Martina Krüger, Christoph Maack, Katrin Streckfuss-Bömeke, Johannes Backs, Rolf Backofen, Bettina Baeßler, Yvan Devaux, Ralf Gilsbach,et al.

    Springer Science and Business Media LLC
    AbstractThe sharing and documentation of cardiovascular research data are essential for efficient use and reuse of data, thereby aiding scientific transparency, accelerating the progress of cardiovascular research and healthcare, and contributing to the reproducibility of research results. However, challenges remain. This position paper, written on behalf of and approved by the German Cardiac Society and German Centre for Cardiovascular Research, summarizes our current understanding of the challenges in cardiovascular research data management (RDM). These challenges include lack of time, awareness, incentives, and funding for implementing effective RDM; lack of standardization in RDM processes; a need to better identify meaningful and actionable data among the increasing volume and complexity of data being acquired; and a lack of understanding of the legal aspects of data sharing. While several tools exist to increase the degree to which data are findable, accessible, interoperable, and reusable (FAIR), more work is needed to lower the threshold for effective RDM not just in cardiovascular research but in all biomedical research, with data sharing and reuse being factored in at every stage of the scientific process. A culture of open science with FAIR research data should be fostered through education and training of early-career and established research professionals. Ultimately, FAIR RDM requires permanent, long-term effort at all levels. If outcomes can be shown to be superior and to promote better (and better value) science, modern RDM will make a positive difference to cardiovascular science and practice. The full position paper is available in the supplementary materials.


  • Native myocardial T1 mapping: influence of spatial resolution on quantitative results and reproducibility
    Antonia Dalmer, Felix G. Meinel, Benjamin Böttcher, Mathias Manzke, Roberto Lorbeer, Marc-André Weber, Bettina Baeßler, and Ann-Christin Klemenz

    AME Publishing Company
    Background Myocardial mapping techniques can be used to quantitatively assess alterations in myocardial tissue properties. This study aims to evaluate the influence of spatial resolution on quantitative results and reproducibility of native myocardial T1 mapping in cardiac magnetic resonance imaging (MRI). Methods In this cross-sectional study with prospective data collection between October 2019 and February 2020, 50 healthy adults underwent two identical cardiac MRI examinations in the radiology department on the same day. T1 mapping was performed using a MOLLI 5(3)3 sequence with higher (1.4 mm × 1.4 mm) and lower (1.9 mm × 1.9 mm) in-plane spatial resolution. Global quantitative results of T1 mapping were compared between high-resolution and low-resolution acquisitions using paired t-test. Intra-class correlation coefficient (ICC) and Bland-Altman statistics (absolute and percentage differences as means ± SD) were used for assessing test-retest reproducibility. Results There was no significant difference between global quantitative results acquired with high vs. low-resolution T1 mapping. The reproducibility of global T1 values was good for high-resolution (ICC: 0.88) and excellent for low-resolution T1 mapping (ICC: 0.95, P=0.003). In subgroup analyses, inferior test-retest reproducibility was observed for high spatial resolution in women compared to low spatial resolution (ICC: 0.71 vs. 0.91, P=0.001) and heart rates >77 bpm (ICC: 0.53 vs. 0.88, P=0.004). Apical segments had higher T1 values and variability compared to other segments. Regional T1 values for basal (ICC: 0.81 vs. 0.89, P=0.023) and apical slices (ICC: 0.86 vs. 0.92, P=0.024) showed significantly higher reproducibility in low-resolution compared to high-resolution acquisitions but without differences for midventricular slice (ICC: 0.91 vs. 0.92, P=0.402). Conclusions Based on our data, we recommend a spatial resolution on the order of 1.9 mm × 1.9 mm for native myocardial T1 mapping using a MOLLI 5(3)3 sequence at 1.5 T particularly in individuals with higher heart rates and women.

  • METhodological RadiomICs Score (METRICS): a quality scoring tool for radiomics research endorsed by EuSoMII
    Burak Kocak, Tugba Akinci D’Antonoli, Nathaniel Mercaldo, Angel Alberich-Bayarri, Bettina Baessler, Ilaria Ambrosini, Anna E. Andreychenko, Spyridon Bakas, Regina G. H. Beets-Tan, Keno Bressem,et al.

    Springer Science and Business Media LLC
    Abstract Purpose To propose a new quality scoring tool, METhodological RadiomICs Score (METRICS), to assess and improve research quality of radiomics studies. Methods We conducted an online modified Delphi study with a group of international experts. It was performed in three consecutive stages: Stage#1, item preparation; Stage#2, panel discussion among EuSoMII Auditing Group members to identify the items to be voted; and Stage#3, four rounds of the modified Delphi exercise by panelists to determine the items eligible for the METRICS and their weights. The consensus threshold was 75%. Based on the median ranks derived from expert panel opinion and their rank-sum based conversion to importance scores, the category and item weights were calculated. Result In total, 59 panelists from 19 countries participated in selection and ranking of the items and categories. Final METRICS tool included 30 items within 9 categories. According to their weights, the categories were in descending order of importance: study design, imaging data, image processing and feature extraction, metrics and comparison, testing, feature processing, preparation for modeling, segmentation, and open science. A web application and a repository were developed to streamline the calculation of the METRICS score and to collect feedback from the radiomics community. Conclusion In this work, we developed a scoring tool for assessing the methodological quality of the radiomics research, with a large international panel and a modified Delphi protocol. With its conditional format to cover methodological variations, it provides a well-constructed framework for the key methodological concepts to assess the quality of radiomic research papers. Critical relevance statement A quality assessment tool, METhodological RadiomICs Score (METRICS), is made available by a large group of international domain experts, with transparent methodology, aiming at evaluating and improving research quality in radiomics and machine learning. Key points • A methodological scoring tool, METRICS, was developed for assessing the quality of radiomics research, with a large international expert panel and a modified Delphi protocol. • The proposed scoring tool presents expert opinion-based importance weights of categories and items with a transparent methodology for the first time. • METRICS accounts for varying use cases, from handcrafted radiomics to entirely deep learning-based pipelines. • A web application has been developed to help with the calculation of the METRICS score (https://metricsscore.github.io/metrics/METRICS.html) and a repository created to collect feedback from the radiomics community (https://github.com/metricsscore/metrics). Graphical Abstract

  • Automatic structuring of radiology reports with on-premise open-source large language models
    Piotr Woźnicki, Caroline Laqua, Ina Fiku, Amar Hekalo, Daniel Truhn, Sandy Engelhardt, Jakob Kather, Sebastian Foersch, Tugba Akinci D’Antonoli, Daniel Pinto dos Santos,et al.

    Springer Science and Business Media LLC
    Abstract Objectives Structured reporting enhances comparability, readability, and content detail. Large language models (LLMs) could convert free text into structured data without disrupting radiologists’ reporting workflow. This study evaluated an on-premise, privacy-preserving LLM for automatically structuring free-text radiology reports. Materials and methods We developed an approach to controlling the LLM output, ensuring the validity and completeness of structured reports produced by a locally hosted Llama-2-70B-chat model. A dataset with de-identified narrative chest radiograph (CXR) reports was compiled retrospectively. It included 202 English reports from a publicly available MIMIC-CXR dataset and 197 German reports from our university hospital. Senior radiologist prepared a detailed, fully structured reporting template with 48 question-answer pairs. All reports were independently structured by the LLM and two human readers. Bayesian inference (Markov chain Monte Carlo sampling) was used to estimate the distributions of Matthews correlation coefficient (MCC), with [−0.05, 0.05] as the region of practical equivalence (ROPE). Results The LLM generated valid structured reports in all cases, achieving an average MCC of 0.75 (94% HDI: 0.70–0.80) and F1 score of 0.70 (0.70–0.80) for English, and 0.66 (0.62–0.70) and 0.68 (0.64–0.72) for German reports, respectively. The MCC differences between LLM and humans were within ROPE for both languages: 0.01 (−0.05 to 0.07), 0.01 (−0.05 to 0.07) for English, and −0.01 (−0.07 to 0.05), 0.00 (−0.06 to 0.06) for German, indicating approximately comparable performance. Conclusion Locally hosted, open-source LLMs can automatically structure free-text radiology reports with approximately human accuracy. However, the understanding of semantics varied across languages and imaging findings. Key Points QuestionWhy has structured reporting not been widely adopted in radiology despite clear benefits and how can we improve this? FindingsA locally hosted large language model successfully structured narrative reports, showing variation between languages and findings. Critical relevanceStructured reporting provides many benefits, but its integration into the clinical routine is limited. Automating the extraction of structured information from radiology reports enables the capture of structured data while allowing the radiologist to maintain their reporting workflow.

  • Color Maps: Facilitating the Clinical Impact of Quantitative MRI
    Nico Sollmann, Miha Fuderer, Fabio Crameri, Sebastian Weingärtner, Bettina Baeßler, Vikas Gulani, Kathryn E. Keenan, Stefano Mandija, Xavier Golay, and Nandita M. deSouza

    Wiley
    Presenting quantitative data using non‐standardized color maps potentially results in unrecognized misinterpretation of data. Clinically meaningful color maps should intuitively and inclusively represent data without misleading interpretation. Uniformity of the color gradient for color maps is critically important. Maximal color and lightness contrast, readability for color vision‐impaired individuals, and recognizability of the color scheme are highly desirable features. This article describes the use of color maps in five key quantitative MRI techniques: relaxometry, diffusion‐weighted imaging (DWI), dynamic contrast‐enhanced (DCE)‐MRI, MR elastography (MRE), and water‐fat MRI. Current display practice of color maps is reviewed and shortcomings against desirable features are highlighted.Evidence Level5Technical EfficacyStage 2

  • Towards reproducible radiomics research: introduction of a database for radiomics studies
    Tugba Akinci D’Antonoli, Renato Cuocolo, Bettina Baessler, and Daniel Pinto dos Santos

    Springer Science and Business Media LLC
    Abstract Objectives To investigate the model-, code-, and data-sharing practices in the current radiomics research landscape and to introduce a radiomics research database. Methods A total of 1254 articles published between January 1, 2021, and December 31, 2022, in leading radiology journals (European Radiology, European Journal of Radiology, Radiology, Radiology: Artificial Intelligence, Radiology: Cardiothoracic Imaging, Radiology: Imaging Cancer) were retrospectively screened, and 257 original research articles were included in this study. The categorical variables were compared using Fisher’s exact tests or chi-square test and numerical variables using Student’s t test with relation to the year of publication. Results Half of the articles (128 of 257) shared the model by either including the final model formula or reporting the coefficients of selected radiomics features. A total of 73 (28%) models were validated on an external independent dataset. Only 16 (6%) articles shared the data or used publicly available open datasets. Similarly, only 20 (7%) of the articles shared the code. A total of 7 (3%) articles both shared code and data. All collected data in this study is presented in a radiomics research database (RadBase) and could be accessed at https://github.com/EuSoMII/RadBase. Conclusion According to the results of this study, the majority of published radiomics models were not technically reproducible since they shared neither model nor code and data. There is still room for improvement in carrying out reproducible and open research in the field of radiomics. Clinical relevance statement To date, the reproducibility of radiomics research and open science practices within the radiomics research community are still very low. Ensuring reproducible radiomics research with model-, code-, and data-sharing practices will facilitate faster clinical translation. Key Points • There is a discrepancy between the number of published radiomics papers and the clinical implementation of these published radiomics models. • The main obstacle to clinical implementation is the lack of model-, code-, and data-sharing practices. • In order to translate radiomics research into clinical practice, the radiomics research community should adopt open science practices.

  • Addressing challenges in radiomics research: systematic review and repository of open-access cancer imaging datasets
    Piotr Woznicki, Fabian Christopher Laqua, Adam Al-Haj, Thorsten Bley, and Bettina Baeßler

    Springer Science and Business Media LLC
    Abstract Objectives Open-access cancer imaging datasets have become integral for evaluating novel AI approaches in radiology. However, their use in quantitative analysis with radiomics features presents unique challenges, such as incomplete documentation, low visibility, non-uniform data formats, data inhomogeneity, and complex preprocessing. These issues may cause problems with reproducibility and standardization in radiomics studies. Methods We systematically reviewed imaging datasets with public copyright licenses, published up to March 2023 across four large online cancer imaging archives. We included only datasets with tomographic images (CT, MRI, or PET), segmentations, and clinical annotations, specifically identifying those suitable for radiomics research. Reproducible preprocessing and feature extraction were performed for each dataset to enable their easy reuse. Results We discovered 29 datasets with corresponding segmentations and labels in the form of health outcomes, tumor pathology, staging, imaging-based scores, genetic markers, or repeated imaging. We compiled a repository encompassing 10,354 patients and 49,515 scans. Of the 29 datasets, 15 were licensed under Creative Commons licenses, allowing both non-commercial and commercial usage and redistribution, while others featured custom or restricted licenses. Studies spanned from the early 1990s to 2021, with the majority concluding after 2013. Seven different formats were used for the imaging data. Preprocessing and feature extraction were successfully performed for each dataset. Conclusion RadiomicsHub is a comprehensive public repository with radiomics features derived from a systematic review of public cancer imaging datasets. By converting all datasets to a standardized format and ensuring reproducible and traceable processing, RadiomicsHub addresses key reproducibility and standardization challenges in radiomics. Critical relevance statement This study critically addresses the challenges associated with locating, preprocessing, and extracting quantitative features from open-access datasets, to facilitate more robust and reliable evaluations of radiomics models. Key points - Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research. - A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled. - Most datasets can be shared, used, and built upon freely under a Creative Commons license. - All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction. Graphical Abstract

  • Automated localization and segmentation of cervical lymph nodes on contrast-enhanced CT using a 3D foveal fully convolutional neural network
    Miriam Rinneburger, Heike Carolus, Andra-Iza Iuga, Mathilda Weisthoff, Simon Lennartz, Nils Große Hokamp, Liliana Caldeira, Rahil Shahzad, David Maintz, Fabian Christopher Laqua,et al.

    Springer Science and Business Media LLC
    Abstract Background In the management of cancer patients, determination of TNM status is essential for treatment decision-making and therefore closely linked to clinical outcome and survival. Here, we developed a tool for automatic three-dimensional (3D) localization and segmentation of cervical lymph nodes (LNs) on contrast-enhanced computed tomography (CECT) examinations. Methods In this IRB-approved retrospective single-center study, 187 CECT examinations of the head and neck region from patients with various primary diseases were collected from our local database, and 3656 LNs (19.5 ± 14.9 LNs/CECT, mean ± standard deviation) with a short-axis diameter (SAD) ≥ 5 mm were segmented manually by expert physicians. With these data, we trained an independent fully convolutional neural network based on 3D foveal patches. Testing was performed on 30 independent CECTs with 925 segmented LNs with an SAD ≥ 5 mm. Results In total, 4,581 LNs were segmented in 217 CECTs. The model achieved an average localization rate (LR), i.e., percentage of localized LNs/CECT, of 78.0% in the validation dataset. In the test dataset, average LR was 81.1% with a mean Dice coefficient of 0.71. For enlarged LNs with a SAD ≥ 10 mm, LR was 96.2%. In the test dataset, the false-positive rate was 2.4 LNs/CECT. Conclusions Our trained AI model demonstrated a good overall performance in the consistent automatic localization and 3D segmentation of physiological and metastatic cervical LNs with a SAD ≥ 5 mm on CECTs. This could aid clinical localization and automatic 3D segmentation, which can benefit clinical care and radiomics research. Relevance statement Our AI model is a time-saving tool for 3D segmentation of cervical lymph nodes on contrast-enhanced CT scans and serves as a solid base for N staging in clinical practice and further radiomics research. Key points • Determination of N status in TNM staging is essential for therapy planning in oncology. • Segmenting cervical lymph nodes manually is highly time-consuming in clinical practice. • Our model provides a robust, automated 3D segmentation of cervical lymph nodes. • It achieves a high accuracy for localization especially of enlarged lymph nodes. • These segmentations should assist clinical care and radiomics research. Graphical Abstract

  • Denoising diffusion probabilistic models for 3D medical image generation
    Firas Khader, Gustav Müller-Franzes, Soroosh Tayebi Arasteh, Tianyu Han, Christoph Haarburger, Maximilian Schulze-Hagen, Philipp Schad, Sandy Engelhardt, Bettina Baeßler, Sebastian Foersch,et al.

    Springer Science and Business Media LLC
    AbstractRecent advances in computer vision have shown promising results in image generation. Diffusion probabilistic models have generated realistic images from textual input, as demonstrated by DALL-E 2, Imagen, and Stable Diffusion. However, their use in medicine, where imaging data typically comprises three-dimensional volumes, has not been systematically evaluated. Synthetic images may play a crucial role in privacy-preserving artificial intelligence and can also be used to augment small datasets. We show that diffusion probabilistic models can synthesize high-quality medical data for magnetic resonance imaging (MRI) and computed tomography (CT). For quantitative evaluation, two radiologists rated the quality of the synthesized images regarding "realistic image appearance", "anatomical correctness", and "consistency between slices". Furthermore, we demonstrate that synthetic images can be used in self-supervised pre-training and improve the performance of breast segmentation models when data is scarce (Dice scores, 0.91 [without synthetic data], 0.95 [with synthetic data]).

  • CheckList for EvaluAtion of Radiomics research (CLEAR): a step-by-step reporting guideline for authors and reviewers endorsed by ESR and EuSoMII
    Burak Kocak, Bettina Baessler, Spyridon Bakas, Renato Cuocolo, Andrey Fedorov, Lena Maier-Hein, Nathaniel Mercaldo, Henning Müller, Fanny Orlhac, Daniel Pinto dos Santos,et al.

    Springer Science and Business Media LLC
    AbstractEven though radiomics can hold great potential for supporting clinical decision-making, its current use is mostly limited to academic research, without applications in routine clinical practice. The workflow of radiomics is complex due to several methodological steps and nuances, which often leads to inadequate reporting and evaluation, and poor reproducibility. Available reporting guidelines and checklists for artificial intelligence and predictive modeling include relevant good practices, but they are not tailored to radiomic research. There is a clear need for a complete radiomics checklist for study planning, manuscript writing, and evaluation during the review process to facilitate the repeatability and reproducibility of studies. We here present a documentation standard for radiomic research that can guide authors and reviewers. Our motivation is to improve the quality and reliability and, in turn, the reproducibility of radiomic research. We name the checklist CLEAR (CheckList for EvaluAtion of Radiomics research), to convey the idea of being more transparent. With its 58 items, the CLEAR checklist should be considered a standardization tool providing the minimum requirements for presenting clinical radiomics research. In addition to a dynamic online version of the checklist, a public repository has also been set up to allow the radiomics community to comment on the checklist items and adapt the checklist for future versions. Prepared and revised by an international group of experts using a modified Delphi method, we hope the CLEAR checklist will serve well as a single and complete scientific documentation tool for authors and reviewers to improve the radiomics literature.

  • Clinical audit in European radiology: current status and recommendations for improvement endorsed by the European Society of Radiology (ESR)
    David C. Howlett, Paulette Kumi, Roman Kloeckner, Nuria Bargallo, Bettina Baessler, Minerva Becker, Steve Ebdon-Jackson, Alexandra Karoussou-Schreiner, Christian Loewe, Marta Sans Merce,et al.

    Springer Science and Business Media LLC
    AbstractClinical audit is an important quality improvement activity and has significant benefits for patients in terms of enhanced care, safety, experience and outcomes. Clinical audit in support of radiation protection is mandated within the European Council Basic Safety Standards Directive (BSSD), 2013/59/Euratom. The European Society of Radiology (ESR) has recognised clinical audit as an area of particular importance in the delivery of safe and effective health care. The ESR, alongside other European organisations and professional bodies, has developed a range of clinical audit-related initiatives to support European radiology departments in developing a clinical audit infrastructure and fulfilling their legal obligations. However, work by the European Commission, the ESR and other agencies has demonstrated a persisting variability in clinical audit uptake and implementation across Europe and a lack of awareness of the BSSD clinical audit requirements. In recognition of these findings, the European Commission supported the QuADRANT project, led by the ESR and in partnership with ESTRO (European Association of Radiotherapy and Oncology) and EANM (European Association of Nuclear Medicine). QuADRANT was a 30-month project which completed in Summer 2022, aiming to provide an overview of the status of European clinical audit and identifying barriers and challenges to clinical audit uptake and implementation. This paper summarises the current position of European radiological clinical audit and considers the barriers and challenges that exist. Reference is made to the QuADRANT project, and a range of potential solutions are suggested to enhance radiological clinical audit across Europe.

  • Trends and statistics of artificial intelligence and radiomics research in Radiology, Nuclear Medicine, and Medical Imaging: bibliometric analysis
    Burak Kocak, Bettina Baessler, Renato Cuocolo, Nathaniel Mercaldo, and Daniel Pinto dos Santos

    Springer Science and Business Media LLC

  • An overview and a roadmap for artificial intelligence in hematology and oncology
    Wiebke Rösler, Michael Altenbuchinger, Bettina Baeßler, Tim Beissbarth, Gernot Beutel, Robert Bock, Nikolas von Bubnoff, Jan-Niklas Eckardt, Sebastian Foersch, Chiara M. L. Loeffler,et al.

    Springer Science and Business Media LLC
    Abstract Background Artificial intelligence (AI) is influencing our society on many levels and has broad implications for the future practice of hematology and oncology. However, for many medical professionals and researchers, it often remains unclear what AI can and cannot do, and what are promising areas for a sensible application of AI in hematology and oncology. Finally, the limits and perils of using AI in oncology are not obvious to many healthcare professionals. Methods In this article, we provide an expert-based consensus statement by the joint Working Group on “Artificial Intelligence in Hematology and Oncology” by the German Society of Hematology and Oncology (DGHO), the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), and the Special Interest Group Digital Health of the German Informatics Society (GI). We provide a conceptual framework for AI in hematology and oncology. Results First, we propose a technological definition, which we deliberately set in a narrow frame to mainly include the technical developments of the last ten years. Second, we present a taxonomy of clinically relevant AI systems, structured according to the type of clinical data they are used to analyze. Third, we show an overview of potential applications, including clinical, research, and educational environments with a focus on hematology and oncology. Conclusion Thus, this article provides a point of reference for hematologists and oncologists, and at the same time sets forth a framework for the further development and clinical deployment of AI in hematology and oncology in the future.

  • Transfer-Learning Deep Radiomics and Hand-Crafted Radiomics for Classifying Lymph Nodes from Contrast-Enhanced Computed Tomography in Lung Cancer
    Fabian Christopher Laqua, Piotr Woznicki, Thorsten A. Bley, Mirjam Schöneck, Miriam Rinneburger, Mathilda Weisthoff, Matthias Schmidt, Thorsten Persigehl, Andra-Iza Iuga, and Bettina Baeßler

    MDPI AG
    Objectives: Positron emission tomography (PET) is currently considered the non-invasive reference standard for lymph node (N-)staging in lung cancer. However, not all patients can undergo this diagnostic procedure due to high costs, limited availability, and additional radiation exposure. The purpose of this study was to predict the PET result from traditional contrast-enhanced computed tomography (CT) and to test different feature extraction strategies. Methods: In this study, 100 lung cancer patients underwent a contrast-enhanced 18F-fluorodeoxyglucose (FDG) PET/CT scan between August 2012 and December 2019. We trained machine learning models to predict FDG uptake in the subsequent PET scan. Model inputs were composed of (i) traditional “hand-crafted” radiomics features from the segmented lymph nodes, (ii) deep features derived from a pretrained EfficientNet-CNN, and (iii) a hybrid approach combining (i) and (ii). Results: In total, 2734 lymph nodes [555 (20.3%) PET-positive] from 100 patients [49% female; mean age 65, SD: 14] with lung cancer (60% adenocarcinoma, 21% plate epithelial carcinoma, 8% small-cell lung cancer) were included in this study. The area under the receiver operating characteristic curve (AUC) ranged from 0.79 to 0.87, and the scaled Brier score (SBS) ranged from 16 to 36%. The random forest model (iii) yielded the best results [AUC 0.871 (0.865–0.878), SBS 35.8 (34.2–37.2)] and had significantly higher model performance than both approaches alone (AUC: p < 0.001, z = 8.8 and z = 22.4; SBS: p < 0.001, z = 11.4 and z = 26.6, against (i) and (ii), respectively). Conclusion: Both traditional radiomics features and transfer-learning deep radiomics features provide relevant and complementary information for non-invasive N-staging in lung cancer.

  • Automation Bias in Mammography: The Impact of Artificial Intelligence BI-RADS Suggestions on Reader Performance
    Thomas Dratsch, Xue Chen, Mohammad Rezazade Mehrizi, Roman Kloeckner, Aline Mähringer-Kunz, Michael Püsken, Bettina Baeßler, Stephanie Sauer, David Maintz, and Daniel Pinto dos Santos

    Radiological Society of North America (RSNA)
    Background Automation bias (the propensity for humans to favor suggestions from automated decision-making systems) is a known source of error in human-machine interactions, but its implications regarding artificial intelligence (AI)-aided mammography reading are unknown. Purpose To determine how automation bias can affect inexperienced, moderately experienced, and very experienced radiologists when reading mammograms with the aid of an artificial intelligence (AI) system. Materials and Methods In this prospective experiment, 27 radiologists read 50 mammograms and provided their Breast Imaging Reporting and Data System (BI-RADS) assessment assisted by a purported AI system. Mammograms were obtained between January 2017 and December 2019 and were presented in two randomized sets. The first was a training set of 10 mammograms, with the correct BI-RADS category suggested by the AI system. The second was a set of 40 mammograms in which an incorrect BI-RADS category was suggested for 12 mammograms. Reader performance, degree of bias in BI-RADS scoring, perceived accuracy of the AI system, and reader confidence in their own BI-RADS ratings were assessed using analysis of variance (ANOVA) and repeated-measures ANOVA followed by post hoc tests and Kruskal-Wallis tests followed by the Dunn post hoc test. Results The percentage of correctly rated mammograms by inexperienced (mean, 79.7% ± 11.7 [SD] vs 19.8% ± 14.0; P < .001; r = 0.93), moderately experienced (mean, 81.3% ± 10.1 vs 24.8% ± 11.6; P < .001; r = 0.96), and very experienced (mean, 82.3% ± 4.2 vs 45.5% ± 9.1; P = .003; r = 0.97) radiologists was significantly impacted by the correctness of the AI prediction of BI-RADS category. Inexperienced radiologists were significantly more likely to follow the suggestions of the purported AI when it incorrectly suggested a higher BI-RADS category than the actual ground truth compared with both moderately (mean degree of bias, 4.0 ± 1.8 vs 2.4 ± 1.5; P = .044; r = 0.46) and very (mean degree of bias, 4.0 ± 1.8 vs 1.2 ± 0.8; P = .009; r = 0.65) experienced readers. Conclusion The results show that inexperienced, moderately experienced, and very experienced radiologists reading mammograms are prone to automation bias when being supported by an AI-based system. This and other effects of human and machine interaction must be considered to ensure safe deployment and accurate diagnostic performance when combining human readers and AI. © RSNA, 2023 Supplemental material is available for this article. See also the editorial by Baltzer in this issue.

  • European Association of Nuclear Medicine (EANM) Focus 4 consensus recommendations: molecular imaging and therapy in haematological tumours
    Cristina Nanni, Carsten Kobe, Bettina Baeßler, Christian Baues, Ronald Boellaard, Peter Borchmann, Andreas Buck, Irène Buvat, Björn Chapuy, Bruce D Cheson,et al.

    Elsevier BV

RECENT SCHOLAR PUBLICATIONS

  • Color Maps: Facilitating the Clinical Impact of Quantitative MRI
    N Sollmann, M Fuderer, F Crameri, S Weingrtner, B Baeler, V Gulani, ...
    Journal of Magnetic Resonance Imaging 61 (4), 1572-1579 2025

  • Color‐map recommendation for MR relaxometry maps
    M Fuderer, B Wichtmann, F Crameri, NM de Souza, B Baeler, V Gulani, ...
    Magnetic Resonance in Medicine 93 (2), 490-506 2025

  • Joint image reconstruction and segmentation of real-time cardiovascular magnetic resonance imaging in free-breathing using a model based on disentangled representation learning
    T Wech, O Schad, S Sauer, J Kleineisel, N Petri, P Nordbeck, TA Bley, ...
    Journal of Cardiovascular Magnetic Resonance 27 (1), 101844 2025

  • Joint image reconstruction and segmentation of real-time cardiac MRI in free-breathing using a model based on disentangled representation learning
    T Wech, O Schad, S Sauer, J Kleineisel, N Petri, P Nordbeck, TA Bley, ...
    Journal of Cardiovascular Magnetic Resonance, 101844 2025

  • Repeatability of radiomic features in myocardial T1 and T2 mapping
    M Manzke, FC Laqua, B Bttcher, AC Klemenz, MA Weber, B Baeler, ...
    European Radiology, 1-13 2025

  • Stability of Dynamic Radiomics Features in Cardiac MRI
    MD Klaus, F Laqua, B Baessler, MJ Ankenbrand
    medRxiv, 2025.04. 18.25326051 2025

  • Referenz Radiologie-Herz
    DC Maintz, B Baeler, J Drner, C Houbois, MC Langenbach, CP Nhle, ...
    Georg Thieme Verlag 2024

  • 46 Diabetische Kardiomyopathie
    B Baeler
    Referenz Radiologie-Herz, 130 2024

  • 17.3 Pathognomonische Befunde
    B Baeler
    Referenz Radiologie-Herz, 59 2024

  • 33 Hypertrophe nicht obstruktive Kardiomyopathie
    B Baeler
    Referenz Radiologie-Herz, 98 2024

  • Automatic structuring of radiology reports with on-premise open-source large language models
    P Woźnicki, C Laqua, I Fiku, A Hekalo, D Truhn, S Engelhardt, J Kather, ...
    European Radiology, 1-12 2024

  • Enhancing histopathological research with privacy-preserving swarm learning and advanced image generation techniques
    J Zhu, OL Saldanha, S Schulz, P Wienholt, SUH Dar, S Engelhardt, ...
    VIRCHOWS ARCHIV 485, S18-S19 2024

  • LernRad: New Course on Cardiac CT Diagnosis offers practice-oriented Training
    B Baessler
    ROFO-FORTSCHRITTE AUF DEM GEBIET DER RONTGENSTRAHLEN UND DER BILDGEBENDEN 2024

  • Deep learning k‐space‐to‐image reconstruction facilitates high spatial resolution and scan time reduction in diffusion‐weighted imaging breast MRI
    ST Sauer, SA Christner, AM Lois, P Woznicki, C Curtaz, AS Kunz, ...
    Journal of Magnetic Resonance Imaging 60 (3), 1190-1200 2024

  • Color Maps: Facilitating the Clinical Impact of Quantitative Magnetic Resonance Imaging
    N Sollmann, M Fuderer, F Crameri, S Weingrtner, B Baessler, V Gulani, ...
    Nico Sollmann, Miha Fuderer, Fabio Crameri, Sebastian Weingrtner, Bettina 2024

  • Cooperative AI training for cardiothoracic segmentation in computed tomography: An iterative multi-center annotation approach
    B Lassen-Schmidt, B Baessler, M Gutberlet, J Berger, JM Brendel, ...
    European Journal of Radiology 176, 111534 2024

  • Radiomics workflow definition & challenges-German priority program 2177 consensus statement on clinically applied radiomics
    R Floca, J Bohn, C Haux, B Wiestler, FG Zllner, A Reinke, J Wei, ...
    Insights into imaging 15 (1), 124 2024

  • Perfect match: radiomics and artificial intelligence in cardiac imaging
    B Baeler, S Engelhardt, A Hekalo, A Hennemuth, M Hllebrand, A Laube, ...
    Circulation: Cardiovascular Imaging 17 (6), e015490 2024

  • Robustness of radiomic features in healthy abdominal parenchyma of patients with repeated examinations on dual-layer dual-energy CT
    M Schneck, S Lennartz, D Zopfs, K Sonnabend, RWM Reimer, ...
    European Journal of Radiology 175, 111447 2024

  • The challenges of research data management in cardiovascular science: a DGK and DZHK position paper—executive summary
    S Steffens, K Schrder, M Krger, C Maack, K Streckfuss-Bmeke, ...
    Clinical Research in Cardiology 113 (5), 672-679 2024

MOST CITED SCHOLAR PUBLICATIONS

  • Radiomics in medical imaging—“how-to” guide and critical reflection
    JE Van Timmeren, D Cester, S Tanadini-Lang, H Alkadhi, B Baessler
    Insights into imaging 11 (1), 91 2020
    Citations: 1186

  • Medical students' attitude towards artificial intelligence: a multicentre survey
    D Pinto dos Santos, D Giese, S Brodehl, SH Chon, W Staab, R Kleinert, ...
    European radiology 29, 1640-1646 2019
    Citations: 655

  • Denoising diffusion probabilistic models for 3D medical image generation
    F Khader, G Mller-Franzes, S Tayebi Arasteh, T Han, C Haarburger, ...
    Scientific Reports 13 (1), 7303 2023
    Citations: 264

  • Machine learning in cardiovascular magnetic resonance: basic concepts and applications
    T Leiner, D Rueckert, A Suinesiaputra, B Baeler, R Nezafat, I Išgum, ...
    Journal of Cardiovascular Magnetic Resonance 21 (1), 61 2019
    Citations: 256

  • Robustness and reproducibility of radiomics in magnetic resonance imaging: a phantom study
    B Baeler, K Weiss, DP Dos Santos
    Investigative radiology 54 (4), 221-228 2019
    Citations: 248

  • CheckList for EvaluAtion of Radiomics research (CLEAR): a step-by-step reporting guideline for authors and reviewers endorsed by ESR and EuSoMII
    B Kocak, B Baessler, S Bakas, R Cuocolo, A Fedorov, L Maier-Hein, ...
    Insights into imaging 14 (1), 75 2023
    Citations: 241

  • Image-based cardiac diagnosis with machine learning: a review
    C Martin-Isla, VM Campello, C Izquierdo, Z Raisi-Estabragh, B Baeler, ...
    Frontiers in cardiovascular medicine 7, 1 2020
    Citations: 223

  • Subacute and chronic left ventricular myocardial scar: accuracy of texture analysis on nonenhanced cine MR images
    B Baessler, M Mannil, S Oebel, D Maintz, H Alkadhi, R Manka
    Radiology 286 (1), 103-112 2018
    Citations: 193

  • How COVID-19 kick-started online learning in medical education—The DigiMed study
    F Stoehr, L Mller, A Brady, A Trilla, A Mhringer-Kunz, F Hahn, C Dber, ...
    PLoS One 16 (9), e0257394 2021
    Citations: 166

  • Automation bias in mammography: the impact of artificial intelligence BI-RADS suggestions on reader performance
    T Dratsch, X Chen, M Rezazade Mehrizi, R Kloeckner, A Mhringer-Kunz, ...
    Radiology 307 (4), e222176 2023
    Citations: 151

  • Cardiac MRI texture analysis of T1 and T2 maps in patients with infarctlike acute myocarditis
    B Baessler, C Luecke, J Lurz, K Klingel, M Von Roeder, S De Waha, ...
    Radiology 289 (2), 357-365 2018
    Citations: 147

  • Texture analysis and machine learning of non-contrast T1-weighted MR images in patients with hypertrophic cardiomyopathy—Preliminary results
    B Baeler, M Mannil, D Maintz, H Alkadhi, R Manka
    European journal of radiology 102, 61-67 2018
    Citations: 135

  • A decade of radiomics research: are images really data or just patterns in the noise?
    D Pinto dos Santos, M Dietzel, B Baessler
    European radiology 31, 1-4 2021
    Citations: 131

  • METhodological RadiomICs Score (METRICS): a quality scoring tool for radiomics research endorsed by EuSoMII
    B Kocak, T Akinci D’Antonoli, N Mercaldo, A Alberich-Bayarri, B Baessler, ...
    Insights into imaging 15 (1), 8 2024
    Citations: 118

  • A systematic evaluation of three different cardiac T2-mapping sequences at 1.5 and 3T in healthy volunteers
    B Baeler, F Schaarschmidt, C Stehning, B Schnackenburg, D Maintz, ...
    European Journal of Radiology 84 (11), 2161-2170 2015
    Citations: 113

  • Cardiac MRI and texture analysis of myocardial T1 and T2 maps in myocarditis with acute versus chronic symptoms of heart failure
    B Baessler, C Luecke, J Lurz, K Klingel, A Das, M Von Roeder, ...
    Radiology 292 (3), 608-617 2019
    Citations: 112

  • Biventricular myocardial strain analysis in patients with arrhythmogenic right ventricular cardiomyopathy (ARVC) using cardiovascular magnetic resonance feature tracking
    P Heermann, DM Hedderich, M Paul, C Schlke, JR Kroeger, B Baeler, ...
    Journal of Cardiovascular Magnetic Resonance 16 (1), 75 2014
    Citations: 111

  • Monoenergetic reconstructions for imaging of coronary artery stents using spectral detector CT: In-vitro experience and comparison to conventional images
    T Hickethier, B Baeler, JR Kroeger, J Doerner, G Pahn, D Maintz, ...
    Journal of cardiovascular computed tomography 11 (1), 33-39 2017
    Citations: 95

  • Radiomics allows for detection of benign and malignant histopathology in patients with metastatic testicular germ cell tumors prior to post-chemotherapy retroperitoneal lymph
    B Baessler, T Nestler, D Pinto dos Santos, P Paffenholz, V Zeuch, ...
    European Radiology 30 (4), 2334-2345 2020
    Citations: 93

  • Big data, artificial intelligence, and structured reporting
    D Pinto dos Santos, B Baeler
    European radiology experimental 2 (1), 42 2018
    Citations: 88