Validity of ChatGPT-generated musculoskeletal images
Published Date: 04th March 2024
Publication Authors: Iyengar KP
Objective
In the evolving landscape of medical research and radiology, effective communication of intricate ideas is imperative, with visualizations playing a crucial role. This study explores the transformative potential of ChatGPT4, a powerful Large Language Model (LLM), in automating the creation of schematics and figures for radiology research papers, specifically focusing on its implications for musculoskeletal studies.
Materials and methods
Deploying ChatGPT4, the study aimed to assess the model’s ability to generate anatomical images of six large joints—shoulder, elbow, wrist, hip, knee, and ankle. Four variations of a text prompt were utilized, to generate a coronal illustration with annotations for each joint. Evaluation parameters included anatomical correctness, correctness of annotations, aesthetic nature of illustrations, usability of figures in research papers, and cost-effectiveness. Four panellists performed the assessment using a 5-point Likert Scale.
Results
Overall analysis of the 24 illustrations encompassing the six joints of interest (4 of each) revealed significant limitations in ChatGPT4’s performance. The anatomical design ranged from poor to good, all of the illustrations received a below-average rating for annotation, with the majority assessed as poor. All of them ranked below average for usability in research papers. There was good agreement between raters across all domains (ICC = 0.61).
Conclusion
While LLMs like ChatGPT4 present promising prospects for rapid figure generation, their current capabilities fall short of meeting the rigorous standards demanded by musculoskeletal radiology research. Future developments should focus on iterative refinement processes to enhance the realism of LLM-generated musculoskeletal schematics.
Ajmera, P.; Iyengar, KP. et al. (2024). Validity of ChatGPT-generated musculoskeletal images. Skeletal Radiology. [Online]. Available at: https://doi.org/10.1007/s00256-024-04638-y [Accessed 2 April 2024].
« Back