Inductive thematic analysis of healthcare qualitative interviews using open-source large language models: how does it compare to traditional methods?

Walter S. Mathis*, Sophia Zhao, Nicholas Pratt, Jeremy Weleff, Stefano De Paoli

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    25 Citations (Scopus)
    69 Downloads (Pure)

    Abstract

    Background
    Large language models (LLMs) are generative artificial intelligence that have ignited much interest and discussion about their utility in clinical and research settings. Despite this interest there is sparse analysis of their use in qualitative thematic analysis comparing their current ability to that of human coding and analysis. In addition, there has been no published analysis of their use in real-world, protected health information.

    Objective
    Here we fill that gap in the literature by comparing an LLM to standard human thematic analysis in real-world, semi-structured interviews of both patients and clinicians within a psychiatric setting.

    Methods
    Using a 70 billion parameter open-source LLM running on local hardware and advanced prompt engineering techniques, we produced themes that summarized a full corpus of interviews in minutes. Subsequently we used three different evaluation methods for quantifying similarity between themes produced by the LLM and those produced by humans.

    Results
    These revealed similarities ranging from moderate to substantial (Jaccard similarity coefficients 0.44-0.69), which are promising preliminary results.

    Conclusion
    Our study demonstrates that open-source LLMs can effectively generate robust themes from qualitative data, achieving substantial similarity to human-generated themes. The validation of LLMs in thematic analysis, coupled with evaluation methodologies, highlights their potential to enhance and democratize qualitative research across diverse fields.
    Original languageEnglish
    Article number108356
    Number of pages11
    JournalComputer Methods and Programs in Biomedicine
    Volume255
    Early online date24 Jul 2024
    DOIs
    Publication statusPublished - 1 Oct 2024

    Keywords

    • Artificial intelligence
    • Large language models
    • Qualitative methods
    • Thematic analysis
    • Mental health

    Fingerprint

    Dive into the research topics of 'Inductive thematic analysis of healthcare qualitative interviews using open-source large language models: how does it compare to traditional methods?'. Together they form a unique fingerprint.

    Cite this