• Text mining with Python

    In my last post, I introduced the method I am employing for the qualitative discourse analysis of my interviews. Here, I want to introduce some of the tools I have used to build my script. Reading multiple .txt files All interview transcriptions were saved as .txt files and read into Python using the os and pandas packages. Saving the main directory (containing all interview files) address as a variable, I could easily extract all files from the directory with the correct format name like this: Removing punctuation and irrelevant breaks To remove excess spacing and irrelevant punctuation, each interview content is converted into a giant string and I simply used…

  • Thesis: Analysing interviews using network methodology?

    A part of my thesis work is to investigate the reliability of the Force Concept Inventory through interviews with the students. I have spent quite some time transcribing, and my next step is therefore to analyse these ones. For this, I choose to employ a recently suggested network approach for qualitative discourse analysis as described in Bruun et al (2018). The method intrigues me as it could relate well to my network approach for analysing the Force Concept Inventory. Of course, the author of both methods is also my supervisor for this project, so a natural introduction to both approaches became a part of our discussions. Potentially, if the method…

  • Transcribing interviews

    This was never my dream, but it has almost been my entire thesis up until recently. In this post, I will describe what my interviews have been about, and how I have structured my transcription. What did my data consist of? The raw interviews consists of 22 audio files, on average about 30 minutes long. The longest interview was 46 minutes, and the shortest one was only 19 minutes – in total 11 hours of recording. Each interview consisted of two parts of content: A qualitative first part assessing general characteristics of the participant and also preparing the participant for the second part of the interview which was directly related…