So, today, I have been coding and do not have much to talk about. Originally, I had planned to use the third corpora that I mentioned in yesterday’s post, but I did not read the page correctly and the data has not been released yet. So, instead, I looked at this paper and found this corpus, the CIC-FCE dataset. The majority of the day has been spent just examining this corpus, and trying to massage it into a way that I can easily feed into the Stanford CoreNLP parser. No hard results yet, but I hope that changes tomorrow.