- Cornell Movie--Dialogs Corpus: Contains fictional conversations between pairs of movie characters
- Thenmap: Documents historical borders and allows users, with an open-source API and mapping tool, to show how borders have changed over time
- Refugees, where they are from and where they go: From the Department of State, the data can be manipulated in various ways, such as identifying the country of origin and the state of relocation.
- Side Effect Resource (SIDER): Contains 139,756 side effects of 1,430 medical drugs
Thursday, February 11, 2016
Thought you might be interested Thursday: Interesting and useful data
Jeremy Singer-Vine's Data is Plural is a most useful website. Consider, for example, the data sets he has shared recently: