Abstract
Social media platforms such as Twitter provide convenient ways to share and consume important information during disasters and emergencies. Information from bystanders and eyewitnesses can be useful for law enforcement agencies and humanitarian organizations to get firsthand and credible information about an ongoing situation to gain situational awareness among other uses. However, identification of eyewitness reports on Twitter is challenging for many reasons. This work investigates the sources of tweets and classifies them into three types (i) direct eyewitnesses, (ii) indirect eyewitness, and (iii) vulnerable accounts. Moreover, we investigate various characteristics associated with each kind of eyewitness account. We observe that words related to perceptual senses (feeling, seeing, hearing) tend to be present in direct eyewitness messages, whereas emotions, thoughts, and prayers are more common in indirect witnesses. We believe these characteristics can help make more efficient computational methods and systems in the future for automatic identification of eyewitness accounts.