TEXT: Traffic Entity eXtraction from Twitter

Lim Cheng Yang, Ian K. T. Tan, Bhawani Selvaretnam, Ewe Kok Howg, Lau Heng Kar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)


Traffic congestion is a significant problem for many developing countries. Among a variety of methods proposed to address this problem, extraction of road traffic conditions from microblogging platforms for dissemination to road users has gained popularity. Coupled with the use of Internet of Things (IoT) devices, up-to-date traffic condition reports from road users and road traffic authorities, can be effectively reconciled and communicated to the intended audience. However, the noisy and unstructured nature of Twitter texts causes a decline in the performance of conventional Named Entity Recognition (NER) techniques. NER techniques have traditionally been used to extract location information. In order to extract the corresponding traffic state information for each location, contextual information become important which are not implemented in conventional NER approaches. In this paper, a rule-based NER technique, which considers contextual clues, is proposed for the extraction of location and traffic state entities. The proposed approach has achieved significant improvement to the F1 score, 88.96 and 81.32, for location entity extraction from formal and informal sources, respectively.

Original languageEnglish
Title of host publicationProceedings of the 2019 5th International Conference on Computing and Data Engineering
PublisherAssociation for Computing Machinery
Number of pages7
ISBN (Electronic)9781450361248
Publication statusPublished - 4 May 2019
Event5th International Conference on Computing and Data Engineering 2019 - Shanghai, China
Duration: 4 May 20196 May 2019


Conference5th International Conference on Computing and Data Engineering 2019
Abbreviated titleICCDE 2019


  • Internet of things
  • Named entity recognition
  • Natural language processing
  • Rule-based
  • Twitter

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Computer Science Applications
  • Health Informatics


Dive into the research topics of 'TEXT: Traffic Entity eXtraction from Twitter'. Together they form a unique fingerprint.

Cite this