AIDA: Accurate Online Disambiguation of Named Entities in Text and Tables
News
Stay up to date with AIDA news and releases, send a mail to: aida-news-subscribe@lists.mpi-inf.mpg.de
Overview
AIDA is a framework and online tool for entity detection and disambiguation. Given a natural-language text or a Web table, it maps mentions of ambiguous names onto canonical entities (e.g., individual people or places) registered in the YAGO2 knowledge base.
You can try AIDA on any text you like in the online demo.
To experimentally verify the quality of AIDA, we annotated nearly 1,400 newswire articles with the entities mentioned in each article. This collection is available for download (see Downloads).
Further Information
If you need any further information, please contact us via mail: NAME_OF_PROJECT@mpi-inf.mpg.de
Discuss on our Mailing List
Discuss AIDA with us and other users by joining our mailing list. Send a mail to aida-users-subscribe@lists.mpi-inf.mpg.de or sign up at https://lists.mpi-inf.mpg.de/listinfo/aida-users.
Downloads and Datasets
Find all datasets related to AIDA in our downloads area.
AIDA JSON Web Service
We provide a HTTP JSON web service for AIDA so that you can try it out without any hassle of setting it up. It's as easy as:
curl --data text="Dylan was born in Duluth." https://gate.d5.mpi-inf.mpg.de/aida/service/disambiguate
More information is available in our web service description.
Please do not use it for comparison in scientific papers or for running time experiments, as the service changes continuously. If you want to compare AIDA for research, please download it and set it up on your own machines.
Publications
- U-AIDA: a Customizable System for Named Entity Recognition, Classification, and Disambiguation PDF
Mohamed Amir Yosef
Dissertation, 2015 - Discovering and Disambiguating Named Entities in Text PDF
Johannes Hoffart
Dissertation, 2015 - EDRAK: Entity-Centric Data Resource for Arabic Knowledge
Mohamed H Gad-Elrab, Mohamed Amir Yosef, Gerhard Weikum
In: ANLP Workshop 2015 at ACL-IJCNLP 2015, Beijing, China, 2015 - Named Entity Disambiguation for Resource-Poor Languages
Mohamed H Gad-Elrab, Mohamed Amir Yosef, Gerhard Weikum
In: Proceedings of the Eighth Workshop on Exploiting Semantic Annotations in Information Retrieval, p. 29-34, ESAIR 2015, Melbourne, Australia, 2015 - AIDA-Social: Entity Linking on the Social Stream
Yusra Ibrahim, Mohamed Amir Yosef, Gerhard Weikum
In: Proceedings of the 7th International Workshop on Exploiting Semantic Annotations in Information Retrieval, p. 17-19. ESAIR 2014, Shanghai, China, 2014 - AIDArabic A Named-Entity Disambiguation Framework for Arabic Text
Mohamed Amir Yosef, Marc Spaniol, Gerhard Weikum
In: Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing, p. 187-195. ANLP 2014, Doha, Qatar, 2014 - Discovering Emerging Entities with Ambiguous Names
Johannes Hoffart, Yasemin Altun, Gerhard Weikum
In: Proceedings of the 23rd International World Wide Web Conference, p. 385–395. WWW 2014, Seoul, South Korea, 2014 - AIDA-light: High-Throughput Named-Entity Disambiguation
Dat Ba Nguyen, Johannes Hoffart, Martin Theobald, Gerhard Weikum
In: Linked Data on the Web, WWW 2014, Seoul, South Korea, 2014 - KORE: Keyphrase Overlap Relatedness for Entity Disambiguation
Johannes Hoffart, Stephan Seufert, Dat Ba Nguyen, Martin Theobald, and Gerhard Weikum
In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, p. 545-554, CIKM 2012, Maui, USA, 2012 - AIDA: An Online Tool for Accurate Disambiguation of Named Entities in Text and Tables
Mohamed Amir Yosef, Johannes Hoffart, Ilaria Bordino, Marc Spaniol, Gerhard Weikum
In: Proceedings of the 37th International Conference on Very Large Databases, VLDB 2011, p. 1450–1453, Seattle, WA, 2011 - Robust Disambiguation of Named Entities in Text PDF
Johannes Hoffart, Mohamed Amir Yosef, Ilaria Bordino, Hagen Fürstenau, Manfred Pinkal, Marc Spaniol, Bilyana Taneva, Stefan Thater, Gerhard Weikum
In: Conference on Empirical Methods in Natural Language Processing, p. 782–792, Edinburgh, Scotland, 2011
For scientific works, please cite this paper
Demo
AIDA can be tested online, with different methods and configurations for entity disambiguation: AIDA Demo
Please use either Firefox or Chrome to view the Demo.
Results
In the EMNLP 2011 paper, the results are given on the subset of 228 CoNLL testb documents wich could be processed by all the competitor methods (1270testb, 1308testb, and 1349testb are missing). We did this for the sake of comparability. The results of our AIDA methods on all 231 CoNLL testb documents are given below. The short names are the same as in the paper.
Measure | sim-k | r-prior sim-k | r-prior sim-k coh | r-prior sim-k r-coh | prior |
---|---|---|---|---|---|
Macro Precision@1.0 | 76.65 | 80.81 | 80.86 | 82.02 | 71.36 |
Micro Precision@1.0 | 76.65 | 80.06 | 82.24 | 82.29 | 66.55 |
Measure | sim-k | r-prior sim-k | r-prior sim-k coh | r-prior sim-k r-coh | prior |
---|---|---|---|---|---|
Macro Precision@1.0 | 76.00 | 81.03 | 80.67 | 81.66 | 75.16 |
Micro Precision@1.0 | 76.61 | 80.56 | 82.05 | 82.54 | 70.46 |