TY - GEN
T1 - AKEA
T2 - 2nd International Conference on Advanced Intelligent Systems and Informatics, AISI 2016
AU - Amer, Eslam
AU - Foad, Khaled
N1 - Publisher Copyright:
© Springer International Publishing AG 2017.
PY - 2016/10/18
Y1 - 2016/10/18
N2 - Keyphrase extraction is a critical step in many natural language processing and Information retrieval applications. In this paper, we introduce AKEA, a keyphrase extraction algorithm for single Arabic documents. AKEA is an unsupervised algorithm as it does not need any type of training in order to achieve its task. We rely on heuristics that collaborate linguistic patterns based on Part-Of-Speech (POS) tags, statistical knowledge, and the internal structural pattern of terms (i.e. word-occurrence). We employ the usage of Arabic Wikipedia to improve the ranking (or significance) of candidate keyphrases by adding a confidence score if the candidate exist as an indexed Wikipedia concept. Experimental results show that on average AKEA has the highest precision value, the highest F-measure value which indicates it presents more accurate results compared to its other algorithms.
AB - Keyphrase extraction is a critical step in many natural language processing and Information retrieval applications. In this paper, we introduce AKEA, a keyphrase extraction algorithm for single Arabic documents. AKEA is an unsupervised algorithm as it does not need any type of training in order to achieve its task. We rely on heuristics that collaborate linguistic patterns based on Part-Of-Speech (POS) tags, statistical knowledge, and the internal structural pattern of terms (i.e. word-occurrence). We employ the usage of Arabic Wikipedia to improve the ranking (or significance) of candidate keyphrases by adding a confidence score if the candidate exist as an indexed Wikipedia concept. Experimental results show that on average AKEA has the highest precision value, the highest F-measure value which indicates it presents more accurate results compared to its other algorithms.
KW - Keyphrase extraction
KW - Natural language processing
UR - http://www.scopus.com/inward/record.url?scp=84994531647&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-48308-5_14
DO - 10.1007/978-3-319-48308-5_14
M3 - Conference contribution
AN - SCOPUS:84994531647
SN - 9783319483078
T3 - Advances in Intelligent Systems and Computing
SP - 137
EP - 146
BT - Proceedings of the International Conference on Advanced Intelligent Systems and Informatics, 2016
A2 - Hassanien, Aboul Ella
A2 - Shaalan, Khaled
A2 - Azar, Ahmad Taher
A2 - Gaber, Tarek
A2 - Tolba, Mohamed F.
PB - Springer
Y2 - 24 October 2016 through 26 October 2016
ER -