Contents
Publications
Downloads
Wraetlic tools
Instances in WordNet 1.7
Similarity and Relatedness in WordSim353

Publications


Natural Language Processing:

e-Learning:

Others

Ph.D. thesis


Lexical semantics

Enrique Alfonseca, Keith Hall, Silvana Hartmann, Large-scale Computation of Distributional Similarities for Queries, In Proceedings of NAACL-HLT 2009.

Eneko Agirre, Enrique Alfonseca, Keith Hall, Jana Kravalova, Marius Pasca, Aitor Soroa, A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches, In Proceedings of NAACL-HLT 2009 [pdf].

Ruiz-Casado, M. and Alfonseca, E. and Castells, P., Automatic Acquisition of Semantics from Text for Semantic Work Environments, In Rech, Decker and Ras (eds.), Emerging Technologies for Semantic Work Environments: Techniques, Methods, and Applications. Information Science Reference, 2008.

M. Ruiz-Casado, E. Alfonseca, M. Okumura and P. Castells, Information Extraction and Semantic Annotation from Wikipedia. In P. Buitelaar and P. Cimiano (eds.), Ontology Learning and Population: Bridging the gap between text and knowledge., IOS Press, 2008.

E. Alfonseca, P. Castells and M. Ruiz-Casado, Automatising the Learning of Lexical Patterns: an Application to the Enrichment of WordNet by Extracting Semantic Relationships from Wikipedia. In the journal of Data and Knowledge Engineering vol. 61 (3), pp. 484-499, Elsevier [abstract and full paper]

M. Ruiz-Casado, E. Alfonseca, M. Okumura and P. Castells, Towards Large-scale Non-taxonomic Re- lation Extraction: Estimating the Precision of Rote Extractors. In the second workshop on Ontology Learning and Population, co-located with ACL-2006.

E. Alfonseca, P. Castells, M. Okumura and M. Ruiz-Casado, A Rote Extractor with Edit Distance- based Generalisation and Multi-corpora Precision Calculation. In the poster session of ACL-2006.

M. Ruiz-Casado, E. Alfonseca and P. Castells, From Wikipedia to Semantic Relationships: a Semi- automated Annotation Approach. In the First Workshop SemWiki2006--From Wiki to Semantics co-located with the European Semantic Web Conference, ESWC-2006.

M. Ruiz-Casado, E. Alfonseca and P. Castells, Using context-window overlapping in Synonym Discovery and Ontology Extension. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP-2005, Borovets, Bulgaria, 2005 [pdf]

M. Ruiz-Casado, E. Alfonseca and P. Castells, Automatic extraction of semantic relationships for WordNet by means of pattern learning from Wikipedia. Proceedings of NLDB-2005. In Natural Language Processing and Information Systems, Lecture Notes in Computer Science 3513, pp. 67-79, Springer, 2005 [Abstract and full paper] [pdf]

M. Ruiz-Casado, E. Alfonseca and P. Castells, Automatic assignment of Wikipedia encyclopedic entries to WordNet synsets, Proceedings of the Atlantic Web Intelligence Conference, AWIC-2005. Lecture Notes in Computer Science 3528, pp. 380-386, Springer, 2005 [pdf]

E. Agirre, E. Alfonseca and O. López de Lacalle, Approximating hierarchy-based similarity for WordNet nominal synsets using Topic Signatures, Second International Global WordNet Conference, 2004 [pdf] [local pdf].

E. Alfonseca and S. Manandhar, Extending a Lexical Ontology by a Combination of Distributional Semantics Signatures, EKAW-2002, Siguenza, Spain, october 2002 (35% accepted). Published in Lecture Notes in Artificial Intelligence 2473 (Springer Verlag), pp. 1-7 [abstract and full paper] [local pdf].

E. Alfonseca and S. Manandhar, Proposal for Evaluating Ontology Refinement Methods. Language Resources and Evaluation (LREC-2002), Las Palmas, Spain, may 2002 [pdf].

E. Alfonseca and S. Manandhar, Improving an Ontology Refinement Method with Hyponymy Patterns. Language Resources and Evaluation (LREC-2002), Las Palmas, Spain, may 2002 [pdf].

E. Alfonseca and S. Manandhar, An unsupervised method for general named entity recognition and automated concept discovery. In Proceedings of the 1st International Conference on General WordNet, Mysore, India, enero de 2002. [pdf]

E. Alfonseca and S. Manandhar, Distinguishing concepts and instances in WordNet. In Proceedings of the 1st International Conference on General WordNet, Mysore, India, enero de 2002 [pdf] [Download the data files]

Morphology

E. Alfonseca, S. Bilac and S. Pharies, Decompounding query keywords from compounding languages. In Proceedings of ACL-2008

E. Alfonseca, S. Bilac and S. Pharies, German decompounding in a difficult corpus, In Proceedings of CICLING-2008

Text summarisation

M. Fuentes, E. Alfonseca and H. Rodríguez, Support Vector Machines for Query-focused Summarization trained and evaluated on Pyramid data. Proceedings of ACL-2007.

E. Alfonseca, M. Okumura, A. Moreno-Sandoval and J. M. Guirao, Googling answer's models in question-focused summarization. In the Proceedings of the Document Understanding Conference 2006.

R. Torralbo, E. Alfonseca, A. Moreno-Sandoval and J. M. Guirao, Description of the UAM system for DUC-2005. In the International Workshop of the Document Understanding Conference (DUC-2005), in conjunction with NAACL, Vancouver, Canada, 2005.

R. Torralbo, E. Alfonseca, A. Moreno-Sandoval and J. M. Guirao, Automatic Generation of Term Definitions using Multidocument Summarisation from the Web. In Proceedings of the International Workshop Crossing Barriers in Text Summarization Research, in conjunction with RANLP-2005, Borovets, Bulgaria, 2005 [pdf]

E. Alfonseca, J. M. Guirao and A. Moreno-Sandoval, A study of chunk-based and keyword-based approaches for generating headlines. Advances in Natural Language Processing. Lecture Notes in Computer Science 3230, pp. 395-406, 2004 [abstract and full paper] [local pdf].

E. Alfonseca, J. M. Guirao and A. Moreno-Sandoval, Description of the UAM system for generating very short summaries at DUC-2004. NAACL Text Summarization Workshop and Document Understanding Conference 2004 [pdf].

E. Alfonseca and P. Rodríguez, Description of the UAM system for generating very short summaries at DUC-2003. HLT-NAACL Text Summarization Workshop and Document Understanding Conference 2003 [ps] [local pdf].

E. Alfonseca and P. Rodríguez, Generating extracts with genetic algorithms, The 2003 European Conference on Information Retrieval (ECIR'2003), Pisa, Italy. Lecture Notes in Computer Science 2633 (Springer-Verlag), pp. 511-519 [abstract and full paper] [local pdf].

Question-Answering

E. Alfonseca, M. DeBoni, J. L. Jara-Valencia and S. Manandhar, A Prototype Question-Answering System using Syntactic and Semantic Information for Answer Retrieval. In Text REtrieval Conference, TREC-2001, Question-Answering Track, 2001

Language Resources and Evaluation

D. Samy, A. Moreno-Sandoval, J. M. Guirao and E. Alfonseca, Building and Assessment of a Parallel Multilingual Corpus (Arabic-Spanish-English). In Proceedings of the Language Resources and Evaluation Conference, LREC-2006.

E. Alfonseca, A. Moreno-Sandoval, J. M. Guirao and M. Ruiz-Casado, The wraetlic NLP suite. In Proceedings of the Language Resources and Evaluation Conference, LREC-2006.

E. Alfonseca and M. Ruiz-Casado, Learning Sure-fire Rules for Named Entity Recognition. In Proceedings of the International Workshop on Text Mining Research, in conjunction with RANLP-2005, Borovets, Bulgaria, 2005 [pdf]

E. Alfonseca, A WordNet Interface to APL2, Proceedings of the APL-2002 Conference, Madrid, july 2002. Published as E. Alfonseca, A WordNet Interface to APL2, APL Quote Quad (ACM SIGAPL), Vol. 32:4, p. 7-16, 2002 [doc].

E. Alfonseca and S. Manandhar, A Framework for Constructing Temporal Models from Texts, FLAIRS-2002, Pensacola, Florida, EE.UU., 2002 [pdf].

E. Alfonseca, Knowledge Extraction using Information Extraction, First Technical Workshop of the Computer Engineering Department. Universidad Autónoma de Madrid, March 2000.

Syntax

E. Alfonseca and S. Manandhar, Noun Phrase chunking with APL2. In Proceedings of the APL-Berlin-2000 Conference, Berlin, july 2000. Published as E. Alfonseca and S. Manandhar, Noun Phrase chunking with APL2, APL Quote Quad (ACM SIGAPL), Vol. 30:4, p. 135-143, Jul. 2000.

Textual entailment

D. Perez and E. Alfonseca, Using Bleu-like Algorithms for the Automatic Recognition of Entailment. In Machine Learning Challenges. Evaluating Predictive Uncertainty, Visual Object Classification and Recognising Textual Entailment. Lecture Notes in Computer Science, volume 3944, Springer Berlin / Heidelberg, 2006, pages 191-204.

D. Pérez and E. Alfonseca, Application of the Bleu algoritm for recognising textual entailments. Recognising Textual Entailment Challenge,, Pascal Challenges Workshop, Southampton, U.K., 2005 [pdf].

Adaptive hypermedia

E. Alfonseca, P. Rodríguez and D. Pérez, An approach for automatic generation of adaptive hypermedia in education with multilingual knowledge discovery techniques. In Computers and Education vol 19 (2), pp. 495-513, 2007 [Abstract and full paper]

E. Alfonseca, R. Carro, E. Martin, A. Ortigosa and P. Paredes, The Impact of Learning Styles on Student Grouping for Collaborative Learning: A Case Study. In User Modelling and User-Adapted Interaction, Vol 16, numbers 3-4, pages 377-401, Kluwer Academic Publishers [Abstract and full paper].

E. Alfonseca, P. Rodríguez and D. Pérez, Welkin: automatic generation of hypermedia sites with NLP techniques. Lecture Notes in Computer Science 3140 (Springer-Verlag), pp. 617-618, 2004. [Abstract and full paper] [local pdf]

E. Alfonseca, D. Pérez and P. Rodríguez, Automatic Multilingual Generation of on-line Information Sites. In Proceedings of the Second International Conference on Multimedia and ICTs in Education, Badajoz, Spain, december 2003 [pdf].

E.Alfonseca and P. Rodríguez, Extending an on-line information site with accurate domain-dependent extracts from the World Wide Web, Semantic Web for Web Learning workshop, CAiSE'2003, Velden, Austria [pdf]

E. Alfonseca and P. Rodríguez, Modelling users' interests and needs for an adaptive on-line information system, User Modelling Conference (UM'2003), Pittsburgh, U.S.A. Lecture Notes in Artificial Intelligence 2702, pp. 76-80 (Springer-Verlag) [abstract and full paper] [local pdf].

E. Alfonseca and P. Rodríguez, Automatically Generating Hypermedia Documents depending on User Goals, Workshop on Document Compression and Synthesis in Adaptive Hypermedia Systems, AH-2002, Málaga, Spain., 2002.

Free-text Computer-Assisted Assessment

D. Perez, I. Pascual-Nieto, P. Rodríguez, E. Anguiano and E. Alfonseca, Validating Automatically Generated Students' Conceptual Models from Free-text Answers at the Level of Concepts, International Journal of Computer Science 1060, pp. 90-93, 2008.

D. Perez, I. Pascual, P. Rodriguez, E. Anguiano and E. Alfonseca, Validating Automatically Generated Students' Conceptual Models from Free-text Answers at the Level of Concepts. In proceedings of the International e-conference on Computer Science (IECCS), December, 2007.

D. Perez, I. Pascual, E. Alfonseca, E. Anguiano and P. Rodriguez, A study on the impact of the use of an automatic and adaptive free-text assessment system during a university course. Chapter of the book "Blended Learning", presented at the International Conference on Web-based Learning (ICWL), August 2007, Edimburg, United Kingdom and published by Prentice Hall, Pearson Education, ISBN 978-981-06-7903-3, pages 186-195.

D. Perez, E. Alfonseca, P. Rodriguez and I. Pascual, A study on the possibility of automatically estimating the confidence values of students' knowledge in generated conceptual models. Journal of Computers, Academy Publishers, Finland. ISSN 1796-203X, volume 2, number 5, July 2007, pages 17-26.

D. Perez, I. Pascual, E. Alfonseca and P. Rodriguez, Automatically generated inspectable learning models for students. Presented at the International conferece Artificial Intelligence for Education (AIED) and published in Frontiers in Artificial Intelligence and Education, IOS Press, The Netherlands. ISSN 0922-6389, volume 158, July 2007, pages 632-634.

E. Alfonseca, P. Rodriguez and D. Perez, An approach for automatic generation of adaptive hypermedia in education with multilingual knowledge discovery techniques. Computers & Education, Elsevier, United Kingdom ISSN 0360-1315, volume 49, pages 495-513, impact factor 1.085 according to the ISI Web of Knowledge 2006.

D. Perez, E. Alfonseca, P. Rodriguez and I. Pascual, Automatic generation of students' conceptual models from answers in plain text. Proceedings of the international conference User Modeling (UM), and published in Lecture Notes in Artificial Intelligence, Springer-Verlag, Germany. ISSN 0302-9743, volume 4511, pages 329-333.

D. Perez, E. Alfonseca and P. Rodriguez, Pueden los ordenadores evaluar automáticamente preguntas abiertas?. In Novatica, September-October 2006. ISSN 0211-2124, number 183, pages 50-53.

D. Perez, E. Alfonseca and P. Rodriguez, A free-text scoring system that generates conceptual models of the students knowledge with the aid of clarifying questions. International WorkShop Semantic Web for E-learning (SWEL), Dublin (Ireland), June 2006, and published in Lecture Notes in Learning and Teaching. ISSN 1649-8623, pages 113-114.

D. Perez, I. Pascual, E. Alfonseca and P. Rodriguez, Automatic Identification of Terms for the Generation of Students Concept Maps. Proceedings of the international conference in Multimedia and Information technologies for the education (MICTE), Sevilla (Spain), November 2006. Published by Formatex, ISBN 978-84-690-2469-8, volume 3, pages 2007-2011.

D. Perez, E. Alfonseca, P. Rodriguez and I. Pascual, Willow: Automatic and adaptive assessment of students free-text answers. Diana Pérez-Marín, Enrique Alfonseca, Pilar Rodríguez and Ismael Pascual-Nieto. In proceedings of the 22nd International Conference of the Spanish Society of Natural Language Processing (SEPLN), Zaragoza (Spain), September 2006. Published in the SEPLN journal, ISSN 1135-5948, number 37, pages 367-368.

D. Perez, E. Alfonseca, M. Freire, P. Rodriguez, J. M. Guirao and A. Moreno-Sandoval, Automatic Generation of Students' Conceptual Models underpinned by Free-Text Adaptive Computer Assisted Assessment. In proceedings of the international conference on Advanced Learning Technologies (ICALT), IEEE Computer Society, Kerkrade (The Netherlands), July 2006, pages 280-284.

D. Perez, E. Alfonseca and P. Rodriguez, On the dynamic adaptation of Computer Assisted Assessment of free-text answers. Proceedings of the International Conference on Adaptive Hypermedia in Dublin (Ireland), June 2006. Published in LNCS 4018, Springer-Verlag, ISSN 0302-9743, pages 374-377.

D. Perez, A. Gliozzo, E. Alfonseca, C. Strapparava, B. Magnini and P. Rodríguez, About the effects of combining Latent Semantic Analysis with other Natural Language Processing techniques for free-text assessment. SIGNOS 38(59), pp. 325-343, 2005.

D. Pérez, O. Postolache, E. Alfonseca, D. Cristea and P. Rodríguez, About the effects of using Anaphora Resolution in Assessing Free-Text Student Answers. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP-2005, Borovets, Bulgaria, 2005 [pdf]

D. Pérez, E. Alfonseca and P. Rodríguez. Adapting the automatic assessment of free-text answers to the students. In the Proceedings of the 9th International Computer-Assisted Assessment (CAA) Conference.

E. Alfonseca, R. Carro, M. Freire, A. Ortigosa, D. Pérez and P. Rodríguez. Authoring of Adaptive Computer-Assisted Assessment of Free-text Answers. ETS journal, Special Issue on Authoring of Adaptable and Adaptive Educational Adaptive Hypermedia. 2005.

D. Pérez, A. Gliozzo, C. Strappavara, E. Alfonseca, P. Rodríguez and B. Magnini, Automatic Assessment of Students' free-text Answers underpinned by the Combination of a BLEU-inspired algorithm and LSA. In proceedings of FLAIRS-2005 [pdf].

E. Alfonseca, R. Carro, M. Freire, A. Ortigosa, D. Pérez and M. Freire, Educational Adaptive Hypermedia meets Computer-Assissted Assessment. Second International Workshop on Authoring of Adaptive and Adaptable Educational Hypermedia (AH-2004). [pdf].

E. Alfonseca and D. Pérez, Automatic Assessment of Open-ended Questions with a BLEU-inspired Algorithm and Shallow NLP. Advances in Natural Language Processing. Lecture Notes in Computer Science 3230, pp. 25-35, 2004 [Abstract and full paper] [local pdf].

D. Pérez, E. Alfonseca and P. Rodríguez, Upper bounds and extension of the Bleu algorithm applied to assessing student essays. In Proceedings of the IAEA-2004 Conference, Philadelphia, 2004 [pdf].

D. Pérez, E. Alfonseca and P. Rodríguez, Application of the Bleu method for evaluating free-text answers in an e-learning environment. In Proceedings of the Language Resources and Evaluation Conference (LREC-2004), Lisbon, 2004 [pdf].

Others

E. Alfonseca. Writing a compiler's compiler with APL. In Proceedings of the APL-98 Conference, Rome, 1998. Published as E. Alfonseca, Writing a compiler's compiler with APL, APL Quote Quad (ACM SIGAPL), Vol. 29:3, p. 69-75, Mar. 1999.

M. Alfonseca, E. Alfonseca and J. de Lara, Compiling a simulation language in APL. In Proceedings of the APL-98 Conference, Rome, 1998. Published as M. Alfonseca, E. Alfonseca and J. de Lara. Compiling a simulation language in APL, APL Quote Quad (ACM SIGAPL), Vol. 29:3, p. 105-109, Mar. 1999.

Ph.D. Thesis

An approach for automatic generation of on-line information systems based on the integration of Natural Language Processing and Adaptive Hypermedia techniques

Abstract

It is a fact that the Internet has consolidated as a widely used mean to convey information. It was soon appreciated that different people access the web with different needs, a fact which motivated the appearance of web sites that provided different information and were structured in different ways depending on the user. Nowadays many web-based systems store user profiles containing some characteristics of the users. These profiles are used to decide which particular information will be shown to each particular visitor, and how it will be organised.

Moreover, different kinds of applications need to know different characteristics of the users. For instance, e-commerce applications use the shopping history and the user's tastes in order to suggest further products; on-line educational systems keep track of the concepts that have already been studied, and the tests that have been successfully solved by the student; and on-line information systems and retrieval applications have to know precisely the information needs of the user in order to provide the most relevant data. In the same way, the procedures for deciding the contents and structure of the web sites in function of the user profiles vary across applications.

Even though there are applications for authoring web sites, constructing them is not yet particularly easy. Amongst the limitations of current authoring tools for on-line information systems are that the kinds of information stored in the user profiles or the rules for adaptation are usually restricted to a few pre-defined types; but, most importantly, they usually require the web author to write all the particular chunks of texts that will be presented to the different users. Therefore, the web author probably has to write as many different versions of the same texts as the number of possible user profiles that affect the contents of the site.

This work describes a framework that combines techniques from different fields in order to create, in a fully automatic way, on-line information systems from linear texts in electronic format, such as textbooks. It borrows ideas from User Modelling and Adaptive Hypermedia for storing and updating the user profiles, and for changing the contents and the structure of the web site according to them. Natural Language Techniques are also applied in order to gather automatically information about the relevant terms found in the original texts, and for adapting the output contents of the site, using automatic filtering and summarisation techniques. The architecture is divided into two steps: an off-line processing step, which collects information about the original linear text, and an on-line step, which executes when a user connects to the system with a web browser, and the contents and hyperlinks are generated.

The framework has been implemented as the Welkin system, which has been used to build three adaptive on-line information sites in a quick and easy way. Some controlled experiments have been performed with real users aimed to provide positive feedback on the implementation of the system.

To download the thesis, click here.