Design of a Lexical Semantics Ontology in the MultiGenera and MultiComb Projects

Authors

  • Rosa Martín Gascueña UCM

DOI:

https://doi.org/10.17561/rilex.6.3.8083

Keywords:

Lexicography, Ontologies, Semantic Categories, Noun Phrase

Abstract

The aim of this paper is to present the design, development, and functionality of the ontology in the MultiGenera and MultiComb multilingual projects, (Spanish, German, and French) for the automatically generation of the noun phrases (FN) and its sentence contexts. It is important to highlight the primordial role of this ontology for the development of these, as it involved a formalisation of the fundamental properties of lexical items in accordance with valential grammar and lexicography. The ontology provides semantic tagging labels for the semantic classes that make up the syntagmatic environment of the twenty nouns, belonging to different areas of knowledge. Its purpose is to organise the lexical data for storage in the database and subsequent retrieval through the prototype applications Xera and Combinatoria, which automatically generate the nominal structures of the nouns and their combination in sentence contexts.

The ontology development process is divided into two phases: in the first phase, the same semantic classes as in Portlex to classify the actants of each noun used for the automatic generation of FN in the Xera 1. 0. in the three languages. In the second phase, the ontology is reconfigured by automatically connecting to the WordNet database synsets and other related ontologies to increase the lexical database of the projects. The latter ontology presents greater granularity, which favours programming for the generation of name environments and their sentence combination. The application of the ontology in language learning is shown by the user-friendly interface in various applications to select the possible lexical items combinations in FN. It should be noted that the ontology is still under development and is available online.

Downloads

Download data is not yet available.

References

Bentivogli, L., Forner, P., Magnini, B. & Pianta, E. (2004). Revising the Wordnet Domains Hierarchy: Semantics, Coverage and Balancing. En G. Sérasset, S. Armstrong, C. Boitet, A. Popescu-Belis & D. Tufis (eds.), Proceedings of the Workshop on Multilingual Linguistic Ressources MLR2004. Universidad de Ginebra. https://doi.org/10.3115/1706238.1706254

Borst, W. N. (1997). Construction of Engineering Ontologies for Knowledge Sharing and Reuse [Tesis doctoral. Universidad de Twente]. Centre for Telematics and Information Technology (CTIT). https://research.utwente.nl/en/publications/construction-of-engineering-ontologies-for-knowledge-sharing-and-

Bosque-Gil, J. (2019). Linguistic Linked Data for Lexicography [Tesis doctoral. Universidad Politécnica de Madrid]. Archivo Digital UPM. https://doi.org/10.20868/UPM.thesis.57887.

Bosque-Gil, J. & García, J. (eds.) (2019). The Ontolex Lemon Lexicography Module Specification. Ontology Lexica under the W3C Community Final Specification Agreement (FSA). https://www.w3.org/2019/09/lexicog/

Brown, C. (2002). Paradigmatic relations of inclusion and identity I: Meronymy. En A. Cruse, F. Hundsnurscher, J. Michael & P. R. Lutzeier (eds.), Lexikologie: Ein internazionales Handbuch zur Natur und Struktur von Wörtern und Wortschätzen 1. Halbband. Lexicology: An International Handbook on the Nature and Structure of Words and Vocabularies, vol. I. (pp. 480–485). de Gruyter.

Codina, L. & Pedraza-Jiménez, R. (2011). Tesauros y ontologías en sistemas de información documental. El profesional de la información, 20(5), 555–563. https://doi.org/10.3145/epi.2011.sep.10

Combinatoria = Domínguez Vázquez, M.ª J. (dir.), Valcárcel Riveiro, C., Bardanca Outeiriño, D., Calañas Continente, J. A., Catalá Torres, N., López Iglesias, N., Martín Gascueña, R., Mirazo Balsa, M., Sanmarco Bande, M. T. & Pino Serrano, L. (2020). Combinatoria. Prototipo online para la generación biargumental de la frase nominal en alemán, español y francés. Universidade de Santiago de Compostela. Consultado el 30 de mayo de 2023. http://portlex.usc.gal/combinatoria/usuario

Croft, W. & Cruse, D. A. (2004). Cognitive Linguistic. Cambridge University Press. https://doi.org/10.1017/CBO9780511803864

Cruse, A. (1986). Lexical Semantics. Cambridge University Press.

Cruse, A. (2002). Descriptive models for sense relations II: Cognitive semantics. En A. Cruse, F. Hundsnurscher, J. Michael & P. R. Lutzeier (eds.), Lexikologie: Ein internazionales Handbuch zur Natur und Struktur von Wörtern und Wortschätzen 1. Halbband. Lexicology: An International Handbook on the Nature and Structure of Words and Vocabularies, vol. I. (pp. 542–549). de Gruyter. https://doi.org/10.1515/9783110113082.1.15.542

Cruse, D. A. (2004). Meaning in Language: An Introduction to Semantics and Pragmatics. Oxford University Press.

Domínguez Vázquez, M.ª J. (2011). Kontrastive Grammatik und Lexikographie: spanisch-deutsches Wörterbuch zur Valenz des Nomens. IUDICIUM Verlag GmbH München.

Domínguez Vázquez, M.ª J., Solla Portela, M. A. & Valcárcel Riveiro, C. (2019). Resource Interoperability: Exploiting Lexicographic Data to Automatically Generate Dictionary Examples. En I. Kosem, T. Z. Kuhn, M. Correia, J. P. Ferreira, M. Jansen, I. Pereira, J. Kallas, M. Jakubíček, S. Krek & C. Tiberius (eds.), Electronic Lexicography in the 21st Century: Smart Lexicography. Proceedings of the eLex 2019 (pp. 51–71). Lexical Computing.

Domínguez Vázquez, M.ª J., Valcárcel Riveiro, C. & Lindemann, D. (2018). Multilingual Generation of Noun Valency Patterns for Extracting Syntactic-Semantical Knowledge from Corpora (MultiGenera). En J. Čibej, V. Gorjanc, I. Kosem & S. Krek (eds.), XVIII EURALEX International Congress. Lexicography in Global Contexts (pp. 847–854). Ljubljana University Press.

Gangemi, A., Navigli, R. & Velardi, P. (2003). The OntoWordNet Project: Extension and Axiomatization of Conceptual Relations in WordNet. En R. Meersman, Z. Tari & C. Schmidt (eds.), On the Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE (pp. 820–838). Springer. https://doi.org/10.1007/978-3-540-39964-3_52

Gómez Guinovart, X. & Solla Portela, M. A. (2018). Building the Galician wordnet: methods and applications. Language Resources and Evaluation, 52(1), 317–339. https://doi.org/10.1007/s10579-017-9408-5

Gruber, T. (1993). A Translation Approach to Portable Ontology Specifications. Knowledge Acquisition, 5(2), 199–220. https://doi.org/10.1006/knac.1993.1008

Guarino, N., Oberle, D. & Staab S. (2009). What Is an Ontology? En S. Steffen & R. Studer (eds.), Handbook on Ontologies. International Handbooks on Information Systems (pp. 1–17). Springer. https://doi.org/10.1007/978-3-540-92673-3_0

Lakoff, G. (1987). Women, Fire and Dangerous Things: what Categories Reveal about the Mind. The University of Chicago Press. https://doi.org/10.7208/chicago/9780226471013.001.0001

Lenat, D. & Guha, R. V. (1990). CYC: A Mid-Term Report. AI Magazine, 11(3), 32–59.

Lyons, J. (1977). Semántica. Editorial Teide. https://doi.org/10.1017/CBO9781139165693

Martin-Gascueña, R. (2013). La hiponimia en un área conceptual. Revista Pragmalingüística, 21, 86–106 http://revistas.uca.es/index.php/pragma/issue/view/139/showToc

Martin, P. (2003). Correction and Extension of WordNet 1.7 for Knowledge-based Applications. En B. Ganter, A. Moor & W. Lex (eds.), Conceptual Structures for Knowledge Creation and Communication (pp. 160–173). Springer. https://doi.org/10.1007/978-3-540-45091-7_11

Masolo, C., Borgo, S., Gangemi, A., Guarino, N. & Oltramari, A. (2003). DOLCE: Descriptive Ontology for Linguistic and Cognitive Engineering https://www.istc.cnr.it/it/content/dolce-descriptive-ontology-linguistic-and-cognitive-engineering

Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D. & Miller, J. (1990). Introduction to WordNet: An on-line Lexical Database. International Journal of Lexicography, 3(4), 235–244. https://doi.org/10.1093/ijl/3.4.235

Ontología léxica = Domínguez Vázquez, M.ª J., Valcárcel Riveiro, C. & Bardanca Outeiriño, D. (2021). Ontología léxica. Universidade de Santiago de Compostela. Consultado el 30 de mayo de 2023 http://portlex.usc.gal/ontologia/

Pease, A., Niles, I. & Li, J. (2002). The Suggested Upper Merged Ontology: A Large Ontology for the Semantic Web and its Application. Working Notes of the AAAI-2002 Workshop on Ontologies and the Semantic Web. July-August. Edmonton.

Pianta, E., Bentivogli, L. & Girardi, C. (2002). MultiWordNet Developing an aligned multilingual database. Proceedings of the 1st International WordNet Conference (pp. 293–302).

Portlex = Domínguez Vázquez, M.ª J. (dir.), Valcárcel Riveiro, C., Mirazo Balsa, M., Sanmarco Bande, M.ª T., Simões, A. & Vale, M. J. (2018). Portlex. Diccionario multilingüe de la valencia del nombre. Universidade de Santiago de Compostela. Consultado el 30 de mayo de 2023. http://portlex.usc.gal/portlex/

Pustejovsky, J. (1995) The Generative Lexicon. The MIT Press Cambridge.

Rodríguez, H., Climent, S., Vossen, P., Bloksma, L., Peters, W., Alonge, A., Bertagna, F. & Roventini, A. (1998). The Top-Down Strategy for Building EuroWordNet: Vocabulary Coverage, Base Concepts and Top Ontology. P. Vossen (ed.) EuroWordNet: A multilingual database with lexical semantic networks (pp. 45–80). Springer. https://doi.org/10.1007/978-94-017-1491-4_3

Rosch, E. (1978). Principles of Categorization. En B. Lloyd & E. Rosch (eds.), Cognition and categorization (pp. 27–48). Lawrence Erlbaum.

Solla Portela, M. A. & Gómez Guinovart, X. (2015). Galnet o WordNet do galego. Aplicacións lexicolóxicas e terminolóxicas. Revista galega de filoloxía, 16, 169–201. https://doi.org/10.17979/rgf.2015.16.0.1383

Studer, R., Benhjamins, V. R. & Fensel, D. (1998). Knowledge Engineering: Principles and methods. Data & Knowledge Engineering, 25(1-2), 161–197. https://doi.org/10.1016/S0169-023X(97)00056-6

Vossen, P. J. T. M. (1998). EuroWordnet: Building a Multilingual Database with Word Nets for European Languages. ELRA Newsletter, 3(1), s.p. https://doi.org/10.1007/978-94-017-1491-4_1

WordNet = The Trustees of Princeton University (2023). WordNet. Princeton University. Consultado el 30 de mayo de 2023. https://wordnet.princeton.edu/

Xera = Domínguez Vázquez, M.ª J. (dir.), Valcárcel Riveiro, C., Bardanca Outeiriño, D., Calañas Continente, J. A., Catalá Torres, N., López Iglesias, N., Martín Gascueña, R., Mirazo Balsa, M., Sanmarco Bande, M.T. & Pino Serrano, L. (2020). Xera. Prototipo online para la generación automática monoargumental de la frase nominal en alemán, español y francés. Universidade de Santiago de Compostela. Consultado el 30 de mayo de 2023. http://portlex.usc.gal/combinatoria/usuario

Published

2023-12-18

How to Cite

Martín Gascueña, R. (2023). Design of a Lexical Semantics Ontology in the MultiGenera and MultiComb Projects. RILEX. Revista Sobre Investigaciones léxicas, 6(3), 77-106. https://doi.org/10.17561/rilex.6.3.8083