LexFr: Adapting the LexIt Framework to Build a Corpus-Based French Subcategorization Lexicon

This paper introduces LexFr , a corpus-based French lexical resource built by adapting the framework LexIt , originally developed to describe the combinatorial potential of Italian predicates. As in the original framework, the behavior of a group of target predicates is characterized by a series of syntactic (i.e., subcategorization frames) and semantic (i.e., selectional preferences) statistical information (a.k.a. distributional profiles ) whose extraction process is mostly unsupervised. The first release of LexFr includes information for 2,493 verbs, 7,939 nouns and 2,628 adjectives. In these pages we describe the adaptation process and evaluated the final resource by comparing the information collected for 20 test verbs against the information available in a gold standard dictionary. In the best performing setting, we obtained 0.74 precision, 0.66 recall and 0.70 F-measure.

LexFr: Adapting the LexIt Framework to Build a Corpus-Based French Subcategorization Lexicon

Rambelli Giulia;LEBANI GIANLUCA;Prévot Laurent;LENCI ALESSANDRO

2016

Abstract

This paper introduces LexFr , a corpus-based French lexical resource built by adapting the framework LexIt , originally developed to describe the combinatorial potential of Italian predicates. As in the original framework, the behavior of a group of target predicates is characterized by a series of syntactic (i.e., subcategorization frames) and semantic (i.e., selectional preferences) statistical information (a.k.a. distributional profiles ) whose extraction process is mostly unsupervised. The first release of LexFr includes information for 2,493 verbs, 7,939 nouns and 2,628 adjectives. In these pages we describe the adaptation process and evaluated the final resource by comparing the information collected for 20 test verbs against the information available in a gold standard dictionary. In the best performing setting, we obtained 0.74 precision, 0.66 recall and 0.70 F-measure.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2016
			
	Titolo del volume
	
				Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)
			
	Appare nelle tipologie:
	
				4.1 Articolo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
416_Paper.pdf accesso aperto Tipologia: Versione dell'editore Licenza: Creative commons Dimensione 187.83 kB Formato Adobe PDF Visualizza/Apri	187.83 kB	Adobe PDF	Visualizza/Apri

I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10278/3715943

Citazioni

ND

0

0

social impact