Automatic extraction of Word Combinations from corpora: evaluating methods and benchmarks