Natural Language Engineering: Volume 22 -

The next generation
KENNETH WARD CHURCH
Published online by Cambridge University Press:

13 October 2016, pp. 977-980
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
I’m sure you want me to tell you about the next new emerging trend, but I’m not going to do that. It is much easier to suggest where trends come from (the next generation), and how to distinguish passing fads (bubbles) from emerging trends. Young people are often the early adopters, the first to see what is about to happen, but most people don’t see what’s coming until well after the fact. Those with the most to lose (the establishment) tend to be the most resistant to change.

Building a multi-domain comparable corpus using a learning to rank method †
RAZIEH RAHIMI, AZADEH SHAKERY, JAVID DADASHKARIMI, MOZHDEH ARIANNEZHAD, MOSTAFA DEHGHANI, HOSSEIN NASR ESFAHANI
Published online by Cambridge University Press:

15 June 2016, pp. 627-653
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Comparable corpora are key translation resources for both languages and domains with limited linguistic resources. The existing approaches for building comparable corpora are mostly based on ranking candidate documents in the target language for each source document using a cross-lingual retrieval model. These approaches also exploit other evidence of document similarity, such as proper names and publication dates, to build more reliable alignments. However, the importance of each evidence in the scores of candidate target documents is determined heuristically. In this paper, we employ a learning to rank method for ranking candidate target documents with respect to each source document. The ranking model is constructed by defining each evidence for similarity of bilingual documents as a feature whose weight is learned automatically. Learning feature weights can significantly improve the quality of alignments, because the reliability of features depends on the characteristics of both source and target languages of a comparable corpus. We also propose a method to generate appropriate training data for the task of building comparable corpora. We employed the proposed learning-based approach to build a multi-domain English–Persian comparable corpus which covers twelve different domains obtained from Open Directory Project. Experimental results show that the created alignments have high degrees of comparability. Comparison with existing approaches for building comparable corpora shows that our learning-based approach improves both quality and coverage of alignments.

NLE volume 22 issue 5 Cover and Back matter
Published online by Cambridge University Press:

13 September 2016, pp. b1-b7
- Article
- - You have access
- PDF
- Export citation

NLE volume 22 issue 2 Cover and Back matter
Published online by Cambridge University Press:

09 February 2016, pp. b1-b6
- Article
- - You have access
- PDF
- Export citation

NLE volume 22 issue 4 Cover and Front matter
Published online by Cambridge University Press:

15 June 2016, pp. f1-f2
- Article
- - You have access
- PDF
- Export citation

NLE volume 22 issue 3 Cover and Back matter
Published online by Cambridge University Press:

18 April 2016, pp. b1-b7
- Article
- - You have access
- PDF
- Export citation

NLE volume 22 issue 6 Cover and Front matter
Published online by Cambridge University Press:

13 October 2016, pp. f1-f2
- Article
- - You have access
- PDF
- Export citation

NLE volume 22 issue 4 Cover and Back matter
Published online by Cambridge University Press:

15 June 2016, pp. b1-b4
- Article
- - You have access
- PDF
- Export citation

NLE volume 22 issue 6 Cover and Back matter
Published online by Cambridge University Press:

13 October 2016, pp. b1-b7
- Article
- - You have access
- PDF
- Export citation

Natural Language Engineering

Refine listing

Actions for selected content:

Volume 22 - November 2016

Emerging Trends

The next generation

Articles

Building a multi-domain comparable corpus using a learning to rank method †

Back Cover (IBC, OBC) and matter

NLE volume 22 issue 5 Cover and Back matter

NLE volume 22 issue 2 Cover and Back matter

Front Cover (OFC, IFC) and matter

NLE volume 22 issue 4 Cover and Front matter

Back Cover (IBC, OBC) and matter

NLE volume 22 issue 3 Cover and Back matter

Front Cover (OFC, IFC) and matter

NLE volume 22 issue 6 Cover and Front matter

Back Cover (IBC, OBC) and matter

NLE volume 22 issue 4 Cover and Back matter

NLE volume 22 issue 6 Cover and Back matter

Natural Language Engineering

Refine listing

Actions for selected content:

Save Search

Volume 22 - November 2016

Emerging Trends

Articles

Back Cover (IBC, OBC) and matter

Front Cover (OFC, IFC) and matter

Back Cover (IBC, OBC) and matter

Front Cover (OFC, IFC) and matter

Back Cover (IBC, OBC) and matter