En-Ar Bilingual word Embeddings without Word Alignment: Factors Effects

Simon O'Keefe, Taghreed Alqaisi

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper introduces the first attempt to investigate morphological segmentation on En-Ar bilingual word embeddings using bilingual word embeddings model without word alignment (BilBOWA). We investigate the effect of sentence length and embedding size on the learning process. Our experiment shows that using the D3 segmentation scheme improves the accuracy of learning bilingual word embeddings up to 10 percentage points compared to the ATB and D0 schemes in all different training settings.
Original languageEnglish
Pages97-107
Number of pages11
Publication statusPublished - 2019

Bibliographical note

© 2019 Association for Computational Linguistics. Uploaded in accordance with the publisher’s self-archiving policy. Further copying may not be permitted; contact the publisher for details

Cite this