Dang, MN., Cao, TH., Lam, TD., Le, MH., Le, DN., Dinh, SD. (2026). Parallel Corpus Construction for Chinese and Vietnamese in Historical Texts.

  1. In: Nguyen, N.T., et al. Computational Intelligence in Engineering Science. ICCIES 2025. Communications in Computer and Information Science, vol 2585. Springer, Cham.
  2. Abstract: This study builds a Vietnamese–ancient Chinese parallel corpus from the Đại Nam Thực Lục Tiền Biên and introduces a phrase-matching algorithm that surpasses BertAlign, offering a promising approach for historical text alignment.