Evaluation of Latent Semantic Analysis in Multilingual Information Retrieval

Authors

  • Priya Sethuraman
  • Muntather Muhsin Hassan
  • Roy P Veettil
  • Dr.A. Jayanthi
  • Dr.N. Kalyana Sundaram
  • Dr. Deepti Patnaik

DOI:

https://doi.org/10.51983/ijiss-2025.IJISS.15.3.18

Keywords:

Latent Semantic Analysis, Multilingual Information Retrieval, Cross-Lingual Retrieval, Natural Language Processing, Semantic Representation

Abstract

Multilingual Information Retrieval (MLIR) systems have become essential tools in a digitally integrated economy. Users require pertinent information in various languages and across linguistic frontiers. A technique rooted in linear algebra and statistical semantics known as Latent Semantic Analysis (LSA) offers a solution for revealing patterns buried within the data, which may cut across languages. In this paper, we investigate the efficiency of LSA in MLIR tasks with various language pairs compared to traditional vector space models and the machine translation approach. Using the Europarl and CLEF corpora, and employing mean average precision (MAP), precision at 10 (P@10), and normalized Discounted Cumulative Gain (DCG), we demonstrate that LSA facilitates reasonable cross-lingual alignment under specific conditions. Moreover, we assess the model's performance considering changes in the number of latent dimensions and various preprocessing techniques applied before the central processing.

Downloads

Published

30-09-2025

How to Cite

Sethuraman, P., Hassan, M. M., Veettil, R. P., Jayanthi, A., Kalyana Sundaram, N., & Patnaik, D. (2025). Evaluation of Latent Semantic Analysis in Multilingual Information Retrieval. Indian Journal of Information Sources and Services, 15(3), 160–169. https://doi.org/10.51983/ijiss-2025.IJISS.15.3.18

Most read articles by the same author(s)