Your browser doesn't support javascript.
loading
Bangla_MER: A unique dataset for Bangla mathematical entity recognition.
Aurpa, Tanjim Taharat; Jeba, Samiha Maisha; Ahmed, Md Shoaib; Ullah, Mohammad Aman; Mehzabin, Maria; Anwar, Md Musfique.
Afiliación
  • Aurpa TT; Bangabandhu Sheikh Mujibur Rahman Digital University, Bangladesh.
  • Jeba SM; IUBAT - International University of Business Agriculture and Technology, Bangladesh.
  • Ahmed MS; Boise State University, United States.
  • Ullah MA; Jahangirnagar University, Bangladesh.
  • Mehzabin M; IUBAT - International University of Business Agriculture and Technology, Bangladesh.
  • Anwar MM; IUBAT - International University of Business Agriculture and Technology, Bangladesh.
Data Brief ; 54: 110407, 2024 Jun.
Article en En | MEDLINE | ID: mdl-38708312
ABSTRACT
Mathematical entity recognition is essential for machines to define and illustrate mathematical substance faultlessly and to facilitate sufficient mathematical operations and reasoning. As mathematical entity recognition in the Bangla language is novel, to our best knowledge, there is no available dataset exists in any repository. In this paper, we present state of the art Bangla mathematical entity dataset containing 13,717 observations. Each record has a mathematical statement, mathematical type and mathematical entity. This dataset can be utilized to conduct research involving the recognition of mathematical operators, renowned mathematical terms (such as complex numbers, real numbers, prime numbers, etc.), and operands as numbers. The findings mentioned above, and their combination are also feasible with a modest tweak to the dataset. Furthermore, we have structured this dataset in raw format and made a CSV file, incorporating three columns text, math entity, and label. As an outcome, researchers may easily handle the data, facilitating a variety of deep learning and machine learning explorations.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Data Brief Año: 2024 Tipo del documento: Article País de afiliación: Bangladesh Pais de publicación: Países Bajos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Data Brief Año: 2024 Tipo del documento: Article País de afiliación: Bangladesh Pais de publicación: Países Bajos