Pesquisa | Portal Regional da BVS

Untangling the evolution of body-part terminology in Pano: conservative versus innovative traits in body-part lexicalization.

Zariquiey, Roberto; Vera, Javier; Greenhill, Simon J; Valenzuela, Pilar; Gray, Russell J; List, Johann-Mattis.

Interface Focus ; 13(1): 20220053, 2023 Feb 06.

Artigo em Inglês | MEDLINE | ID: mdl-36659979

RESUMO

Although language-family specific traits which do not find direct counterparts outside a given language family are usually ignored in quantitative phylogenetic studies, scholars have made ample use of them in qualitative investigations, revealing their potential for identifying language relationships. An example of such a family specific trait are body-part expressions in Pano languages, which are often lexicalized forms, composed of bound roots (also called body-part prefixes in the literature) and non-productive derivative morphemes (called here body-part formatives). We use various statistical methods to demonstrate that whereas body-part roots are generally conservative, body-part formatives exhibit diverse chronologies and are often the result of recent and parallel innovations. In line with this, the phylogenetic structure of body-part roots projects the major branches of the family, while formatives are highly non-tree-like. Beyond its contribution to the phylogenetic analysis of Pano languages, this study provides significative insights into the role of grammatical innovations for language classification, the origin of morphological complexity in the Amazon and the phylogenetic signal of specific grammatical traits in language families.

A comparative wordlist for the languages of The Gran Chaco, South America.

Brid, Nicolás; Messineo, Cristina; List, Johann-Mattis.

Open Res Eur ; 2: 90, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-37645292

RESUMO

Home to more than twenty indigenous languages belonging to six linguistic families, the Gran Chaco has raised the interest of many linguists from different backgrounds. While some have focused on finding deeper genetic relations between different language groups, others have looked into similarities from the perspective of areal linguistics. In order to contribute to further research of areal and genetic features among these languages, we have compiled a comparative wordlist consisting of translational equivalents for 326 concepts - representing basic and ethnobiological vocabulary - for 26 language varieties. Since the data were standardized in various ways, they can be analyzed both quantitatively and qualitatively. In order to illustrate this in detail, we have carried out an initial computer-assisted analysis of parts of the data by searching for shared lexicosemantic patterns resulting from structural rather than direct borrowings.

Using lexical language models to detect borrowings in monolingual wordlists.

Miller, John E; Tresoldi, Tiago; Zariquiey, Roberto; Beltrán Castañón, César A; Morozova, Natalia; List, Johann-Mattis.

PLoS One ; 15(12): e0242709, 2020.

Artigo em Inglês | MEDLINE | ID: mdl-33296372

RESUMO

Lexical borrowing, the transfer of words from one language to another, is one of the most frequent processes in language evolution. In order to detect borrowings, linguists make use of various strategies, combining evidence from various sources. Despite the increasing popularity of computational approaches in comparative linguistics, automated approaches to lexical borrowing detection are still in their infancy, disregarding many aspects of the evidence that is routinely considered by human experts. One example for this kind of evidence are phonological and phonotactic clues that are especially useful for the detection of recent borrowings that have not yet been adapted to the structure of their recipient languages. In this study, we test how these clues can be exploited in automated frameworks for borrowing detection. By modeling phonology and phonotactics with the support of Support Vector Machines, Markov models, and recurrent neural networks, we propose a framework for the supervised detection of borrowings in mono-lingual wordlists. Based on a substantially revised dataset in which lexical borrowings have been thoroughly annotated for 41 different languages from different families, featuring a large typological diversity, we use these models to conduct a series of experiments to investigate their performance in mono-lingual borrowing detection. While the general results appear largely unsatisfying at a first glance, further tests show that the performance of our models improves with increasing amounts of attested borrowings and in those cases where most borrowings were introduced by one donor language alone. Our results show that phonological and phonotactic clues derived from monolingual language data alone are often not sufficient to detect borrowings when using them in isolation. Based on our detailed findings, however, we express hope that they could prove to be useful in integrated approaches that take multi-lingual information into account.

Assuntos

Idioma , Modelos Teóricos , Entropia , Cadeias de Markov , Redes Neurais de Computação , Fonética , Análise de Regressão , Reprodutibilidade dos Testes

RESUMO

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA