Hatier, S., Augustyn, M., Yan, R., Tran, T. T. H., Tutin, A., & Jacques, M. - P. (2016). French cross-disciplinary scientific lexicon: extraction and linguistic analysis. Dans T. Margalitadze & G. Meladze (éD.), Proceedings Of The Xvii Euralex International Congress Lexicography & Linguistic Diversity (p. 355–365). Présenté à XVII EURALEX International congress, Tbilisi: Ivane Javakhishvili Tbilisi State University.
Résumé : This paper presents the work we carried out to extract and structure a specialized lexicon based on a corpus of French scientific articles in the fields of humanities and social sciences. The characteristics of the targeted lexicon may be summarized as follows: it is not domain-related inasmuch as it is shared by various disciplines; it serves to express the specific operations, naming the objects and exposing the results of research processes. In this view, the targeted lexicon studies here is genre-related. We designed this cross-disciplinary scientific lexicon (CSL) as a resource for several purposes: it may serve natural language processing, e.g. as a stoplist for automatic terms identification, as well as foreign language teaching. Indeed, students and scholars in the sciences need to acquire familiarity with the rhetoric of the research article, thus needing to master these words. We present here the two-stage creation of this lexicon: first, it was semi-automatically extracted from a corpus of 500 research articles spanning ten disciplines. Second, it was manually structured to reflect the semantics and rhetoric of science. This structure takes into account the lexico-syntactic properties of CSL nouns, adjectives, verbs and adverbs. The resource will be freely available for academic purposes.