Segmentation into sentences of spontaneous oral corpus, the case of averbal syntactic units in a corpus of spoken French and Vietnamese
Author: Vu, Thi Hieu
Under the direction of: Florence Lefeuvre and Huy-Linh Dao
Sorbonne Nouvelle University Paris 3
Texte français
Keywords: Language science, Vietnam, Segmentation, Adverbial phrase, Discourse marker, Assent period, Predicative period, Reduplication, Linguistic corpus, Vietnamese - French.
Abstract
This thesis studies two interesting oral questions. First, how do you segment the stream of words into sentences ? Secondly, are there many averbal sentences in speaking because the time to react is very short ? This work was carried out from two oral corpora, in French and in Vietnamese. We have segmented them according to the syntactic approach. The frequent difficulties encountered in this thesis demonstrates that it is a delicate work. The segmentation results showed that, in speaking, verbal sentences are more frequent than averbal sentences. Regarding averbal sentences, they include several types of predicates which are distributed heterogeneously in the two languages. Finally, discursive markers, either verbal or averbal, are used to regularize discourse.