搜索结果: 1-15 共查到“理论语言学 Corpus”相关记录44条 . 查询时间(0.093 秒)
non-initial placement of agent constructions in spoken clauses: A corpus-based study of language production under time pressure
Sports commentary Rightward placement of subjects Japanese
2017/8/30
In this exploratory study we test the hypothesis that the retrieval from memory of proper noun Agents (PNAs) under processing pressure causes a greater proportion of such semantic arguments to be plac...
Frequential test of (S)OV as unmarked word order in Dutch and German clauses: A serendipitous corpus-linguistic experiment
OV markedness
2017/8/30
In a paper entitled “Against markedness (and what to replace it with)”, Haspelmath argues “that the term ‘markedness’ is superfluous”, and that frequency asymmetries often explain structural (un)marke...
Clitic placement in Serbian:Corpus and experimental evidence
Clitic placement Serbian Corpus experimental evidence
2015/8/7
The focus of our paper is the distribution of the so-called “second-position” clitics. Languagesof this type fallinto three classes:those in which the sentential position for clitics is after the firs...
Corpus-based Learning of Analogies and Semantic Relations
analogy metaphor semantic relations Vector Space Model cosine similarity noun-modifier pairs
2015/7/30
We present an algorithm for learning from unlabeled text, based on the Vector Space Model (VSM) of information retrieval, that can solve verbal analogy questions of the kind found in the SAT college e...
Generating data as a proxy for unavailable corpus data: the contextualized sentence completion task
Contextualized sentence completion task corpus datives genitives variety differences
2015/6/17
There is much interest in using large corpora to explore predictors of the probability of higher level linguistic structures, but suitable corpora are not available for all languages and their varieti...
Parse Selection on the Redwoods Corpus:3rd Growth Results
Redwoods Corpus 3rd Growth Results
2015/6/12
This report details experimental results of using stochastic disambiguation models for parsing sentences from the Redwoods treebank (Oepen et al., 2002). The goals of this paper are two-fold: (i) to r...
Corpus-Based Induction of Syntactic Structure:Models of Dependency and Constituency
Corpus-Based Induction Syntactic Structure Dependency and Constituency
2015/6/12
We present a generative model for the unsupervised learning of dependency structures. We also describe the multiplicative combination of this dependency model with a model of linear constituency. The ...
A multimodal corpus of speech to infant and adult listeners
A multimodal corpus speech to infant adult listeners
2015/4/24
An audio and video corpus of speech addressed to 28 11- month-olds is described. The corpus allows comparisons between adult speech directed toward infants, familiar adults, and unfamiliar adult addre...
This article describes the preparation, recording and orthographic transcription of a new speech corpus, the Nijmegen Corpus of Casual Spanish (NCCSp). The corpus contains around 30 hours of recording...
A Blueprint for a Comprehensive Australian English Auditory-Visual Speech Corpus
Australian English Auditory Visual Speech Corpus
2015/4/7
Contemporary speech science is driven by the availability of large, diverse speech corpora. Such infrastructure underpins research and technological advances in various practical, socially beneficial ...
Clausal Coordinate Ellipsis and its Varieties in Spoken German: A Study with the TüBa-D/S Treebank of the VERBMOBIL Corpus
Clausal Coordinate Ellipsis Spoken German the TuBa-D/S Treebank VERBMOBIL Corpus
2015/4/7
Grammar rules for Clausal Coordinate Ellipsis (CCE) are based nearly exclusively on linguistic judgments (intuitions). For German,the extent to which grammar rules based on this type of empirical evid...
Preparing a Corpus of Dutch Spontaneous Dialogues for Automatic Phonetic Analysis
corpus creation conversational speech spontaneous dialogues reductions pronunciation variants automatic phonemic transcription
2015/4/3
This paper presents the steps needed to make a corpus of Dutch spontaneous dialogues accessible for automatic phonetic research aimed at increasing our understanding of reduction phenomena and the rol...
Comparing Linguistic Judgments and Corpus Frequencies as Windows on Grammatical Competence: A Study of Argument Linearization in German Clauses
Comparing Linguistic Judgments Corpus Frequencies Grammatical Competence Argument Linearization German Clauses
2015/4/3
When language users grammatically encode a communicative intention, they often avail of a range of linguistic means—each option yielding a member of a set of paraphrases. A rich source of paraphrases ...
A Flexible, Scalable Finite-State Transducer Architecture for Corpus-Based Concatenative Speech Synthesis
Transducer Architecture Concatenative Speech Synthesis
2015/3/11
A Flexible, Scalable Finite-State Transducer Architecture for Corpus-Based Concatenative Speech Synthesis.
Fundamental Frequency Modeling for Corpus-Based Speech Synthesis Based on a Statistical Learning Technique
Speech Synthesis Statistical Learning Technique
2015/3/11
Fundamental Frequency Modeling for Corpus-Based Speech Synthesis Based on a Statistical Learning Technique.