



Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
An overview of the field of computational linguistics, which explores the intersection of human language and computer science. It covers key topics such as psycholinguistics, speech processing, lexical access, syntactic processing, and the computational analysis of language. The document delves into the various applications of computational linguistics, including speech recognition, speech synthesis, text-to-speech conversion, computational morphology, computational syntax, compositional semantics, computational sign language, and computational forensic linguistics. It also discusses the use of computers in language analysis, such as frequency analysis, concordances, collocations, computational lexicography, information retrieval, and summarization. The document highlights the importance of computational linguistics in bridging the gap between human language and technological advancements, enabling more efficient and effective communication between humans and machines.
Typology: Summaries
1 / 6
This page cannot be seen from the preview
Don't miss anything!
A. Psycholinguistics is the field of linguistics study in which researchers investigate the psychological processes involved in the use of language, including language comprehension, language (speech or sign) production, and first and second language acquisition. . B. THE HUMAN MIND AT WORK: HUMAN LANGUAGE PROCESSING Speaking and comprehending speech can be viewed as a speech chain, a kind of brain-to-brain linking, A spoken utterance starts as a message in the speaker’s brain/mind. It is put into linguistic form and interpreted as articulation commands, emerging as an acoustic signal. The signal is processed by the listener’s ear and sent to their brain/mind, where it is interpreted. This means that language processing is more than grammar alone, linguistic performance tries to detail the psychological mechanisms that work with the grammar to permit language production and comprehension. C. Comprehension One of the aims of psycholinguistics is to describe the processes people normally use in speaking and understanding language. The various breakdowns in performance, such as “tip-of-the-tongue” phenomena, speech errors, and failure to comprehend tricky sentences tell us a great deal about how language is processed. o The speech signal In linguistics, speech is a system of communication that uses spoken words (or sound symbols). i. The vibrations of our vocal cords cause variations of air pressure, and sounds we produce can be described in terms of: ii. Fundamental frequency (pitch): How fast the variations of air pressure occur. iii. Intensity: The magnitude of acoustic signal, which is perceived as loudness.
iv. The quality of a speech sound is determined by the shape of the vocal tract:the shape affects how the sound waves travel. D. SPEECH PERCEPTION AND COMPREHENSION Speech is a continuous signal. The speech signal can be broken into strings of: phonemes, syllables, morphemes, words, and phrases. a) The "segmentation problem" - > how do listeners carve up the continuous speech signal into meaningful units? Lexical access, stress, and intonation. b) The "lack of invariance problem" - > how do listeners recognize different speech sounds when they are used in different contexts and spoken by different people? Listeners can normalize their perceptions to account for rate of speech and speaker pitch differences. E. BOTTOM-UP AND TOP-DOWN MODELS a) Top-down processing: proceeding from semantic and syntactic information to the lexical information from the sensory input b) Listeners can predict that if a speaker says the then an NP is coming c) In experiments, listeners seem to make much use of top-down information d) Bottom-up processing moving from the sensory phonetic input to phonemes. Then morphemes, etc. up to semantic interpretation e) Listeners wait to construct an NP until they hear the followed by a noun F. LEXICAL ACCESS AND WORD RECOGNITION In order to discover more about lexical access or word recognition, psycholinguists have devised several experiments: a) Lexical decision - > Task of subjects in psycholinguistic experiments that involve people deciding whether or not a string of letters or sounds is a word. Frequently used words such as car are responded to more quickly than infrequent words such as fig. This leads researchers to believe that frequent words are more easily accessed in the lexicon than infrequent words b) Lexical access experiments show that people retrieve all the meanings of a word. G. SYNTACTIC PROCESSING
Computational linguistics is a subfield of linguistics and computer science that focuses on with the interactions of human language and computers. Computational linguistics includes the analysis of: a) Written texts and spoken discourse b) The translation of text and speech from one language into another c) The use of human (not computer) languages for communication between computers and people d) The modelling and testing of linguistic theories. M. COMPUTATIONAL PHONETICS AND PHONOLOGY Computational phonetics and phonology is concerned with processing speech. There are two sides of computational phonetics and phonology: a) Speech recognition - > Process of analysing the speech signal into its component phones and phonemes, and producing, in effect, a phonetic transcription of the speech. b) Speech synthesis - > Process of creating electronic signals that simulate the phones and prosodic features of speech and assemble them into words and phrases for output to an electronic speaker, or for further processing, as in a speech-generation application.
Computers must also be able to determine syntactic structure. A parser is a program that uses grammar to assign phrase structure to a string of words. a) A top-down parser proceeds by first consulting the grammar rules (use a grammar containing the rules S → NP VP, NP → Det N and so forth) and then examining the input string to see if the first word could begin an S. b) A bottom-up parser looks at the input string first and then finds phrasal categories. c) A transition network composed of nodes (circles) and arcs (arrows) may be used to model syntactic processing.
1. COMPOSITIONAL SEMANTICS Compositional semantics is concerned with 1) producing a semantic representation of the input in the computer and 2) producing natural language to represent meanings. 2. COMPOSITIONAL PRAGMATICS Computers use semantic and pragmatic knowledge to analyze structurally ambiguous sentences. 3. COMPUTATIONAL SIGN LANGUAGE a) Linguists at Boston University are currently working on computer algorithms that will recognize sign language as spoken language can be b) The signer stands in front of a camera and the computer recognizes the distinctive features of sign language such as hand shape, movement, and orientation. 4. COMPUTER MODELS OF GRAMMAR a) Computers can be programmed to model the grammar of language b) This forces linguists to be explicit in formulating the rules grammar c) If the program cannot generate a possible grammatical sentence, then there is an error in the grammar d) If the program generates an ungrammatical sentence, then there is an error in the grammar Q. TEXT AND SPEECH ANALYSIS FREQUENCY ANALYSIS, CONCORDANCES, AND COLLOCATIONS Computers can be used to: a) Do frequency analyses to reveal the most common words in written (the, of, and, to, a, in, that, is, was, he) and spoken (I, and, the, to, that, you, it, of, a, know) American English. b) Do concordances, which specify the location of any particular word and its context. c) Do collocation analyses, which reveals the occurrences of two or more words within a short space of each other in a corpus and provides evidence that the presence of one word in a text affects the occurrence of other words.