语音学发音语音学笔记

news2024/10/7 10:22:50

词汇表

articulators 咬合架发音器
cochlea 耳蜗
consonants 元音
dialect 方言
eardrum 鼓膜
endolymph 内淋巴
Epiglottis 喉头盖
formants 共振峰
fricative 摩擦音
Larynx 喉
meatus 耳道
monosyllabic 单音节
pinna 耳廊
pitch 音调
pitch harmonics 音高泛音
phonation 发声
plosive 爆破音
perilymph 外淋巴
polysyllabic 多音节
Phonetic Transcription 音标
Phonology 音位学, 音韵学
Prosody 韵律学
resonator 谐振器
Response Magnitude 响应幅度
Spectrum 光谱图
Syllable 音节
vibration 振动
Vocalisation 发声
Vocal Tract 声道
vocal folds/cords 声带
voiced 浊音
vowels 辅音
unvoiced 清音
utterances 话语

L10 Speaking

Voice Source

Air pressure from the lungs builds up behind closed ‘vocal folds’ (often called ‘vocal cords’)来自肺部的气压在闭合的“声带”(通常称为“声带”)后面积聚

The vocal folds are repeatedly forced apart and pulled together again, producing a series of small pulses of air, This modulation of the airstream is known as phonation 声带反复被迫分开并再次拉在一起,产生一系列小的空气脉冲

The tension in the muscles attached to the vocal folds determines their rate of vibration and hence the ‘fundamental frequency’ (FX or F0) of the speech waveform 附着在声带上的肌肉的张力决定了它们的振动速率,因此决定了语音波形的“基频”(FX 或 F0)

The fundamental frequency contributes to the perceived pitch of the voice 基频有助于感知声音的音调

Because the vibration is not a pure sine wave, there is energy at frequencies that are multiples of the fundamental frequency (known as the pitch harmonics

Voice Filter

The vocal tract forms a resonator with a complex shape 声道形成一个形状复杂的共鸣器

Resonances are known as formants 共振称为共振峰

Speech is produced by using the articulators to change the shape of the vocal tract, hence modifying its resonant characteristics语音是通过使用发音器来改变声道的形状,从而改变其共振特性而产生的

Different configurations of the vocal tract enhance some of the harmonics of the pitch, and suppress (damp) others声道的不同配置增强了音高的一些谐波,并抑制(抑制)其他谐波

The principal articulator is the tongue, but the jaw, lips, soft palate and teeth are also involved 主要的发音器官是舌头,但下颌、嘴唇、软腭和牙齿也参与其中

The Excitation Spectrum

在这里插入图片描述

The Speech Spectrum

在这里插入图片描述

Sound Source

exciting a resonance (e.g. a whistle) 激发共鸣(例如口哨声)
vibrating an articulator (e.g. the tongue) 振动咬合架(例如舌头)
releasing a blockage (e.g. the lips)
• A voiced sound is one in which the vocal cords are vibrating 浊音是声带振动的声音
• An unvoiced sound is one in which the vocal cords are not vibrating 清音是声带不振动的声音
• A fricative sound results from a turbulent air flow at a constriction 摩擦声是由收缩处的湍流气流产生的
• A plosive sound occurs after a blockage is released 堵塞解除后发出爆破音

L11 Hearing

人耳主要功能是频率分析

The main percepts are …
– pitch 音高
– loudness 响度
– timbre 音色

Outer Ear

The pinna protects the entrance to the ear canal, and its shape makes it directionally sensitive at high frequencies耳廓保护耳道入口,其形状使其对高频方向敏感

The external canal - meatus - is a tube (~2.7 cm long, ~0.7 cm in diameter) that leads from the pinna to the middle ear 外耳道 - 耳道 - 是一条从耳廓通向中耳的管子(长约 2.7 厘米,直径约 0.7 厘米)

The meatus terminates at the cone shaped tympanic membrane (eardrum)耳道终止于锥形鼓膜(鼓膜

Sound waves entering the ear impinge upon the eardrum and cause it to vibrate进入耳朵的声波撞击鼓膜并使其振动

Middle Ear

The middle ear transforms the vibration of the eardrum into oscillations of the liquid in the inner ear by vibrating the oval window 中耳通过振动卵圆窗将鼓膜的振动转化为内耳液体的振动

The necessary impedance matching (between air and liquid) is achieved by a group of bones - the ossicles - acting as a system of mechanical levers

The pressure at the oval window is ~35x greater than that arriving at the eardrum

This mechanical amplification allows us to hear sounds 1000x weaker than otherwise

Muscles attached to the ossicles protect the inner ear from potential damage due to high sound levels 附着在小骨上的肌肉保护内耳免受高音量的潜在伤害

Inner Ear

The transformation from mechanical vibrations to electrical nerve impulses (neural transduction) takes place in the snail-like structure of the cochlea 从机械振动到电神经冲动(神经转导)的转变发生在耳蜗的蜗牛状结构中

The cochlea is ~35 mm long and is filled with a colourless liquid called perilymph 耳蜗长约 35 毫米,充满了一种称为外淋巴液的无色液体

The cochlea is divided into two regions along its length by a membrane structure called the cochlea partition (a channel filled with a liquid called endolymph 耳蜗沿其长度被称为耳蜗分区的膜结构分为两个区域(一个充满液体的通道,称为内淋巴

耳蜗隔板以 basilar membrane 基底膜 和 Reissner’s membrane为界

耳蜗的作用

• The mechanical properties of the basilar membrane determine how the cochlea responds to sound
• Vibrations entering at the oval window set up travelling waves which lead to peaks of energy at different places along the cochlea depending on the frequency
• The vibration is nearest the oval window for high- frequency sounds
• The organ of corti transform the mechanical movements into electrochemical pulses by bending the outer hair cells (of which there are ~25,000)
• These actions are equivalent to a bank of bandpass filters 相当于一组带通滤波器

Frequency Selectivity

• Low frequency sounds can mask higher frequency sounds because of the overlap between auditory filters 由于听觉滤波器之间的重叠,低频声音可以掩盖高频声音
• The bandwidth over which masking operates is termed the critical band 掩蔽操作的带宽被称为临界带
• The shapes of the auditory filters are revealed by deriving psychophysical tuning curves 听觉滤波器的形状通过导出心理物理调谐曲线来揭示

Spectrogram vs. Cochleogram

• Spectrogram
– Plot of log energy across time and frequency (linear frequency scale)
• Cochleogram
– Cochlear filtering by the gammatone filterbank (or other models of cochlear filtering)
– Quasi-logarithmic frequency scale, and filter bandwidth is frequency-dependent

Binaural Auditory Processing

Possible mechanism:
– inter-aural time differences (ITD) 耳间时间差
– inter-aural level differences (ILD) 耳间电平差

Hearing Aids

Basic-Components of hearing aids
• Microphone
• Amplifier (in digital also DSP)
• Loudspeaker
• Battery

Types of hearing aids
在这里插入图片描述

Cochlear Implants 人工耳蜗

A dialect is when different words are used 方言
An accent is when different sounds are used
Accents and dialects reflect regional and/or social differences

L13 Sounds and Symbols

Writing allows information to be:
transmitted over space
stored over time

The Syllable

The syllable is the shortest stretch of speech

Syllables consist of
vowels: sound segments produced using an unobstructed configuration of the vocal tract 使用畅通无阻的声道配置产生的声音片段
consonants: sound segments in which the airflow is at least partly obstructed 气流至少部分受阻的声音片段
A simple CVC (consonant-vowel-consonant) syllable corresponds to the opening and closing of the mouth

Words can be
monosyllabic (having one syllable) 单音节
polysyllabic (having two or more syllables) 多音节

Consonants and vowels 元音和辅音

Phonetic Transcription

International Phonetic Association (IPA) 国际语音协会

L14 Articulatory Phonetics 发音语音学

The Resonant Cavities 谐振腔
Nasal Cavity 鼻腔
Pharyngial Cavity 咽腔
Laryngial Cavity 喉腔
Oral Cavity 口腔
The Articulators 发音器
Alveolar Ridge 牙床
Hard Palate 硬腭
Soft Palate (Velum) 软腭
Upper Teeth 上牙
Upper Lip 上唇
Lower Lip 下唇
Uvula 小舌
Vocal Cords 声带

Speech sounds are classified in articulatory phonetics as follow
– vowels & consonants (i.e. all sounds)
• where the air stream comes from
• whether air is going in or out
– consonants
• whether the vocal cords are vibrating: voice
• where the constriction is: the place of articulation
• how the sound is made: the manner of articulation
– vowels
• the position of the tongue
• the shape of the lips

• Articulation refers to the constriction of the vocal tract during speech production 是指发音过程中声道的收缩
• Articulation involves the movement of an active articulator (e.g. the tongue) towards a passive articulator (e.g. the top of the mouth)
• The place of articulation refers to the physical location of the constriction in the vocal tract

**Place of Articulation: **

  1. Bilabial
  2. Labiodental
  3. Dental
  4. Alveolar
  5. Postalveolar
  6. Retroflex
  7. Palatal
  8. Velar
  9. Uvular
  10. Pharyngeal
  11. Glottal

Manner of Articulation
The manner of articulation refers to the way in which the airstream is modified by the primary and secondary articulators 发音方式是指气流被初级和次级发音器修改的方式
• Degrees of stricture …
– closure: articulators in firm contact (stops)
– narrowing: articulators close together but not touching (fricatives)
– approximation: wide gap between articulators (approximants)

Stops
– complete blockage of the airstream
– can be produced at many different places of articulation
– stops made with a velic closure are called oral stops
– stops made without a velic closure (and with airflow through the nasal cavity) are called nasal stops

plosives 爆破音
slower release sounds are called affricates 塞擦音

Voice, Place, Manner (VPM) labels are the standard method of specifying consonants
voiceless,voiced,bilabial ,velar ,alveolar ,postavelar ,glottal ,plosive ,fricative ,affricate ,nasal ,lateral-approximant
在这里插入图片描述
在这里插入图片描述

元音

Vowel quality is governed by …
– vowel height: high / low
– vowel location: front / back
– lip position: rounded / unrounded

The height of a vowel refers to the relationship between the highest point of the tongue and the roof of the oral cavity 元音的高度是指舌头最高点与口腔顶部的关系
close [i] /high [u]
open [æ] /low [ɑ]
mid ([ə]) /half-close ([e]) / half-open ([ʌ])

The location of a vowel refers to the part of the tongue which is highest 元音的位置指的是舌头最高的部分
front vowel [i]
back vowel [u]
central vowel
The mid-central vowel [ə] is called schwa

Vowel quality can be indicated by
– placing a dot on the vowel quadrilatera
– relating it to a set of language-independent cardinal vowels

Vowels seem to act as a carrier signal that is modulated by the consonants 元音似乎充当由辅音调制的载波信号
For these reasons, vowel quality is very variable and can drift over time (hence giving rise to different historical and contemporary accents)
由于这些原因,元音质量变化很大,并且会随着时间的推移而漂移(因此产生不同的历史和现代口音)

L15 Acoustic Phonetics 语音学

Articulatory Phonetics is a description of speech sounds in terms of the physical actions performed in their production 物理动作
Acoustic Phonetics is a description of speech sounds in terms of the acoustic consequences of their production 声学结果
Voice Onset Time (VOT) 发声时间
频谱图
Coarticulation 联合发音 refers to the influence of one sound on another
To a speech technologist/engineer, this is a form of context-dependency
To a phonetician, it is a consequence of efficient motor planning

L16 Phonology and Prosody 音系学和韵律学

Phonemic Contrast 音位对比

The contrastive phones in a language are called phonemes 音素

minimal pair in english contains the words
– “fussy” [fʌsɪ]
– “fuzzy” [fʌzɪ]
Therefore the [s] and [z] sounds are phonemes in English

The Phoneme 音素

Daniel Jones defined the phoneme as “a family of uttered sounds (segmental elements of speech) in a particular language which count for practical purposes as if they were one and the same

The phonemic inventory of a language is found by exploring all of the possible minimal pairs 一种语言的音位清单是通过探索所有可能的最小对来找到的

Phonemes are written using IPA symbols

A phonemic transcription is an idealised representation of an utterance (whereas a phonetic transcription represents the actual sounds used) 音位转录是话语的理想化表示(而语音转录表示实际使用的声音)

Phonological Processes
– Assimilation (feature spreading)
– Elision (deletion)
– Epenthesis (insertion)
– Reduction (neutralised vowel quality)

Prosody

Lexical stress refers to the prominence of syllables in words
The location of the stress can be marked using a diacritic, e.g. [ˈbɪləʊ] vs. [bɪˈləʊ]

About half of the world’s languages use the pitch pattern (called lexical tone) to distinguish between one word and another
Languages with moving pitch patterns (such as Modern Standard Chinese) are called contour tone languages
Languages with entirely level tones (such as Yoruba) are called register tone languages
经典施氏食獅史

intonation

Pitch variation that doesn’t affect the meaning of the words, but does affect the meaning of an utterance is known as intonation

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.coloradmin.cn/o/73478.html

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈,一经查实,立即删除!

相关文章

【云原生 | Kubernetes 实战】09、K8s 控制器 Replicaset 入门到企业实战应用

目录 K8s 控制器 Replicaset 一、Replicaset 控制器:概念、原理解读 1.1 Replicaset 概述 1.2 Replicaset 工作原理:如何管理 Pod ? 二、Replicaset 资源清单文件编写技巧 三、Replicaset 使用案例:部署 Guestbook 留言板 …

第16章 母函数

第16章 母函数 母函数是离散数学领域最意外、最有用的发明之一。粗略来讲,母函数将序列问题转化为代数问题。 组合数学中常常出现普通型母函数、指数型母函数、狄利克雷型母函数 16.1 无穷级数 通俗地说,母函数F(x)就是无穷级数 符号[xnx^nxn]F(x)表示母函数F(x…

策略模式学习

0.引言 最近想整理一下代码。我的想法是使用继承的方案,使用多态写一个interface,然后不同的方法来继承它。最近ChatGPT比较火,顺便问了一下它: 在C代码设计中,我对同一算法设计了不同的实现,例如计算平均…

使用 Web 应用程序示例在 Java 中进行安全编码

使用 Web 应用程序示例在 Java 中进行安全编码 使用 Java 中的 Online Shop Web 应用程序示例了解最常见的漏洞以及如何避免它们 课程英文名:Secure coding Dive into Injections with Java & Spring boot 此视频教程共36.0小时,中英双语字幕&…

1x9 Dual SC Optical Transceivers

1、Pin Assignment & Description TD, TD-: DC coupled LVPECL inputs for the transmitter. 50Ω differential lines. RD, RD-: Open-emitter out circuits. DC coupled LVPECL outputs for the receiver. 50Ω differential lines. SD: Signal Detect. Normal opti…

【工业控制】多变量动态矩阵预测控制(DMC)【含Matlab源码 1499期】

⛄一、简介(附课程报告) 1引言 众所周知,上世纪 60 年代初形成的现代控制理论在航空、航天等领域取得了辉煌的成果。 然而人们不久就发现在完美的理论与控制之间还存在着巨大的鸿沟。主要表现在以下几个方面: 1.现代控制理论的基点是对象精确…

OneHotEncoder独热编码

首先了解机器学习中的特征类别:连续型特征和离散型特征。 拿到获取的原始特征,必须对每一特征分别进行归一化,比如,特征A的取值范围是[-1000,1000],特征B的取值范围是[-1,1].如果使用logistic回归,w1*x1w2…

Docker-compose编排

一、Docker Compose前言 Docker Compose的前身是Fig,Fig被Docker收购之后正式更名为Compose,Compose向下兼容Fig Docker Compose是一个用于定义和运行多容器Docker应用的工具,只需要一个Compose的配置文件和一个简单的命令就可以创建并运行应…

STC 51单片机61——呼吸灯

#include "reg52.h" sbit LedHuxi1P1^0; #define time (65536-1000) // 单次定时1ms unsigned char T_High, T, ti, dir; void InitTimer0(void) { TMOD0x01; TL0 time; //initial timer1 low byte TH0 time >> 8; …

从来没有一家互联网公司不问【设计模式+SSM框架底层源码】

结构型模式 创建型模式 行为型模式 六大原则 免费分享学习设计模式整理的学习笔记文档 Spring5源码解读 **其实,学习编程不是学习配置东西,然后写编程套路。写程序最重要的是你能心中有自己的一套架构思路,比如我现在就没有写Java的项目…

还在公域流量里面投入大量广告费而无法变现,不如试试私域流量吧

大家好,我是林工,不知道大家听说过私域流量这个词没有,听起来是一个听起来很模糊的概念。很多人都知道它大概的意思,但具体要落实到“私域流量怎么做”这件事情上,大都都是一问三不知,不知道该如何入手。 其…

艾美捷魔力红组织蛋白酶B活性分析试剂盒研究手册

艾美捷ICT魔力红组织蛋白酶B活性分析试剂盒包含: 套件937:25测试 魔红基质(MR-RR2),1 25测试小瓶,#6133 Hoechst 33342,1毫升,#639 吖啶橙,0.5 mL,#6130 工具包手册…

低版本docker cp报错Error: Path not specified未指定路径解决办法

大概就是版本过低。此时我们可以通过另外得途径解决 1:查找启动容器得id docker inspect -f ‘{{.Id}}’ 40e8c27c975f 例如我这里找es得 2:进入到docker容器挂在到本地得共享盘地址 一般就事 以我得乌班图为例 /var/lib/docker rootubuntu:/var/lib/do…

Android动画 补间动画

目录 1.什么是补间动画 2.XML实现方式 3.代码实现方式 4.展现形式 1.什么是补间动画 补间动画:属于Android中View动画的一种,就是涵盖了 平移、缩放、旋转 和 透明度四种变化的动画。实现方式有两种:xml文件 和 java代码。 四种补间动…

[附源码]计算机毕业设计大学生志愿者服务管理系统Springboot程序

项目运行 环境配置: Jdk1.8 Tomcat7.0 Mysql HBuilderX(Webstorm也行) Eclispe(IntelliJ IDEA,Eclispe,MyEclispe,Sts都支持)。 项目技术: SSM mybatis Maven Vue 等等组成,B/S模式 M…

擎创技术流 | ClickHouse实用工具—ckman教程(6)

哈喽~小伙伴们,今天依旧是每一周一期的技术分享~ 关于“ckman”的技术分享已经第6期了,大家是不是跟小编一样收获颇丰?新知识虽然新鲜有趣,但也别忘了回顾旧知识巩固基础噢~↓↓↓ 擎创技术流 | ClickHouse实用工具—ckman教程&…

10万字208道Java经典面试题总结(附答案)

前言 最近有很多粉丝问我,有什么方法能够快速提升自己,通过阿里、腾讯、字节跳动、京东等互联网大厂的面试,我觉得短时间提升自己最快的手段就是背面试题,最近总结了Java常用的面试题,分享给大家,希望大家都…

[附源码]Python计算机毕业设计SSM基于微信平台的车险投保系统设计与实现(程序+LW)

项目运行 环境配置: Jdk1.8 Tomcat7.0 Mysql HBuilderX(Webstorm也行) Eclispe(IntelliJ IDEA,Eclispe,MyEclispe,Sts都支持)。 项目技术: SSM mybatis Maven Vue 等等组成,B/S模式 M…

VINS、MAVROS等的坐标系统一(草稿,未得出明确结果)

由于不同算法之间的坐标系不同,导致计算的结果混乱,该博客的目的是记录和统一不同算法之间的坐标系,保证坐标系的统一 一、VINS算法 vins算法,使用D435I相机。该坐标方向为:右前上分别为xyz。角度:由于是四…

SpringBoot+Vue实现前后端分离的心理咨询系统

文末获取源码 开发语言:Java 使用框架:spring boot 前端技术:JavaScript、Vue.js 、css3 开发工具:IDEA/MyEclipse/Eclipse、Visual Studio Code 数据库:MySQL 5.7/8.0 数据库管理工具:phpstudy/Navicat JD…