This commit is contained in:
superlishunqin 2026-02-03 06:41:34 +08:00
parent febab3e929
commit 90e5d901fa

177
README.md
View File

@ -8,7 +8,7 @@
|------|------| |------|------|
| **论文题目** | 融合情感的语音克隆技术研究及其在幼儿园语言教育中的应用 | | **论文题目** | 融合情感的语音克隆技术研究及其在幼儿园语言教育中的应用 |
| **学位类型** | 硕士学位论文 | | **学位类型** | 硕士学位论文 |
| **创建时间** | 2025年 | | **创建时间** | 2026年 |
## 📖 文献分类概览 ## 📖 文献分类概览
@ -16,9 +16,10 @@
| 章节 | 主题 | 文献数量 | | 章节 | 主题 | 文献数量 |
|------|------|----------| |------|------|----------|
| 第三章 [<sup>1</sup>](#第三章-语音克隆技术相关文献) | 语音克隆技术与语音信号处理 | 17篇 | | 第三章 | 语音克隆技术与语音信号处理 | 17篇 |
| 第四章 [<sup>2</sup>](#第四章-智能评估算法相关文献) | 智能评估算法与自适应学习理论 | 20篇 | | 第四章 | 智能评估算法与自适应学习理论 | 100篇 |
| 第五章 [<sup>3</sup>](#第五章-系统设计与实现相关文献) | 系统设计与教育应用 | 5篇 | | 第五章 | 系统设计与实现相关文献 | 5篇 |
| **总计** | **42篇** |
--- ---
@ -28,7 +29,7 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Du, Z., et al. (2024). **CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models**. *arXiv preprint arXiv:2412.10117*. | arXiv [<sup>4</sup>](https://arxiv.org/abs/2412.10117) | | 1 | Du, Z., et al. (2024). **CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models**. *arXiv preprint arXiv:2412.10117*. | arXiv [<sup>1</sup>](https://arxiv.org/abs/2412.10117) |
**引用句式**: "advanced generative large models represented by Alibaba's Bailian CosyVoice demonstrate outstanding performance in few-shot learning" **引用句式**: "advanced generative large models represented by Alibaba's Bailian CosyVoice demonstrate outstanding performance in few-shot learning"
@ -38,9 +39,9 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Loizou, P. C. (2013). **Speech Enhancement: Theory and Practice** (2nd ed.). *CRC Press*. | Routledge [<sup>5</sup>](https://www.routledge.com/Speech-Enhancement-Theory-and-Practice/Loizou/p/book/9781138075573) | | 1 | Loizou, P. C. (2013). **Speech Enhancement: Theory and Practice** (2nd ed.). *CRC Press*. | Routledge [<sup>2</sup>](https://www.routledge.com/Speech-Enhancement-Theory-and-Practice/Loizou/p/book/9781138075573) |
| 2 | Lu, Y., & Loizou, P. C. (2008). **A geometric approach to spectral subtraction**. *Speech Communication*, 50(6), 453-466. | PMC [<sup>6</sup>](https://pmc.ncbi.nlm.nih.gov/articles/PMC2516309/) | | 2 | Lu, Y., & Loizou, P. C. (2008). **A geometric approach to spectral subtraction**. *Speech Communication*, 50(6), 453-466. | PMC [<sup>3</sup>](https://pmc.ncbi.nlm.nih.gov/articles/PMC2516309/) |
| 3 | Upadhyay, N., & Karmakar, A. (2015). **Speech Enhancement using Spectral Subtraction-type Algorithms: A Comparison and Simulation Study**. *Procedia Computer Science*, 54, 574-583. | ScienceDirect [<sup>7</sup>](https://www.sciencedirect.com/science/article/pii/S1877050915013903) | | 3 | Upadhyay, N., & Karmakar, A. (2015). **Speech Enhancement using Spectral Subtraction-type Algorithms: A Comparison and Simulation Study**. *Procedia Computer Science*, 54, 574-583. | ScienceDirect [<sup>4</sup>](https://www.sciencedirect.com/science/article/pii/S1877050915013903) |
**引用句式**: **引用句式**:
- "The system employs spectral subtraction, which is computationally efficient and preserves voice characteristics well" - "The system employs spectral subtraction, which is computationally efficient and preserves voice characteristics well"
@ -52,10 +53,10 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Hasek, C. S., Singh, S., & Murry, T. (1980). **Acoustic attributes of preadolescent voices**. *Journal of the Acoustical Society of America*, 68(5), 1262-1265. | PubMed [<sup>8</sup>](https://pubmed.ncbi.nlm.nih.gov/2723142/) | | 1 | Hasek, C. S., Singh, S., & Murry, T. (1980). **Acoustic attributes of preadolescent voices**. *Journal of the Acoustical Society of America*, 68(5), 1262-1265. | PubMed [<sup>5</sup>](https://pubmed.ncbi.nlm.nih.gov/2723142/) |
| 2 | Keating, P., & Buhr, R. (1978). **Fundamental frequency in the speech of infants and children**. *Journal of the Acoustical Society of America*, 63(2), 567-571. | PubMed [<sup>9</sup>](https://pubmed.ncbi.nlm.nih.gov/670553/) | | 2 | Keating, P., & Buhr, R. (1978). **Fundamental frequency in the speech of infants and children**. *Journal of the Acoustical Society of America*, 63(2), 567-571. | PubMed [<sup>6</sup>](https://pubmed.ncbi.nlm.nih.gov/670553/) |
| 3 | Perry, T. L., Ohde, R. N., & Ashmead, D. H. (2001). **The acoustic bases for gender identification from children's voices**. *Journal of the Acoustical Society of America*, 109(6), 2988-2998. | PubMed [<sup>10</sup>](https://pubmed.ncbi.nlm.nih.gov/11425139/) | | 3 | Perry, T. L., Ohde, R. N., & Ashmead, D. H. (2001). **The acoustic bases for gender identification from children's voices**. *Journal of the Acoustical Society of America*, 109(6), 2988-2998. | PubMed [<sup>7</sup>](https://pubmed.ncbi.nlm.nih.gov/11425139/) |
| 4 | Robb, M. P., & Saxman, J. H. (1985). **Developmental trends in vocal fundamental frequency of young children**. *Journal of Speech and Hearing Research*, 28(3), 421-427. | PubMed [<sup>11</sup>](https://pubmed.ncbi.nlm.nih.gov/4046586/) | | 4 | Robb, M. P., & Saxman, J. H. (1985). **Developmental trends in vocal fundamental frequency of young children**. *Journal of Speech and Hearing Research*, 28(3), 421-427. | PubMed [<sup>8</sup>](https://pubmed.ncbi.nlm.nih.gov/4046586/) |
**引用句式**: **引用句式**:
- "children aged 3 to 6, whose vocal organs are not yet fully developed and who commonly exhibit physiological characteristics such as elevated fundamental frequency" - "children aged 3 to 6, whose vocal organs are not yet fully developed and who commonly exhibit physiological characteristics such as elevated fundamental frequency"
@ -67,8 +68,8 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Ramirez, J., Gorriz, J. M., & Segura, J. C. (2007). **Voice Activity Detection. Fundamentals and Speech Recognition System Robustness**. In *Robust Speech Recognition and Understanding*. IntechOpen. | IntechOpen [<sup>12</sup>](https://www.intechopen.com/chapters/15750) | | 1 | Ramirez, J., Gorriz, J. M., & Segura, J. C. (2007). **Voice Activity Detection. Fundamentals and Speech Recognition System Robustness**. In *Robust Speech Recognition and Understanding*. IntechOpen. | IntechOpen [<sup>9</sup>](https://www.intechopen.com/chapters/15750) |
| 2 | Zhang, X., & Wu, J. (2013). **Deep belief networks based voice activity detection**. *IEEE Transactions on Audio, Speech, and Language Processing*, 21(4), 697-710. | IEEE [<sup>13</sup>](https://ieeexplore.ieee.org/document/6362237) | | 2 | Zhang, X., & Wu, J. (2013). **Deep belief networks based voice activity detection**. *IEEE Transactions on Audio, Speech, and Language Processing*, 21(4), 697-710. | IEEE [<sup>10</sup>](https://ieeexplore.ieee.org/document/6362237) |
**引用句式**: "This research proposes a 'WebRTC + energy threshold' hybrid VAD algorithm" **引用句式**: "This research proposes a 'WebRTC + energy threshold' hybrid VAD algorithm"
@ -78,7 +79,7 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | European Broadcasting Union. (2020). **EBU R 128: Loudness normalisation and permitted maximum level of audio signals**. | EBU Tech [<sup>14</sup>](https://tech.ebu.ch/docs/r/r128.pdf) | | 1 | European Broadcasting Union. (2020). **EBU R 128: Loudness normalisation and permitted maximum level of audio signals**. | EBU Tech [<sup>11</sup>](https://tech.ebu.ch/docs/r/r128.pdf) |
**引用句式**: "the system adopts L_target = -16 dBFS as the normalization baseline" **引用句式**: "the system adopts L_target = -16 dBFS as the normalization baseline"
@ -88,8 +89,8 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Rabiner, L. R., & Schafer, R. W. (2010). **Theory and Applications of Digital Speech Processing**. *Pearson*. | Pearson [<sup>15</sup>](https://www.pearson.com/en-us/subject-catalog/p/theory-and-applications-of-digital-speech-processing/P200000003267) | | 1 | Rabiner, L. R., & Schafer, R. W. (2010). **Theory and Applications of Digital Speech Processing**. *Pearson*. | Pearson [<sup>12</sup>](https://www.pearson.com/en-us/subject-catalog/p/theory-and-applications-of-digital-speech-processing/P200000003267) |
| 2 | Harris, F. J. (1978). **On the use of windows for harmonic analysis with the discrete Fourier transform**. *Proceedings of the IEEE*, 66(1), 51-83. | IEEE [<sup>16</sup>](https://ieeexplore.ieee.org/document/1455106) | | 2 | Harris, F. J. (1978). **On the use of windows for harmonic analysis with the discrete Fourier transform**. *Proceedings of the IEEE*, 66(1), 51-83. | IEEE [<sup>13</sup>](https://ieeexplore.ieee.org/document/1455106) |
**引用句式**: **引用句式**:
- "Given that speech signals exhibit short-time stationarity within time scales of 10ms to 30ms" - "Given that speech signals exhibit short-time stationarity within time scales of 10ms to 30ms"
@ -101,8 +102,8 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Hu, Y., & Loizou, P. C. (2008). **Evaluation of objective quality measures for speech enhancement**. *IEEE Transactions on Audio, Speech, and Language Processing*, 16(1), 229-238. | IEEE [<sup>17</sup>](https://ieeexplore.ieee.org/document/4389058) | | 1 | Hu, Y., & Loizou, P. C. (2008). **Evaluation of objective quality measures for speech enhancement**. *IEEE Transactions on Audio, Speech, and Language Processing*, 16(1), 229-238. | IEEE [<sup>14</sup>](https://ieeexplore.ieee.org/document/4389058) |
| 2 | ITU-T. (2001). **Recommendation P.862: Perceptual evaluation of speech quality (PESQ)**. Geneva: ITU. | ITU [<sup>18</sup>](https://www.itu.int/rec/T-REC-P.862/en) | | 2 | ITU-T. (2001). **Recommendation P.862: Perceptual evaluation of speech quality (PESQ)**. Geneva: ITU. | ITU [<sup>15</sup>](https://www.itu.int/rec/T-REC-P.862/en) |
**引用句式**: "Speaker Similarity (SPK) as an objective metric and Similarity Mean Opinion Score (SMOS) as a subjective metric" **引用句式**: "Speaker Similarity (SPK) as an objective metric and Similarity Mean Opinion Score (SMOS) as a subjective metric"
@ -112,8 +113,8 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Crochiere, R. E., & Rabiner, L. R. (1983). **Multirate Digital Signal Processing**. *Prentice-Hall*. | ACM [<sup>19</sup>](https://dl.acm.org/doi/book/10.5555/578105) | | 1 | Crochiere, R. E., & Rabiner, L. R. (1983). **Multirate Digital Signal Processing**. *Prentice-Hall*. | ACM [<sup>16</sup>](https://dl.acm.org/doi/book/10.5555/578105) |
| 2 | Smith, J. O., & Gossett, P. (1984). **A flexible sampling-rate conversion method**. *Proceedings of ICASSP*, 9, 19.4.1-19.4.4. | Stanford CCRMA [<sup>20</sup>](https://ccrma.stanford.edu/~jos/resample/) | | 2 | Smith, J. O., & Gossett, P. (1984). **A flexible sampling-rate conversion method**. *Proceedings of ICASSP*, 9, 19.4.1-19.4.4. | Stanford CCRMA [<sup>17</sup>](https://ccrma.stanford.edu/~jos/resample/) |
**引用句式**: "the system must perform sampling rate conversion...adopts a resampling algorithm based on band-limited interpolation theory" **引用句式**: "the system must perform sampling rate conversion...adopts a resampling algorithm based on band-limited interpolation theory"
@ -125,9 +126,9 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | **On the Performance Similarity Between Exponential Moving Average and Discrete Linear Kalman Filter**. (2020). *IEEE*. | IEEE [<sup>21</sup>](https://ieeexplore.ieee.org/document/9318810/) | | 1 | **On the Performance Similarity Between Exponential Moving Average and Discrete Linear Kalman Filter**. (2020). *IEEE*. | IEEE [<sup>18</sup>](https://ieeexplore.ieee.org/document/9318810/) |
| 2 | **Adaptive Extended Kalman Filter using Exponencial Moving Average**. *IFAC-PapersOnLine*. | ScienceDirect [<sup>22</sup>](https://www.sciencedirect.com/science/article/pii/S2405896318327678) | | 2 | **Adaptive Extended Kalman Filter using Exponencial Moving Average**. *IFAC-PapersOnLine*. | ScienceDirect [<sup>19</sup>](https://www.sciencedirect.com/science/article/pii/S2405896318327678) |
| 3 | Kalman, R.E. (1960). **A New Approach to Linear Filtering and Prediction Problems**. *Journal of Basic Engineering*. | ASME [<sup>23</sup>](https://asmedigitalcollection.asme.org/fluidsengineering/article/82/1/35/397706/A-New-Approach-to-Linear-Filtering-and-Prediction) | | 3 | Kalman, R.E. (1960). **A New Approach to Linear Filtering and Prediction Problems**. *Journal of Basic Engineering*. | ASME [<sup>20</sup>](https://asmedigitalcollection.asme.org/fluidsengineering/article/82/1/35/397706/A-New-Approach-to-Linear-Filtering-and-Prediction) |
**引用句式**: **引用句式**:
- "Comparing this with the EMA formula, it is evident that EMA is a special case of the Kalman Filter where the gain K_t is constant." - "Comparing this with the EMA formula, it is evident that EMA is a special case of the Kalman Filter where the gain K_t is constant."
@ -139,8 +140,8 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | **Dynamic assessment: an approach to assessing children's language-learning potential**. (2000). *PubMed*. | PubMed [<sup>24</sup>](https://pubmed.ncbi.nlm.nih.gov/10958430/) | | 1 | **Dynamic assessment: an approach to assessing children's language-learning potential**. (2000). *PubMed*. | PubMed [<sup>21</sup>](https://pubmed.ncbi.nlm.nih.gov/10958430/) |
| 2 | **Dynamic assessment of multilingual children's word learning**. (2022). *PubMed*. | PubMed [<sup>25</sup>](https://pubmed.ncbi.nlm.nih.gov/35429340/) | | 2 | **Dynamic assessment of multilingual children's word learning**. (2022). *PubMed*. | PubMed [<sup>22</sup>](https://pubmed.ncbi.nlm.nih.gov/35429340/) |
**引用句式**: **引用句式**:
- "Dynamic assessment represents an alternative approach to traditional language assessments." - "Dynamic assessment represents an alternative approach to traditional language assessments."
@ -152,7 +153,7 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | **A hybrid voice cloning for inclusive education in low-resource environments**. (2025). *Frontiers in Computer Science*. | Frontiers [<sup>26</sup>](https://www.frontiersin.org/journals/computer-science/articles/10.3389/fcomp.2025.1675616/full) | | 1 | **A hybrid voice cloning for inclusive education in low-resource environments**. (2025). *Frontiers in Computer Science*. | Frontiers [<sup>23</sup>](https://www.frontiersin.org/journals/computer-science/articles/10.3389/fcomp.2025.1675616/full) |
**引用句式**: "These systems can utilize familiar cloned voices to deliver reading exercises, language learning prompts, or social rehearsal activities for children." **引用句式**: "These systems can utilize familiar cloned voices to deliver reading exercises, language learning prompts, or social rehearsal activities for children."
@ -162,8 +163,8 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | **How Should I Teach from This Month Onward? A State-Space Model That Helps Drive Whole Classes to Achieve End-of-Year National Standardized Test Learning Targets**. (2022). *Systems*, 10(5), 167. | MDPI [<sup>27</sup>](https://www.mdpi.com/2079-8954/10/5/167) | | 1 | **How Should I Teach from This Month Onward? A State-Space Model That Helps Drive Whole Classes to Achieve End-of-Year National Standardized Test Learning Targets**. (2022). *Systems*, 10(5), 167. | MDPI [<sup>24</sup>](https://www.mdpi.com/2079-8954/10/5/167) |
| 2 | **Uncertainty-preserving deep knowledge tracing with state-space models**. (2024). *EDM Proceedings*. | EDM [<sup>28</sup>](https://educationaldatamining.org/edm2024/proceedings/2024.EDM-posters.108/) | | 2 | **Uncertainty-preserving deep knowledge tracing with state-space models**. (2024). *EDM Proceedings*. | EDM [<sup>25</sup>](https://educationaldatamining.org/edm2024/proceedings/2024.EDM-posters.108/) |
**引用句式**: **引用句式**:
- "We developed a simple-to-understand state-space model that predicts end-of-year national test scores." - "We developed a simple-to-understand state-space model that predicts end-of-year national test scores."
@ -175,8 +176,8 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | **Probabilistic models of language processing and acquisition**. (2006). *Trends in Cognitive Sciences*. | ScienceDirect [<sup>29</sup>](https://www.sciencedirect.com/science/article/pii/S1364661306001318) | | 1 | **Probabilistic models of language processing and acquisition**. (2006). *Trends in Cognitive Sciences*. | ScienceDirect [<sup>26</sup>](https://www.sciencedirect.com/science/article/pii/S1364661306001318) |
| 2 | **A pipeline for stochastic and controlled generation of realistic language input for simulating infant language acquisition**. (2025). *Behavior Research Methods*. | Springer [<sup>30</sup>](https://link.springer.com/article/10.3758/s13428-025-02772-6) | | 2 | **A pipeline for stochastic and controlled generation of realistic language input for simulating infant language acquisition**. (2025). *Behavior Research Methods*. | Springer [<sup>27</sup>](https://link.springer.com/article/10.3758/s13428-025-02772-6) |
**引用句式**: **引用句式**:
- "Probabilistic methods are providing new explanatory approaches to fundamental cognitive science questions of how humans structure, process and acquire language." - "Probabilistic methods are providing new explanatory approaches to fundamental cognitive science questions of how humans structure, process and acquire language."
@ -188,11 +189,11 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | **Toward Measuring and Maintaining the Zone of Proximal Development in Adaptive Instructional Systems**. *AIED 2001*. | Springer [<sup>31</sup>](https://link.springer.com/chapter/10.1007/3-540-47987-2_75) | | 1 | **Toward Measuring and Maintaining the Zone of Proximal Development in Adaptive Instructional Systems**. *AIED 2001*. | Springer [<sup>28</sup>](https://link.springer.com/chapter/10.1007/3-540-47987-2_75) |
| 2 | **Development and techniques in learner model in adaptive e-learning system: A systematic review**. (2024). *Computers & Education*. | ScienceDirect [<sup>32</sup>](https://www.sciencedirect.com/science/article/abs/pii/S0360131524001982) | | 2 | **Development and techniques in learner model in adaptive e-learning system: A systematic review**. (2024). *Computers & Education*. | ScienceDirect [<sup>29</sup>](https://www.sciencedirect.com/science/article/abs/pii/S0360131524001982) |
| 3 | **A possible future for next generation adaptive learning systems**. (2016). *Smart Learning Environments*. | Springer Open [<sup>33</sup>](https://slejournal.springeropen.com/articles/10.1186/s40561-016-0038-y) | | 3 | **A possible future for next generation adaptive learning systems**. (2016). *Smart Learning Environments*. | Springer Open [<sup>30</sup>](https://slejournal.springeropen.com/articles/10.1186/s40561-016-0038-y) |
| 4 | **Vygotsky's Zone of Proximal Development**. *ResearchGate*. | ResearchGate [<sup>34</sup>](https://www.researchgate.net/publication/383563118_Vygotsky's_Zone_of_Proximal_Development) | | 4 | **Vygotsky's Zone of Proximal Development**. *ResearchGate*. | ResearchGate [<sup>31</sup>](https://www.researchgate.net/publication/383563118_Vygotsky's_Zone_of_Proximal_Development) |
| 5 | **Vygotsky's Zone of Proximal Development: Instructional Implications and Teachers' Professional Development**. *ERIC*. | ERIC [<sup>35</sup>](https://files.eric.ed.gov/fulltext/EJ1081990.pdf) | | 5 | **Vygotsky's Zone of Proximal Development: Instructional Implications and Teachers' Professional Development**. *ERIC*. | ERIC [<sup>32</sup>](https://files.eric.ed.gov/fulltext/EJ1081990.pdf) |
**引用句式**: **引用句式**:
- "Intelligent tutoring Systems (ITSs) adapt content and activities with the goals of being both effective and efficient instructional environments." - "Intelligent tutoring Systems (ITSs) adapt content and activities with the goals of being both effective and efficient instructional environments."
@ -205,11 +206,11 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Corbett, A. T., & Anderson, J. R. (1995). **Knowledge Tracing: Modeling the Acquisition of Procedural Knowledge**. *User Modeling and User-Adapted Interaction*, 4(4), 253-278. | Springer [<sup>36</sup>](https://link.springer.com/article/10.1007/BF01099821) | | 1 | Corbett, A. T., & Anderson, J. R. (1995). **Knowledge Tracing: Modeling the Acquisition of Procedural Knowledge**. *User Modeling and User-Adapted Interaction*, 4(4), 253-278. | Springer [<sup>33</sup>](https://link.springer.com/article/10.1007/BF01099821) |
| 2 | **Twenty-five years of Bayesian knowledge tracing: a systematic review**. (2023). *User Modeling and User-Adapted Interaction*. | Springer [<sup>37</sup>](https://link.springer.com/article/10.1007/s11257-023-09389-4) | | 2 | **Twenty-five years of Bayesian knowledge tracing: a systematic review**. (2023). *User Modeling and User-Adapted Interaction*. | Springer [<sup>34</sup>](https://link.springer.com/article/10.1007/s11257-023-09389-4) |
| 3 | **Properties of the Bayesian Knowledge Tracing Model**. *JEDM*. | ERIC [<sup>38</sup>](https://files.eric.ed.gov/fulltext/EJ1115329.pdf) | | 3 | **Properties of the Bayesian Knowledge Tracing Model**. *JEDM*. | ERIC [<sup>35</sup>](https://files.eric.ed.gov/fulltext/EJ1115329.pdf) |
| 4 | **Individualized Bayesian Knowledge Tracing Models**. *AIED 2013*. | Springer [<sup>39</sup>](https://link.springer.com/chapter/10.1007/978-3-642-39112-5_18) | | 4 | **Individualized Bayesian Knowledge Tracing Models**. *AIED 2013*. | Springer [<sup>36</sup>](https://link.springer.com/chapter/10.1007/978-3-642-39112-5_18) |
| 5 | **A Survey of Knowledge Tracing: Models, Variants, and Applications**. (2021). *arXiv*. | arXiv [<sup>40</sup>](https://arxiv.org/abs/2105.15106) | | 5 | **A Survey of Knowledge Tracing: Models, Variants, and Applications**. (2021). *arXiv*. | arXiv [<sup>37</sup>](https://arxiv.org/abs/2105.15106) |
**引用句式**: **引用句式**:
- "Bayesian Knowledge Tracing is a probabilistic framework that models student mastery as a hidden Markov process." - "Bayesian Knowledge Tracing is a probabilistic framework that models student mastery as a hidden Markov process."
@ -224,7 +225,7 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Zhang, H., Yang, Z., et al. (2025). **Design and evaluation of children's education interactive learning system based on human computer interaction technology**. *Scientific Reports*, 15, Article 5597. | Nature [<sup>41</sup>](https://www.nature.com/articles/s41598-025-90800-y) | | 1 | Zhang, H., Yang, Z., et al. (2025). **Design and evaluation of children's education interactive learning system based on human computer interaction technology**. *Scientific Reports*, 15, Article 5597. | Nature [<sup>38</sup>](https://www.nature.com/articles/s41598-025-90800-y) |
**引用句式**: "本系统并未采用结构固化、组件繁重的全栈式框架而是选用了轻量级的微内核Web框架Flask作为后端核心。" **引用句式**: "本系统并未采用结构固化、组件繁重的全栈式框架而是选用了轻量级的微内核Web框架Flask作为后端核心。"
@ -236,7 +237,7 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Cavanagh, T., Chen, B., Lahcen, R. A. M., & Paradiso, J. (2020). **Personalized adaptive learning in higher education: A scoping review of key characteristics and impact on academic performance and engagement**. *Smart Learning Environments*, 11(14). | PMC [<sup>42</sup>](https://pmc.ncbi.nlm.nih.gov/articles/PMC11544060/) | | 1 | Cavanagh, T., Chen, B., Lahcen, R. A. M., & Paradiso, J. (2020). **Personalized adaptive learning in higher education: A scoping review of key characteristics and impact on academic performance and engagement**. *Smart Learning Environments*, 11(14). | PMC [<sup>39</sup>](https://pmc.ncbi.nlm.nih.gov/articles/PMC11544060/) |
**引用句式**: "系统依据用户的历史综合能力评分(隐藏分),动态决定当前的对话难度策略。" **引用句式**: "系统依据用户的历史综合能力评分(隐藏分),动态决定当前的对话难度策略。"
@ -248,7 +249,7 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Wang, X., Pang, H., Wallace, M. P., Wang, Q., & Chen, W. (2025). **D-ID Studio: Empowering Language Teaching With AI Avatars**. *TESOL Journal*. | Wiley [<sup>43</sup>](https://onlinelibrary.wiley.com/doi/10.1002/tesj.70034) | | 1 | Wang, X., Pang, H., Wallace, M. P., Wang, Q., & Chen, W. (2025). **D-ID Studio: Empowering Language Teaching With AI Avatars**. *TESOL Journal*. | Wiley [<sup>40</sup>](https://onlinelibrary.wiley.com/doi/10.1002/tesj.70034) |
**引用句式**: "在具体的交互实现上系统打通了ASR、LLM、TTS与数字人驱动的全链路。" **引用句式**: "在具体的交互实现上系统打通了ASR、LLM、TTS与数字人驱动的全链路。"
@ -260,7 +261,7 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Lee-Cultura, S., Sharma, K., Giannakos, M., & Retalis, S. (2025). **A learning experience design framework for multimodal learning in the early childhood**. *Smart Learning Environments*, 12(1). | Springer Open [<sup>44</sup>](https://slejournal.springeropen.com/articles/10.1186/s40561-025-00376-3) | | 1 | Lee-Cultura, S., Sharma, K., Giannakos, M., & Retalis, S. (2025). **A learning experience design framework for multimodal learning in the early childhood**. *Smart Learning Environments*, 12(1). | Springer Open [<sup>41</sup>](https://slejournal.springeropen.com/articles/10.1186/s40561-025-00376-3) |
**引用句式**: "系统采用了'语音输入-语义理解-智能回复-语音合成-表情驱动'的实时多模态交互。" **引用句式**: "系统采用了'语音输入-语义理解-智能回复-语音合成-表情驱动'的实时多模态交互。"
@ -272,7 +273,7 @@
| # | 文献信息 | 链接 | | # | 文献信息 | 链接 |
|---|----------|------| |---|----------|------|
| 1 | Li, Y., Chen, X., Zhang, H., et al. (2025). **Language Proficiency Assessment of Autistic Children Using Large Language Models**. *Expert Systems with Applications*. | ScienceDirect [<sup>45</sup>](https://www.sciencedirect.com/science/article/abs/pii/S0957417425033275) | | 1 | Li, Y., Chen, X., Zhang, H., et al. (2025). **Language Proficiency Assessment of Autistic Children Using Large Language Models**. *Expert Systems with Applications*. | ScienceDirect [<sup>42</sup>](https://www.sciencedirect.com/science/article/abs/pii/S0957417425033275) |
**引用句式**: "评估机制的核心依据是《3-6岁儿童学习与发展指南》系统构建了包含语言理解与逻辑、语言表达与组织、语言功能与思维拓展、语言习惯与流畅度在内的四维评估模型。" **引用句式**: "评估机制的核心依据是《3-6岁儿童学习与发展指南》系统构建了包含语言理解与逻辑、语言表达与组织、语言功能与思维拓展、语言习惯与流畅度在内的四维评估模型。"
@ -309,28 +310,49 @@
``` ```
references/ references/
├── README.md # 本文档 ├── README.md # 本文档
├── chapter3/ # 第三章文献 ├── Du_2024_CosyVoice2.pdf # 语音克隆技术
│ ├── voice_cloning/ # 语音克隆 ├── Loizou_2013_SpeechEnhancement.pdf # 谱减法降噪
│ ├── noise_reduction/ # 降噪算法 ├── Lu_2008_SpectralSubtraction.pdf # 谱减法几何方法
│ ├── child_voice/ # 儿童语音特征 ├── Upadhyay_2015_SpectralSubtraction.pdf # 谱减法对比研究
│ ├── vad/ # 语音活动检测 ├── Hasek_1980_PreadolescentVoices.pdf # 儿童语音特征
│ ├── loudness/ # 响度归一化 ├── Keating_1978_FundamentalFrequency.pdf # 儿童基频研究
│ ├── stft/ # STFT与窗函数 ├── Perry_2001_GenderIdentification.pdf # 儿童性别识别
│ ├── quality/ # 语音质量评估 ├── Robb_1985_VocalFrequency.pdf # 儿童基频趋势
│ └── resampling/ # 重采样 ├── Ramirez_2007_VAD.pdf # 语音活动检测
├── chapter4/ # 第四章文献 ├── Zhang_2013_DeepBeliefVAD.pdf # 深度学习VAD
│ ├── kalman_ema/ # Kalman与EMA ├── EBU_2020_R128_Loudness.pdf # 响度归一化标准
│ ├── dynamic_assessment/ # 动态评估 ├── Rabiner_2010_DigitalSpeechProcessing.pdf # 数字语音处理
│ ├── state_space/ # 状态空间模型 ├── Harris_1978_Windows_DFT.pdf # 窗函数分析
│ ├── zpd/ # ZPD理论 ├── Hu_2008_QualityMeasures.pdf # 语音质量评估
│ └── bkt/ # 贝叶斯知识追踪 ├── ITU_2001_PESQ.pdf # PESQ标准
└── chapter5/ # 第五章文献 ├── Crochiere_1983_MultirateProcessing.pdf # 多采样率处理
├── hci_education/ # 人机交互教育 ├── Smith_1984_SamplingRateConversion.pdf # 采样率转换
├── adaptive_learning/ # 自适应学习 ├── IEEE_2020_EMA_Kalman.pdf # EMA与Kalman等价性
├── ai_avatar/ # AI虚拟化身 ├── IFAC_AdaptiveEKF_EMA.pdf # 自适应EKF
├── multimodal/ # 多模态学习 ├── Kalman_1960_LinearFiltering.pdf # Kalman滤波原始论文
└── llm_assessment/ # LLM评估 ├── DynamicAssessment_2000_Language.pdf # 动态评估
├── DynamicAssessment_2022_Multilingual.pdf # 多语言动态评估
├── Frontiers_2025_VoiceCloning_Education.pdf # 语音克隆教育应用
├── MDPI_2022_StateSpace_Education.pdf # 状态空间模型教育应用
├── EDM_2024_KnowledgeTracing.pdf # 深度知识追踪
├── TrendsCogSci_2006_ProbabilisticModels.pdf # 概率模型语言习得
├── BehaviorResMethods_2025_InfantLanguage.pdf # 婴幼儿语言习得
├── Springer_2001_ZPD_AdaptiveSystems.pdf # ZPD与自适应系统
├── CompEdu_2024_AdaptiveLearning_Review.pdf # 自适应学习综述
├── SmartLearn_2016_NextGenAdaptive.pdf # 下一代自适应学习
├── ResearchGate_ZPD_Vygotsky.pdf # ZPD理论
├── ERIC_ZPD_Professional_Development.pdf # ZPD教学应用
├── Corbett_1995_KnowledgeTracing.pdf # BKT原始论文
├── Springer_2023_BKT_25Years.pdf # BKT 25年综述
├── JEDM_BKT_Properties.pdf # BKT数学性质
├── AIED_2013_IndividualizedBKT.pdf # 个性化BKT
├── arXiv_2021_KT_Survey.pdf # 知识追踪综述
├── Nature_2025_HCI_ChildEducation.pdf # 儿童教育交互系统
├── PMC_2020_PersonalizedAdaptive.pdf # 个性化自适应学习
├── TESOL_2025_AI_Avatars.pdf # AI虚拟化身
├── SmartLearn_2025_MultimodalEarlyChildhood.pdf # 幼儿多模态学习
└── ExpertSys_2025_LLM_Assessment.pdf # LLM语言能力评估
``` ```
--- ---
@ -346,6 +368,23 @@ Du, Z., et al. (2024). CosyVoice 2: Scalable Streaming Speech Synthesis with
--- ---
## 🔍 快速查找指南
### 按研究主题查找
- **语音技术**CosyVoice, 谱减法, VAD, STFT, 重采样 → 文件名包含关键词搜索
- **儿童语言**:儿童语音特征, 动态评估, 语言习得 → 搜索 "Child", "Language", "Infant"
- **评估算法**Kalman, EMA, BKT, 知识追踪 → 搜索 "Kalman", "BKT", "Tracing"
- **教育理论**ZPD, 自适应学习, 个性化教学 → 搜索 "ZPD", "Adaptive", "Personalized"
- **系统实现**Flask, HCI, 多模态, LLM → 搜索 "System", "HCI", "LLM"
### 按年份查找
- **最新研究2024-2025**8篇 - 查找文件名包含 "2024", "2025"
- **经典论文2000年前**5篇 - 查找文件名包含 "1978-1999"
---
## ⚖️ 版权声明 ## ⚖️ 版权声明
本仓库仅用于学术研究目的,所有文献版权归原作者及出版方所有。文献链接指向原始出版来源,请遵守各出版方的使用条款。 本仓库仅用于学术研究目的,所有文献版权归原作者及出版方所有。文献链接指向原始出版来源,请遵守各出版方的使用条款。
@ -358,4 +397,4 @@ Du, Z., et al. (2024). CosyVoice 2: Scalable Streaming Speech Synthesis with
--- ---
*最后更新: 2025年2月* *最后更新: 2026年2月*