Page cover

5. Recording and validation

Chapter 5 overview:

This chapter is for anyone working on voice data projects. This may be with TWB Voice or any other platform. We explain the key principles for creating high-quality data sets.

We cover topics such as:

  • How to collect high-quality recordings

  • How to rate (validate) recordings in a fair and consistent way

  • Checklists so you can be more efficient and get good-quality recordings

  • Making sure your recordings are in line with TTS or ASR goals

Note: This section is all about action. You don’t need to be a technical expert. We focus on being clear, consistent and fair.

Recording and validation are the two key tasks in . To get high-quality voice , you need clear, varied, natural recordings. And you need to rate them with accurate validation. This will make sure they can be used to train voice technologies like and systems. Experts may also use the datasets you create for research or to develop other .

In this section, we go through the guidelines, processes, roles, and checks that are needed during a project. They ensure that the data is of a good quality and consistent. We also look at the key standards and approaches that you can use to collect data and create data sets. You can use these for any project to collect data, not just on TWB Voice, but also on other platforms.

Last updated

Was this helpful?