Recordings and Software

This page summarises the recordings and software yielded from projects of FIST.

Non-word repetition sets: Universal Non-Word Repetition (UNWR) and CNRep

Three sets of non-word repetition sets are available below, yielded from the following work.

Howell, P., Tang, K., Tuomainen, O., Chan, S. K., Beltran, K., Mirawdeli, A. and Harris, J. (2017) Identification of fluency and word-finding difficulty in samples of children with diverse language backgrounds. International Journal of Language & Communication Disorders 52(5). 595-611. http://dx.doi.org/10.1111/1460-6984.12305 [DOI] [pdf] [bib]
In this paper, we proposed a "universal" non-word repetition (UNWR) test designed for 20 Indo-European languages (English, Polish, Romanian, European Portuguese, Bulgarian, Serbo-Croat-Bosnian, Czech, Dutch, French, German, Hungarian, Slovene, Swedish, Danish, Norwegian, Russian, Latvian, Ukrainian, Urdu-Hindi and Bengali). To complement this test set, we made the test set available in both audio and phonetic transcription formats. Methodologically, this would allow researchers and school teachers to apply this test set with ease, as well as improving replicability of research findings.

(Please note: In the paper, the stimuli presented were slightly different from the ones listed here, however they conform with what is said here. In the paper, the stimuli were presented live (as opposed to pre-recorded), therefore they are most similar to the Standard Souther British English version of UNWR with slightly different stress patterns and vowel quality in some of the non-words.)

Currently, two versions of UNWR are available. They differ in terms of the pronunciation (phones and stress patterns). Furthermore, we also made the well-known CNRep (Gathercole et al. 1994) non-word repetition set available for ease of application and comparison with UNWR.

The speaker in the audio stimuli was Dr. Geoff Lindsey, a professional phonetician and a speech coach (http://englishspeechservices.com/).
The phonetic transcription in the written stimuli is based on CUBE (Current British English searchable transcriptions) http://seas3.elte.hu/cube/, a searchable pronunication dictionary that reflects the accent of contemporary Southern British English.

Please click on the following bars for more details of the three non-word repetition sets.

Non-word repetition set: Universal Non-Word Repetition (UNWR) -- Universal Version

Citation

If you have used this set of stimuli (printed or audio format) in any way, please cite:

Howell, P., Tang, K., Tuomainen, O., Chan, S. K., Beltran, K., Mirawdeli, A. and Harris, J. (2017) Identification of fluency and word-finding difficulty in samples of children with diverse language backgrounds. International Journal of Language & Communication Disorders 52(5). 595-611. http://dx.doi.org/10.1111/1460-6984.12305 [DOI] [pdf] [bib]

About the test set

This version of UNWR is the "Universal" set. It is not based on a particular accent/language in terms of phones and stress patterns. The stress patterns were "neutral" such that there isn't a particular syllable being stressed.
This set contains 12 items per syllable size, from 2 syllables to 5 syllables. For each syllable size, there are two practise items: they are Word 1 and Word 2 (2-Syllable), Word 13 and Word 14 (3-Syllable), Word 25 and Word 26 (4-Syllable), and Word 37 and Word 38 (5-Syllable). The sound files are named in the following format: Word[Number]_[Orthographic form of the item].wav.

About the test procedure

We recommend you start with the 2-syllable items, and gradually increase the syllable size complexity. To keep the task brief, a child has to get at least seven items out of ten items correct to progress to the next syllable length, otherwise the task ends there. The final score is the number of non-words correct.

About the speaker

Name: Dr. Geoff Lindsey
Background: The speaker is a professional phonetician and a speech coach (http://englishspeechservices.com/)
Native language and accent: British English (Standard Southern British English)
Gender: Male
Age: mid-50's

About the sound files

Format: wav
Channel: Mono
Sample rate: 44100 Hz
Description: All stimuli have the same absolute duration (padded with silence), in each the sound portion is centered. They have been high-pass filtered at 50 Hz, and intensity normalised at 65dB.
Processing Steps: (Click to see details)
1. The recording was made in a professional sound booth at University College London.
2. The recording was manually divided into individual stimuli with generous amount of silence before and after each stimulus.
3. ProsodyAligner was used to detect word boundaries objectively with an English Forced-alignment model.
4. 1s silence was added to the start and the end of each stimulus.
5. The word boundaries from ProsodyAligner were then shifted by 0.2 sec outwards to be conservative.
6. The stimuli were extracted again using the adjusted word boundaries.
7. The stimuli were duration-normalised (i.e. identical duration) by padding sufficient amount of silence before and after each stimulus to match the duration of the longest stimulus.
8. The duration-normalised stimuli were then high-pass filtered at 50Hz, and intensity normalised at 65dB using Praat.

Stimuli Breakdown for UNWR (Universal)

Syllable Size	Number of Items	Audio Samples	Download Sound (.wav in .zip)	Download Transcription (.xls and .pdf)
All	All		UNWR_Universal_trans_SyllAll.zip	UNWR_Universal_trans_SyllAll.xls UNWR_Universal_trans_SyllAll.pdf
2	12	"plumpom" [plum.pom]	UNWR_Universal_wavs_Syll2.zip	UNWR_Universal_trans_Syll2.xls UNWR_Universal_trans_Syll2.pdf
3	12	"ɡokimap" [ɡo.ki.map]	UNWR_Universal_wavs_Syll3.zip	UNWR_Universal_trans_Syll3.xls UNWR_Universal_trans_Syll3.pdf
4	12	"puplutende" [pu.plu.ten.de]	UNWR_Universal_wavs_Syll4.zip	UNWR_Universal_trans_Syll4.xls UNWR_Universal_trans_Syll4.pdf
5	12	"pluturukompi" [plu.tu.ru.kom.pi]	UNWR_Universal_wavs_Syll5.zip	UNWR_Universal_trans_Syll5.xls UNWR_Universal_trans_Syll5.pdf

Non-word repetition set: Universal Non-Word Repetition (UNWR) -- Standard Southern British English Version

Citation

If you have used this set of stimuli (printed or audio format) in any way, please cite:

Howell, P., Tang, K., Tuomainen, O., Chan, S. K., Beltran, K., Mirawdeli, A. and Harris, J. (2017) Identification of fluency and word-finding difficulty in samples of children with diverse language backgrounds. International Journal of Language & Communication Disorders 52(5). 595-611. http://dx.doi.org/10.1111/1460-6984.12305 [DOI] [pdf] [bib]

About the test set

This version of UNWR is the "Standard Southern British English" version. It is based on the Standard Southern British English accent in terms of phones and stress patterns. The stress patterns were determined by minimising the number of stresses per word and allowing stray syllables in words with odd number of syllables.
This set contains 12 items per syllable size, from 2 syllables to 5 syllables. For each syllable size, there are two practise items: they are Word 1 and Word 2 (2-Syllable), Word 13 and Word 14 (3-Syllable), Word 25 and Word 26 (4-Syllable), and Word 37 and Word 38 (5-Syllable). The sound files are named in the following format: Word[Number]_[Orthographic form of the item].wav.

About the test procedure

About the speaker

Name: Dr. Geoff Lindsey
Background: The speaker is a professional phonetician and a speech coach (http://englishspeechservices.com/)
Native language and accent: British English (Standard Southern British English)
Gender: Male
Age: mid-50's

About the sound files

Format: wav
Channel: Mono
Sample rate: 44100 Hz
Description: All stimuli have the same absolute duration (padded with silence), in each the sound portion is centered. They have been high-pass filtered at 50 Hz, and intensity normalised at 65dB.
Processing Steps: (Click to see details)
1. The recording was made in a professional sound booth at University College London.
2. The recording was manually divided into individual stimuli with generous amount of silence before and after each stimulus.
3. ProsodyAligner was used to detect word boundaries objectively with an English Forced-alignment model.
4. 1s silence was added to the start and the end of each stimulus.
5. The word boundaries from ProsodyAligner were then shifted by 0.2 sec outwards to be conservative.
6. The stimuli were extracted again using the adjusted word boundaries.
7. The stimuli were duration-normalised (i.e. identical duration) by padding sufficient amount of silence before and after each stimulus to match the duration of the longest stimulus.
8. The duration-normalised stimuli were then high-pass filtered at 50Hz, and intensity normalised at 65dB using Praat.

Stimuli Breakdown for UNWR (Standard Southern British English)

Syllable Size	Number of Items	Audio Samples	Download Sound (.wav in .zip)	Download Transcription (.xls and .pdf)
All	All		UNWR_SSBE_wavs_SyllAll.zip	UNWR_SSBE_trans_SyllAll.xls UNWR_SSBE_trans_SyllAll.pdf
2	12	"plumpom" [ˈplʌm.pəm]	UNWR_SSBE_wavs_Syll2.zip	UNWR_SSBE_trans_Syll2.xls UNWR_SSBE_trans_Syll2.pdf
3	12	"ɡokimap" [ɡə.ˈkɪ.məp]	UNWR_SSBE_wavs_Syll3.zip	UNWR_SSBE_trans_Syll3.xls UNWR_SSBE_trans_Syll3.pdf
4	12	"puplutende" [ˌpɵ.plə.ˈtɛn.di]	UNWR_SSBE_wavs_Syll4.zip	UNWR_SSBE_trans_Syll4.xls UNWR_SSBE_trans_Syll4.pdf
5	12	"pluturukompi" [plə.ˌtɵ.ɹə.ˈkɔm.pi]	UNWR_SSBE_wavs_Syll5.zip	UNWR_SSBE_trans_Syll5.xls UNWR_SSBE_trans_Syll5.pdf

Non-word repetition set: CNRep

Citation

If you have used this set of stimuli (printed or audio format) in any way, please cite BOTH of the following publications:

Howell, P., Tang, K., Tuomainen, O., Chan, S. K., Beltran, K., Mirawdeli, A. and Harris, J. (2017) Identification of fluency and word-finding difficulty in samples of children with diverse language backgrounds. International Journal of Language & Communication Disorders 52(5). 595-611. http://dx.doi.org/10.1111/1460-6984.12305 [DOI] [pdf] [bib]

Gathercole, S. E., Willis, C. S., Baddeley, A., & Emslie, H. (1994). The Children's Test of Nonword Repetition: a test of phonological working memory. 2(2), 103-27. Doi:10.1080/09658219408258940 [DOI] [bib]

About the test set

This version of CNRep is based on Gathercole et al (1994). The major differences are that the phonetic transcription is based on the Standard Southern British English accent in terms of phones and stress patterns as well as having two additional trial items per syllable size (kindly provided by Prof. Susan E. Gathercole).
This set contains 12 items per syllable size, from 2 syllables to 5 syllables. For each syllable size, there are two practise items: they are Word 1 and Word 2 (2-Syllable), Word 13 and Word 14 (3-Syllable), Word 25 and Word 26 (4-Syllable), and Word 37 and Word 38 (5-Syllable). The sound files are named in the following format: Word[Number]_[Orthographic form of the item].wav.

About the test procedure

About the speaker

Name: Dr. Geoff Lindsey
Background: The speaker is a professional phonetician and a speech coach (http://englishspeechservices.com/)
Native language and accent: British English (Standard Southern British English)
Gender: Male
Age: mid-50's

About the sound files

Format: wav
Channel: Mono
Sample rate: 44100 Hz
Description: All stimuli have the same absolute duration (padded with silence), in each the sound portion is centered. They have been high-pass filtered at 50 Hz, and intensity normalised at 65dB.
Processing Steps: (Click to see details)
1. The recording was made in a professional sound booth at University College London.
2. The recording was manually divided into individual stimuli with generous amount of silence before and after each stimulus.
3. ProsodyAligner was used to detect word boundaries objectively with an English Forced-alignment model.
4. 1s silence was added to the start and the end of each stimulus.
5. The word boundaries from ProsodyAligner were then shifted by 0.2 sec outwards to be conservative.
6. The stimuli were extracted again using the adjusted word boundaries.
7. The stimuli were duration-normalised (i.e. identical duration) by padding sufficient amount of silence before and after each stimulus to match the duration of the longest stimulus.
8. The duration-normalised stimuli were then high-pass filtered at 50Hz, and intensity normalised at 65dB using Praat.

Stimuli Breakdown for CNRep

Syllable Size	Number of Items	Audio Samples	Download Sound (.wav in .zip)	Download Transcription (.xls and .pdf)
All	All		CNRep_wavs_SyllAll.zip	CNRep_trans_SyllAll.xls CNRep_trans_SyllAll.pdf
2	12	"sermat" [ˈsəː.mət]	CNRep_wavs_Syll2.zip	CNRep_trans_Syll2.xls CNRep_trans_Syll2.pdf
3	12	"fonamit" [ˈfɔ.nə.mɪt]	CNRep_wavs_Syll3.zip	CNRep_trans_Syll3.xls CNRep_trans_Syll3.pdf
4	12	"munnatalic" [ˌmʌ.nə.ˈta.lɪk]	CNRep_wavs_Syll4.zip	CNRep_trans_Syll4.xls CNRep_trans_Syll4.pdf
5	12	"molipadary" [ˌməw.lɪ.ˈpɛj.də.ɹi]	CNRep_wavs_Syll5.zip	CNRep_trans_Syll5.xls CNRep_trans_Syll5.pdf