| The following article is still a work in progress and being improved for the benefit of all visitors. Please bear with us while improvements are being made, and as soon as an editor is available to do so assume good faith until the edits are complete. For information on how to help, see the manual of style guideline. We apologize for the inconvenience this may cause in the meantime.
Work-in-progress pages are automatically sorted into this category.
The following is a list of auxiliary phonemes that will alter the effect of a note or control some effects in a certain way. It is possible to use a VOCALOID voicebank without having to ever touch these set of data, however, use of them within a song can improve the results of a Vocaloid's ability to sound more human-like and emotional. In all cases, the data has to be entered manually through the note properties selection.
The following were added to Vocaloids during the first generation of the software.
|Symbol||Sample + notes||Applies to|
|[*in]||Breathing Inhalation sample, used by itself. Requires to be in a different track and doesn't stand long notes (it generates an audio loop)||Japanese voicebanks only|
|[*out]||Breathing Exhalation sample, used by itself. Requires to be in a different track and doesn't stand long notes (it generates an audio loop)||Japanese voicebanks only|
|[Asp]||According to one report, asp is possible to use in Vocaloid with mixed results. For instance, Miriam will make a gasp-like noise.||Full potential unknown|
- [*in] and [*out] are used to mimick a singer inhaling and exhaling while singing. Because the singer is breathing out as they sing, there actually is little need for [*out] and its usage is more limited. For use them correctly the best is work them in a different track from the main melody.
- [Asp] seems to be thought as an slot for a Breathing sample, however not all the first generation voicebanks seems to have. In the case of these ones it seems to alter the way how the sound is rendered, affecting the pronunciation.
Both English and Japanese Vocaloids from the 2nd version of software had access to some or all of the following.
|Symbol||Sample + notes||Applies to|
|[br1]||Breathing sample, used by itself with no additional phonemes.||All V2 voicebanks except Sweet Ann, Tonio and Sonika have this.|
|[br2]||Breathing sample, used by itself with no additional phonemes.||All V2 voicebanks except Sweet Ann, Tonio and Sonika have this.|
|[br3]||Breathing sample, used by itself with no additional phonemes.||All V2 voicebanks except Sweet Ann, Tonio and Sonika have this.|
|[br4]||Breathing sample, used by itself with no additional phonemes.||All V2 voicebanks except Sweet Ann, Tonio and Sonika have this.|
|[br5]||Breathing sample, used by itself with no additional phonemes.||All V2 voicebanks except Sweet Ann, Tonio and Sonika have this.|
|[Sil]||Silence Phoneme||Works with both Japanese and English voicebanks.|
|[Asp]||Deprecated Breath Phoneme. Alters the render.||Works with both Japanese and English voicebanks.|
|[R]||The rolling or trilled 'r' sound.||English voicebanks only; with exception of Megurine Luka and Sweet Ann|
- "Asp" is short for "aspiration".
- [Asp] seems to be an empty slot in the phonetic system carried from the first version of VOCALOID. However as empty slot, it affects the way how the sound is rendered, thus affecting the pronunciation.
- When placed between a vowel pair generates devoicing and blending of the second vowel. It has various potential applications.
- This one can be used as a blending phoneme, allowing to correct some choppy phoneme combinations, a common problem in the Japanese voicebanks. There is less need for [Asp] in English voicebanks as these ones tends to have the opposite issue (unwanted blending and slurry sounds), making [Sil] much more useful to them.
- The blending effect also allows to emulate some sounds, like produce a a faux /æ / combining the [e] and [a] phonemes.
- Big Al's use of [R] is much more limited then other English Vocaloids. Just it works at the beginning of a syllable and only if there's any adjacent note before this one,
- For achieve this effect with Japanese voicebanks, it's required blend succesive flaps into a trill, producing a sound similar to rolling R.
- Breaths ([br1] - [br5])
- These replace the [*in] and [*out] phonetics of VOCALOID, with the exception that the majority of Vocaloids for VOCALOID2 have inhaling sounds, with the exception of Big Al who has grunts and exhaling sounds.
- With the breathes ([br1] to [br5]) in VOCALOID2, there is one particular glitch to note. If several are placed together side by side with no breaks between them then depending on the note length, not all of them may be sounded out. This also may happen if they're placed to close a singing note, for this reason is recommendable work them in a different track.
VOCALOID3 and onwardsEdit
Some of the extra codes for English, Japanese, Korean, Chinese and Spanish Vocaloids.
|Symbol||Sample + notes||Applies to|
|[br1] - [br5]||Breathing sample, used by itself with no additional phonemes,||Imported VOCALOID2 voicebanks (See "VOCALOID2" section) and a few VOCALOID3 and VOCALOID4 vocals.|
|[-]||It can be used to carry a word across several notes (ex- (#C2), - (#B2), - (#C2), amp- (#D2), ple (#C2)). In VOCALOID3, the smoothness has been improved.||English, Japanese and Korean Vocaloids.|
|[Sil]||Silence Phoneme||Does not work with Korean Vocaloids.|
|[Asp]||Deprecated Breath Phoneme.||The glitch was removed and this phoneme no longer alters the render in the VOCALOID3 editor.|
|[R]||The rolling or trilled 'r' sound.||Some imported English VOCALOID2 voicebanks (See "VOCALOID2"). Deprecated for the V3 English voicebanks.|
|[*_0]||Devoiced Sonorant (replace the <*> for the intended phoneme). Adding _0 to a sonorant alters its pronunciation to be pronounced in a voiceless way, making it barely audible. The sonorants includes the vowels, glides, approximants, liquids (laterals and rhotics) and nasals, thus the available devoiced sonorants varies per language.||Available for all the currently released V3 and onwards voicebanks. Also works for V2 voicebanks imported to V3|
- "Asp" is short for "aspiration" and normally applies to consonants rather then vowels.
- From VOCALOID3 and onwards, the [Asp] it can't be used as a blending phoneme anymore, as apparently the rendering glitch was removed in the succeding version of the software.
- Now it has an easier usage for the Japanese voicebanks, as typing っ into the lyrics will put this phoneme automatically.
- Initial ㅇ = N (silent consonant/glottal stop) and Final ㅇ = Np (retroflex nasal consonant) can work for Korean Vocaloids in place of [Sil]
- Extends the vowel pronunciation across several notes.
- Japanese voicebanks now can use this phonetic data, however it's usage is not critical due to the way the language works. For this language, it allows a better control the pronunciation.
- Other languages such as English cannot connect certain pronunciations (diphthongs and rhotic vowels) without this and it helps smooth the transactions across notes. It still can be utilized as the old hyphen/slash input in the lyric, used in V2.
- Some voicebanks, like SONiKA or Miku English, may manifest some particular sound glitches when using this phoneme.
- Most of the VOCALOID3 and later voicebanks now insert their breaths via WAV sample insert, removing the need for [br1] - [br5]. However, OLIVER and various other VOCALOIDs still can make use of the VOCALOID2 system.
- The Devoiced Sonorant [*_0] hasn't much use in English voicebanks due to the way the language works (the reduced vowels tends to shift toward an unstressed vowel rather lose their voicing). Anyway, it still can be used as a supplementary resource.
- In the case of Japanese voicebanks, it allows a more natural or colloquial pronunciation. Usually the vowels [M], [i] (after a voiceless palatal consonant) and occasionally the [o] (after a plosive) becomes devoiced when they're placed between two voiceless consonants. However, this varies across the dialects and also depends of the emphasis or speech way.
- Many producers have found this particular phonetic data useful in extending the Vocaloids language capablities. For example, with a Japanese Vocaloid it allows them to produce a much closer level of English capabilities or another languages, as it allows to work around the limitations produced by the restrictive CV phonotactics of the Japanese language, allowing produce single or coda (end of syllable) consonant sounds.
- The [*_0] also allows some interesting effects, like imitate a whispering voice or some breath effects.
- When importing, the VOCALOID3 software is capable of making the Devoiced Sonorant ([*_0]) out of Phonetics the already available within the imported V2 voicebanks. It is unknown the way the software process the samples for achieve this, however it may be related to the manipulation of the harmonic content, an option previously available in V1 .
- [?] greatly increases a Vocaloids potential capabilities by allowing them to mimic certain dialects such as Cockney, North America and Scottish English.
- In Japanese, glottal stops occur at the end of interjections of surprise or anger, and are represented by the character っ. However adding っ into the lyrics will input [Sil] a phoneme instead.
- The Korean has a different symbol for the glottal stop, being replaced by [N].
Please note we are waiting for more information on some languages