| The subject of this article is controversial and content may be in dispute. This article exists to help ensure that the wiki covers all types of VOCALOID-related issues.
Before making any potentially controversial changes to the article, please carefully read the discussion-page dialogue to see if the issue has been raised before, and ensure that your edit meets all of Vocaloid Wikia's policies and guidelines.
Since VOCALOID™ was first released, there have been some controversial issues that have sprung up in regards to it. While VOCALOID™ is not without criticism, these issues have become subject to much attention or outcry. The issues discussed on this page are particularly upsetting for VOCALOID fans at times as they are often used to question the validity of VOCALOIDs or the software itself.
Validity of workEdit
The VOCALOID software is regarded as a virtual instrument in Japan, while overseas (outside of Japan and its neighboring Asian countries) VOCALOIDs are sold as virtual singers. This small difference in classification can be a controversial issue in the VOCALOID fandom; for example, it brings about the question as to whether Miku can be called a "singer" when she is just a synthesized vocal or an "instrument".
This, in part, has led some music fans to ask if music utilizing VOCALOID is real music when the vocals are not real and if the software is on par with real singers. The VOCALOID2 software was often commented to be far from perfect and was noted to be far from being a top rate singer due to its limitations. At times media will often also fail to report VOCALOID correctly, leading a general confusion as to what VOCALOID is. Some reports label it as a "game" based on Project Diva, others a "band" based on the concerts, and some mistake the PVs for part of an anime. Furthermore, because of Miku's fame there is also a tendency for some reports to presume all VOCALOIDs sing in a similar tone or pitch of voice to Miku. This renders all VOCALOIDs judged based solely on one vocal, causing further confusion.
During his tweets, GazettE's Aoi questioned if VOCALOID and such virtual singers should be compared on the same level when producers and bands such as his own exist. Criticizing the VOCALOID songs and questioning if they were even real music led to fan outburst.
Generally, VOCALOID producers regard their work as "real". Many famous song writers have gone onto other works beyond VOCALOID.
VOCALOID and politicsEdit
Miku was also the subject of one of the most controversial uses of the legal agreements of any VOCALOID producing studio from the Democratic Party of Japan, whose running candidate, Kenzo Fujisue, attempted to secure the use of Miku's image in the Japanese House of Councillors election of July 11, 2010. The hope was that the party could use her image to appeal to younger voters. Although Crypton Future Media rejected the party's use of her image or name for political purposes, Fujisue released the song "We Are the One" using her voice on YouTube, by simply replacing her image with the party's character in the music video.
Another political issue that has haunted VOCALOID is racism. Generally this comes mostly from the Japanese VOCALOID fandom, although it is not absent from the overseas fandom. Some producers like SolPie and VOCALOIDs like SeeU have failed to completely establish themselves because they were Chinese or Korean, as when they uploaded their work onto Niconico they were met with hate or flame. Some producers have managed to tolerate it, but others have been put off venturing into Japanese websites for it and instead use other websites like YouTube or CreCrew.
The racial intolerance issue has also been extended to VOCALOIDs and while SeeU is largely popular in the overseas fandom, within the Japanese fandom she is ignored or receives hatred. Her voicebank has received little use in Japan despite Yamaha's advertisements and the inclusion of a Japanese voicebank.
Legality of Producer worksEdit
Plagiarism and other forms of copyright infringement have caused both drama and serious consequences for those involved. Plagiarism and copyright infringement often overlap each other; it is only partially innocent when it affects the individuals who are unaware of what they or anyone else has done. Some specific cases of plagiarism or direct copyright violation, however, have become particularly famous.
Examples of direct violation of this level issue are the numerous Lady Gaga covers that use as background music her song "Bad Romance". This is one of the most common occurrences of this level of act, and even when granted that the uploader acknowledges the owner, some labels such as Sony Entertainment have been known to defend their property and pull the video. Karen-T, Crypton Future Media's own label, tends to only pull PVs and videos that have been re-uploaded from their original source.
Usually, when fans commit acts such as the usage of illegal VOCALOIDs, other VOCALOID fans may drop the license holder an alert of what videos the user has uploaded. The result may be that the entire uploaders account ends up being removed or closed. Cover songs, therefore, are dangerous grounds that easily violate copyright without care, and even experienced users, such as Giuseppe, have been a target for such acts. However, not all copyright claims have proven to be genuine with some false claims made every so often. For example, SEGA has been known to claim copyright on certain songs such as "World is Mine" even though their copyright of the song is only via the Project Diva games. This does not mean that the uploader was not in the clear, as Japanese producers rarely give permission for their works to be re-submitted outside of their own account and the vast majority find it upsetting.
Some YouTube reprinters also claim their re-submission, usually done without permission, help promote the VOCALOID artists despite upsetting them in the process. Regardless of circumstances, the main issue has always been a result of the fact that VOCALOID itself is a commercial product and open for anyone to use as they please and despite there being licensing agreements in place there is little studios can do to prevent this from occurring.
Yunomi-P was subjected to hate after being forced to admit plagiarism when VOCALOID fans began to notice works by the artist were often from other sources. The stress and drama surrounding the event caused the artist to retire from VOCALOID illustrations. Songs where Yunomi had done the illustrations were targeted by trolls and anti-Yunomi messages. Karen-T also was forced to remove some songs from their listings such as Magnet while the issue was resolved. In January 2011, Japanese boyband KAT-TUN were forced to admit plagiarism against their song "Never×Over~「-」Is Your Part~", after the producer of the song admitted it was influenced by the VOCALOID song "Dye" produced by AVTechNO, after fans expressed their outrage over the similarities of the two songs. AVTechNO went down on record as stating that the band was not at fault and it was the producer of the song who committed the plagiarism.
Often, the drama is uncalled for and unnecessary. Yunomi's plagiarism, for example, was not a direct violation of copyright material as the images were used mostly as source material or were already from stock images, the use of which is common in the art world already. KAT-TUN's indirect violation resulted in both the songs "Dye" and "Never×Over~「-」Is Your Part~" receiving a particularly large amount of attention, with "Dye" itself receiving a previously unseen amount of views on Nico Nico Douga and Youtube.
The song known to VOCALOIDs fans as "Secret" is an example.Originally it was stolen from the account of Ryuuuu by aRth (who stole many other songs from Ryuuuu and other Producers). However, aRth's act of stealing the song results in Ryuuuu's otherwise previously unknown song receiving much fame and attention on a song that otherwise may not have had the attention it had.
Pirated versions/illegal modificationsEdit
- Main article: POCALOID
One of the most common concerns for any software is pirated versions. It has been proven over the course of time that there is little that can be done to prevent software from being cracked by software crackers and the ensuing impact on legitimate copies of the software.
The main concerns about illegal software versions are normally as followed:
- Harm to user computers: Torrents can at times contain Trojans and viral software inflections. When a user downloads pirated versions of software from sources other than the legitimate source, they do not know what they are downloading and if it is safe.
- Scams: There are cases where voicebanks sold over the internet are advertised as legitimate, but when later inspected, are actually cracked copies.
- Lack of technical support: There is no technical support offered for illegal versions of software such as Vocaloid.
- Legality: The act of software downloading is considered a act of stealing; therefore, persons can be held accountable for their actions. As such, illegal versions of VOCALOID™ do not comply with the end user agreements of VOCALOID™.
- Publishing: Work published that utilizes illegal versions of software can lead to shame within the music industry and destroy producer reputation. In the case of the Japanese, shunning is a common reaction; however, this doesn't always work.
- Support for further development: It is only financially feasible to update software if it sells. Piracy can at times take away potential sales from the software.
- Capabilities: Sometimes the capabilities of pirated versions are different from those of legitimate versions of the software. Sometimes they contain unsupported functions, altered interfaces or "bug fixes" that can interfere with software updates or functionality. One of the most common function removals of illegal VOCALOID is the security measures put in place within the software.
- Software conflict: Lastly, illegal versions of a software are known to at times cause conflict with legal versions of the software.
These concerns aside, there is often little overall difference between the illegal versions of software and legal versions. However, due to the moral implications behind support of illegal Vocaloid, it is often shunned within the Vocaloid fandom. The illegal version of Vocaloid has also been known to have been sold as its legal counterpart, so buyers of Vocaloid need to check their software sources when purchasing Vocaloid to ensure they are purchasing from a legal vendor.
The studios are concerned about illegal software because of the loss of potential customers; there will always be users who are unwilling to ever purchase the software as long as illegal "free" versions exists. The impact of the loss of sales can potentially hurt Vocaloid development; the sales expectancy for vocal synthesizers is often as low as 1,000 units. If the software fails to sell, then there is less of a chance that the studio or company behind the software will produce more. For some studios this is an important issue, as the financial support they give to producing software is based on the success of previous releases, and less sales result in less money for the next vocal.
The use of illegal and legal versions often is determined by cultural trends, with some cultures being more likely to download pirated versions than others.
Within the Vocaloid fandom, there is support for alternative software such as UTAU as a legitimate method for producing works, rather than use of illegal VOCALOID™ software.
During the development of Ruby, Syo spoke out about his payment and involvement of her production. He noted he had worked on Ruby for 17 months mostly alone, taking 10 hours out of his school time a week and was only offered 1/4 of the minimum wage amount, plus a bonus if she sold well. He noted the fact that others will steal his work bummed him out.
Cross-synthesis ("XSY") was first introduced for this engine, but was limited to a set few VOCALOID packages. XSY between two VOCALOIDs or two languages were not programmed into the feature's capabilities.
The software can be, however, modified to allow XSY to occur between any VOCALOID of any language, creating a "Frankenstein-esque" vocal with a mixture of both voicebanks' traits. The resulting vocal can often sound very different in comparison to the products used to make it, making it difficult to tell which two were used.
While a number of fans do not see XSY modding as a harmful, several VOCALOIDs do not share the same licensing as each other. An example of two VOCALOIDs with different licenses are Hatsune Miku and IA. As a result, problems are caused when a producer publishes their work, as any work produced using modded XSY will not comply with the End User Agreement license of VOCALOID.
The issue with XSY modding is often overlooked and producers have been known to publish works using modded XSY on sites such as YouTube or Nico Nico Video. Not all producers see it as destructive, compared to issues related to general pirating, because XSY modding does not affect the sales of VOCALOIDs.
Most of the faults with XSY modding are caused by the mathematical equations used to render the results. When two voicebanks not designed to be used together for XSY are used, the softwares calculations cannot handle them so well. These are due to the technical differences between the vocals, such as extra phonetic data, differences in phoneme usages and set up. Whereas in the case of vocals set up to compliment and work well together (Example being Megpoid V4 Native and NativeFat) the results are higher in quality. This is also why in the modified version of XSY vocals such as V3 Megpoid and Megpoid V4 are not set up to XSY with each other. No matter what, the results of mixing the V3 and V4 versions is impossible to get quality results out of due to Megpoid V4 having increased triphones amount and vocal differences.
XSY of vocals between languages (such as English x Japanese) can produce unintelligible or low quality results, as the sounds were not designed to be combined. However, it is noted that the method can produce entirely new sounds for use, for example, mixing V3 Megpoid - Native and Megpoid English can be used to make the Megpoid package recreate other languages more easily.
Due to the differences between any Vocaloid, quality is almost always lost due to this and this means the mixing of two voicebanks not intended to be used together in XSY almost never produces high enough quality results to warrant the modifying of the vocal. As confirmed, mixing two vocals can at times produce a result that is the equivalent of an entirely new voicebank and the greater the differences between the voicebanks the more different the result is then vocals set up to be used with it. Effectively the potential tones the User has access to is double by just mixing two vocals, producing two possible variations in addition to the original two voices. The main attraction to XSY modding therefore falls upon the results that XSY itself offers, making a few voices produce many results.
Regardless of the advances that XSY modding can offer, producers should be aware that works published using the altered software violates the End User Agreement.
As with any part of Vocaloid, job plug-in support gave way to unsupported plug-ins being made. For example, one of the more common plug-ins are language conversion based and many perform basic executions like transforming English phonetics to Japanese and vice versa, a practice that already occurs within the Vocaloid fanbase without such plug-ins. Many of these Job plug-ins are harmless and often are just made to make life easier for users by speeding up the process. Though in the case of phonetic conversions even with supported plug-ins such as Maika's Spanish to English plug-in are not perfect and can even be bested by vocals of that language.
As with any unsupported adaptions of Vocaloid or its functions, users should be aware of devious practices such as hidden Trojan files within the download, though this does not mean all are designed with malicious intent.
Another form of illegal VOCALOID usage is the act of creating an entire new voicebank from existing ones or claiming to create a new product without creating a licensing agreement with Yamaha in the process. This latter issue is much rarer than illegal downloads as it requires the re-recording of entire vocal banks. To date this has yet to be a serious problem. All groups or parties claiming to have done so have proven to be hoaxes, either created using UTAU or XSY modding.
Vocaloids are created using a developers construction Kit or "Dev Kit". It was confirmed without the access code, the Dev Kit can only pack/unpack the script - a full working Vocaloid voicebank is not possible.
Controversy has been caused by users breaking the Vocaloid licence agreement. As explained by the End-User license agreement, a User is responsible for their own actions, and any misconduct will fall solely upon the actions of the user. For example, despite the existence of pirated versions, it is the user who chooses whether to buy legal Vocaloid, download illegal Vocaloid or not buy Vocaloid at all. One of the most notorious cases of user misconduct in the overseas fandom came from a producer known as "Planty-P".
Planty-P was an infamous producer who had a questionable reputation among overseas Vocaloid fans, owed to past controversies. The most serious of this, had been releasing the details of how to get hold of a trial version of Anon & kanon, despite the package not being released yet nor the trial having been declared free for distribution by the developers. Despite his reputation, he wanted to aid promotion of several projects within the overseas community and had a small following of fans as a result. This allowed him to gain a position of respect and trust despite the more questionable actions he had been known for.
The controversy which caused him to leave the community was "Stella". Stella was announced as an upcoming private Japanese Vocaloid being produced by the team PL Tech, for the charity "Americans for the Arts Foundation". It was stated that Stella was private because the expenses needed to release her as a commercial Vocaloid were too great. She had three 'voicebanks' listed: "core", "sun" and "moon".
Over time suspicion grew that Stella might be a fake; a turning point in public view occurring when a discrepancy was spotted in one of her demos, revealing the videos to be doctored. On 19th of Jan 2015, Planty finally confessed that Stella was a fake, leading to large amounts of vitriol within the community. Within hours of the confession it was revealed Stella was created using questionable methods.
Three elements made Stella controversial:
- Parties involved; The scale of trust involved between the project and its members. Three others were involved with PL Tech, who all stated they were in the dark regarding Stella's origins. These members worked on Stella on the basis of trust and friendship with Planty, causing large emotional fallout among the group when the illegitimacy of the project was revealed. The fact that he used the name of a well known charity to promote her development was considered a direct attempt to mislead individuals, as well as the forged emails provided to 'prove' her legitimacy. At the time Planty was also involved with the promotion of an UTAU known as "Jewel", a tribute to Ring Suzune, which also became caught up in the controversies. The sudden news of what happened with Stella led the team behind Jewel to overhaul large parts of their project to distance themselves from Planty, including renaming the character.
- Intentions of Stella; Stella was made by XSY modding. Consequently, any work that had been produced using her was illegal. Planty's aim to sell albums containing Stella's "voice" for charity would have been a breach of the licence agreement, regardless of transparency. The monetisation added to the seriousness of the situation, and Planty's charitable intentions were bought into question.
- Stella's "development"; Though two of the vocals were revealed eventually to be XSY, the third he stated was made using a Vocaloid Dev kit. He declared that he acquired them from a group called "VIBE" and Bplats, Inc., as he had additionally claimed to have aided in the development of the XSY function. VIBE was forced to speak out, stating that Planty's connection to them was fabricated. The team expressed frustration at Planty's statements, and worried that his lie may have put their own project in jeopardy. They confirmed they had not passed over a Dev Kit to him and the 3rd vocal was yet another XSY modded voice result.
Also, investigations by fans during and before the event revealed he hung around illegal vocaloid release sites, so it was presumed his Vocaloid were not necessarily all legal, if any at all. In addition, due to comments made at the time by himself on the Vocaloid wikia, his copy of Rana was also suspected of being illegal. This makes the situation even further problematic as published work did not naturally comply with the End-User agreement. Even if XSY modding was allowed, he would have not been in a position to be legally sell the albums regardless unless all copies of his vocals were legal versions. With doubt of their authenticity, his finical gain from possibly non-legal copies of software would have been dubious as well.
Some of his practices were known before the event, but were not really brought up until after and had no connection with the events at the time. The events brought scrutiny upon his actions, as a result, every action he had ever been known to have made was reveal and exposed. Even though Planty's actions did not lead to criminal charges against him by any legal authority, the subsequent reaction from the Vocaloid community made it impossible for him to continue under the name "Planty-P". Planty is not a isolated case, though it highlights the potential and consequences of fraudulent practices within the fanbase.
It is also important to note that his confession, Planty mentioned he was aware that he was breaching the licence agreement and was aware of the immoral nature of his actions.
Other cases have existed in other areas of the fandom besides Planty. For example, a number of producers in the Japanese fandom have been driven out of the Japanese community due to the large bulk of trolls and harassment that followed the unveiling of their practices, as well as forced removal of work(s). The issue went to the point it was impossible for them to return even under a different name, if discovered and have even ended careers of illustrators and musicians.
Singing vocal clonesEdit
One of the earliest concepts behind VOCALOID™ was to produce a vocal so near-perfect that there would be no need for the original vocalist. Alternatively, there is the temptation to publish a song while crediting the provider, rather than the Vocaloid, as the singer.
According to Crypton, because professional female singers refused to provide voice samples, in fear that the software might create their singing voice's clones, Crypton changed their focus from imitating certain singers to creating character vocals. This change of focus led to sampling the vocals of voice actors and the Japanese voice acting agency Arts Vision supporting the development.
Similar concerns have been expressed within the other studios creating VOCALOIDs, with Zero-G refusing to release the names of their providers. Miriam Stockley - who provided the voice for Miriam - remains the only known Zero-G voice provider who receives acknowledgement on the VOCALOID product page.
For more, see this page on Wikia Answers.
The agreements of the VOCALOID™ license prohibit users from producing works which are considered degrading, are aimed at undermining individuals, or are controversial, but this does not stop users from doing these things.
The result is that some songs, such as "Wash My Blood", become subject to outcry for their lyrics or subject matter. In the aforementioned song, Luka is often perceived as a nun who broke her vows and had sex, then aborted her unborn child. For some cultures, this is simply unacceptable behaviour, making the song controversial if this is taken as the interpretation.
The concern of the misuse of vocals in this way was raised by Miriam Stockley in regards to her Vocaloid Miriam and its release, noting that there was little that could be done once a vocal is in the hands of producers.
Voice recording competitions were held to find and select voice providers for certain VOCALOIDs, such as Aoki Lapis and Merli. Each competition have their own set of rules and voice providers were either picked by the companies themselves or the fans.
In July 2014, a voice provider competition was held for Chinese VOCALOIDs, Yuezheng Ling and Zhanyin Lorra. The first round of the competition was for Ling and required a speech and a singing file to be sent in by email to Shanghai He Nian, which would be later uploaded to Ling's official Weibo account. Fans were able to vote on their preferred voices here.
In August, the six finalists for Ling and additional four contestants for Lorra were revealed. From then on, fans were to vote on the voice actress based on the sound of their voice to suit the two VOCALOIDs. In September, three finalists were revealed for each VOCALOID and they were recorded as sample voicebanks to demonstrate how they sounded in the VOCALOID3 engine. These samples were uploaded onto their official website and once again, allowed fans to vote on the voicebank they preferred. However, due to the samples being labelled with the name of the voice provider, this caused a large number of unfair votes. A number of the voice providers were already well known and popular within the Eastern fandom, which lead fans to automatically vote for them due to their status. Even more, certain fans used multiple IP addresses and notified other people to vote for their choice, many of which complied.
At first, Shanghai He Nian caught this behavior and reverted the votes down to even and fair numbers, but the fans continued to "spam vote". After the first try, Shanghai He Nian made no additional effort to fix the issue. Not only had the company noticed this problem, the "winning" contestants had also caught the behavior and asked that fans should vote solely on how the competitors sounded and not by their fanbase. But this request went ignored.
Towards the last week, the top two contestants for Lorra, Gui Shen Ren and Xiao Lian Sha, had evened out, with Xiao Lian Sha pulling ahead. On the final day of the competition, Gui Shen Ren was given a high amount of spam votes and won. For Ling, QI Inory had over 100,000 votes at the last week, with the other two having much less. On the final day, Yu Wu Yue Shan gained over 100,000 votes as well and temporarily pulled ahead, but was defeated when QI Inory gained another 100,000 votes and became the winner for Ling's contest.
Lucía and LUANEdit
Lucía and LUAN were a pair of Spanish VOCALOIDs unveiled, produced and programmed by the known VOCALOID artist Giuseppe. Giuseppe had previously been involved in the development of several VOCALOIDs from different companies.
Lucía was first introduced in March, 2016 as a potential upcoming Spanish female vocalist, with two concept designs by Riisago and AkiGlancy (aka. EmpathP). However, Giuseppe could not release her commercially because of the lack of support from other VOCALOID companies, and his inability to pay for the VOCALOID license.
When the demos were showcased, they received criticism. Most comments from Spanish speakers noted that the voice of Lucía was too similar to the VOCALOID3, Clara, particularly in her lower notes, leading to several complaints about the lack of diversity in Spanish voicebanks. Other comments claimed that she was too "flat-toned" and that the voice was "quite generic". However, the most notable criticism was about her thick European Spanish accent. When Bruno, Clara and MAIKA were unveiled, the Latin American Vocaloid fandom brought discussions about the strong accent those libraries had, and asked for VOCALOIDs with a more Standard Spanish accent. Lucía also had a heavy Castillian accented, resulting in a general sense of discomfort expressed by both users and fans.
LUAN was unveiled in July 2016 as a brand new Spanish male VOCALOID, with Giuseppe once again involved in the development, producing a demo for the beta voice. The voicebank was intended to have a soft, mellow singing style and a particular, androgynous sound to increase his versatility; however, the demo instantly drew a large amount of criticism. This was predominantly due to the sound quality, the particular timbre that was compared once more to Clara's. More tension was caused when the voice provider was confirmed (via leaks) to be Akuo, a Vocaloid user with a divisive reputation within the Spanish Vocaloid community. In consequence, comparisons with Lucía were made, and some users felt the pair were worse than Clara, Bruno and MAIKA, being too robotic and/or produced to a mediocre standard.
Much of the criticism towards LUAN was not well received by Giuseppe, leading him to lash out against Spanish fans, insulting their taste and credibility. Though Giuseppe had a history of negative opinions regarding the community (leading to a questionable reputation amongst some fans), this was the first time he had directly insulted Vocaloid users. These comments were later deleted, along with LUAN's demo and all related social media accounts. Giuseppe revealed he will not be releasing the vocals and has now withdrawn from VOCALOID as a whole, resulting in the closure of all related social media accounts in his name.
Some producers treat VOCALOIDs like they are dolls, and believe that they can make a VOCALOID do anything they want.
As noted by Crypton Future Media in regards to their Project If..., releases presented as young children risk becoming subject to pedophilic sexual portrayals. In particular, AH Software's first three VOCALOIDs and their VOICEROID products caught attention and were accused of being aimed at the lolicon fandom.
The elements in Hatsune Miku's design were commonplace in the early days of VOCALOID and owe their birth to the numerous derivatives of Hatsune Miku and other Crypton Future Media VOCALOIDs; however, it was not found to be associated with VOCALOID prior to Miku's release. The overall layout of Hatsune Miku's boxart has been largely referenced within the Japanese culture and is often easily noticed. KEI himself has been fully aware of Miku's fame to the point that he has parodied his own artwork of Miku from time to time.
The design influence of Hatsune Miku led to other cultural sources referencing the design. One example is Kasane Teto, an UTAU, being one of the most easily acknowledgable example of this. Her success to pass as a Vocaloid during an April Fools' joke was partly owed to how similar in design she appeared to existing Crypton Future Media Vocaloids, though that was not the sole factor in this. Furthermore, mascots of other music related software and hardware have been created based on the popularity of Miku's own design, such as FL Chan who owes her character creation to Hatsune Miku's popularity.
Her design also influenced other Vocaloids. For example, cyber-style clothes, as seen on the Crypton Future Media VOCALOIDs, were implemented onto Gackpo's design in accordance with their belief that this was a popular element of the current VOCALOID design. Macne Nana was created as a "sound princess" for the Mac computer; her working name was actually "Macne Miku" and she took influence from Miku herself.
Despite its influence on popular culture, by 2010, Miku's mascot was the source of some controversial opinions within the fandom, causing incidences of possible influence to be met with dispute and at times negativity.
In 2012, a Chinese virtual singer concert hit a particular point by Miku fans for a "Miku-esque" appearance of the main singer Dong Fang Zhi Zi. The conclusion was made that she was an attempt to 'cash in' on Miku's popularity by borrowing elements from her design and using the same virtual diva concept. This caused a negative reaction from Miku fans, and videos of the singer received several thousand "dislikes". This proved that singers who are similar to the point where fans believe they are 'ripping off' Miku are met with controversial outcry. However, designs are not restricted to post-Miku designs, but other pre-Miku designs on occasion are brought up.
However, this is not always a common reaction as many companies use "Formulas" within their products, these are based on the popularity of certain trends from popular culture. "Formulas for success" are found in books, movies, games and television already and are also often applied to something to allow the developers time to focus on other aspects of the end product without having to worry about the consequences of experimenting with unknown or complex elements.
As VOCALOID becomes increasingly more popular, more VOCALOIDs are made, and therefore finding new and unique designs becomes more difficult for studios. Therefore, the creation of VOCALOIDs whose designs resemble other VOCALOIDs may occur more often. Some cite CUL and Big Al as such an example.
Despite the fact that artwork, also, has no effect on VOCALOID vocal performance, there are many who judge a VOCALOID upon their initial release getting excited or not based purely on design. Therefore, there is also a risk of a VOCALOID being judged by vocal performance based on the design, with fans downgrading VOCALOID vocals they like because they dislike the design. VOCALOIDs who have been effected by this include (but not restricted to) Sweet Ann, Megpoid/GUMI and Bruno.
Most VOCALOIDs have been noted to have pale or Caucasian coloured pigmentation of the skin during both the VOCALOID and VOCALOID2 era of voicebanks, with the latter being the most noteworthy because of its more driven marketing of VOCALOID character design. As with all character design issues, some minority groups feel left out because their ethnic group appears to not be represented in VOCALOID. This is a hard reality for them, because despite the fact that many influential musicians were "black" or of African descent, darker skinned musicians have difficulty getting recognized despite their influence on modern music genres.
For Japanese VOCALOIDs, this is all a result of their culture. In general characters tend to have pale pigments and therefore, since the avatars of their VOCALOIDs are released into their culture, their VOCALOIDs follow standard anime influenced designs.
Skin colour is an issue some take very seriously due to indifferences between racial representation and attitude towards ethnic groups. In Vocaloid itself, it's now been witnessed that this goes far to the point that it can determine if a fan enjoys a VOCALOID fully or not. Some cited their reason for liking the VOCALOID Merli upon her design unveiling because it featured "the first VOCALOID with dark skin". Merli was not the first VOCALOID to be released, however, with darker tones, as both Clara and Bruno originally were going to be released with dark toned skin. However, after a competition to pick out new designs since the fans reacted badly to the originals, Bruno and Clara are now depicted with paler skin; despite there being a variety of choices put forward by fans during their redesign contest. The result was based on a majority vote.
The most serious concern falls onto avatarless vocaloids, which often leads one particular design to be favoured over others or something new. In the case of Leon and Lola, this is more true, despite being confirmed in 2009 that their vocals were both provided by Black vocalists and that they were released to sing in genre of music known as "Soul". Some fans have expressed difficulty at viewing the avatarless vocaloids as anything but their most common design, particularly Leon. While there are pale skinned singers in Soul music, particularly British soul music (and being made by a British company, this is also plausible for LEON and LOLA to not be featured with darker skin), but these singers are still a small percentage of the singers within the Soul genre.
In the case of LEON's most common fan design, this design was merely based on the fact that LEON was made by a western studio as opposed to a Japanese one and the design is based on a stereotypical westerner design. Also, the design was conceived independently before it was known both Leon and Lola were voiced by "black" singers.
When RUBY's character design was revealed on July 4, 2015 at Anime Expo, Syo stated that PowerFX had gone against his wishes. He and Natasha Allegri worked on a final design for nearly a year, however, Anders relayed to them that PowerFX wanted a new illustration to appeal to their EDM audience. Syo stated he had been promised control over the design and was shocked to see a new illustration premiered at Anime Expo. The image was concealed from the developers until launch.
Anders, who acted as a relay between PowerFX and Syo, countered this, saying that he had not promised Syo complete control over the design. When Syo asked if his friend would be allowed creative input, Anders said: "I can't say that she will have the final say, but she can certainly come up with ideas for the design and if they're good we'll use them." This conversation occurred during December 2013, when Anders was communicating with Crypton about RUBY. After it became apparent Crypton would want more control over the design than they were comfortable with, Anders suggested moving to Zero-G or PowerFX because he believed they would offer them more freedom. Anders asserted that no promises were made and no contracts signed beyond NDAs.
PowerFX revealed they never received Misha's design or knew it existed. An email sent by Bil Bryant indicated that he had not seen Syo and Misha's concept until after Anime Expo. On July 7, Bryant stated that he had not been aware that Misha was Latina and wanted to be portrayed as such, nor that Syo had been promised input on the design.
The art that appeared was commissioned 6 months before its unveiling. PowerFX officially stated that only one artist was requested and paid to do artwork. The aim was to appeal to digital music makers and EDM producers while not totally alienating the existing user base.
After negotiations, the miscommunication between PowerFX and the developers was resolved. Misha's design would appear on RUBY's software installer and boxart. Misha stated that they believed that Anders had not intentionally misled any of the parties. She revealed that early whitewashing accusations were due to miscommunication, as they had never clarified that RUBY was supposed to represent her ethnicity.
A similar miscommunication occurred when RUBY's boxart was revealed. Once again, RUBY's team stated that the boxart PowerFX used was not the design they had created or intended. It was discovered that PowerFX had never received the boxart that the team designed due to the team members' miscommunication, each believing the other had already sent it to PowerFX. When PowerFX did not receive the boxart they had agreed to, they created their own. This contained factual errors and poor design, leading to fan outcry.
Vocaloid China controversyEdit
YANHE's voice was recorded prior to the choosing of an avatar and was intended to be the voicebank for VOCALOID CHINA PROJECT's runner up, Zhiyu Moke. YAMAHA became suspicious when money lent for the project disappeared and it was found that the president of Shanghai HENIAN Information Technology Co. Ltd. (SHN), Ren Li, had spent the funds at a brothel. YAMAHA removed the rights to all of the VOCALOID CHINA cast members and SHN was left with no other choice but to host a second design contest to find a new character for their completed voicebank. This caused an outrage from Chinese fans as they assumed the original cast members were to become VOCALOIDs after Luo Tianyi in addition to a lack of a proper explanation for the second contest. As a result, the fans shunned YANHE as they believed that she was replacing the original VOCALOID CHINA cast, which hurt her initial sales and reputation. The VOCALOID CHINA PROJECT cast members were unable to become VOCALOIDs until early 2014 when SHN repurchased the character rights from YAMAHA and ended collaborations with Bplats, Inc. to develop Chinese VOCALOIDs on their own.
Later in 2014, Flexin, a Chinese MMD user, tried to alert the rest of the Chinese fans about what happened behind the scenes. At Ren Li's request, SHN responded to Flexin with a lawsuit and claimed he was spreading false rumors. In the end, Flexin withdrew his statements and kept silent about the controversy. Other fans who discovered the controversial secrets avoided speaking about it due to SHN's reaction to Flexin.
When Ren Li "resigned" as president (in truth, he was fired for his actions), news of what had happened was able to be spoken and it was revealed by G.K., a former worker for SHN and member of Vanguard Sound.
Product results and expectationsEdit
A cause for controversial concern is aimed at individual vocals. Some a product of myth or bias, the Vocaloid can be misjudged by large audiences if misconception or opinion is taken too heavily as fact.
Sometimes the quality of a Vocaloid is not always felt equally across all areas of vocaloid. "Quality" within vocaloid is determined by a number of factors.
It is important to remember that Vocaloid itself was designed to be a professional user product; as a result, even the lowest quality vocal is considered by synthesizer standards to be a HQ product. The engine itself, however, can have impact on the vocal and not all vocalists are suitable for the engine, as Gackt was warned when he made his first recordings. In addition, the compatibility with the engine version can also render a vocal able to excel in unexpected ways. Despite its lower quality, the original Kaito product had high compatibility allowing it to excel as a vocal despite being a lesser quality version then the Kaito V3 version.
Regardless of all these factors, "Quality", or even a lack of, does not always mean the usefulness or results of the vocal are more guaranteed then another vocal of a different capabilities.
The following is a few (but not all) of the examples of possible contributions to the quality;
- Style; Prima was built for opera and when used to sing in the opera vocal style will excel over a Vocaloid not built for this style of music. This means, Prima produces, for her purpose, a high quality result. Using her for Soul, would producer a much lower quality result as she does not have a vocal built for this style.
- Vocalist; professional vocalists over voice actors, with professional voice providers such as Gackt producing much higher quality results then an amateur singer such as Haruna Ikezawa. This is due to issues such their training or experience singing, increasing their capabilities to match pitch, tone or sound.
- Version; the original VOCALOID engine did not produce good results compared to later versions of the software. Its vocal range capabilities were limited and it had a great amount of digital noise. the original Kaito vocal produces a much more low quality result then the KAITO V3 package. This is the same also for voices such as Megpoid and V3 Megpoid - Native, wherein the "Native" V3 update is much higher quality then the original Megpoid vocal. Updates to the engine add new functions as well as often improving existing ones and older vocals may not have access to them even if they are imported into the newer engine version. This is not to say that a vocal does not experience any difference once imported into a newer engine; Vocaloid2 vocals imported into Vocaloid3 displayed a large quality result once imported, though some experience errors they did not possess before such as Sonika.
- Script; Prior to VOCALOID4, problems with English vocals were reported. This is owed to the fact the standard English script produced in the pre-VOCALOID, contained errors. This was the script sold with every Vocaloid Dev Kit script even up until VOCALOID3, therefore every English vocals produced using this script was sold with the errors caused by it built in. The result was every Vocaloid contained incorrect sounds. Both Ruby and Cyber Diva were created using different built scripts intended to improve on the standard original English script offered by Yamaha with the Dev Kit.
- Pitch; pitch layering also adds to quality by giving a strong voice to build the vocal upon. The layers of pitching divided by at least two known types of pitch; stationary and articulation. The number of pitches per vocal varies; Maika for example has 6 pitch layers in total overall. The way the pitch layers affect the performance and quality of a voicebank is related to the synthesis engine itself, and the mathematical interpolation and extrapolation processes used for reconstitute the full range of the voice. In theory, with more pitch layers, the voicebank should behave more similar to its voice provider, although doing this also means an increase of the samples required thus incresing the filesize and computational resources needed for the voicebank. Most English vocaloids pre-VOCALOID4 were confirmed to have only 2 layers of articulation, the total layers of pitching for Yohioloid was confirmed as 3.5. In addition to number of layers, it is important to note that, a Vocaloid will end up sounding off pitch if just 1 of these layers is incorrect, in the same way an incorrect data input of just 1 variable can deviante the whole result of a regression model. A known example is Lily's VOCALOID2 vocal had one of her pitch layers too high. As a result the whole Lily VOCALOID2 voice was off pitch. This was later corrected with V3 Lily by creating a new voice.
- Phonetic data; In addition to what sounds are recorded via the script, what phonetics are constructed for the voicebank are also factored in. Phonetic data is recorded when the vocalist reads their respective languages script, but some advanced samples may have to be constructed by merging two or more samples together. Compared to pitch, these do not have an impact on the entirety of the voice, but do have an impact on constructed words. The construct samples are normally the "diaphonetic" and "triphonetic" samples. Prior to Vocaloid3, only diaphonetic data was included in vocals and the engine did not factor in triphonetic sounds. These were introduced in Vocaloid3 and resulted in a higher quality voicebank because they smoothed out the vocal results. This became a major factor in why there was a quality leap in Vocaloid 3 vocals over Vocaloid 2 ones. As noted by Internet co., Ltd, however, an increased amount of them does not necessarily result in a higher quality vocal. Triphonetic data does not always require new recordings and it is possible to construct them with previous recordings.
It is important to note that several factors are not a result of a vocaloid's quality. Unfortunately sometimes these become myths among the fans or users and have even been known to cost a Vocaloid sales such as the following;
- No matter how good the Vocaloid is, if the user cannot use the vocal well, the results will be not much more then a LQ result. This can be because be because of any number of factors such as they lack the skills, experience or mean to fix a problem with a voice.
- If the user tries and to make the Vocaloid do something it wasn't built for, this can produce a reason of lower quality then a Vocaloid vocal which was built for the same purpose. This is mentioned in Style, but applies to even things that are outside of Vocaloids domain of design such as basic speech or forcing a Vocaloid to sing in a language it wasn't built for (particularly if there is a large difference between the two).
- In addition, even though the user may enjoy the way a vocaloid sounds, this does not always mean the vocaloid is high in quality. Appeal falls purely down to a individual's opinion/taste and these can ignore the true traits or quality of the vocaloid in the process, even expressed with ignorance or naivety of the truth behind a vocaloids vocal results. The Kagamine Vocaloids represent some of the notable examples of quality issues with their VOCALOID2 vocals reflecting a variety of varying quality per package from the original LQ "Act1" package to the HQ "Append". Despite the issues, they remain popular and their fans support them regardless of their past issues.
- See also "Languages".
One of the oldest controversial towards individual Vocaloids is regarding the language that Vocaloid sings. Some fans report adoring one set of vocaloids for their language, only to put another down for the way that language sounds to them. This can be because they feel that language has more appeal or because they speak that language. The result is that entire sections of the Vocaloid library of voicebanks get ignored or generally shunned for their language alone and myths made up purely by fans about them plague these vocals.
As mentioned previously in "Quality", language has become one of the factors of myth among fans. However, the story of why this myth exists is not short.
The issue with languages began in VOCALOID2. During the VOCALOID2 era, the franchise became popular due to Hatsune Miku's fame. Vocaloid became associated with the Japanese culture and due to the overwhelming popularity the focus shifted from English vocals due to the high profile of Miku. Despite having more vocals released for the English version at the time of her release, and having done well for themselves, Miku was a record breaking vocal and sold a an overwhelming amount of units. This was never met by future Vocaloids from Japan, but even then the next few releases enjoyed success due to her popularity. Her success also resulted in the VOCALOID vocals Meiko and Kaito being "forgotten".
By the time fans became aware of there being other vocals beside Miku, Rin and Len, they had a overwhelming popularity. Though Meiko and Kaito were able to find their place among the growing Japanese fans, there was only a few able to use the English vocals. Further more overseas, western fans had grown to know Miku and other Crypton Future Media vocals and did not necessarily even know there was English vocals.
When finally the English vocals began to receive interest again, there was a mixed reaction. Since western fans had never been able to "hear" a Vocaloid's true nuisances due to language nativity as a result of listening to Japanese Vocaloid, they were able to hear it for the first time. Often they were branded as "Low quality" when compared to the Japanese vocaloids. This was not always true, despite how strong the myth was at the time. For example, Sweet Ann and Prima produced high quality singing results due to being voiced by professionals, while Hatsune Miku and the Kagamine package were built to act as characters. Therefore, both English vocals produced overall better singing results. In the case of English Vocaloid was true that English Vocaloids contained errors, this was owed to a incorrect script sold by Yamaha with the English language Dev Kit.
However, in contrast to what many western opinions were, often the Japanese producers reported contrasting results in regards to the same set of vocals. For example, Tonio was praised for having a "beautiful" voice by Japanese English vocaloid fans, whereas some westerners felt his voice was "ugly" due to its deepness. Japanese producers did not always see the same clarity issue with regards to voices such as Big Al or Sweet Ann and felt both were fairly clear, while western fans tried to claim the entire English vocals were unclear and Japanese vocals were much better in terms of clarity. This despite the fact this became a factor in why many Japanese songs had to include subtitles because of the lack of clarity of a Vocaloid in a particular song.
In the case of Megurine Luka's VOCALOID2 voice, some users felt her Japanese voice produced better singing results than her English voice. This was partly owed to Luka going to sale with sounds missing, resulting in the English voice being low quality even compared to some other English vocals. A myth was built within some fans that the Japanese Luka vocal was better at produce English results than Luka English. In turn, this fueled an on going myth that existed about Japanese language vocaloids and their capablities in languages. Despite this, the voice was set up with most of the basic sounds and therefore was capable of using most sounds within English Vocaloid. This meant, despite its quality issue, it still produced better English results then her Japanese vocal for the purpose.
This is not isolated to just English Vocaloids and has often been more of bias opinion rather then truth. However, the English version was most criticized because of how overall different the approach was between English and Japanese vocals were. The English language itself also has a reputation for being one of the most difficult to recreate in synthesizers due to its lack of distinction per sound. resulting in a sound varying much greater then most other languages and many other factors work at play. For instance, it is possible to find a voice intangible for its accent alone if the listener is not used to the accent within the voice.
As other Vocaloid languages entered the market with VOCALOID3, users began to be able to hear some of the problems repeated in other languages that plagued Japanese and English, as simple factors such as the VOCALOID engine noise were able be heard across all versions. In addition Vocaloids like Oliver and Avanna gained much popularity and many Japanese vocals like Gumi and Miku were given English voice version. As a result, the overall attitude towards the English version had improved by VOCALOID4.
Bias towards languages, also is brought up in regards to Spanish vocals. Since you can get a "decent" level of Spanish from Japanese vocals in contrast to making them sing English, some Spanish fans do not see the point of Spanish vocals being produced. This appeared among Spanish fans and their complaints during the controversy over Lucía and LUAN, though was not a major argument against the pair. In contrast to the structure gap between English and Japanese, the Japanese vocals used for Spanish is no where near as large due to the two languages having many similarities in basic word construction. Still, the quality even here is higher among Spanish vocals then Japanese ones used for Spanish.
Though quality has been improving on some of the other languages, this does not always get noticed and some even claim there has been no increase in quality on some such as English vocals. The other factor is that a large majority proportion of the western fans who are interested in Vocaloid are already fans of manga and anime, so the Japanese vocaloids appeal more to them. In addition, strong releases such as Maika bring into question the reluctance towards non-Japanese Vocaloid producers and fans. This is notable even when the user was a native speaker or adapt enough to use a vocaloid in that language.
However, regardless one factor remains; a Vocaloid is not LQ just because it is released for a particular language nor is one HQ for the same reason. In short language of a Vocaloid alone does not factor in quality at all, though the approach to the language (its build and version) and other factors previously mentioned in "Quality" do factor in.
Studios are not without fault and sometimes have made claims about their products, which often are true, are not always as good as they seem.
One VOCALOID who was a subject to controversy claims was SeeU. In the early VOCALOID3 demos, SBS Artech claimed you could use SeeU's voicebank to create English, even though it was set up for Korean and had no support for using it to make English. They even went so far as to label her a "trilingual" VOCALOID, even though with only two voicebanks she was bilingual. Upon inspection of the voicebank, some producers discovered that SeeU had several English phonemes that were not needed for Korean and SBS were again heavily criticised for both the claim and inclusion of phonetics SeeU didn't need. Inclusion of unneeded phonetics is nothing new as every English VOCALOID2 after Prima's release (with the exception of Luka) was given the rolling r phonetic data "R" because Prima couldn't do opera without it. Despite this, this led to jokes about SeeU being made for "Konglish" rather then "Korean" and as of note, criticism for the focus on her English capabilities against the Japanese capabilities she was given thanks to an additional Japanese voicebank.
Eventually SBS Artech addressed the language issue by confirming that they would make an English voicebank and stated the reason for the past claims was they wanted to release her with an English voicebank but also wanted to meet the VOCALOID3 release. In the end they did not have time to make a English voicebank and included the phonetic data to allow SeeU to create English as most sounds for English were already in the Korean language. Despite this, as many pointed out, the way the language is structured and the fact she is largely not smooth results in English that is either choppy/broken or odd sounding thanks to a Korean accent. They then confirmed a English voicebank that will allow her to fully do English.
SeeU was not the only vocaloid to be criticized for this. SONiKA was also noted for having a remark about with editing how she can be made to sing other languages. This is true for any Vocaloid vocal, though she was criticized for having it on her product page when no other Vocaloid had mentioned this at all.
MAIKA also, due to a custom dictionary, was released with the 16 sounds missing from Spanish that would allow her to sing in Catalan. However, these allowed her to also sing more closely to other languages such as English, Portuguese and Japanese. However, she will not sound like a native speaker.
There has been some noticeable concerns with the information on her from the Mikunopolis website in regards to Vocaloids by Crypton Future Media;
- On the Mikunopolis website Luka's profile contains the statement; "In the past, creating songs with English lyrics had always been somewhat awkward, but Luka is able to sing both in Japanese, English or a mix of the two - a worldwide virtual singer". While it is true it had always been awkward to use a Japanese VOCALOID for English, there had always existed native English capable VOCALOIDs so the English language had always been easy to access.
- As mentioned on Luka's product page, she has a number of issues with her English voicebank that leave some English speakers questioning her English capabilities overall. So ironically while she had the best English level of any Japanese based VOCALOID at the time of her release, just how much lack of awkwardness there is to her vocal results is questionable.
- Luka was aimed at the Japanese speaking market and was originally not intended to be a world wide release (hence the lack of an English interface).
- "The Kagamine twins are well known for their very clear and precise vocals". Considering their history of lack of clarity and/or pronunciation problems, as well as Act 2 of the software missing a pronunciation entirely, this is a little bit of an exaggeration on the webhost's part. Also the statement, "Just with a few tweaks here and there they could sing almost any other genres as well," is somewhat questionable due to their reputation of requiring previous experience to use and often at times needing more than just a few tweaks to make them work.
Both GUMI and MAYU had a song where they sung in "English" using their voicebanks ("Fly Me to the Moon" and "Dreamin'", respectively). Except, both were using a Japanese voicebank to sing in English. The original "Fly Me to the Moon" demonstration was more welcomed with Megpoid than the second version done for her V3 update. Back then, many fans were interested by her more accurate English pronunciations than past Japanese VOCALOIDs, as a common western practice was to use Japanese VOCALOIDs for English because some fans preferred their vocals over what the English vocals offered. However, by the second time the demo appeared it was questioned why Internet Co. was persisting with the same demo song when its also possible to do a version with Japanese lyrics. For MAYU's demonstration, English fans have also complained that they can barely understand MAYU's "English".
Some argue that showing a demo with the VOCALOID being forced to sing in a language they were not designed for is a bad demonstration of the VOCALOIDs capabilities, since it can generate a wrong impression of the real strengths and weakness of the voicebank, either making a voicebank appear more flexible than it is in reality. It can end up showing some flaws that aren't present in its original language as the voice is pushed away from its natural "safe zone". This wrong impression can be worse if the listener isn't a native speaker, or at least someone with a deep knowledge of that language.
This is particularly notable as the Producer of the song must be adapt with phoneme adaptations enough to manipulate the vocal to mimic another language. Demo makers such as Giuseppe have been known to be able to make Vocaloids sing in several languages they were not built for.
Other issues fall upon the tuning of the vocal. In regards to demos created by producers such as Cillia, the way they tune their vocals results in an unnatural portrayal of the vocal. It is not always possible to hear the VOCALOID's true traits for this reason. Other producers have a habit of using the wrong pitch as was heard in a few of Lily's V2 demos because producers are accustomed to using other vocals like Hatsune Miku.
- Zettai Ryouiki - Pixpedia's Article (English)
- TVTropes: Zettai Ryouiki
- VOCALOIDs Unofficial illustrations