Vocaloid Wiki
💻 Technology article work in progress. What is being worked on: Idea is to redirect all non-VOCALOID tech to this directory page and link related wikis when avalible. Discussions here
For information on how to help, see the guidelines.  More subjects categorized here.

Top Directory Music directory Merchandise directory Technology directory Synthesizers directory Tutorial directory Websites directory

Sub-root article guide to direct readers to information about various software.

Singing Voice Synthesizers (Lyric Input)[]


AquesTone.JPG Software-icon.png

AquesTone is a VSTi plugin developed by Aquest, there are four voice options: Female F1, Auto F1, Male HK, and Auto HK. UTAU Uta Utane (aka. Defoko) uses Female voice 1 as a source for the UTAU default voicebank.

The female voice is credited in Pop'n Music ("Chilt Featuring AquesTone") and other BEMANI rhythm games like Dance Dance Revolution.

External links

Examples of usage
  • "ちかてつ (Chikatetsu); Subway" by Calmwind01x ft. TILT (AquesTone)
  • "ミルキーポケット (Miruki Poketto); Milky Pocket" by Calmwind01x ft. TILT (AquesTone)
  • "Time's Intersection" by Calmwind01x ft. TILT (AquesTone)


Filoyo.jpg Software-icon.png

Alter/Ego is a : WORK IN PROGRESS



External links

Examples of usage

Emvoice One[]

Filoyo.jpg Software-icon.png

Emvoice One (formerly SING and working title SOHO) is a software by Emvoice for OSX and Windows. The software will allow music producers to create vocals without the need of a singer.

External links


Filoyo.jpg 7e61cc2f-02ff-e011-98e6-0025902c7e73 2 full-1-.png

Cadencii is a voice synthesizer program and frontend for several other voice synthesizers: VOCALOID, VOCALOID2, UTAU (or rather, UTAU resamplers), STRAIGHT with UTAU, WORLD, and AquesTone. It has its own engine (written by shuraba-P / HAL) named v.Connect-STAND. The source code used to be hosted on SourceForge.JP, but has moved to GitHub.

Cadencii's interface emulates the VOCALOID interface very closely. The piano roll can also change color depending on the synthesizer engine being used; for example, when VOCALOID2 is selected as the synthesizer, the piano roll becomes grey and green, and when UTAU is selected, it becomes blue and pink.


It is currently officially available for Windows and Mac OS X. The latest version available is v3.5.4 for Windows and v3.4.1 for Mac OS X. There is also an unofficial port in the Debian repositories at v3.3.9.[1]

On Mac OS X and other Unix-based operating systems, Cadencii requires Wine to be useful. For OS X there is jCadencii, a JAVA version of Cadencii for OS X. The JAVA frontend is actually compiled from a lot of #ifdef JAVA in the C# Code.[2] If you take a look into the OS X .app bundle, you'll still find minimized wine bundles to support the dlls with native Windows code.

It's not possible to use Mono to build and run the CSharp program cross-platform yet, since there is some native Windows code. Workarounds may be possible in a way like pipelight, a method to provide better SilverLight/Flash support in Unix(-like) systems using wine.

Cadencii has its own file format, .xvsq (not to be confused with VOCALOID3's .VSQx). Cadencii can also import and export other synthesizers' project files, such as VOCALOID's VSQ and UTAU's UST file formats. Notably, it can export as MusicXML, making it a popular choice for creating files that work with Sinsy, especially in combination with its ability to import VOCALOID and UTAU project files.

Besides simply being able to import VSQ and UST files, Cadencii can also read the pitchbends (old pitchbend type/Mode1-only for UST), which can be used in combination with Cadencii's ability to use multiple synthesizers, as seen in the example video below (Tori no Uta).

  • License
    • Cadencii is free software.
    • Source codes of Cadencii are the copyright of kbinani.
    • It's stated that Cadencii is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
    • Cadencii is released under the GNU General Public License, version 3.0.
External links

Examples of usage
  • "CadenciiをインストールしてSinsyに歌わせるまでの録画" by nwp8861 (Mimirobo-P) ft. Sinsy; export tutorial
  • "Tori no Uta" by FastSpeedy ft. Nukupoid; UTAU voicebank


Cantor.png CANTOR1.jpeg
Cantor2.jpg CANTOR2a.gif

Cantor (and its successor Cantor 2) is a synthesizer developed by VirSyn and utilizes Formant synthesis. It was released at the same time as MIRIAM, and was a rival to the original VOCALOID software. It was sold for £199.99 including VAT which was said to be expensive for its time, but that was due to offering a far greater selection of vocals.

Unlike VOCALOID, it had 50 voices for use, which was far greater than what VOCALOID had to offer upon its release. But because of its design it was more intended to resemble a virtual instrument than a virtual singer. It had capabilities for both German and English, and supported both Windows XP and Mac OS X, as opposed to VOCALOID which was restricted to just Windows XP. However, similar to VOCALOID, it worked as either a stand alone or plug-in and supported ReWire.


As with the early version of VOCALOID, Cantor was a victim of change in the way indie music was produced, and by Cantor 2 was being impacted by this change (alongside VOCALOID) as the demand for synthesized voices began to disappear.

The final version of the software, Cantor 2.1 was released on February 6, 2007. Though updates have since ceased, the software remains on sale, and is confirmed compatible with Windows XP/Vista/7 and Mac OS X 10.5/10.6 versions. A demo is able to still be downloaded from VirSyn's website, although it requires an eLicensed Syncrosoft dongle to be able to download the demo and the full software version itself.[3] The special dongle was included with the boxed version of the software, as well as other software sold by VirSyn. It was not able to be bought with the downloaded versions but could be purchased separately.[4]

Cantor 2 later became available for purchase on the Crypton Future Media Website in 2008, and a demo was released on Crypton's Youtube account.[5]

External links

Examples of usage
  • "Tears of an Angel" by Mike Oldfield ft. MIRIAM + CANTOR


Filoyo.jpg Dandyinterface.png

Chipspeech is a synthesizer developed by plogue. A retro styled vocal synthesizer created to reproduce vintage vocal synthesizers released in the 20th century. The software acts as stand alone or plug-in software to various DAWs. It can sing and talk and supports two languages: English and Japanese (though Japanese currently does not have talk capability). There are various means to adjust the vocal in the way the user wants, creating some very unique sounds and results.

The main strength of the engine is it can have a multiple number of synthesizer styles built into it. While some like Dandy 704 or Lady Parsec are based on samples recreation vocals much like VOCALOID or UTAU, others like Dee Klatt do not have samples and are fully synthetic sounds. They are instead based on direct input, meaning they recreate to various degrees faithful recreations of their engine, with Dee Klatt's being a fully rendered "live" feed back. For example, along with 5 new vocals since release, the Circuit Bending feature was added in version 1.032. This mimicked the circuit bending method of getting unique vocals from the classic old chips, which allowed for 'one of a kind' results.

As they are based on old technology they are all dated vocals and do not reflect the modern sounding ones. Therefore they at best barely sound human and do not attempt to even sound remotely like an uncanny effect. This can be off putting for those inexperienced with synthesizers of the past who and those who want realistic sounding vocals. The vocals themselves are difficult to find otherwise in their original chip forms, some being impossible to find due to how old their technology is.


Plogue Art et Technologie, Inc is a small company specialized in chipbased technology and aiming to recreate chip effects and sounds and apply them to modern technology. Chipspeech was one of their many ideas they planned for years to create, however, they were held back on the means to execute it. Though they had the technology, they lacked the knowledge. The Chipspeech software was born after they hired a member of their development team with Phonetic knowledge, allowing them to come up with ideas on how to create a vocal synthesizer technology.

The software acts as stand alone or plug-in software to various DAWs. It can sing and talk and supports two languages: English and Japanese (though Japanese currently does not have talk capability). There are various means to adjust the vocal in the way the user wants, creating some very unique sounds and results.

Unlike VOCALOID, CeVIO or other synthesizers words are typed as sentences. If used as a plug-in into a DAW, the synthesizer will play each sentence in the keyboard. The software is easy to use but requires some work to master.

The software currently has 12 characters, with an optional 13th being able to once downloaded known as "Daisy". Daisy was able to go into Alter/ego and is the only character able to do so. Daisy is, however. currently retired and unavailable for download. The other 12 characters are based on various synthesizers, such as Rotten.ST based on Atari ST’s STSPEECH.TOS or Dandy 704 based upon the IBM 704 computer. With the exception of Dandy 704, all have a cyberpunk-style character illustration representing each voice, with Dandy 704's instead being Steampunk. There is a basic "storyline" between the characters and a "canon" as per say. However, this does not impact the software itself.

External links


Filoyo.jpg Software-icon.png




External links

Examples of usage


Filoyo.jpg Software-icon.png

DeepVocal is a : WORK IN PROGRESS



External links

Examples of usage


Filoyo.jpg NiaoNiao.png

NIAONiao Virtual Singer (袅袅虚拟歌手 Niǎoniǎo xūnǐ gēshǒu) is a Chinese voice synthesizer program developed by dsound.[6]

The default voicebank is named Yu Niaoniao (余袅袅), however, users can create their own voicebank and take advantage of its larger file feature. NIAONiao can import MIDI files, VSQX files (VOCALOID3 only), and UST files, export tracks as the "Niao" file format (*.nn), and can render vocal tracks directly as WAV, MP3, or MIDI files.


The principle is the same as UTAU. Many Chinese fans have begun producing vocal banks for both programs. The voicebank format for NIAONiao is radically different from UTAU, the main difference being that the voice samples are packed in a large file. Due to being made for a Chinese audience, NIAONiao can have final consonants in a voice, also unlike UTAU. NIAONiao is not exclusive to singing in Chinese, just as UTAU is not restricted to Japanese. For example, a NIAONiao voicebank for Nagone Mako can be downloaded from the official NIAONiao website.

The interface is much closer in similarity to VOCALOID and (unlike UTAU) there is a panel at the bottom for controlling parameters, pitchbends, and vibrato.

External links


Filoyo.jpg Software-icon.png

NEUTRINO is a Japanese neural voice synthesizer program developed by SHACHI.[x]

It is compatible Windows, MacOS, and Linux. Web browser compatibility is based in Google Drive.



The user uploads data in the MusicXML format, which the NEUTRINO program reads to output a WAV file of the generated voice. Gender factor, vibrato intensity, and pitch shift can be adjusted prior to output.[x]

MusicXML files can be made in Symphony Pro, Cadencii, MuseScore, and finale NotePad.

  • Voices: NEUTRINO has two known voices: Tohoku Kiritan, and Yoko. NEUTRINO supports Japanese.
    • Tohoku Kiritan (東北きりたん) is a Japanese-only voice.
    • Yoko (謡子; f001j) is a Japanese-only voice.
External links
Examples of usage

  • "AIきりたんの『キリトリセン』 無調声ver" by 96s_kM4osM ft. Tohoku Kiritan
  • "AIきりたん調声有無聞き比べ" by 96s_kM4osM ft. Tohoku Kiritan

Piapro Studio for NT : WORK IN PROGRESS[]

Filoyo.jpg Software-icon.png

Piapro Studio for NT is a : WORK IN PROGRESS



External links


Filoyo.jpg Renoidplayer.png

RenoidPlayer is an online synthesizer created by g200kg. It is compatible with various web browsers and also works on the iPad and iPhone as long as they run iOS 6. Note: Safari running on Mac OS X cannot export the final product. Use Chrome/Firefox when exporting.


It has a built-in sequencer, unlike Sinsy. However this Editor can be a little confusing to new users, but for those experienced, there is MML available for use. RenoidPlayer is similar to AquesTone in the fact that pitchbends and other flags and tunings are ignored. This applies to all voices. Currently, there are only 8 voicebanks available to use, many of which are UTAU voicebanks which the author got permission from their creators to make RenoidPlayer compatible voice libaries for.

  • Data import: RenoidPlayer can accept file-drop as a sequence data. (Note infomation and basic lyric informations only. Pitch-bends and other additional informations are ignored.)
    • VOCALOID: VOCALOID Sequence files (.VSQ/.VSQx)
    • UTAU: UTAU Script files (.UST)
    • CeVIO: CeVIO Creative Studio files (.CCS)
    • MusicXML (.XML)
  • Parameters
    • Volume - Output volume control.
    • Transpose - Output pitch control, semi-note step.
    • Portamento - Pitch change smoothness control.
    • FormantCorrection - Keep formant independently of output pitch.
    • Formant - Formant control.
    • Humanize - Add some fluctuation to pitch and dynamics.
    • VibratoDepth - Amount of vibrato. Note that the vibrato has the delay time to start by 'VibratoDelay' parameter.
    • VibratoRate - Vibrato speed control.
    • VibratoDelay - Delay time to start vibrato.
External links


Filoyo.jpg Software-icon.png

Sinsy (Singing Voice Synthesis System) (しぃんしぃ) is an online HMM-based singing voice synthesis system by the Nagoya Institute of Technology that was created under the Modified BSD license.


The synthesizer is free to use, but will only generate tracks up to 5 minutes. The user uploads data in the MusicXML format, which the Sinsy website reads to output a WAV file of the generated voice. Gender factor, vibrato intensity, and pitch shift can be adjusted prior to output.[7]

MusicXML files can be made in Symphony Pro, Cadencii, MuseScore, and finale NotePad.

Some users have praised Sinsy for its realism. This can be attributed to Sinsy's voice source being a TTS (specifically, HTS, also by the Nagoya Institute of Technology), a process known for producing human-like results.

As of December 25, 2013 the official creators of the Sinsy are Keiichi Tokuda (Producer and designer), Keiichiro Oura (Design and Development), Nakamura Kazuhiro (Development and Main Maintainer), and Yoshihiko Nankaku.

  • Voices: Sinsy has four known voices: Yoko, Xiang-Ling, Matsuo-P, and Namine Ritsu S. Sinsy supports Japanese and English. A Chinese version has been released of Xiang-Ling as of Christmas 2015.
    • Yoko (謡子; f001j) is a Japanese-only voice.
    • Xiang-Ling (香鈴; f002j; f002e; f002m) is a Japanese, English, and Chinese (Mandarin) voice. The English voice was added on Christmas 2012. The Chinese voice was added on Christmas 2015.
    • Matsuo-P (松尾P; m003e_beta) is an English voice that was released to the public on December 25, 2013 along with the version 3.4 release of the Sinsy website. Unlike other banks on the Sinsy website that were created using female voices, Matsuo was voiced by a male voice actor. Its voice can be heard here singing "Who's Crying Now". Matsuo-P's voice actor is a YouTube and Niconico user who goes by the name Koya Matsuo.[8]
    • Namine Ritsu S (波音リツS; f004j_beta) is a Japanese voice. It can be heard here singing RIP=RELEASE. As of December 25, 2013, Namine Ritsu S became available for public use.
External links

Examples of usage
  • "CadenciiをインストールしてSinsyに歌わせるまでの録画" by nwp8861 (Mimirobo-P) ft. Tutorial
  • "MuseScoreで楽譜作成し、Sinsyに歌わせる手順(2010/09/02版)" by nwp8861 (Mimirobo-P) ft. Tutorial
  • "Golden Slumbers" by Koya Matsuo (Matsuo-P) ft. Tutorial

Synthesizer V[]

Filoyo.jpg Synthesizer V.png

Synthesizer V (also know as SynthV) is a vocal synthesizer created and developed by Dreamtonics, Co. Ltd. It is currently available on Windows, Mac, and Linux systems. The software is available in English, Japanese, Chinese and Korean. The engine was fully released on December 28, 2018.


It was first previewed in 2017. According to the developers, the project is a product of 7 years of work and is the fifth revision.

It was made available for download in 2018 with the first vocal "Eleanor Forte", downloadable for free. Chinese and Japanese vocals are in production. More languages and dialects are planned.

On December 28, 2018 the engine got fully released with the possibility to register three vocals for free, and it also became possible to purchase a permanent license for the engine.

In December 2019, a web version of Synthesizer V was released, for those who cannot or don't wish to use the program version.

On July 30, 2020, the second version of the engine (release 2) was released under the name Synthesizer V Studio; it is not compatible with voicebanks made for the first engine version (release 1).

  • Vocals:
    • Eleanor Forte (エレノア フォルテ; formerly known as ENG-F1) was the first American English vocal and the first vocal overall to be released for Synthesizer V. Her first name, "Eleanor", has the meaning of light or bright with a sense of nobility, and her last name, "Forte", has the meaning of strength and references the musical dynamic, forte, which literally translates to loud. Despite being a virtual singer, Eleanor has an interest in antiques. She is described to be honest, endeavoring, and to have an unwavering sense of morality.
    • ANRI (杏里), also known as Airi Lin, is an English vocal developed and distributed by AUDIOLOGIE Co., Ltd. in collaboration with Dreamtonics Co., Ltd., and released as an AI voice database for Synthesizer V Studio in December 2021. She is described to be a kind person with a warm and dynamic personality. Once she sets a goal, she becomes very determined. She loves bubble tea, strawberry mille-feuille, and her pet guinea pig, Chachamaru (茶茶丸; Chácháwán).
    • Yamine Renri (闇音レンリ) is a Japanese female vocal released previously for UTAU. She has since been released for Synthesizer V. Her download can be obtained from her own Japanese website and the Synthesizer V website downloads.[9]
    • Genbu (ゲンブ) is a Japanese male vocal and the first male vocal released for the program. He may seem a bit straightforward, to the point of being called rude, however he cares about the people around him. His voice is supposed to represent his personality: calm, soothing, and somewhat weak.
    • AiKO (艾可) is a female Chinese voice and the first Chinese voice released for the program. AiKO is an enthusiastic girl. She tends to be careless but won't let tough times get her down. She always happy to make progress, no matter how small. She is a hard worker and likes to wear her work clothes. She has four main attires. A maid one, a digital one, a "preach" (missionary) one, and a performer one. Her theme color is #FDD000.
    • Chiyu (赤羽) is a female Chinese voice and the first Synthesizer V vocal released from Beijing Photek S&T Development Co., Ltd., and the first of the Medium⁵ series. Chiyu is Xingchen's older sister and is 17 years old. She is based on the element of fire and her representative shape is the tetrahedron. Her birthday is November 26. Chiyu was voiced by You Mao UNEKO.
    • Shian (诗岸) is a female Chinese voice and the second vocal from the Medium⁵ series. She is the youngest of the sisters and is 14 years old. She is based on the element of earth and her representative shape is the cube. Her birthday is January 19. Shian was voiced by warma.
    • Cangqiong (苍穹) is a female Chinese voice and the third vocal from the Medium⁵ series. She made her debut in early 2019 using a different voice synthesizer before officially becoming a Synthesizer V vocal. Cangqiong is the eldest of the sisters and is 18 years old. She is based on the element of air and her representative shape is the octahedron. Her birthday is May 20. Cangqiong was voiced by Kinoko_Mogu.
    • Haiyi (海伊) is a female Chinese voice and the fourth vocal from the Medium⁵ series. She made her debut in late 2017 using a different voice synthesizer before officially becoming a Synthesizer V vocal. Haiyi is Xingchen's younger sister and is 15 years old. She is based on the element of water and her representative shape is the icosahedron. Her birthday is July 22. Haiyi was voiced by Hua Zheng Anly.
    • MAN-M1 an unreleased experimental Chinese (Mandarin) male vocal.
    • MAN-F1 an unreleased experimental Chinese (Mandarin) female vocal.
    • JA-F1 an unreleased experimental Japanese female vocal.
    • Saki (サキ, originally known as JA-F2) is a Japanese female vocal and the first AI voice database to be released (after its Standard version debuted along with SynthV Studio).
    • Qing Su (青溯, originally known as MAN-F3) is a Chinese (Mandarin) female vocal and the first Chinese AI voice database.
    • Kotonoha Akane & Aoi (琴葉 茜 ・ 葵) is a pair of Japanese female voices (sharing one single voicebank) by AH-Software Co. Ltd. that originally started as VOICEROID talking libraries. Their birthday is April 25, and both are voiced by Yui Sakakibara.
    • Muxin (牧心) is a male Chinese voice from Beijing Photek. He is 14 years old and is said to be an idealist with a great fondness for history and social sciences. He is voiced by YuukaNeko.
    • Minus, also known as Xingchen Minus (星尘Minus), is a female Chinese voice and the fifth vocal from the Medium⁵ series. She is based on the element of ether and her representative shape is the dodecahedron. She is voiced by Sawako Sui Hua Zi.
    • Koharu Rikka (小春六花) is a Japanese female voice by TOKYO6 ENTERTAINMENT and AH-Software, with a matching CeVIO AI talking library. Her birthday is May 16, and she is voiced by Yoshino Aoyama.
    • Tsurumaki Maki (弦巻マキ) is a bilingual Japanese and English female voice by AH-Software with matching CeVIO AI talking libraries. She originally debuted as a mascot member of Jam Band, then in late November 2010 became a Japanese-only VOICEROID+ speech library. Her birthday is September 15, and she is currently voiced by Manami Tanaka.
    • ROSA (ロサ) is an announced Japanese female voice by ZAN-SHIN Co., Ltd. with a matching CeVIO AI talking library (which released separately on December 17, 2021). She is 15 years old and is the younger sister of CUL, with pink being her theme color. She is voiced by Shirayuki.
    • SOLARIA (formerly known as SOLARIS) is an upcoming English female voice by Eclipsed Sounds, LLC, slated for release as an AI voice database on January 19, 2022. She is themed to be like a sun goddess. She is voiced by Emma Rowley.
    • Tsuina-chan (ついなちゃん) is a Japanese female voice by AH-Software who originally debuted as a VOICEROID2 in late 2019. She is 14 years old, her birthday is February 3, and her voice is provided by Mai Kadowaki.
    • Kyomachi Seika (京町セイカ) is an upcoming Japanese female voice by AH-Software that also originated as a VOICEROID+ (in mid-2016) and is expected to release as Standard and AI voice databases for Synthesizer V Studio on January 27, 2022. This 23-year-old serves as mascot to the city of Kyoto's Seika district. Her birthday is July 5 and her voice provider is Rika Tachibana.
    • Hanakuma Chifuyu (花隈千冬) is an upcoming Japanese female voice by TOKYO6 and AH-Software with a companion CeVIO AI talk library. She attends the same high school and music club as Koharu Rikka. Her birthday is March 2 and she is voiced by Kaya Okuno.
    • Natsuki Karin (夏色花梨) is another upcoming Japanese female voice by TOKYO6 and AH-Software, also with companion CeVIO AI talk library. She also attends the same school and club as Rikka and Chifuyu above. Her birthday is September 9, and Miyu Takagi voices her.
    • Xingchen Infinity is the upcoming migration to SynthV of Xingchen, after her VOCALOID contract with Shanghai HENIAN Information Technology Co. Ltd. ended in late August 2021. She will be again voiced by Chalili.
External links
Examples of usage

Virtual Singer[]

Filoyo.jpg HarmonyAssistantVS LMT.png

Virtual Singer is an plug-in module released late 2000 for Melody Assistant or Harmony Assistant and was made by Myriad.


Virtual Singer was a relatively small time package. It was a relatively cheap program at only $20, a total of $50 would be spent acquiring both it and Assistant programs. The software's forum is still seeing activity despite the software being dated and users were still producing works using the software in October 2011. There exists a work produced in March 2018, although this seems to be an anomaly, not a revival.

The results are comparable to other software in terms of clarity for its time including Cantor, but was a little more realistic in comparison to Cantor since it was based on human results. It was capable of singing in the following languages: British English, American English, French (Northern and Southern), Finnish, German, Latin, Spanish, Italian, Japanese, and Occitan. However, more scripts are being written by the users of the software that would allow it to produce more than this.[10]

It generated a "human" voice from the score lyrics. It came with the Real Singer II technology. Like UTAU a new voice was possible to be created from your own. However, if you downloaded the software several "free" voices could be downloaded from Myriad's website, some capable of multilingual results and others were made just for one language. Updates were also free of charge, however the latest version is 3.2.

External links

Vocal Synthesis Tool UTAU[]

Filoyo.jpg Utau-interface.png

Vocal Synthesis Tool UTAU (歌声合成ツール UTAU) is a voice synthesizer program currently available for Windows and Mac OS X systems (the Mac version being named UTAU-Synth), it was developed by Ameya/Ayame. UTAU is a shareware] vocal synthesizer program that allows users to create and distribute their own voicebanks. And is viewed as a well-supported alternative to more expensive software that share the same abilities.

UTAU has the advantage of having its development occur at a faster pace. It has plug-in support and users have made a number of plug-ins that greatly improve the software's handling and experience. This support was established fairly early in the software's existence, whereas VOCALOID did not gain this ability until VOCALOID3 in late 2011 and even now it only offers a limited access to source code and plug-in support. Therefore the plug-ins for UTAU can often prove invaluable to users as they can effect the software's results and quality greatly.

Some UTAU voicebanks have been put out as "real" VOCALOIDs, such as the April Fool's joke origins of Kasane Teto. Songs using both UTAU and VOCALOID are also not unheard of. Some users have also began to enforce their copyright ruling over their voicebanks; UTAU or fanmade VOCALOIDs who are guilty of plagiarizing an UTAU's name or using a voicebank without permission risk violating UTAU software agreements and voicebank copyright ownership.


UTAU, meaning "to sing" in Japanese, has its origin in "Jinriki VOCALOID" (人力ボーカロイド, "Manual VOCALOID"), the act of re-editing an existing singing voice, extracting tones as WAV files, and reassembling them. In December 2007, Ameya/Ayame (飴屋/菖蒲), using LOLI.COM's voice samples, released a beta software called Loliedit featuring a simple voicebank called "Loline Com" (a pun on the original voice provider and the "ne" particle originally used on Crypton's products). It features a simple interface with a piano roll, has limited mora (Japanese syllables) and works with a primitive beta engine (or "resampler") later updated on UTAU. This beta software can still be downloaded from Ameya/Ayame's website. In March 2008, Ameya/Ayame released a free, advanced support tool to aid a "Manual VOCALOID" process, called UTAU. In later years, 2010, user feedback and suggestions, as well as other such ideas, lead to the creation of triphone ("VCV"; vowel-consonant-vowel) voicebanks; VOCALOID did not gain this capability until 2011 when VOCALOID3 was released.

The program comes with a default voicebank of 142 samples of Japanese syllables generated from the default voice of A-QUEST's text-to-speech software AquesTalk. Any user can load their own voicebank into UTAU to use. However, without the explicit permission of the voice donor, it is a violation of copyright laws. Those laws protect the rights of any vocalist who may not wish for their voices to be used within the program, such as celebrities. Any music made through this program can be used in the commercial sector. UTAU can be downloaded for free from the home page. It will not run properly on computers which do not support Japanese text or AppLocale.

UTAU is one of the few programs able to convert VOCALOID data files for its own use. It saves data in the .UST (UTAU Sequence text) format and is capable of converting .VSQ files to .UST. However, .UST files itself do not hold as much data as the VOCALOID engines' VSQ or VSQX file extensions, and UTAU does not try to convert many things into even its rough equivalent, only placing the notes. As a result, loss of data may occur. It currently does not support the VOCALOID5 extension, VPR.

External links


Filoyo.jpg Vocalina.png

VOCALINA (보카리나) is a "text to speech" singing synthesizer and DAW for personal music-related content. It was developed in October 2011 by TGENS Co., Ltd. On September 25, 2017, it was announced by the CEO (Kang Woo-Mo) that VOCALINA's Service would be terminated on October 1, 2017. Contrary to this, on the 29th of September in 2017, the CEO announced they would work to extend the service for one more year, as thanks to the community for their encouragement and support. Additionally, VOCALINA would be free to use until the service expired.

In terms of the quality of singing results, the engine produces a large amount of noise, despite the noise reduction engine implemented into the program. This is likely due to the synthesis method itself being Diphonetic Synthesis, a type of concatenative speech synthesis notorious for sonic glitches. Many users of singing synthesizers are familiar with this type of synthesis, as it is used in singing synthesizers like UTAU.


VOCALINA Studio was first revealed before it was released with some interest-building events. After the first BETA 1.1 release, there were contests for songs and art that built the community. The community activities took place largely on Cafe Naver , but has since moved to another page on the same site. Once VOCALINA had been upgraded to version 2.0.16, the BETA got dropped from the name; it was now considered stable. The most recent version is 2.3.2, only featuring the vocalist Khylin. The program is Korean and relies solely on Hangul (Hanja isn't supported) input when using the singing synthesis. It has similar Parameters to that of many Piano Roll-styled MIDI Editors such as Pitch, Velocity, and Dynamics. VST effects like Reverb (REV), Equalizer (EQ), and Echo (ECO) are also listed as Parameters for mixing convenience. Select VSTs bundled within the Studio can be used within the VST Editor. Since it is a DAW in addition to a singing synthesizer, one is able to mix the vocals in real time.

The program launched with VORA, an idol living under cover as a high school girl. When SeeU, a Korean VOCALOID created around the same time VORA was released, there was brief rivalry between the two products. However, since then VORA has performed alongside SeeU in many songs,[11] such as the single "Abandoned" being performed LIVE at one of SeeU's concerts.[12]

Normally, in order to use Khylin, a user must buy a "ticket" from the VOCALINA website, which grants them usage of her voice for a month at minimum, and a year at maximum. After the period has ended, the user needs to renew the service by buying another ticket if they want to continue using Khylin.

  • Vocals:
    • Choi Bora 최보라 (known by her stage name VORA 보라) was the first VOCALINA vocalist, and can be used for free in VOCALINA Studio, no purchase necessary. She was retired[13] due to being incompatible with the new 2.3.0 update of the engine, but is still available with the 2.2.0 version. Her official demo song is Dream (드림). VORA's Official Album that served as her introduction demo is called "Kiss My Life", like her webcomic of the same name. It is available to be listened to here.

      She is 17 years old, 160 cm in height and 43 kg in weight. Her birthday is March 20 and has an AB blood type. Her vocal range is C4~F5 with a tempo of 70~180 BPM.

      Her favorite activities are singing, texting, and making phone calls. Her hobbies are listening to music and solving puzzles. Her "special skills" are organizing and managing blogs. She dislikes loud noises and raw ingredients. She is the only child in her family. She has a webcomic about her rise to stardom called Kiss My Life by Taeyang. (It's written in Korean.)
    • Khylin (formerly KHylin) 카일린 is the second VOCALINA vocalist. Her demo was first revealed on August 17, 2012 and in the video, her name is stylized as "THE KHYLIN". You can hear the demo here, but there is no indication of the song having a title. Her voice provider has sung with VORA before development, a notable duet is here. Khylin comes with every 2.0 version of VOCALINA Studio, but she is only being offered for free in version 2.3.2.

      She is 16 years old and her birthday is July 28. Her vocal range is E3~E5 with a tempo of 70~150 BPM.

      Khylin doesn't have any special information regarding her identity. This implies she does not have a place in VORA's "Kiss My Life" world and exists solely as an isolated vocalist. Her voice provider was revealed to be Hyo Bae Park.
External links

Examples of usage
  • "승인 (Seung-in)" by ??? ft. VORA & SeeU
    Favicon-yt.pngYouTubeFavicon-bb.pngbilibili - YT version set to privte
  • "GET OUT BOY" by 5일 ft. UNI & Khylin
  • "ENJOY THE PARTY" by 5일 ft. VORA
  • "Abandoned" by Sesilin (세시린) ft. VORA

Singing Voice Synthesizers (Samples Instrument)[]


Realivox - The Ladies.png Realivox - Blue.png

Realivox is a vocal synthesizer by Realitone. There are two software packages for Realivox entitled The Ladies and Blue. The software used to run the voices are powered by Kontakt which has a full version which goes by the same name, or Kontakt Player which is free but comes with less features. Kontakt runs on Windows and Mac meaning that Realivox does as well.


The Ladies consist of 5 vocals :

  • Cheryl; A voice described as airy and pretty that is perfect for ethereal film cues.
  • Teresa; Described as a Soprano opera diva.
  • Patty; A voice suited for pop and ethnic music.
  • Julie; For full range songs.
  • Toni; For smooth R'n'B songs.

The voices have 30 multi-sampled articulations, being Oo, Ah, Ee, Oh, Ey, Hmm, Mmm, La, Bah, Bee, Boh, Boo, Buh, Bop, Bow, Bah Fall, Dah, Dee, Doh, Doo, Duh, Boom, Bom, Hey, Ha, Ho, Hoo, Me, Shoo, and Yeah. On top of that the voices have true sampled legato. These voices can also be stacked to create choirs as well as solos.

The Ladies software comes with Kontakt Player in its installation.

Blue comes with a single voice that comes with 12,000 vocal samples. Each of the six vowels has thirty-two samples accompanying it. This is because when pronouncing different consonants before or at the end of different vowels, the mouth will open and close differently based off of what vowel has been pronounced. This reduces the amount of incorrectly sounding pronunciations and replicates sung English more accurately.

Like with The Ladies, Blue also has true sampled legato, but it also has polyphonic legato.

There is also an Ensemble Mode which allows the user to switch between 3 voices that comes with the package without the need to switch between tracks. This comes with settings for volume, pan, tuning, timbre and offset for each voice that is activated.

Blue produces a crystal clear sound.[14]

External links

Examples of usage

The Ladies

  • "How We Created "Mmmquiring Minds" with Realivox" by Realitone ft. The Ladies
  • "Mmquiring Minds" by Realitone ft. The Ladies
  • "Walking Through a Dark Town" by Realitone; Frank Raschke ft. Cheryl


  • "Realivox Blue Walkthrough" by Realitone ft. Blue
  • "Blue Demos" by Realitone ft. Blue; Kontakt
  • "The Wonderful Blue" by Realitone; Man Parrish ft. Blue

Symphonic Choirs[]

Filoyo.jpg QLChoirs-xlarge.jpg

Symphonic Choirs is a choir synthesizer produced by EastWest/Quantum Leap, able to recreate the effect of an entire choir for any song. It is popular with indie musicians.

External links

Ethera (from Zero-G): WORK IN PROGRESS[]

Filoyo.jpg Software-icon.png

Ethera is a : WORK IN PROGRESS



External links


Filoyo.jpg AcmeIku ss 02.jpg

MAIDLOID (メイドロイド) is a Text-to-Speech/Phrase Editor synthesizer developed and distributed by StudioS, and it was released December 27, 2007. The purpose was to create erotic sounds using 50 Japanese consonants along with several hundred voice clips and breathing sounds to sound like a recreation of an eroge.

Some users have attempted to make the vocals 'sing' and even proved successful by importing the Japanese syllables into Vocal Synthesis Tool, UTAU. However, the H-Voice Characters are not meant for musical purposes and – like with the Macne Series – users would need to own the full versions of the software to legally be able to use this in UTAU, as the Lite version of Iku and RIYONA only provide a small number of Japanese consonants and cannot make a fully working voicebank.

When ported to the UTAU software and used for 'singing', the Lite versions of Acme Iku & RIYONA have poor clarity and quality due to the demo files' compression and is difficult for even native Japanese speakers to understand. Once bought, the full versions of Acme Iku's and RIYONA's voice qualities are much better.


It is a voice synthesizing engine made to resemble VOCALOID, but it is made to create erotic sounds instead of singing. It can voice the 50 Japanese consonants along with several hundred voice clips and breathing sounds to sound like a recreation of an eroge. There are a few extras, such as the ability to insert customized special sound effects and a few BGMs. Acme Iku's voice was made to be used in games and animation and those works can be sold. However, it is illegal to distribute her full voicebank. There are exceptions with the Lite version.

On December 27, 2017, for Acme Iku's tenth anniversary, StudioS announced that there would be a new character added to the Character H-Voice Series named RIYONA,[15] and Acme Iku's page on DLsite was updated as well, adding contents from the physical CD extras to the MAIDLOID editor. Anyone with a valid purchase of her on DLsite can download the updated version for free.

On February 10, 2018, the new H-Voice character, RIYONA, was released onto the DLsite and DMM for 1,620円. Her files contained over 900+ new sounds and phrases, most of which consists eroge sounds. StudioS stated that RIYONA was made with the idea in mind of using her for not only H-Games/Eroge Games but also fighting games as well, with a whole folder of voice lines dedicated to those types phrases. RIYONA uses a brand new software built by StudioS just for her vocal usage, making her the first H-Voice Character so far to not use the MAIDLOID Editor.

In 2011, a Japanese producer by the name of Paraoka, created three songs using the software and released a doujin album, "Training Diary".[16]

  • Vocals: The demo versions of Iku and RIYONA only include a small portion of their original libraries, their sound quality manually degraded to make the usage of the demo files less desirable, provoking customers to buy the full versions.
    • Acme Iku (阿久女イク; (あくめいく)) is the first character of StudioS' H-Voice Character Series and uses the Flash made MAIDLOID software. Her voice is provided by the Japanese voice actress, Sayaka Keina (計名さや香; Keina Sayaka) and her illustration is by Shin. Her concept is a housemaid android who is meant to be a lewd and cute character. She is 1 years old, 145 cm in height with "NAISHO! (ナイショ!)" as her weight. Her vocal range is said to be 'good' with a lolita treble, her tempo is "H-mode" and her genres are Hentai and Ecchi.

      Her name Acme in kanji is the choice of corresponding pronunciation and has no valid literal interpretation. In katakana 'Acme' in Japanese literally means "orgasm." And Iku is a common erotic moan in Japan, meaning "to come, to orgasm." As such, her name translates into a sexual innuendo.

      Iku is featured as a playable character in the Japanese PC doujin game Super Strip Fighter IV (スーパーストリートファイターⅣ). Her voicebank is used, and some of her attacks consist of use of her hair, she is able to use her hair accessory and throw or swing it at her opponents.[17]
    • Princess Knight RIYONA (姫騎士リヨナ) is the second and newest character of StudioS' H-Voice Character Series labeled as KNIGHTROID. She uses software made for Windows, which subsequently shall be referred to as the KNIGHTROID Editor. Her voice is provided by an adult game Japanese voice actress, Miki Honda (本多未季; Honda Miki) and her illustration is by Shin.

      Her concept is a princess knight, meant to be proud and a strong ruler, but she's also more of an older sister figure, unlike Iku. Not much is currently known about her as a character, and little is known about her voice provider other than her few roles in some eroge games.

      RIYONA is featured as a playable character in the Japanese PC doujin game Super Strip Fighter 5 (スーパーストリートファイター5). Her voice provider provided the samples used, and some of her attacks consist of use of her using her sword as well as the ability to use lightning attacks.
External links

Examples of usage
  • "Chaosmaid" by Paraoka ft. Acme Iku

Macne Series[]

Main article: Macne Series

Mixed Voice Synthesizers (Singing + TTS)[]

CeVIO Creative Studio[]

Filoyo.jpg CeVIO interface

CeVIO Creative Studio (pronounced che-ˈvē-ˈo) is a commercial vocal synthesizer product released on September 26, 2013. CeVIO Creative Studio received two awards in 2013, the "MicrosoftⓇ Innovation Award 2013" and the "CEDEC Award". Its demo version, CeVIO Creative Studio FREE, is available in trial form. Previously, users could not edit Parameters in this version, but they are able to now, with few restrictions.


CeVIO has two capabilities, speaking function and singing, both of which need to be provided by their developers.

The speaking portion offers a large dictionary of words to which the vocalists can pronounce in a variety of ways and emotions. There are usually 3 different types of voices that can be cross-synthesized, or isolated to portray a single emotion. If they misinterpret kanji, phonemes can be edited. Velocity, Length, Tone, Accent, and Pitch can be edited in this mode.

The singing portion offers: Amplitude Timing, Pitchbends, Volume, and Vibrato rate and Depth. Gender is also available to be edited on the side bar of the piano roll. A recent development of the engine also introduced phoneme-input, previously not used. The ability to add and edit phonemes manually allows some "Engrish" words to be made, or allows smoother pronunciation of borrowed words. In addition to phoneme editing, up to 5 hiragana/katakana characters can be added onto a single space. This is a unique feature of CeVIO.

  • Parameters:
    • Amplitude Timing allows for phoneme editing. Sounds often are split into 6 segments in this section, and the beginning of the note is highlighted with a pink line. Dragging the last purple line back to the pink line of the next note will often help with vowel transitions. If there are lines between notes covering a blank space, this characterizes a breath or static sound. It cannot be deleted.
    • Pitch allows for editing of Pitchbends and addition of Portamento, which most vocals can produce on their own. Many advise using the Line Tool to draw pitchbends, as the Pencil Tool is very sensitive.
    • Volume can be used for dynamics, however it uses a very intense scale and minor adjustments to loudness can cause peaking/clipping, and lower volumes can cause an influx of static. Adjusting the volume of the track may be preferable.
    • Vibrato rate controls how fast vibrato cycles will happen. Within the program, the user isn't supposed to draw the vibrato, but rather a diagonal line/curve upwards to indicate the vibrato is accelerating, or a diagonal line/curve downwards to indicate the vibrato is decelerating.
    • Vibrato depth controls how deep the pitch of the vibrato cycles will be. In other words, the intensity of the vibrato. This is edited the same way as Vibrato rate, with the diagonal lines or curves.
  • Voices:
    • Sato Sasara is a speaking and singing CeVIO product. She is 16 years old.
    • Suzuki Tsudumi is a speaking CeVIO product. She is 17 years old and friends with Sasara.
    • Takahashi is a speaking CeVIO product. He is 20 years old and friends with Sasara.
    • ONE is a speaking and singing CeVIO product. She is the second installment of the - ARIA ON THE PLANETES -" project.
    • IA is a speaking and singing CeVIO product. She was the first installment of the - ARIA ON THE PLANETES -" project and a VOCALOID3 character.
    • Akasaki Minato is a singing CeVIO product and part of the Color Voice Series. He is 25 years old.
    • Midorizaki Kasumi is a singing CeVIO product and part of the Color Voice Series. She is 27 years old.
    • Ginsaki Yamato is a singing CeVIO product and part of the Color Voice Series. He is 50 years old.
    • Kinzaki Koharu is a singing CeVIO product and part of the Color Voice Series. She is 52 years old.
    • Shirosaki Yuudai is a singing CeVIO product and part of the Color Voice Series. He is 20 years old.
    • Kizaki Airi is a singing CeVIO product and part of the Color Voice Series. She is 18 years old.
    • HAL-O-ROID is a singing CeVIO product and based on the deceased Enka singer. He is 37.3 years old.
External links

Festival Speech Synthesis System[]

Fl.gif Software-icon.png

Festival Speech Synthesis System is a free (libre) / open-source software speech synthesizer developed at the Centre for Speech Technology Research (CSTR) of the University of Edinburgh. In order to enable Festival to sing, there is a plugin required called Flinger (Festival Singer) developed at the OGI School of Science and Engineering that was released in 2001. Free(b)soft has contributed Czech diphone voices and an accessible editor similar to the VOCALOID editor.

Flinger has been used by users in the western world for creating English demo songs.

Most voicebanks for Festival are available under a free license, whereas VOCALOID (and UTAU) use proprietary licenses. However, the voicebanks that come with Flinger are still proprietary. The experimental LMMS plugin Singerbot uses Festival for singing releases.

External links


Main article: Voiceful

Related Text-to-speech Synthesizers[]

Gynoid Talk[]

Main article: Gynoid Talk

Galaco Talk : WORK IN PROGRESS[]

Main article: Galaco Talk

Speech Synthesis (FL Studio plugin)[]

FL Studio.png FL Chan.png

FL Studio, formerly known as FruityLoops, is a digital audio workstation (DAW), developed by Belgian company Image-Line released December 18, 1997. FL Studio was originally the creation of Didier Dambrin, who is now the lead programmer responsible for the overall development.

In the western VOCALOID (and also UTAU) scene, FL Studio is among one of the more popular DAWs used by some producers, it is not as widely used in Japan since they also have access to other popular DAW systems as well.[18] It is mainly used to put all the components of the music together. FL Studio also recognizes VOCALOID as a Rewire plugin, and is able to actively feed their voice samples together, making the software overall more attractive to VOCALOID users.


FL Studio features a fully automatable workflow centered around a pattern-based music sequencer. The environment includes MIDI support and incorporates a number of features for the editing, mixing, and recording of audio. Completed songs or clips may be exported to Microsoft WAV, MP3, and the OGG Vorbis format. FL Studio saves work in a proprietary *.flp format, inadvertently sharing the same extension as a filetype created by Adobe Flash CS3.

The program is acclaimed for its professional DAW features at an attractive price point. Scoring to video is possible using the video-player plugin, but there is no support for traditional music notation.

FL Studio is written in Borland Delphi and thus until recently, there were no plans to develop a native Linux or Mac OS X port. Recent changes, however, has gleaned on 64-bit support and a self-packaged ASIO driver for low end sound cards. As of September 2015, the Image-Line website indicates that the company is heavily developing a native mac version which can currently be tested by all registered forum users (alpha 0.4); similarly, several FL studio plugins are in development as native versions and can also be tested (alpha 11 currently). In May 2018, Image-Line finally distributed version 20 of FL Studio, which has native MAC OS support.

Since the software was designed for a wide variety of uses and aimed to appeal to users of all music levels, many amateur musicians also use this software, and for the western VOCALOID fandom has proven to be popular because it allows even amateurs a chance to remix or make new songs that can compete with professional works. It has many tools that have abled to them to create work on level with more professional VOCALOID users and has been considered one of the more attractive software suites for purchase for this reason by new fans.

Some producers however prefer to prerender the voices from VOCALOID into WAV files (which is also a necessary step for UTAU-based voicebanks) and insert it in FL Studio as an audio sample, instead of using ReWire. Alternatively, a whole prerendered voice in a WAV file can be articulated into sliced samples, for easy access to note segments and remixing, as well as for conserving system resources.

  • FL-chan is a mascot inspired by the success of VOCALOID2 character Hatsune Miku. Didier Dambrin made a comment in an Internet forum that he also wanted a moe character like Habanero-tan for FL Studio. A Japanese artist by the name Uruido took the request and created a moe anthropomorphism of FL Studio called FL-Chan, and she became the official character of the DAW software since the release of ver. 8 in March 2008. Image-Line has adopted the character as a form of mascot, providing an array of purchasable merchandise and a dedicated page on the Image-Line website. You can find that here.

FL-chan's voice has also been successfully transferred into UTAU. Much like Acme Iku, users need to own the original software to legally use her voice. It is not the most realistic of the UTAU voicebanks as its results came from a Vecoder and were made by artificial means.

She has been featured in various VOCALOID videos, Meiko's Various Feelings being one.

External links

Megpoid Talk[]

Main article: Megpoid Talk

Talk Ex[]

Main article: Talk Ex


Main article: VOICEROID


Main article: VTalk

Miscellaneous : WORK IN PROGRESS[]

Linux MultiMedia Studio[]

Lmms logo.png Software-icon.png

Linux MultiMedia Studio or LMMS, is a free digital audio workstation (DAW) alternative to proprietary software such as FL Studio, Cakewalk, and GarageBand released March 7, 2015. It is distributed free of charge, unlike FL Studio and Cakewalk, and it is also free as in freedom, meaning anyone can see or modify the source code. It is available for various distributions of Linux, and despite its name, also for BSD, Microsoft Windows, and Mac OS X (with some limitations).


LMMS allows anyone to easily create loops using a variety of predefined synthesizer plugins, as well as external VST plugins. (There is a VST plugin version of the VOCALOID engine.) Effect chains can be added to almost any instrument through the use of LADSPA effects. The software also features the ability to import existing FL Studio project files.

Like the VOCALOID Editor, notes are entered using a piano-roll interface. For beats and basslines, LMMS also includes a transport bar similar to that of FL Studio. This editor can be changed to a piano-roll editor if more sophisticated beats and music loops are desired. LMMS can also sing if the experimental plugin Singerbot is installed.

Each instrument can be assigned to an "FX Channel". The FX Mixer provides a single area to change the volume and panning, as well as the effects chains, of all instruments assigned to a specific FX Channel at once. This works well for adjusting the master volume of a drumset (which is normally made up of several different samples, each sample corresponding to a different portion of the drumset; e.g., snare, bass drum, cymbal, etc.).

  • Included synthesizer plugins: LMMS includes various synthesizer plugins out-of-the-box. There are also a variety of community-developed presets for each synthesizer plugin, making it easy for beginners to pick a sound they like.
    • AudioFileProcessor provides a simple interface to use the piano-roll or beat editor with a sound clip. LMMS includes several instrument samples, as well as a variety of drum samples, represented as AudioFileProcessor presets.
    • BitInvader provides a highly customizable wavetable synthesizer.
    • Kicker is a kick/bass drum synthesizer.
    • LB302 Monophonic imitation of Roland TB-303.
    • Mallets synthesizes tuneful percussion sounds.
    • Monstro provides more advanced triple oscillator mixing
    • Nescaline emulates the Nintendo Entertainment System® APU.
    • Organic provides an organ-like synthesizer with adjustable "stop" levels for each virtual "pipe".
    • FreeBoy emulates the Nintendo Game Boy APU (audio processor unit).
    • SID emulates the Commodore 64 MOS Technology Sound Interface Device chip, with support for 6581 and 8580 variants.
    • TripleOscillator provides three versatile oscillators that can be mixed together in different ways to produce all kinds of sounds.
  • Vibed models vibrating string instruments.
    • Watsyn provides more advanced wavetable synthesizer
    • Embedded ZynAddSubFX engine with various presets.
External links


Filoyo.jpg IVoxeliPhone1Small.png

iVoxel is a singing vocoder with a vocal sequencer developed by VirSyn. It is a built in application for the iPhone and iPad. It has many features beyond a sequencer.

Like other VirSyn products it is based on the same ideas and concepts that IVOCALOID is based on. But like previous software, such as Cantor, it is not intended to be a realistic singing voice.

External links


Filoyo.jpg Sugarcape.png

SugarCape is a vocal synthesizer developed by sota, only available for use on Mac OS X Snow Leopard 1.6.8 or above. The newest version now called SaltCase Alpha 0.0.2 uses a tripitch voicebank, and it is assumed that it has been given a sort of "prefix.map", similar to that of an UTAU voicebank. This function allows transitions between pitches to be more natural, rather than giving in to the harsh distortion of the sample as it goes deeper.

In addition to the preset voice in SugarCape, one can add a voice to be compatible with the program. This has been done with Nagone Mako, and several other UTAU voicebanks. Some have thought of importing the Macne Series into SugarCape.

External links

Examples of usage
  • "sugarcape" by talc; なんてこったい on NND ft. N/A
  • "てくてく~地べたのスカイウォーカー~" by 呑気大王 ft. SugarCapePro
  • "DESTINY" by ねこ伯爵P  Vocalist missing!