Vocaloid Wiki
Vocaloid Wiki
Advertisement
💻 Technology article work in progress. What is being worked on: Idea is to redirect all non-VOCALOID tech to this directory page and link related wikis when avalible. Discussions here
For information on how to help, see the guidelines.  More subjects categorized here.
💻

Top Directory Music directory Merchandise directory Technology directory Synthesizers directory Tutorial directory Websites directory



Sub-root article guide to direct readers to information about various software.

Singing Voice Synthesizers (Lyric Input)[]

AquesTone[]

AquesTone Software-icon

AquesTone is a VSTi plugin developed by Aquest, there are four voice options: Female F1, Auto F1, Male HK, and Auto HK. UTAU Uta Utane (aka. Defoko) uses Female voice 1 as a source for the UTAU default voicebank.

The female voice is credited in Pop'n Music ("Chilt Featuring AquesTone") and other BEMANI rhythm games like Dance Dance Revolution.

External links


Examples of usage
  • "ちかてつ (Chikatetsu); Subway" by Calmwind01x ft. TILT (AquesTone)
    Favicon-ytYouTube
  • "ミルキーポケット (Miruki Poketto); Milky Pocket" by Calmwind01x ft. TILT (AquesTone)
    Favicon-ytYouTube
  • "Time's Intersection" by Calmwind01x ft. TILT (AquesTone)
    Favicon-ytYouTube

Alter/Ego : WORK IN PROGRESS[]

Filoyo Software-icon

Alter/Ego is a : WORK IN PROGRESS

History

PRODUCT_History : WORK IN PROGRESS

External links


Examples of usage
  • "TITLE" by PRODUCER ft. SINGER
    Favicon-ytYouTube

Emvoice One[]

Filoyo Software-icon

Emvoice One (formerly SING and working title SOHO) is a software by Emvoice for OSX and Windows. The software will allow music producers to create vocals without the need of a singer.

External links

Cadencii[]

Filoyo 7e61cc2f-02ff-e011-98e6-0025902c7e73 2 full-1-

Cadencii is a voice synthesizer program and frontend for several other voice synthesizers: VOCALOID, VOCALOID2, UTAU (or rather, UTAU resamplers), STRAIGHT with UTAU, WORLD, and AquesTone. It has its own engine (written by shuraba-P / HAL) named v.Connect-STAND. The source code used to be hosted on SourceForge.JP, but has moved to GitHub.

Cadencii's interface emulates the VOCALOID interface very closely. The piano roll can also change color depending on the synthesizer engine being used; for example, when VOCALOID2 is selected as the synthesizer, the piano roll becomes grey and green, and when UTAU is selected, it becomes blue and pink.

History

It is currently officially available for Windows and Mac OS X. The latest version available is v3.5.4 for Windows and v3.4.1 for Mac OS X. There is also an unofficial port in the Debian repositories at v3.3.9.[1]

On Mac OS X and other Unix-based operating systems, Cadencii requires Wine to be useful. For OS X there is jCadencii, a JAVA version of Cadencii for OS X. The JAVA frontend is actually compiled from a lot of #ifdef JAVA in the C# Code.[2] If you take a look into the OS X .app bundle, you'll still find minimized wine bundles to support the dlls with native Windows code.

It's not possible to use Mono to build and run the CSharp program cross-platform yet, since there is some native Windows code. Workarounds may be possible in a way like pipelight, a method to provide better SilverLight/Flash support in Unix(-like) systems using wine.


Cadencii has its own file format, .xvsq (not to be confused with VOCALOID3's .VSQx). Cadencii can also import and export other synthesizers' project files, such as VOCALOID's VSQ and UTAU's UST file formats. Notably, it can export as MusicXML, making it a popular choice for creating files that work with Sinsy, especially in combination with its ability to import VOCALOID and UTAU project files.

Besides simply being able to import VSQ and UST files, Cadencii can also read the pitchbends (old pitchbend type/Mode1-only for UST), which can be used in combination with Cadencii's ability to use multiple synthesizers, as seen in the example video below (Tori no Uta).

  • License
    • Cadencii is free software.
    • Source codes of Cadencii are the copyright of kbinani.
    • It's stated that Cadencii is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
    • Cadencii is released under the GNU General Public License, version 3.0.
External links


Examples of usage
  • "CadenciiをインストールしてSinsyに歌わせるまでの録画" by nwp8861 (Mimirobo-P) ft. Sinsy; export tutorial
    Favicon-nnNiconicoFavicon-ytYouTube
  • "Tori no Uta" by FastSpeedy ft. Nukupoid; UTAU voicebank
    Favicon-ytYouTube

Cantor[]

Cantor CANTOR1
Cantor2 CANTOR2a

Cantor (and its successor Cantor 2) is a synthesizer developed by VirSyn and utilizes Formant synthesis. It was released at the same time as MIRIAM, and was a rival to the original VOCALOID software. It was sold for £199.99 including VAT which was said to be expensive for its time, but that was due to offering a far greater selection of vocals.

Unlike VOCALOID, it had 50 voices for use, which was far greater than what VOCALOID had to offer upon its release. But because of its design it was more intended to resemble a virtual instrument than a virtual singer. It had capabilities for both German and English, and supported both Windows XP and Mac OS X, as opposed to VOCALOID which was restricted to just Windows XP. However, similar to VOCALOID, it worked as either a stand alone or plug-in and supported ReWire.

History

As with the early version of VOCALOID, Cantor was a victim of change in the way indie music was produced, and by Cantor 2 was being impacted by this change (alongside VOCALOID) as the demand for synthesized voices began to disappear.

The final version of the software, Cantor 2.1 was released on February 6, 2007. Though updates have since ceased, the software remains on sale, and is confirmed compatible with Windows XP/Vista/7 and Mac OS X 10.5/10.6 versions. A demo is able to still be downloaded from VirSyn's website, although it requires an eLicensed Syncrosoft dongle to be able to download the demo and the full software version itself.[3] The special dongle was included with the boxed version of the software, as well as other software sold by VirSyn. It was not able to be bought with the downloaded versions but could be purchased separately.[4]

Cantor 2 later became available for purchase on the Crypton Future Media Website in 2008, and a demo was released on Crypton's Youtube account.[5]

External links


Examples of usage
  • "Tears of an Angel" by Mike Oldfield ft. MIRIAM + CANTOR
    Favicon-ytYouTube

Chipspeech[]

Filoyo Dandyinterface

Chipspeech is a synthesizer developed by plogue. A retro styled vocal synthesizer created to reproduce vintage vocal synthesizers released in the 20th century. The software acts as stand alone or plug-in software to various DAWs. It can sing and talk and supports two languages: English and Japanese (though Japanese currently does not have talk capability). There are various means to adjust the vocal in the way the user wants, creating some very unique sounds and results.

The main strength of the engine is it can have a multiple number of synthesizer styles built into it. While some like Dandy 704 or Lady Parsec are based on samples recreation vocals much like VOCALOID or UTAU, others like Dee Klatt do not have samples and are fully synthetic sounds. They are instead based on direct input, meaning they recreate to various degrees faithful recreations of their engine, with Dee Klatt's being a fully rendered "live" feed back. For example, along with 5 new vocals since release, the Circuit Bending feature was added in version 1.032. This mimicked the circuit bending method of getting unique vocals from the classic old chips, which allowed for 'one of a kind' results.

As they are based on old technology they are all dated vocals and do not reflect the modern sounding ones. Therefore they at best barely sound human and do not attempt to even sound remotely like an uncanny effect. This can be off putting for those inexperienced with synthesizers of the past who and those who want realistic sounding vocals. The vocals themselves are difficult to find otherwise in their original chip forms, some being impossible to find due to how old their technology is.

History

Plogue Art et Technologie, Inc is a small company specialized in chipbased technology and aiming to recreate chip effects and sounds and apply them to modern technology. Chipspeech was one of their many ideas they planned for years to create, however, they were held back on the means to execute it. Though they had the technology, they lacked the knowledge. The Chipspeech software was born after they hired a member of their development team with Phonetic knowledge, allowing them to come up with ideas on how to create a vocal synthesizer technology.

The software acts as stand alone or plug-in software to various DAWs. It can sing and talk and supports two languages: English and Japanese (though Japanese currently does not have talk capability). There are various means to adjust the vocal in the way the user wants, creating some very unique sounds and results.

Unlike VOCALOID, CeVIO or other synthesizers words are typed as sentences. If used as a plug-in into a DAW, the synthesizer will play each sentence in the keyboard. The software is easy to use but requires some work to master.

The software currently has 12 characters, with an optional 13th being able to once downloaded known as "Daisy". Daisy was able to go into Alter/ego and is the only character able to do so. Daisy is, however. currently retired and unavailable for download. The other 12 characters are based on various synthesizers, such as Rotten.ST based on Atari ST’s STSPEECH.TOS or Dandy 704 based upon the IBM 704 computer. With the exception of Dandy 704, all have a cyberpunk-style character illustration representing each voice, with Dandy 704's instead being Steampunk. There is a basic "storyline" between the characters and a "canon" as per say. However, this does not impact the software itself.

External links

MUTA: WORK IN PROGRESS[]

Filoyo Software-icon

MUTA is a : WORK IN PROGRESS

History

PRODUCT_History : WORK IN PROGRESS

External links


Examples of usage
  • "TITLE" by PRODUCER ft. SINGER
    Favicon-ytYouTube


DeepVocal : WORK IN PROGRESS[]

Filoyo Software-icon

DeepVocal is a : WORK IN PROGRESS

History

PRODUCT_History : WORK IN PROGRESS

External links


Examples of usage
  • "TITLE" by PRODUCER ft. SINGER
    Favicon-ytYouTube

NIAONiao[]

Filoyo NiaoNiao

NIAONiao Virtual Singer (袅袅虚拟歌手 Niǎoniǎo xūnǐ gēshǒu) is a Chinese voice synthesizer program developed by dsound.[6]

The default voicebank is named Yu Niaoniao (余袅袅), however, users can create their own voicebank and take advantage of its larger file feature. NIAONiao can import MIDI files, VSQX files (VOCALOID3 only), and UST files, export tracks as the "Niao" file format (*.nn), and can render vocal tracks directly as WAV, MP3, or MIDI files.

History

The principle is the same as UTAU. Many Chinese fans have begun producing vocal banks for both programs. The voicebank format for NIAONiao is radically different from UTAU, the main difference being that the voice samples are packed in a large file. Due to being made for a Chinese audience, NIAONiao can have final consonants in a voice, also unlike UTAU. NIAONiao is not exclusive to singing in Chinese, just as UTAU is not restricted to Japanese. For example, a NIAONiao voicebank for Nagone Mako can be downloaded from the official NIAONiao website.

The interface is much closer in similarity to VOCALOID and (unlike UTAU) there is a panel at the bottom for controlling parameters, pitchbends, and vibrato.

External links

NEUTRINO : WORK IN PROGRESS[]

Filoyo Software-icon

NEUTRINO is a Japanese neural voice synthesizer program developed by SHACHI.[x]

It is compatible Windows, MacOS, and Linux. Web browser compatibility is based in Google Drive.

History

PRODUCT_History : WORK IN PROGRESS

The user uploads data in the MusicXML format, which the NEUTRINO program reads to output a WAV file of the generated voice. Gender factor, vibrato intensity, and pitch shift can be adjusted prior to output.[x]

MusicXML files can be made in Symphony Pro, Cadencii, MuseScore, and finale NotePad.

  • Voices: NEUTRINO has two known voices: Tohoku Kiritan, and Yoko. NEUTRINO supports Japanese.
    • Tohoku Kiritan (東北きりたん) is a Japanese-only voice.
    • Yoko (謡子; f001j) is a Japanese-only voice.
External links
Examples of usage



  • "AIきりたんの『キリトリセン』 無調声ver" by 96s_kM4osM ft. Tohoku Kiritan
    Favicon-ytYouTube
  • "AIきりたん調声有無聞き比べ" by 96s_kM4osM ft. Tohoku Kiritan
    Favicon-ytYouTube

Piapro Studio for NT : WORK IN PROGRESS[]

Filoyo Software-icon

Piapro Studio for NT is a : WORK IN PROGRESS

History

PRODUCT_History : WORK IN PROGRESS

External links

RenoidPlayer[]

Filoyo Renoidplayer

RenoidPlayer is an online synthesizer created by g200kg. It is compatible with various web browsers and also works on the iPad and iPhone as long as they run iOS 6. Note: Safari running on Mac OS X cannot export the final product. Use Chrome/Firefox when exporting.

History

It has a built-in sequencer, unlike Sinsy. However this Editor can be a little confusing to new users, but for those experienced, there is MML available for use. RenoidPlayer is similar to AquesTone in the fact that pitchbends and other flags and tunings are ignored. This applies to all voices. Currently, there are only 8 voicebanks available to use, many of which are UTAU voicebanks which the author got permission from their creators to make RenoidPlayer compatible voice libaries for.

  • Data import: RenoidPlayer can accept file-drop as a sequence data. (Note infomation and basic lyric informations only. Pitch-bends and other additional informations are ignored.)
    • VOCALOID: VOCALOID Sequence files (.VSQ/.VSQx)
    • UTAU: UTAU Script files (.UST)
    • CeVIO: CeVIO Creative Studio files (.CCS)
    • MusicXML (.XML)
  • Parameters
    • Volume - Output volume control.
    • Transpose - Output pitch control, semi-note step.
    • Portamento - Pitch change smoothness control.
    • FormantCorrection - Keep formant independently of output pitch.
    • Formant - Formant control.
    • Humanize - Add some fluctuation to pitch and dynamics.
    • VibratoDepth - Amount of vibrato. Note that the vibrato has the delay time to start by 'VibratoDelay' parameter.
    • VibratoRate - Vibrato speed control.
    • VibratoDelay - Delay time to start vibrato.
External links

Sinsy[]

Filoyo Software-icon

Sinsy (Singing Voice Synthesis System) (しぃんしぃ) is an online HMM-based singing voice synthesis system by the Nagoya Institute of Technology that was created under the Modified BSD license.

History

The synthesizer is free to use, but will only generate tracks up to 5 minutes. The user uploads data in the MusicXML format, which the Sinsy website reads to output a WAV file of the generated voice. Gender factor, vibrato intensity, and pitch shift can be adjusted prior to output.[7]

MusicXML files can be made in Symphony Pro, Cadencii, MuseScore, and finale NotePad.

Some users have praised Sinsy for its realism. This can be attributed to Sinsy's voice source being a TTS (specifically, HTS, also by the Nagoya Institute of Technology), a process known for producing human-like results.

As of December 25, 2013 the official creators of the Sinsy are Keiichi Tokuda (Producer and designer), Keiichiro Oura (Design and Development), Nakamura Kazuhiro (Development and Main Maintainer), and Yoshihiko Nankaku.

  • Voices: Sinsy has four known voices: Yoko, Xiang-Ling, Matsuo-P, and Namine Ritsu S. Sinsy supports Japanese and English. A Chinese version has been released of Xiang-Ling as of Christmas 2015.
    • Yoko (謡子; f001j) is a Japanese-only voice.
    • Xiang-Ling (香鈴; f002j; f002e; f002m) is a Japanese, English, and Chinese (Mandarin) voice. The English voice was added on Christmas 2012. The Chinese voice was added on Christmas 2015.
    • Matsuo-P (松尾P; m003e_beta) is an English voice that was released to the public on December 25, 2013 along with the version 3.4 release of the Sinsy website. Unlike other banks on the Sinsy website that were created using female voices, Matsuo was voiced by a male voice actor. Its voice can be heard here singing "Who's Crying Now". Matsuo-P's voice actor is a YouTube and Niconico user who goes by the name Koya Matsuo.[8]
    • Namine Ritsu S (波音リツS; f004j_beta) is a Japanese voice. It can be heard here singing RIP=RELEASE. As of December 25, 2013, Namine Ritsu S became available for public use.
External links


Examples of usage
  • "CadenciiをインストールしてSinsyに歌わせるまでの録画" by nwp8861 (Mimirobo-P) ft. Tutorial
    Favicon-nnNiconicoFavicon-ytYouTube
  • "MuseScoreで楽譜作成し、Sinsyに歌わせる手順(2010/09/02版)" by nwp8861 (Mimirobo-P) ft. Tutorial
    Favicon-ytYouTube
  • "Golden Slumbers" by Koya Matsuo (Matsuo-P) ft. Tutorial
    Favicon-ytYouTube

Synthesizer V[]

Filoyo Synthesizer V

Synthesizer V (also known as SynthV) is a vocal synthesizer created and developed by Dreamtonics, Co. Ltd. Its first release (R1) was available on Windows, Mac, and Linux systems, until it was succeeded by Synthesizer V Studio in July 2020, which has support for the usual "Standard" sample-based voice libraries and the new AI voice libraries. Synthesizer V was available in English, Japanese and Mandarin Chinese, although there were numerous community-made translations of the interface, and was fully released on December 28, 2018. Synthesizer V Studio expanded the language support to Cantonese Chinese in April 2023 and to Spanish in November 2023.

History

It was first previewed in 2017. According to the developers, the project is a product of 7 years of work and is the fifth revision.

It was made available for download in 2018 with the first vocal "Eleanor Forte", downloadable for free. Chinese and Japanese vocals were also produced. More languages and dialects are planned.

On December 28, 2018 the engine got fully released with the possibility to register three vocals for free, and it also became possible to purchase a permanent license for the engine.

In December 2019, a web version of Synthesizer V was released, for those who cannot or don't wish to use the program version.

On July 30, 2020, the second version of the engine (release 2) was released under the name Synthesizer V Studio; it is not compatible with voicebanks made for the first engine version (release 1).

External links
Examples of usage

Virtual Singer[]

Filoyo HarmonyAssistantVS LMT

Virtual Singer is an plug-in module released late 2000 for Melody Assistant or Harmony Assistant and was made by Myriad.

History

Virtual Singer was a relatively small time package. It was a relatively cheap program at only $20, a total of $50 would be spent acquiring both it and Assistant programs. The software's forum is still seeing activity despite the software being dated and users were still producing works using the software in October 2011. There exists a work produced in March 2018, although this seems to be an anomaly, not a revival.

The results are comparable to other software in terms of clarity for its time including Cantor, but was a little more realistic in comparison to Cantor since it was based on human results. It was capable of singing in the following languages: British English, American English, French (Northern and Southern), Finnish, German, Latin, Spanish, Italian, Japanese, and Occitan. However, more scripts are being written by the users of the software that would allow it to produce more than this.[9]

It generated a "human" voice from the score lyrics. It came with the Real Singer II technology. Like UTAU a new voice was possible to be created from your own. However, if you downloaded the software several "free" voices could be downloaded from Myriad's website, some capable of multilingual results and others were made just for one language. Updates were also free of charge, however the latest version is 3.2.

External links

Vocal Synthesis Tool UTAU[]

Filoyo Utau-interface

Vocal Synthesis Tool UTAU (歌声合成ツール UTAU) is a voice synthesizer program currently available for Windows and Mac OS X systems (the Mac version being named UTAU-Synth), it was developed by Ameya/Ayame. UTAU is a shareware vocal synthesizer program that allows users to create and distribute their own voicebanks. And is viewed as a well-supported alternative to more expensive software that share the same abilities.

UTAU has the advantage of having its development occur at a faster pace. It has plug-in support and users have made a number of plug-ins that greatly improve the software's handling and experience. This support was established fairly early in the software's existence, whereas VOCALOID did not gain this ability until VOCALOID3 in late 2011 and even now it only offers a limited access to source code and plug-in support. Therefore the plug-ins for UTAU can often prove invaluable to users as they can effect the software's results and quality greatly.

Some UTAU voicebanks have been put out as "real" VOCALOIDs, such as the April Fool's joke origins of Kasane Teto. Songs using both UTAU and VOCALOID are also not unheard of. Some users have also began to enforce their copyright ruling over their voicebanks; UTAU or fanmade VOCALOIDs who are guilty of plagiarizing an UTAU's name or using a voicebank without permission risk violating UTAU software agreements and voicebank copyright ownership.

History

UTAU, meaning "to sing" in Japanese, has its origin in "Jinriki VOCALOID" (人力ボーカロイド, "Manual VOCALOID"), the act of re-editing an existing singing voice, extracting tones as WAV files, and reassembling them. In December 2007, Ameya/Ayame (飴屋/菖蒲), using LOLI.COM's voice samples, released a beta software called Loliedit featuring a simple voicebank called "Loline Com" (a pun on the original voice provider and the "ne" particle originally used on Crypton's products). It features a simple interface with a piano roll, has limited mora (Japanese syllables) and works with a primitive beta engine (or "resampler") later updated on UTAU. This beta software can still be downloaded from Ameya/Ayame's website. In March 2008, Ameya/Ayame released a free, advanced support tool to aid a "Manual VOCALOID" process, called UTAU. In later years, 2010, user feedback and suggestions, as well as other such ideas, lead to the creation of triphone ("VCV"; vowel-consonant-vowel) voicebanks; VOCALOID did not gain this capability until 2011 when VOCALOID3 was released.

The program comes with a default voicebank of 142 samples of Japanese syllables generated from the default voice of A-QUEST's text-to-speech software AquesTalk. Any user can load their own voicebank into UTAU to use. However, without the explicit permission of the voice donor, it is a violation of copyright laws. Those laws protect the rights of any vocalist who may not wish for their voices to be used within the program, such as celebrities. Any music made through this program can be used in the commercial sector. UTAU can be downloaded for free from the home page. It will not run properly on computers which do not support Japanese text or AppLocale.


UTAU is one of the few programs able to convert VOCALOID data files for its own use. It saves data in the .UST (UTAU Sequence text) format and is capable of converting .VSQ files to .UST. However, .UST files itself do not hold as much data as the VOCALOID engines' VSQ or VSQX file extensions, and UTAU does not try to convert many things into even its rough equivalent, only placing the notes. As a result, loss of data may occur. It currently does not support the VOCALOID5 extension, VPR.

External links

OpenUtau[]

Filoyo Openutau

OpenUtau is an unofficial, open-source successor to UTAU that was created due to UTAU not receiving any updates since 2013. It is written in C#, and the lead developer is StAkira. It does not come with any resamplers presintalled. It has official support for many different languages, such as English, Chinese, Japanese, Korean, and more. OpenUtau is available on GitHub under the MIT license for Windows, macOS, and Linux systems.

VOCALINA[]

Filoyo Vocalina
VOCALINA (보카리나) is a "text to speech" singing synthesizer and DAW for personal music-related content. It was developed in October 2011 by TGENS Co., Ltd. On September 25, 2017, it was announced by the CEO (Kang Woo-Mo) that VOCALINA's Service would be terminated on October 1, 2017. Contrary to this, on the 29th of September in 2017, the CEO announced they would work to extend the service for one more year, as thanks to the community for their encouragement and support. Additionally, VOCALINA would be free to use until the service expired.

In terms of the quality of singing results, the engine produces a large amount of noise, despite the noise reduction engine implemented into the program. This is likely due to the synthesis method itself being Diphonetic Synthesis, a type of concatenative speech synthesis notorious for sonic glitches. Many users of singing synthesizers are familiar with this type of synthesis, as it is used in singing synthesizers like UTAU.

History

VOCALINA Studio was first revealed before it was released with some interest-building events. After the first BETA 1.1 release, there were contests for songs and art that built the community. The community activities took place largely on Cafe Naver , but has since moved to another page on the same site. Once VOCALINA had been upgraded to version 2.0.16, the BETA got dropped from the name; it was now considered stable. The most recent version is 2.3.2, only featuring the vocalist Khylin. The program is Korean and relies solely on Hangul (Hanja isn't supported) input when using the singing synthesis. It has similar Parameters to that of many Piano Roll-styled MIDI Editors such as Pitch, Velocity, and Dynamics. VST effects like Reverb (REV), Equalizer (EQ), and Echo (ECO) are also listed as Parameters for mixing convenience. Select VSTs bundled within the Studio can be used within the VST Editor. Since it is a DAW in addition to a singing synthesizer, one is able to mix the vocals in real time.

The program launched with VORA, an idol living under cover as a high school girl. When SeeU, a Korean VOCALOID created around the same time VORA was released, there was brief rivalry between the two products. However, since then VORA has performed alongside SeeU in many songs,[10] such as the single "Abandoned" being performed LIVE at one of SeeU's concerts.[11]

Normally, in order to use Khylin, a user must buy a "ticket" from the VOCALINA website, which grants them usage of her voice for a month at minimum, and a year at maximum. After the period has ended, the user needs to renew the service by buying another ticket if they want to continue using Khylin.

  • Vocals:
    • Choi Bora 최보라 (known by her stage name VORA 보라) was the first VOCALINA vocalist, and can be used for free in VOCALINA Studio, no purchase necessary. She was retired[12] due to being incompatible with the new 2.3.0 update of the engine, but is still available with the 2.2.0 version. Her official demo song is Dream (드림). VORA's Official Album that served as her introduction demo is called "Kiss My Life", like her webcomic of the same name. It is available to be listened to here.

      She is 17 years old, 160 cm in height and 43 kg in weight. Her birthday is March 20 and has an AB blood type. Her vocal range is C4~F5 with a tempo of 70~180 BPM.

      Her favorite activities are singing, texting, and making phone calls. Her hobbies are listening to music and solving puzzles. Her "special skills" are organizing and managing blogs. She dislikes loud noises and raw ingredients. She is the only child in her family. She has a webcomic about her rise to stardom called Kiss My Life by Taeyang. (It's written in Korean.)
    • Khylin (formerly KHylin) 카일린 is the second VOCALINA vocalist. Her demo was first revealed on August 17, 2012 and in the video, her name is stylized as "THE KHYLIN". You can hear the demo here, but there is no indication of the song having a title. Her voice provider has sung with VORA before development, a notable duet is here. Khylin comes with every 2.0 version of VOCALINA Studio, but she is only being offered for free in version 2.3.2.

      She is 16 years old and her birthday is July 28. Her vocal range is E3~E5 with a tempo of 70~150 BPM.

      Khylin doesn't have any special information regarding her identity. This implies she does not have a place in VORA's "Kiss My Life" world and exists solely as an isolated vocalist. Her voice provider was revealed to be Hyo Bae Park.
External links


Examples of usage
  • "승인 (Seung-in)" by ??? ft. VORA & SeeU
    Favicon-ytYouTubeFavicon-bbbilibili - YT version set to privte
  • "GET OUT BOY" by 5일 ft. UNI & Khylin
    Favicon-ytYouTube
  • "ENJOY THE PARTY" by 5일 ft. VORA
    Favicon-ytYouTube
  • "Abandoned" by Sesilin (세시린) ft. VORA
    Favicon-ytYouTube

Singing Voice Synthesizers (Samples Instrument)[]

Realivox[]

Realivox - The Ladies Realivox - Blue
Realivox is a vocal synthesizer by Realitone. There are two software packages for Realivox entitled The Ladies and Blue. The software used to run the voices are powered by Kontakt which has a full version which goes by the same name, or Kontakt Player which is free but comes with less features. Kontakt runs on Windows and Mac meaning that Realivox does as well.
History

The Ladies consist of 5 vocals :

  • Cheryl; A voice described as airy and pretty that is perfect for ethereal film cues.
  • Teresa; Described as a Soprano opera diva.
  • Patty; A voice suited for pop and ethnic music.
  • Julie; For full range songs.
  • Toni; For smooth R'n'B songs.

The voices have 30 multi-sampled articulations, being Oo, Ah, Ee, Oh, Ey, Hmm, Mmm, La, Bah, Bee, Boh, Boo, Buh, Bop, Bow, Bah Fall, Dah, Dee, Doh, Doo, Duh, Boom, Bom, Hey, Ha, Ho, Hoo, Me, Shoo, and Yeah. On top of that the voices have true sampled legato. These voices can also be stacked to create choirs as well as solos.

The Ladies software comes with Kontakt Player in its installation.


Blue comes with a single voice that comes with 12,000 vocal samples. Each of the six vowels has thirty-two samples accompanying it. This is because when pronouncing different consonants before or at the end of different vowels, the mouth will open and close differently based off of what vowel has been pronounced. This reduces the amount of incorrectly sounding pronunciations and replicates sung English more accurately.

Like with The Ladies, Blue also has true sampled legato, but it also has polyphonic legato.

There is also an Ensemble Mode which allows the user to switch between 3 voices that comes with the package without the need to switch between tracks. This comes with settings for volume, pan, tuning, timbre and offset for each voice that is activated.

Blue produces a crystal clear sound.[13]

External links


Examples of usage

The Ladies

  • "How We Created "Mmmquiring Minds" with Realivox" by Realitone ft. The Ladies
    Favicon-ytYouTube
  • "Mmquiring Minds" by Realitone ft. The Ladies
    Favicon-scSoundCloud
  • "Walking Through a Dark Town" by Realitone; Frank Raschke ft. Cheryl
    Favicon-scSoundCloud

Blue

  • "Realivox Blue Walkthrough" by Realitone ft. Blue
    Favicon-ytYouTube
  • "Blue Demos" by Realitone ft. Blue; Kontakt
    Favicon-scSoundCloud
  • "The Wonderful Blue" by Realitone; Man Parrish ft. Blue
    Favicon-scSoundCloud

Symphonic Choirs[]

Filoyo QLChoirs-xlarge
Symphonic Choirs is a choir synthesizer produced by EastWest/Quantum Leap, able to recreate the effect of an entire choir for any song. It is popular with indie musicians.
External links

Ethera (from Zero-G): WORK IN PROGRESS[]

Filoyo Software-icon
Ethera is a : WORK IN PROGRESS
History

PRODUCT_History : WORK IN PROGRESS

External links

MAIDLOID[]

Filoyo AcmeIku ss 02
MAIDLOID (メイドロイド) is a Text-to-Speech/Phrase Editor synthesizer developed and distributed by StudioS, and it was released December 27, 2007. The purpose was to create erotic sounds using 50 Japanese consonants along with several hundred voice clips and breathing sounds to sound like a recreation of an eroge.

Some users have attempted to make the vocals 'sing' and even proved successful by importing the Japanese syllables into Vocal Synthesis Tool, UTAU. However, the H-Voice Characters are not meant for musical purposes and – like with the Macne Series – users would need to own the full versions of the software to legally be able to use this in UTAU, as the Lite version of Iku and RIYONA only provide a small number of Japanese consonants and cannot make a fully working voicebank.

When ported to the UTAU software and used for 'singing', the Lite versions of Acme Iku & RIYONA have poor clarity and quality due to the demo files' compression and is difficult for even native Japanese speakers to understand. Once bought, the full versions of Acme Iku's and RIYONA's voice qualities are much better.

History

It is a voice synthesizing engine made to resemble VOCALOID, but it is made to create erotic sounds instead of singing. It can voice the 50 Japanese consonants along with several hundred voice clips and breathing sounds to sound like a recreation of an eroge. There are a few extras, such as the ability to insert customized special sound effects and a few BGMs. Acme Iku's voice was made to be used in games and animation and those works can be sold. However, it is illegal to distribute her full voicebank. There are exceptions with the Lite version.

On December 27, 2017, for Acme Iku's tenth anniversary, StudioS announced that there would be a new character added to the Character H-Voice Series named RIYONA,[14] and Acme Iku's page on DLsite was updated as well, adding contents from the physical CD extras to the MAIDLOID editor. Anyone with a valid purchase of her on DLsite can download the updated version for free.

On February 10, 2018, the new H-Voice character, RIYONA, was released onto the DLsite and DMM for 1,620円. Her files contained over 900+ new sounds and phrases, most of which consists eroge sounds. StudioS stated that RIYONA was made with the idea in mind of using her for not only H-Games/Eroge Games but also fighting games as well, with a whole folder of voice lines dedicated to those types phrases. RIYONA uses a brand new software built by StudioS just for her vocal usage, making her the first H-Voice Character so far to not use the MAIDLOID Editor.


In 2011, a Japanese producer by the name of Paraoka, created three songs using the software and released a doujin album, "Training Diary".[15]

  • Vocals: The demo versions of Iku and RIYONA only include a small portion of their original libraries, their sound quality manually degraded to make the usage of the demo files less desirable, provoking customers to buy the full versions.
    • Acme Iku (阿久女イク; (あくめいく)) is the first character of StudioS' H-Voice Character Series and uses the Flash made MAIDLOID software. Her voice is provided by the Japanese voice actress, Sayaka Kazuna (計名さや香; Kazuna Sayaka) and her illustration is by Shin. Her concept is a housemaid android who is meant to be a lewd and cute character. She is 1 years old, 145 cm in height with "NAISHO! (ナイショ!)" as her weight. Her vocal range is said to be 'good' with a lolita treble, her tempo is "H-mode" and her genres are Hentai and Ecchi.

      Her name Acme in kanji is the choice of corresponding pronunciation and has no valid literal interpretation. In katakana 'Acme' in Japanese literally means "orgasm." And Iku is a common erotic moan in Japan, meaning "to come, to orgasm." As such, her name translates into a sexual innuendo.

      Iku is featured as a playable character in the Japanese PC doujin game Super Strip Fighter IV (スーパーストリートファイターⅣ). Her voicebank is used, and some of her attacks consist of use of her hair, she is able to use her hair accessory and throw or swing it at her opponents.[16]
    • Princess Knight RIYONA (姫騎士リヨナ) is the second and newest character of StudioS' H-Voice Character Series labeled as KNIGHTROID. She uses software made for Windows, which subsequently shall be referred to as the KNIGHTROID Editor. Her voice is provided by an adult game Japanese voice actress, Miki Honda (本多未季; Honda Miki) and her illustration is by Shin.

      Her concept is a princess knight, meant to be proud and a strong ruler, but she's also more of an older sister figure, unlike Iku. Not much is currently known about her as a character, and little is known about her voice provider other than her few roles in some eroge games.

      RIYONA is featured as a playable character in the Japanese PC doujin game Super Strip Fighter 5 (スーパーストリートファイター5). Her voice provider provided the samples used, and some of her attacks consist of use of her using her sword as well as the ability to use lightning attacks.
External links
Examples of usage
  • "Chaosmaid" by Paraoka ft. Acme Iku
    Favicon-nnNiconico

Macne Series[]

Main article: Macne Series

Mixed Voice Synthesizers (Singing + TTS)[]

CeVIO Creative Studio[]

Filoyo CeVIO interface 6.1.22.0
CeVIO Creative Studio (pronounced che-ˈvē-ˈo) is a commercial vocal synthesizer product released on September 26, 2013. CeVIO Creative Studio received two awards in 2013, the "MicrosoftⓇ Innovation Award 2013" and the "CEDEC Award". Its demo version, CeVIO Creative Studio FREE, is available in trial form. Previously, users could not edit Parameters in this version, but they are able to now, with few restrictions.
History

CeVIO has two capabilities, speaking function and singing, both of which need to be provided by their developers.

The speaking portion offers a large dictionary of words to which the vocalists can pronounce in a variety of ways and emotions. There are usually 3 different types of voices that can be cross-synthesized, or isolated to portray a single emotion. If they misinterpret kanji, phonemes can be edited. Velocity, Length, Tone, Accent, and Pitch can be edited in this mode.


The singing portion offers: Amplitude Timing, Pitchbends, Volume, and Vibrato rate and Depth. Gender is also available to be edited on the side bar of the piano roll. A recent development of the engine also introduced phoneme-input, previously not used. The ability to add and edit phonemes manually allows some "Engrish" words to be made, or allows smoother pronunciation of borrowed words. In addition to phoneme editing, up to 5 hiragana/katakana characters can be added onto a single space. This is a unique feature of CeVIO.

  • Parameters:
    • Amplitude Timing allows for phoneme editing. Sounds often are split into 6 segments in this section, and the beginning of the note is highlighted with a pink line. Dragging the last purple line back to the pink line of the next note will often help with vowel transitions. If there are lines between notes covering a blank space, this characterizes a breath or static sound. It cannot be deleted.
    • Pitch allows for editing of Pitchbends and addition of Portamento, which most vocals can produce on their own. Many advise using the Line Tool to draw pitchbends, as the Pencil Tool is very sensitive.
    • Volume can be used for dynamics, however it uses a very intense scale and minor adjustments to loudness can cause peaking/clipping, and lower volumes can cause an influx of static. Adjusting the volume of the track may be preferable.
    • Vibrato rate controls how fast vibrato cycles will happen. Within the program, the user isn't supposed to draw the vibrato, but rather a diagonal line/curve upwards to indicate the vibrato is accelerating, or a diagonal line/curve downwards to indicate the vibrato is decelerating.
    • Vibrato depth controls how deep the pitch of the vibrato cycles will be. In other words, the intensity of the vibrato. This is edited the same way as Vibrato rate, with the diagonal lines or curves.
  • Voices:
    • Sato Sasara is a speaking and singing CeVIO product. She is 16 years old.
    • Suzuki Tsudumi is a speaking CeVIO product. She is 17 years old and friends with Sasara.
    • Takahashi is a speaking CeVIO product. He is 20 years old and friends with Sasara.
    • ONE is a speaking and singing CeVIO product. She is the second installment of the - ARIA ON THE PLANETES -" project.
    • IA is a speaking and singing CeVIO product. She was the first installment of the - ARIA ON THE PLANETES -" project and a VOCALOID3 character.
    • Akasaki Minato is a singing CeVIO product and part of the Color Voice Series. He is 25 years old.
    • Midorizaki Kasumi is a singing CeVIO product and part of the Color Voice Series. She is 27 years old.
    • Ginsaki Yamato is a singing CeVIO product and part of the Color Voice Series. He is 50 years old.
    • Kinzaki Koharu is a singing CeVIO product and part of the Color Voice Series. She is 52 years old.
    • Shirosaki Yuudai is a singing CeVIO product and part of the Color Voice Series. He is 20 years old.
    • Kizaki Airi is a singing CeVIO product and part of the Color Voice Series. She is 18 years old.
    • HAL-O-ROID is a singing CeVIO product and based on the deceased Enka singer. He is 37.3 years old.
External links

Festival Speech Synthesis System[]

Fl Software-icon
Festival Speech Synthesis System is a free (libre) / open-source software speech synthesizer developed at the Centre for Speech Technology Research (CSTR) of the University of Edinburgh. In order to enable Festival to sing, there is a plugin required called Flinger (Festival Singer) developed at the OGI School of Science and Engineering that was released in 2001. Free(b)soft has contributed Czech diphone voices and an accessible editor similar to the VOCALOID editor.

Flinger has been used by users in the western world for creating English demo songs.

Most voicebanks for Festival are available under a free license, whereas VOCALOID (and UTAU) use proprietary licenses. However, the voicebanks that come with Flinger are still proprietary. The experimental LMMS plugin Singerbot uses Festival for singing releases.

External links


Voiceful : WORK IN PROGRESS[]

Main article: Voiceful


Related Text-to-speech Synthesizers[]

Gynoid Talk[]

Main article: Gynoid Talk

Galaco Talk : WORK IN PROGRESS[]

Main article: Galaco Talk

Speech Synthesis (FL Studio plugin)[]

FL Studio FL Chan
FL Studio, formerly known as FruityLoops, is a digital audio workstation (DAW), developed by Belgian company Image-Line released December 18, 1997. FL Studio was originally the creation of Didier Dambrin, who is now the lead programmer responsible for the overall development.

In the western VOCALOID (and also UTAU) scene, FL Studio is among one of the more popular DAWs used by some producers, it is not as widely used in Japan since they also have access to other popular DAW systems as well.[17] It is mainly used to put all the components of the music together. FL Studio also recognizes VOCALOID as a Rewire plugin, and is able to actively feed their voice samples together, making the software overall more attractive to VOCALOID users.

History

FL Studio features a fully automatable workflow centered around a pattern-based music sequencer. The environment includes MIDI support and incorporates a number of features for the editing, mixing, and recording of audio. Completed songs or clips may be exported to Microsoft WAV, MP3, and the OGG Vorbis format. FL Studio saves work in a proprietary *.flp format, inadvertently sharing the same extension as a filetype created by Adobe Flash CS3.

The program is acclaimed for its professional DAW features at an attractive price point. Scoring to video is possible using the video-player plugin, but there is no support for traditional music notation.

FL Studio is written in Borland Delphi and thus until recently, there were no plans to develop a native Linux or Mac OS X port. Recent changes, however, has gleaned on 64-bit support and a self-packaged ASIO driver for low end sound cards. As of September 2015, the Image-Line website indicates that the company is heavily developing a native mac version which can currently be tested by all registered forum users (alpha 0.4); similarly, several FL studio plugins are in development as native versions and can also be tested (alpha 11 currently). In May 2018, Image-Line finally distributed version 20 of FL Studio, which has native MAC OS support.


Since the software was designed for a wide variety of uses and aimed to appeal to users of all music levels, many amateur musicians also use this software, and for the western VOCALOID fandom has proven to be popular because it allows even amateurs a chance to remix or make new songs that can compete with professional works. It has many tools that have abled to them to create work on level with more professional VOCALOID users and has been considered one of the more attractive software suites for purchase for this reason by new fans.

Some producers however prefer to prerender the voices from VOCALOID into WAV files (which is also a necessary step for UTAU-based voicebanks) and insert it in FL Studio as an audio sample, instead of using ReWire. Alternatively, a whole prerendered voice in a WAV file can be articulated into sliced samples, for easy access to note segments and remixing, as well as for conserving system resources.

  • FL-chan is a mascot inspired by the success of VOCALOID2 character Hatsune Miku. Didier Dambrin made a comment in an Internet forum that he also wanted a moe character like Habanero-tan for FL Studio. A Japanese artist by the name Uruido took the request and created a moe anthropomorphism of FL Studio called FL-Chan, and she became the official character of the DAW software since the release of ver. 8 in March 2008. Image-Line has adopted the character as a form of mascot, providing an array of purchasable merchandise and a dedicated page on the Image-Line website. You can find that here.

FL-chan's voice has also been successfully transferred into UTAU. Much like Acme Iku, users need to own the original software to legally use her voice. It is not the most realistic of the UTAU voicebanks as its results came from a Vecoder and were made by artificial means.

She has been featured in various VOCALOID videos, Meiko's Various Feelings being one.

External links


Megpoid Talk[]

Main article: Megpoid Talk

Talk Ex[]

Main article: Talk Ex

VOICEROID[]

Main article: VOICEROID

VTalk[]

Main article: VTalk


Miscellaneous : WORK IN PROGRESS[]

Linux MultiMedia Studio[]

Lmms logo Software-icon
Linux MultiMedia Studio or LMMS, is a free digital audio workstation (DAW) alternative to proprietary software such as FL Studio, Cakewalk, and GarageBand released March 7, 2015. It is distributed free of charge, unlike FL Studio and Cakewalk, and it is also free as in freedom, meaning anyone can see or modify the source code. It is available for various distributions of Linux, and despite its name, also for BSD, Microsoft Windows, and Mac OS X (with some limitations).
History

LMMS allows anyone to easily create loops using a variety of predefined synthesizer plugins, as well as external VST plugins. (There is a VST plugin version of the VOCALOID engine.) Effect chains can be added to almost any instrument through the use of LADSPA effects. The software also features the ability to import existing FL Studio project files.

Like the VOCALOID Editor, notes are entered using a piano-roll interface. For beats and basslines, LMMS also includes a transport bar similar to that of FL Studio. This editor can be changed to a piano-roll editor if more sophisticated beats and music loops are desired. LMMS can also sing if the experimental plugin Singerbot is installed.

Each instrument can be assigned to an "FX Channel". The FX Mixer provides a single area to change the volume and panning, as well as the effects chains, of all instruments assigned to a specific FX Channel at once. This works well for adjusting the master volume of a drumset (which is normally made up of several different samples, each sample corresponding to a different portion of the drumset; e.g., snare, bass drum, cymbal, etc.).

  • Included synthesizer plugins: LMMS includes various synthesizer plugins out-of-the-box. There are also a variety of community-developed presets for each synthesizer plugin, making it easy for beginners to pick a sound they like.
    • AudioFileProcessor provides a simple interface to use the piano-roll or beat editor with a sound clip. LMMS includes several instrument samples, as well as a variety of drum samples, represented as AudioFileProcessor presets.
    • BitInvader provides a highly customizable wavetable synthesizer.
    • Kicker is a kick/bass drum synthesizer.
    • LB302 Monophonic imitation of Roland TB-303.
    • Mallets synthesizes tuneful percussion sounds.
    • Monstro provides more advanced triple oscillator mixing
    • Nescaline emulates the Nintendo Entertainment System® APU.
    • Organic provides an organ-like synthesizer with adjustable "stop" levels for each virtual "pipe".
    • FreeBoy emulates the Nintendo Game Boy APU (audio processor unit).
    • SID emulates the Commodore 64 MOS Technology Sound Interface Device chip, with support for 6581 and 8580 variants.
    • TripleOscillator provides three versatile oscillators that can be mixed together in different ways to produce all kinds of sounds.
  • Vibed models vibrating string instruments.
    • Watsyn provides more advanced wavetable synthesizer
    • Embedded ZynAddSubFX engine with various presets.
External links

iVoxel[]

Filoyo IVoxeliPhone1Small
iVoxel is a singing vocoder with a vocal sequencer developed by VirSyn. It is a built in application for the iPhone and iPad. It has many features beyond a sequencer.

Like other VirSyn products it is based on the same ideas and concepts that IVOCALOID is based on. But like previous software, such as Cantor, it is not intended to be a realistic singing voice.

External links


SugarCape[]

Filoyo Sugarcape
SugarCape is a vocal synthesizer developed by sota, only available for use on Mac OS X Snow Leopard 1.6.8 or above. The newest version now called SaltCase Alpha 0.0.2 uses a tripitch voicebank, and it is assumed that it has been given a sort of "prefix.map", similar to that of an UTAU voicebank. This function allows transitions between pitches to be more natural, rather than giving in to the harsh distortion of the sample as it goes deeper.

In addition to the preset voice in SugarCape, one can add a voice to be compatible with the program. This has been done with Nagone Mako, and several other UTAU voicebanks. Some have thought of importing the Macne Series into SugarCape.

External links


Examples of usage
  • "sugarcape" by talc; なんてこったい on NND ft. N/A
    Favicon-ytYouTube
  • "てくてく~地べたのスカイウォーカー~" by 呑気大王 ft. SugarCapePro
    Favicon-nnNiconico
  • "DESTINY" by ねこ伯爵P  Vocalist missing!
    Favicon-nnNiconico


References[]

Navigation[]

Advertisement