Vocaloid Wiki
Advertisement
🛠 This subject is work in progress.
Please bear with us while improvements are being made and assume good faith until the edits are complete.
For information on how to help, see the guidelines. More subjects categorized here.
🛠
📰 This subject requires intervention. What is being worked on: Please note the details of this software are subject to change without notice.
For information on how to help, see the guidelines.  More subjects categorized here.
📰

VX-β was a prototype VST3/AU plug-in voice synthesis engine included within the VOCALOID β-STUDIO research studio developed by the company Yamaha Corporation and functions within various digital audio workstation (DAW) applications. VX-β was equipped with AI, deep learning technology, that uses singing voice synthesis technology currently in the research or "beta" stage.

The software was released via lottery application to eligible applicants beginning August 23, 2023.[1]

History[]

VOCALOID β-STUDIO was first teased on July 27, 2023, featuring a promotional image of the "β" (beta) Greek letter and a date of August 8, 2023 with no additional details provided.[2] Yamaha officially announced the project on August 8, revealed as a limited research studio to be made available to creators with the hope that they would use the rough prototype software provided that was still in the research stage and has unknown value, to create works that would open up new horizons in the field of singing voice synthesis. The first project, the VX-β digital audio workstation (DAW) plug-in was simultaneously announced the same day. It was announced that the VOCALOID β-STUDIO services would have further details released in late August, 2023. The date of services to end was announced to be March 31, 2024 and the VX-β DAW plugin service was announced to end on April 1, 2024.[3]

On August 22, 2023, Yamaha began accepting applications for participation in the VOCALOID β-STUDIO campaign and encouraged the creation of new music using the VX-β application. It was noted that in the future, Yamaha would recruit new applicants irregularly, and in addition to those who have won the lottery, the winning applicants who have been selected based on the answers to the questionnaire would receive the VX-β DAW plug-in, which uses singing voice synthesis technology that was still in the research stage, and would be distributed free of charge.[4][5] Music created using VX-β was subject to the individual character terms of use of each voicebank.[6] Users could freely publish within the scope of the terms. A total of nine voicebanks were announced for beta testing of the software including: prtv_0, prtv_1, prtv_2, prtv_3, Gazenβ, nagiβ, multiβ-N (which includes an additional seventeen individual voicebanks within the main multiβ-N voicebank package), Gekiyakuβ and Kazehikiβ.

Beginning August 23, 2023, the first round of recruitment began and winning applicants were presented with a download and serial codes for the VX-β plugin. Those who were not selected at the time would automatically be enrolled for the next recruitment.[7]

Development[]

The first project of the VOCALOID β-STUDIO was announced to be the release of VX-β, a DAW plug-in based on Yamaha's research-stage voice synthesis technology. It was noted that VX-β was not considered a beta version of the next version of VOCALOID products. It was an experimental product that incorporates functions and elements that propose a more futuristic world of voice synthesis. Yamaha's hope was that creators would challenge new singing voice synthesis with their own free ideas and make use of this product for their own creations. VOCALOID β-STUDIO would be available for all creators to participate. Random drawings were announced among those who apply for participation and serial codes to use VX-β sequentially would be issued to applicants. While Yamaha continues to enhance the project structure in order to deliver VX-β to as many people as possible, delivery times for new products would be limited. It was noted that after April 1, 2024, VX-β would no longer be available for activation due to the end of the usage period. Details on the application for participation would be available in late August 2023. Services offered by VOCALOID β-STUDIO would be limited to March 31, 2024.

Lottery Application[]

Applicants can apply to participate in VOCALOID β-STUDIO from the Yamaha Music Members campaign page. To apply, applicants must (1) obtain a Yamaha Music ID and (2) answer a questionnaire.

A random lottery would be held for those who apply during the period (until March 31, 2024), and the serial code and download link of the singing voice synthesis plug-in VX-β would be sent to the winners by email. It was noted that that some of the winners may be selected based on the answers to the questionnaire, in order to create many works that expand the possibilities of new singing voice synthesis with VX-β .

About the Lottery Procedure[]

The number and schedule of irregular lottery procedures and the number of winners have not been disclosed. Applicants can check the lottery status after applying from the "Application History" on the Yamaha Music Members website. Those who are not selected in one lottery would be eligible for the next lottery. The same person cannot win the lottery more than once. If applicants would like to modify their answers to the questionnaire after completing the application, applicants may complete the application procedure again. If applicants apply multiple times, only the last one would be valid. Note that Yamaha cannot answer questions about the lottery process and content.

Requirements[]

  • OS: Windows 10, Windows 11, macOS 13 Ventura
  • CPU: Haswell (4th generation) or later Intel Core series or Xeon series; Apple Silicon
  • RAM: 8 GB minimum recommended
  • HDD: At least 1 GB or more (including all voicebanks)
  • Graphic Minimum Open GL 3.2 or higher
  • Other: Audio device, internet connection (for authentication, deauthentication, software updates, etc.)
  • Monitor size:
    • Minimum operating environment: 1024×768 or higher resolution

Additional installation notes[]

Works created with VX-β can be freely published and can be used commercially. However, it was noted that participants would need to comply with the VX-β terms of use when publishing songs, and also comply with the rules of each individual voicebank character.

Usable Period[]

VX-β has a limited usable period. After March 31, 2024, the activity end date of this project, it will not start as a plug-in and cannot be used. It was encouraged that users save important work that is being worked on by exporting it onto their respective DAW software until then.

Releases[]

VOCALOID β-STUDIO logo blk

Vocal libraries released for the VOCALOID β-STUDIO VX-β engine.


Additional notes[]

Examples of usage[]

An example of solfège using VX-β technology. See also, a listing of vocal stats here.

To contribute an example- see this blog entry to download the VSQx.

prtv_0 File:Prtv 0 - Solfege.ogg prtv_1 File:Prtv 1 - Solfege.ogg prtv_2 File:Prtv 2 - Solfege.ogg prtv_3 File:Prtv 3 - Solfege.ogg Gazenβ File:Gazenβ - Solfege.ogg nagiβ File:Nagiβ - Solfege.ogg multiβ-N File:Multiβ-N - Solfege.ogg f00 File:F00 - Solfege.ogg f01 File:F01 - Solfege.ogg f02 File:F02 - Solfege.ogg f03 File:F03 - Solfege.ogg f04 File:F04 - Solfege.ogg f05 File:F05 - Solfege.ogg f06 File:F06 - Solfege.ogg f07 File:F07 - Solfege.ogg f08 File:F08 - Solfege.ogg f09 File:F09 - Solfege.ogg f10 File:F10 - Solfege.ogg f11 File:F11 - Solfege.ogg m00 File:M00 - Solfege.ogg m01 File:M01 - Solfege.ogg m02 File:M02 - Solfege.ogg m03 File:M03 - Solfege.ogg m04 File:M04 - Solfege.ogg Gekiyakuβ File:Gekiyakuβ - Solfege.ogg Kazehikiβ File:Kazehikiβ - Solfege.ogg Kanade Kanon β File:KanadeKanon - Solfege.ogg And Uge β File:AndUge - Solfege.ogg Ci-chan β File:CiChan - Solfege.ogg Kasukabe Tsumugi β KasukabeTsumugi - Solfege.ogg

New features[]

For a list of VX-βs new parameters see Parameters

VX-β offers new features including:

  • Voice: VX-β does not require downloading of individual voicebanks, as all voicebank data was included in the plug-in.
    • Note that voicebank data may be added or changed in future version upgrades.
  • Singer/Style: Typically, one voicebank represents one singer. This pull-down switches the singing style that was unique to the singer represented by a given voicebank. Switching between singing styles switches not only the pitch movement, but also the timbre of the singing voice. Selecting a voicebank would take time off of the sequence length, but switching between singing styles would be instantaneous. The exception was the voicebank multiβ-N, which contained multiple singers within it. multiβ-N uses this pulldown to switch the singer itself, not the singing style. f00 to f11 are female singers and m00 to m04 are male singers. Users are able to use the new editing tools to freely edit the accents, vibrato, rhythmic feel, and more.
  • Various Parameters: By manipulating various parameters, users can increase the expressiveness of the sound. Each parameter can be intuitively manipulated and followed in real time during playback. Any of the parameters can be changed by dragging with the mouse, entering numerical values by clicking on the numerical value section, or restoring default values by double clicking on the knob or slider.
    • Parameters include: Air, Formant, Attack, Vibrato, Power, Timing, Pitch, Fuzzy, Presence, Kero, Key, Master, Tune, & Output
  • Use of Automation: All parameters can be dynamically controlled using the DAW's automation functions.
  • Keep Voicing, Guide Tone, Visual Functions:
    • Keep Voicing Function: When the Keep Voicing function was turned on, the sound of the song position position can be continued when playback was stopped. This function allows users to make adjustments while checking the changes in voice tone and strength/weakness caused by each parameter in real time.
    • Guide Tone Function: A button that toggles on/off the guide sound for note input.
    • Visual Functions: The waveform can be displayed or hidden. By drawing a waveform of a singing voice being synthesized in real time, the pitch and volume can be visually captured. Turning it off reduces the drawing load.
  • Other Functions:
    • Breath Insertion and Cutting: If there was a gap between notes, it may be considered a rest and a breath may be automatically inserted. Similarly, when a note was separated, a breath was inserted, so if users do not wish to include a breath, users can input the lyric "っ" and it would be pronounced as a natural prompt. If users wish to force the insertion of a breath, users can do so by entering "BR" and the diacritical marks as shown in the list of special diacritical marks below.
    • Vowel Voicelessness: You can silence vowels by adding a "0" (half-width zero) to the lyrics. For target notes that users wish to silence, such as "de shi 0 ta" or "so shi 0 te," add a "0" to the lyrics.
  • Phonetic Symbols:
    • Basic Information on Pronunciation Symbols: VX-β does not allow the phonetic symbols for VOCALOID to be input as they are. When loading a .vpr file (VOCALOID5/6 sequence), the diacritics for VOCALOID written in the .vpr file are automatically converted to diacritics for VX-β. If you want to change the automatically converted diacritical marks, you can use the VOCALOID5/6 Editor's diacritical marks field for VX-beta You can directly write the diacritics for VX-β by prefixing them with a $. For example, To pronounce "fu" = [f u], enter $f $u in the VOCALOID5/6 diacritical marks field.


Gallery[]

Media Gallery[]


Tutorials[]


The reference manual for this program can be found here.

References[]

Navigation[]

Advertisement