On December 1, 2014, when asked about VOCALOID4 updates, Noboru responded that they wished to update all of their VOCALOID3s to the new engine but had no specific schedule at the time.[1] However, it was noted that Megpoid would be the first to be worked upon.[2]
On October 7, 2015, Noboru posted a SoundCloud sample that demonstrated GUMI's Japanese voicebanks in the VOCALOID4 engine. She gained growl samples, improved XSY capabilities, and some tweaks.[3] GUMI was confirmed to receive 5 new voicebanks along with updated versions of her VOCALOID3 voicebanks for a total of 10 voicebanks. Her release date was announced to be November 5.[4]
On November 11, trial versions of her voicebanks became available.[5]
This package acts as both a update to the old V3 Megpoid and V3 Megpoid - Native packages, while also providing an extension to the original voices. The product contains 5 new natural-sounding tones. Improvements to the older vocals make them easier to use for new producers, while the large selection of variations allows producers to find a suitable tone more easily. The complete package also suits more advanced producers due to the high XSY possibilities, while keeping the process relatively simple.
The package also contains demo song data and illustrations.
Vocal traits as noted:
All 10 voices have the same tempo and range. Changing tone mid-song should be less noticeable, particularly if the right set of voicebanks are used to bridge the gaps between voices.
All 5 older voices are newly recorded.
The 5 updated vocals have even more triphones than the V3 versions.
All 5 of the V3 Megpoid vocals have been reworked almost from scratch.
Noise issues related to certain sounds present in the V3 Megpoid vocals are fixed.
Quality of the 5 VOCALOID3 vocals has been improved as a overall result.
GWL is capable of being used with all 10 vocals. Megpoid in particular will create a rich, expressive sound when using this feature.
Software issues as noted:
The original 5 voices may not produce the same sounds as before due to being new recordings.
May be some loss of power to "Native", "Sweet", "Whisper" and "Adult". This is due to the improvements of word connections, which improve the formation of words but slightly softens them in the process. "Power", however, should remain unaffected.
One of the packages criticism is just on its overall value;
As with the VOCALOID3 package, the additional tones don't offer any major genre advantages, as all share the same tempo and range. This became even more of an issue when XSY was opened up in Ver.4.3.0 of the VOCALOID4 engine as there were now more impacting vocals available for XSY then the 10 Megpoid vocals had between them.
At the time of release, the overall package combined can work out to be one of the most expensive releases for VOCALOID that had ever been made. The total price was more then double many other Vocaloid releases both released during Vocaloid3 and Vocaloid4. This meant for the same price of Megpoid V4, you could potentially buy at least two other Vocaloids instead, even ones with XSY themselves.
For a basic exaplaination for how the main 5 voicebanks work together without the additional newer 5 voicebanks, see the previous release V3 Megpoid page. The 5 new voicebanks expand on that packages previous capablities and allow for even more realism and added expression.
All 10 vocals are capable of XSY.Due to the fact every version comes with at least 1 additional voice, V4 Megpoid will have access to the XSY function no matter what package the user buys.[7]
Voices have only been set to XSY between their respective pairs (i.e. Native and NativeFat) easily; XSY between vocals outside of their respective pair (i.e. Native and Power) may have unpredictable results. Users need to be careful of this when mixing the 10 voicebanks.
The Megpoid V4 package especially makes full use of the XSY feature. Despite the unpredictable results of mixing the vocals outside of their respective pairs, combining them will effectively create an effect much like having an additional voicebank entirely, allowing a large number of additional tones for GUMI.[8]
In total the number of combinations for XSY will create the equivalent of up to 90 additional possible variations for GUMI's character for VOCALOID4. Including the original 10 voicebanks used within XSY, the package offers up to 100 possible theoretical voicebank variations for her character.
Despite the number of variety of tones the package creates, at least 20 of those (the 5 intended XSY pairings) do not vary a great deal compared to the other 80 due to how similar they are.
Compared to Arsloid, there is more stability within any of the 90 XSY combinations. Since Megpoid V4's vocals are full voicebanks and not "Extended Libraries", the results are much smoother, HQ and predictable. Any of the 10 vocals and their XSY combinations are fully capable of acting as main vocals and supporting roles equally.
Compared to EVEC, the vocals are all within Vocaloid itself, as a result, there is less chance of a clash with Vocaloid itself, which is by-product of EVEC working independently from Vocaloid.
From Ver.4.3.0 of the Vocaloid engine, a XSY group "Internet" was added to Vocaloid. All vocals within the "Internet" group can XSY with each other. This vocal release is part of this group. If a User owns one or more vocals within the "Internet" group, XSY between them will open up as a result of owning this release.
The "Internet" group is currently the largest XSY group at 29 voicebanks. If old versions that have since been re-released are removed, there are currently 21 unique voicebanks. These are; "Gackpoid Native", "Gackpoid Whisper", "Gackpoid Power", "Lily Native", "Lily", "Cul", "Kokone", "Gachapoid", "Chika", "Otomachi Una Sugar", "Otomachi Una Spicy", "Megpoid Native", "Megpoid Sweet", "Megpoid Adult", "Megpoid Power", "Megpoid Whisper", "Megpoid Nativefat", "Megpoid Mellowadult", "Megpoid Powerfat", "Megpoid Naturalsweet" and "Megpoid Softwhisper".
In total if user was to own every VOCALOID3 and VOCALOID4 XSY capable INTERNET co., LTD vocal, this offers the equivalent of 812 additional voicebanks achievable via XSY. The total tone variation offered by the combined packages comes to a theoretical 841 voicebanks in total.[9]
Note that Users of the updated XSY list have to understand that as confirmed by Internet co., Ltd themselves, depending on the combination of the two vocals a level of extra unwanted noise may be generated.
Another note worthy detail on V4 and later vocals compared to V3 vocals within this group is that Internet Co., Ltd. increased the number of triphones from Chika onwards. Thus, many newer vocals do not match sound-for-sound with the older pre-Chika released vocals.
Post VOCALOID4 releases were also developed with XSY in mind, so are better for XSY then the older VOCALOID3 releases.
In addition to not being built with XSY in mind, VOCALOID3 packages with only 1 vocal in their contents that were released may be even more problematic for XSY. Internet co often made multi-pack releases have a quality that made switching mid-song less noticeable between voicebanks, making them slightly better for XSY then single vocal releases.
Note that also "GWL" does not become viable for use with any V3 vocals used as the primarily vocal. This should be considered when using any vocal for XSY within the "Internet" group from V3.
One of the advantages of the 10 vocals within this package when used for a secondary role is that they can be used for "control" during XSY. The slight adjustments between the vocals means that if the user is not quite satisfied with how one Megpoid vocals impact is, they can swap it out for another within the Megpoid V4 range since all have the same tempo and vocal range. This gives the release a high utility purpose compared to most vocals within the "Internet" group.
While Gumi's V3 Megpoid vocal can do this also, the Megpoid V4 package is superior for this role due to its increased capacities with XSY and increased voicebanks.
Optimum Range: F2 ~ A4 Optimum Tempo: 60 ~ 175BPM Total Tempo (min-max): 115 BPM No. of Keys: W ~ 17, B ~ 12, Total ~ 29
Package details as noted:
NATIVE This is the latest updated version of the VOCALOID2Megpoid vocal and carries the same overall intended tone of the original. It acts as the "basic" or "default" tone of the entire Megpoid V3 package.
Vocal traits as noted:
Improves the Megpoid vocal even more so; further improving the voice than V3 Megpoid had done.
Phonetic notes as noted:
Word connections generally improved.
Voicebank sample
Megpoid V4 Native
Cross-Synthesis as noted:
XSY with "NativeFat" much more comfortably than the other 8 voices.
Once the 2 "Native" vocals are installed there will be listed 4 complete vocals. The additional voices are because both XSY options for XSY (Native x NativeFat, NativeFat x Native) are already set up as vocals to use.
Optimum Range: F2 ~ A4 Optimum Tempo: 60 ~ 175 BPM Total Tempo (min-max): 115 BPM No. of Keys: W ~ 17, B ~ 12, Total ~ 29
Package details as noted:
NATIVE FAT A variation of the "Native" voice with a thicker throat sound.
Vocal traits as noted:
The thicker throat sound allows her to handle higher ranges easier without sounding strained.
Voicebank sample
Megpoid V4 NativeFat
Cross-Synthesis as noted:
XSY with "Native" much more comfortably than the other 8 voices.
Once the 2 "Native" vocals are installed, there will be listed 4 complete vocals. The additional voices are because both XSY options for XSY (Native x NativeFat, NativeFat x Native) are already set up as vocals to use.
Optimum Range: F2 ~ A4 Optimum Tempo: 60 ~ 175BPM Total Tempo (min-max): 115 BPM No. of Keys: W ~ 17, B ~ 12, Total ~ 29
Package details as noted:
ADULT This is an update of the the V3 Megpoid vocal.
Phonetic notes as noted:
Word connections generally improved.
Voicebank sample
Megpoid V4 Adult
Cross-Synthesis as noted:
XSY with "Mellow Adult" much more comfortably than the other 8 voices.
Once the 2 "Adult" vocals are installed, there will be listed 4 complete vocals. The additional voices are because both XSY options for XSY (Adult x Mellow Adult, Mellow Adult x Adult) are already set up as vocals to use.
Optimum Range: F2 ~ A4 Optimum Tempo: 60 ~ 175 BPM Total Tempo (min-max): 115 BPM No. of Keys: W ~ 17, B ~ 12, Total ~ 29
Package details as noted:
MELLOW ADULT As its name suggests, it has a a more relaxed tone than Adult.
Voicebank sample
Megpoid V4 MellowAdult
Cross-Synthesis as noted:
XSY with "Adult" much more comfortably than the other 8 voices.
Once the 2 "Adult" vocals are installed, there will be listed 4 complete vocals. The additional voices are because both XSY options for XSY (Adult x Mellow Adult, Mellow Adult x Adult) are already set up as vocalsto use.
Optimum Range: F2 ~ A4 Optimum Tempo: 60 ~ 175BPM Total Tempo (min-max): 115 BPM No. of Keys: W ~ 17, B ~ 12, Total ~ 29
Package details as noted:
POWER This is an update of the the V3 Megpoid vocal.
Phonetic notes as noted:
Word connections generally improved without loss of power to the vocal.
Voicebank sample
Megpoid V4 Power
Cross-Synthesis as noted:
XSY with "PowerFat" much more comfortably than the other 8 voices.
Once the 2 "Power" vocals are installed, there will be listed 4 complete vocals. The additional voices are because both XSY options for XSY (Power x PowerFat,PowerFat x Power) are already set up as vocals to use.
Optimum Range: F2 ~ A4 Optimum Tempo: 60 ~ 175BPM Total Tempo (min-max): 115 BPM No. of Keys: W ~ 17, B ~ 12, Total ~ 29
Package details as noted:
POWER FAT A variant of the "Power" vocal with a thicker throat sound to it.
Vocal traits as noted:
Holds the descent of notes better than "Power".
Holds a "Dark" tone of voice.
The thicker throat sound allows her to handle higher ranges easier without sounding strained.
Voicebank sample
Megpoid V4 PowerFat
Cross-Synthesis as noted:
XSY with "Power" much more comfortably than the other 8 voices.
Once the 2 "Power" vocals are installed there will be listed 4 complete vocals. The additional voices are because both XSY options for XSY (Power x PowerFat,PowerFat x Power) are already set up as vocals to use.
Optimum Range: F2 ~ A4 Optimum Tempo: 60 ~ 175 BPM Total Tempo (min-max): 115 BPM No. of Keys: W ~ 17, B ~ 12, Total ~ 29
Package details as noted:
SWEET This is an update of the the V3 Megpoid vocal.
Phonetic notes as noted:
Word connections generally improved.
Voicebank sample
Megpoid V4 Sweet
Cross-Synthesis as noted:
XSY with "Natural Sweet" much more comfortably than the other 8 voices.
Once the 2 "Sweet" vocals are installed there will be listed 4 complete vocals. The additional voices are because both XSY options for XSY (Sweet x Natural Sweet, Natural Sweet x Sweet) are already set up as vocals to use.
Optimum Range: F2 ~ A4 Optimum Tempo: 60 ~ 175 BPM Total Tempo (min-max): 115 BPM No. of Keys: W ~ 17, B ~ 12, Total ~ 29
Package details as noted:
NATURAL SWEET This has the "cute" traits of Sweet, but has a much more natural tone.
Vocal traits as noted:
This vocal suffers less issues related to how the original Sweet vocal sounded "forced" or artificial.
Voicebank sample
Megpoid V4 NaturalSweet
Cross-Synthesis as noted:
XSY with "Sweet" much more comfortably than the other 8 voices.
Once the 2 "Sweet" vocals are installed, there will be listed 4 complete vocals. The additional voices are because both XSY options for XSY (Sweet x Natural Sweet, Natural Sweet x Sweet) are already set up as vocals to use.
Optimum Range: F2 ~ A4 Optimum Tempo: 60 ~ 175 BPM Total Tempo (min-max): 115 BPM No. of Keys: W ~ 17, B ~ 12, Total ~ 29
Package details as noted:
WHISPER This is an update of the the V3 Megpoid vocal.
Vocal traits as noted:
Improvements to voice to prevent waning while going up the octaves.
Phonetic notes as noted:
Word connections generally improved.
Voicebank sample
Megpoid V4 Whisper
Cross-Synthesis as noted:
XSY with "Soft Whisper" much more comfortably than the other 8 voices.
Once the 2 "Whisper" vocals are installed, there will be listed 4 complete vocals. The additional voices are because both XSY options for XSY (Whisper x Soft Whisper, Soft Whisper x Whisper) are already set up as vocals to use.
Optimum Range: F2 ~ A4 Optimum Tempo: 60 ~ 175 BPM Total Tempo (min-max): 115 BPM No. of Keys: W ~ 17, B ~ 12, Total ~ 29
Package details as noted:
SOFT WHISPER This is a much more breathier tone of voice than "Whisper".
Vocal traits as noted:
The voice is designed to make GUMI sound "smaller" then the normal Whisper voice.
Voicebank sample
Megpoid V4 SoftWhisper
Cross-Synthesis as noted:
XSY with "Whisper" much more comfortably than the other 8 voices.
Once the 2 "Whisper" vocals are installed, there will be listed 4 complete vocals. The additional voices are because both XSY options for XSY (Whisper x Soft Whisper, Soft Whisper x Whisper) are already set up as vocals to use.