The online demonstrator is free to use, but will only generate tracks up to 5 minutes. The user uploads data in the MusicXML format, which the Sinsy website reads to output a WAV file of the generated voice. Gender factor, vibrato intensity, and pitch shift can be adjusted prior to output.[2]
As of December 25, 2015, the official developers of Sinsy were Keiichi Tokuda (Producer and designer), Keiichiro Oura (Design and Development),[3] Kazuhiro Nakamura (Development and Main Maintainer), and Yoshihiko Nankaku.[4]
It was originally only in Japanese and English, but Mandarin was later added; the website only supports English and Japanese despite this currently.[5][6]
In 2016, Sinsy stated using the deep learning processing technology DNN.[7]
Yoko (謡子), Japanese female vocal. She currently has two vocals for the service, both in Japanese, one being a beta and the other a fully released version.
Xiang-Ling (香鈴), Japanese female vocal. An English vocal was added Christmas, 2015. Mandarin was added also to her language capabilities.
Matsuo-P (松尾P), English masculine vocal
Namine Ritsu S (波音リツS), a Japanese male vocal currently in beta. Originally produced for UTAU, it released on December 25, 2013.
^ITmedia ニュース - 初音ミクとも簡単に対話できる「MMDAgent」、その詳細を聞いてきた. Retrieved November 23, 2013
^Nakamura, K.; Oura, K.; Nankaku, Y.; Tokuda, K. (May 2014). "HMM-Based singing voice synthesis and its application to Japanese and English". 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). pp. 265–269. doi:10.1109/ICASSP.2014.6853599. ISBN978-1-4799-2893-4. S2CID384604.