ComputersSoftware

Synthesizers of speech with Russian voices. The best speech synthesizer. How to use a speech synthesizer?

Today speech synthesizers, used in stationary computer systems or mobile devices, do not seem unusual any more. Technologies have stepped far ahead and allowed to reproduce the human voice. How it works, where it is applied, what is the best speech synthesizer and what potential problems a user may encounter, see below.

What are speech synthesizers and where are they used?

Speech synthesizers are special programs consisting of several modules, which allow you to translate typed on the keyboard text into ordinary human speech in the form of soundtrack.

It would be naive to assume that the accompanying libraries contain absolutely all the words or possible phrases recorded in the studios by real people. It's just physically impossible. In addition, the libraries of phrases would be so large that it would simply not be possible to install them even on modern high-capacity hard drives, not to mention mobile devices.

For this, a technology was developed, called Text-to-Speech.

The most widely used speech synthesizers are in several areas, which can be attributed to the independent study of foreign languages (programs often have support in 50 languages or more), the code needs to hear the correct pronunciation of the word, listening to the texts of books instead of reading, creating speech and vocal parts in music , Their use by people with disabilities, the issuance of search queries in the form of voiced words and phrases, etc.

Variety of programs

Depending on the field of application, all programs can be divided into two main types: standard, directly convert text to speech, and speech or vocal modules used in music applications.

For a more complete understanding of the picture, let's look at both classes, but more emphasis will be placed on the synthesizers of speech in their immediate use.

Pros and cons of the simplest speech applications

As for the advantages and disadvantages of programs of this type, first consider all the same disadvantages.

First of all, it is necessary to clearly understand that the computer - it is a computer, which at this stage of development human speech can synthesize very approximately. In the simplest programs, there are often problems with word stress, reduced sound quality, and in mobile devices - increased power consumption, and sometimes unauthorized loading of speech modules.

But there are also many advantages, because a lot of audio information is perceived much better than the visual one. Convenience is obvious.

How to use a speech synthesizer?

Now a few words about the basic principles of using programs of this type. You can install any type of speech synthesizer without any problems. In stationary systems, a standard installer is used, where the main task will be the choice of supported language modules. For mobile devices, the installation file can be downloaded from an official store or repository such as Google Play or the AppStore, after which the application is installed automatically.

As a rule, when you first start, you do not need to make any settings other than setting the default language. True, sometimes the program can offer to choose the sound quality (in the standard version, applied everywhere, the sampling frequency is 4410 Hz, the depth is 16 bits and the bitrate is 128 kbps). In mobile devices, these figures are lower. Nevertheless, a certain voice is taken as the basis. Using a standard pronunciation pattern by applying filters and equalizers achieves the sound of just such a timbre.

In use, you can choose several options for translating text: entering text manually, scoring already text from the file, integrating into other applications (for example, web browsers) with activation of the search results output or reading of text content on online pages. It is enough to choose the necessary variant of actions, language and voice, with which all this will be pronounced. Many programs have several varieties of voices: both male and female. To activate the playback process, the start button is usually used.

If we talk about how to turn off the speech synthesizer, there may be several options. In the simplest case, the stop button is used in the program itself. In the case of integration into the browser, deactivation is performed in the extension settings or by the complete removal of the plug-in. But with mobile devices, despite a direct shutdown, there may be problems that will be discussed separately.

In music programs, settings and text input are much more difficult. For example, FL Studio has its own speech module, in which you can select several types of voices, change the tone settings, play speed, etc. To emphasize the stresses before the syllable, use the symbol "_". But such a synthesizer is suitable only for the creation of robotic voices.

But the package Yamaha Vocaloid refers to programs of a professional type. Text-to-Speech technology is realized here in the fullest extent. In the settings, in addition to the standard parameters, you can set articulation, glissando, use libraries with vocals of professional performers, make up words and phrases, adjusting them to notes, and a whole bunch of more. Not surprisingly, the package with only one vocal takes about 4 GB or more in the installation distribution, and after unpacking it is twice or three times as large.

Speech Synthesizers with Russian Voices: A Brief Overview of the Most Popular

But let's return to the simplest applications and consider the most popular ones.

RHVoice - according to most experts, the best speech synthesizer, which is the Russian development of the authorship of Olga Yakovleva. In the standard version, three voices are available (Alexander, Irina, Elena). The settings are simple. And the application itself can be used as a stand-alone program, compatible with SAPI5, and as a screen module.

Acapela is quite an interesting application, the main feature of which is almost perfect voice acting in more than 30 languages of the world. In the regular version, however, only one voice is available (Alain).

Vocalizer is a powerful application with the female voice of Milena. Very often this program is used in call-centers. There are many settings for setting accent, volume, speed reading and installing additional dictionaries. The main difference is that the speech engine can be integrated into programs like Cool Reader, Moon + Reader Pro or Full Screen Caller ID.

Festival is a powerful speech synthesis and recognition utility for Linux and Mac OS X. The application comes with open source and, in addition to standard language packs, supports even Finnish and Hindi.

ESpeak is a speech application that supports more than 50 languages. The main disadvantage is the preservation of files with synthesized speech exclusively in WAV format, which takes up a lot of space. But the program is cross-platform and can be used even in mobile systems.

Problems with speech synthesizer in Google Android

When installing the "native" speech synthesizer from Google, users constantly complain that it spontaneously includes loading additional language modules, which can not only take a long enough time, but also consumes traffic.

Get rid of this in Android-systems can be very simple. To do this, use the settings menu, then go to the language and voice input section, select the voice search and click on the cross (disconnect) on the speech recognition option offline. Additionally, it is recommended that you clean the application cache and reboot the device. Sometimes it may be necessary to disable notifications in the application itself.

What in the end?

To sum up, we can say that in most cases ordinary users will be approached by the simplest programs. In all ratings, RHVoice is in the lead. But for musicians who want to achieve a natural voice, so that the difference between live vocal and computer synthesis is not felt by ear, it's better to give preference to programs like Vocaloid, especially since they produce many additional voice libraries, and settings have so many possibilities that primitive Applications, as they say, and did not stand side by side.

Similar articles

 

 

 

 

Trending Now

 

 

 

 

Newest

Copyright © 2018 en.unansea.com. Theme powered by WordPress.