Close httplib2 connections.
generateVoiceCloningKey(body=None, x__xgafv=None)
Generates voice clone key given a short voice prompt. This method validates the voice prompts with a series of checks against the voice talent statement to verify the voice clone is safe to generate.
list(languageCode=None, x__xgafv=None)
Returns a list of Voice supported for synthesis.
close()
Close httplib2 connections.
generateVoiceCloningKey(body=None, x__xgafv=None)
Generates voice clone key given a short voice prompt. This method validates the voice prompts with a series of checks against the voice talent statement to verify the voice clone is safe to generate. Args: body: object, The request body. The object takes the form of: { # Request message for the `GenerateVoiceCloningKey` method. "consentScript": "A String", # Required. The script used for the voice talent statement. The script will be provided to the caller through other channels. It must be returned unchanged in this field. "languageCode": "A String", # Required. The language of the supplied audio as a [BCP-47](https://www.rfc-editor.org/rfc/bcp/bcp47.txt) language tag. Example: "en-US". See [Language Support](https://cloud.google.com/speech-to-text/docs/languages) for a list of the currently supported language codes. "referenceAudio": { # Holds audio content and config. # Required. The training audio used to create voice clone. This is currently limited to LINEAR16 PCM WAV files mono audio with 24khz sample rate. This needs to be specified in [InputAudio.audio_config], other values will be explicitly rejected. "audioConfig": { # Description of inputted audio data. # Required. Provides information that specifies how to process content. "audioEncoding": "A String", # Required. The format of the audio byte stream. "sampleRateHertz": 42, # Required. The sample rate (in hertz) for this audio. }, "content": "A String", # Required. The audio data bytes encoded as specified in `InputAudioConfig`. Note: as with all bytes fields, proto buffers use a pure binary representation, whereas JSON representations use base64. Audio samples should be between 5-25 seconds in length. }, "voiceTalentConsent": { # Holds audio content and config. # Required. The voice talent audio used to verify consent to voice clone. "audioConfig": { # Description of inputted audio data. # Required. Provides information that specifies how to process content. "audioEncoding": "A String", # Required. The format of the audio byte stream. "sampleRateHertz": 42, # Required. The sample rate (in hertz) for this audio. }, "content": "A String", # Required. The audio data bytes encoded as specified in `InputAudioConfig`. Note: as with all bytes fields, proto buffers use a pure binary representation, whereas JSON representations use base64. Audio samples should be between 5-25 seconds in length. }, } x__xgafv: string, V1 error format. Allowed values 1 - v1 error format 2 - v2 error format Returns: An object of the form: { # Response message for the `GenerateVoiceCloningKey` method. "voiceCloningKey": "A String", # The voice clone key. Use it in the SynthesizeSpeechRequest by setting [voice.voice_clone.voice_cloning_key]. }
list(languageCode=None, x__xgafv=None)
Returns a list of Voice supported for synthesis. Args: languageCode: string, Optional. Recommended. [BCP-47](https://www.rfc-editor.org/rfc/bcp/bcp47.txt) language tag. If not specified, the API will return all supported voices. If specified, the ListVoices call will only return voices that can be used to synthesize this language_code. For example, if you specify `"en-NZ"`, all `"en-NZ"` voices will be returned. If you specify `"no"`, both `"no-\*"` (Norwegian) and `"nb-\*"` (Norwegian Bokmal) voices will be returned. x__xgafv: string, V1 error format. Allowed values 1 - v1 error format 2 - v2 error format Returns: An object of the form: { # The message returned to the client by the `ListVoices` method. "voices": [ # The list of voices. { # Description of a voice supported by the TTS service. "languageCodes": [ # The languages that this voice supports, expressed as [BCP-47](https://www.rfc-editor.org/rfc/bcp/bcp47.txt) language tags (e.g. "en-US", "es-419", "cmn-tw"). "A String", ], "name": "A String", # The name of this voice. Each distinct voice has a unique name. "naturalSampleRateHertz": 42, # The natural sample rate (in hertz) for this voice. "ssmlGender": "A String", # The gender of this voice. }, ], }