https://nanonets.com/blog/chat-with-pdfs-using-chatgpt-and-openai-gpt-api/
domingo, 15 de octubre de 2023
lunes, 9 de octubre de 2023
Google tts audio generation
https://cloud.google.com/text-to-speech/docs/audio-profiles
To generate an audio file, make a POST
request and provide the appropriate request body. The following shows an example of a POST
request using curl
. The example uses the access token for a service account set up for the project using the Google Cloud Platform Cloud SDK. For instructions on installing the Cloud SDK, setting up a project with a service account, and obtaining an access token, see the Quickstarts.
The following example shows how to send a request to the text:synthesize
endpoint.
curl \
-H "Authorization: Bearer "$(gcloud auth print-access-token) \
-H "Content-Type: application/json; charset=utf-8" \
--data "{
'input':{
'text':'This is a sentence that helps test how audio profiles can change the way Cloud Text-to-Speech sounds.'
},
'voice':{
'languageCode':'en-us',
},
'audioConfig':{
'audioEncoding':'LINEAR16',
'effectsProfileId': ['telephony-class-application']
}
}" "https://texttospeech.googleapis.com/v1beta1/text:synthesize" > audio-profile.txt
If the request is successful, the Text-to-Speech API returns the synthesized audio as base64-encoded data contained in the JSON output. The JSON output in the audio-profiles.txt
file looks like the following:
{
"audioContent": "//NExAASCCIIAAhEAGAAEMW4kAYPnwwIKw/BBTpwTvB+IAxIfghUfW.."
}
To decode the results from the Cloud Text-to-Speech API as an MP3 audio file, run the following command from the same directory as the audio-profiles.txt
file.
sed 's|audioContent| |' < audio-profile.txt > tmp-output.txt && \ tr -d '\n ":{}' < tmp-output.txt > tmp-output-2.txt && \ base64 tmp-output-2.txt --decode > audio-profile.wav && \ rm tmp-output*.txt