Speech Synthesis¶
Text-to-speech and glottal flow models.
v_sapisynth
¶
V_SAPISYNTH - SAPI speech synthesis (stub).
This wraps Microsoft SAPI which is Windows-specific.
v_sapisynth
¶
Synthesize speech using Microsoft SAPI.
This is a Windows-specific function that interfaces with Microsoft SAPI. For cross-platform text-to-speech, consider using pyttsx3 or gTTS.
Raises:
| Type | Description |
|---|---|
NotImplementedError
|
SAPI is Windows-specific. |
Source code in pyvoicebox/v_sapisynth.py
v_glotros
¶
V_GLOTROS - Rosenberg glottal model.
v_glotros
¶
Rosenberg glottal model.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
d
|
int
|
Derivative order (0, 1, or 2). |
required |
t
|
array_like
|
Time in fractions of a cycle. Default: (0:99)/100. |
None
|
p
|
array_like
|
Parameters: p[0]=closure time, p[1]=+ve/-ve slope ratio. Default: [0.6, 0.5]. |
None
|
Returns:
| Name | Type | Description |
|---|---|---|
u |
ndarray
|
Output waveform (derivative of flow waveform if d>0). |
Source code in pyvoicebox/v_glotros.py
v_glotlf
¶
V_GLOTLF - Liljencrants-Fant glottal model.
v_glotlf
¶
Liljencrants-Fant glottal model.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
d
|
int
|
Derivative order (0, 1, or 2). Default: 0. |
0
|
t
|
array_like
|
Time in fractions of a cycle. Default: (0:99)/100. |
None
|
p
|
array_like
|
Parameters [te, E0/Ee, 1-tp/te]. Default: [0.6, 0.1, 0.2]. |
None
|
Returns:
| Name | Type | Description |
|---|---|---|
u |
ndarray
|
Output waveform. |
q |
dict
|
Structure with glottal model parameters. |