Changelog
1.3.1 (unreleased)
- Add experimental support for alignments (see docs/ALIGNMENTS.md)
- Raw phonemes no longer split sentences
1.3.0
- Moved development to OHF-Voice org
- Removed C++ code for now to focus on Python development
- A C API
libpiper
written in C++ is planned
- Embed espeak-ng directly instead of using separate
piper-phonemize
library
- Change license to GPLv3
- Use Python stable ABI (3.9+) so only a single wheel per platform is needed
- Change Python API:
PiperVoice.synthesize
takes a SynthesisConfig
and generates AudioChunk
objects
PiperVoice.synthesize_raw
is removed
- Add separate
piper.download_voices
utility for downloading voices from HuggingFace
- Allow text as CLI argument:
piper ... -- "Text to speak"
- Allow text from one or more files with
--input-file <FILE>
- Excluding any file output arguments will play audio directly with
ffplay
- Support for raw phonemes in text with
[[ <phonemes> ]]
- Adjust output volume with
--volume <MULTIPLIER>
(default is 1.0)