Sesame CSM 1B

Generate from CSM 1B (Conversational Speech Model). Code is available on GitHub: SesameAILabs/csm. Checkpoint is hosted on HuggingFace.

Voices

Select a predefined speaker

Select a predefined speaker

Each line is an utterance in the conversation to generate. Speakers alternate between A and B, starting with speaker A.

Choose input method

Direct text input Upload ebook file

Conversation

GPU time limited to 3 minutes, for longer usage duplicate the space.

Synthesized audio