Riffusion Ai music generator can give Spoken Word with Accents

michaelrbarton · April 1, 2025, 4:52am

I prompted an acting theatrical play conversation, with a cop and a witness who saw a bank robbery. I have prompted a Southern woman voice also.
I am just wondering if this can be used to create voices for Iclone characters.
It sounds more authentic and realistic than text to speech. But you might not get what you want. Riffusion is free with unlimited use. Here is a link to the audio.

https://www.riffusion.com/song/1f30804d-547f-4f86-a910-ef2a0b42f2ff

Monologue female 1950 talking about prices

https://www.riffusion.com/song/d212c7c0-8708-409d-96f9-b12ee3e94c22

animagic · April 1, 2025, 2:43pm

If you save the voice as MP3 or WAV file, you can use any voice with AccuLIPS. Just load the voice file and create a file with the text of the dialog (with the same name as the voice file and extension .txt) and it should work.

I use ElevenLabs voices that way but new options come out regularly.

EDIT: Just remembering: I think you use iClone 7. It would work similarly but the lipsynch is not as good. AccuLIPS is only in iClone 8.

mark · April 2, 2025, 1:14pm

Looks kinda cool. I can’t seem to get just “spoken/words” without the music. Any ideas?

AutoDidact · April 2, 2025, 1:31pm

Although I was not impressed with the quality of the voices
on the site that the op linked,
here is a site that uses AI to separate vocals from
their music tracks

Caveat: you have to have a paid account to actually download the separated tracks but there is a work around.

They let you “preview” your separated results so just use
any free screen recording software ,like OBS studio,
and capture a video (with audio) of the separated voice tracks playing in preview
and save out that audio with your video editor for use in Iclone or CTA.

That is how I made this old couple sing a duet in CTA
from an AI song I generated on UDIO.com

mark · April 2, 2025, 4:08pm

That’s cool! Thanks!

planetstardragon · April 2, 2025, 5:40pm

an additional note to separating vocals - the virtual DJ software - which has a free version has a technology that separates tracks, so you can literally create stems from any song.
it’s fun remixing classics that way - it separates drums, vocals, bass and keyboards as individual outputs - although you’d have to record it in real time because it’s a dj plug in for performance and not directly a plug in for breaking down tracks

the basic version is free but if you want to attach a physical controller to it, you have to buy the drivers per attachment - and if you want to remove any visual watermarks on the stream, there is an additional fee…but the extras are not necessary to do basic functions.

AutoDidact · April 2, 2025, 9:41pm

Downloaded and installed virtual DJ
its complex and deep but I figured out how to separate the voice from the music pretty fast.
a bit disappointing that it has no “export to .wav” function but I can just capture my separated tracks audio with OBS.

Thanks

planetstardragon · April 2, 2025, 10:05pm

it does - but it’s a live performance plug in - so you have to record the mix as if you were playing a song live, it’s very powerful though and does an amazing job at sifting through the tracks. it would be easiest to record the live stream though with a 3rd party software - I use an older version for audacity for that function myself. ( side note - don’t use the latest version of audacity- the last version 2.4.2 before it was sold to a new company is the best, the new company adds some spy / tracking features for capitalist reasons! )

gordryd · April 2, 2025, 10:09pm

Audacity can break music into 4 stems (Bass, Drum, Vocal, Everything Else) using the FREE Intel OpenVINO Music Separation plugin. It did a good job on drums, but in my experience the vocal separation was not the best.

michaelrbarton · April 3, 2025, 11:16pm

Here is another monologue, unaccompanied female speaker. She is talking in front of an audience about the time she caught a foul ball in 1964, at a baseball game. No music. I think I am getting better at prompting for no music and only words. God Bless!

https://www.riffusion.com/song/35c4678f-afae-43b8-9304-fb70f2ed52a7

AutoDidact · April 4, 2025, 3:26pm

Original song created by suno.ai
Vocal track for lipsinc animation separated by Virtual DJ

michaelrbarton · April 9, 2025, 4:02pm

Here is some spoken word from Riffusion Ai music generator. I used a couple of lines from Riffusion to make the woman talk in the video. You create spoken word from Riffusion. It is hit or miss. But it is possible to do using any Ai video or 3D rendering video if you need some audio for your creation. Riffusion is in beta mode at the moment.

God Bless!

https://www.reddit.com/r/KlingAI_Videos/comments/1jv8vpl/woman_on_the_beach_i_used_riffusion_ai_music/

https://www.riffusion.com/song/e712ff6e-0f12-4240-903b-0f0b7283aff2

planetstardragon · April 9, 2025, 11:06pm

-chefs kiss- really well done!

amakaetokwu · May 2, 2025, 7:05pm

I agree!

amakaetokwu · May 6, 2025, 5:48am

Interesting! If you’re aiming for more believable and emotionally rich voice acting in your iClone projects, you might want to try AudioModify.

Unlike basic text-to-speech or even Riffusion’s style-based generation, AudioModify lets you shape delivery with detailed controls, pitch, pacing, emotion, and even regional accents like a Southern drawl. I’ve used it for scene work (witness/police-type interactions too), and it handles dialogue-driven storytelling really well.

The voices aren’t just realistic they can be directed, which is huge for animation and game cutscenes. Worth a test run if you’re building dynamic character-driven content.

Www.

michaelrbarton · May 18, 2025, 10:52pm

Only if one of them gets stewed.

michaelrbarton · May 18, 2025, 11:01pm

From CC4 to Forge Flux to Hailuo Ai.
Woman washing her hands at the sink.

Woman washing her hands at the sink. Character Creator 4 input image using Forge Flux and Hailuo creating the animation of the faucet water and washing hands. : r/HailuoAiOfficial