remotelove@lemmy.ca to

AI@lemmy.ml · 9 months ago

[voice recognition] Audio tools for generating datasets?

1

4

[voice recognition] Audio tools for generating datasets?

remotelove@lemmy.ca to

AI@lemmy.ml · 9 months ago

1

This is more of personal project to learn more about how speech recognition (SR) works and how AI training works at a low level. (Functionally, it’s pointless and is just a self-assigned “homework problem”)

To do this, I need to record a bit of audio to to use as training data.

Recording and chopping up .wav files is easy, but it’s time consuming. I am toying with my own teleprompter-like python app that will prompt for a word, record and tag, and save for later. However, is there a good app to automatically create utterances that is already built?

Ideally, unrecognized words in my own SR system would be automatically turned into tagged audio clips to be used for re-training or fine tuning.

I am shortcutting a bit of this work in python with Google SR for my first dataset. Unfortunately, calling external APIs is sidestepping my intent of this project so I’ll move away from that soon.

People that work with AI typically work with lots of data, so I figured here was a good place to ask.

You must log in or register to comment.

Chat

remotelove@lemmy.caOP
link
fedilink
arrow-up
1·
9 months ago
I found this as a start: https://github.com/cmusphinx/pocketsphinx/blob/master/cython/pocketsphinx/segmenter.py

AI@lemmy.ml

artificial_intel@lemmy.ml

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
2 users / week
83 users / month
2.15K users / 6 months
1 local subscriber
4.12K subscribers
422 Posts
1.27K Comments
Modlog

mods: