Open-source speech recognition and text-to-speech potentially usable with the Poppy robots

gfabre · May 11, 2016, 7:49am

Hi there,
Here a feedback from IntRoLab (university of Sherbrooke) : Interaction homme-robot par la voix : on se comprend mon ami ! http://cursus.edu/dossiers-articles/articles/27285/interaction-homme-robot-par-voix-comprend/

They have used : (open source)

Many Ears https://sourceforge.net/projects/manyears/ : ManyEars implements real-time microphone array processing to perform sound source localisation, tracking and separation. It was designed for mobile robot audition in dynamic environments.
Palaver : Open speech recognition for Linux https://github.com/JamezQ/Palaver
Disco : Collaborative Discourse Manager https://github.com/charlesrich/Disco

and non open source :
Google speech API

More information about open source component used can be found in https://introlab.3it.usherbrooke.ca/mediawiki-introlab/index.php/ManyEars

gfabre · July 7, 2016, 2:04pm

Hi there,

meSpeak.js is a Text-To-Speech solution on the Web ."speak.js is 100% clientside JavaScript. “speak.js” is a port of eSpeak, an open source speech synthesizer, which was compiled from C++ to JavaScript using Emscripten."
The project is under GPL. (thanks to Johann, who have quoted about meSpeak.js on a framapad, Johann makes nice open source SVG tests for children, including programming http://jlodb.poufpoufproduction.fr/tibibo.html?id=prog )

French is available. It’s look like “robot voice”, but as it’s used for robot, it’s maybe not a big problem.

Do you think it’s could be interesting for Poppy robots ?

And easily usable with ardiuno+wifi or nodemcu ?

gfabre · July 7, 2016, 4:23pm

Article published today about MyCroft, by Ubuntu team https://insights.ubuntu.com/2016/07/07/mycroft-the-open-source-answer-to-natural-language-platforms/

Thot · July 8, 2016, 10:29am

Very interesting the Mycroft project !!
Here is the community
Here is the code !!

Navigating in the forum, I also saw the very sexy AI samurai
But it seems to be NOT open source.

gfabre · November 20, 2016, 2:24pm

Hello,

Top 5 Open Source Speech Recognition Toolkits : http://blog.neospeech.com/2016/07/08/top-5-open-source-speech-recognition-toolkits/ (thanks to guildem to share it in http://linuxfr.org/nodes/110556/comments/1682139 )

gfabre · November 20, 2016, 2:41pm

And here the solutions mentioned in the wiki for kalliope (https://www.youtube.com/watch?v=t4J42yO2rkM amazing - : http://linuxfr.org/news/kalliope-votre-assistant-personnel-vocal )

TTS : https://github.com/kalliope-project/kalliope/blob/master/Docs/tts.md
STT : https://github.com/kalliope-project/kalliope/blob/master/Docs/stt.md
(dI didn’t checked it which are open source and not)

gfabre · November 24, 2016, 5:49pm

Hello,

On Kalliope, votre assistant personnel vocal - LinuxFr.org, Sylvain Chevalier talk about http://kaldi-asr.org/ :

Depuis déjà plusieurs années, le “standard” pour la reconnaissance vocale libre c’est kaldi,
en particulier grâce à ses modules pour l’apprentissage profond (Deep
Learning en anglais, d’ailleurs en passant je trouve le concept de
“neuron” pas bien choisi pour un projet dans ce domaine, où les “neural
nets” sont partout). La plupart des systèmes commerciaux l’utilisent.

Someone has already tried it for speech recognition ?

kookic · December 11, 2016, 6:16pm

Bonsoir,
Je viens de m’inscrire ici.
A ce sujet, j’utilise la reconnaissance vocale avec Snowboy (sur Pcduino et odroid x4) c’est du python,
https://snowboy.kitt.ai/
Ca marche plutôt bien.
Amusez-vous bien…
kookic

gfabre · December 11, 2016, 6:41pm

ça n’a pas l’air open source, non ? on cherche à mutualiser sur des solutions open source…

kookic · December 11, 2016, 7:18pm

exact, mais pour moi c’est “version free”

gfabre · January 25, 2018, 9:29pm

Mycroft : nouveau financement collaboratif, avec la part belle à l’open source pour les technos, où les solutions retenues sont mis en avant dans la campagne. En anglais. Une version open source gérant le français sera-t-elle portée par la communauté en étant catalysé par ce nouveau modèle de Mycroft ? à suivre… https://www.kickstarter.com/projects/aiforeveryone/mycroft-mark-ii-the-open-voice-assistant

gfabre · October 14, 2018, 9:10am

Sur primtux, Stéphane parle de gspeech pour du TTS : https://forum.primtux.fr/viewtopic.php?pid=14553#p14553 : https://github.com/lusum/gSpeech

Philippe nous indique “AccessDV Linux, une distribution destinée aux déficients auditifs, qui intègre de nombreux outils intéressants, notamment leur machine à lire, un ensemble de scripts bash permettant la lecture automatique depuis de nombreuses sources.”

gfabre · May 8, 2019, 3:57pm

De nouveaux liens :

Topic		Replies	Views
Starting with the Poppy Robot Technology	9	2697	April 20, 2016
I have a vague notion of a project in mind, [English] Community projects	0	502	January 4, 2020
The Poppy Show? Technology	15	3757	October 7, 2014
Faire parler Poppy avec Snap! / Make Poppy talk with Snap! Technology documentation , français , snap , software	3	1712	June 22, 2016
Projet Cherry, Audio system for Poppy Technology hardware	2	1657	March 9, 2015

Open-source speech recognition and text-to-speech potentially usable with the Poppy robots

Related Topics