Implement speech-to-text using arduino uno(or any microprocessor) and send this to laptop wirelessly

tooambitious · Aug 17, 2023

Hi
I am currently working on a project that requires me to convert user given speech input to text in real-time using an arduino(either locally or send the voice wirelessly to my laptop for recognition in my laptop). Is there a way I could implement this only by using an arduino uno and a microphone module as we have a tight budget. Any expensive solutions will also be appreciated(for e.g. using raspberry pie). Would be thankful if someone helps me as this is my first project of this kind. Thank you in advance

BobTPH · Aug 17, 2023

tooambitious said:
Is there a way I could implement this only by using an arduino uno and a microphone module as we have a tight budget

No. Text to speech requires way more compute power than an Arduino.

tooambitious · Aug 17, 2023

BobTPH said:
No. Text to speech requires way more compute power than an Arduino.

Got it, but first thing I want to actually convert Speech-To-Text and not Text-To-Speech. Secondly, is it possible to send the speech data to my laptop and use my laptop's computational power(or any AI speech recognition tool api) rather than do it locally within the Arduino to accomplish that? And Thank you for your time!

BobTPH · Aug 17, 2023

Sorry, I knew that, just wrote it backwards, so the same answer.

Ya’akov · Aug 18, 2023

Welcome to AAC.

Tensorflow Lite and a supported board might offer a solution, but it really depends on the capabilities you expect. If you want something like dictation, you will just have to use a much larger computer. If you are interested in particular voice commands, something like MicroSpeech would work.

Full blown TensorFlow on a Raspberry Pi 4 can do pretty well, like spchcat.

As far as sending the audio to a computer, you can just use any streaming solution.

tooambitious · Aug 18, 2023

Ya’akov said:
Welcome to AAC.

Tensorflow Lite and a supported board might offer a solution, but it really depends on the capabilities you expect. If you want something like dictation, you will just have to use a much larger computer. If you are interested in particular voice commands, something like MicroSpeech would work.

Full blown TensorFlow on a Raspberry Pi 4 can do pretty well, like spchcat.

As far as sending the audio to a computer, you can just use any streaming solution.

Thank you for the reply. I found some solutions for implementing using raspberry pie 4. But I wonder if somehow I can stream the voice over to my laptop using some extremely cheap microcontroller like Arduino Uno or Raspberry pie Pico and process everything inside my laptop and then send the text over to the microprocessor/controller. Your solution involves processing the voice inside the raspberry pie then streaming to my computer. Is my method even possible? Thanks in advance

tooambitious · Aug 18, 2023

Ya’akov said:
Welcome to AAC.

Tensorflow Lite and a supported board might offer a solution, but it really depends on the capabilities you expect. If you want something like dictation, you will just have to use a much larger computer. If you are interested in particular voice commands, something like MicroSpeech would work.

Full blown TensorFlow on a Raspberry Pi 4 can do pretty well, like spchcat.

As far as sending the audio to a computer, you can just use any streaming solution.

Basically I am currently working for a university project and trying to build something like the emo robot.

My take is that I will stream the Speech recognition of the user to an customized AI language model API for example ChatGPT. Then the response received will be read by the robot using a custom trained text-to- voice AI(All this processing will be done on my laptop and the communicated to the robot). For simulating eyes/facial expressions I thought of using an i2c LCD/OLED display. The thing is I completely new to this stuff so thought of seeking help online. Feel free to correct me if my ideas are too ambitious or silly to implement and that should I drop it. Thank you

tooambitious · Aug 18, 2023

tooambitious · Aug 18, 2023

I

Ya’akov said:
Welcome to AAC.

Tensorflow Lite and a supported board might offer a solution, but it really depends on the capabilities you expect. If you want something like dictation, you will just have to use a much larger computer. If you are interested in particular voice commands, something like MicroSpeech would work.

Full blown TensorFlow on a Raspberry Pi 4 can do pretty well, like spchcat.

As far as sending the audio to a computer, you can just use any streaming solution.

I liked this idea though. Using Raspberry pie my 90% problems will be solved but the problem is raspberry pie and it's mockups are very expensive here in India and it is not worth to spend this much for a few marks. Hence I am looking for a solution that includes a cheap microcontroller for just streaming and receiving data from my computer while all processing will be done inside my laptop

djsfantasi · Aug 18, 2023

I don’t think you need a microprocessor at all! Get a Bluetooth Microphone and send the audio directly to the laptop via Bluetooth.

tooambitious · Aug 18, 2023

O

djsfantasi said:
I don’t think you need a microprocessor at all! Get a Bluetooth Microphone and send the audio directly to the laptop via Bluetooth.

Ofcourse that was my first idea but my professor may not be satisfied with that, atleast if I do some additional steps he could be convinced. Also we would still raspberry pie as recording audio is not the only thing the robot will do .Thanks for the reply!

Ya’akov · Aug 18, 2023

If you use an RPi running Linux all kinds of things open up. As I mentioned above, you can use something like DarkIce to convert the microphone audio into a useful format, then stream it using something like Icecast.

This can be received and used on the PC as a normal audio stream. Alternatively, you could simply encode the audio as a bitstream of ~5kHz bandwidth and attach that to a UDP port, with a corresponding listener on the PC side.

kkiisshh · Aug 24, 2024

Ya’akov said:
If you use an RPi running Linux all kinds of things open up. As I mentioned above, you can use something like DarkIce to convert the microphone audio into a useful format, then stream it using something like Icecast.

This can be received and used on the PC as a normal audio stream. Alternatively, you could simply encode the audio as a bitstream of ~5kHz bandwidth and attach that to a UDP port, with a corresponding listener on the PC side.

Hi! I tried using a esp8266 and a microphone module to send the raw voltage data to my laptop at 8khz, using udp. It abruptly stops...
I thought I could take the voltage data and make an audio wave out of that on my pc itself.
Also after further research, i don't think an esp8266 could do that standalone. As of now, im trying to find some projects which use rstp on esp to directly stream audio after processing, but still esp isn't capable of this maybe.

Thread starter	Similar threads	Forum	Replies	Date
K	How to Implement Multiple Burden Resistors for a CT Leakage Current Sensor?	Analog & Mixed-Signal Design	11	Jul 22, 2025
	How to implement the combinational logic block of FSM using Combinational Analysis?	Digital Design	8	Jul 16, 2025
J	Which BMS ICs can I use to implement this?	Power Electronics	2	Mar 15, 2025
	I have a problem to implement a common source Cascode gain stage in the multistage amplifier design	Homework Help	36	Jul 5, 2024
F	Problem while trying to implement an stm32f103c8 UART driver	Microcontrollers	5	May 29, 2024

Implement speech-to-text using arduino uno(or any microprocessor) and send this to laptop wirelessly

Join our Engineering Community! Sign-in with:

Implement speech-to-text using arduino uno(or any microprocessor) and send this to laptop wirelessly

tooambitious

BobTPH

tooambitious

BobTPH

Ya’akov

tooambitious

tooambitious

tooambitious

tooambitious

djsfantasi

tooambitious

Ya’akov

kkiisshh

You May Also Like

ST’s New High-Precision Op Amp Takes Aim at the 4 V to 36 V Range

Diodes Inc. Releases Multi-Phase SPI Boost Controller for Auto Lighting

Mesh AI: Node-Level Intelligence with Non-Cellular 5G/6G Connectivity

EW ‘26 Exclusive—Nuvoton Talks MCUs: Low-Power, Edge AI, Automotive, and More