Getting audio input and processing it?

Michael Curry · Jan 9, 2013

I've been working on a Morse Code translator, I've made it work so that youcan input English and it will translate it to Morse and play the Audio. I now want to add a feature to the program that takes audio input, processes it and then outputs the English.

Are there any specific APIs that I can use to take input from a microphone,is there any way to detect frequencies or amplitudes, as well as find out for how long the wave lasts? If not, are there any implementations of this?Would there be a way to do it?

Dave Angel · Jan 10, 2013

I've been working on a Morse Code translator, I've made it work so that you can input English and it will translate it to Morse and play the Audio. I now want to add a feature to the program that takes audio input, processes it and then outputs the English.

Are there any specific APIs that I can use to take input from a microphone, is there any way to detect frequencies or amplitudes, as well as find out for how long the wave lasts? If not, are there any implementations of this? Would there be a way to do it?

If you find an approach that gets audio information from your hardware,
and on your operating system, then you presumably end up with a stream
of bytes (or of some other data type, if the sample size is bigger than
8 bits). If this were my project, I'd ignore the detecting of frequency
(though it can be done, by various filtering techniques), and just
measure energy. Presumably the signal is quiet for space, and noisy for
mark. So you set a threshold, and measure how long the signal is below
the threshold, and then how long it's above. The time above the
threshold is how long the "wave lasts." Presumably you can then set a
few length thresholds, and short values are dit and longer ones are dah.

So, how do you measure energy on a byte string? The way I've seen audio
encoded in 8 bits has 128 as the DC zero reference point. Values above
and below that represent a nonzero signal voltage. So you'd do
something like:
abs( value-128)

to get an instantaneous energy. Do a sanity check to make sure the
"quiet" signal gives you a very small energy (some noise expected, of
course).

There are lots of refinements possible. But I suspect someone else is
going to pop in and say that libary xyz on pypi will do it all for you
automatically. There's probably someone who's done the whole project
before. But what's the fun in that?

Processing user input as it's entered	0	Mar 26, 2013
Beginner's Guide to getting CipherSweet working with PDO and MYSQL	1	Dec 1, 2022
Python Multi-Channel Audio	0	Jul 8, 2010
Some advice on getting into coding?	5	Jun 14, 2021
Audio Processing in Ruby	3	Mar 4, 2008
Java audio recording	0	Jun 4, 2008
how to mix audio files	2	Mar 3, 2010
Help with Optimization of Python software: real-time audio controller	5	Feb 11, 2007

Getting audio input and processing it?

Michael Curry

Dave Angel

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads