Using KA9Q-Radio - Example with the RTL-SDR dongle

Northern
Utah
WebSDR

Nothern Utah WebSDR Logo - A skep with a Yagi

Using "KA9Q-Radio"
with the RTL-SDR dongle

Preliminary

Important:

This document represents an effort on my part to understand the operation of "ka9q-radio" and is not intended to be authoritative.

As such, this is a work in progress and will certainly contain many "blank spots" and errors. What it is intended to do is to help the new user along and start to get the "feel" of how the pieces go together.

Please read EVERY document in the /docs directory of the "ka9q-radio" git - and refer back when you see something you don't understand!

For more information about ka9q-radio, go here:

Using KA9Q-Radio - link

This page has much more information about the internal workings of ka9q-radio and other examples of its use.

What is "KA9Q-Radio" - and why is it different from other SDR programs/suites?

One of the advantages of SDRs is the capability of receiving multiple signals at the same time - but this is typically exploited only in a limited fashion. The limit of this capability is a combination of both the bandwidth of the acquisition device (e.g. how much spectrum the device is inhaling) and also the processing capability of the host computer. Usually it's the latter point that has limited the usefulness/capability of many wide-bandwidth SDRs: It is typical for each "instance" of a receiver used by a user to have to process data from the high-bandwidth acquisition stream - which may be several megasamples. Because each per-user instance requires so much processing, this can make a multi-user receiver system "un-scalable" - that is, each user requires a significant amount of CPU processing.

In 2006, an article was published ¹ that described what might be considered to be a mathematical "shortcut" when it comes to processing large amounts of data. Without going into detail, the "traditional" method for producing a single virtual receivers is to crunch the full bandwidth data to yield - at least in an amateur radio application - only a narrow bandwidth - perhaps a few kHz for an SSB or FM signal or even a few 10s of kHz for a waterfall - and if multiple receivers are required, it's necessary to "re-crunch" the large amount of raw input data for each, individual receiver even through that mathematical operation for each receiver is expensive in CPU time and nearly identical. A far more efficient method - potentially one that is many hundreds of times more efficient, depending on how much "economy of scale" was done - would be to do the "expensive" number crunching just once and then use that already-processed data to synthesize each, individual receiver - and it is this method, generally referred to "Overlap and Save" - that is used by KA9Q-Radio.

As an example of the "former" method: If the "csdr" ² utility is used on, say, an RTL-SDR with 2 MHz of bandwidth, a Raspberry Pi4 is capable of only handling 4-8 simultaneous receivers before all available CPU cycles are used: This is one of the reasons why the open-source "OpenWebRX" server isn't particularly salable to a large number (e.g. dozens) of users. Conversely, the PA3FWM WebSDR server (which is closed source) likely (unconfirmed!) uses same the techniques as KA9Q-Radio - which are noted in Footnote #1 -to allow hundreds of users on the same hardware platform as an OpenWebRX server that may be only to handle a half-dozen or so.

Using the aforementioned "Overlap and Save" method in reference #1, a Raspberry Pi4 running ka9q-radio can simultaneously decode every FM channel in the same 2 MHz bandwidth simultaneously with plenty of processing power to spare!

KA9Q-radio is open-source and it may be found here: https://github.com/ka9q/ka9q-radio/ - and the instructions for compiling it along with a list of dependencies may be found here: https://github.com/ka9q/ka9q-radio/blob/main/INSTALL.txt

IMPORTANT - READ THIS BEFORE PROCEEDING

Additional installation instructions:

Also, be sure to read this file: https://github.com/ka9q/ka9q-radio/blob/main/docs/notes.md as it contains information about configuring multicast and the local DNS needed to resolve the hostnames.

After installing and building ka9q-radio, run the following commands (sudo may be required):

mkdir /var/lib/ka9q-radio Note: This may fail if it already exists
chown <username> /var/lib/ka9q-radio Substitute the user name under which you are running "ka9q-radio"

It may be worth verifying that /var/lib/ka9q-radio/wisdom is "owned" by the user running "ka9q-radio"

Also make sure that this directory - and the wisdom file - belong to the same group under which you are running "ka9q-radio" using "chgrp". If, when starting "radiod" you see an error related to the wisdom file it probably has to do with access to it.

FFT "wisdom" file:

Once you have installed ka9q-radio, execute the following to optimize the operation of the FFTW3 algorithm. The data that this produces - the "wisdom" file - is specific to every computer and running this optimizes performance on the hardware that you are using.

time fftwf-wisdom -v -T 1 -o wisdom rof500000 cof36480 cob1920 cob1200 cob960 cob800 cob600 cob480 cob320 cob300 cob200 cob160

NOTE: This may take many minutes - or even hours to run, depending on your computer hardware.

Once this is done, take the resulting file - "nwisdom" - and place it in /etc/fftw - but back-up the previous version that was there!

For more information about this, see: https://github.com/ka9q/ka9q-radio/blob/main/docs/FFTW3.md

Also recommended:

It is recommended that you also install "Avahi" for local DNS name resolution of the multicast streams using the name rather than the IP address - one method being to do: snap install avahi

After this is installed, enable it by typing: sudo systemctl start avahi-daemon.service - and then verify that it is running by typing: sudo systemctl status avahi-daemon.service

The use of this will be discussed later.

To Do: See if there is a way to install Avahi using "apt install" rather than Snap as it is desirable to uninstall Snap as it can operate in the background and "break" an already-working system - particularly after a reboot.

IMPORTANT - Back up your .conf files!

As of the time of this writing (June, 2023) the default configuration files WILL BE OVERWRITTEN every time you do an update/make of ka9q-radio. Both the files in the home "~/ka9q-radio" and "/etc/radio" directories can be overwritten.

What this means is that if you modify the original configuration files (e.g "/etc/radio/sdrplayd.conf", "/etc/radio/radiod@hf.conf" - and those in the "ka9q-radio directory) you will LOSE those modifications when you do an update.

When you make changes to ANY configuration file, be sure to save a copy elsewhere, and be prepared to restore it after you do updates.

It is on the list of future updates to change this behavior.

The "magic" of "ka9q-radio": One input/multiple outputs:

In the simple case, let us presume that you have a single SDR of some sort: KA9Q-radio supports several receivers, including the RTL-SDR, HackRF, AirspyHF, Funcube Dongle, SDRPlay devices and the RX-888 - and more may be added in the future as this is open-source.

Let us suppose that you wish to receive audio from, say, 22 frequencies simultaneously - all WSPR frequencies on the LF, MF and HF bands (14 frequencies - including two each on 60 and 80 meters), all WWV signals (2.5, 5, 10, 15, 20 and 25 MHz), all CHU frequencies (3.330, 7.85, 14.66 MHz) the purposes of propagation monitoring. Conventionally, one might configure 22 audio channels in ALSA/Pipewire/Pulse and while this is possible, it gets cumbersome: What if you also wanted to monitor the FT-8 and FT-4 channels as well - adding at least another 20 audio streams?

A usable work-around would be to convey these demodulated audio streams via multicast. This means of propagating data via UDP dates back to at least the late 1980s and is widely used to convey video streams over LANs: In this case, we could use this transport method to loop data to the same computer to which the receiver is connected - but we could also send this same data to one or more computers on the same LAN, each one taking the data that is relevant to its needs.

While this means of transport is elegant in its simplicity, it does have a few caveats:

Multicast uses UDP, meaning that the transport is considered in network parlance to be "unreliable" meaning that unlike TCP/IP, it not "connected" and does not do acknowledgments of receipt or resend if the packet is dropped. This, alone, means that it should not traverse the Internet in its native form.

This "unconnected" (connection-less) property means that its "broadcast" capability can be used to send received data to any number of computers on the same LAN without each, additional computer adding processing or data load to the source computer or network.

Multicast does not "play nice" with wireless LAN equipment - at least not the sort that is cheap and likely to be found around the home. To use multicast, you really should use a WIRED connection.

The "Opus" utilities included with "ka9q-radio" can help mitigate propagation of streams across "multicast-unfriendly" networks or the Internet in general by encoding the data and conveying it via conventional TCP/IP.

Multicast can flood a LAN if you are not careful. Because it is a broadcast, you can end up propagating many megabits of data across a network if you are not careful. For a small, home network with Gig-E connections, this may not be a big deal - but if you try this on a large network that isn't well architect ed (see below) then you won't make any friends!

Some older or simple network-connected devices (e.g. process controllers, alarm systems, clocks, etc.) may experience problems if they cannot deal with a large flood of multicast data - particularly if their IP stacks/hardware is not designed to do so.

Multicast might not propagate across networks between switches. Some routers/switches may not propagate Multicast - at least by default - to prevent the problems noted above.

A "good" router or smart switch will employ IGMP ³ to prevent multicast from "going everywhere" and allow management of where and how this data might be propagated (or not!) by the switch.

Using KA9Q-radio:

IMPORTANT: This document - as is KA9Q-radio - is a work in progress and will evolve. I (the author of this document) am learning the various aspects of this utility as well and errors, misunderstandings and omissions are surely going to appear. Please consider this document to be a first step in getting acquainted with this software.

Comment: Various pieces of KA9Q-radio may be configured to start/operate as a service - but at the time of this writing, I could not get this to work reliably and is a work in progress.

Example: A multi-channel receiver using the RTL-SDR dongle

The RTL-SDR dongle - and its many clones - is an inexpensive device that was originally intended to allow the reception of off-air digital television (using the DVB-T standards, which are not those used in the U.S., Mexico, Canada and several other countries.) Because it is essentially a frequency converter attached to an analog-to-digital converter, it can also be used as a front end SDR device.

The basic RTL-SDR dongle has the Raphael R820T frequency converter followed by the Realtek RTL2832U and this combination allows the tuning of (more or less) any frequency from somewhere around 30 MHz to a bit higher than 1.5 GHz and operate in a bandwidth up to approximately 2 MHz. Some versions of the RTL-SDR dongle (such as the "RTL-SDR Blog Version 3") also include provisions that allow reception of HF frequencies (approx. 500 kHz to 30 MHz) by enabling the "Direct" ("Q") input but the "rtlsdrd" program described here does not (yet?) have the ability to configure this input method making it usable only for VHF and up.

Preparing the computer for use with the RTL-SDR Dongle

Unless you have already successfully installed and used the RTL-SDR dongle on your computer, there are a few suggested configurations that should be done for it to work properly: Even if you have used an RTL-SDR dongle before, it would be a good idea to verify that these have been done.

Add "rtl-sdr.rules" in /etc/udev/rules.d/rtl_sdr.rules

This file may be found online in many places including the Osmocom rtl-sdr project and on several places - including here - link. This file allows the computer to recognize an RTL-SDR device as such. The use of sudo may be required to install this file.

Add a blacklist entry to prevent the RTL-SDR from being recognized only as a DVB (Television) receiver device

This is done by adding the line: blacklist dvb_usb_rtl28xxu to the file "/etc/modprobe.d/blacklist.conf". The use of sudo may be required to edit this file.

After these changes are made, reboot the system for them to take effect.

Starting the receiver:

Unlike most other programs in ka9q-radio that interface with hardware, the program "rtlsdrd" does not have an external configuration (".conf") file assocated with it at this time so everything must be configured from the command line. This program also does not currently have the ability to be run as a service.

Consider this command line:

rtlsdrd -D rtlsdr-pcm.local -R rtlsdr-status.local -r 2048000 -f 162500000 &

Looking at each part in detail:

rtlsdrd - This invokes the program "rtlsdrd".
-D rtlsdr-pcm.local - This is the hostname ("rtlsdr-pcm.local") of the multicast stream on which the raw I/Q (PCM) data from the receiver will be emitted.
-R rtlsdr-status.local - This is the hostname ("rtlsdr-status.local") of the metadata (control) stream for this receiver.
-r 2048000 - This sets the sample rate of the RTL-SDR at 2.048 Msps.

The typical range of sample rates supported by RTL-SDRs is between 225 and 300 ksps and 900-2048 ksps. Rates higher than 2048 ksps may be supported by some combinations of dongles and USB hardware, but this must be tested on a case-by-case basis to see if this works without having problems with dropped samples.

-f 162500000 - This is the center frequency in Hz to which the receiver is tuned.

Important: Do not tune the receiver exactly to a frequency that you wish to monitor, but rather tune it a few kHz away (for SSB, AM, etc.) or at least 20 kHz away (for narrowband FM) signals. This should be done to avoid the narrow "zero Hz hole" that can be present on the output of the A/D converter: Signals with carriers that fall into this "hole" can be badly distorted.
Be sure that this frequency is chosen so that the receive frequencies that you wish to use are within the range of the receiver based on the sample rate.

In this example we can tune signals within half of the sample rate (1024 ksps) or 162.500 MHz +/- 1.024 MHz.
In actuality, the available range is slightly less than this as we want to stay way from the extreme lower and upper edges of the sample passband (e.g. +/- 1.0 MHz) as signals are rolled off near the edge and aliasing (false signals) and also appear near the edges.

When using the R820T converter (e.g. not using "direct" mode) the lower frequency limit where signals may be receive is typically around 30 MHz. The upper frequency limit is usually between 1500 and 1800 MHz.

& - The "&" at the end of the command line indicates that this program should run in the background.

Additional optional parameters for controlling the receiver:

-I <device number> - "device number" is the serial number assigned to the RTL-SDR dongle.

Note that by default, the serial number of an RTL-SDR dongle is likely to be "00000000" or "00000001". It may be changed using the "rtl_eeprom" utility that you will have to install separately.
This parameter may be used to specify a particular RTL-SDR device on a system if more than one is present.
If this parameter is omitted the first device RTL-SDR found will be what is used.

-L - This parameter, if present, sets "linearity" using gain tables. Without this parameter "sensitivity" is used.

Note: I don't quite know to what this refers.

-a - Enable software AGC. Without this parameter hardware AGC is used.
-b - Enable bias-Tee on the input RF connector. With this parameter, approximately 4.7 volts at up to 100 mA may be supplied from the RF connector to power a preamp, etc.
-c <offset> - This sets the frequency calibration, where <offset> is a positive/negative number used to set the frequency precisely. This is used to compensate for errors in the RTL-SDR's internal reference oscillator.

Note:

There is currently no support for:

Manual gain adjustment on the R820T converter.

This makes it impossible to calibrate a signal strength indicator that might be used, and it also is likely to result in brief signal disruptions if a strong signal appears in the passband when the A/D converter is briefly overloaded before the AGC reduces the gain.

This effect is more noticable when running AX.25 packet as any brief "pop" caused by a the appearance of a strong signal (e.g. local repeater keying up) and the subsequent overload will corrupt received packets.

There is no way to specify "direct" RF input via the "Q" channel as used for HF reception, limiting use only to be that through the internal frequency converter (the R820T). This means that only frequencies above (approximately) 30 MHz may be tuned.

Additional (optional) parameters related to data transfer from "rtlsdrd":

-A <iface> - This sets the multicast interface used for the multicast data. "lo" is for looopback while "eth0" would be a typical value for the default Ethernet interface.
-S <ssrc> - This defines the SSRC (stream source identifier). If not specified, this is set automatically..
-v - Enable verbose mode (e.g. more information from "rtlsdrd")
-T <ttl> - TTL (Time to Live) for RTP data. Default is 1.
-t <ttl> - TTL (Time to Live) for metadata stream. Default is 1
-p <tos> - IP TOS (Type of Service). The default is 48 (e.g. AF12 << 2).

If you get an error:

Pay close attention to the errors that you might get. Typical causes of errors are:

Is the RTL device plugged in? It's worth checking!
If you are using the "-I" parameter, are you specifying the exact serial number of the device? You must specify this exactly!

If you are using more than on RTL-SDR dongle, be sure that each one has its own, unique serial number as set using the "rtl_eeprom" utility.

Omitting a required field. Our example above shows a reasonable minimum of parameters required to configure an RTL-SDR.
Invalid value within a field. Perhaps you put something wrong in a field?

Once rtlsdrd is running:

With rtlsdrd running, we need to set up our receivers which are invoked using "radiod" along with a configuration. "radiod" is the heart of ka9q-radio as it does the work of processing the massive amount of data coming in from our receiver hardware. Even on a modest processor it is capable of simultaneously demodulated hundreds of individual receive channels in a mix of frequencies, bandwidths, sample rates and modes.

As with the receive hardware itself, a configuration file is used to set things up and here we will use "radio@rtlsdr.conf".

Consider the following command line:

./radiod radiod@rtlsdr.conf &

The above will invoke "radiod" using configuration file "radio@rtlsdr.conf" and if we peer into this we'll see which frequencies are configured - and the various receive mode. Multiple definitions of receive frequencies and modes may be included in various named sections. For more detailed information, see: https://github.com/ka9q/ka9q-radio/blob/main/docs/ka9q-radio.md

Our example "radiod@rtlsdr.conf" file:

Our "radiod@rtlsdr.conf" file used by "radiod" is used to define the virtual receiver(s) that we might want. Let's take a look at a minimum configuration:

The first - and required - section is the "global" section which contains the following:

[global]
overlap = 5
blocktime = 50
input = rtlsdr-status.local
status = fm.local
fft-threads = 4
samprate = 24000
mode = pm

Breaking this down:

blocktime - In most cases, this is "20", representing 20 milliseconds (or 0.02 seconds).

In the case of the configuration of the rtlsdr operating at 2048 ksps, this means that we have (600000 * 0.02) = 40960 samples per FFT block.
A larger value for this can yield sharper channel filters and it somewhat relaxes timing constraints having to do with CPU scheduling - but it can increase latency and increase CPU utilization as processing of an FFT increases with the square root of the FFT block size.

overlap - This is typically 5.

When processing the FFTs, an overlap of 5 means that 1/5th of each block (e.g. 20%) consists of samples from the previous block, with the remaining 4/5 being "new" samples.
When a value of 5 is selected in conjunction with a blocktime of 20 ms, the overlap is therefor (20 / 5) = 4 milliseconds.
Smaller values of overlap (which means that a higher percentage of the data of each block is "old" data) permit sharper filters - but this also means that each FFT block contains a higher proportion of "old" data and more CPU power is needed overall.
Only certain values of overlap will work. A value of 5 is selected to reduce CPU loading, but for lower-bandwidth receivers like the RTL-SDR and/or if you have plenty of CPU power an overlap of 2 - which gives improved performance and better filters - could be used.

input - This is the name of the stream, defined in the "data" statement following the "-R" parameter in our invocation of rtlsdrd.
samprate - This is the default sample rate of a demodulated output streams: The example above is 24 kHz. This value may be overridden in the configuration of modes and receivers as needed.
mode - KA9Q-radio can demodulate several modes - and the above is defined as being "pm" as the default, which may be overridden in the configuration of modes and receivers as needed.

The mode "pm" is actually what is referred to as "FM" in amateur radio service. The "pm" mode includes de-emphasis - the boosting of highs on transmit and the subsequent de-boosting of those same highs on receive to improve the noise performance of weak signals.

status - This defines the stream containing (what else) but the status of this thread.
fft-threads - This sets the number of threads used by "FFTW3" for the "forward FFT". On an Intel processor with at least 4 cores, 4 threads seems reasonable and it reduces latency of each FFT, but more threads implies a higher overall CPU utilization across all of the cores.

Let's take a look at the sub-sections located after the [global] section:

[NWS]
# NOAA/NWS weather frequencies
data = nws-pcm.local
mode = pm
freq = "162m400 162m425 162m450 162m475 162m500 162m525 162m550"

Examining in detail:

[NWS] - This names this section of the file "NWS"
# NOAA/NWS weather frequencies - The hash (#) symbol at the beginning of the line denotes that this is a comment
data = nws-pcm.local - This specifies the host name of the multicast stream on which the audio of our seven receive channels will be carried.
mode = pm - This specifies that we are receiving using "pm" - which is actually what amateur radio and the National Weather Service call "fm" - See the comment above.

Because we specified "pm" in the [global] section already, the definition here is redundant.

freq = "162m400 ... 162m550" - This is the list of frequencies to which our virtual receivers are tuned.

As noted above, be sure that the frequencies specified here are with range of the receiver at the current sample rate - also considering that one must stay slightly away from the upper/lower edges to avoid distorting the signal.

Note that the frequencies may be specified in a number of ways. For example, 162.55 MHz can be specified as:

162550000 - In Hz
162m550 - In MHz to the nearest 1 kHz
162m55 - In MHz to nearest 10 kHz
162m550000 - In MHz to the nearest Hz
162550k - In kHz to the nearest 1 kHz
162550k000 - In kHz, to the nearest Hz.

All of the above will specify the same frequency for reception.

Here we define an output multicast stream that contains all seven NOAA (National Weather Service) frequencies used in the U.S - and, in fact, we are receiving all of them simultaneously.

As mentioned above, a single multicast stream can carry multiple audio channels using the SSRC to identify/use the sub-stream for a particular receiver - and in the above we see that we have defined the use of seven NWS frequencies on which we will receive . By default, the SSRC will be the frequency with any non-numeric characters removed.

Taking the example of the frequencies above and the SSRC values that they will produce when the non-numerical characters are removed:

16255000 = SSRC of 16255000
162m550 = SSRC of 162550
162m55 = SSRC of 16255
162m550000 = SSRC of 162550000
162550k = SSRC of 162550
162550k000 = SSRC of 162550000

The importance of being consistent in the way that the frequency is represented is very obvious!

Note:

If you have installed Avahi, you can use the name following the "data" statement ("nws-pcm.local") rather than needing the numerical IP address.
More in-depth information for starting "radiod" as a service.

Comment about FM/PM squelch:

As detailed later in this document, the "pm" mode has, by default, squelch enabled. What this means that if squelch is enabled, if there is no signal, there will be multicast audio data being emitted by "radiod" for the specified frequency.

For FM and PM reception in ka9q-radio there is currently no way to completely disable the FM squelch.

There must signal present on the frequency to which you are listening for there to be an audio stream.

"fftwf_export_wisdom_to_filename" errors when starting radiod

If you see the errors "fftwf_export_wisdom_to_filename" produced by "radiod" when it is starting, that does not mean that it won't work properly, but it likely will not be as CPU-efficient as it could be as the FFT "wisdom" needed to optimize operation of the algorithm is missing. To help resolve this - and potentially reduce CPU utilization - do the following: sudo chown <username> /var/lib/ka9q-radio/wisdom - substituting for <username> the name of the user under which you are running ka9q-radio.

Getting audio output (to speakers):

Being able to "hear" the demodulated audio is a quick and easy way to verify that everything is working - even if this isn't likely to be the main purpose to which ka9q-radio would be put. At this point it is recommended that you place a .wav file on your test system and then use "aplay" to test the speaker: If your filename were "music.wav", simply do: play music.wav and if all goes well, you should hear it play: If not, read the next section, below.

Having verified via "top" or "htop" that rtlsdrd and radiod are running, you can test it via a local speaker if you like but note that unless you need an analog audio output of some sort, it is not even necessary to have any audio playback devices on your system - but it's a nice tool to have. If your computer has a sound card, connect a speaker to it and do the following: "aplay -l": You should see a list of available devices such as the following:

**** List of PLAYBACK Hardware Devices ****
card 0: PCH [HDA Intel PCH], device 0: ALC662 rev3 Analog [ALC662 rev3 Analog]
Subdevices: 1/1
Subdevice #0: subdevice #0

You may see other devices, particularly if you have an HDMI or similar monitor that can convey audio - but the above is typical of an analog audio output device on a motherboard.

What to do if you see "No audio device" when you try to play your local audio file (e.g. "music.wav")

If you do not see any available sound cards - but you know that one is present (a plug-in card, on the motherboard), it may be that you have been the victim of a quirk recent versions of Linux (e.g. Ubuntu 22.04) where parts of the the sound system seem to "go away" at random - likely after a reboot/update. To repair this, try the procedure at the very top of this page: http://www.sdrutah.org/info/high_rate_loopback_websdr.html to re-load/restart the audio devices once again.

If you get an error like "play WARN alsa: can't encode 0-bit Unknown or not applicable"

This is a vexing problem to many trying to use their sound cards on recent version of Ubuntu and it seems to be related to pulseaudio and/or pipewire. If you get this error - and audio does not play, try disabling pulseaudio:

systemctl --user stop pulseaudio.socket
systemctl --user stop pulseaudio.service

After doing the above, try playing the audio file again: It may work now, even if you still get the "can't encode 0-bit Unknown" error.

If you get an error related to pipewire and a report that no device is available - despite "aplay -l" showing devices, you may need to uninstall pipewire. This is done using the following commands:

systemctl --user unmask pulseaudio
systemctl --user --now disable pipewire-media-session.service
systemctl --user --now disable pipewire pipewire-pulse
systemctl --user --now enable pulseaudio.service pulseaudio.socket
sudo apt remove pipewire-audio-client-libraries pipewire

Once you have a working audio path

Remember: The audio stream will not exist unless there is a signal present on the input frequency - not even white noise!

Once you have verified that an audio device is present and will play the audio file that you have put on the computer, consider the following command:

monitor nws-pcm.local

We know from the output to screen when "radiod" started and from the contents of "radiod@rtlsdr.conf" from the [NWS] section that the name of the stream is "nws-pcm.local" represent the multicast for the receivers receivers (plural!) and upon this invocation we will see a screen like this:

KA9Q Multicast Audio Monitor: nws-pcm.local
                                                                 ------- Activity -------- Play
dB Pan     SSRC Tone Notch ID                                 Total   Current      Idle Queue
+0   0   162550                                                    9         9              80
+0 -25   162475                                                    6         6         2     0
+0 25   162500                                                    0         0         1     0

Comment:

The above shows the results from an RTL-SDR that is connected to an outside antenna and receiving three NWS broadcasts on 162.475, 162.500 and 162.550 MHz. Because there are no transmissions audible on the other four channels, they are squelched (which is enabled by default) and no data is being emitted from those receivers.

If all goes right, you should hear audio from the speaker containing every NWS transmitter that your RTL-SDR dongle can hear. To control these, press the "h" key to get a list of options - the most relevant for the current discussion shown below:

↑ ↓ - Select prev/next session
⤒ ⤓ - Home/End select first/last session
⇞ ⇟ - Page up/Page down select prev/next session page
d - Delete session
r - Reset playout buffer
m - Mute current session
M - Mute all sessions
u - Unmute current session
U - Unmute all sessions
A - Toggle start all sessions muted
- + - Volume -1/+1 dB
← → - Stereo position left/right (pan)
v - Toggle verbose display
h - Toggle help display
q - Toggle quiet mode

In other words, you can move up/down between receivers to select that on which the controls (keys) will operate.

When you first do this, it's recommended that you hit the "M" key (uppercase) to mute ALL receivers - and then use the up/down arrow to select which one(s) you wish to hear and then hit the "u" (lowercase) key to unmute that receiver. You can then use the plus and minus keys to adjust the volume, left/right arrows to pan (move between speakers), etc.

To exit "monitor" hit "CTRL-C".

Getting a specific audio source from a stream

While it is possible to use other tools to extract a audio from a multicast stream (To Do: Discuss other methods in this or another document) the "pcmcat" tool allows you to do so. Taking the example of the NWS receivers again, consider the following line:

./pcmcat nws-pcm.local -s 162550 | aplay -r 12000 -c 1 -f s16_le

If all goes well (e.g. rtlsdrd and radiod are running) you will hear the 162.550 MHz NWS audio.

Again, remember that this signal must be present and being received by the RTL-SDR for the stream to exist: The stream will not exist if there is no signal and the virtual receiver is squelched.
Change the ssrc (the number after "-s") to reflect the frequency of your local NWS transmitter.

Similarly, if we wanted to hear a different NWS transmitter on 162.525 MHz we would use this line:

./pcmcat nws-pcm.local -s 162525 | aplay -r 12000 -c 1 -f s16_le

In the above we see the multicast IP address for the NWSreceivers - but following the "-s" parameter we see the "ssrc" - in this case "162525" representing 162.550 MHz (or another number if we had used the "ssrc" parameter when we defined the receiver). Following this we see that we have piped - via STDOUT - the audio to "aplay", specifying a sample rate of 12 kHz (-r 12000), a monaural source (-c 1) and the format of our audio (-f s16_le) which is 16-bit signed, little-endian.

If desired, you could pipe the raw audio somewhere else - perhaps to a file - or use SOX to write it to a .wav file, instead as this example:

./pcmcat nws-pcm.local -s 162525 | sox -t raw -r 12000 -b 16 -c 1 -L -e signed-integer - out.wav

This will record the 162.525 MHz NWS receiver, via "STDOUT" from pcmcat to the file "out.wav".

To make it record for 2 minutes, the following will work:

timeout 120 ./pcmcat nws-pcm.local -s 162525 | sox -t raw -r 12000 -b 16 -c 1 -L -e signed-integer - out.wav

Within ka9q-radio is another utility called "pcmrecord" that can record every virtual receiver within a group defined in the radiod.conf file simultaneously (e.g. all six WWV signals could be recorded at once.) For more about "pcmrecord", see the page ka9q-radio command overview - Link.

Mode definitions

For more details about the "mode.conf" file and the parameters within it, see the page: Configuration files in KA9Q-Radio - link.

In the configuration file for "radiod" (e.g. "radiod@hf.conf") we see the use of modes such as "am" and "usb" but you might wonder how these are defined. The answer to this lies in the file "modes.conf" where we see - in each individual section - the definition of of how this mode is defined in terms of sample rate, actual method of demodulation, filter bandwidth, etc.

While many of the common "modes" are included in "modes.conf", you can define and add your own mode: Perhaps you need an upper-sideband receiver centered on 1500 Hz that is 400 Hz wide for WSPR - you could do that!

As an example how these are defined, consider the [am] section of "modes.conf":

[am]
demod = linear
samprate = 12000
low = -5000
high = 5000
recovery-rate = 50
hang-time = 0
envelope = yes

demod - "Linear" defines modes where amplitude is a prime component of the modulation such as AM, SSB, HF digital modes - and any type of receiver where multiple signals/sidebands must be conveyed accurately at baseband.
samprate - 12000 defines a 12 kHz sample rate
low and high - The -5000 and 5000 define the lower and higher edges of the filter, respectively. Here, this defines a 10 kHz bandwidth and nominally 5 kHz of audio.
recovery-rate - The number of dB per second that the gain (agc) is to be recovered when the signal has decreased in level and the "Hang Time" has expired
hang-time - The time, in seconds, after the signal has decreased to hold the gain constant before increasing it at the "recovery-rate"
envelope - When "yes" this enables the standard AM-type "envelope" detector - it defaults to OFF.

Now, consider the [cwu] (upper-sideband CW) section:

[cwu]
demod = linear
samprate = 12000
low = -200
high = +200
shift = +500
hang-time = 0.2

low and high - "-200" and "200" specify a 400 Hz wide bandwidth.
shift - Here, "+500" moves the center to 500 Hz, meaning that our CW passband goes between 300 and 700 Hz. If we wanted lower-sideband CW (e.g. "cwl") we could make this "-500". Similarly, if you want a 700 Hz center frequency, this value would be 700 - the sign depending on whether you want upper or lower sideband reception.
hang-time - Here, 0.2 seconds (200 msec) is selected for the traditional fast recovery used during CW reception.

Now, a "non-linear" mode:

[fm]
demod = fm
samprate = 24000
low = -8000
high = +8000
deemph-tc = 0
deemph-gain = 0
threshold-extend = no ; don't interfere with packet, digital, etc

Contrary to common parlance, the mode amateurs use on VHF and UHF and call "FM" is really phase modulation - but "fm" is operationally identical to "pm" if, during transmit, the audio is pre-emphasized (boosted) at a rate of 6dB/octave and then de-emphasized (filtered) at the same rate on receive. "True" FM is used for digital modulation such as that used for D-Star, C4FM, etc.

demod - Here "fm" is specified. Unlike a linear mode, the amplitude of the received signal has no bearing on the demodulation.
samprate - Because the width of an FM signal is typically more than 10 kHz, the sample rate must be high enough to accommodate this. For an I/Q signal, the Nyquist limit is the sample rate, so 24 kHz of bandwidth will easily accommodate typical amateur FM signals.
low and high - The +/-8000 specifies a 16 kHz bandwidth - appropriate for the +/-5 kHz deviation typically found in North America on the VHF and UHF amateur bands.
deemph-tc - To Do: The value of "0" disables de-emphasis?
deemph-gain - To Do: The value of "0" provides no gain?
threshold-extend - It is recommended that this be used ONLY for voice, not data

Traditionally, "threshold extension" is a mean by which the quality of weak FM signals is improved by coupling a reduced detection bandwidth with a tracking local oscillator, causing an increase in distortion due to the "excessive" filtering of the sidebands, but trading this off for a noise-reduced signal.
According to documentation provided, this version of "threshold extend" catches the instances when - in an FM detector using the 4-quadrant "atan2()" function - to "slip" 360 degrees under high signal+noise conditions by blanking audio when signal amplitude drops below a threshold, reducing the intensity of the "pops" that would otherwise be present - hence the reason for not recommending its use for data. Consider this to be experimental.

Finally, "pm" - the receive mode that is appropriate for typical amateur "FM" operation on the VHF/UHF bands:

[pm]
demod = fm
samprate = 24000
low = -8000
high = +8000
squelchtail = 0
threshold-extend = yes ; PM assumes voice mode, so enable this

For this, the "squelchtail" is specified in the number of "blocks" (defined in the main receiver definition - typically 20 milliseconds) so a value of "0" would mean no squelch tail. For this, "threshold-extension" is turned on (OK for voice - not recommended for any sort of data) and for +/- 5 kHz deviation, the same +/-8000 Hz (16 kHz) bandwidth is used. It's worth noting that de-emphasis when using "demod = fm" is on by default, making it appropriate for the "pm" mode used by amateurs on the VHF and UHF bands.

For more information about the parameters found in "modes.conf" and other files refer to:

Configuration files in KA9Q-Radio - link.
https://github.com/ka9q/ka9q-radio/blob/main/docs/ka9q-radio.md

For more information about ka9q-radio, go here:

Using KA9Q-Radio - link

This page has much more information about the internal workings of ka9q-radio and other examples of its use.

References:

1 - Mark Borgerding, “Turning Overlap-Save into a Multiband Mixing, Downsampling Filter Bank”, IEEE Signal Processing Magazine, March 2006. https://www.iro.umontreal.ca/~mignotte/IFT3205/Documents/TipsAndTricks/MultibandFilterbank.pdf

2 - The csdr tools by HA7ILM may be found here: https://github.com/ha7ilm/csdr. This represents a "toolbox" of signal processing engines that can do things like filter, decimate, shift, demodulate, convert formats, provide AGC and more. These tools may be useful for additional filtering of signals.

3 - IGMP (Internet Group Management Protocol) is used to set up "local" groups of hosts. In the context of this article, a "group" might be a number of hosts that require multicast data from a source on specific portions of the network, but not everywhere. The ability to compartmentalize where multicast data is sent can prevent it from flooding to other devices on the network. See the article: https://en.wikipedia.org/wiki/Internet_Group_Management_Protocol

4 - The PA3FWM at the University of Twente in the Netherlands uses an A/D converter that streams raw data via an Ethernet interface to a computer that uses graphics-card processors to do the heavy-lifting. See these pages for more information: http://websdr.ewi.utwente.nl:8901/ and http://www.pa3fwm.nl/projects/sdr P.T. de Boer, PA3FWM, provides a version of the WebSDR software that is similar to that operating at the University of Twente, but does not utilize GPU cores and bespoke hardware and is therefore more limited in bandwidth - but it still is extremely economical with its CPU power in servicing many users - even on computers of limited processor power allowing many times the number of users compared to OpenWebRX. While I have no direct evidence that it does, I suspect that the PA3FWM WebSDR uses the technique described in Reference #1 to allow its very efficient use of CPU power to service many users simultaneously.

TO DO:

Find the best way to determine multicast addresses for PCM and status streams - DONE - use AVAHI
Do a better job of "getting in the mind" of the creator of this code to understand the methodology.
Wherever it says "To Do" or "Source code says..." provide better understanding/documentation

Additional information:

For general information about this WebSDR system - including contact info - go to the about page (link).

For the latest news about this system and current issues, visit the latest news page (link).

For more information about this server you may contact Clint, KA7OEI using his callsign at ka7oei dot com.

For more information about the WebSDR project in general - including information about other WebSDR servers worldwide and additional technical information - go to http://www.websdr.org

Back to the Northern Utah WebSDR landing page