Audio over IP, what is it, why should I care?

IP networks are the largest standardized data transfer networks worldwide.  These networks can be found in almost every business and home and are used for file transfer, storage, printing, etc.  The Internet Protocol over Ethernet (802.x) networks is widely understood and supported.  It is robust, inexpensive, well-documented, readily deployed, and nearly universal.  Many equipment manufacturers such as Comrex, Telos, and Wheatstone have developed audio equipment that uses IP networks to transfer and route audio within and between facilities.

IP protocol stack
IP protocol stack

Audio enters the system via an analog-to-digital converter (A/D converter), often a sound card, at which point a computer program stores it as a file.  These files can be .wav, .mp3, .mp4, apt-X, or some other format.  Once the audio is converted to a digital data format, it is handled much the same way as any other digital data.

IP stands for “Internet Protocol,” which is a communications protocol for transmitting data between computers connected to area networks.  In conjunction with a transmission protocol, either TCP (Transmission Control Protocol) or UDP (User Datagram Protocol) IP forms what is known as the Internet Protocol Suit known as TCP/IP.  The Internet Protocol Suit contains four layers:

  1. Application layer – This is the protocol that contains the end-use data.  Examples of these would be HTTP, FTP, DCHP, SMTP, POP3, etc.  Telos Systems uses its own application called “Livewire” for its equipment.  Wheatstone uses “WHEATNET.”  Digigram uses “Ethersound.”   This is an important distinction.
  2. Transfer layer – This contains the TCP or UDP header information that contains such things as transmitting, and receiving ports, checksum value for error checking, etc.  It is responsible for establishing a pathway through multiple IP networks, flow control, congestion routing, error checking, and retransmission.  TCP allows for multiple IP packets to be strung together for transmission, increasing transfer rate and efficiency.
  3. Internet layer – This is responsible for transporting data packets across networks using unique addresses (IP addresses).
  4. Link Layer – This can also be called the physical layer, using Ethernet (802.x), DSL, ISDN, and other methods.  The physical layer also means things like network cards, sound cards, wiring, switches, and routers.

Advantages:

An IP network can be established to transmit data over almost any path length and across multiple link layer protocols.  Audio, converted to data can thus be transmitted around the world, reassembled, and listened to with no degradation.  Broadband internet connections using a cable, DSL, ISDN, or T-1 circuits can be pressed into service as STL’s, ICR’s, and TSL’s.  This translates to fast deployment; no STL coordination or licensing issues, no antennas to install if on a wired network.  Cost reductions are also realized when considering this technology over dedicated point-to-point TELCO T-1’s.  Additionally, license-free spread spectrum radios that have either DS-1 or 10baseT Ethernet ports can be used, provided an interference-free path is available.

IP audio within facilities can also be employed with some brands of consoles and soundcards, thus greatly reducing audio wiring and distribution systems and corresponding expenses.  As network speeds increase, file transfer speeds and capacity also increase.

Disadvantages:

Dissimilar protocols in the application layer mean a facility can’t plug a Barix box into a Telos Xtream IP and make it work.  There are likely hundreds of application layer protocols, most of which do not speak to each other.  At some point in the future, an IP audio standard, like the digital audio AES/EBU may appear, which will allow equipment cross-connections.

Additionally, the quality of the physical layer can degrade performance over congested networks.  The installations must be carefully completed to realize the full bandwidth capacities of cables, patch panels, patch cords, etc.  Even something as little as stepping on a Category 6 cable during installation can degrade its high-end performance curve.  The cable should be adequately supported, not kinked, and not stretched (excessive pulling force) during installation.

TCP/IP reliability is another disadvantage over formats like ATM.  In a TCP/IP network, no central monitoring or performance check system is available.  TCP/IP is what could be called a “broadcast” protocol.  That is to say, it is sent out with a best-effort delivery and no delivery confirmation.  Therefore, it is referred to as a connection-less protocol and in network architecture parlance, an unreliable network.  Lack of reliability allows any of these faults to occur; data corruption, lost data packets, duplicate arrival, out of order data packets.  That is not to say that is does not work, merely that there is no alarm generated if an IP network begins to lose data.  Of course, the loss of data will affect the reconstruction of the audio.

Analog digital converter symbol
Analog digital converter symbol

Finally, latency can become an issue over longer paths.  Every A/D converter, Network Interface Card (NIC), cable, patch panel, router, etc has some latency in its circuitry.  These delays are additive and dependent on the length of the path and the number of devices in it.

Provided care is taken during design and installation, AOIP networks can work flawlessly.  Stocking adequate spare parts, things like ethernet switches, NICs, patch cables and a means to test wiring and network components is a requirement for AOIP facilities.

Listening to the Radio like doing Cocaine

Well now, this explains a few things.  Always interested in carrying science forward, I read with interest the article on Gawker which cites a study from McGill University in Montreal, Canada.  The gist of the article states that we seek out music we enjoy because of a chemical reaction in our brains:

If music-induced emotional states can lead to dopamine release, as our findings indicate, it may begin to explain why musical experiences are so valued. These results further speak to why music can be effectively used in rituals, marketing or film to manipulate hedonistic states. Our findings provide neurochemical evidence that intense emotional responses to music involve ancient reward circuitry and serve as a starting point for more detailed investigations of the biological substrates that underlie abstract forms of pleasure.

By extension, radio has previously been the venue for most new music discoveries.  Although this continues today, it is being supplanted by “new media” sources such as youtube.  As a point of reference, studies on cocaine addiction show that dopamine levels increase by about 22% during use.  When a listener is exposed to what is perceived as good music (a subjective term), average dopamine levels increased by about 21%.

Risk-taking behavior like computer file sharing, when known consequences are large, could enhance that by adding an element of danger.  The Gawker article lists Samuel Barber’s “Adagio for Strings,” which I always found somewhat depressing.  Something more like this a little more fun:

You can disregard the T-shirt pitch at the end if you want.

So there you are you erstwhile program directors, now you know why your job is important; you are to make us addicts, or not, depending…

MW/SW hybrid stations

There are four of these stations in Canada.  In my scanning of the shortwave frequencies, I have found CFRX on 6070 KHz to put a strong signal into my location pretty much 24/7. At night it is drowned out by interference but daytime is listenable from local sunrise until about an hour before local sunset with a 100 uv signal.  A look at the map reveals the CFRX transmitter site is roughly 300 miles away.  CFRX is a relay of CFRB, 1010 KHz in Toronto, Ontario, Canada.  CFRB does not come in here at all due to protecting 1010 WINS in NYC.

1964 CFRX/CFRB QSL card
1964 CFRX/CFRB QSL card

What I find very interesting is the station uses 1 KW TPO into a single 50-foot (117-degree) vertical tower.  That is a pretty low-tech transmission facility, nothing like the minimum 50 KW with a 10 dB antenna requirement the FCC stipulates for shortwave broadcasters in this country. Even so, it generates a big useable signal, in the case of CFRX, covering nearly 300,000 square miles very inexpensively.

CFRX has been relaying CFRB’s signal since 1937.  According to the ODXA site, the station uses a Bauer 701B modified to transmit on HF.    The Bauer transmitters were always solid units.  It went off the air in 2008 for several months while the transmitter was repaired, but eventually, it did return.  Obviously, the station ownership finds value in the service and it continues today.

I often wonder why the FCC won’t allow a similar HF relay service here in the US.  Daytime propagation on the tropical bands (75 and 60 Meters (3.9-4 MHz and 4.7 – 5.06 MHz respectively)) would allow low-power relays to cover large areas like CFRX.  Tropical band propagation is such that nighttime coverage may be degraded by interference from other, more powerful stations offshore.  Even so, it seems like a good way to cover a lot of ground in an economical way.

Other Canadian shortwave relay stations:

HF Call signPower/frequencyParent stationLocationPower/frequency
CFRX1000/6060CFRBToronto, ON50 KW/1010
CFVP1000/6030CKMXCalgary, AB50 KW/1060
CKZN300/6160CFGB (CBC R-1)St. Johns, NF4.5 KW/89.5 MHz
CKZU500/6160CBU (CBC R-1)Vancouver, BC50 KW/690

Power in watts, frequency is KHz unless otherwise noted.

I know the CRTC would like to do away with these stations, but I think that is foolish.