# Myths & Misconceptions

# More than Frequency Response?

The first thing to consider is whether or not FR is the only measurement that should be looked at, as EQ and the guide are based on the concept of FR being the dominant aspect of sound quality. In fact, the EQ guide works on the premise of IEMs being minimum phase systems for which the FR can be changed using minimum phase filters (used in most EQ software). The following sub sections cover different measurement types and their importance when assessing sound quality and EQ.

# Distortion/Nonlinearity

The most obvious measurement to look at is distortion/nonlinearity. In the case of IEMs which are low excursion transducers, nonlinearity is mostly characterized by harmonic distortion (HD), and therefore any increase in other distortion metrics will be reflected in HD. It is therefore common to see measurements of total harmonic distortion (THD) when distortion figures are provided along with the tested SPL level. Indeed, as the SPL output of an IEM increases, so does distortion; when using EQ, the preamp will reduce the digital volume, but most will also increase the volume a bit to counter the preamp. Therefore, any time a positive gain filter is used in an EQ, the distortion in the affected region will increase.

What is not common is the inclusion of psychacoustic models such as masking. Therefore, most distortion measurements will be shown with two options: the first presented as "raw" data with no indication for how humans will perceive it, and the second with a generalized rule of thumb concerning distortion audibility. In both cases, the technical data is presented without much consideration for the human perceptual system. However, in most cases, distortion will not be perceptually significant with IEMs, and will not be audible even with extensive EQ. It is therefore unecessary to consider nonlinearity as being a big factor when using EQ.

# Transients, Speed, & Responsiveness

A transient is often explained as being a short duration, single instance sound, such as the pop of a balloon or a gunshot. These are usually displayed in the time domain; instead of having frequencies on the x-axis, it is instead time.

The most common method way of seeing the transient response of IEMs (and other transducers) is by measuring the impulse response (IR). A single impulse is fed into the IEM, and the obtained output shows the response of the IEM over time. This single impulse contains all frequencies at the same amplitude, after which a Fourier transform of the IR is used to separate the frequencies and the corresponding amplitude of each frequency, which changed due to the FR of the IEM. In fact, FR measurements are all done by first measuring the IR of the IEM, then doing the Fourier transform of it to obtain the FR.

Below is a demonstration of the link between FR and IR shown with two IEMs; both were measured, and one was re-measured using EQ to approximate the other IEM within reason (+/- 0.5 dB). By using EQ, or any method of changing the FR, there is a corresponding change in the IR. These are causally linked, and therefore contain the same information. However, IR is not helpful when trying to correlate it with what is being heard, and is not a valid method for characterizing "speed".

The small differences in the modified IRs are due to non-perfect matching of the two FRs. The modified FR graph even shows visible discrepancies between the two FRs, which then lead to different IRs.

# CSD/Waterfall Plots

CSD, or waterfall plots, seem like the optimal way of displaying both frequency and time domain information. However, as discussed previously, time domain information, such as decay or resonance is causally linked to the FR by the minimum phase behavior of IEMs, as shown with the IR demonstration. Indeed, for minimum phase systems, any ringing and/or decay will reflect in amplitude in the FR: when these issues are fixed, so are the displayed ringing/decay features in waterfall plots.


# Tuning & Technicalities Dichotomy

Popularized by Crinacle and adopted by many, the tuning versus technicalities dichotomy has become the most common way of thinking about sound quality with relation to IEMs (and headphones). The prevalence of measurements combined with uncertainty and mismatch with perception has resulted in a split between "easy to explain" aspects, grouped under tuning for and in consequence FR, and "hard to correlate" aspects, grouped under technicalities. The latter is of importance, since it is often thought of as being separate from tuning and FR, when that is not the case; in fact, based on what has been discussed before, FR is the most important metric for predicting sound quality and dominates over every other one. Much of what is labeled technical is merely tuning under another name, colored by expectation and subject to interpretation. The frequency response remains the most comprehensive and reliable indicator of what is heard.

The division has been further reinforced by the rise of EQ, which allows competent transducers to be shaped into nearly any response: in turn, tuning becomes a matter of taste, while technicalities are assumed to reflect the underlying quality of the driver, unalterable. If EQ can replicate one IEM’s frequency response using another, and if that response fully defines the sound, then what remains to distinguish technical performance? As detailed previously, FR defines, in most cases, the totality of audible output. No other measurable metric can be used to explain technicalities as being separate from FR. Therefore, it is better to think of technicalities as perceptual consequences of tonal balance and tuning quirks. When listeners point to an IEM having better technicalities despite "identical" FRs, they are often responding to uncontrolled variables, such as differences in insertion depth, ear geometry, HRTF, unit variation, isolation, or even expectations.

For technicalities to become something to consider as separate from FR, there are a few steps to take from the academic and industry side.

  1. The first is to define the terms used under the technicalities umbrella in ways that are consistent with people's interpretation and perception.
  2. The second is to provide a theoretical framework under which testing and measurements can be done to quantify these terms.
  3. The third is to provide conclusive differences in the measured data for these variables between different IEMs
  4. The fourth is to demonstrate the audibility of such variables
  5. The fifth is to examine the statistical spread and correlation between these variables, IEMs, and preference

These five steps aim to not only examine technicalities as being separate from FR, but also to verify audbility and preference, both concepts that are assumed to exist and match perfectly in the hobby.


# Test Signals vs Music

Another common misconception involves the test signal used for measurmeents. Indeed, many argue that the sine sweep that is used currently is not representative of real-life "complex" signals like music, with many frequencies at different amplitudes occuring at the same time to the point where IEMs will have varying levels of "accuracy".

Depending on the algorithm, different stimulus signal can be the input, whether it be noise, music, sine sweep, etc. The resulting FR will always be the same (sometimes with miniscule differences). White noise, which contains all frequencies at the same amplitude, is similar in concept to IR with its single impulse containing all frequencies at the same amplitude. Music can be thought of as a form of noise, with the caveat of music having an FR of its own which will have to be removed by the algorithm in order to only obtain the transfer function of the IEM and not of both.

Nowadays, measurements use an exponentially-swept sine signal as detailed by Angelo Farina, which allows the user to measure both linear and nonlinear behavior of IEMs (or other DUTs) in a short time. In REW, options for measuring with sweeps and noise are available; a real-time analyzer (RTA) function is also present, allowing the user to measure the real-time response of an IEM (works best with white noise; if using music, the average FR of the music has to be subtracted).


# Driver Type & Configuration Differences

Similar to the tuning and technicalities dichotomy, there are also a lot of ideas and concepts related to the drivers themselves that have been created and perpetuated to explain various perceived sound characteristics, despite many of them being false.

Concepts such as speed and decay have been shown to relate directly to FR and are therefore not immutable properties of the drivers. Other factors such as driver size are either Thiele/Small parameters (TSP), or linked to them, and it is these parameters that ultimately influence both linear (FR) and nonlinear (distortion) behavior. These parameters are often discussed as being directly related to how sound is perceived, when they are instead parameters that are used for a desired system.

The same occurs with driver configurations, which are often used as an indicator of performance. These claims assume that the driver configuration determines how an IEM performs, but in reality, they are only meaningful because they influence the FR. An IEM with many drivers can sound bad compared to a single driver IEM not because it is inherently worse, but because its FR is less optimal. In this sense, driver configuration is a means, not an end, to a desired and targeted FR.