Just randomly thinking again about some things I noticed with audio at
low sample rates.
For baseline, can note, basic sample rates:
44100: Standard, sounds good, but bulky
32000: Sounds good--- Synchronet 3.21a-Linux NewsLink 1.2
22050: Moderate
16000: OK, Modest size, acceptable quality.
Seems like best tradeoff if not going for high quality.
11025: Poor, muffled.
8000: Very poor, speech almost unintelligible (normally).
But, it is seeming like a "weird hack" may exist here.
  8000: Very poor, speech almost unintelligible (normally).
    But, it is seeming like a "weird hack" may exist here.
On 9/6/25 5:28 AM, BGB wrote:
   8000: Very poor, speech almost unintelligible (normally).
     But, it is seeming like a "weird hack" may exist here.
You might want to look at how AT&T did it. It has been a while but I think this
is near what they used. Back when phones were analog and digital was just getting started.
Just randomly thinking again about some things I noticed with audio
at low sample rates.
For baseline, can note, basic sample rates:
44100: Standard, sounds good, but bulky
32000: Sounds good
22050: Moderate
16000: OK, Modest size, acceptable quality.
Seems like best tradeoff if not going for high quality.
11025: Poor, muffled.
8000: Very poor, speech almost unintelligible (normally).
But, it is seeming like a "weird hack" may exist here.
BGB <cr88192@gmail.com> posted:
Just randomly thinking again about some things I noticed with audio at
low sample rates.
For baseline, can note, basic sample rates:
44100: Standard, sounds good, but bulky
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high
end of the audio spectrum.
Might sound "good" to someone who does not know what it is supposed
to actually sound like, though.
32000: Sounds good
22050: Moderate
16000: OK, Modest size, acceptable quality.
Seems like best tradeoff if not going for high quality.
11025: Poor, muffled.
8000: Very poor, speech almost unintelligible (normally).
But, it is seeming like a "weird hack" may exist here.
On Sat, 6 Sep 2025 05:28:16 -0500
BGB <cr88192@gmail.com> wrote:
Just randomly thinking again about some things I noticed with audio
at low sample rates.
For baseline, can note, basic sample rates:
44100: Standard, sounds good, but bulky
32000: Sounds good
22050: Moderate
16000: OK, Modest size, acceptable quality.
Seems like best tradeoff if not going for high quality.
11025: Poor, muffled.
8000: Very poor, speech almost unintelligible (normally).
But, it is seeming like a "weird hack" may exist here.
8000 x 8bit (mu-law in USA, A-law in majority of the world) was a
standard sampling rate for digital back ends of analog wired telephony
for more than 50 years. I didn't check, but would assume that it still
is.
Most people founded it quite intelligible. Certainly more intelligible
than cellular telephony, until less then 20 years ago cellular improved
a little.
On 9/6/2025 11:21 AM, MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
Just randomly thinking again about some things I noticed with audio at
low sample rates.
For baseline, can note, basic sample rates:
   44100: Standard, sounds good, but bulky
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high
end of the audio spectrum.
Might sound "good" to someone who does not know what it is supposed
to actually sound like, though.
Dunno. I mostly use headphones.
Seemingly, at least with the headphones I have, I can hear tones up to around 17 kHz, but above this, pretty much nothing.
I noticed when trying to get new headphones, I got some cheap ones at
first that sounded like muffled crap (they were around $10 IIRC). I
tried generating tones and with these headphones audio dropped off to nothing after around 11 kHz. Ended up needing to buy some slightly more expensive headphones (around $30 IIRC, from Logitech), which sounded a
bit better.
Ended up giving the cheap ones to my dad, they apparently worked fine
for him.
Below 1kHz, sine waves rapidly drop off in intensity, whereas square and sawtooth waves retain full loudness.
on the headphones, I can still hear sine waves (well under 1kHz) if the volume is fairly high.
IRL, I have noted that I am mostly unable to hear tuning forks.
My mom also recently got a "steel tongue drum" (with an apparent 432Hz tuning), which I had noted I can sorta hear, but the sound is very
quiet. I mostly hear the "thwap" sound when she uses the little rubber- tipped mallet on it.
If I put my hand near it, I can feel vibrations, but I don't really hear anything.
Personally, much over a 32 kHz sample rate, any difference rapidly drops off, so 44100 and 48000 seem to sound basically the same.
I was mostly trying to explore the area around 8000 though, where
normally I hear crap-all. But, seemingly, with some questionable
filtering, intelligible speech can come through, I just don't entirely understand how it works.
But, as noted, there are several variations of the trick:
 Feed audio through ADPCM;
   Works better with either 2-bit/sample IMA,
    or with encoder tuned to overshoot.
 Model audio as line-fitting during downsampling.
   This is likely similar to what ADPCM ends up doing.
 Model audio as B-spline fitting.
   Seems to preserve more perceptual quality than the line fitting.
But, what I am not entirely sure of is why this would make any real difference.
But, can note that it does differ from the more conventional
downsampling strategies of "just average stuff", in that both approaches tend to generate points outside the original curve.
   32000: Sounds good
   22050: Moderate
   16000: OK, Modest size, acceptable quality.
     Seems like best tradeoff if not going for high quality.
   11025: Poor, muffled.
    8000: Very poor, speech almost unintelligible (normally).
      But, it is seeming like a "weird hack" may exist here.
Seemingly, there is no general disagreement that 11025 and 8000 sound
kinda like crap?...
I guess 11025 worked OK for Doom and Quake.
 Quake 2 had used 22050 (but, still 8-bit PCM).
 Quake 3 had used 22050 (but 16-bit PCM now)
With Wolfenstein 3D, it wasn't until hearing some slightly better
quality versions of the sound effects from the iOS port that I realized
the enemies were saying stuff for their sound effects. Like, the low-
level enemies apparently saying "Achtung!" rather than "Aaah-Uuuh" (but, with the audio from the DOS version, just sorta heard a whole lot of the latter).
But, as noted, I mostly ended up preferring 16000 A-Law for sound
effects and similar as a good tradeoff for space and quality. Also
ADPCM, which uses less space.
Some people seem to try to use MP3 or OGG for sound effects, but:
 128 kbps: Bulky
  64 kbps: Poor
  32 kbps: Can full of broken glass.
In addition to both formats being complicated, computationally expensive
to decode, and typically needing to use a third party library to decode them.
Also, in this case, 2-bit IMA ADPCM seems to somewhat beat MP3 at the
low bitrate game (at least to my hearing).
Not sure of a good way to go lower, best way I have found in past
fiddling was, eg:
 Downsample by 1/16 or so to generate a reference line;
   Eg, spline-fitting the samples;
 Also generate the side-intensity
   Eg, standard deviation from samples and the spline.
 Store this line in some form, such as via ADPCM;
 Approximate the intermediate table with patterns from a table.
   The table of patterns itself derived partly from the frequencies.
   Stores the relative intensity above/below the spline curve.
Where, one way of storing the line is, say:
 4x 3-bit, each control-point sample, as ADPCM
 3 or 4-bit, side/intensity sample (eg, standard deviation channel). Pattern table might be stored as 4 or 8 bits per block.
 Pattern is chosen by whichever best fits the intermediate samples.
but with, say, 8 bits per sample block, but with a 16x internal
downsample, could work out to 0.5 bits/sample (or, 16kHz audio in
8kbps). With 8-bit patterns, it is 0.75 bits/sample.
Example patterns:
 0: Flat line, follow spline
 1: Positive (sin 8*PI)
 2: Positive Hump (sin PI)
 3: Negative Hump
 4: Positive (sin 2*PI)
 5: Negative (sin 2*PI)
 6: Positive (sin 3*PI)
 7: Negative (sin 3*PI)
 8/9: 4*PI
 A/B: 5*PI
 C/D: 6*PI
 E/F: 7*PI
 ...
If using 6 or 8-bit patterns, it can include a second (or 3rd) sub- frequency.
 00..0F: Same as above
 10..1F: Same main pattern as 00..0F
   Sub-frequency mirrored in frequency and polarity (+8 mod 16).
   Roughly 5/8 amplitude of main frequency.
 2x: Same, but lower intensity sub-frequency (3/8).
 3x: Same, but lower intensity sub-frequency (1/8).
 4x..7x: Same, but use a different sub-frequency index (+/-5 mod 16).
   Encodes offset sign and intensity (5/8 or 3/8).
 8x..Fx: Add a 3rd frequency, lower intensity than the second (1/8).
   Similar strategy to above.
 ...
Decoding algorithm would work in blocks, eg:
 Unpack spline points;
 Interpolate splines for each sample;
 Multiply deviation channel with the values from the pattern table.
   This is then added onto the base spline.
However, this sort of approach is somewhat more complicated than just
using a low-bitrate ADPCM (and I haven't used it much).
Also, quality is inferior to 2-bit ADPCM.
But, not a lot in this area that doesn't sound like total garbage...
I had noted in past experiments that seemingly a lower-limit scheme for intelligible speech for me, was:
Split audio into blocks of 128 samples (at a 16kHz sample rate);
Match a sine-wave between 4 and 8 kHz (picking the loudest sine wave);
Encode the frequency and intensity of this sign wave.
This can achieve ~ 0.125 bits per sample, or 2 kbps.
 However, speech is very unnatural sounding;
 Pretty much any non-speech audio becomes unrecognizable noise.
Where, say, frequency is a byte in steps of 16 Hz; and intensity is an
A-Law value.
Though, this pushes the limits of intelligibility, and it is possible
that others might find such a scheme unintelligible.
Has also experimented with schemes of encoding the relative intensity of
a series of 16 bands (between 4 and 8 kHz), but quality was also pretty
low here (and it won neither for quality or ability to achieve a low bitrate). Quality is better with more bands, but this quickly reaches a practical limit.
It being seemingly more effective to pick 1 or 2 sine waves, and then encoding the specific frequency and intensity of each.
For slightly more natural sound, can pick N sine waves from within
specific frequency ranges, say, 4 waves:
 2-3kHz, 3-4kHz, 4-6 kHz, 6-8kHz
Resulting in something slightly more like a normal human voice.
 But, still sounds unnatural.
 And, it still falls on its face for any non-speech audio.
Also, can note:
While I am saying sine waves here, wave shape is non critical, it also
seems to work if using square waves or similar.
for these experiments, had mostly ended up discarding everything below
2kHz, as it seems to not contain anything particularly relevant.
Can note that the "block sampling rate" for this approach seems to needs
to be over 100 Hz for best effect (a block size of 128 samples giving a 125Hz block-sampling frequency).
But, can note that seemingly no mainline audio codecs work this way...
On 9/6/2025 11:54 AM, BGB wrote:
On 9/6/2025 11:21 AM, MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
Also, can note:
While I am saying sine waves here, wave shape is non critical, it also
seems to work if using square waves or similar.
for these experiments, had mostly ended up discarding everything below
2kHz, as it seems to not contain anything particularly relevant.
Can note that the "block sampling rate" for this approach seems to
needs to be over 100 Hz for best effect (a block size of 128 samples
giving a 125Hz block-sampling frequency).
But, can note that seemingly no mainline audio codecs work this way...
Playing around with WAV almost destroyed my eardrums and my speakers.
FWIW, I have an example of a wav experiment right here:
https://youtu.be/DrPp6xfLe4Q?t=63
On 9/6/2025 3:18 PM, Chris M. Thomasson wrote:
On 9/6/2025 11:54 AM, BGB wrote:
On 9/6/2025 11:21 AM, MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
...
Also, can note:
While I am saying sine waves here, wave shape is non critical, it
also seems to work if using square waves or similar.
for these experiments, had mostly ended up discarding everything
below 2kHz, as it seems to not contain anything particularly relevant.
Can note that the "block sampling rate" for this approach seems to
needs to be over 100 Hz for best effect (a block size of 128 samples
giving a 125Hz block-sampling frequency).
But, can note that seemingly no mainline audio codecs work this way...
Playing around with WAV almost destroyed my eardrums and my speakers.
FWIW, I have an example of a wav experiment right here:
https://youtu.be/DrPp6xfLe4Q?t=63
This is fairly quiet (apart from a slight warbling sound) until the
piano comes in at around 1:20...
I don't have any of my experiments here on YouTube; would likely need to find some good public domain audio test examples (and/or record myself speaking, but would rather not), and set something up here.
Though, in this case, it would be more examples of "pushing the limits
for poor audio quality".
Did find a video of another guy doing something vaguely similar to what
I have done in some experiments:
https://www.youtube.com/watch?v=qosYRO6WjkQ
But, his examples sound very different from mine (with a characteristic sound more like bad MP3 compression), I suspect because he was using different frequency bands or similar (as noted, mine ignored pretty much everything below 2kHz).
BGB <cr88192@gmail.com> posted:
Just randomly thinking again about some things I noticed with audio at
low sample rates.
For baseline, can note, basic sample rates:
44100: Standard, sounds good, but bulky
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high
end of the audio spectrum.
Might sound "good" to someone who does not know what it is supposed
to actually sound like, though.
MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
Just randomly thinking again about some things I noticed with audio at
low sample rates.
For baseline, can note, basic sample rates:
   44100: Standard, sounds good, but bulky
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high
end of the audio spectrum.
Might sound "good" to someone who does not know what it is supposed
to actually sound like, though.
My ears are not good enough to notice the difference between CD quality, AAC/high sample rate MP3/ogg vorbis/etc, but according to my savant (?) cousin who could listen to a 16 min piece of music once and then write
down the score for all the instruments, none of them sound like live,
but they are close enough that he can listen and internally translate to what it would have sounded like in a concert.
Terje
MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
Just randomly thinking again about some things I noticed with audio at
low sample rates.
For baseline, can note, basic sample rates:
44100: Standard, sounds good, but bulky
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high
end of the audio spectrum.
Might sound "good" to someone who does not know what it is supposed
to actually sound like, though.
My ears are not good enough to notice the difference between CD quality, AAC/high sample rate MP3/ogg vorbis/etc, but according to my savant (?) cousin who could listen to a 16 min piece of music once and then write
down the score for all the instruments, none of them sound like live,
but they are close enough that he can listen and internally translate to what it would have sounded like in a concert.
Terje
On 9/7/2025 5:26 AM, Terje Mathisen wrote:
MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
Just randomly thinking again about some things I noticed with audio at >>> low sample rates.
For baseline, can note, basic sample rates:
   44100: Standard, sounds good, but bulky
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high
end of the audio spectrum.
Might sound "good" to someone who does not know what it is supposed
to actually sound like, though.
My ears are not good enough to notice the difference between CD quality, AAC/high sample rate MP3/ogg vorbis/etc, but according to my savant (?) cousin who could listen to a 16 min piece of music once and then write down the score for all the instruments, none of them sound like live,
but they are close enough that he can listen and internally translate to what it would have sounded like in a concert.
To me, 44100 and 48000 sound basically the same, so not much gain in
going higher.
The difference between 32000 and 44100 is slight.
BGB <cr88192@gmail.com> posted:
On 9/7/2025 5:26 AM, Terje Mathisen wrote:
MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
Just randomly thinking again about some things I noticed with audio at >>>>> low sample rates.
For baseline, can note, basic sample rates:
   44100: Standard, sounds good, but bulky
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high
end of the audio spectrum.
Might sound "good" to someone who does not know what it is supposed
to actually sound like, though.
My ears are not good enough to notice the difference between CD quality, >>> AAC/high sample rate MP3/ogg vorbis/etc, but according to my savant (?)
cousin who could listen to a 16 min piece of music once and then write
down the score for all the instruments, none of them sound like live,
but they are close enough that he can listen and internally translate to >>> what it would have sounded like in a concert.
To me, 44100 and 48000 sound basically the same, so not much gain in
going higher.
The difference between 32000 and 44100 is slight.
The difference is in the phase of the high end spectrum 15K-22K
Meanwhile, decided to check the delta between:
 Audio downsampled from 16K to 8K via averaging pairs of samples;
 Audio downsampled from 16K to 8K via spline curve fitting.
On 9/7/25 6:58 PM, BGB wrote:
Meanwhile, decided to check the delta between:
  Audio downsampled from 16K to 8K via averaging pairs of samples;
  Audio downsampled from 16K to 8K via spline curve fitting.
Seems, inadequate to satisfy the Nyquist criteria.
Terje Mathisen <terje.mathisen@tmsw.no> posted:
MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
Just randomly thinking again about some things I noticed with audio at >>>> low sample rates.
For baseline, can note, basic sample rates:
44100: Standard, sounds good, but bulky
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high
end of the audio spectrum.
Might sound "good" to someone who does not know what it is supposed
to actually sound like, though.
My ears are not good enough to notice the difference between CD quality,
AAC/high sample rate MP3/ogg vorbis/etc, but according to my savant (?)
cousin who could listen to a 16 min piece of music once and then write
down the score for all the instruments, none of them sound like live,
but they are close enough that he can listen and internally translate to
what it would have sounded like in a concert.
Just after graduating CMU I worked in a high end stereo store. The listening room was 4 walls none of them parallel and a slanted ceiling; so it had essentially no reverberation. The Pittsburgh string quartet rented out
the room for various practices, and we recorded on 9 track tape at 60"/s
and played it back on Dalquist speakers and other high end amplification; diddling with the equalization until the recording sounded like the live string quartet (only seconds apart live<->recorded).
I have/had 2 brothers who could listen to a movie and then go write down
the score of one or two of the tunes. I, personally, can't carry a tune
in a basket--but I admire those who can. I can hear things that others don't seem to. Things like whether the phono section of a pre-amp has a tube or not--its all in the harmonics!!
Dunno.
Averaging pairs would be the traditional method for downsample, but,
when downsampling to 8kHz, audio sounds muffled, and intelligibility of speech is poor.
On 07/09/2025 23:12, MitchAlsup wrote:
Terje Mathisen <terje.mathisen@tmsw.no> posted:
MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
Just randomly thinking again about some things I noticed with audio at >>>>> low sample rates.
For baseline, can note, basic sample rates:
    44100: Standard, sounds good, but bulky
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high
end of the audio spectrum.
Might sound "good" to someone who does not know what it is supposed
to actually sound like, though.
My ears are not good enough to notice the difference between CD quality, >>> AAC/high sample rate MP3/ogg vorbis/etc, but according to my savant (?)
cousin who could listen to a 16 min piece of music once and then write
down the score for all the instruments, none of them sound like live,
but they are close enough that he can listen and internally translate to >>> what it would have sounded like in a concert.
Just after graduating CMU I worked in a high end stereo store. The
listening
room was 4 walls none of them parallel and a slanted ceiling; so it had
essentially no reverberation. The Pittsburgh string quartet rented out
the room for various practices, and we recorded on 9 track tape at 60"/s
and played it back on Dalquist speakers and other high end amplification;
diddling with the equalization until the recording sounded like the live
string quartet (only seconds apart live<->recorded).
I have/had 2 brothers who could listen to a movie and then go write down
the score of one or two of the tunes. I, personally, can't carry a tune
in a basket--but I admire those who can. I can hear things that others
don't
seem to. Things like whether the phono section of a pre-amp has a tube or
not--its all in the harmonics!!
Having a good memory for tunes, or being able to replicate tunes, and
being able to distinguish the quality of sound reproduction is not
actually highly correlated. The former is primarily a higher-level
brain function, while the later is partly physical, partly low-level
brain (or software vs. hardware, to suit the group better!).
For those that work directly with music, age brings experience and
improves abilities like recognising or duplicating tunes. Age also
brings deterioration in the physical aspects of hearing - especially at higher frequencies.
There is /some/ overlap, because both groups spend a lot of time
listening to music, which exercises and improves both functions.
One key difference, however, is that it is easy to appreciate when
people can listen to a tune once and play it again afterwards - you can watch them do it. For people who say they can distinguish CD audio from AAC or other high bps compressed audio, and other "golden ears" distinctions, it's a different matter - in double-blind tests, most fail badly. There are a great many factors involved in high-quality audio reproduction - the basic sample rate is only one of them.
On 9/8/2025 3:59 AM, David Brown wrote:
On 07/09/2025 23:12, MitchAlsup wrote:
There is /some/ overlap, because both groups spend a lot of time
listening to music, which exercises and improves both functions.
I listen to music a lot, but usually House and EDM and similar.
... For people who say they can distinguish CD audio from
AAC or other high bps compressed audio, and other "golden ears" >distinctions, it's a different matter - in double-blind tests, most fail >badly. There are a great many factors involved in high-quality audio >reproduction - the basic sample rate is only one of them.
On Mon, 8 Sep 2025 10:59:50 +0200, David Brown
<david.brown@hesbynett.no> wrote:
... For people who say they can distinguish CD audio from
AAC or other high bps compressed audio, and other "golden ears" >distinctions, it's a different matter - in double-blind tests, most fail >badly. There are a great many factors involved in high-quality audio >reproduction - the basic sample rate is only one of them.
That's true, but the basic sample rate does make a significant
difference. I don't know if it is true, but I have read that to
/accurately/ reproduce a given note requires 10 to 11 harmonics: the
primary note, 7 higher, and 2 to 3 lower.
This means most notes will include sounds that are outside the range
of (normal) human hearing, but you can still /feel/ these sounds [even
the high ones] and miss them when they are absent.
C8 (high C) on the piano is ~4186 Hz. Assuming the need for the 7th
higher harmonic - 29302 Hz - Nyquist would demand a minimum sampling
rate of 58604/s to accurately reproduce C8.
In practice, unless you like orchestral, or certain folk or country,--- Synchronet 3.21a-Linux NewsLink 1.2
you are not likely to hear much difference between a CD and a decent
quality compressed version of it. But the CD itself is not a faithful reproduction of the live performance.
And, of course, if you like orchestral you are more likely to be
listening to vinyl rather than CD. 8-)
BGB <cr88192@gmail.com> writes:
On 9/8/2025 3:59 AM, David Brown wrote:
On 07/09/2025 23:12, MitchAlsup wrote:
There is /some/ overlap, because both groups spend a lot of time
listening to music, which exercises and improves both functions.
I listen to music a lot, but usually House and EDM and similar.
Not that I listen to that, but IIRC, most of that is fairly narrow
in frequence range and mostly generated electronically instead
of by actual instruments (drum machines, synthesizers, etc.)
So you're starting with "artificial" digital signals.
While classical, progressive rock, jazz and classic rock music all leverage real-world analog instruments, most of which have unique and complex
harmonic elements and remain in the analog domain until converted
to final digital form.
On 9/8/2025 4:10 PM, Scott Lurndal wrote:
BGB <cr88192@gmail.com> writes:
On 9/8/2025 3:59 AM, David Brown wrote:
On 07/09/2025 23:12, MitchAlsup wrote:
There is /some/ overlap, because both groups spend a lot of time
listening to music, which exercises and improves both functions.
I listen to music a lot, but usually House and EDM and similar.
Not that I listen to that, but IIRC, most of that is fairly narrow
in frequence range and mostly generated electronically instead
of by actual instruments (drum machines, synthesizers, etc.)
So you're starting with "artificial" digital signals.
Fair enough.
I had noted when looking at some of this that typically the frequency spectrum drops off sharply to pretty much nothing. Exactly where this
point is depends a lot on the song, but somewhere in the area of 11 to
16 kHz seems typical.
But, yeah, it appears like things often drop off steeply after a few
points: 8kHz, 11kHz, and 16kHz. With a few songs looked at having a sort
of "stair step" look in their spectrum.
If I take a song and do an 8kHz high-pass filter, what is left mostly
sounds like varying levels of white noise.
One of my other test cases (mostly for speech; ripped from the audio-
track of an episode of an animated TV show), has a drop-off wall at 8kHz (nothing over 8kHz).
Checking for another animated show, it seems to have an 8kHz wall for
the speech, but a 4kHz wall for the background music.
The presence of a sharp 8kHz frequency wall in several cases does imply
that 16kHz recording is likely popular for voice acting.
On 9/7/25 10:55 PM, BGB wrote:
Dunno.
Averaging pairs would be the traditional method for downsample, but,
when downsampling to 8kHz, audio sounds muffled, and intelligibility
of speech is poor.
It has been a couple of decades since that discrete time signal
processing course so the details have faded. But I do know that aliasing
can be a big problem. Hence the good low pass filter as part of the decimation process.
Assuming your 16KSPS data started with a good presample filter there was
no signal or noise (or at least negligible) above 8KHz, it is going to
have stuff between 4KHz and 8KHz. Fail to filter that adequately and it
gets folded/aliased to a lower frequency.
So you need a discrete time filter to remove most of the information in
your data above 4KHz while leaving what you want alone.
The moving average filter is not a good choice. Sure it has a zero at
your new sample rate but it has poor performance in general.
OK.
I was just seeing here if I could make stuff "less muffled".
 A cleaner averaging filter, like:
   (S0+3*S1+3*S2+S3)/8
Tends to have more muffle.
Whereas:
   (5*S1+5*S2-S0-S3)/4
Slightly boosts high frequencies (so less muffle) but may have other drawbacks.
On 9/8/2025 3:59 AM, David Brown wrote:
On 07/09/2025 23:12, MitchAlsup wrote:
Terje Mathisen <terje.mathisen@tmsw.no> posted:
MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
For those that work directly with music, age brings experience and
improves abilities like recognising or duplicating tunes. Age also
brings deterioration in the physical aspects of hearing - especially
at higher frequencies.
This is a concern for me, partly if I lose high frequencies, seemingly I wont have anything, as my hearing of low frequencies (sub 1kHz) is
seemingly already impaired.
Seemingly, most of my world of audio perception is located between 1kHz
and 8kHz.
Most lower frequencies are more felt than heard.
There is /some/ overlap, because both groups spend a lot of time
listening to music, which exercises and improves both functions.
I listen to music a lot, but usually House and EDM and similar.
When I was younger, a lot of Goth (mostly of the Synthpop/Synthwave variaty), and Industrial.
There was Dubstep for a while, but seemingly the whole genre kind of imploded (though, never got much mainstream popularity aside from "Skrillex").
One key difference, however, is that it is easy to appreciate when
people can listen to a tune once and play it again afterwards - you
can watch them do it. For people who say they can distinguish CD
audio from AAC or other high bps compressed audio, and other "golden
ears" distinctions, it's a different matter - in double-blind tests,
most fail badly. There are a great many factors involved in
high-quality audio reproduction - the basic sample rate is only one of
them.
I am not really a "golden ear" AFAICT.
At high bitrates, or high sample rates, I can't hear much difference.
But, mostly just noting that it is at low bitrates where things like MP3
and similar start to sound like crap.
On Mon, 8 Sep 2025 10:59:50 +0200, David Brown
<david.brown@hesbynett.no> wrote:
... For people who say they can distinguish CD audio from
AAC or other high bps compressed audio, and other "golden ears"
distinctions, it's a different matter - in double-blind tests, most fail
badly. There are a great many factors involved in high-quality audio
reproduction - the basic sample rate is only one of them.
That's true, but the basic sample rate does make a significant
difference. I don't know if it is true, but I have read that to
/accurately/ reproduce a given note requires 10 to 11 harmonics: the
primary note, 7 higher, and 2 to 3 lower.
This means most notes will include sounds that are outside the range
of (normal) human hearing, but you can still /feel/ these sounds [even
the high ones] and miss them when they are absent.
C8 (high C) on the piano is ~4186 Hz. Assuming the need for the 7th
higher harmonic - 29302 Hz - Nyquist would demand a minimum sampling
rate of 58604/s to accurately reproduce C8.
In practice, unless you like orchestral, or certain folk or country,
you are not likely to hear much difference between a CD and a decent
quality compressed version of it. But the CD itself is not a faithful reproduction of the live performance.
And, of course, if you like orchestral you are more likely to be
listening to vinyl rather than CD. 8-)
On 9/8/25 11:28 PM, BGB wrote:
OK.
I was just seeing here if I could make stuff "less muffled".
A cleaner averaging filter, like:
(S0+3*S1+3*S2+S3)/8
Tends to have more muffle.
Whereas:
(5*S1+5*S2-S0-S3)/4
Slightly boosts high frequencies (so less muffle) but may have
other drawbacks.
Use a real FIR filter:
http://t-filter.engineerjs.com/
As an example, I tried:
16KSPS
Pass band: 0 to 3500Hz, 1dB dips
Stop band: 4KHz to 8KHz: 40dB attenuation
This resulted in a filter with 51 taps. The more brick wall like the
filter is, the more taps required.
On 08/09/2025 22:10, BGB wrote:
On 9/8/2025 3:59 AM, David Brown wrote:
On 07/09/2025 23:12, MitchAlsup wrote:
Terje Mathisen <terje.mathisen@tmsw.no> posted:
MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
For those that work directly with music, age brings experience and
improves abilities like recognising or duplicating tunes. Age also
brings deterioration in the physical aspects of hearing - especially
at higher frequencies.
This is a concern for me, partly if I lose high frequencies, seemingly
I wont have anything, as my hearing of low frequencies (sub 1kHz) is
seemingly already impaired.
Seemingly, most of my world of audio perception is located between
1kHz and 8kHz.
Most lower frequencies are more felt than heard.
There is /some/ overlap, because both groups spend a lot of time
listening to music, which exercises and improves both functions.
I listen to music a lot, but usually House and EDM and similar.
Isn't that an oxymoron? "Music" and "House / EDM" don't belong together
in the same sentence :-)
When I was younger, a lot of Goth (mostly of the Synthpop/Synthwave
variaty), and Industrial.
There was Dubstep for a while, but seemingly the whole genre kind of
imploded (though, never got much mainstream popularity aside from
"Skrillex").
It sounds like you have likely damaged your hearing - but those kinds of "music" (not that I am opinionated...) are often played at very high volumes.
One key difference, however, is that it is easy to appreciate when
people can listen to a tune once and play it again afterwards - you
can watch them do it. For people who say they can distinguish CD
audio from AAC or other high bps compressed audio, and other "golden
ears" distinctions, it's a different matter - in double-blind tests,
most fail badly. There are a great many factors involved in high-
quality audio reproduction - the basic sample rate is only one of them.
I am not really a "golden ear" AFAICT.
At high bitrates, or high sample rates, I can't hear much difference.
But, mostly just noting that it is at low bitrates where things like
MP3 and similar start to sound like crap.
There are, as I said, /many/ factors involved - you are mixing up
bitrates and sample rates.
If everything else is "perfect", 44.1 kHz sample rate can reproduce frequencies (including phase information) up to 22.05 kHz - more than
enough for anyone but some young children.
But everything else is usually very far from perfect. A particular
issue is the dynamic range - 16-bit linear coding does not have enough
range for a lot of music. Either quite sounds are "pixelated", losing a lot of important information, or the dynamic range is compressed before
the CD quality image is generated - giving the music a "flat" sound.
When compressed audio formats are used, they may start off at higher bit depths and sample rates, but in effect the bit depth also gets
compressed and you lose resolution as well as sample rate and high
frequency information for high compression ratios. And just as high
jpeg compression produces artefacts for some images, such as ghosting,
so does high audio compression.
On 9/9/2025 8:06 AM, David Brown wrote:
On 08/09/2025 22:10, BGB wrote:
On 9/8/2025 3:59 AM, David Brown wrote:
On 07/09/2025 23:12, MitchAlsup wrote:
Terje Mathisen <terje.mathisen@tmsw.no> posted:
MitchAlsup wrote:
BGB <cr88192@gmail.com> posted:
For those that work directly with music, age brings experience and
improves abilities like recognising or duplicating tunes. Age also
brings deterioration in the physical aspects of hearing - especially
at higher frequencies.
This is a concern for me, partly if I lose high frequencies, seemingly
I wont have anything, as my hearing of low frequencies (sub 1kHz) is
seemingly already impaired.
Seemingly, most of my world of audio perception is located between
1kHz and 8kHz.
Most lower frequencies are more felt than heard.
There is /some/ overlap, because both groups spend a lot of time
listening to music, which exercises and improves both functions.
I listen to music a lot, but usually House and EDM and similar.
Isn't that an oxymoron? "Music" and "House / EDM" don't belong together >> in the same sentence :-)
Probably still more "music" than "Gansta Rap" though...
On 9/9/2025 12:13 PM, BGB wrote:
[...]
This is fairly nice:
https://youtu.be/RijB8wnJCN0?list=RDRijB8wnJCN0
:^)
My mom got a steel drum tuned to 432 Hz, it is barely audible to me. Can
put my hand near it and feel vibrations, but don't hear much.
Would be easier if these things generated square waves, I can hear
square waves.
On Tue, 9 Sep 2025 07:06:17 -0500
David Schultz <david.schultz@earthlink.net> wrote:
On 9/8/25 11:28 PM, BGB wrote:
OK.
I was just seeing here if I could make stuff "less muffled".
 A cleaner averaging filter, like:
   (S0+3*S1+3*S2+S3)/8
Tends to have more muffle.
Whereas:
   (5*S1+5*S2-S0-S3)/4
Slightly boosts high frequencies (so less muffle) but may have
other drawbacks.
Use a real FIR filter:
http://t-filter.engineerjs.com/
As an example, I tried:
16KSPS
Pass band: 0 to 3500Hz, 1dB dips
Stop band: 4KHz to 8KHz: 40dB attenuation
This resulted in a filter with 51 taps. The more brick wall like the
filter is, the more taps required.
IIRC, AT&T had Fpass = 3.2 KHz. That's significantly easier than 3.5.
Of course, AT&T used analog IIR filter rather than digital FIR. I have
no idea what sort of IIR it was, but would guess that 5th order Bessel
filter with -3dB point at 3.6 KHz could serve as a fair digital
imitation of their circuit.
But, there is some "weird hacks" that can be done in audio processing
when downsampling that seems to notably increase intelligibility at an
8kHz sample rate ...
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high end of
the audio spectrum.
On Sat, 6 Sep 2025 14:19:40 -0500, BGB wrote:
But, there is some "weird hacks" that can be done in audio processing
when downsampling that seems to notably increase intelligibility at an
8kHz sample rate ...
There are digital encoding formats used with mobile phones that are
optimized for speech. Ever heard a call where the other end sounded every
now and then like they were underwater? That’s the kind of compression artifact you get.
On Sat, 06 Sep 2025 16:21:12 GMT, MitchAlsup wrote:
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high end of
the audio spectrum.
I wonder how that works, given that the audio engineer that mastered the >recording was using speakers that cost a fraction of the price.
Lawrence =?iso-8859-13?q?D=FFOliveiro?= <ldo@nz.invalid> writes:
On Sat, 06 Sep 2025 16:21:12 GMT, MitchAlsup wrote:
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high end of >> the audio spectrum.
I wonder how that works, given that the audio engineer that mastered the >recording was using speakers that cost a fraction of the price.
Have you priced quality studio monitors? Obviously not.
A nice pair of intro electrostatics run about a USD1200 (magnapan lrs+).
A single studio monitor can easily cost more than USD12000.
Lawrence =?iso-8859-13?q?D=FFOliveiro?= <ldo@nz.invalid> writes:
On Sat, 06 Sep 2025 16:21:12 GMT, MitchAlsup wrote:
No it does not sound "good" on a system that accurately reproduces
22KHz; like systems with electrostatic speakers covering the high end of >>> the audio spectrum.
I wonder how that works, given that the audio engineer that mastered the
recording was using speakers that cost a fraction of the price.
Have you priced quality studio monitors? Obviously not.
A nice pair of intro electrostatics run about a USD1200 (magnapan lrs+).
A single studio monitor can easily cost more than USD12000.
On 08/09/2025 23:57, George Neuner wrote:
:
This means most notes will include sounds that are outside the range
of (normal) human hearing, but you can still /feel/ these sounds [even
the high ones] and miss them when they are absent.
Nope. Most notes are much lower, and harmonics of relevance are within
the range of human hearing. For high enough notes, you simply don't
hear as much harmonic information.
C8 (high C) on the piano is ~4186 Hz. Assuming the need for the 7th
higher harmonic - 29302 Hz - Nyquist would demand a minimum sampling
rate of 58604/s to accurately reproduce C8.
You can't accurately hear C8 even when live - you don't get the same >harmonic information as you do with C6, because your ears can't
distinguish the higher harmonics. Your ears have the same limitations
as any other senses in this manner - you can look at your cat's feet and >count its toes, but if you look at a fly's feet you can't count the toes.
In practice, unless you like orchestral, or certain folk or country,
you are not likely to hear much difference between a CD and a decent
quality compressed version of it. But the CD itself is not a faithful
reproduction of the live performance.
Good quality compressed formats are often better than CD quality. The >killer for CD quality is not the sample rate, it is the limited dynamic >range from the linear 16-bit range. Compressed formats will, in effect,
use a more logarithmic scale (like A-law and mu-law, used to get >comprehensible speech despite a much smaller sample size) that is more
in line with the way the human brain interprets sound.
And, of course, if you like orchestral you are more likely to be
listening to vinyl rather than CD. 8-)
In theory (but very rarely in practice), when combined with good enough >amplifiers and speakers, vinyl has a a higher dynamic range than CD
audio. But that is only the case when the record is new. Play it a few >times, and the wear from the needle will smooth out the tracks enough to >eliminate the difference.
But enjoying music is a psychologically, physically, mentally and >biologically complex hobby. The comfort of the chair you are sitting--- Synchronet 3.21a-Linux NewsLink 1.2
in, or the type of reflections and absorptions from the rest of the
room, can make a big difference. Knowing that you have spent a great
deal of money on your impressive-looking hifi system will improve your >listening experience regardless of what any audio measurement might say.
Some audiophiles prefer the "valve sound" over "transistor sound" -
not because the sound reproduction is more accurate (it is not - valves
add second harmonic distortion that is non-existent in transistor >amplifiers), but simply because they like it better.
You are forgetting the lower harmonics. If it is true about 3 lower,
then ~1/3 of notes on the piano will include an overtone that is below
the (average) hearing threshold.
On 9/12/25 12:01 PM, George Neuner wrote:
You are forgetting the lower harmonics. If it is true about 3 lower,
then ~1/3 of notes on the piano will include an overtone that is below
the (average) hearing threshold.
One of the coolest things I ever heard, felt really, were the beat tones between a couple of peddle notes on the pipe organ at the Meyerson in Dallas.
On 9/12/25 12:01 PM, George Neuner wrote:
You are forgetting the lower harmonics. If it is true about 3 lower,
then ~1/3 of notes on the piano will include an overtone that is below
the (average) hearing threshold.
One of the coolest things I ever heard, felt really, were the beat tones between a couple of peddle notes on the pipe organ at the Meyerson in Dallas.
David Schultz <david.schultz@earthlink.net> posted:
On 9/12/25 12:01 PM, George Neuner wrote:
You are forgetting the lower harmonics. If it is true about 3 lower,
then ~1/3 of notes on the piano will include an overtone that is below
the (average) hearing threshold.
One of the coolest things I ever heard, felt really, were the beat tones
between a couple of peddle notes on the pipe organ at the Meyerson in
Dallas.
Have you listened to a helicopter-style sub-woofer ??
Generally housed between stories in a building--a helicopter arranged set
of blades, that can go all the way down to 0 Hz--and up to about 30 Hz.
The low frequency components adjust the pitch of the blades through the cyclic.
David Schultz <david.schultz@earthlink.net> posted:
One of the coolest things I ever heard, felt really, were the beat tones
between a couple of peddle notes on the pipe organ at the Meyerson in
Dallas.
Have you listened to a helicopter-style sub-woofer ??
On 12/09/2025 19:23, David Schultz wrote:
On 9/12/25 12:01 PM, George Neuner wrote:
You are forgetting the lower harmonics. If it is true about 3 lower,
then ~1/3 of notes on the piano will include an overtone that is below
the (average) hearing threshold.
Harmonics are always integer multiples of the base frequency, not
fractions - that's the definition of a harmonic.
On Fri, 12 Sep 2025 20:32:48 +0200, David Brown
<david.brown@hesbynett.no> wrote:
On 12/09/2025 19:23, David Schultz wrote:
On 9/12/25 12:01 PM, George Neuner wrote:
You are forgetting the lower harmonics. If it is true about 3 lower,
then ~1/3 of notes on the piano will include an overtone that is below >>>> the (average) hearing threshold.
Harmonics are always integer multiples of the base frequency, not
fractions - that's the definition of a harmonic.
I learned them as "overtones" ... but it seems that musicians call
them all "harmonics" regardless of whether they are higher or lower.
MMV.
Sysop: | DaiTengu |
---|---|
Location: | Appleton, WI |
Users: | 1,070 |
Nodes: | 10 (0 / 10) |
Uptime: | 127:41:28 |
Calls: | 13,731 |
Calls today: | 1 |
Files: | 186,965 |
D/L today: |
1,258 files (486M bytes) |
Messages: | 2,417,820 |