[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Blind Source Separation by Sparse Decomposition

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: Blind Source Separation by Sparse Decomposition
From: "Richard J. Fabbri" <fabbri@xxxxxxxxxxx>
Date: Wed, 8 Sep 1999 18:33:45 -0400
Comments: To: Al Bregman <bregman@hebb.psych.mcgill.ca>
In-reply-to: <Pine.SUN.3.95.990908122648.13940D-100000@hebb.psych.mcgill .ca>
References: <3.0.3.32.19990908013047.00786490@mail.netaxis.com>
Reply-to: "Richard J. Fabbri" <fabbri@xxxxxxxxxxx>
Sender: AUDITORY Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Al,

If you have ever heard (voltage comparator per my email) clipped
speech (nth approximation to Licklider's "infinitely-clipped" speech)
then, you know the remarkable job our hearing does perceiving the
spoken word via zero-crossings.  Yes, assumptions such as band
limiting and zero-mean reduce the universe of possible signal
reconstructions once given a measured set of zero-crossings.
But, these ideas can also be appreciated as schema satisfying
Time-Domain boundary conditions where the "boundaries" are
the zero-crossings.

My reason for proposing Binaural audition w/ clipping is probably
becoming quite clear but, I will not write that reason hear.  Anyone
who wants to witness a startling demo of "stream segregation",
otherwise known to Cherry as "The Cocktail Party Effect", should
simply playback a Binaural recording thru voltage comparators
and listen to those comparators through stereo headphones.

While listening to these reconstructed (Binaural) spatial sources,
recognize that we're not discussing theory ... we are:
        - Demonstrating psychoacoustics
        - Demonstrating source "segregation" via zero-crossings.

Primarily period & interaural delay are encoded by zero-crossings.
Most of the additional (directional) encoding, caused by pinnae and
head diffraction, have been removed from the Binaural signals found
at the L/R comparator outputs, i.e., some degradation of Front/Back
and Up/Down discrimination will be experienced while listening to
clipped Binaural recordings.

But, can anyone explain these reconstructed spatial sources via a
frequency analysis of "infinitely-clipped" (Binaural) channels?
If not, then, how SHOULD we approach the CASA problem?
Are there other tools and models we have ignored?

Finally, there are better approaches to zero-crossing reconstruction,
i.e., perceived with high fidelity but, these require a time-domain
theory that will make no general progress thru our community
while frequency analysis remains unquestioned.

Try the above experiment.
How do zero-crossings help us separate spatial sources?
Can you explain this zero-crossing effect using frequency analysis?


Rich

Prev by Date: Re: Blind Source Separation by Sparse Decomposition
Next by Date: How many streams?
Previous by thread: Re: Blind Source Separation by Sparse Decomposition
Next by thread: Re: Blind Source Separation by Sparse Decomposition
Index(es):
- Date
- Thread