Dr. Dobb's is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them. Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.


Channels ▼
RSS

Web Development

Audio Watermarking


Test Statistic I

To determine the quality of the watermarking process, you can test the outcome. One metric is the difference between the population means of A and B rather than their values per se. The standardized test statistic is given by the equation in Figure 2(a).


Figure 2: Test statistic I.

Both sample means can be assumed normally distributed under the Central Limit Theorem, because of the large amount (M>>30) of Fourier coefficients. Therefore, '–' is normally distributed and you can make the estimations '','' and . The independence of the random variables and , from the same set AUB of Fourier coefficients, leads to the approximations 0 and 0.

Therefore, you can formulate the mutually exclusive propositions:

  • H0: (z)=N(0,1)
  • H1: m(z)=N(m,1), m(k/ 1 +k2)x1/

with N(,s2) the normal distribution, mean , and the variation coefficent defined in Figure 2(d). m can be derived with the approximation in Figure 2(b). With these equations, the threshold can be calculated as T=z1-PI. According to the symmetry of the normal distributions (z) and m(z), k can be calculated from m–T=z1PII and the approximation for m, see above. This is a reasonable assumption because and are both estimators calculated from the mixed sets A and B; see Figure 2(c).

Therefore, you can define the probabilities of correct detection (1-PII), rejection (1-PI), measure the variation coefficient in Figure 2(d) of the mean of set AUB, and calculate the necessary embedding factor k from the equation in Figure 2(c). But a factor k, ensuring the probabilities of correct detection or rejection according to Figure 2(c), may result in audible distortions. To satisfy both requirements, you can define the probabilities of correct detection (1–PII), rejection (1–PII), use the effective embedding factor k according to the psychoacoustic model, and calculate the number of Fourier coefficients from Figure 2(c).

Test Statistic II

According to the equation in Figure 2(a), you can extend the test statistic by normalizing the random variable z from equation >Figure 2(a) to the mean ('+')/2 of the whole set AUB; see Figure 3. Therefore, you expect z to be a random variable with mean 0 in the unmarked and an approximated mean of m2k in the marked case; again, see Figure 3.


Figure 3: Test statistic II.

One disadvantage of this kind of metric is the complicated PDF is not suitable for the calculation of the threshold and embedding parameter. I measured m for different k and verified the linear relation of equation in Figure 3.

Properties of the Algorithm

The general approach in quality evaluation is to compare the original signal with the watermarked signal for different k during subjective listener tests. To ensure the quality, the power of the watermarked noise is shifted just below the masking threshold. This results in an average embedding factor that is equivalent to k=0.15 derived without using the psychoacoustic model. Shifting the watermark noise to higher power levels decreases the quality and increases the average embedding factor. The embedding factor k=0.10 was used for robustness tests.


Related Reading


More Insights






Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Dr. Dobb's encourages readers to engage in spirited, healthy debate, including taking us to task. However, Dr. Dobb's moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing or spam. Dr. Dobb's further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.