.rs
.\" Troff code generated by TPS Convert from ITU Original Files
.\"                 Not Copyright ( c) 1991 
.\"
.\" Assumes tbl, eqn, MS macros, and lots of luck.
.TA 1c 2c 3c 4c 5c 6c 7c 8c
.ds CH
.ds CF
.EQ
delim @@
.EN
.nr LL 40.5P
.nr ll 40.5P
.nr HM 3P
.nr FM 6P
.nr PO 4P
.nr PD 9p
.po 4P

.rs
\v | 5i'
.sp 1P
.ce 1000
SECTION\ 7
.ce 0
.sp 1P
.ce 1000
\fBSUBJECTIVE\ OPINION\ TESTS\fR 
.ce 0
.sp 1P
.sp 2P
.LP
\fBRecommendation\ P.80\fR 
.FS
This Recommendation was numbered P.74 in the
\fIRed\ Book\fR .
.FE
.RT
.sp 2P
.ce 1000
\fBMETHODS\ FOR\ SUBJECTIVE\ DETERMINATION
.EF '%	Volume\ V\ \(em\ Rec.\ P.80''
.OF '''Volume\ V\ \(em\ Rec.\ P.80	%'
.ce 0
.sp 1P
.ce 1000
\fBOF\ 
TRANSMISSION\ QUALITY\fR 
.ce 0
.sp 1P
.LP
\fB1\fR 	\fBIntroduction\fR 
.sp 1P
.RT
.PP
This Recommendation contains advice to Administrations on
conducting subjective tests in their own laboratories. The tests carried out
in the CCITT Laboratory by using reference systems are described in
Section\ 3 of this Volume.
.PP
In the course of developing items of telephone equipment, it is
necessary to conduct various kinds of specialized tests to diagnose faults 
and shortcomings; such tests dedicated to the study of specific aspect 
of 
transmission quality are not discussed here. The present purpose is to 
indicate methods that have been found suitable for determining how satisfactory 
given 
telephone connections may be expected to be if offered as such for use 
by the public. 
.PP
The methods indicated here are intended to be generally applicable
whatever the form of any degrading factors present. Examples of degrading
factors include transmission loss (often frequency dependent), circuit 
and room noise, sidetone, talker echo, nonlinear distortion of various 
kinds, 
propagation time, deleterious affects of voice\(hyoperated devices and 
changes in characteristics of telephone sets, including loudspeaking sets. 
Combinations of two or more of such factors have to be catered for. 
.RT
.sp 2P
.LP
\fB2\fR 	\fBRecommended methods\fR 
.sp 1P
.RT
.PP
To be applicable for such a wide range of types of degrading factor given 
in \(sc\ 1, the assessment method must reproduce as far as possible all 
the relevant features present when customers converse over telephone connections. 
Suitable methods are referred to as \*Q 
Conversation Tests
\*U and detailed prescriptions on the conduct of such tests as carried 
out by British Telecom 
are given in Supplement\ No.\ 2 at the end of this volume.
.PP
If the rather large amount of effort needed is available and the
importance of the study warrants, transmission quality can be determined by
service observations and recommended ways of performing these, including the
questions to be asked when interviewing customers, are given in
Recommendation\ P.82.
.PP
A disadvantage of the 
service observation method
for many
purposes is that little control is possible over the detailed characteristics 
of the telephone connections being tested. A method that largely overcomes 
this disadvantage but retains many of the advantages of service observations 
is that used by the AT&T\ Co. and termed SIBYL (refer to Supplement\ No.\ 
5, Volume\ V, 
\fIRed\ Book\fR ). According to this method, members of the staff of Bell
Laboratories volunteer to allow a small proportion of their ordinary internal 
calls to be passed through special arrangements which modify the normal 
quality of transmission according to a test programme. If a particular 
call has been so treated the volunteer is asked to vote by dialling one 
of a set of digits to 
indicate his opinion. In this way all results are recorded by the controlling 
computer and complete privacy is retained. 
.bp
.RT
.sp 2P
.LP
\fB3\fR 	\fBSupplementary methods\fR 
.sp 1P
.RT
.PP
Under certain conditions, it is permissible to dispense with the
full conversation method and to use 
one\(hyway listening\(hyonly tests
.
Suitable conditions apply for using a listening test when the degrading
factor(s) under study affect the subjects only in their listening role.
Attenuation/frequency
.PP
distortion and nonlinear distortion caused by quantizing have been studied
successfully by 
listening tests
but it would be unwise to study the
effects of sidetone, for example, by this method. Listening\(hyonly tests 
may also be misleading when assessing the effects of a factor, like circuit 
noise, when the magnitude of the degradation caused is substantial. In 
any case, sufficient comparison with the results from full conversation 
tests should be made before the results from listening\(hyonly tests are 
accepted as reliable. 
\v'6p'
.RT
.sp 2P
.LP
\fBRecommendation\ P.81\fR 
.FS
This Recommendation was numbered\ P.70 in the
\fIRed\ Book\fR .FE
.RT
.sp 2P
.sp 1P
.ce 1000
\fBMODULATED\ NOISE\ REFERENCE\ UNIT\ (MNRU)\fR 
.EF '%	Volume\ V\ \(em\ Rec.\ P.81''
.OF '''Volume\ V\ \(em\ Rec.\ P.81	%'
.ce 0
.sp 1P
.ce 1000
\fI(Malaga\(hyTorremolinos, 1984; amended Melbourne, 1988)\fR 
.sp 9p
.RT
.ce 0
.sp 1P
.sp 2P
.LP
	The\ CCITT,
.sp 1P
.RT
.sp 1P
.LP
\fIconsidering\fR 
.sp 9p
.RT
.PP
(a)
that the use of digital processes (64\ kbit/s PCM\ A\(hylaw or \(*m\(hylaw, 
A/D/A encoder pairs, A/\(*m\(hylaw or \(*m/A\(hylaw converters, digital 
pads based on 8\(hybit\ PCM words, 32\ kbit/s ADPCM,\ etc.) in the international 
telephone 
network has grown rapidly over the past several years, and this growth is
expected to continue;
.PP
(b)
that new digital processes are being standardized,
e.g.\ 64\ kbit/s 7\ kHz wideband ADPCM;
.PP
(c)
that there is a need for standard tools to measure the
quantization distortion performance of digital processes [for example,
32\ kbit/s ADPCM (Recommendation\ G.721) and 64\ kbit/s 7\ kHz wideband codec
(Recommendation\ G.722)], so that the tools can be used for estimating the
subjective transmission performance of international connections containing
digital processes;
.PP
(d)
that an objective speech quality assessment method has not yet been established; 
.PP
(e)
that, at the present time, subjective tests incorporating reference system 
conditions represent the only suitable method for measuring 
the speech transmission performance of digital processes;
.PP
(f)
that expressing results in terms of a common reference
system may facilitate comparison of subjective test results obtained at
different laboratories,
.sp 1P
.LP
\fIrecommends\fR 
.sp 9p
.RT
.PP
(1)
the use of a narrow\(hyband Modulated Noise Reference Unit
(MNRU) as the reference system in terms of which subjective performance of
telephone bandwidth digital processes should be expressed;
.PP
This specification is subject to future enhancement and therefore
should be regarded as provisional.
.FE
(2)
the use of a wideband MNRU as the reference system in terms of which subjective 
performance of wideband digital processes should be 
expressed
.
.PP
\fINote\ 1\fR \ \(em\ The MNRU can be realized using laboratory equipment 
or by computer simulation. Further information on the MNRU is given in 
the 
references listed at the end of this Recommendation.
.bp
.PP
\fINote\ 2\fR \ \(em\ The listening\(hyonly method presently proposed when 
using the MNRU in subjective tests is described in Supplement\ No.\ 14 
at the end of this volume. See Recommendation\ P.80, \(sc\ 3, for precautions 
concerning the use of 
listening\(hyonly tests.
.PP
\fINote\ 3\fR \ \(em\ Objective measurement methods which suitably reflect
subjective quantization distortion performance of various types of digital
processes do not exist at present. (For example, the objective technique of
Recommendation\ G.712, based on sine\(hywave and band\(hylimited noise 
measurements, are designed for PCM and do not measure appropriately the 
distortion induced by other systems such as ADPCM.) The artificial voice 
described in 
Recommendation\ P.50 may be relevant. Even if an objective method is developed, 
subjective tests will be required to establish correlation of subjective 
results/objective results for particular digital process types.
.PP
\fINote\ 4\fR \ \(em\ The wideband MNRU without noise shaping as described in
this Recommendation is recommended
. Administrations are asked to comment on the need for a filter in the 
noise path after the multiplier (see 
Supplement\ No.\ 15), to shape the correlated noise spectrum. Some
Administrations suggest the use of such a filter while others do not.
.RT
.sp 2P
.LP
\fB1\fR 	\fBIntroduction\fR 
.sp 1P
.RT
.PP
The MNRU was originally devised to produce distortion subjectively similar 
to that produced by logarithmically companded PCM systems\ [1]. This 
approach was based on the views:
.RT
.LP
	1)
	that network planning would require extensive subjective
tests to enable evaluation of PCM system performance over a range of compandor 
characteristics, at various signal levels and in combination with various 
other transmission impairments (e.g.\ loss, idle circuit noise, etc.) at 
various 
levels, and
.LP
	2)
	that it would be as reliable and easier to define a
reference distortion system, itself providing distortion perceptually similar 
to that of PCM systems, in terms of which the performance of PCM systems 
could be expressed. This requires extensive subjective evaluation of the 
reference 
system when inserted in one or more simulated telephone connections, but 
leads to the possibility of simplified subjective evaluation of new digital 
processing techniques.
.PP
Various organizations (Administrations, scientific/industrial
organizations), as well as the CCITT itself, have made extensive use of the
MNRU concept for evaluating the subjective performance of digital processes 
(in arriving at Recommendations\ G.721 and G.722, for example). A modified 
version for use in evaluating codecs of wider bandwidth (70\(hy7000\ Hz) 
is now common 
practice. However, the actual devices used, while based on common principles, 
may have differed in detail, and hence the subjective results obtained 
may 
also have differed. (Differences in subjective testing methodology are also
relevant.) The purpose of this Recommendation is to define the narrow\(hyband 
and wideband versions of the MNRU as completely and in as much detail as 
possible in order to minimize the effects of the device, and of its objective 
calibration procedures, on subjective\(hytest results.
.sp 2P
.LP
\fB2\fR 	\fBGeneral description\fR 
.sp 1P
.RT
.PP
Simplified arrangements of the MNRU are shown in Figure\ 1a/P.81 for the 
narrow\(hyband version and Figure\ 1b/P.81 for the wideband version. Speech 
signals entering from the left are split between\ 2 paths, a signal path 
and a noise path. The signal path provides an undistorted (except for bandpass 
filtering) speech signal at the output. In the noise path, the speech signal
instantaneously controls a multiplier with an applied gaussian noise \*Qcarrier\*U 
which has a uniform spectrum between\ 0\ Hz and a frequency at least twice 
the 
cutoff frequency of the lowpass portion of the bandpass filter. The output 
of the multiplier consisting of the noise modulated by the speech signal, 
is then added to the speech signal to produce the distorted signal. 
.PP
The attenuators and switches in the signal and noise paths allow
independent adjustment of the speech and noise signal levels at the output.
Typically, the system is so calibrated that the setting of the attenuator
(in\ dB) in the noise path represents the ratio of instantaneous speech 
power to noise power, when both are measured at the output of the band\(hypass 
filter 
(terminal\ OT).
.bp
.PP
For this Recommendation, the decibel representation of the ratio is
called \fIQ\fR\d\fIN\fR\ufor the narrow\(hyband version and \fIQ\fR\d\fIW\fR\ufor 
the wideband 
version.
.RT
.LP
.rs
.sp 18P
.ad r
\fBFigure 1a/P.81, p.\fR 
.sp 1P
.RT
.ad b
.RT
.LP
.rs
.sp 18P
.ad r
\fBFigure 1b/P.81, p.\fR 
.sp 1P
.RT
.ad b
.RT
.sp 2P
.LP
\fB3\fR 	\fBPerformance specifications\fR 
.sp 1P
.RT
.sp 1P
.LP
3.1
	\fIGeneral\fR 
.sp 9p
.RT
.PP
The specifications in this section apply both to hardware
implementations and software simulations.
.PP
For practical implementations, the actual signal levels and noise
levels may be increased or decreased to meet special needs. In such cases, 
the level requirements detailed below will have to be modified accordingly. 
.bp
.RT
.sp 1P
.LP
3.2
	\fISignal path\fR 
.sp 9p
.RT
.PP
The requirements under this heading refer to the MNRU with infinite attenuation 
in the noise path of Figures\ 1a/P.81 and 1b/P.81; separate 
resistive terminations at the terminals\ T5 and T6 (unlinked) will achieve
this.
.PP
The frequency response of the signal path (i.e. between terminals\ IT and\ 
OT of Figures\ 1a/P.81 and 1b/P.81) should be within the limits of 
Figure\ 2a/P.81 for the circuit of Figure\ 1a/P.81 and within the limits of
Figure\ 2b/P.81 for the circuit of Figure\ 1b/P.81.
.PP
The loss between terminals IT and OT for a 0\ dBm, 1\ kHz input sine
wave should be 0\ dB. Over the input level range +10\ dBm to \(em50\ dBm, 
the loss 
should be 0\ dB\ \(+-\ 0.1\ dB.
.PP
Any harmonic component should be at least 50\ dB below the fundamental 
at the system output (terminal\ OT in Figures 1a/P.81 and 1b/P.81) for 
any 
fundamental frequency between 125\ Hz and 3000\ Hz in a narrow\(hyband 
system and 
100\ Hz and 6000\ Hz in a wideband system.
.PP
The idle noise generated in the signal path must be less than \(em60\ dBm, 
measured at terminal\ OT, in order to conform with\ \(sc\ 3.4. 
.PP
It is recommended that the level of speech signals applied to the
terminals\ IT should be less than \(em10\ dBm (mean power while active, 
i.e.\ mean 
active level according to Recommendation\ P.56) in order to avoid amplifier
peak\(hyclippings of the signal, and be greater than \(em30\ dBm to ensure 
sufficient speech signal\(hyto\(hynoise ratio. 
.RT
.sp 1P
.LP
3.3
	\fINoise path\fR 
.sp 9p
.RT
.PP
The requirements under this heading refer to the MNRU with infinite attenuation 
inserted into the signal path of Figures\ 1a/P.81 and 1b/P.81; 
separate resistive terminations at the terminals\ T1 and T2 (unlinked) will
achieve this.
.RT
.sp 1P
.LP
3.3.1
	\fILinearity as function of input level\fR 
.sp 9p
.RT
.PP
With a \fIQ\fR\d\fIN\fR\usetting of 0\ dB in the circuit of Figure\ 1a/P.81, 
or a \fIQ\fR\d\fIW\fR\usetting of 0\ dB in the circuit of Figure\ 1b/P.81, 
as the case may be, the noise level at the system output (terminal\ OT) 
should be numerically equal to the sine wave level at the input terminal 
(terminal\ IT). A correspondence 
within \(+-\ 0.5\ dB should be obtained for input levels from\ +5\ dBm 
to\ \(em45\ dBm, and for input frequencies from 125\ Hz to 3000\ Hz in 
a narrow\(hyband system and 100\ Hz to 6000\ Hz in a wideband system. 
.RT
.sp 1P
.LP
3.3.2
	\fINoise spectrum\fR 
.sp 9p
.RT
.PP
For a narrow\(hyband system, when \fIQ\fR\d\fIN\fR\uis set to 0\ dB, input 
sine waves applied to terminal\ IT in Figure\ 1a/P.81 with levels from\ 
+5 to\ \(em45\ dBm and frequencies from\ 125\ Hz to\ 3000\ Hz should result 
in a flat noise system 
spectrum density at the output of the multiplication device (terminal\ T3 of
Figure\ 1a/P.81) within\ \(+-\ 1\ dB over the frequency range\ 75\ Hz to\ 
5000\ Hz. The 
spectrum density should be measured with a bandwidth resolution of
maximum\ 50\ Hz.
.PP
For a wideband system, when \fIQ\fR\d\fIW\fR\uis set to\ 0\ dB, input sine 
waves applied to terminal\ IT in Figure\ 1b/P.81 with levels from\ +5 to 
\(em45\ dBm and 
frequencies from\ 100\ Hz to 6000\ Hz should result in a flat noise system
spectrum density at the output of the multiplication device (terminal\ T3 of
Figure\ 1b/P.81) within\ \(+-\ 1\ dB over the frequency range\ 75\ Hz to\ 
10\ 000\ Hz. The spectrum density should be measured with a bandwith resolution 
of 
maximum\ 50\ Hz.
.RT
.sp 1P
.LP
3.3.3
	\fIAmplitude distribution\fR 
.sp 9p
.RT
.PP
The amplitude distribution of the noise at the system output should be 
approximately gaussian. 
.PP
\fINote\fR \ \(em\ A noise source consisting of a gaussian nose generator
followed by a peak clipper with a flat spectrum from near zero to 20\ kHz 
will produce a satisfactory output noise at terminal\ OT. 
.bp
.RT
.LP
.rs
.sp 20P
.ad r
\fBFigure 2a/P.81 p.\fR 
.sp 1P
.RT
.ad b
.RT
.LP
.rs
.sp 21P
.ad r
\fBFigure 2b/P.81, p.\fR 
.sp 1P
.RT
.ad b
.RT
.LP
.bp
.sp 1P
.LP
3.3.4
	\fINoise attenuators\fR 
.sp 9p
.RT
.PP
The loss of the noise attenuator(s) i.e.\ between terminals\ T4
and\ T5 in
Figures\ 1a/P.81 and 1b/P.81, should be within\ \(+-\ 0.1\ dB of the nominal 
setting. The attenuator(s) should at least allow\ \fIQ\fR\d\fIN\fR\uand\ 
\fIQ\fR\d\fIW\fR\usettings in the 
range\ \(em5\ dB to\ 45\ dB, i.e.\ a\ 50\ dB range.
.RT
.sp 1P
.LP
3.4
	\fICombined path\fR 
.sp 9p
.RT
.PP
The requirements under this heading refer to the MNRU with both
speech and noise paths simultaneously in operation.
.PP
With \fIQ\fR\d\fIN\fR\uor \fIQ\fR\d\fIW\fR\u(as the case may be) set to 
zero, and the 
input terminated by an equivalent resistance, the idle noise generated 
in the combined path should be less than \(em60\ dBm when measured at the 
system output 
(terminal\ OT).
.RT
.sp 2P
.LP
	\fBReferences\fR 
.sp 1P
.RT
.LP
[1]
	LAW (H. | .), SEYMOUR (R. | .): A reference distortion system using
modulated noise, \fIThe Institute of Electrical Engineers\fR , pp.\ 484\(hy485,
November\ 1962.
.sp 2P
.LP
	\fBBibliography\fR 
.sp 1P
.RT
.LP
CCITT\ \(em\ Contribution COM\ XII\(hyNo.\ 63, \fISome considerations on\fR 
\fIspecifications for modulated noise reference unit\fR , NTT,\ Japan, Study
Period\ 1981\(hy1984.
.LP
CCITT\ \(em\ Contribution COM\ XII\(hyNo.\ R4, pp.\ 71\(hy79, Study Period\ 
1981\(hy1984. 
.LP
CCITT\ \(em\ Contribution COM\ XII\(hyNo.\ 119, \fIDescription and method 
of use of\fR 
\fIthe modulated noise reference unit (MNRU/MALT)\fR , France, Study
Period\ 1981\(hy1984.
\v'2P'
.sp 2P
.LP
\fBRecommendation\ P.82\fR 
.FS
This Recommendation was numbered P.77 in the
\fIRed Book\fR .FE
.RT
.sp 2P
.ce 1000
\fBMETHOD\ FOR\fR \ 
\fBEVALUATION\ OF\ SERVICE\ FROM\fR 
.EF '%	Volume\ V\ \(em\ Rec.\ P.82''
.OF '''Volume\ V\ \(em\ Rec.\ P.82	%'
.ce 0
.sp 1P
.ce 1000
\fBTHE\ STANDPOINT\ OF\ SPEECH\ TRANSMISSION\ QUALITY\fR 
.ce 0
.sp 1P
.ce 1000
\fI(Geneva, 1976; amended at Malaga\(hyTorremolinos, 1984)\fR 
.sp 9p
.RT
.ce 0
.sp 1P
.LP
\fB1\fR 	\fBGeneral\fR 
.sp 1P
.RT
.PP
The CCITT recommends that Administrations make use of telephone
users' surveys in the manner of Recommendation\ E.125\ [1] as a means of
measuring speech transmission quality on international calls.
.PP
Such surveys being call\(hyrelated (in this instance to the last
international call made) can be conducted either by the full use of the
.PP
Recommendation\ E.125 questionnaires (where other valuable information is
obtained on users' difficulties, e.g.\ knowing how to make the call,
difficulties in dialling or understanding tones, etc.) or by making use of
those questions solely related to transmission quality which appear in
Annex\ A.
.PP
\fINote\fR \ \(em\ The evaluation of the transmission performance may be 
altered by difficulties in setting\(hyup call. Hence the response to incomplete 
questionnaires should be considered with some reservation.
.RT
.LP
.sp 2
.bp
.sp 2P
.LP
\fB2\fR 	\fBConduct of surveys\fR 
.sp 1P
.RT
.PP
In order to make valid comparisons between data collected in
different countries, Recommendation\ E.125 should be strictly adhered to.
Specifically the preamble to the Recommendation, the notes of intended 
use of the questionnaires and the precise order and wording of the questions 
should be rigidly followed. In some cases, however, an exception will be 
made and 
Question\ 10.0 will be replaced by the wording indicated in Annex\ B (detailed
information is given in [3]).
.PP
\fINote\fR \ \(em\ This alternative version has the advantage of simplifying 
the classification of responses to open end probes by experts, as well 
as 
increasing the sensitivity to some types of impairments such as delay. These
advantages should be weighed against the additional interview time which 
may be required. 
.RT
.sp 2P
.LP
\fB3\fR 	\fBTreatment of results\fR 
.sp 1P
.RT
.PP
To provide quantitative information suitable for comparisons, the subjective 
assessments (e.g. those obtained from Question\ 9.0 of Annex\ A) of 
excellent, good, fair or poor (see Note) should be accorded scores of\ 
4, 3, 2 and\ 1, respectively and a mean opinion score (MOS) calculated 
for all 
associated responses. Similarly for all those experiencing difficulty (under
Question\ 10.0 of Annex\ A or, alternatively, Question\ 10.0 of Annex\ B) a
percentage
of the total responses should be calculated.  These two criteria of MOS and
percentage difficulty are now internationally recognized and have been 
measured under many different laboratory simulated connections and practical 
situations.
.PP
The results can be classified in a number of ways, e.g.\ in terms of the 
call\(hydestination countries or by nature/composition of the connection 
i.e.\ cable/satellite circuits, presence or otherwise of echo suppressors 
etc. Typical methods of presentation of the results are shown in\ [2], 
in this case for several countries. It should be noted that in all
presentations it is essential to show the number of responses.
.PP
\fINote\fR \ \(em\ Among the reasons which lead to the limitation of users'
opinions of transmission quality to four classes, i.e.\ excellent, good, fair
and poor, is the following. The experience gained in human factor
investigations has shown that when a question which requires a selection 
from several different classifications is posed in aural form, e.g.\ by 
face\(hyto\(hyface interview or by telephone as with Recommendation\ E.125, 
the respondent is 
frequently unable to carry a clear mental separation of more than four
.PP
categories. As a consequence, he is unable to draw on his short\(hyterm 
memory and judgement ability in a sufficiently precise manner to avoid 
confusion and gives an unreliable response. This restriction does not apply 
to other situations 
where a written presentation of the choices is used, in which case frequently 
five or more classes may be appropriate and shown to yield reliable 
responses.
.RT
.ce 1000
ANNEX\ A
.ce 0
.ce 1000
(to Recommendation P.82)
.sp 9p
.RT
.ce 0
.ce 1000
\fBExtract from the questionnaire annexed to Recommendation E.125\fR 
.sp 1P
.RT
.ce 0
.PP
Reproduced below are the questions relating to transmission
quality which appear in the questionnaire annexed to
Recommendation\ E.125.
.sp 1P
.RT
.PP
The CCITT recommends that this Annex should be used when
customers' general impressions of transmission performance are required.
.sp 1P
.LP
9.0
	
.sp 9p
.RT
.LP
	\fIWhich of these four words comes closest to describing the\fR 
\fIquality of the connection during conversation?\fR 
.LP
9.1
	\(em
	\fIexcellent\fR 
.LP
9.2
	\(em
	\fIgood\fR 
.LP
9.3
	\(em
	\fIfair\fR 
.LP
9.4
	\(em
	\fIpoor\fR .bp
.sp 1P
.LP
10.0
	\fIDid you or the person you were talking to have difficulty in\fR 
\fItalking or hearing over that connection?\fR 
.sp 9p
.RT
.LP
	(If answer is \*Qyes\*U) probe for nature of difficulty, but without
suggesting possible types of difficulty, and copy down answers verbatim:
e.g.\ \fI\*QCould you describe the difficulty a little more?\*U\fR 
.LP
\ 
	
.LP
\ 
	
.LP
	At end of interview, categorize the answers in terms of the items below:
.LP
10.1
	\(em
	low volume
.LP
10.2
	\(em
	noise or hum
.LP
10.3
	\(em
	distortion
.LP
10.4
	\(em
	variations in level, cutting on and off
.LP
10.5
	\(em
	crosstalk
.LP
10.6
	\(em
	echo
.LP
10.7
	\(em
	complete cut off
.LP
10.8
	\(em
	other (specify)
.PP
\fINote\fR \ \(em\ Responses to Questions 10.1 to 10.8 are only obtained
from customers who have expressed difficulty in Question\ 10.0.
.ce 1000
ANNEX\ B
.ce 0
.ce 1000
(to Recommendation P.82)
.sp 9p
.RT
.ce 0
.ce 1000
\fBAlternative version for Question 10.0 of questionnaire\fR 
.sp 1P
.RT
.ce 0
.ce 1000
\fBannexed to Recommendation E.125\fR 
.ce 0
.LP
\fR 
.PP
Studies at AT&T have shown that the verbatim responses
describing impairments (requested after Question\ 10.0 of Annex\ A) are 
often too imprecisely worded to permit accurate classification by interviewers 
who are 
not experienced in transmission studies. A typical solution to this problem 
has been to convene a panel of experts to classify the responses, a method 
which 
may become impractical as the size and number of user reaction tests increases. 
This annex presents an alternative approach developed in 1976 and used 
widely since then by AT&T to measure customer's perceptions of transmission 
quality on domestic and international telephone connections. The approach 
involves a more complicated technique of probing for impairments which 
simplifies the ultimate task of classifying the responses. The alternative 
of Question\ 10.0 is 
reproduced below.
.sp 1P
.RT
.PP
The CCITT recommends that this annex should be used for diagnostic purposes 
only. 
.sp 1P
.LP
10.0
	\fIDid you have any difficulty talking or hearing over that\fR 
\fIconnection?\fR 
.sp 9p
.RT
.LP
	Do not probe: If the person volunteers an explanation, write it
down.
.LP
\ 
	
.LP
	On question 10.1\(hy10.8, attempt to read entire text before respondent
replies.
.sp 1P
.LP
10.1
	\fINow I'd like to ask some specific questions about the connection.\fR 
.sp 9p
.RT
.LP
	If the person has already described difficulty, add:
.LP
	(\fIIn view of what you've already said, some of these may seem\fR 
\fIrepetitious, but please bear with me\fR ). \fIFirst, during your\fR 
\fIconversation on that call, did you hear your own voice echoing back,\fR 
\fIor did your own voice sound hollow to you?\fR 
.LP
10.1.1
	\(em
	echo hollow (own voice)
.LP
10.1.2
	\(em
	neither
.LP
10.1.3
	\(em
	don't remember/not sure
.LP
10.1.4
	\(em
	other (specify)
.bp
.sp 1P
.LP
10.2
	\fIDid you hear another telephone conversation on the telephone\fR 
\fInetwork at the same time as your own?\fR 
.sp 9p
.RT
.LP
10.2.1
	\(em
	other conversation
.LP
10.2.2
	\(em
	no
.LP
10.2.3
	\(em
	don't remember/not sure
.LP
10.2.4
	\(em
	other (specify)
.sp 1P
.LP
10.3
	 \fINow I'd like you to think about the voice of the person you were\fR 
\fItalking to. Was the volume of the voice low as if the person were faint 
and\fR \fIfar away; did the voice fade in and out; or was the voice interrupted 
or\fR 
\fIchopped up at times?\fR 
.sp 9p
.RT
.LP
10.3.1
	\(em
	low volume
.LP
10.3.2
	\(em
	fading
.LP
10.3.3
	\(em
	chopping
.LP
10.3.4
	\(em
	none
.LP
10.3.5
	\(em
	don't remember/not sure
.LP
10.3.6
	\(em
	other (specify)
.sp 1P
.LP
10.4
	 \fIHow did the voice of the person your were talking to sound to you:\fR 
\fIdid it echo or sound hollow and tinny; or did it sound fuzzy or unnatural?\fR 
.sp 9p
.RT
.LP
10.4.1
	\(em
	echo, hollow
.LP
10.4.2
	\(em
	fuzzy, unnatural
.LP
10.4.3
	\(em
	none
.LP
10.4.4
	\(em
	don't remember/not sure
.LP
10.4.5
	\(em
	other (specify)
.sp 1P
.LP
10.5
	 \fINow let me describe three kinds of noise. Tell me if you noticed\fR 
\fIany of these noises during your conversaiton: a rushing or hissing sound; 
a\fR \fIfrying and/or sizzling, crackling sound; or a humming or buzzing 
sound?\fR 
.sp 9p
.RT
.LP
10.5.1
	\(em
	rushing, hissing
.LP
10.5.2
	\(em
	frying and/or sizzling, cackling
.LP
10.5.3
	\(em
	humming, buzzing
.LP
10.5.4
	\(em
	none
.LP
10.5.5
	\(em
	don't remember/not sure
.LP
10.5.6
	\(em
	other (specify)
.sp 1P
.LP
10.6
	\fINow let me describe three more kind of noise. Tell me if you\fR 
\fInoticed any of these during your conversation: a clicking sound; a series 
of\fR \fImusical tones or beeps; or a continuous high\(hypitched tone?\fR 
.sp 9p
.RT
.LP
10.6.1
	\(em
	clicking
.LP
10.6.2
	\(em
	tones or beeps
.LP
10.6.3
	\(em
	high\(hypitched tone
.LP
10.6.4
	\(em
	none
.LP
10.6.5
	\(em
	don't remember/not sure
.LP
10.6.6
	\(em
	other (specify)
.bp
.sp 1P
.LP
10.7
	 \fIDid the other person seem slow to respond, as if there were delay\fR 
\fIor time lag in the conversation?\fR 
.sp 9p
.RT
.LP
10.7.1
	\(em
	yes
.LP
10.7.2
	\(em
	no
.LP
10.7.3
	\(em
	don't know
.LP
10.7.4
	\(em
	other (specify)
.sp 1P
.LP
10.8
	 \fIWould you please try to remember the background noise in the area\fR 
\fIaround your telephone (e.g.\ noise from air\(hyconditioning plant unit, 
road\fR 
\fItraffic, office equipment or other people talking) when you made the 
call.\fR \fIWhich of the following categories best describes it?\fR 
.sp 9p
.RT
.LP
10.8.1
	\(em
	very noisy
.LP
10.8.2
	\(em
	noisy
.LP
10.8.3
	\(em
	quiet
.LP
10.8.4
	\(em
	very quiet
.LP
10.8.5
	\(em
	other (specify)
.sp 1P
.LP
10.9
	 \fIWhich of the categories listed below best describes the extent to\fR 
\fIwhich you heard your own voice through your telephone when you were\fR 
\fItalking?\fR 
.sp 9p
.RT
.LP
10.9.1
	\(em
	could not hear it
.LP
10.9.2
	\(em
	could hear it now that you have drawn my attention to it
.LP
10.9.3
	\(em
	did notice it \(em not loud
.LP
10.9.4
	\(em
	did notice it \(em loud
.LP
10.9.5
	\(em
	other (specify)
.sp 1P
.LP
10.10
	\fIWas there anything else about the connection you'd like to\fR 
\fImention?\fR 
.sp 9p
.RT
.LP
	\fIYes \(em What?\fR (Write in)
.LP
\ 
	
.LP
\ 
	
.LP
\ 
	
.LP
	Coding instructions:
.LP
	\(em
	is there a written comment?
.LP
	\(em
	does the comment apply to this call?
.LP
	\(em
	does it mention an impairment?
.LP
	\(em
	has it been mentioned already?
.LP
	\(em
	other (specify)
.PP
\fINote\fR \ \(em\ The responses to the specific questions are only obtained 
from customers who have expressed difficulty in Question\ 10.0. This may 
prevent the diagnosis of certain impairments (the bias produced is more 
serious than 
that mentionned at the end of Annex\ A).
.sp 2P
.LP
	\fBReferences\fR 
.sp 1P
.RT
.LP
[1]
	CCITT Recommendation \fIInquiries among users of the international\fR 
\fItelephone service\fR , Red\ Book, Vol.\ II, Rec.\ E.125, ITU, Geneva, 1985.
.LP
[2]
	CCITT\ \(em\ Question 2/XII, Annex 2, Contribution COM XII\(hyNo. 1, Study
Period 1977\(hy1980, Geneva,\ 1977.
.LP
[3]
	CCITT\ \(em\ Question 2/XII, Annex, Contribution COM XII\(hyNo.\ 171, Study
Period 1977\(hy1980, Geneva, August\ 1979.
.bp
.sp 2P
.LP
\fBRecommendation\ P.84\fR 
.RT
.sp 2P
.ce 1000
\fBSUBJECTIVE\ LISTENING\ TEST\ METHOD\ FOR\ EVALUATING\fR 
.EF '%	Volume\ V\ \(em\ Rec.\ P.84''
.OF '''Volume\ V\ \(em\ Rec.\ P.84	%'
.ce 0
.sp 1P
.ce 1000
\fBDIGITAL\ CIRCUIT\ MULTIPLICATION\ AND\ \ PACKETIZED\ VOICE |
SYSTEMS\fR 
.FS
The specifications in this Recommendation are subject to
future enhancement and therefore should be regarded as provisional.
.FE
.ce 0
.sp 1P
.ce 1000
\fI(Melbourne, 1988)\fR 
.sp 9p
.RT
.ce 0
.sp 1P
.LP
\fB1\fR 	\fBIntroduction\fR 
.sp 1P
.RT
.sp 1P
.LP
1.1
	\fIPurpose\fR 
.sp 9p
.RT
.PP
The purpose of this Recommendation is to describe a subjective
listening test method which can be used to compare the performance of Digital 
Circuit Multiplication Equipment (DCME) and 
packetized voice systems
.
.PP
Many of the degradations found in DCME or packetized voice systems
have
not been tested before and their effects on other systems in the network are
unknown. Therefore the only definitive method is the conversation test where
the effects of non\(hylinearity, delay, echo,\ etc. and their interactions 
can be verified. 
.PP
For DCME systems, degradations can include not only the effects of
variable bit\(hyrate coding, DSI gain (channel allocation), clipping, freezeout
and noise contrast, but also those due to non\(hylinearities in the speech
detection system, such that the system may function differently for different 
speech input levels or activity factors. For packetized voice systems the 
subjective effect, for example, of \*Qlost packets\*U is unknown.
.PP
Listening tests play an important preliminary role in the assessment, and 
can supply useful information serving to narrow the range of conditions 
needing a complete conversation test. Moreover, listening tests of the 
effects of the impairments produced by DCME, in association with an evaluation 
of the effects of delay added by the DCME, using the echo tolerance method 
described in Recommendation\ G.131, can give a good indication of the overall 
performance of such systems and allow reasonable comparisons to be made. 
In addition, the 
.PP
delay evaluation should determine whether or not the use of DCME in a network 
setting will require additional echo control. This listening test method 
will not provide results useful for generating network application rules 
based on 
factors such as the quantizing distortion unit (qdu). Future improvements of
the test will allow such results to be obtained.
.PP
Evaluation of DCME in tandem with other DCME has not been considered at 
this stage nor have the effects of systems using encoding at different 
rates. This Recommendation will subsequently be updated when information on
these specific points becomes available.
.PP
This Recommendation confines itself solely to listening tests; a
separate Recommendation\ P.85, on conversation tests, will be formulated when
sufficient information on evaluation techniques is available. Alternatively,
this Recommendation may be revised to include conversation test methods.
.RT
.sp 2P
.LP
1.2
	\fIDefinitions\fR 
.sp 1P
.RT
.sp 1P
.LP
1.2.1
	\fBdigital circuit multiplication equipment (DCME)\fR 
.sp 9p
.RT
.PP
A general class of equipment which permits concentration of a
number of 64\ kbit/s PCM encoded input speech circuits onto a reduced number 
of transmission channels. 
.PP
This equipment allows an increase in the circuit capacity of the
system. The capacity of speech and voiceband data can both be increased 
by the use of DCME. 
.bp
.RT
.sp 1P
.LP
1.2.2
	\fBdigital circuit multiplication system\fR (DCMS)
.sp 9p
.RT
.PP
A telecommunication system comprised of two or more DCME terminals connected 
by a digital transmission system providing a pool of bearer channels. The 
DCMS supports: 
.RT
.LP
	i)
	64 kbit/s clear channels for ISDN services (can be used in   the bearer pool),
.LP
	ii)
	voiceband data (dial\(hyup) up to and including 9600 bit/s
V.29. Group\ III facsimile is also included under this heading,
.LP
	iii)
	 voice services in the frequency range 300\(hy3400 Hz, carried at 56 or 
64\ kbit/s, 
.LP
	iv)
	64 kbit/s clear (not ISDN dial\(hyup),
.LP
	v)
	sub\(hy64 kbit/s digital data.
.sp 1P
.LP
1.2.3
	\fICircuit versus packet mode\fR 
.sp 9p
.RT
.PP
Internally the DCME may employ a circuit or a packet mode for the transmission 
of speech or data. In the circuit mode, bearer channels are 
derived by providing suitable time slots on the transmission facility
interconnecting the DCME terminal equipment. In the packet mode virtual 
bearer channels are created and the speech or data samples are put into 
one or more 
packets of fixed or variable length. The packets are addressed to the
destination circuit and transmitted in a virtual channel on the transmission
facility one at a time. Thus, in the circuit mode the transmission facility 
can be thought of as carrying a number of bearer channels multiplexed together, 
while in the packet mode the facility is thought of as a single high speed
channel logically divided into virtual channels which transmits packets 
one at a time. 
.RT
.sp 1P
.LP
1.2.4
	\fBsingle clique working (point\(hyto\(hypoint operation)\fR 
.sp 9p
.RT
.PP
The system of two DCMEs interconnected by one set of bearer
channels. This working of a DCME is the most efficient mode of operation 
for a DCMS. It utilizes the maximum bearer pool capacity and the minimum 
inter\(hyDCME control information. It is an exclusive mode of operation. 
Another term for 
point\(hyto\(hypoint is circuit\(hybased DCMS. Figure\ 1/P.84 shows an 
example of 
point\(hyto\(hypoint or circuit\(hybased DCMS.
.RT
.LP
.rs
.sp 14P
.ad r
\fBFigure 1/P.84, p.\fR 
.sp 1P
.RT
.ad b
.RT
.sp 1P
.LP
1.2.5
	\fBmulti\(hyclique working (point\(hyto\(hymultipoint operation)\fR 
.sp 9p
.RT
.PP
A single DCME working to more than one DCME each on a
point\(hyto\(hypoint destination basis; designations are split and are 
therefore not interactive. Multi\(hyclique working reduces the traffic 
handling capacity 
compared
with point\(hyto\(hypoint operation, due to a reduction in bearer capacity. 
Single 
clique working is the equivalent of point\(hyto\(hypoint operation.
.bp
.RT
.sp 1P
.LP
1.2.6
	\fBmulti\(hydestination operation\fR 
.sp 9p
.RT
.PP
Many DCMEs working over a common bearer capacity pool, enabling
interactive working. This is the equivalent of a TDMA satellite system. 
Traffic handling capacity is drastically reduced since the bearer becomes 
very small, due to inter\(hyDCME control messages and inter\(hyterminal 
operation reducing the 
bearer capacity. Another term for multi\(hydestination DCMS is network\(hybased 
DCMS. Figure\ 2/P.84 shows an example of this. 
.RT
.LP
.rs
.sp 24P
.ad r
\fBFigure 2/P.84, p.\fR 
.sp 1P
.RT
.ad b
.RT
.sp 1P
.LP
1.2.7
	\fBlow rate encoding (LRE)\fR 
.sp 9p
.RT
.PP
Speech coding methods with bit rates less than 64 kbit/s, e.g.
the 32\ kbit/s ADPCM transcoder, (Recommendation\ G.721). This is one technique 
commonly used in DCME to increase the circuit capacity. 
.RT
.sp 1P
.LP
1.2.8
	\fBdigital speech interpolation (DSI)\fR 
.sp 9p
.RT
.PP
This is a technique whereby advantage can be taken of the inactive periods 
during a conversation, creating extra channel capacity. Speech activity 
is typically 30\(hy40%, on average, which can produce a DSI gain of up 
to 3 |  | , but generally in the range of 2 |  |  to 2,5 |  | . 
.RT
.sp 1P
.LP
1.2.9
	\fBLRE gain, DSI gain, DCME gain\fR 
.sp 9p
.RT
.PP
LRE gain is the factor by which the 64 kbit/s rate of the incoming circuits 
is reduced when LRE is used for coding within the DCME. For example, when 
a transcoder conforming to Recommendation\ G.721 is used, the LRE gain 
will equal\ 2. The LRE gain is 1 when no transcoding is used. 
.bp
.PP
DSI gain is the ratio of the number of active speech input circuits to 
the number of bearer channels used to transport this speech, where the 
same 
encoding rate is used for circuits and bearer channels. The DSI gain is
constrained by the number of input circuits and the speech activity factor 
and other input speech characteristics. The DSI gain is 1 when DSI is not 
used. 
.PP
The DCME gain is the product of the LRE and DSI gain factors.
.RT
.sp 1P
.LP
1.2.10
	\fBDCME overload\fR 
.sp 9p
.RT
.PP
The instant when the number of instantaneously active input
circuits exceeds the maximum number of \*Qnormal\*U bearer channels available 
for DSI. 
.RT
.sp 1P
.LP
1.2.11
	\fBfreezeout\fR 
.sp 9p
.RT
.PP
The condition when an input circuit becomes active with speech and cannot 
be immediately assigned to a bearer channel, due to lack of availability 
of such channels. 
.RT
.sp 1P
.LP
1.2.12
	\fBfreezeout fraction\fR 
.sp 9p
.RT
.PP
The percentage of speech lost, obtained by averaging over all
input circuits for a given time interval, e.g.\ one minute.
.RT
.sp 1P
.LP
1.2.13
	\fBtransmission overload\fR 
.sp 9p
.RT
.PP
The condition when the freezeout fraction goes beyond the value
set in accordance with the speech quality requirements.
.RT
.sp 1P
.LP
1.2.14
	\fBclipping\fR 
.sp 9p
.RT
.PP
An impairment occurring in DSI systems employing speech detectors whereby 
the detector, due to the time it takes to recognize that speech is 
present, can cut off (\*Qclip\*U) the start of the speech utterance. Competitive 
clipping is the impairment caused by the overload control strategy which
allows
.PP
freezeout to occur when bearer channels are temporarily unavailable. Another
name for the competitive clipping overload control strategy is 
sample
dropping
.
.RT
.sp 1P
.LP
1.2.15
	\fBvariable bit rate (VBR)\fR 
.sp 9p
.RT
.PP
An overload control strategy often used to cope with traffic peaks and 
hence freezeout problems. Temporary, additional bearer channels (overload 
channels) are created. Several VBR techniques are available: 
.RT
.LP
	i)
	 Graceful overload is one technique to reduce the bit rate. For example, 
a 4\(hybit sample 32\ kbit/s ADPCM channel can be reduced on demand to 
a minimum of a 3\(hybit sample 24\ kbit/s, and the VBR will average across 
the DCMS somewhere between 3 and 4\ bits. The dynamic load control (DLC) 
will operate 
when the predicted traffic loading rises above a preset VBR.
.LP
	ii)
	 Permanent 3\(hybit allocation set on block of channels. These channels 
operate solely in a 3\(hybit mode. 
.PP
The different reduction techniques available have different
subjective performances.
.sp 1P
.LP
1.2.16
	\fBqueuing\fR 
.sp 9p
.RT
.PP
An overload control strategy employing buffer memory in the DCME transmitter 
to store speech samples while waiting for a bearer channel to 
become available.
.RT
.sp 1P
.LP
1.2.17
	\fBdynamic load control (DLC)\fR 
.sp 9p
.RT
.PP
An overload control strategy in which the DCMS signals to the
associated switch that the traffic load the switch is generating, or is
predicted to generate, cannot be transmitted satisfactorily by the DCMS and
that the switch should reduce its demand on the DCMS by a holding signal 
sent to the circuits when they become idle. 
.RT
.sp 1P
.LP
1.2.18
	\fBload carrying capacity\fR 
.sp 9p
.RT
.PP
The load carrying capacity is defined as the maximum offered
speech load plus \*Qoverhead\*U load (see \(sc\ 1.2.19) that the DCME channels 
can 
carry without forced loss of any speech samples. DCME overload is defined to
occur when the instantaneously offered load exceeds the carrying capacity of
the DCME bearer channels.
.bp
.RT
.sp 1P
.LP
1.2.19
	\fBapplied and offerd load\fR 
.sp 9p
.RT
.PP
The applied load consists of the speech bursts entering the DCME on the 
active circuits. Thus, applied load is a function of the number of 
active circuits and the speech activity on the circuits.
.PP
The offered load consists of the applied load plus any additional load 
(overhead) generated by the DCME messages and control information. The 
offered load is the load presented to the DCME bearer channels. If the 
offered load is less than the load\(hycarrying capacity of the channels, 
then all the offered load is carried by the DCME. However, if the offered 
load exceeds the capacity of 
the bearer channels, then, depending upon the overload strategy of the DCME,
some of the offered load will be lost through competitive clipping (sample
dropping). The DCME may employ variable bit rate coding so that, should the
freezeout fraction exceed some preset limit, the DCME can momentarily increase 
the load\(hycarrying capacity of the bearer channels (creation of overload 
channels) in order to accommodate the extra load. Dynamic load control 
may also be used to limit the applied load. 
.PP
The instantaneous load is a function of the statistics of the input
speech and the 
DCME overhead traffic
, and is difficult to characterize mathematically. However, the long\(hyterm 
time average applied load can be 
calculated as follows:
\v'6p'
.RT
.sp 1P
.ce 1000
\fIL\fR\d\fIa\fR\u= \fIN\fR 
@ { (*a } over { (*a~+~\(*b } @ ,
.ce 0
.sp 1P
.LP
.sp 1
where \fIL\fR\d\fIa\fR\uis the average applied load, \(*a is the average 
speech burst 
length,
\(*b is the average silence length, and \fIN\fR is the number of circuits 
in use. The term \(*a/(\(*a\ +\ \(*b) is equal to the average speech activity. 
The applied load is 
measured at the input to the DCME on the circuits. Thus, the 
average load   on the DCME
can be externally controlled by varying the number of circuits in use, 
\fIN\fR , or the 
speech activity factor
, \(*a/(\(*a\ +\ \(*b).
.PP
Similarly, average offered load is a useful concept, and it can be calculated 
from this formula: 
\v'6p'
.sp 1P
.ce 1000
\fIL\fR\d\fIo\fR\u= \fIN\fR 
@ { (*a(\fIk\fR~+~1) } over { (*a~+~\(*b } @  +\ \fIG\fR ,
.ce 0
.sp 1P
.LP
.sp 1
.LP
where \fIL\fR\d\fIo\fR\uis the average load offered to the bearer channels, 
the term \fIk\fR is a constant which accounts for the 
\*Qstretching\*U
effect that the
speech
detector has on the activity factor, and the term \fIG\fR is a load factor that
accounts for the system overhead traffic (e.g. control messages). Thus, the
average offered load, \fIL\fR\d\fIo\fR\u, will almost always be larger than the
average applied load, \fIL\fR\d\fIo\fR\u.
.sp 1P
.LP
1.3
	\fITest philosophy\fR 
.sp 9p
.RT
.PP
In order for a test to satisfactorily evaluate DCME performance the test 
methodology should meet certain conditions. These are as 
follows:
.RT
.LP
	i)
	the method should use principles, procedures, and
instrumentation that are acceptable to CCITT;
.LP
	ii)
	 the method should be adaptable to different languages and should yield 
results that are comparable to previous test results; 
.LP
	iii)
	 the method should permit DCME performance to be compared subjectively 
(or objectively) to reference conditions. Examples of suitable 
reference conditions are 
hypothetical reference connections
(HRCs), white noise and speech correlated noise. The HRCs should model 
the facilities the DCME is designed to replace, when these facilities are 
known. The results of the comparisons should permit making \*Qequivalence 
statements\*U about the 
DCME, e.g.\ a DCME system is subjectively equivalent to \fIx\fR asynchronously
tandemed 64\ kbit/s PCM systems. Ideally, the method should yield results 
from which a network application rule can be derived; 
.LP
	iv)
	the DCME should be tested with a realistic traffic load
simulator and circuit\(hyunder\(hytest signal conditions applied. Most of the
transitory impairments arise when the DCME is operating in the range of 
applied load which forces the use of DSI. Therefore, to subjectively measure 
the 
effects of these impairments it is necessary to vary the applied load on the
DCME up to and including the maximum design load. The clipping produced 
by the speech detector is affected by the type of signal being transmitted 
on the 
circuit under test. Therefore, only a realistic speech signal which also
contains appropriate additive noise should be used on the circuit under test;
.bp
.LP
	v)
	 in most instances DCME is designed to be used in the network as a replacement 
for an existing facility. If the DCME introduces more delay 
than the facility replaced, then this additional delay will reduce the echo
tolerance (grade of service) unless it is compensated for by the use of
extra echo control measures
. Based on Recommendation\ G.131, the
magnitude of the reduction in the echo tolerance that will occur without 
extra echo control can be determined and hence a decision taken as to the 
need for 
additional echo control measures.
.LP
	vi)
	 The methodology should, ideally, yield results which can be used to produce 
new opinion models or modify existing models. 
.sp 1P
.LP
1.4
	\fIDescription of DCME\fR 
.sp 9p
.RT
.PP
Annex A contains a detailed description of the characteristics of DCME 
that can be evaluated with this methodology. This section contains a brief 
summary of these characteristics. 
.PP
The test methodology applies to two types of DCME: one type which uses 
DSI only to obtain a DSI gain and a second type which uses a combination 
of LRE and DSI to obtain both a LRE gain and a DSI gain. The test methodology 
accounts for the operation of the speech detector, recognizing that speech 
clipping is an impairment that may occur even though the DCME is not overloaded. 
.PP
The test methodology is applicable to DCME employing any one or a
combination of three methods of overload control: 1) sample dropping or
competitive clipping, 2) variable bit rate, and 3) queuing. The test plan 
also allows for testing of DCME having DLC capability. 
.PP
The test methodology recognizes that many of the impairments
produced by DCME occur only when a load is applied, and therefore provision 
is made to apply a controlled load to the DCME under test. The load is 
varied 
between zero and 100% of circuit capacity. Use of the packet mode in the 
DCME converts it into a packetized voice system, and this test methodology 
is 
applicable to these systems. At the present time only point\(hyto\(hypoint (and
possibly point\(hyto\(hymultipoint) DCME are covered by this methodology.
.RT
.sp 2P
.LP
\fB2\fR 	\fBSource recordings\fR 
.sp 1P
.RT
.sp 1P
.LP
2.1
	\fIApparatus and environment\fR 
.sp 9p
.RT
.PP
The talker should be seated in a quiet room having a volume of
between 40 and 120\ cubic meters and a reverberation time of less than 500\ ms
(preferably in the range\ 200 to 300\ ms). The room noise level must be below
30\ dBA with no dominant peaks in the spectrum.
.PP
Speech should be recorded from an 
Intermediate Reference
System
(IRS), as specified in Recommendation\ P.48, or an equivalent
circuit. The IRS is chosen as it is well documented and can be implemented 
by all laboratories. The IRS should be calibrated according to 
Recommendation\ P.64.
.PP
The recording equipment should be of high quality and of the type
agreed to by the test. The equipment selected should be capable of providing 
at least a 40\ dB signal\(hyto\(hynoise ratio. A suitable system might 
consist, for 
example, of a high\(hyquality digital audio tape recording system.
.PP
All the source speech material should be recorded so that the active speech 
level, as measured according to Recommendation\ P.56, is approximately 
23\ dB below the peak overload level of the recording system. This will 
assure that the speech peaks will not overload the recording system. 
.RT
.sp 1P
.LP
2.2
	\fISpeech material\fR 
.sp 9p
.RT
.PP
The speech material should consist of a sequence of simple,
meaningful, short sentences, chosen at random because easy to understand 
(from current non\(hytechnical literature or newspapers, for example). 
Very short and 
very long sequences should be avoided, the aim being that each sequence when
spoken should have a duration of at least 30\ s and the duration of any two
sequences should differ by no more than 5\ s. Administrations can use one 
of two approaches: 
.RT
.LP
	i)
	 to have as many different sequences as there are conditions (an example 
of suitable material from which sequences may be constructed is 
contained in Annex\ B), or
.LP
	ii)
	to have a more limited number, e.g. 10 sequences per
talker, where combinations of two sequences can be used (this is shown in
detail in Annex\ C).
.PP
Because of the opinion scales to be used the first approach is
recommended. Enough sequences should be available to cater for all the test
conditions, plus a sufficient number for use in a practice session.
.bp
.sp 1P
.LP
2.3
	\fIProcedure\fR 
.sp 9p
.RT
.PP
At least three sentences should be used for each sequence. A silent period 
containing only circuit noise of approximately one second should procede 
the first sequence and the sequence should end with a similar silent period 
containing only the circuit noise. One of the inter\(hysentence pauses 
containing circuit noise should last one to two seconds. Otherwise, the 
talker should 
speak so that pauses occur naturally.
.PP
To facilitate the processing of the recorded speech through the DCME, i.e. 
to allow for the starting and stopping of the recorders between sequences 
and to allow time for adjusting the DCME for the next test condition, sequences 
should be separated by a 5\ seconds gap on the tape. Therefore, the recorded 
source sequences will have the pattern on the tape shown in Figure\ 3/P.84.
.RT
.LP
.rs
.sp 11P
.ad r
\fBFigure 3/P.84, p.\fR 
.sp 1P
.RT
.ad b
.RT
.PP
Sequences should be played back to listeners beginning with the
one second silent period. After the sequence has ended, a 5\ s period of
complete silence should be provided to permit the listener to vote.
.PP
Talkers should pronounce the sequence of sentences fluently but not
dramatically and have no speech deficiencies such as \*Qstutter\*U.
.PP
At least two male\(hyfemale pairs of talkers shall be used, and more
pairs are desirable if the test\(hytime permits.
.PP
The method of presentation of the source sequences will be by
randomization of talkers by blocks; as shown in the following example:
.RT
.LP
.sp 1
Block 1
Block 2
Block 3
. |  | 
Block n
Talker
1 2 3 4
3 4 1 2
1 3 2 4
2 3 1 4
.LP
where talkers 1 and 2 are male and talkers 3 and 4 are female.
.sp 1P
.LP
2.4
	\fICalibration signals and speech levels\fR 
.sp 9p
.RT
.PP
When the recordings have been made, the active speech level of each speech 
sequence (excluding the preceding and following silent periods) should 
be measured, preferably according to Recommendation\ P.56. If necessary, 
the 
speech
should then be re\(hyrecorded onto the \fIright\fR channel of a second 
system with the necessary gain adjustments, so that all the sequences will 
be brought to the 
same speech level, namely 23\ dB below the peak overload level of the recording 
system. 
.PP
Thirty seconds of 1000 Hz tone should be inserted at the re\(hyrecording 
stage, 
at an r.m.s. level\ 17\ dB above the active speech level, i.e.\ 6\ dB below the
peak overload level of the recording system: the peak level of this tone 
will be 3\ dB higher still. This tone can then be used later to adjust 
the r.m.s. 
input speech level to be 20\ dB below the overload point of the DCME (a
peak/r.m.s. of tone of 3\ dB with the speech level 17\ dB below the r.m.s. 
tone level will give the 20\ dB figure). 
.bp
.PP
The \fIleft\fR channel of the source recording should contain a 1000 Hz
tone at a level 23\ dB below the peak overload level and of 0.5\ s duration,
recorded about 0.5\ s before the start and after the end of each sequence. 
These two signals may be used as checking and control signals in the processing 
of 
the source sequences through the DCME under test.
.RT
.sp 2P
.LP
\fB3\fR 	\fBSimulating system load\fR 
.sp 1P
.RT
.sp 1P
.LP
3.1
	\fIRequirements for a generic\fR 
\fIvoice load simulator\fR 
.sp 9p
.RT
.PP
Digital Circuit Multiplication Equipment (DCME), by definition, is used 
to gain an advantage in the number of circuits multiplexed onto a digital 
transmission facility. With this advantage, however, comes potential 
degradation of transmission quality when carried loads exceed that for which
the DCME was engineered. Thus, a rigorous performance evaluation of DCME
includes studying the behaviour of the DCME under conditions of no load,
engineered load, and overload. Because the transmission performance of DCME
under load depends critically upon the load characteristics, it is necessary 
to use \fIknown and controlled\fR simulated loads in order to properly 
assess DCME 
performance. This section describes the generic requirements for a voice 
load simulator for the purpose of facilitating DCME performance evaluations 
under 
conditions that are meaningful. Use of voice load simulators with the generic 
requirements described here will also enable the comparison of results 
from 
different studies of various DCME.
.PP
\fINote\ 1\fR \ \(em\ The load simulator specified here is to be used for the
performance evaluation of DCME using Digital Speech Interpolation (DSI). 
This excludes Type A DCME, for which load is not an issue by virtue of 
the fixed 
time\(hyslot assignment of the channels.
.PP
\fINote\ 2\fR \ \(em\ The load simulator specified here is an \*Qexternal\*U
simulator that produces simulated speech signals so as to exercise many
circuits being multiplexed onto a digital transmission facility. Prototype 
DCME frequently use \*Qinternal\*U load simulation of \*Qtrunk needs service\*U 
requests 
that simulate the output of multiple speech detector circuits and thus 
compete for transmission capacity, even though no simulated signals are 
actually 
transmitted; only the \*Qlive\*U channel under test is actually transmitting. 
This type of simulator can be very useful in the lab, but is not treated 
here 
because certain assumptions would have to be made regarding the performance
characteristics of the associated speech detector simulation.
.RT
.sp 1P
.LP
3.1.1
	\fIParameters\fR 
.sp 9p
.RT
.PP
A generic Voice Load Simulator (VLS) for DCME performance
evaluation has the following attributes (the parametric specification of 
which are detailed later in this section): 
.RT
.LP
	\(em
	talk\(hyspurt characteristics,
.LP
	\(em
	silence (gap) characteristics,
.LP
	\(em
	background noise\(hyfill for silent periods,
.LP
	\(em
	spectral properties of the simulated speech,
.LP
	\(em
	amplitude characteristics,
.LP
	\(em
	physical interface, including idle\(hycircuit
specifications.
.PP
The above are a minimum set of parameters that may have to be
expanded as required; for example, time variation of the number of simulated
calls might have to be studied, at which time a pertinent specification 
would have to be added. Also, only simulated speech signals are discussed. 
It may be desirable to add simulated tones, signalling frequencies, and 
voiceband data of various types at a later date. 
.sp 2P
.LP
3.1.2
	\fIRequirements\fR 
.sp 1P
.RT
.sp 1P
.LP
3.1.2.1
	\fIGeneral\fR 
.sp 9p
.RT
.PP
These requirements apply to a generic VLS testing a DCME.
Accordingly, the DCME must receive digital signals from the VLS that simulate 
multiple and independent sources of speech similar to that which is observed 
in telephone networks. To meet the \*Qmultiple and independent\*U condition, 
it will be assumed that the VLS output is to several T1 or CEPT interfaces. 
.bp
.PP
Where possible, existing Recommendations have been used in deriving
these requirements. The most notable exception are the requirements associated 
with speech activity and the underlying statistical distributions of 
talk\(hyspurts and silent periods (gaps). For these, the current technical
literature was surveyed; the results of\ [1] being both recent and based on
conversational speech, are used here.
.RT
.sp 1P
.LP
3.1.2.2
	\fITalk\(hyspurt characteristics\fR 
.sp 9p
.RT
.PP
The probability density function (p.d.f.) of talk\(hyspurt durations is 
modeled by two weighted geometric p.d.f.'s: 
\v'6p'
.RT
.sp 1P
.ce 1000
\fIf\fR\d\fIt\fR\u\fI(k\fR \fI)\fR = \fIC\fR\d1\u(1\(em\fIU\fR\d1\u)\fIU\fR 
$$Ei:\fIk\fR \(em1:1_ 
+ \fIC\fR\d2\u(1\(em\fIU\fR\d2\u)\fIU\fR $$Ei:\fIk\fR \(em1:2_, \fIk\fR 
= 1, 2, 3,  |  |  |  
.ce 0
.sp 1P
.LP
.sp 1
where
.LP
	\fIC\fR\d1\u= 0.60278
	\fIU\fR\d1\u= 0.92446
.LP
	\fIC\fR\d2\u= 0.39817
	\fIU\fR\d2\u= 0.98916.
.PP
Every increment of the variable \fIk\fR is equal to 5 ms in time. The cumulative 
distribution function of talk\(hyspurt durations is shown in 
Figure\ 4/P.84. The average talk\(hyspurt duration is \(*a\ =\ 227\ ms.
.LP
.rs
.sp 20P
.ad r
\fBFigure 4/P.84, p.\fR 
.sp 1P
.RT
.ad b
.RT
.sp 1P
.LP
3.1.2.3
	\fISilence (gap) characteristics\fR 
.sp 9p
.RT
.PP
The p.d.f. of silence durations is also modeled by two weighted
geometric p.d.f.'s:
\v'6p'
.RT
.sp 1P
.ce 1000
\fIf\fR\d\fIs\fR\u\fI(k\fR \fI)\fR = \fID\fR\d1\u(1\(em\fIW\fR\d1\u)\fIW\fR 
$$Ei:\fIk\fR \(em1:1_ 
+ \fID\fR\d2\u(1\(em\fIW\fR\d2\u)\fIW\fR $$Ei:\fIk\fR \(em1:2_, \fIk\fR 
= 1, 2, 3,  |  |  |  
.ce 0
.sp 1P
.LP
.sp 1
where
.LP
	\fID\fR\d1\u= 0.76693
	\fIW\fR\d1\u= 0.89700
.LP
	\fID\fR\d2\u= 0.23307
	\fIW\fR\d2\u= 0.99791.
.PP
The cumulative distribution function of silence (gap) durations is shown 
in Figure\ 4/P.84. 
.PP
The average silence duration of \(*b = 596 ms, combined with the 227 ms 
talk\(hyspurt duration average, yields a long\(hyterm speech activity factor 
of 
27.6\ percent.
.bp
.RT
.sp 1P
.LP
3.1.2.4
	\fIBackground noise\(hyfill for silent periods\fR 
.sp 9p
.RT
.PP
Noise should be inserted into the silent periods (gaps) so that the performance 
of DSI in the presence of noise can be studied. It is desirable to have 
the noise level adjustable; a default value of \(em58\ dbm0p is provisionally 
recommended. 
.RT
.sp 1P
.LP
3.1.2.5
	\fIProperties of the simulated speech\fR 
.sp 9p
.RT
.PP
The artificial voice signal of Recommendation P.51 shall be used as a basis 
for simulating the characteristics of human speech. Supplement\ No.\ 7 
to the Series\ P\ Recommendations describes a possible generation process 
of the 
artificial voice according to Recommendation\ P.51. This signal can then be
switched on/off according to the talk\(hyspurt and silence duration statistics
described in \(sc\(sc\ 3.1.2.3 and 3.1.2.4.
.RT
.sp 1P
.LP
3.1.2.6
	\fIPhysical interface\fR 
.sp 9p
.RT
.PP
The load simulator should have T1 and/or CEPT outputs which have
physical, electrical, coding, frame structure, alignment, and signalling
characteristics as per Recommendations\ G.703, G.704, G.711 and G.732
(2048\ kbit/s) or G.733 (1544\ kbit/s).
.RT
.sp 1P
.LP
3.2
	\fIDetermining\fR 
\fIload capacity of tested systems\fR 
.sp 9p
.RT
.PP
The average applied load equals the product of the number of
circuits in use, \fIN\fR , and the average speech activity. The load capacity 
of the tested system equals the maximum load that the system is designed 
to handle, 
\fIL\fR\dm\\da\\dx\u. The load capacity can be determined by:
.RT
.LP
	i)
	obtaining the manufacturer's specifications,
.LP
	ii)
	calculation.
.PP
After the load capacity is determined, the partial loads at which the system 
will be tested can be determined. The partial loads are: 
\v'6p'
.sp 1P
.ce 1000
\fIL\fR\di\u= \fIc\fR\di\u\fIL\fR\dm\\da\\dx\u
.ce 0
.sp 1P
.LP
.sp 1
where
\v'6p'
.LP
	\fIc\fR\di\u= 0.0, 0.50, 0.75 and 1.0.
.LP
.sp 1
.sp 1P
.LP
3.3
	\fIControlling load applied to tested systems\fR 
.sp 9p
.RT
.PP
The load applied to the DCME can be changed by varying \fIN\fR and the 
activity factor. For these tests the speech activity factor will be assumed 
constant at 28%. Therefore, to obtain a partial load, \fIL\fR\di\u, it is
necessary
to calculate the number of active circuits which come closest to achieving 
this desired value. 
.PP
For example, if \fIL\fR\dm\\da\\dx\u= 48 and if a partial load of
\fIL\fR\di\u\ =\ 0.50 \fIL\fR\dm\\da\\dx\uis desired and the speech activity 
factor 
of 28% is assumed, then the number of active circuits, \fIN\fR \dactive
\u, is
calculated thus:
\v'6p'
.RT
.sp 1P
.ce 1000
\fIN\fR \dactive
\u = \fIc\fR\di\u@ { fIL\fR\dm\\da\\dx\u\fI } over { speech\fR~~\fIactivity~factor) } @ \fR = 0.5
@ { 8 } over { .28 } @  = 86 active circuits.
.ce 0
.sp 1P
.LP
.sp 1
In the test, 86 circuits would carry speech load and the remainder would be
idled.
.PP
\fINote\fR \ \(em\ The following items are for future study:
.LP
	a)
	 Should DCME loads include voiceband data as well as speech? The effect 
of voiceband data traffic on speech quality is an important issue in the 
evaluation of DCME performance. Data percentage is defined as 
follows:
\v'6p'
.sp 1P
.ce 1000
\fIP\fR \ddata
\u = 
@ { umber~of~input~circuits~active~with~data } over { otal~number~of~active~circuits } @  \(mu 100%
.ce 0
.sp 1P
.LP
.sp 1
	b)
	Some Administrations report that speech activity on their
real circuits averages about 36% when using a highly sensitive speech detector 
having a short hangover time of about 30\ ms. Is it desirable to modify 
the 
speech load requirements given in \(sc\ 3.1, and, if so, what values are
recommended?
.bp
.LP
	c)
	Fractional values of speech load are given in \(sc\ 3.2. Some
DCME may operate so as to display significant changes in performance at
different fractional load points. Should the fractional load points be 
changed to accommodate this type of operation, and, if so, what changes 
are 
recommended?
.sp 2P
.LP
\fB4\fR 	\fBProcessing of the speech\fR 
.sp 1P
.RT
.PP
The DCME testing laboratory will take the source recordings, replay them 
through the circuit under test of the agreed DCME (using the calibration 
tone to set the agreed input level), operating the DCME at the agreed load, 
and record the output from the circuit under test in a predetermined arrangement 
(explained in \(sc\ 5). The recorded outputs will then be used to perform the
listening test. The DCME being tested must be connected to the load simulator 
and to the recording and playback equipment as shown in Figure\ 5/P.84. 
It may be necessary to make provision for special A/D and D/A interfaces 
to permit the selected load simulator and recording equipment to be connected 
to the DCME. 
.PP
All the processed outputs will be on the \fIleft\fR channel of the
recording medium. The corresponding original signal will be simultaneously
recorded on the \fIright\fR channel. The 1\ kHz tone will be available 
both in its original form (\fIright\fR channel) and as processed by passing 
through the DCME 
under test (\fIleft\fR channel).
.PP
The 1 kHz tone on the source recording (see \(sc 2) will be used to
adjust the r.m.s. input speech level to be 20, 30 or 38\ dB below the overload 
point of the DCME coder. 
.RT
.LP
.rs
.sp 26P
.ad r
\fBFigure 5/P.84, p.\fR 
.sp 1P
.RT
.ad b
.RT
.sp 2P
.LP
\fB5\fR 	\fBTest design\fR 
.sp 1P
.RT
.PP
Three separate tests are proposed to evaluate different aspects of DCME 
performance. The first verifies the effect of various loads on the 
performance. The second verifies the effect of errors in the DCME digital
control channel. The third test calculates the effect that the DCME delay 
has on the echo tolerance. This last test will be done using Recommendation\ 
G.131 and does not involve subjective testing. 
.bp
.RT
.sp 1P
.LP
5.1
	\fITest No. 1: Effect of applied load\fR 
.sp 9p
.RT
.PP
This test may be conducted twice, once to obtain a quality rating and (optionally) 
a second time to obtain a listening effort rating. The 
parameters for testing are as follows:
.RT
.LP
	a)
	\fIDCME test parameters:\fR 
.LP
	1.
	DCMEs under test: \fIN\fR 
.LP
	2.
	DCME loads: four values (0, 0.5, 0.75, 1.0) (see \(sc 3.2)
.LP
	3.
	speech activity factor: one value (28%)
.LP
	4.
	active circuit speech characteristics: one value (see \(sc 3.1)
.LP
	5.
	circuit under test (CUT) idle circuit noise (ICN): two
values
.FS
Time permitting, use of a third noise level of \(em58\ dBm0p is
suggested. This will permit a better characterization of the effect different 
noise levels have on the DCME. 
.FE
(\(em77 and \(em45\ dBm0p)
.LP
	6.
	input speech level to CUT: three values (20, 30 and 38 dB
below DCME coder overload)
.LP
	7.
	output listening levels: at least three values (preferred
and preferred \(+-10\ dB)
.LP
	8.
	talkers: four talkers, i.e. 2 male and 2 female.
.LP
	b)
	\fIReference parameters\fR 
.LP
	1.
	original source sequences: one value
.LP
	2.
	MNRU: four values (5\(hy35 dB in 10 dB steps)
.LP
	3.
	SNR: three values (20, 30 and 40 dB)
.LP
	4.
	 reference connections (HRCs): approximately four different cases to be 
decided by test team 
.LP
	5.
	listening levels: three levels (see above)
.LP
	6.
	talkers: four talkers, i.e. 2 male and 2 female.
.PP
For the stated set of parameters the number of test condition is:
.sp 1P
.ce 1000
4 \(mu 2 \(mu 3 \(mu 3 \(mu 4 \(mu \fIN\fR = 288 \(mu \fIN\fR DCME conditions
.ce 0
.sp 1P
.LP
plus
.sp 1P
.ce 1000
12 \(mu 3 \(mu 4 = 144 reference conditions.
.ce 0
.sp 1P
.LP
This totals (assuming \fIN\fR = 1 DCME):
.sp 1P
.ce 1000
432 test conditions + 36 practice = 468 conditions.
.ce 0
.sp 1P
.PP
The set of test conditions should be divided into about 13
segments (12\ test\ +\ 1\ practice) of 36\ conditions with the conditions 
within 
each segment put into a random order. Table\ 1/P.84 lists the conditions in a
basis non\(hyrandomized segment.
.PP
The basic balanced segment in Table 1/P.84 will be repeated for each of 
4\ talkers and 3\ listening levels to create 12\ test segments: A thru 
L. A 
practice segment\ P will also be created. The test segments\ A thru\ L 
plus\ P can then be ordered for playback in the listening test according 
to the procedure described in \(sc\ 6. 
.PP
Assuming each condition takes 35 s to present and obtain a vote, total 
test time is about 4.5\ hours. 
.RT
.sp 1P
.LP
5.2
	\fITest No. 2: Effect of digital errors in the DCME control channel\fR 
.sp 9p
.RT
.PP
The preceding test was done assuming that the digital transmission facility 
is operated error\(hyfree. Under real conditions errors will occur and 
errors in the DCME control channel may cause momentary disruption of the 
voice circuits. To determine the effect of digital errors on performance, 
Test\ No.\ 1 should be repeated while random errors at a rate of 10\uD\dlF261\u3\d 
are injected 
into the control channel. For this test only one listening level (preferred) 
is necessary, so the total number of test conditions is \fIN\fR \ \(mu\ 
96 plus 144 
reference conditions. With \fIN\fR \ =\ 1, the test time is 2.3\ hours.
.bp
.RT
.ce
\fBH.T. [T1.84]\fR 
.ce
TABLE\ 1/P.84
.ce
\fBBasic segment (assumes 1 DCME for testing)\fR 
.ps 9
.vs 11
.nr VS 11
.nr PS 9
.TS
center box;
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
Condition	Load	BCN (dBm0p)	Input | ua\d\u)\d (dB)	Q (dB)	SNR (dB)	HRC
_
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
\ 1	0.00	\(em77	20			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
\ 2	0.50	\(em77	20			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
\ 3	0.75	\(em77	20			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
\ 4	1.00	\(em77	20			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
\ 5	0.00	\(em45	20			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
\ 6	0.50	\(em45	20			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
\ 7	0.75	\(em45	20			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
\ 8	1.00	\(em45	20			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
\ 9	0.00	\(em77	30			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
10	0.50	\(em77	30			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
11	0.75	\(em77	30			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
12	1.00	\(em77	30			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
13	0.00	\(em45	30			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
14	0.50	\(em45	30			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
15	0.75	\(em45	30			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
16	1.00	\(em45	30			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
17	0.00	\(em77	38			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
18	0.50	\(em77	38			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
19	0.75	\(em77	38			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
20	1.00	\(em77	38			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
21	0.00	\(em45	38			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
22	0.50	\(em45	38			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
23	0.75	\(em45	38			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
24	1.00	\(em45	38			
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
25			20			Original
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
26			20	\ 5		
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
27			20	15		
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
28			20	25		
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
29			20	35		
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
30			20		20	
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
31			20		30	
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
32			20		40	
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
33			20			HRC1
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
34			20			HRC2
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
35			20			HRC3
.T&
cw(36p) | cw(30p) | cw(36p) | cw(36p) | cw(30p) | cw(30p) | cw(30p) .
36			20			HRC4
.TE
.LP
ICN
idle circuit noise
.LP
\ua\d\u)\d\ dB below DCME coder overload level.
.nr PS 9
.RT
.ad r
\fBtableau 1/P.84 [T1.84], p.\fR 
.sp 1P
.RT
.ad b
.RT
.LP
.sp 3
.sp 1P
.LP
5.3
	\fITest No. 3: Effect of delay\fR 
.sp 9p
.RT
.PP
In this test, using Recommendation G.131, the intent is to
calculate the transmission delay through the DCME, then determine if the 
delay will require the use of additional echo control measures. The answer 
to this 
question requires that we define the connections that the DCME will be 
used to provide, then determine the echo tolerance of these connections 
assuming that conventional transmission facilities are used in place of 
the DCME, and then 
finally determine the reduction in the echo tolerance that will occur by
inserting the DCME into the connections. If the reduction in tolerance falls
below acceptable limits then additional echo control measures will be required 
if the DCME is used. 
.bp
.RT
.sp 2P
.LP
\fB6\fR 	\fBListening test procedure\fR 
.sp 1P
.RT
.sp 1P
.LP
6.1
	\fIApparatus, calibration and environment\fR 
.sp 9p
.RT
.PP
The 
listening room
should meet the same conditions as the
recording room
with the exception that the environmental noise should be set to 45\ dBA 
(Hoth spectrum \(em\ Supplement\ No.\ 13, at the end of this 
fascicle.
.PP
The IRS receiving end (Recommendation P.48) or equivalent circuit will 
be used. The IRS should be calibrated according to Recommendation\ P.64. 
.PP
The gain of the system should be set in such a way that the 1 kHz tone 
played back from the recordings produces a sound pressure of 7\ dBPa when 
measured on the IEC\ 318 artificial ear (Recommendation\ P.51). Thus the 
speech level at that point will also be \(em10\ dBPa (84\ dB SPL) for undistorted 
speech 
which is close to the \*Qpreferred listening level\*U.
.RT
.sp 1P
.LP
6.2
	\fIInstructions to subjects\fR 
.sp 9p
.RT
.PP
The instructions are given in Annex D. When the subjects have read these 
instructions, they should listen to the practice conditions and give 
their response to each sample. No suggestions should be made to them that 
the practice conditions exhaust the range of qualities that they can expect 
to 
hear. Questions about procedure or about the meaning of the instructions 
should be answered, but any technical questions must be met with the response, 
\*QWe 
cannot tell you anything about that until the test is finished\*U.
.RT
.sp 1P
.LP
6.3
	\fIOpinion scale\fR 
.sp 9p
.RT
.PP
The methods agreed to are both of the single stimulus type based on the 
mandatory \*Qquality\*U scale and the optional \*Qlistening effort\*U scale. 
.RT
.sp 1P
.LP
6.3.1
	\fIOpinions based on the \*Qquality\*U scale\fR 
.sp 9p
.RT
.PP
The following five categories should be used for the quality
test:
.RT
.LP
	\(em
	Excellent
.LP
	\(em
	Good
.LP
	\(em
	Fair
.LP
	\(em
	Poor
.LP
	\(em
	Bad
.LP
or equivalent depending on language. (Supplement No. 2, at the end of this
fascicle.
.sp 1P
.LP
6.3.2
	\fIOpinions based on the effort required to understand the\fR 
\fImeaning of sentences (listening effort scale)\fR 
.sp 9p
.RT
.PP
The following five categories should be used for the optional
listening effort test:
.RT
.LP
	\(em
	complete relaxation possible, no effort required;
.LP
	\(em
	attention necessary, no appreciable effort required;
.LP
	\(em
	moderate effort required;
.LP
	\(em
	considerable effort required;
.LP
	\(em
	no meaning understood with any feasible effort.
.LP
or equivalent according to language. (Supplement No. 2, at the end of this
fascicle.)
.PP
\fINote\ 1\fR \ \(em\ It is expected that quality and listening effort 
scales are correlated. Therefore it is not generally required to use both 
scales. 
However, if, in a particular case, it is desirable to obtain ratings on both
scales, the test should first be performed by using the listening effort 
scale and then duplicated using the quality scale. This order of presentation 
is 
particularly important if the same listeners and the same speech sources are
used in both tests.
.PP
\fINote\ 2\fR \ \(em\ The rating scales associated with the categories 
defined in \(sc\(sc\ 6.3.1 and 6.3.2 are assumed to be linear interval 
scales. It is recommended to bring this assumption to the attention of 
the subjects in the test 
instructions, either in words or by presenting numbers of numerical scales in
.PP
the written instructions. Examples of how this can be done is given in 
Annex\ D. Alternatively, the scale can have more than 5\ grades (e.g.\ 
7 or 11\ grades) with the same five verbal definitions at equal distances. 
An additional possibility is to define the end points of the scale separately 
(e.g.\ Ideal and Unusable). These defined end points then serve as anchoring 
points but are not supposed to be used for the rating. Examples of such 
alternative subjective scales are 
found in Annex\ E.
.bp
.RT
.sp 1P
.LP
6.4
	\fISequence of operations\fR 
.sp 9p
.RT
.PP
The 12 test plus 1 practice segments (A\(hyL plus P) should be played back 
according to the augmented latin\(hysquares: 
.RT
.LP
.sp 1
\fIQuality test\fR \fIOptional listening effort test\fR P
	CABD . |  |  
	P
	ABDC . |  |  
	
P
	DBAC . |  |  
	P
	DCAB . |  |  
	
P
	ADCB . |  |  
	P
	BDCA . |  |  
	
P
	BCDA . |  |  
	P
	CABD . |  |  
	
.PP
In these squares, each row is used for each group of listeners,
who may listen either simultaneously or separately. The segments are played
back in the given order within each row. A pause will naturally occur between 
one segment and the next, while the right place on the recording medium 
is 
being found and possibly the calibration is checked; this pause will also be
welcomed by the listeners.
.sp 1P
.LP
6.5
	\fIListeners\fR 
.sp 9p
.RT
.PP
The listeners used in the tests should be drawn at random from the population 
of telephone service customers. About 40 but not less than 
30\ listeners should be solicited.
.RT
.sp 1P
.LP
6.6
	\fIData collection\fR 
.sp 9p
.RT
.PP
Subject's responses may be collected by any convenient method:
pencil and paper, press\(hybuttons controlling lamps recorded by the operator, 
or automatic data\(hylogging equipment, for example. But whatever method 
is used, 
care must be taken that subjects should not be able to observe other subjects' 
responses, nor should they be able to see the record of their own previous 
responses. Apart from the inevitable memory and practice effects, each 
response should be independent of every other. 
.RT
.sp 2P
.LP
\fB7\fR 	\fBStatistical analysis and reporting of results\fR 
.sp 1P
.RT
.PP
After the test is finished and all subject responses are collected, the 
experimenter will assign numerical scores to the responses as follows: 
.RT
.LP
.sp 1
\fIResponse\fR \fIScore\fR Excellent
5
Good
4
Fair
3
Poor
2
.LP
Bad
1
Complete relaxation possible, no effort required
5
Attention necessary, no appreciable effort required
4
Moderate effort required
3
Considerable effort required
2
No meaning understood with any feasible effort
1
.bp
.PP
The numerical mean score (over subjects) should be calculated for each 
condition, and these means listed (this is required so that effects due 
to male and female speech can be seen). 
.PP
As a further aid to rapid review of results, graphs should be prepared 
according to the formats shown in Figure\ 6/P.84. 
.PP
Note especially that the averaging of male and female results is here proposed 
purely to reduce the output to manageable proportions, and does not 
imply that this step would be warranted for the detailed study and
interpretation of the results (unless the significance tests justify it).
.PP
Calculation of separate standard deviations for each condition is not recommended. 
Confidence limits should be evaluated and significance tests 
performed by conventional analysis\(hyof\(hyvariance techniques.
.RT
.LP
.rs
.sp 40P
.ad r
\fBFigure 6/P.84, p.\fR 
.sp 1P
.RT
.ad b
.RT
.LP
.bp
.ce 1000
ANNEX\ A
.ce 0
.ce 1000
(to Recommendation P.84)
.sp 9p
.RT
.ce 0
.ce 1000
\fBDescription of\fR 
\fBdigital circuit multiplication equipment\fR 
.sp 1P
.RT
.ce 0
.LP
A.1
	\fIDefinition of DCME\fR 
.sp 1P
.RT
.PP
Digital circuit multiplication equipment (DCME) is defined in
\(sc\ 1.2.1. A working definition may be: any digital transmission method that
derives more voicegrade circuits than is possible using equipment conforming 
to Recommendation\ G.711. For our purposes the term circuit may at times 
refer to a circuit between two switching points (trunk) or between the 
customers premises and a switching point (loop). At other times it may 
refer to an end\(hyto\(hyend 
digital connection. The circuit may also be physical or virtual. The term
voicegrade means that the bandwidth of the circuit is nominally 3.1\ kHz. We
will attempt to avoid confusion by using suitable qualifiers, when necessary, 
to describe the kind of circuit we mean. 
.PP
Based on the above definitions we conclude that there are three basic types 
of DCME. These are: 
.RT
.LP
	\fIType\ A\fR \ \(em\ Uses only LRE (low rate encoding, < | 4 kbit/s) to
obtain a circuit multiplier larger than\ 1. Some LRE methods (e.g.,\ 32\ kbit/s
ADPCM) are amenable to the subjective testing methods described in
Recommendation\ P.70; other methods (e.g.\ 48\ kbit/s vocoding) may require new
subjective test methods.
.LP
	 \fIType\ B\fR \ \(em\ Uses only digital speech interpolation (DSI) to 
obtain a circuit multiplier larger than\ 1. DSI is defined in \(sc\ A.2. 
By definition the digital coding used in Type\ B DCME to derive a circuit, 
operates at 64\ kbit/s and conforms to Recommendation\ G.711. Thus, the 
coding provides a circuit 
multiplier of unity. During periods of DCME overload any of several overload
strategies may be used to resolve the contention for channels. The three 
basic overload strategies are defined in \(sc\ A.5. For example, during 
momentary periods of overload the channel coding rate may be reduced to 
increase the channel 
capacity. However, this recoding action is attributed to the DSI and the
circuit multiplier larger than\ 1 thus obtained is credited to the DSI.
.LP
	 \fIType\ C\fR \ \(em\ Combination of Types A and B. This hybrid type 
employs LRE to obtain a circuit multiplier larger than\ 1, and then DSI 
to obtain an 
additional circuit multiplier larger than\ 1. For example, if the LRE comforms 
to Recommendation\ G.721 32\ kbit/s ADPCM, then the coder has a circuit 
multiplier of \(*k\ =\ 2. The DSI may increase this multiplier by a further 
factor of\ 2 or\ 3, depending upon the DCME. The total multiplier, 4 to\ 
6, is equal to the product of the LRE and DSI multipliers. 
.sp 1P
.LP
A.2
	\fIDigital speech interpolation\fR \fI(DSI)\fR 
.sp 9p
.RT
.PP
Digital speech interpolation, is defined in \(sc 1.2.8. A working
definition of DSI may be: any method for assigning a voicegrade bearer 
channel on demand for the transmission of speech at the onset of the speech 
burst 
(talk\(hyspurt). The bearer channel comes from a pool maintained by the 
DCME and the speech comes from an active circuit connected to the DCME. 
When the speech burst stops the channel is either: 
.RT
.LP
	i)
	relinquished and put back into the pool, or
.LP
	ii)
	kept assigned to the circuit as long as the pool is not
empty and the channel is not needed to service another circuit.
.PP
In the above context the term \*Qbearer channel\*U refers to the
transmission paths between the DCME terminals, which are used to carry the
traffic on the circuits connected to the DCME. By definition, a bearer 
channel has the same bandwidth as a circuit, i.e.\ voicegrade. Bearer channels 
may be 
derived using time, space or even frequency or wavelength division multiplexing 
of the transmission medium used by the DCME. The transmission media may 
be 
copper wire, coaxial cable, radio path or fibre.
.sp 1P
.LP
A.3
	\fISpeech detection\fR 
.sp 9p
.RT
.PP
To perform DSI, the DCME must contain a 
speech detector
.
The speech detector monitors the circuits and determines when speech is 
present and when it is not. When speech is declared present the DCME attempts 
to assign an available bearer channel to the circuit. If no channel is 
available the DCME then invokes its overload strategy. When the speech 
burst ends the speech 
detector may provide some \*Qhangover\*U to avoid tail\(hyend clipping 
of the burst. Hangover extends the effective length of the burst. 
.bp
.PP
\*QFill\(hyin
\*U is another speech detector function sometimes
employed to bridge or eliminate the silence gaps less than a certain length
between speech bursts. Fill\(hyin does not extend the length of individual 
bursts the way hangover does, but requires a processing delay equal to 
the maximum 
filled\(hyin gap. Both hangover and fill\(hyin increase the activity factor 
of the 
speech on the bearer channels.
.PP
To avoid front\(hyend clipping of the speech burst, the speech detector 
sometimes employs delay of a few milliseconds to give it time to decide 
whether speech is present. 
.PP
Clipping or mutilation of the speech burst (both front\(hyend and
possibly tail\(hyend) may occur because the speech detector makes false or late
decisions. The operation of the speech detector and thus the clipping
performance of the DCME is a function of many factors characterizing the 
signal on the circuits, such as the signal level, signal\(hyto\(hynoise 
ratio, and echo path loss. 
.RT
.sp 1P
.LP
A.4
	\fIDefinition of load\fR 
.sp 9p
.RT
.PP
The frequency of DCME overloading is a function of the load on the system. 
The system load consists of the speech bursts generated on the incoming 
circuits plus DCME generated overhead traffic. Since the speech burst activity 
on the circuits varies from moment to moment, the load also has short\(hyterm 
variations.
.PP
In defining load we must distinguish between the applied load and the offered 
load. The applied load is the speech bursts entering the DCME on the 
circuits in use. Thus, 
applied load
is a function of the number of
circuits in use and the speech activity on the circuits. The 
offered
load consists of the applied load plus any additional load generated by the
DCME. The offered 
load
is the load presented to the DCME channels. It should be evident that the 
offered load is usually larger than the applied 
load, because:
.RT
.LP
	i)
	 the speech detector increases the activity factor, since it adds fill\(hyin 
or hangover to speech bursts; 
.LP
	ii)
	\*Qoverhead\*U information may have to be transmitted on the
channels along with the speech samples.
.PP
While the load varies continuously, subject to the statistics of the speech 
and the circuit activity, if we assume that the number of circuits in use, 
\fIN\fR , is a constant over some period of time in which we are observing 
the operation of the DCME, then the average applied and offered loads becomes 
useful concepts. Formulas for the average loads are defined in \(sc\ 1.2.19. 
While these formulas are somewhat simplistic and do not capture the information 
concerning the variance of the load about the average, they do allow useful
insight into the operation of the DCME.
.PP
The load carrying capacity of the DCME channels is also an important consideration. 
The load carrying capacity is defined as the maximum offered 
speech plus \*Qoverhead\*U load that the DCME channels can carry. If the 
offered 
load is less than the load carrying capacity of the channels, then all the
offered load is carried by the DCME. However, if the offered load exceeds 
the capacity of the channels, then depending upon the overload strategy 
of the 
DCME, (see \(sc\ A.5) some of the offered load will be lost through 
sample
dropping
, or variable bit rate coding will be used to momentarily increase the 
load carrying of the channels so that they can accommodate the extra load. 
Thus, overloading is defined to occur when the offered load exceeds the 
carrying capacity of the DCME channels.
.PP
In a sample dropping system the load capacity is fixed and is simply \(*k\fIM\fR 
, where \fIM\fR is the number of 64\ kbit/s equivalent channels provided 
and \(*k is the LRE factor which accounts for the difference in bit rates 
between the 
circuits (always 64\ kbit/s) and the channels. If 32\ kbit/s LRE is used 
on the channels, for example, then \(*k\ =\ 2. If LRE is not used then 
\(*k\ =\ 1. If variable bit rate (VBR) coding is used then the load capacity 
of the DCME is not fixed, and overloading may be avoided by temporarily 
creating extra bearer channels. If the coding rate drops from 32\ to 16\ 
kbit/s, for example, then during the 
period VBR is active \(*k\ =\ 4.
.PP
In these examples the number of channels available to carry speech is assumed 
to be constant. However, in DCME that carries voiceband data and other 
tones on the circuits, DSI cannot be used on these signals. The result 
is that these continuous signals capture channels for full\(hytime use, 
reducing the pool of channels available for carrying speech. 
.PP
By using the average load equations and the concept of load capacity, we 
can illustrate in Figure\ A\(hy1/P.84 the load curves for a sample dropping 
type\ C\ DCME. The slope of the offered load curves depends upon the speech
activity factor. \(*a/(\(*a\ +\ \(*b), and the speech detector \*Qstretch\*U 
factor,\ \fIk\fR . Load 
.PP
curves for three different activity factors are shown. If the number of
circuits in use, \fIN\fR , is less than \fIN\fR\dm\\di\\dn\u\ =\ \(*k\fIM\fR 
\(em\fIG\fR \ =\ 43 then the DSI will never activate, even if the momentary 
speech activity factor goes to unity on all active circuits. Since the 
DCME\(hycarried load cannot exceed \(*k\fIM\fR \ =\ 48, as the average 
offered load, \fIL\fR\d\fIo\fR\u, gets closer and closer to the maximum 
capacity, the frequency of overloading (sample dropping) will increase 
as the moment\(hyto\(hymoment fluctuations in the speech activities push 
the offered load 
above the limit.
.bp
.RT
.LP
.rs
.sp 21P
.ad r
\fBFigure A\(hy1/P.84, p.\fR 
.sp 1P
.RT
.ad b
.RT
.PP
Figure A\(hy2/P.84 illustrates the load curves for a variable bit
rate type C system which recodes at 16\ kbit/s during overload. In this 
example, when the offered load exceeds \(*k\fIM\fR \ =\ 48 the coding rate 
is dropped from 32 to 16\ kbit/s on the bearer channels. The capacity is 
thus increased to \(*k\fIM\fR \ =\ 96. The extra capacity absorbs the momentary 
overload and prevents sample dropping (freezeout) from occurring. If the 
offered load exceeds 96 then sample dropping will have to occur, because 
further VBR (e.g.\ down to 8\ kbit/s) is not provided for in this example. 
.RT
.LP
.rs
.sp 23P
.ad r
\fBFigure A\(hy2/P.84, p.\fR 
.sp 1P
.RT
.ad b
.RT
.LP
.bp
.PP
Thus, in summary, as long as \fIN\fR \(= \fIN\fR\dm\\di\\dn\uthe DCME will not
need to use the DSI function, because all circuits will have access to 
a bearer channel. Overload will not occur until the offered load exceeds 
the load 
carrying capacity. In overload, the DCME will start dropping samples or will
queue the samples, in which case \(*k will not change, or the DCME will 
decrease the coding rate, in which case \(*k will increase, thus momentarily 
increasing the capacity of the DCME. 
.RT
.sp 1P
.LP
A.5
	\fIOverload strategies\fR 
.sp 9p
.RT
.PP
When a number of active circuits connected to the DCME exceeds the number 
of available channels, the DCME will experience momentary overloads; an 
increase in speech bursts will sometimes require more channels than are 
available. When this happens the DCME must invoke its \*Qoverload strategy\*U. 
The strategy is designed to deal with the issue of how best to share the 
channel 
pool. A number of basic strategies are possible:
.RT
.LP
	\fIType\ 1\ \(em\ \fR \fICompetitive clipping\fR \fIor\fR 
\fIspeech sample\fR \fIdropping\fR . In this strategy, defined in \(sc\ 
1.2.14, samples are dropped from the front end of the speech burst that 
unsuccessfully bids for a channel. 
Sample dropping continues until a channel is available or the burst ends
normally. Perceptually, the effects of front\(hyend sample dropping and 
front\(hyend clipping, the latter caused by the speech detector, should 
be the same, even 
though they have different causes. Theoretically, however, they are not
entirely the same, because front\(hyend clipping is more likely to affect
low\(hylevel parts of the signal, whereas freezeout affects all levels 
with equal probability. 
.LP
	\fIType\ 2\ \(em\ Variable bit rate coding\fR . This strategy, defined in
\(sc\ 1.2.15, employs embedded speech coding algorithms or other means to
effectively multiply the number of bearer channels momentarily available 
to the circuits to carry the offered load. Since a lowering of the bit 
rate will have the effect of increasing the quantization noise produced 
by the coders, the 
perceptual effect of variable rate coding will be momentary increases in
quantizing noise, i.e.\ reductions in \fIQ\fR (for a discussion of \fIQ\fR 
, see 
Recommendation\ P.81, \(sc\ 2).
.LP
	 \fIType\ 3\ \(em\ Queueing\fR . This strategy, defined in \(sc 1.2.16, 
employs buffers (memories) for the speech burst samples to occupy while 
waiting for a channel. The perceptual effect of pure queueing, without 
buffer overflow, is a time shift of the speech bursts. No samples are lost, 
and there is no increase in noise. The impairment introduced can be called 
\*Q 
silence duration
modulation
\*U. From the listener's point of view a given speech burst when
queued will begin somewhat later in time relative to its predecessor burst 
than it would have without queueing. Also the succeeding burst may be perceived 
as beginning somewhat sooner. Since the buffers must, of necessity, be 
finite this strategy cannot be employed alone, but it must be coupled with 
either sample 
dropping or variable rate coding. Thus, a queueing system can have speech
mutilation or recoding noise as well as time shifting.
.LP
	\fIType\ 4\ \(em\ Dynamic load control\fR . An overload control strategy,
defined in \(sc\ 1.2.17, in which the DCME signals to the associated switch 
that 
the traffic load which the switch is generating, or is predicted to generate, 
cannot be transmitted satisfactorily by the DCME, and the switch should 
reduce its demand on the DCME by a holding signal sent to the circuits 
when they 
become idle.
.sp 1P
.LP
A.6
	\fISilence reconstruction methods\fR 
.sp 9p
.RT
.PP
Since the DCME does not transmit silences between speech bursts at the 
receiving end, the silences must be artificially recreated. Several 
different methods for doing this are possible. The simplest is to insert a
white
noise at a fixed level in the receiver during silences. Careful selection of
the level is necessary to avoid noise contrast, that is, an apparent and
annoying contrast between the noise in the silences and the background noise
during speech bursts. Other methods are possible which attempt to adapt the
noise level automatically to the circuit conditions; these methods require
careful filtering and estimation of source noise power.
.RT
.sp 1P
.LP
A.7
	\fICircuit versus packet mode\fR 
.sp 9p
.RT
.PP
Internally the DCME may employ a circuit or a packet mode for the transmission 
of speech bursts. In the circuit mode, bearer channels are derived by providing 
suitable time slots on the transmission facility interconnecting the DCME 
terminal equipment. In the packet mode, the speech burst samples are put 
into one or more packets 
.bp
.PP
of fixed or variable length. The packets  are
addressed to the destination circuit and transmitted over the transmission
facility one at a time. Thus, in the circuit mode the transmission facility 
can be thought of as carrying a number of channels multiplexed together, 
while in the packet mode the facility is thought of as a single high speed 
channel which transmits packets one at a time. 
.PP
In the packet mode, performance of the system depends on how the
packets are serviced. Two servicing methods are:
.RT
.LP
	a)
	All packets from all circuits enter a first\(hyin first\(hyout
(FIFO) queue and are serviced by the high speed channel one at a time. Each
packet is treated independently. Each packet experiences a variable delay in
arriving at the receiving end that is a function of the fill of the FIFO 
queue. If packets arrive too late, after a given reconstruction delay, 
they will be 
lost or discarded by the receiver. This is called 
packet dropping
and   it is a function of the system load. Packet dropping can cause speech
mutilation at any point in the burst. It gives rise to \*Qmid\(hyburst\*U 
sample 
dropping. Packets can also be dropped in the FIFO queue if it experiences
overflow. The fill of the queue is monitored and the overload strategy is
invoked when necessary to prevent excessive packet dropping.
.LP
	b)
	Once a circuit has seized the high speed channel for
transmission of a packet all the packets on the circuit for that burst are
transmitted before the high speed line is free to transmit another circuit's
packets. Thus the circuit is \*Qcut\(hythrough\*U during the burst. Cut\(hythrough 
operation avoids mid\(hyburst speech sample loss. However, since only one
.LP
circuit at a time can use the high speed channel, other circuits with packets 
to transmit must await their turn. The packets must be queued while they 
await the channel. Load\(hydependent queueing delays must be equalized 
at the receiving end. This is usually done by employing some form of time 
stamp on the packet. The possibility always exists that packet queues will 
overflow before the 
packets can be transmitted. When this happens the overload strategy is 
invoked to prevent excessive packet dropping. 
.PP
Packet mode introduces more delay than a non\(hypacket mode DCME. The extra 
delay has three components. The first is the 
packetization time
. Packetization time is a function of packet length and circuit coding 
rate. The second is the 
reconstruction delay
. Reconstruction delay is chosen to minimize the probability of packet 
loss. The third is 
packet queueing
delay
. All three components may be load\(hydependent and thus variable.
.PP
In summary, use of packet mode rather than circuit mode may introduce these 
additional performance\(hyaffecting aspects: 
.RT
.LP
	i)
	mid\(hyburst sample dropping,
.LP
	ii)
	 additional delay equal to the sum of the packetization and reconstruction 
delays, 
.LP
	iii)
	packet queueing delay.
.sp 1P
.LP
A.8
	\fIPacket reconstruction\fR 
.sp 9p
.RT
.PP
In a packet mode, system loss of a packet presents the receiver
with a dilemma, namely, what to use in place of the speech samples carried 
in the lost packet. Several methods are employed and they have different 
performance consequences. One method is to insert noise samples in place 
of the lost speech samples. Another method repeats samples in a previous 
packet to 
replace the lost samples. Other methods are also employed.
.RT
.sp 1P
.LP
A.9
	\fICircuit versus network systems\fR 
.sp 9p
.RT
.PP
With the above definitions in mind there appears to be yet another way 
to classify DCME. We can talk about DCME using non\(hyswitched channels 
and 
DCME using switched channels. The first type, non\(hyswitched channels, 
is called a circuit\(hybased DCME. The second type, using switched channels, 
is called a 
connection\(hybased DCME.
.PP
A circuit\(hybased system would be used to provide circuits, either
trunks or loops. All switching is done outside the DCME. The connection\(hybased 
system incorporates circuit\(hy or packet\(hyswitching and thus is more 
properly 
thought of as a network solution rather than a circuit solution.
.PP
The testing of a connection\(hybased DCME is likely to be more
complicated than is the testing of a circuit\(hybased DCME. One reason 
is that the size of a connection\(hybased system may make it difficult 
to test in a 
laboratory. Another reason is that loading such a system with a controlled 
load is difficult. 
.bp
.RT
.ce 1000
ANNEX\ B
.ce 0
.ce 1000
(to Recommendation P.84)
.sp 9p
.RT
.ce 0
.ce 1000
\fBSpeech material used to construct speech sequences\fR 
.sp 1P
.RT
.ce 0
.sp 1P
.ce 1000
(The following narratives are examples used by Bell |
Communications Research)
.ce 0
.sp 1P
.ce 1000
ORWELL
.sp 1P
.RT
.ce 0
.PP
George Orwell began his classic novel 1984 with, \*QIt was a
bright cold day in April,\*U but he gave no further hint as to what the 
weather might be during the fateful year. From the succession of untoward 
weather 
events that marked 1983, many have come to believe that the world's weather 
has undergone an unprecedented change for the worse and that we might be 
headed for a series of natural disasters this year to match the demise 
of free democratic thought and speech described in Orwell's book. 
.sp 9p
.RT
.PP
Since we do not have the ability to predict what individual
weather events might occur during 1984, let us turn the calendar back a 
hundred years and see what happened throughout the country in 1884. The 
year opened 
with the arrival of arctic air from northern Canada which drove the thermometer 
down to \(em40\(de |  at Rockford, Illinois, and to \(em25\(de |  at Indianapolis, 
Indiana, both records that still stand. Sub\(hyzero temperatures penetrated 
into the South, and a hard freeze hit citrus groves in Florida. 
.PP
In early February, heavy rains falling on a deep snow cover caused the 
Ohio River to flood. Crests were of record height from Cincinnati to the 
river's mouth at Cairo, Illinois.
.PP
Late February brought an outbreak of tornados in the South and the
Ohio Valley, where some sixty individual funnels descended. More than 420 
were killed, and more than 1000\ injured. Nothing approached this visitation 
in 
severity or extent until the tornado outbreak in April in Durango, Colorado,
for seventy\(hysix days ending April\ 16.
.PP
In May, out\(hyof\(hyseason rainstorms in the deserts of the Southwest
caused widespread floods. Rail traffic from Salt Lake City to the south was
interrupted for three weeks, and the Rio Grande River flooding at El Paso,
Texas, caused $1\ million in damage.
.PP
Heavy frosts occurred in late May, when the thermometer dropped to
22\(de |  in Massachusetts, and snow fell in Vermont on Memorial Day.
.PP
California got more heavy rain in June; Los Angeles had 1.39 inches
and San Francisco 2.57 inches, both all\(hytime June records. And as a 
result of rain in Wisconsin the flooding Chippewa River did more than $1.5\ 
million in 
damages and left 2,000 homeless at Eau Claire.
.PP
The great Oregon snow blockade followed 34 inches of snowfall at
Portland in the middle of December. Rail communication was cut off from the
east and south for many days, and mail from California had to come by ocean
steamer.
.PP
If you think the weather that made so many headlines in 1983 was
unprecedented, hark back to 1884. We do not know whether El Nino was active
then or whether some other atmospheric or oceanic force was the culprit. 
All we can do now is wait and see what 1984 brings. 
.RT
.ce 1000
FOG
.sp 1P
.RT
.ce 0
.PP
One of winter's most spectacular sights is a smokelike fog
that rises from openings in the arctic ice fields and occasionally appears
above the open waters of unfrozen lakes and harbors in our temperate zone.
Various names for the phenomenon are \*Qfrost smoke\*U, \*Qsea smoke\*U, 
\*Qsteam fog\*U, \*Qwarm water fog\*U, and \*Qwater smoke\*U. The fog is 
caused by the passage of a 
stream of arctic or polar air with a temperature near zero Fahrenheit over
unfrozen water. Within the lower forty\(hyeight states, it occurs principally 
over unfrozen areas of the Great Lakes and over harbor waters of the north 
Atlantic coast. 
.sp 9p
.RT
.PP
\*QSea smoke\*U occurs because the vapor pressure at the surface of
the
water is greater than that in the air above. Water vapor evaporates into the
air faster than the air can accommodate it. The excess moisture condenses 
and forms a layer of fog, like steam or smoke rising off the water. Usually 
a clear space exists between the water's surface and the bottom of the 
fog, and its 
upper limit is generally 10 to 25\ feet. If an atmospheric inversion develops
near the water's surface, the fog may be confined there and becomes thick,
resulting in a hazard to navigation.
.bp
.PP
If the air temperature is severely cold, \(em20\(de |  or below, the rising 
moisture may form ice crystals in the layer of air just above the water. 
This is called \*Qfrost smoke\*U, and it makes a beautiful sight, especially 
when 
sunlight glitters on the thin ice needles.
.PP
\*QSteam fog\*U can occur over lakes and streams in the autumn following 
a clear, still night during which the air has cooled. The differences in 
vapor 
pressures cause the warm water to steam into the cold air, and whole valleys
and basins can be covered with a thin layer of fog while the hillside remains 
clear. 
.RT
.ce 1000
ANNEX\ C
.ce 0
.ce 1000
(to Recommendation P.84)
.sp 9p
.RT
.ce 0
.ce 1000
\fBInstructions on the use of a limited number of sentences\fR 
.sp 1P
.RT
.ce 0
.sp 1P
.ce 1000
(Contribution by the Swedish Telecommunication Administration)
.ce 0
.sp 1P
.PP
If \fIN\fR sentences per talker are used there will be \fIN\fR (\fIN\fR 
\(em1) possible sentence combinations per talker. The first 16 results 
are tabulated below: 
.sp 9p
.RT
.LP
	 \fIN\fR \ 2\ \ 3\ \ 4\ \ 5\ \ 6\ \ 7\ \ 8\ \ 9\ 10\ \ 11\ \ 12\ \ 13\ 
\ 14\ \ 15\ \ 16\ \ 17 
.LP
	\fIN\fR (\fIN\fR \(em1)
	\ 2\ \ 6\ 12\ 20\ 30\ 42\ 56\ 72\ 90\ 110\ 132\ 156\ 182\ 210\ 240\ 272
.PP
Either of two reasons for wanting to limit the number of sentences can 
be put forth: 
.LP
	\(em
	 the wish to save time by not having to author lists of more than 2\(mu85 
sentence combinations per talker. Separate recording of all the 
combinations is of course still needed unless sophisticated editing equipment 
for digital types is at hand, or 
.LP
	\(em
	the need to organize the test in a way that fulfills the
requirements for an analysis of variance.
.PP
Depending on which of the motives above is invoked, different
methods can be adopted. These are:
.LP
	1)
	 All possible \fIN\fR (\fIN\fR \(em1) sentence combinations per talker 
are recorded. 
.LP
	a)
	 The same \fIN\fR sentences are used for all 4 talkers. The same sentence 
pair should then not be used for the same test conditions from talker to 
talker, in order to avoid possible systematic interaction between test 
conditions and phonetic content, or
.LP
	b)
	Four different sets of \fIN\fR sentences (\fIN\fR 1, \fIN\fR 2,
\fIN\fR 3 and \fIN\fR 4) are authored. Then no precautions corresponding to a)
are needed. However, interaction will still be possible and uncontrolled.
.LP
	2)
	To allow for an analysis of variance, subjects must judge
the same speech material for all test conditions and all talkers. The number 
of sentences will then be limited to \fIM\fR \(mu2 where \fIM\fR is the 
number of pairs that will be used in the test. If \fIM\fR \ =\ 1 the test 
may appear too tedious for the 
subjects and the phonetic coverage may be insufficient. If an analysis of
variance is to be justified, and the test is still to be practically possible, 
an expansion of the number of presentations is therefore recommended. \fIM\fR 
\ =\ 2 or 3 should be enough. This will lengthen the test time for each 
subject, but experience shows that tests of 2.5\ hours per subject are 
quite possible. 
Adjustments for such an expansion must then be made when deciding the
presentation order.
.ce 1000
ANNEX\ D
.ce 0
.ce 1000
(to Recommendation P.84)
.sp 9p
.RT
.ce 0
.ce 1000
\fBInstructions to subjects\fR 
.sp 1P
.RT
.ce 0
.LP
D.1
	\fIQuality scale \(em DCME test\fR 
.sp 1P
.RT
.PP
In this test we are evaluating systems that might be used for
telecommunications service between separate places.
.PP
You are going to hear a number of samples of speech reproduced in the earpiece 
of the handset. Each sample will consist of a 30 to 35\ seconds long 
sequence of three or more sentences.
.PP
Please listen to the complete sequence, then indicate your opinion of the 
overall sound quality. If you hear any noises or other interference in 
the pauses before, between or following the sentences you should include 
the effect of this interference in your judgement of the overall quality. 
.bp
.PP
For indicating your opinion you are requested to use the following
5\(hypoint rating scale:
.RT
.LP
.sp 1
\fIScore\fR \fIQuality opinion\fR 5
Excellent
4
Good 
3
Fair 
2
Poor 
1
Bad or Unsatisfactory 
.PP
After listening to a sample sequence, either (1) please write down on your 
response sheet a score, or\ (2) please press the appropriate button 
which on this rating scale represents your opinion of the sound quality 
of the sample just heard. 
.PP
After you have given your opinion there will be a short pause before the 
next sample begins. 
.PP
For practice, you will first hear \*Q\fIn\fR \*U samples and give an opinion 
on each; then there will be a break to make sure that everything is clear. 
.PP
From then on you will have a break after every \*Q\fIk\fR \*U samples. There
will be a total of \*Q\fIt\fR \*U samples in the test. The test will last 
a total of 
about \*Q\fItime\fR \*U hours.
.RT
.sp 1P
.LP
D.2
	\fIListening effort scale \(em DCME test\fR 
.sp 9p
.RT
.PP
In this test we are evaluating systems that might be used for
telecommunications service between separate places.
.PP
You are going to hear a number of samples of speech reproduced in the earpiece 
of the handset. Each sample will consist of a 30 to 35\ seconds long 
sequence of three or more sentences.
.PP
Please listen to the complete sequence, then indicate your opinion of the 
effort required to understand the meaning of the sentences. 
.PP
For indicating your opinion you are requested to use the following
5\(hypoint rating scale:
.RT
.LP
.sp 1
\fIScore\fR \fIListening effort opinion\fR 5
Complete relaxation possible, no effort required 
4
Attention necessary, no appreciable effort required 
3
Moderate effort required 
2
Considerable effort required 
1
No meaning understood with any feasible effort 
.PP
After listening to a sample sequence, either (1) please write down on your 
response sheet a score, or\ (2) please press the appropriate button 
which on this rating scale represents your opinion of the effort required to
understand the meaning of the sample just heard.
.PP
After you have given your opinion there will be a short pause before the 
next sample begins. 
.PP
For practice, you will first hear \*Q\fIn\fR \*U samples and give an opinion 
on each; then there will be a break to make sure that everything is clear. 
.PP
From then on you will have a break after every \*Q\fIk\fR \*U samples. There
will be a total of \*Q\fIt\fR \*U samples in the test. The test will last 
a total of 
about \*Q\fItime\fR \*U hours.
.bp
.RT
.ce 1000
ANNEX\ E
.ce 0
.ce 1000
(to Recommendation P.84)
.sp 9p
.RT
.ce 0
.ce 1000
\fBExamples of other\fR 
\fBsubjective scales\fR 
.sp 1P
.RT
.ce 0
.LP
E.1
	\fIEleven\(hygrade quality scale\fR 
.sp 1P
.RT
.LP
.sp 1
10
\ 9
\ 8
\ 7
\ 6
.
Excellent
.
Good
.
The number 10 denotes a reproduction that is perfectly faithful
to the ideal. No improvement is possible.
.PP
\ 5
\ 4
\ 3
\ 2
\ 1
\ 0
Fair
.
Poor
.
Bad
.
The number 0 denotes a reproduction that has no similarity
to the ideal. A worse reproduction cannot be imagined.
(See IEC Report 268\(hy13, Annex A.)
.sp 1P
.LP
E.2
	\fISeven point quality scale\fR 
.sp 9p
.RT
.LP
.sp 1
\fIScore\fR \fIQuality description\fR 6
Ideal circuit
5
Excellent circuit. Possible to relax completely during call,
very agreeable 
4
Good circuit. Necessary to pay attention, but not necessary
to make a special effort. Agreeable circuit 
.PP
3
Fair circuit. A moderate, but not too great,
effort is necessary. Not a very agreeable circuit 
2
Poor circuit. Listening is possible, but somewhat difficult.
Listening disagreeable 
1
Bad circuit. Can be used only with great difficulty.
Listening very disagreeable 
0
Very bad circuit. Practically unusable 
(See CCIR Report 751, Volume VIII.3, 1986.)
.bp
.sp 1P
.LP
E.3
	\fIFive\(hygrade impairment scale\fR 
.sp 9p
.RT
.LP
	5
	Imperceptible.
.LP
	4
	Perceptible, but not annoying.
.LP
	3
	Slightly annoying.
.LP
	2
	Annoying.
.LP
	1
	Very annoying.
.PP
(See Supplement No. 14, Annex B.)
.sp 2P
.LP
	\fBReference\fR 
.sp 1P
.RT
.LP
[1]
	 LEE and UN: A study of ON\(hyOFF characteristics of conversational speech, 
\fIIEEE Trans. Comm.\fR , Vol.\ COM\(hy34, No.\ 6, June\ 1986. 
.LP
.rs
.sp 40P
.ad r
Blanc
.ad b
.RT
.LP
.bp
.LP
\fBMONTAGE: PAGE 234 = BLANCHE\fR 
.sp 1P
.RT
.LP
.bp