Apple, Inc. v. Motorola, Inc. et al

Filing 92

Declaration of Christine Saunders Haskett filed by Plaintiffs Apple, Inc., NEXT SOFTWARE, INC. re: 90 Motion Requesting Claims Construction (Attachments: # 1 Ex. 1 Moto Infring. Cont. Ex. A, # 2 Ex. 2 '157 patent, # 3 Ex. 3 '179 patent, # 4 Ex. 4 '329 patent, # 5 Ex. 5 '230 file history, # 6 Ex. 6 Oxford dictionary definition, # 7 Ex. 7 '559 file history, # 8 Ex. 8 The OSI Model, # 9 Ex. 9 ISO Standard, # 10 Ex. 10 Japanese file history, # 11 Ex. 11 Japanese prosecution appeal, # 12 Ex. 13 Moto Infring. Cont. Ex. E, # 13 Ex. 14 IEEE Standard, # 14 Ex. 15 '333 patent, # 15 Ex. 16 '721 file history, # 16 Ex. 17 '193 file history, # 17 Ex. 18 Moto Infring. Cont. Ex. F, # 18 Ex. 19 Merriam Webster Dictionary, # 19 Ex. 20 Webster's Dictionary) (Haslam, Robert)

Download PDF

EXHIBIT 4 U'nited States Patent Taguchi [il] Best Available [54] SPEECH ANALYSIS AND SYNTHESIS APPARATUS (75) Inventor: Tetsu Taguchi, Tokyo, Japan . [73] Assignee: Nippon Electric Co., Ltd., Tokyo, Japan [21] [30] Jan. 4, 1979 Foreign Application Priority Data Jan. 9, 1978 [IP] Ian. 9, 1978 [JP] Nov. 10, 1978 [IP] [51] [52] [58] Japan .................................... 53-1282 Japan .................................... 53-1283 Japan ................................ 53-138690 Int. CI) ................................................ GI0L 1/00 U.S. Cl. .................................................. 179/1 SA Field of Search ........................... 179/1 SA, 1 SD [56] References Cited U.S. PATENT DOCUMENTS 3,715,512 4,038,495 Nov. 17, 1981 the normalized predictive residual power falls to low levels, for example in high-pitched speech, the calculation of linear predictor coefficients from the autocorrelation coefficients of the speech sound is stopped when the normalized predictive residual power falls below a predetermined threshhold level. Either a variable stage synthesis fIlter is used having its number of stages determined by the number of linear predictor coefficients actually calculated, or a fixed number of stages can be used and a zero value ftlter stage coefficient supplied to those stages in excess of the number of coefficients calculated. Degradation of speech quality due to quantization and transmission errors can be alleviated by computing the normalized predictive residual power on the synthesis side from the transmitted predictor coefficients and using it· to excite the input to the synthesis filter. In one embodiment especially suitable for high ambient noise conditions, both a sound source and a noise source are employed and two different conversion and window processing channels are provided; one for noise-affected speech and the other for pure noise. Autocorrelations in each channel are performed along with correllations between charmels, and the autocorrelations and correllations are then appropriately combined to provide an autocorrelation coefficient of the speech sound. AppL No.: 942 [22J Filed: COPY 4,301,329 [45] [19] 2/1973 Kelly ................................ 179/1 SA 7/1977 White ............................... 179/1 SA Primary Examiner-Mark E. Nusbaum Assistant Examiner-E. S. Kemeny Attorney, Agent, or Firm-Sughrue, Mion, Zinn, Macpeak & Seas ABSTRACT In order to prevent errors and instability which may occur in a speech analysis and synthesis apparatus when [57] 10 Claims, 7 Drawing Figures Iffi 110 111 Vj!Jv ;16 '-------"'~,40 ADlIOI{cl , 1i5 117 u.s. Patent Nov. 17, 1981 Sheet 1 of 5 4,301,329 FIC; I ffilOR NIT 106 8KHz 1(6 (Ill p (~ {c 100 ~Hz LINEAR ffiEllIcrOR U COCFFlCENT METER 101 KI KZ- ---- Kp V/UV OUANTIZER 110 III DEMODULATffi V/UV 20 FROM 101l0j 127 128 PIlCH u.s. Patent Nov. 17, 1981 4,301,329 Sheet 2 of 5 fit l FOO\1104 S(tl 106 104 103 T I 1 or I~ DIAl} r-- BUFFER ,.. WINDON MEMffiY PIffiID1 LPf PIlCH PICKER FfiU~~I{ol FROJIOlla) v/UV l JUDGING K)5 I AUTO- I 109 CORRELATOR fromjloHbl ---- f UNOO 107---- PREDICTOR COEFRClENT IUER U COORQIIR ~ )08 {UTI It ~IOI(cl QUANTIZER fl(J 1 u.s. Patent Sheet 3 of 5 Nov. 17, 1981 FROM IOlle) 4,301,329 [[MODULATOR 1[3 FILTER STAGE CONTROLLER , '---1 I PITCH 414' 116 IMPlJUf GENERATm 117 FROMIOI(a) 127 128 129 514 501 r+ FIC 5 CORRELATK)l 9J8TRACTOR S@)))) ~ • CI.l • ~ ;,)""i)"'I N(c»)):"'I"""" ~ 405)J)).,) ~)) ('1) :s :JJ) 1"1" z 0 ~ 109 VNi JUDQNG CONTROLLER J-.301 I ~r PICKER 108 10 FROM 101k) --I 'I QLl4NTIZER 106 ...... :J ...... \0 00 VJ ::r ('b ('b ..... .j:::.. 0 """+l FI(J 7 VI 712 700 I,r-l... ~ ! A --;--i I F.'\/ 710 I 704 I 720 i MPUT ~ "" w 0 ....... 'II W .---t-- J 713 740 U-----.. 711 N \0 U.s. Patent 'OL Nov. 17, 1981 103 ly2 LPF 8 BUFFER A7D I~ K:l4 i WINDOW I ~ ----fl!;I MEMCffl PITCH PlO<ER PROCESSOR 109 50Hz. 8KHz (0) SOlffi i::1 V/UV I lOS 10{ TlMr~G ~ ~W ~ AUTO P ffiRRELATOR tt-- ~ 4,301,329 Sheet 5 of 5 {t07 , / LINEAR PI{OICTOO COEFFICIENT METER 3~1 V/Uv ICONTIULER PITCH Kz K, Kn ----- j)~ 110 C(I[R rill 112 } FROM 100(c) DECODER Kn 113 ,-001 K2 KJ - ---- Kjv< ffiWERTER -.l p UI METER I . lu {ro:p V/uv 602 -[ L PfltH I 401 ~---- ~2 I 127,\ FROM IOl{c) rI D/A t 1 LPF I liB ~ ~0N1IDJ£R 1:(1 116 r i I"x' SYNTHESIZING FILTER -.J 40 l Jls A IM~ GENERAlffi ~ 117 0 L WHITE NOiSE GENERATOR 4,301,329 1 SPEECH ANALYSIS AND SYNTHESIS APPARATUS BACKGROUND OF THE INVENTION 5 The present invention relates to a speech analysis and synthesis apparatus and, more particularly, to an apparatus of this type having a digital filter of improved stability for speech synthesis and having minimized deterioration of speech quality and minimized reduction 10 in transmission information arising from transmission error and quantizing error. Further reduction in the frequency band used in the encoding of voice signals has been increasingly demanded as a result of the gradually increasing practice 15 of the composite transmission of the speech-facsimile signal combination or the speech-telex signal combination or the use of multiplexed speech signals for the purpose of more effective use of telephone circuits. In the band reduction encoding, the speech sound is 20 expressed in terms of two characteristic parameters, one for speech sound source information and the other for the transfer function of the vocal tract. In the speech analysis and synthesis technique, the speech waves voiced by a human are assumed to be radiation output 25 signals radiated through the vocal tract which is excited by the vocal cords to function as a speech sound source, and the spectral distribution information equivalent to the speech sound source information and the transfer function information of the vocal tract is sampled and 30 encoded on the speech analyzer side for transfer to the synthesizer side. Upon receipt of the coded information, the synthesizer side uses the spectral distribution information to determine the coefficient of a digital filter for speech synthesis and applies the speech source informa- 35 tion to the digital fllter to reproduce the original speech signal. Generally, the spectral distribution information is expressed by the spectral envelope representative of spectral distribution and the resonance characteristic of 40 the vocal tract. As is well known, the speech sound information is the residual signal resulting from the subtraction of the spectral envelope component from the speech sound spectrum. The residual signal has a spectral distribution over the entire frequency range of 45 the speech sound, and has a complex waveform to represent the residual signal in terms of digitized information is not consistent with the aim of band reduction encoding. In general, however, a voiced sound produced by vibration of the vocal cords is represented by 50 a train of impulses which has an envelope shape analogous to the waveform of the voiced sound and the same pitch as that of the voiced sound while, unvoiced sound produced by air passing turbulently through constrictions in the tract is expressed by the white noise. There- 55 fore, the band reduction of the speech sound information is usually carried out by using the impulse train and the white noise for representing the voiced and unvoiced sounds. As described above, the spectral envelope is used to 60 express the spectral distribution information and to distinguish between the voiced and· unvoiced sounds, while pitch period and sound intensity are employed for the speech sound source information. A spectral variation of the speech wave is relatively slow because the 65 speech signal is produced through motions of the sound adjusting organs such as tongue and lips. Accordingly, a spectral variation for a 20 to 30 msec period can be 2 held constant. For analysis and synthesis purposes, therefore, every 20 msec portion of the speech signal is handled as an analysis segment or frame, which serves as a unit for the extraction of the parameters to be transfer red to the synthesis side. On the synthesis side, the parameters transferred from the analysis side are used to control the coefficients of a synthesizing filter and as the exciting input on the analysis frame-by-analysis frame basis, for the reproduction of the original speech. To extract the above-mentioned, parameters, the so-called linear prediction method is generally used (For details, reference is made to an article titled "Linear Prediction: A Tutorial Review" by JOHN MAKHOUL, PROCEEDINGS OF THE IEEE, VOL. 63, No.4, APRIL 1975). The linear prediction method is based on the fact that a speech waveform is predictable from linear combinations of immediately preceding waveforms. Therefore, when applied to the speech sound analysis, the speech wave data sampled is generally given as p S(n) = } ; 1 aj· Sen - 1) + Uj = Sen) + (I) Un where S(n) is the sam.£le value of the speech voice at a given time point; S(n-i), the sample value at the time point i samples prior thereto; P, the linear predictor; Sn, the predicted value of the sample at the given time point, Un is the predicted residual difference; and ai, the predictor coefficient. The linear predictor coefficient ai has a predetermined relation with the correlation coefficients taken from the samples. It is therefore obtainable recursively from the extraction ofthe correlation coefficients, which are then subjected to the so-called Durbin method (Reference is made to the above-cited article by JOHN MAKHOUL). The linear predictor coefficient ai thus obtained indicates the spectrogram envelope information and is used as the coefficient for the digital filter on the synthesis side. As the parameter representing the spectral envelope of the speech sound, the variation in the cross sectional area of the vocal tract with respect to the distance from the larynx is often employed, the variation meaning the reflection coefficient of the vocal tract and being called the partial autocorrelation coefficient, PARCOR coefficient or K parameter hereunder. The K parameter determines the coefficient of a filter for synthesizing the speech sound. When IK I > 1, the filter is unstable, as is known, so that the stability of the filter can be checked by using the K parameter. Thus, the K parameter is of importance. Additionally, the K parameter is coincident with a K parameter appearing as an interim parameter in the course of the computation by the above-mentioned recursive method and is expressed as a function of a normalized predictive residual power (see the above-mentioned article by J. MAKHOUL). The normalized predictive residual power is defined as a value resulting from dividing u in the equation (1) by the power of the speech sound in the analysis frame. The exposition of the speech analysis and synthesis is discussed in more detail in an article "Speech Analysis and Synthesis by Linear Prediction of the Speech Wave " by B. S. ATAL AND SUZANNE L. HANAVER, The J oumal of the Acoustic Society of America. VoL 50, Number 2 (Part 2), 1971, pp. 637 to 655. The conventional speech analysis and synthesis apparatus of this kind has a very limited computational speed 3 4,301,329 4 due to the limitation on the scale of the apparatus alof a noise-affected speech sound; N, the number of lowed therefor. The arithmetic unit of a limited accusamples of a waveform to be analyzed; and i, the numracy arithmetic such as one based on a limited word ber of each sample. The right side of the above equation length with fixed decimal point is usually employed for is rewritten in the form of the autocorrelation: such apparatus. The normalized predictive residual 5 power is relatively small in the voiced sound with high p(SN)(SNjT = P(S)(S)T - P(l\)(!')T + P(lII)(SN)T periodicity but relatively large in the unvoiced sound with low periodicity, and its value is lower as the anawhere lyzing order is higher (see the article by AT AL et aI, FIG. 5 on page 642, for example). 10 N-l The conventional speech analysis and synthesis appap(S)(S)T = i::O Si· Si + T ratus has a synthesis filter of a fixed number of stages N-I p(N)(N)r = i':O Ni· Ni + T corresponding to the number of order for the linear predictor coefficient. Therefore, when a waveform of N-l p(SN)(N)T = i::O (Si + i'lI) . Ni + T extremely high periodicity, i.e., of clear spectrogram 15 structure, such as the stationary part of a voiced sound, N-l p(l'l)(SN)T = i=:O Ni· (Si + T + Ni +T) is processed, the normalized predictive residual power tends to be smaller than the smallest significant value that can be handled by the above-mentioned limited Generalizing the delay T, P(SN)(SN)T is defined as the first accuracy arithmetic. More definitely, this means that 20 autocorrelation coefficient and the K parameters, which are given as a function of the (P(Slv)(N)r - P(N)(lv)T+ P(N)(SN)T) is defined as the second normalized predictive residual power, tend to be coefficient. Under the IK I > 1, adversely affecting the stability of the synthesis autocorrelation of a speech sound isthis definition, difautocorrelation expressed as a filter. The window processing applied to successive prefixed lengths of sound waveform may help increase 25 ference between the first and second autocorrelation coefficients. the normalized predictive residual power, because the As described above, to obtain the parameter to corwindow length rarely equals an integral multiple of the rectly express only the feature of the speed sound under pitch period of the sound even if it is of high periodicity high ambient noise, -the autocorrelation of the speech and, consequently because the spectral structure of the sound waveform within a single window length has a 30 sound is expressed in terms of the difference between the first and second autocorrelation coefficients. More lower clarity. Such increased normalized predictive specifically, a conventional method employs an acousresidual power may help avoid the above-mentioned tic-to-electrical signal converting unit for noise detecinstability of the synthesis filter. However, the use of tion as well as an acoustic-to-electrical signal convertthe window processing does not necessarily mean an increase in the predictive residual power sufficient to 35 ing unit for speech signal detection. With these units, the acoustic signal from a noise source .and the acoustic contribute to the stability of the synthesis filter, because signal from a speaker are detected as a synthesis acousa high-pitched voice sound, such as a female voice, has tic signal while at the same time only the acoustic signal a sufficient periodicity within a very short window derived from the noise source is detected. Then, the length to lower the predictive residual power. When the linear predictor coefficient for the analysis 40 autocorrelation coefficient of the noise-affected speech sound and the autocorrelation coefficient of. the noise is made to be of high order while the number of stages are measured. Following this, the correlation coefficiof the synthesizing digital filter is reduced to overcome ent between the noise-affected speech signal is measuch difficulty, the approximation of the spectral envesured from the above two kinds of signals. Similarly, the lope of a less stationary speech sound or of the voiced sound having a relatively large predictive residual com- 45 correlation coefficient between the noise and the noiseaffected speech signal is measured. Then, the autocorrepared power with the arithmetic accuracy is considerably reduced, deteriorating the quality of the synthesized lation coefficient of the speech sound signal is measured speed sound. on the basis of the two autocorrelation coefficients, and The calculation of the linear predictor coefficient the linear-predictor coefficient is measured on the basis under a high ambient noise involves errors since the 50 of the autocorrelation coefficient of the speech signal. signal wave to be analysed is the superposition of the In the conventional method, however, when the spatial ambient noise on the speech wave. The spectral envedistances from the noise source to the acoustic to eleclope calculated from the linear predictor coefficient trical signal converters for signal detection and noise affected by the ambient noise is different from the specdetection are different from each other, no linearity or tral envelope of the original speech wave. Under the 55 analogy exists between the input speech signals to both influence of the ambient noise, the linear predictor coefconverting units. Therefore, the relation established ficient must be analyzed to remove the influence by the may be inaccurate among the autocorrelation coefficiambient noise. Such analysis is usually carried out by ent of the speech signal relative to the autocorrelation using an autocorrelation coefficient as follows. The coefficient of the noise-affected speech signal, the autoautocorrelation coefficient p(SN)(SN)T of a noise- 60 correlation coefficient of the noise, the correlation coefaffected speech sound at a delay T is given as ficient between the noise-affected speech signal and the noise, and the correlation coefficient between the noise and the noise-affected speech signal. N-l p(SN) (SN)T = }: (Si + Ni)(Si + T + Ni + T) As a result, there is a possibility that the autocorrelai=O 65 tion coefficient measured of the speech sound at delay T becomes larger than that of the sound· perse. Specifiwhere So, S!, S2, ... are a series of samples of a speech cally, when the autocorrelation value at delay T is norsound wave; no, n I, n2, ... , a series of samples of a noise malized to "1", the autocorrelation value of the speech wave; So+No, SI +N], S2+N2, ... , a series of samples 5 4,301,329 sound measured at delay 7 may be closer to "I", compared to that of the speech sound per se, and, as the case may be, it exceeds "I". 'Vhen the autocorrelation value exceeds "1", the synthesizing filter with the coefficient which is the linear predictor coefficient calculated from the autocorrelation coefficient becomes unstable. This is seen, for example, from the fact that when the linear predictor coefficient is of first degree, the K parameter which is the interim parameter in the calculation of the linear predictor coefficient by the Durbin method exceeds "I". The above-mentioned conventional method to obtain the linear predictor coefficient for the purpose of expressing correctly only the feature of the speech sound under the condition of high ambient noise, has a disadvantage that the speech synthesis filter with the obtained linear predictor coefficient as its coefficient becomes unstable because of the influence of noise. As described above, the conventional method first measures the autocorrelation coefficient of the speech sound on the basis of the autocorrelation coefficient of the noise-affected speech sound, the autocorrelation of noise, the correlation coefficient between the noiseaffected speech sound and noise, and the correlation coefficient between noise and the noise-affected speech sound, and then obtains the linear predictor coefficient depending on the autocorrelation coefficient measured of the speech sound. Evidently, the conventional method suffers from the same disadvantage when the noise source has a spatially large volume, or when the transfer function in the acoustic area ranging from the noise source to the converter for speech sound detection is different from that in the acoustic area from the noise source to the converter for noise detection. In the characteristic parameters of the speech sound obtained on the analysis side, the speech sound source information, particularly the normalized predictive residual power representative of the amplitude information or the complex parameter of a short time average power and a normalized predictive residual power, have a much larger rate of time variation than that of the linear predictor coefficient a or the K parameter. This arises from the fact that, while K parameter representative of the reflection coefficient of the vocal tract depends on the cross sectional area of the vocal tract changing with muscular motion of a human and therefore slowly varies with time, the normalized predictive residual power U as expressed by (2) 6 the original speech sound to the synthesizing filter, the reproducibility of the amplitude is, of course, poor. Specifically, in the conventional apparatus, the linear predictor coefficient is exactly coincident with the nor5 malizedpredictive residual power representative of the spectral envelope of the speech sound on the analysis side, while, on the synthesis side, the normalized predictive residual power is largely influenced by the above errors but the linear predictor coefficient is little ef10 fected by errors. Therefore, the speech sound synthesized by using both the factors is poor in amplitude reproducibility. SUMMARY OF THE INVENTION 15 20 25 30 35 40 45 Accordingly, an object of the invention is to provide a speech analysis and synthesis apparatus capable of making speech analysis and synthesis with high stability even when the nomalized predictor residual power is below the limited accuracy of the apparatus as in the stationary part of voiced sound stationary part. Another object ofthe invention is to provide a speech analysis and synthesis apparatus which is stably operable even under high ambient noise. Still another object of the invention is to provide a speech analysis and synthesis apparatus which can compensate for deterioration of the amplitude reproducibility due to quantization error and transmission error and is capable of making speech analysis and synthesis with high stability even when the amount of information to be transmitted is little. According to the invention, the normalized residual power obtained on the analysis side is monitored and when it falls below a predetermined value, the synthesis filter is controlled to be the number of stages corresponding to the orderin such a case or the linear predictor coefficient with higher order than that is transmitted with zero to thereby eliminate the instability of the synthesis filter. Further, the normalized residual power is obtained from the linear predictor coefficient on the synthesis side and is used to excite the synthesis filter to thereby prevent speech quality from being degraded due to quantization error and transmission error. In one embodiment especially suitable for high ambient noise conditions, both a sound source and a noise source are employed and two different conversion and window processing channels are provided; one for noise-affected speech and the other for pure noise. Autocorrelations in each channel are performed along with correllations between channels, and the autocorrelations and correllations are then appropriately combined to provide an autocorrelation coefficient of the speech sound. Other objects and features of the invention will be apparent from the following description taken in conjunction with the accompanying drawings, in which: 50 where Ki is the K parameter of i-th order and p is the number of order, is affected by the amplification of all the changes of the respective Ki's and therefore its variation is complicated and steep. For this reason, in the analysis of the parameter in- 55 eluding the normalized predictive residual power, the analysis frame length must be set shorter than that ofthe BRIEF DESCRIPTION OF THE DRAWINGS analysis frame required for analyzing the other parameFIG. 1 shows a blo(;;k diagram of an ordinary speech ters such as the linear predictor coefficient and the like, 60 analysis and synthesis apparatus; resulting in the increase of transmission capacity. FI G. 2 shows a block diagram of a part of the circuit Since the time variation of the parameters including shown in FIG. 1; the normalized predictive residual power is signficant, FIG. 3 shows a block diagram of the analysis side of the parameters are easily influenced by transmission a speech analysis and synthesis apparatus according to error due to external and internal causes in the course of the transmission. Further, when the parameters are 65 the invention; FIG. 4 shows a block diagram of the synthesis side of quantized they involve quantization error. When the the speech analysis and synthesis apparatus according to normalized predictive residual power influenced by such errors is applied as the amplitude information of the invention; 7 4,301,329 8 normalized predictive residual power U to an amplitude FIG. 5 shows a block diagram of the analysis side of signal meter 108. the apparatus which is another embodiment according The amplitude signal meter 108 measures an exciting to the invention; amplitude as VU.P from the short time average power FIG. 6 shows a block diagram of a speech analysis and synthesis which is another embodiment of the in- 5 P supplied from the autocorrelator 105 and the normalized predictive residual power U supplied from the vention and includes the analysis side and synthesis side; linear predictor coefficient meter 107 and supplies the and measured exciting amplitude to the quantitizer 110. FIG. 7 shows a block diagram of another example of The pitch picker 106 measures the pitch period from a speech synthesizing digital filter. 10 the speech voiced wave representing word code supDESCRIPTION OF THE PREFERRED plied from the window processing memory 104 by a EMBODIMENTS known autocorrelation method, or the· Cepstrum method, as described in an article "Automatic Speaker Reference is first made to FIG. 1 illustrating an ordiRecognition Based on Pitch Contours" by B. S. Atal, nary speech analysis and synthesis apparatus. In operation, a speech sound signal is applied through a wave- 15 Ph D thesis Polytech. Brooklyn (1968) and in an article "Cepstrum Pitch Determination" by A. M. Noll, J. form input terminal 100 to an analog to digital (A-D) Acoust. Soc. Amer., Vol. 41, pp. 293 to 309, Feb. 1967. converter 102. In the A-D converter 102, a high freThe result of the measurement is applied as the pitch quency component of the speech sound signal is filtered period information to the quantitizer 110. out by a low-pass filter with a cut-off frequency of 3,400 A voic,d/unvoiced jUdging unit 109 judges voiced or Hz and the speech signal filtered out is sampled by 20 unvoiced signal by a well known method using paramesampling pulses of 8,000 Hz derived from terminal (a) of ters such as K parameters measured by the linear pretiming source 101. The sampled signal is then converted dictor coefficient meter 107, and the normalized predicinto a digital signal with 12 bits per one sample for tive residual power. This method is discussed in detail in storage in a buffer memory 103. The buffer memory 103 temporarily stores the digitized speech wave for ap- 25 an article "A Pattern Recognition Approach to VoiceUnvoiced-Silence Classification with Application to proximately one analysis frame period (for example, 20 Speech Recognition", IEEE TRANSACTION ON msec) and supplies the speech wave stored for every ACOUSTIC, SPEECH, AND SIGNAL PROCESSone analysis frame period to a window processing memING, VOL. ASSP-24,.No. 3, June 1976. ory 104, in response to the signal from the output termiThe quantitizer 110 quantitizes K parameters Kl, K2 nal (b) of the timing source 101. The window process- 30 . . . Kp supplied from the linear predictor coefficient ing memory 104 includes a memory capable of storing measuring unit 107, the exciting amplitude information the speech wave of one analysis window length, for v'lJ:j> fed from the amplitude signal meter 108, the example, 30 msec,· and stores the speech wave of the judging information supplied from the voice/unvoiced total of 30 msec; 10 msec of the speech wave transferred from the buffer memory 103 in the preceding frame, the 35 judging unit 109, and the pitch period information fed from the pitch picker 106, into 71 bits. With one bit 10 msec part being adjacent to the present frame, and derived from the output terminal (c) of the timing the whole speech wave in the present frame transferred source 101 added to the 71 bit code for the transmission from the buffer memory 103. The window processing frame synchronization, the quantization output is transmemory 104 then multiplies the speech wave stored by a window such as the Hamming window and then ap- 40 mitted in the form of 72 bit transmission frames through a transmission line 111. plies the multiplied speech wave to an autocorrelator The transmission line 111 is capable of transmitting 105 and a pitch picker 106. data of 3600 bits/sec, for example, and leads the data of The autocorrelator 105 calculates an autocorrelation each 72 bit frame and 20 msec frame period, i.e., of 3600 coefficient in delay T from a delay 1, for example, 125 JLsec to a delay p, for example, 1250 JLsec (P= 10), by 45 Baud, to a. demodulator 112. The demodulator 112 detects the frame synchronizusing a speech wave representative of word code in ing bit of the data fed through the transmission line 111, accordance with the following equation (3): and delivers demodulated K parameters to a K/a converter 113, the exciting amplitude information to a mulN-I-T (3) SO tiplier 114, the voiced/unvoiced decision information to l: Si·Si+T i-O a switch 115, the pitch period information to an impulse PT = -..!..::;!:l\:,.,',---,l---generator 116. l: Si2 i=O The impulse generator 116 generates a train of impulses with the same period as the pitch period obtained Further, the autocorrelator 105 supplies to an amplitude 55 from the pitch period information and supplies it to one signal instrument 108 the energy of the speech wave of the fixed contacts of the switch 115. A noise generacode word within one window length, that is, short tor 117 generates· white noise for transfer to the other time average power fixed contact of the switch 115. The switch 115 couples the impulse generator through the movable contact N-l 60 with the multipler 114, when the voiced/unvoiced l: sP. judging information indicates the voiced sound. On the i=O other hand, when the jUdging information indicates the A linear predictor coefficient instrument 107 meaunvoiced sound, the switch 115 couples the noise genersures K parameter of p and the normalized predictive ator 117 with the multiplier 114. residual power U from the autocorrelation coefficient ·65 The multiplier 114 multiplies the impulse train or the supplied from the autocorrelator 105 by the method white noise passed through the switch i15 by the excitknown as an autocorrelating method and distributes the ing amplitude information, i.e., the amplitude coefficiK parameters measured to a quantitizer 110 and the ent, and sends the multiplied signal to an. adder 118. The 9 4,301,329 10 ler 301.and to an amplitude signal instrument 108. The adder 118 provides a summation of the output signal from the mUltiplier 114 and the signal delivered from an controller 301 checks to determine whether the normaladder 120 and delivers the sum to a one-sample-period ized predictive residual power is larger than a predeterdelay 121 and a digital"to analog (D-A)converter 127. mined value or not, corresponding to the limited accuThe delay 121 delays the input signal by one sampling 5 racy of the apparatus. When it is smaller than the predeperiod of the A-D converter 102 and sends the output termined value, a calculation stop signal is applied to signal to the multiplier 124 and to a one-sample-period linear predictor coefficient instrument 107. Upon redelay 122. Similarly, the output signal of the one-samceipt of the calculation stop signal, the linear predictor pIe-period delay 122 is applied to a multiplier 125 and coefficient instrument 107 stops its calculation. When the next stage one-sample-period delay. In a similar 10 no calculation stop signal is applied thereto, it calculates manner, the output of the adder 118 is successively the linear predictor coefficient of the second order and delayed finally through one-sample-period delay 123 the normalized predictive residual power of the second and then is applied to a multiplier 126. order by using the autocorrelation coefficient representTne multiplier factors of the multipliers 124, 125 and ing the waveform reproducibility, the predictor coeffi126 are determined by a parameters supplied from Kia IS cient of the first order, and the normalized predictive converter 113. The result of the multiplication of each residual powder of the first order. Succeedingly, the multiplier is successively added in adders 119 and 120. instrument 107 recursively calculates the linear predicThe Kia converter 113 converts K parameters to linear tor coefficient until the controllel' 301 produces the predictor coefficients al, az, a3, ... ap by the recursive calculation stop signai. Alternatively, the maximum method mentioned above, and delivers at to the multi- 20 predictor order Nt may be present to thereby stop the plier 124, a2 to the multiplier 125, . . . and ap to the calculation of the coefficient measuring unit 107 automultiplier 126. matically when it calculates the maximum one Nl, reThe adders 118 to 120, the one-sample delays 121 to gardless of the calculation stop signal, preventing the 123, and the multipliers 124 to 126 cooperate to form a need for the increased order number for the linear prespeech sound synthesizing filter. The synthesized 25 dictor coefficient. speech sound is converted into analog form by the D-A If the measuring unit 107 stops its calculation after converter 127 and then is passed through a low-pass calculating the linear predictor coefficient of the N2 filter 128 of 3400 Hz so that the synthesized speech order, the N2 order linear predictor coefficient is apsound is obtained at the speech sound output terminal 30 plied to a variable sage synthesis filter 40 in the synthe129. sis side shown in FIG. 4. The controller 301 applies a In the circuit thus far described, the speech analysis variable filter control signal for controlling the number part from the speech sound input terminal 100 to the of the fIlter stages corresponding to the N2 order to the quantitizing circuit 110 may be disposed at the transmitvariable stage synthesis filter 40. The filter coefficient of ting side, the transmission line 111 may be constructed by an ordinary telephone line, and the speech synthesis 35 the filter 40 is controlled by the linear predictor coefficient of the N2 order and the number of filter stages of part from the demodulator 112 to the output terminal the fIlter 40 is controlled by the variable stage filter 129 may be disposed at the receiving side. control signal. Under such controls, the filter 40 is exThe autocorrelation measuring unit shown in FIG. 1 cited by an exciting signal and produces a synthesized may be of the product-summation type shown in FIG. 2. With S(O), S(l), ... S(N-l) for the speech wave code 40 speech sound signal to the D-A converter 127. As shown in FIG. 4, synthesis filter 40 is comprised of an words which are input signals to the window processadder 118, adders 410 to 414 of the same number as the ing memory (in the designation, N designates the numfilter stage number n previously set, multipliers 420 to ber of sampling pulses within one window length), 424, one-sample delays 430 to 434 and switches for wave data 8(t) corresponding to one sampling pulse and controlling the number of filter stages. A control signal another wave data S(t+2) spaced by i sample periods 45 fed from the controller 301 on the analysis side is defrom the wave data 8(t) are applied to a multiplier 201 modulated by a demodulator 112 on the synthesis side of which the output signal is applied to an adder 202. and is then sent to the filter stage controller 401. The The output signal from the adder 202 is applied to a controller 401, in response to the control signal, turns register 203 of which the output is coupled with the other input of the adder 202. Through the process in the 50 on switches SWo to SWn2 (in the drawing, SW4 is expressed SWn2) and turns off the remaining switches. instrument shown in FIG. 2, the numerator components With respect to the coefficient ofthe synthesis filter, the of the autocorrelation coefficient PT shown in Eq. (3) K parameter of the N2 order calculated on the synthesis are obtained as the output signal from the coefficient side is converted into an a parameter by the K-a conmeasuring unit 105 (the denominator component, i.e., the short time average power, corresponds to the out- 55 verter 113. The a parameter of the N2 order is applied to the corresponding multiplier 420 to 424. In the drawput signal at delay 0). The autocorrelation coefficient PT ing, the a parameter corresponding to the N2 order is is calculated by using these components in accordance applied to the multiplier 423 for setting the filter coeffiwith the equation (3). cient. In place of the arrangement having the measuring Turning now to FIGS. 3 and 4, there are shown block diagrams of the analysis side and the synthesis side in 60 unit 107 supplying the linear predictor coefficient of the N2 order and the controller 301 supplying the variable the apparatus of the invention. In these drawings, like stage synthesis filter control signal to the synthesis filreference numerals denote like parts or portions in FIG. ter, the linear predictor coefficient of the N3 order can 1. Linear predictor coefficient instrument 107 calculates be always transferred and the linear predictor coefficithe linear predictor coefficient of the first order and the normalized predictive residual power from the autocor- 65 ents from the (N2)+ 1 to the N3 order set to zero. In this alternative, the use of the fixed stage synthesis filter of relation coefficient representing the reproducihilityof a waveform delivered from the autocorrelator 105. The n3 stages can attain approximately the same effect as normalized predictive residual power is fed to a controlthat attained by the variable stage synthesis filter. 11 4,301,329 In the above-mentioned example according to the invention, when the normalized predictor residual power of the high order exceeds the accuracy range in the limited accuracy arithmetic because of high predictivity, as in the stationary part of voiced sound, the control 301 detects this to stop the calculation of the linear predictor coefficients of the superfluous order. The filter stage control signal is used corresponding to the order where the normalized predictive residual power is within the accuracy range of the apparatus. Further, the linear predictive coefficient of a higher order than that limiting order is treated as zero. For this, the speech sound may be stably synthesized at all times. Turning now to FIG. 5, there is shown another embodiment of the sppech analysis and synthesis apparatus according to the invention which is operable stably even under high ambient noise. FIG. 5 illustrates in block form the construction of the analysis side as in FIG. 3. In the figure, like reference numerals denote like structural elements shown in FIG. 3. An acoustic signal generated by a noise source 405 is applied to an acoustic-to-electrical signal converter 501 and to another similar type converter 502, each of which may be a microphone. The converter 501 converts a signal mixed with acoustic signals generated by a speech sound and noise source N into an electrical signal an supplies the converted electrical signal to a window processing memory 503, through an A-D converter 102 and a buffer memory 103. The converter 502 converts the acoustic signal from the noise source into an electrical signal which in turn is applied to the window processing memory 503. The window processor 503 segments an electrical signal into windows such as rectangular windows or Hamming windows, and stores the segmented signals and produces the stored data at the fixed delay speech sound output terminal 505 and the variable delay speech sound output terminal 506. The window processing memory 504 segments an electrical signal derived from the converter 502 into windows such as rectangular windows or Hamming windows, stores the segmented signals therein and then produces them at the fixed delay noise output terminal 507 and the variable delay noise output terminal 508. Correlation instrumental memories 509 to 512 measure the correlation coefficients from delay 0 to T and store them therein. The correlation instrumental memory 509 measures the autocorrelation coefficient of a noise-affected speech sound signal from delay 0 to T by using a noiseaffected speech sound signal which is derived from the fixed delay speech sound output terminal and has no delay relative to the output signal derived from the variable delay speech sound output terminal 506, and by using a noise-affected speech sound signal which is derived from terminal 506 and has delays from 0 to T relative to the output signal from the output terminal 505. The correlation instrumental memory 509 then stores the autocorrelation coefficient measured. Similarly, the remaining correlation instrumental memories 510 to 512 each measure the autocorrelation coefficient of noise from delay 0 to T by using the correlation coefficient between a noise-affected speech sound and noise and the correlation coefficient between noise and a noise-affected speech sound. Each memory stores the autocorrelation coefficient measured. A correlation adder/subtractor 513 performs the following calculation on the three kinds of the correlation coefficients with respect to delay from 0 to T; (correlation coeffici- 5 10 15 20 25 30 35 40 45 50 55 60 '65 12 ent between a noise-affected speech sound and noise)+(correlation coefficient between noise and a noiseaffected speech sound) - (autocorrelation coefficient of noise). The adder/subtractor 513 then applies to result of the calculation as the second autocorrelation coefficient to a correlation subtractor 514. The correlation subtractor 514 is supplied with the autocorrelation coefficient of the noise-affected speech sound stored in the correlation instrument 509. The autocorrelation coefficient in this case is treated as a first correlation coefficient. Then subtracted from the first correlation coefficient is a second correlation coefficient linearly, nonlinearly or linearly in weighted manner. The result of the subtraction is applied as a third correlation coefficient to a linear predictive coefficient calculator 107. The subtracting method in nonlinear manner or in linear but weighted manner may be enumerated below: Third correlation coefficient = first correlation coefficient-f (first correlation coefficient at delay 0, second correlation coefficient at delay O)Xsecond correlation coefficient Third correlation coefficient = first correlation coefficient-f (T)Xsecond correlation coefficient Third correlation coefficient = first correlation coefficient-f (first correlation coefficient at delay 0, second correlation coefficient at delay O,T)Xsecond correlation coefficient where 7 represents a delay ranging from 0 to T; f (first correlation coefficient at delay 0, second correlation coefficient at delay 0) is a function expressed by by m1-m2. exp (-m3xsecond correlation coefficient at delay O/first correlation coefficient at delay 0); K1 to K3 are constants; f(7) is a function which monotonously increases with 7 and satisfies the relation O<f(0)<f(7)21; and f(first correlation coefficient at delay 0, second correlation at delay 0,7) is a function expressed by f(first correlation coefficient at delay 0, second correlation coefficient at delay O)Xf (7). The linear predictor coefficient measuring unit 107 measures the next predictor coefficient and the normalized predictive residual power in a similar manner as described relating to FIG. 1, by using the third correlation coefficient representing the autocorrelation coefficient of a speech sound. The normalized predictive residual power is applied to the controller 301. The controller 301judges whether the normalized predictive residual power is larger than a predetermined value, for exmaple, zero or a minute positive value. When the predictive residual power is below the predetermined value, there is a high possibility that the stability of the synthesis filter is deteriorated. Therefore, a calculation stop signal is applied to the linear predictive coefficient instrument 107. Upon receipt of the stop signal, the linear predictive coefficient instrument 107 stops its calculation. When no stop signal is applied to it, it calculates the linear predictor coefficient of the second order and the normalized predictive residual power by using the linear predictor coefficient of the first order and the normalized predictive residual power. Successively, the calculator 107 continues its calculation of the linear predictive coefficient until the controller 301 produces a calculation stop signal. As in the case of FIGS. 3 and 4, modification is possible in which the maximum predictive order Nl is previously set and the linear predictor coefficient calculator 107 is automatically stopped after the maximum predictive order N1 is calculated, regardless of the calculation stop signal, thereby eliminating unnecessary increase of the order of the linearpredic- 13 4,301,329 14 tive coefficient. When the calculation is stopped by the stages. Under such controls, the filter 40 is excited by an calculation stop signal after the linear predictor coefficiexciting signal. Consequently, a synthesized speech ent of the N2 order, the linear predictive coefficient of sound signal is obtained from the output of the low-pass filter 128. In case where the linear predictive coefficient the N2 order is applied to the variable stage synthesis . filter. 5 demodulated are those other than the K parameters The controller 301 supplies a variable filter control (partial autocorrelation coefficient), the normalized predictive residual power instrument 602 can obtain the signal to a variable synthesis filter as shown in FIG. 4. The filter coefficient of the variable synthesis filter is normalized predictive residual power by means for controlled by the linear predictive coefficient of the N2 converting them into the partial autocorrelation coeffiorder and the number of the filter stages is controlled by 10 cients or another equivalent means. The transmission the variable stage synthesis filter control signal. The parameters such as the short time average power transvariable stage synthesis filter is excited by the filter mitted from the analysis side are also affected by the exciting signal and produces a synthesis speech sound condition of the transmission line. The time variation of signal. As in the previous example, in place of the arthe short time average power is gentle, compared to rangement that the linear predictive coefficient instru- 15 that of the normalized predictive residual power. Acment 107 applies the linear predictive coefficient of the cordingly, if it is smoothed in the receiving side, it has N2 order and the controller 301 applies the variable little effect on the quality of the synthesis sound. Therestage synthesis filter control signal to the variable stage fore, transmission error may easily be alleviated, withsynthesis filter, the linear predictive coefficient of the out being contrary to the object of the invention. N3 order can be transferred at all times, and the linear 20 Obviously, the present invention is applicable to the predictive coefficient from (N2)+ 1 to N3 order can be linear predictor speech sound analysis and synthesis treated as zero. Under this condition, the use of a fixed apparatus of the voice exciting method (see. B. S. stage synthesis filter of the N3 order can attain approxiAPAL, M. R. SCHROEGER, V. STOVR, BELL TELEPHONE LABORATORIES MURRAY HILL, mately the effect as that attained by using the variable stage synthesis filter. In this example, when the noise 25 N.J. 07974 "Voice Excited Predictive Cording system power of two acoustic-to-electric converters are differforLow Bit Rate Transmission of Speech" IEEE CATent, the output signal of one or both of the converters ALOG NUMBER 75 CH0971-2SCB ICC75. JUNE 16 may be adjusted by using an amplifier or an attenuator to 18), since the present invention is not directly related so that both the outputs are coincident to each other. to the transmission method of the speech sound source Another embodiment of the invention which can 30 information. alleviate the deterioration of the amplitude reproducIn a speech analysis and synthesis apparatus of the ibility of a synthesis speech signal due to transmission predictive residual wave exciting method (see, error and quantitizing error, will be described referring CHONG KWAN UN, ~"lD D. THOMAS MAGILL to FIG. 6, which shows in block form the construction ''The Residual-Excited Linear Prediction Vocoder of the analysis side and the synthesis side. Like refer- 35 with Transmission Rate Below 9.6 K bits/s" IEEE ence numerals denote like structural elements in the Transactions on Communications, Vol. COM-23, No. previous embodiments. 12, December 1975), the predictive residual waveform In this example, the short time average power obis divided by the normalized predictive residual power tained by the correlation measuring unit 105 ori the on the analysis side, and the amplitude variation range analysis side is directly applid to quantiiizer 110 where 40 of the predictive residual waveform is compressed and is is quantized, and the quantized signal is transmitted to then is transmitted to the synthesis side. On the synthethe synthesis side. In this case, the normalized predicsis side, the predictive residual waveform is multiplied tive residual power obtained by the linear predictive by the normalized predictive residual power calculated coefficient measuring unit t07 is not transmitted. Confrom the linear predictive coefficient so that it is possitroller 30t stops the calculation of the linear predictor 45 ble to prevent the amplitude reproducibility of the syncoefficients of higher order when normalized predictive tnesized speech sound deteriorated by the transmission residual power supplied from measuring unit 107 falls error of the linear predictor coefficient. below a predetermined value, and transmits a control As described above, in this embodiment, the synthesignal representative of the order of the last linear presizing filter is excited by the normalized predictive redictive coefficien,t obtained before the calculation is 50 sidual power obtained from the linear predictor coeffistopped. On the other hand, the synthesis side receives cient which are affected by quantitizing error and transmission error so that the relation betwen the linear preand demodulates K parameters including quantization error or transmission error which are transmitted from dictor coefficients and the normalized predictive residthe analysis side. The demodulated signal is applied to ual power is not greatly damaged, unlike the conventhe normalized predictive residual power (NPRP) in- 55 tional apparatus of this kind. Since there is no need for strument 601. The instrument 60t measures the normaltransmission of the normalized predictive residual ized predictive residual power in accordance with the power, the amount of information to be transmitted is equation (2) and applies the result of the measurement reduced accordingly. When the linear predictive coeffito the amplitude signal instrument 602. Thus, compocient in the analysis frame period shorter than the analynents 601 and 602 actually serve as part of the analysis 60 sis frame on the analysis side is interpolated on the synportion of the system although located with the synthethesis side by using the transmitted linear interpolated sis ponion. The instrument 602 measures the exciting by using the linear predictor coefficients, the amount of amplitude by using the short time average power P and information to be transmitted can be reduced and the normalized predictive residual power U, through the synthesized speech quality may be improved. operation of IV. P. The filter stage controller 401 turns 65 Although the speech synthesizing filter used in the on all the switches and turns off the remaining switches above examples is constructed by a recursive filter with included in the filter 40 as shown in FIG. 4, in response the coefficient of determined by a parameters, it may be to the control signal for controlling the number of filter replaced by a lattice type filter with the coefficient 15 4,301,329 determined by K parameters. An example of the use of the lattice type filter is illustrated in FIG. 7. As shown, the synthesizing filter is comprised of one-sample delays 701 to 703, multipliers 704 to 709 and adders 710 to 715. A first stage filter 730 with the coefficient of K parameters Kl of the first order, a second stage filter 740 with the coefficient of K parameters K2 of the second order, and a P-th stage filter 740 with the coefficient of K parameter Kp of the Pth order are connected in cascade fashion to constitute the filter. An exciting signal is applied to the adder 714 in the final stage filter 750 and the synthesized speech sound is outputted from the input of the first stage one-sample delay 701. What is claimed is: 1. A speech analysis and synthesis apparatus including a speech analysis part and a speech synthesis part, in which said speech analysis part comprises: means for converting a speech sound into an electrical signal; a filter for removing the frequency components of the electrical signal higher than a predetermined frequency; an analog to digital converter for converting into a train of digital code words the otuput of said filter; a memory for temporarily storing a given-length segment of the digital code word train during a predetermined frame period; a window processor supplied with said code word read out from said memory for each predetermined frame period for window processing it and for storing the result of window processing; autocorrelation means for determining the autocorrelation coefficient for each of said code words included in said one frame period; calculating means for receiving the output of said autocorrelation means and calculating and providing a series of linear predictor coefficients of successively higher order representative of the spectrum information of said speech sound and a normalized predictive residual power forming speech sound source information of said speech sound; a controller coupled to said calculating means for stopping the calculation of said linear predictor coefficients with higher order when said normalized predictive residual power falls below a predetermined value while at the same time transmitting a control signal representative of the order of the last linear predictor coefficient obtained before the calculating is stopped; sound information means for generating sound information signals representing characteristics such as the voiced/unvoiced condition, the amplitude or the pitch of said code words; means coupled to said calculating means, controller and sound information means for qunatizing said linear predictor coefficients, said sound information signals and said control signal for transmission; and in which said synthesis part comprises: combining means for generating a filter input signal from said sound information signals; a synthesizing digital filter receiving as an input said filter input signal having a coefficient determined by said linear predictor coefficients, the number of stages of said filter being variable; means for controlling the number of stages of said synthesizing digital filter corresponding to the order of the last linear predictor coefficient obtained in the analysis part; and 5 10 15 20 2S 30 35 40 45 50 55 60 65 16 means for converting the output signal of said synthesizing digital filter into an analog signal. 2. A speech analysis and synthesis apparatus including a speech analysis part and a speech synthesis part in which said speech analysis part comprises: means for converting a speech sound into an electric signal; a filter for removing the frequency components of the electrical signal higher than a predetermined frequency; an analog to digital converter for converting into a train of digital code words the output of said filter; a memory for temporarily storing a given length segment of the digital code word train during a predetermined frame period; a window processor supplied with said code word read out from said memory for each predetermined frame period for window processing it and for storing the result of window processing; autocorrelation means for determining the autocorrelation coefficient for each of said code words included in said one frame period; calculating means for receiving the output of said autocorrelation means and for calculating and providing a series of linear predictor coefficients of successively higher order representative of the spectrum information of said speech sound and a normalized predictive residual power forming speech sound source information of said speech sound; control means coupled to said calculating means for producing a control signal representative of zero for the value of any linear predictor coefficients with higher order than the last calculated linear predictor coefficient when said normalized predictive residual power falls below a predetermined value; sound information means for generating sound information signals representing characteristics such as the voiced/unvoiced condition, the amplitude or the pitch of said code words; means coupled to said calculating means,' control means and sound information means for quantizing said linear predictor coefficients, said sound information signals and said control signal for transmission; and in which said synthesis part comprises: combining means for generating a filter input signal from said sound information signals; a synthesis digital filter receiving said filter input signal as its input and having ;t filter coefficient determined by said linear predictor coefficients, said synthesizing digital filter comprising a plurality of stages of successively higher order corresponding to the orders of said linear predictor coefficients and each said filter stage receiving as its filter stage coefficient the linear predictor coefficient having the same order, whereby filter stages of order higher than the order of said last calculated linear predictor coefficient receive a zero value filter stage coefficient; and means for converting the output signal of said synthesizing digital filter into an analog signal. 3. A speech analysis and synthesis apparatus according to claim 1 or 2, in which the highest order of linear predictor coefficient to be obtained is predetermined. 4. A speech analysis and synthesis apparatus including a speech analysis part and a speech synthesis part in which said speech analysis part comprises: 17 4,301,329 18 a first converting means for converting a noisea first converting means for converting a noiseaffected speech sound signal into an electrical sigaffected speech sound signal into an electrical signal, sampling with given sampling pulses the elecnal, sampling with given sampling pulses the electrical signal for conversion into a train of first digitrical signal for conversion into a trai,n of first digital code words, and subsequently window processtal code words, and subsequently window process- 5 ing these first digital code words for every predeing these first digital code words. for every predetermined frame period; termined frame period; a second converting means for generating an electria second converting means for generating an electrical signal representative of noise-affecting said cal signal representative of noise affecting said speech sound signal, sampling the noise-representspeech sound signal, sampling the noise-represent- 10 ing signal for conversion into a train of second ing signal for conversion into a train of second digital code words, and subsequently window prodigital code words, and subsequently window processing these second digital code words for every cessing these second digital code words for every predetermined frame period; predetermined frame period; a first autocorrelation measuring means coupled to a first autocorrelation measuring means (509) for 15 said first converting means for producing as a first producing as a first autocorrelation coefficient the autocorrelation coefficient with respect to the outautocorrelation coefficient the autocorrelation coput signal of said first converting means; efficient with resp.ect to the output of said first a second autocorrelation measuring means (512) for converting means; producing a second autocorrelation coefficient 20 a second autocorrelation measuring means coupled to with respect to the output signal of said second said second converting means for producing a secconverting means; ond autocorrelation coefficient with respect to the a correlation means (510,511) for producing a pair of output signal of said second converting means; a correlation means for producing a pair of correlacorrelation coefficients with respect to the output 25 tion coefficients with respect to the output of said of said first and second means; a correlation subtractor (513,514) for subtracting the first and second converting means; . output of said correlation means from that of said a correlation subtractor for subtracting the outputs of first and second autocorrelation means; said correlation means from that of said first and second autocorrelation means; calculating means responsive to outputs from said 30 calculating means responsive to outputs from said correlation subtractor for calculating and providcorrelation subtractor for calculating and providing a series of linear predictor coefficients of successively higher order representative of the specing a series of linear predictor coefficients of suctrum information of said speech sound and a norcessively higher order representative of the spectrum information of said speech sound and a nor35 malized predictive residual power; malized predictive residual power; a controller coupled to said calculating means for stopping the calculation of any linear predictor means for producing as zero the value of any linear coefficient of higher order in response to said norpredictor coefficient with higher order when said malized predictive residual power falling below a normalized predictive residual power falls below a predetermined value while transmitting at the same 40 predetermined value; time a control signal representative of the order of sound information means for generating sound information signals representing characteristics such as the last linear predictor coefficient obtained before calculation is stopped; the voiced/unvoiced condition, the amplitude or sound information means for generating sound inforthe pitch of said code words; means for receiving and quantizing for transmission mation signals representing characteristics such as 45 the voiced/unvoiced condition, the amplitude or said· linear predictor coefficients and said sound the pitch of said code words; information signals; and in which said synthesis means coupled to said calculating means, said conpart comprises: troUer and said sound information means for quancombining means for generating a filter input signal tizing said linear predictor coefficients and said 50 from said sound information signals; sound information signals and said control signal a synthesizing digital filter receiving said filter input signal as its input and comprising a plurality of for transmission; and in which said synthesis part comprises: stages of successively higher order corresponding combining means for generating a filter input signal to .the orders of said linear predictor coefficients from said sound information signals; and each said filter stage receiving as its filter stage 55 a synthesizing digital filter receiving said filter input coefficient the linear predictor coefficient having signal as its input and having a fIlter coefficient the same order, whereby filter stages of order determined by said linear predictor coefficients, higher than the order of said last calculated linear the number of stages of said filter being variable; predictor coefficient receive zero value filter stage means for controlling the number of stages of said 60 coefficients; and synthesizing digital filter corresponding to the means for converting the output signal of said syntheorder of said last calculated liner predictor coefficisizing digital filter into an analog signal. ent obtained in the analysis part; and 6, A speech analysis and synthesis apparatus accordmeans for converting the ouptut signal of said syntheing to claim 4 or 5, further comprising means for adjustsizing digital filter into an analog signal. 65 ing the output signals from said first and second con5. A speech analysis and synthesis apparatus includverting means so that the noise component outputs from ing a speech analysis part and a speech synthesis part in said first and second converting means are equal to each which said speech analysis part comprises: other. 19 4,301,329 20 7. A speech analysis and synthesis apparatus accorda recursive filter having a filter coefficient which is ing to claim 4 or 5 wherein said correlation subtractor is determine4 by a plurality of filter stage coefficients in a a subtractor for nonlinearly subtracting. linear predictive method and in which the data value at 8. A speech analysis and synthesis apparatus accordany point in time is linearly predictable from past data ing to claim 1, 2, 4, or 5 further comprising in the syn- 5 values. thesis part means for calculating a normalized predic10. A speech analysis and synthesis apparatus accordtive residual power from said linear predictor coefficiing to claim 1, 2, 4, or 5, in which said synthesizing ents, said normalized predictive residual power being digital filter is of a lattiee type having its filter coefficipart of the input signal to said synthesizing digital filter. ent determined by a partial autocorrelation coefficient 9. A speech analysis and synthesis apparatus accord- 10 i.e., K parameter. ing to claim 7 in which said synthesizing digital filter is * * * * * 15 20 25 30 35 40 45 50 55 60 65 UNITED STATES PATENT AND TRADEMARK OFFICE CERTIFICATE OF CORRECTION PATENT NO. 4,301,329 DATED November 17, 1981 INVENTOR(S) Tetsu Taguchi Page 1 of 3 It is certified that error appears in the above~identified patent and that said Letters Patent is hereby corrected as shown below: Column 1, line 46 - after "waveform" insert --. Therefore, an attempt-- Column 2, line 12 - change "(For" to --(for-formula (1), second occurrence, change . · 1 lne 26 - c h ange "s (n-l )" to -- S ( n-l-A.. .) line 27 -change "Sn" to •• "s" to --S-- --tn-- line 29 - change "Un" to --Un -Column 3, line 28 - after "consequently" insert Column 4, lines 52-53 - change "acoustic to electrical" to --acoustic-to-electrical-- 4~ • • Column 6, line 36 - change "orderin l l to --order in-- f. UNITED STATES PATENT AND TRADEMARK OFFICE CERTIFICATE OF CORRECTION DATED • INVENTOR(S) . Page 2 of 3 4,301,329 November 17, 1981 Tetsu Taguchi PATENT NO. It is certified that error appears in the above-identified patent and that said Letters Patent is hereby corrected as shown below: Colunm 8, line 34 - change "voice" to --voiced-Column 9, line • 4 - change "to analog" to --to-analog-- - line 45 change "S(t+2)" to --S(t+i)-- Colunm 10, line 30 - change "sage" to --stage-Colunm 10, line 67 - change "n3" to --N3-- • 6 - change "coIJ.trol" to --controller-- line 15 - change lfsppech" to --speech-- Colunm 11, line line 26 - change "an: • Colunm 12, line (2nd. occurrence) to --and-- 4 - change "to" to --the-- line 30 - delete "by" line 47 • - change " e:xmaple" to --example-- Column 13, line 24 - before "effect" insert --same-line 40 - change "applid" to --applied-- line 41, change "is" (first occurrence) to --it-line 65 - change" /U.P." to -- JU.P. -- • o UNITED STATES PATENT AND TRADEMARK OFFICE CERTIFICATE OF CORRECTION DATED INVENTOR(S) Page 3 of 3 4,301,329 November 17, 1981 Tetsu Taguchi PATENT NO. It is certified that error appears in the above-identified patent and that said Letters Patent is hereby corrected as shown below: Column 14, line 46 - after "sound" insert --from being-- line 52 - change "betwen" to --between-- o lines 61-62,. delete "interpolated by using the linear" IN THE CLAIMS: Column 15, line 55 - change "qunatizing" to --quantizing-- Signed and Scaled this Twentieth ISEALl Da), of April 1981 Attest: GERALD J. MOSSINGHOFF Attesting Officer • • Commissioner of Patents and Trademarks

Disclaimer: Justia Dockets & Filings provides public litigation records from the federal appellate and district courts. These filings and docket sheets should not be considered findings of fact or liability, nor do they necessarily reflect the view of Justia.

Why Is My Information Online?