Graeme Clark Collection

Permanent URI for this collection

Search Results

Now showing 1 - 7 of 7
  • Item
    Thumbnail Image
    A review of the biological, psychophysical, and speech processing principles used to design the tickle talker
    Blamey, P. J. ; Cowan, R. S. C. ; Alcantara, J. I. ; Whitford, L. A. ; Galvin, K. L. ; Sarant, J. Z. ; Clark, Graeme M. ( 1992)
    The Tickle Talker is a wearable electrotactile speech processor, designed to be used by profoundly hearing-impaired children and adults in conjunction with lipreading and residual hearing. The effectiveness of such a device is affected by an interaction between biological, human engineering, psychophysical, and speech processing considerations. The requirements, the design principles, and the performance of the Tickle Talker in each of these areas will be discussed. Electrical stimulation of the nerve bundles lying along the sides of the fingers was chosen to provide safe, comfortable, energy-efficient stimulation of a well-organised and sensitive part of the tactile sensory system. This is achieved at a small cost to the appearance and mobility of one hand when using the Tickle Talker. The biphasic pulse waveform used to stimulate the nerve bundles has been chosen to ensure a biologically safe stimulus. The electrical parameters (pulse duration, pulse rate, and electrode position) that are used to encode speech information are varied within ranges that are matched to the characteristics of the tactile sense. The usable ranges and information-carrying potential of each of these parameters have been assessed in psychophysical experiments. A comparison of these results with similar experimental data for cochlear implant and hearing aid users is instructive in assessing the possible limitations of tactile and auditory speech processors. The results discussed will include the discrimination and identification of stimuli differing in intensity, duration and pulse rate; the identification of different spatial patterns of stimulation, and the detection of gaps in stimuli. In most respects, the tactile results are similar to the corresponding auditory measures. The resolution of temporal differences such as pulse rate discrimination or gap detection are generally not as good as in the auditory case, but may be as good or better than the corresponding results for some profoundly hearing-impaired individuals. The speech processor used in the Tickle Talker is a "feature extraction" device that explicitly estimates the second formant frequency, amplitude envelope, and fundamental frequency of the voice and encodes them in terms of electrode position, pulse width and pulse rate of the electrical stimulation pattern. Consideration of the psychophysical results and the speech information available from these parameters allows optimization of the Tickle Talker's operation and a broad estimation of its potential performance in speech discrimination. The perception of duration and place of articulation (front/back) of vowels, and the manner and voicing of consonants are expected to be improved by the Tickle Talker. Prosodic variations conveyed by pulse rate are expected to be perceived by some users, but not all. High frequency consonants such as: /s/,/z/./?/, and /t?/ are encoded in a particularly salient manner by the Tickle Talker.
  • Item
    Thumbnail Image
    Design fundamentals for a tactile speech processor: i) encoding of speech information, and ii) biomedical safety considerations [Abstract]
    Cowan, Robert S. C. ; Blamey, Peter J. ; Sarant, Julia Z. ; Galvin, Karyn L. ; Clark, Graeme M. ( 1992)
    Approaches to providing speech information through the tactual modality have varied in: number and spatial location of transducers; method of interfacing with the skin's sensory apparatus; and content of speech information presented. Use of a multiple speech feature encoding approach to design of a tactile device was implemented in the wearable multichannel electrotactile speech processor or Tickle Talker developed at the University of Melbourne. Psychophysical studies established that subjects could discriminate salient electrical parameters in the tactual display, and that this information could be used to discriminate acoustic speech feature contrasts. Results with normally-hearing and hearing-impaired adults and children using an FOF2 encoding strategy showed improved discrimination scores for closed-set speech feature discrimination batteries, closed-set vowel and consonant identification tasks, as well as for open-set word and sentence comprehension. Based on analyses of tactual encoding of speech features, alternative speech processing strategies designed to increase the quality of speech information available were evaluated. Results for two hearing-impaired adults showed improved feature discrimination with the addition of a voicing signal to the FOF2 strategy. Biomedical safety investigations conducted concurrently have established that the electrical parameters of the stimulus waveform, electrode handset design, and electrical circuitry of the device are free from potential risks. Longer-term physiological assessments included measures of possible effects of electrical stimulation on tactual sensitivity, finger temperature, finger and hand blood flow, electrical thresholds and maximum comfortable levels, and on central nervous system, function as measured by EEG. Results of the kinesthetic, vascular and neurological assessments showed no significant contraindications which might limit application or long-term use of the device.
  • Item
    Thumbnail Image
    Implantation of the new CI-24M Cochlear Limited receiver-stimulator and its electrode array
    Clark, Graeme M. ; Pyman, B. ; Webb, Robert L. (Monduzzi Editore, 1997)
    Cochlear Limited has released the new CI-24M model Nucleus 24 electrode cochlear implant system encompassing a series of new features. It offers the potential for improvements in hearing and cosmesis, as well as telemetry.
  • Item
    Thumbnail Image
    Effects of rate and pulse manipulations of the spectral maxima speech processor upon speech production
    Tobey, Emily A. ; Blamey, Peter J. ; McDermott, Hugh J. ; McKay, Colette M ( 1993)
    Postlingually deafened adults experience many changes in their speech as a consequence of late-onset deafness. Previous studies have indicated changes in fundamental frequency, duration, intensity, and vowel formant frequencies in postlingually deafened adults. Postlingually deafened adults who receive multichannel cochlear implants demonstrate improved control of fundamental frequency and intensity. Shifts in formant frequencies to values similar to normal hearing. speakers also appears in some adult implant users. In order to examine how adult implant users adjust their vowel production in response to map changes, we examined vowel production following manipulations to the processing strategy of the Spectral Maxima Speech Processor. This processor uses a speech processing scheme in which six spectral maxima from the outputss of 16 bandpass filters stimulate the cochlea on a place basis at a constant rate. The rate of sampling of the filterbank output is 250 Hz, so six biphasic pulses are presented every 4 msec and there is no attempt to extract fundamental frequency or to find the formant peaks in the speech signal. Two manipulations to the scheme were examined. In the first condition, the rate of sampling remained at 250Hz but eight biphasic pulses were presented rather than six. In the second condition, six biphasic pulses were presented but the rate of sampling of the filterbank output was increased to 400 Hz. Speech samples also were acquired using the standard spectral maxima processor .and with no auditory feedback when the processor was turned off. Speech samples from three �subjects were acquired immediately after receiving the manipulated speech processors and after two weeks experience with the various processors. Preliminary data indicate one subject experienced increased fundamental frequencies while using the 400 Hz high rate strategy. No significant changes were observed in fundamental frequency between the normal SMSP processor and the eight pulse variation. Elimination of feedback resulted in significantly lower fundamental frequencies. The high rate and eight pulse variations resulted in significantly higher first formant frequencies. Second formant frequencies also appear to be influenced by the processing strategies. Comparisons of values produced immediately after receiving a new strategy versus those produced after two weeks use show shifts in primarily first formant values. Data will be presented for all three subjects and discussed in regard to sensitivity to variations in speech processing schemes and the influence of experience with manipulated schemes. (Work supported by the N1H-NIDCD).
  • Item
    Thumbnail Image
    An extension of the Multipeak speech processing strategy for the MSP/MINI 22 cochlear implant system
    Jones, P. A. ; McDermott, H. J. ; Sellgman, P. M. ; Millar, J. B. ( 1992)
    The speech perception of three post-linguistically deaf adults using the Nucleus MSP/Mini System 22 cochlear implant system programmed with a new speech processing strategy, MPEAK+AO. was evaluated. The MPEAK+AO strategy retains all the information of the standard Multipeak speech processing strategy and additionally presents acoustic components below 400Hz to the most-apical electrode. This extra spectral Information may help implantees understand speech, particularly in noise. Since the estimated fundamental frequency is presented as the rate of stimulation at a fixed intracochlear site and is thereby potentially perceived more easily. and the amplitude of the stimulation on the apical electrode, associated with the voice fundamental, Is directly determined from the estimated energy in the relevant spectral region. these coding factors may provide a better representation ot the prosodic information in speech and a more complete auditory feedback signal. The comparison between Multipeak and MPEAK+AO included tests of vowel, consonant and CNC word recognition. Speech materials were presented with both a male and female speaker. Sentence material. presented with background masking noise (four-speaker babble) was also used. The results showed that the new strategy significantly improved the ability of these MSP users to recognise words in open-set sentences in noisy conditions.
  • Item
    Thumbnail Image
    A new algorithm for voicing detection and voice pitch estimation based on the neocognitron
    Moxham, James R. E. ; Jones, Peter A. ; McDermott, Hugh D. ; Clark, Graeme M. ( 1992)
    Over the last decade cochlear implants have been used increasingly to restore hearing to the profoundly deaf. One of the more widely used implants is the Nucleus multi-electrode implant, developed by the University of Melbourne and Cochlear Pty. Ltd. The speech processor used with this implant is the MSP, programmed with the multipeak strategy. This device incorporates circuits to estimate the fundamental frequency (FO) of speech signals, and to decide whether voicing is present. This paper describes a new FO estimator and voicing detection algorithm based on the neocognitron; a neural network modelled on the retina and early visual system. Performance was compared with that of three other FO estimation algorithms: linear predictive coding (LPC), cepstral analysis and the algorithm used in the Multipeak-MSP processor. For the speech samples tested, the neocognitron performed more reliably than the other three systems. On the basis of these results, this work may be able to provide benefits to existing and future cochlear implant users.
  • Item
    Thumbnail Image
    Development of an advanced electrotactile speech processor [Abstract]
    Cowan, Robert S. C. ; Blamey, Peter J. ; Glavin, K. L. ; Sarant, J. Z. ; Millard, R. ; Clark, Graeme M. ( 1992)
    Initial studies with a multiple channel electrotactile speech processor ('Tickle Talker') showed that profoundly hearing-impaired adults and children could achieve significant improvements on speech perception tests through use of tactual-encoded estimates of fundamental frequency, second formant frequency and amplitude envelope. Results showed that the device provided tactual cues which could be used to discriminate speech features such as vowel length and place, and consonant manner and voicing. However, discrimination of initial consonant voicing was not as consistent as for other features. Accurate discrimination of consonant voicing and manner (particularly high frequency information) is critical, since this Information is generally inaudible through hearing aids, and difficult to lipread.