• International Journal of Technology (IJTech)
  • Vol 11, No 3 (2020)

Design of Electrolarynx to Increase Energy Efficiency by Varying the Driving Source

Madhushankara Maila, Somashekara Bhat, Keerthana Prasad

Corresponding email: madhushankar.m@manipal.edu


Cite this article as:
Maila, M., Bhat, S., Prasad, K., 2020. Design of Electrolarynx to Increase Energy Efficiency by Varying the Driving Source. International Journal of Technology. Volume 11(3), pp. 554-563

64
Downloads
Madhushankara Maila Manipal School of Information Sciences, Manipal Academy of Higher Education, Manipal, 576104, Karnataka, India
Somashekara Bhat Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, 576104, Karnataka, India
Keerthana Prasad 1Manipal School of Information Sciences, Manipal Academy of Higher Education, Manipal, 576104, Karnataka, India
Email to Corresponding Author

Abstract
image

The larynx is an organ in the human respiratory system that forms the basis of the speech production system. An electrolarynx is a device used by most laryngectomees to regain their verbal communication. Our objective was to design an electrolarynx with increased energy efficiency by modifying the driving source. Initially, we designed the electrolarynx to accommodate various driving signals. We then compared the voice output obtained by different input signals for quality and measured energy consumption using a digital multimeter. When compared to existing methods, the proposed driving source was found to be 11.1% more energy efficient. Voice quality was also better than the traditional driving source by 31.6%.

Driving signal; Electrolarynx; Energy efficiency; Laryngectomee; Larynx; Speech

Introduction

Voice is a powerful mechanism to express ourselves. Voice production is an organized control of sensory and motor nervous systems (Simonyan and Horwitz, 2011). The cortical and subcortical areas of the brain control the speech production system. The lungs, larynx, vocal tract, and oral cavity are the main motor organs for speech production in human beings (Ackermann et al., 2014). The energy for producing speech originates from the lungs through the exhalation of breath. This vibrates the larynx and produces regular movement, resulting in laryngeal vibrations known as glottal waves (Bouchard et al., 2016). These periodic movements resonate, and intelligible speech is produced by the articulators in the oral cavity, such as the tongue, teeth, and lips (Visser, 2006).

    A person loses his ability to speak when the larynx is removed surgically due to unavoidable circumstances, such as laryngeal cancer. The electrolarynx is an electro-mechanical device that acts as a larynx substitute. This device has an oscillator and a vibrator, which is held against the neck when the laryngectomee intends to speak. The vibrational energy passes through the neck surface, and speech is produced by proper articulator movement (Gironda and Fabus, 2011). Apart from laryngectomees, this device can also help patients who are critically ill and rely on artificial ventilation (Tuinman et al., 2015; Sato et al., 2016; Rose et al., 2018). The electrolarynx being a handheld device, one of the hands must be engaged in operating the device. This could prove awkward for laryngectomees as they socialize. Many researchers are working toward reducing the device’s size to ease consumers’ lives.

A wearable device has been designed that uses a thin vibrator on a plastic support attached to the neck region (Hashiba et al., 2007). This vibrator is controlled by a wireless controller kept in the user’s pocket. Due to the required 9 v energy supply, this entire system is still substantial. The vibrator size reduction method was also employed in Madden et al. (2011), where the researchers designed an actuator for handsfree operation using a pager motor. However, pager motors’ current handling capacity is very low at about 100 ma, meaning there is insufficient vibration at the neck region to produce audible speech, as sound level affects speech intelligibility (Knox and Anneberg, 1973; Weiss et al., 1979).

The recent work on electrolarynxes suggests a variation of fundamental frequencies to produce different tones for tonal languages (Guo et al., 2016; Ifukube, 2017; Wang et al., 2017a; Wang et al., 2017b). However, extensive training is required to produce pitch variations.

Control electronics designed using the electromyography activity of the neck muscles (Arifin et al., 2014; Fuchs et al., 2015; Oe et al., 2017) and video cameras to capture lip movement (Wu et al., 2013) provide an automatic control over the electrolarynx. A statistical method for varying fundamental frequencies suggests that, this allows for more natural speech (Tanaka et al., 2016; Tanaka et al., 2017). Based on the electrolarynx speech signals’ feedback, the excitation signal is varied to accommodate differences in fundamental frequency. To decrease the electrolarynx’s self-radiated noise, an interpolation of ultrasonic waves (Mills and Zara, 2014; Massey and Yilmaz, 2016) and time domain enhancement methods (Xiao et al., 2018) are used. The findings from Sardjono et al. (2009) and Coetzee et al. (2018) suggested that voices change due to advancing age after examining the mel-frequency cepstral coefficients, implying the possible changes in voice excitation sources. A simulated modified excitation source design revealed improved voice quality over conventional vibration methods (Madhushankara et al., 2017).

Being a battery-driven device, an electrolarynx’s size should be as small as possible. However, battery technology is not advanced enough for new technology, so alternative energy-saving methods must also be accommodated (Madhushankara et al., 2015; Fuchs et al., 2016; Sofyan et al., 2016; Ejidokun et al., 2018). In our research, we designed an energy-efficient electrolarynx by modifying the driving source, thereby reducing battery size, which is proportional to battery capacity. We also employed li-ion batteries, which are commonly found in mobile phones due to their light weight and high energy capacity.

Conclusion

We have shown how an alternative electrolarynx driving signal leads to less power consumption and hence more energy efficiency with either improved or analogous performance in terms of formant distance, amplitude, quality factor, and leakage noise.

     In addition, the present work was carried out using an Arduino Uno development platform, which consists of as many as 20 pin chips, along with other peripherals that are not required for the proposed electrolarynx. The design of an application-specific integrated circuit with fewer pins and constituents will lower the inactive current and further improve battery performance.

References

Ackermann, H., Hage, S.R., Ziegler, W., 2014. Brain Mechanisms of Acoustic Communication in Humans and Nonhuman Primates: An Evolutionary Perspective. Behavioral and Brain Sciences, Volume 37, pp. 529546

Arifin, F., Sardjono, T.A., Purnomo, M.H., 2014. The Relationship between Electromyography Signal of Neck Muscle and Human Voice Signal for Controlling Loudness of Electrolarynx. Biomedical Engineering: Applications, Basis and Communications, Volume 26, pp. 1450054-1–1450054-7

Bouchard, K.E., Conant, D.F., Anumanchipalli, G.K., Dichter, B., Chaisanguanthum, K.S., Johnson, K., Chang, E.F., 2016. High-Resolution, Non-invasive Imaging of Upper Vocal Tract Articulators Compatible with Human Brain Recordings. PLoS One, Volume 11(3), p. e0151327

Boylestad, R.L., Nashelsky, L., Li, L., 2002. Electronic Devices and Circuit Theory. Prentice Hall, Englewood Cliffs, NJ, USA

Brookes, M., 2003. VOICEBOX: A Speech Processing Toolbox for MATLAB. Available Online at www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html, Accessed on January 5, 2018

Childers, D.G., 1995. Glottal Source Modeling for Voice Conversion. Speech Communication, Volume 16(2), pp. 127138

Coetzee, P., Lamfel, J., Rubin, D.M., Aharonson, V., 2018. An Age Adapting Electrolarynx–A Feasibility Study. In: International Conference on Applied Human Factors and Ergonomics 2018, Editors, Ahram Tareq Z., Falcao Christianne. Springer, pp. 880886

Ejidokun, T., Yesufu, T., Ayodele, K., Ogunseye, A., 2018. Implementation of an On-board Embedded System for Monitoring Drowsiness in Automobile Drivers. International Journal of Technology, Volume 9(4), pp. 819827

Fuchs, A.K., Amon, C., Hagmüller, M., 2015. Speech/Non-speech Detection for Electro-Larynx Speech using EMG. Biosignals, pp. 138144

Fuchs, A.K., Hagmüller, M., Kubin, G., 2016. The New Bionic Electro-Larynx Speech System. IEEE Journal of Selected Topics Signal Processing, Volume 10(5), pp. 952961

Gironda, F., Fabus, R., 2011. Assessment of Articulation and Phonological Disorders. In: A Guide to Clinical Assessment and Professional Report Writing in Speech-Language Pathology, Stein-Rubin, C., Fabus, R. (eds.), Nelson Education, USA, pp. 139176

Guo, L., Nagle, K.F., Heaton, J.T., 2016. Generating Tonal Distinctions in Mandarin Chinese using an Electrolarynx with Preprogrammed Tone Patterns. Speech Communication, Volume 78, pp. 3441

Hashiba, M., Sugai, Y., Izumi, T., Ino, S., Ifukube, T., 2007. Development of a Wearable Electro-larynx for Laryngectomees and Its Evaluation. In: Engineering in Medicine and Biology Society, 2007, 29th Annual International Conference of the IEEE, 2007, IEEE, pp. 52675270

Hugeng, H., Kurniawan, R., 2016. Development of the ‘Healthcor’ System as a Cardiac Disorders Symptoms Detector using an Expert System based on Arduino Uno. International Journal of Technology, Volume 7(1), pp. 7887

Ifukube, T., 2017. Assistive Tool Design for Speech Production Disorders. In: Sound-Based Assistive Technology, Springer, pp. 169197

Knox, A.W., Anneberg, M., 1973. The Effects of Training in Comprehension of Electrolaryngeal Speech. Journal of Communication Disorders, Volume 6(2), pp. 110120

Madden, B., Nolan, M., Burke, T., Condron, J., Coyle, E., 2011. Intelligibility of Electrolarynx Speech using a Novel Hands-Free Actuator. In: International Conference on Bio-inspired Systems and Signal Processing, pp. 265–269 doi:10.5220/0003167902650269

Madhushankara, M., Bhat, S., Prasad, K., 2017. Design of an Electrolarynx Driving Source for Improved Vowel Quality. Journal of Advanced Research in Dynamical and Control Systems, Volume 9(15), pp. 870876

Madhushankara, M., Prasad, K., Chaitanya, C., Bhat, S., 2015. A Low Power Low Frequency Oscillator for Driving Electrolarynx. In: International Conference on VLSI Systems, Architecture, Technology and Applications (VLSI-SATA 2015), IEEE, pp. 13

Massey, J.W., Yilmaz, A.E., 2016. AustinMan and AustinWoman: High-Fidelity, Anatomical Voxel Models Developed from the VHP Color Images. In: 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), IEEE, pp. 33463349

Meltzner, G.S., Kobler, J.B., Hillman, R.E., 2003. Measuring the Neck Frequency Response Function of Laryngectomy Patients: Implications for the Design of Electrolarynx Devices. The Journal of the Acoustical Society of America, Volume 114, pp. 10351047

Mills, P., Zara, J., 2014. 3D Simulation of an Audible Ultrasonic Electrolarynx using Difference Waves. PloS One, Volume 9, p. e113339

Oe, K., Kokushi, S., Nakamura, R., 2017. Proposal of New Control Parameter for Neck Myoelectric Control-type Electrolarynx. In: 2017 International Symposium on Micro-NanoMechatronics and Human Science (MHS), IEEE, pp. 12

Rose, L., Istanboulian, L., Smith, O.M., Silencieux, S., Cuthbertson, B.H., Amaral, A.C.K.B., Fraser, I., Grey, J., Dale, C., 2018. Feasibility of the Electrolarynx for Enabling Communication in the Chronically Critically Ill: The EECCHO Study. Journal of Critical Care, Volume 47, pp. 109113

Sardjono, T., Hidayati, R., Purnami, N., Noortjahja, A., Verkerke, G., Purnomo, M.A., 2009. Preliminary Result of Voice Spectrum Analysis from Laryngectomised Patients with and without Electro Larynx: A Case Study in Indonesian Laryngectomised Patients. In: 2009 International Conference on Instrumentation, Communications, Information Technology, and Biomedical Engineering (ICICI-BME), IEEE, pp. 15

Sato, K., Okajima, M., Taniguchi, T., 2016. The Electrolarynx as a Communication Tool for Mechanically Ventilated Critically Ill Patients: A Prospective Feasibility Study. Intensive Care Medicine, Volume 42(8), pp. 1299–1300

Simonyan, K., Horwitz, B., 2011. Laryngeal Motor Cortex and Control of Speech in Humans. The Neuroscientist, Volume 17(2), pp. 197208

Sofyan, N., Putro, D.Y., Zulfia, A., 2016. Performance of Vanadium-Doped LiFePO4/C Used as a Cathode for a Lithium Ion Battery. International Journal of Technology, Volume 7(6), pp. 13071315

Stevens, K., 1996. Articulatory-Acoustic-Auditory Relationship. In: The Handbook of Phonetic Sciences, Hardcastle, W., Laver, J. (eds.), Blackwell, Oxford, UK

Tanaka, K., Toda, T., Nakamura, S., 2016. Evaluation of Electrolarynx Controlled by Real-Time Statistical F0 Prediction. The Journal of the Acoustical Society of America, Volume 140(4), pp. 29642964

Tanaka, K., Toda, T., Nakamura, S., 2017. A Vibration Control Method of an Electrolarynx Based on Statistical F0 Pattern Prediction. IEICE Transactions on Information and Systems, Volume 100(9), pp. 21652173

Tuinman, P., Ten Hoorn, S., Aalders, Y., Elbers, P., Girbes, A., 2015. The Electrolarynx Improves Communication in a Selected Group of Mechanically Ventilated Critically Ill Patients: A Feasibility Study. Intensive Care Med, Volume 41(3), pp. 547548

Visser, L., 2006. The Teacher’s Voice. Cengage Learning Australia

Wang, L., Feng, Y., Yang, Z., Niu, H., 2017a. Development and Evaluation of Wheel-Controlled Pitch-Adjustable Electrolarynx. Medical & Biological Engineering & Computing, Volume 55, pp. 14631472

Wang, L., Feng, Y., Yang, Z., Niu, H., 2017b. Development and Perceptual Evaluation of Pitch Controllable Electrolarynx. In: 2017 International Conference on Biomedical & Health Informatics (BHI), IEEE EMBS, pp. 305308

Weiss, M.S., Yeni?Komshian, G.H., Heinz, J.M., 1979. Acoustical and Perceptual Characteristics of Speech Produced with an Electronic Artificial Larynx. The Journal of the Acoustical Society of America, Volume 65(5), pp. 12981308

Wu, L., Wan, C., Wang, S., Wan, M., 2013. Development and Evaluation of On/Off Control for Electrolaryngeal Speech via Artificial Neural Network based on Visual Information of Lips. Journal of Voice, Volume 27, pp. 259.e7259.e16

Xiao, K., Wang, S., Wan, M., Wu, L., 2018. Radiated Noise Suppression for Electrolarynx Speech based on Multiband Time-Domain Amplitude Modulation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, Volume 26, pp. 15851593