Periodic Acoustic Noise Cancelation with Intelligent and Active Method

Author

shiraz university- PhD candidate

Abstract

In this paper a new method based on temporal difference on reinforcement learning algorithm is proposed to active noise control of periodic acoustic signal. This method does not need dynamic information and therefore it is fully robust with respect to the dynamic change. Very low computational burden and low needed memory are some advantages of this method. In the first step, with well definition of the state variables, actions, and reward signal, a reinforcement learning problem is formed and then it is solved by Q-learning technique. In the next step the problem is simplified by help of frequency domain modeling information and it is solved by temporal difference method. Finally, a multi-level approach is presented which it can increase the precision without increasing of the memory size. It is shown by simulation that this method works well and the improvement of the multi-level temporal difference method with respect to the Q-learning method is addressed.

Keywords

Main Subjects


[1] Russell S, Norvig P (2009) Artificial Intelligence: A Modern Approach: Prentice Hall.
[2] Sutton R, Barto A (1998) Reinforcement Learning:An Introduction. Cambridge, MA: MIT Press.
[3] Kaelbling LP, Littman LM, Moore AW (1996) Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4: 237–285.
[4] Kuo SM, Morgan DR (1999) Active noise control: a tutorial review. in IEEE Proceedings 87: 943-973.
[5] Raeisy B, Golbahar Haghighi S (2012) Active Noise Controller with Reinforcement Learning. presented at the The 16th CSI International Symposiums on Artificial Intelligence & Signal Processing (AISP 2012): 74-79, Shiraz, IRAN.
[6] Zhou YL, Zhang QZ, Li XD, Gan WS (2005) Analysis and DSP implementation of an ANC system using a filtered-error neural network. Journal of Sound and Vibration 285: 1–25.
[7] Qizhi Z, Yongle J (2002) Active noise hybrid feedforward/feedback control using neural network compensation. Journal of vibration and acoustics 124: 100-104.
[8] Watkins CJCH (1989) Learning from delayed rewards. PhD Thesis.
[9] Zhou YL, Zhang QZ, Li XD, Gan WS (2008) On the use of an SPSA-based model free feedback controller in active noise control for periodic disturbances in a duct. Journal of Sound and Vibration 317: 456–472.