1. Feng, Z., Tan, L., Li, W. and Gulliver,
T.A., "Reinforcement learning based dynamic network self-optimization for
heterogeneous networks", in Communications, Computers and Signal
Processing, PacRim, Pacific Rim Conference on, IEEE., (2009), 319-324.
G.C. and Shamma, J.S., "Distributed dynamic reinforcement of efficient
outcomes in multiagent coordination and network formation", Dynamic Games and
Applications, Vol. 2, No. 1,
3. Brooks, R.A.,
"A robust layered control system for a mobile robot", IEEE
Journal ofRobotics and Automation,
Vol. 2, No. 1, (1986), 14-23.
4. Gosavi, A.,
"A tutorial for reinforcement learning", Department of Engineering Management and
Systems Engineering, (2011).
5. Busoniu, L.,
Babuska, R. and De Schutter, B., "A comprehensive survey of multiagent
reinforcement learning", Systems, Man, and Cybernetics, Part C: Applications
and Reviews, IEEE Transactions on,
Vol. 38, No. 2, (2008), 156-172.
6. Qiao, J.,
Hou, Z. and Ruan, X., "Application of reinforcement learning based on
neural network to dynamic obstacle avoidance", in Information and
Automation, 2008. ICIA 2008. International Conference on, IEEE. Issue, (2008),
S., Ghaderi, R., Ebrahimzade, A., , "A q-learning based continuous tuning of fuzzy wall tracking
without exploration", International Journal of Engineering-Transactions
A: Basics, Vol. 25, No. 4, (2012), 355-366.
8. Abdi, J.,
Khalili, G.F., Fatourechi, M., Lucas, C. and Sedigh, A.K., "Control of
multivariable systems based on emotional temporal difference learning
controller", International Journal of Engineering-Transactions A: Basics, Vol. 17, No. 4, (2004), 363-376.
9. Mirmomeni, M.
and Yazdanpanah, M., "An unsupervised learning method for an attacker
agent in robot soccer competitions based on the kohonen neural network", International
Journal of Engineering- Transactions A: Basics, Vol. 21, No. 3, (2008), 255-268.
10. Koohi, H.,
Nadernejad, E. and Fathi, M., "Employing sensor network to guide firefighters
in dangerous area", International Journal of Engineering-Transactions
C: Aspects,, Vol. 32, No. 2,
11. Ganesan, D.,
Krishnamachari, B., Woo, A., Culler, D., Estrin, D. and Wicker, S., Complex behavior at scale: An experimental
study of low-power wireless sensor networks. (2002), Technical Report
12. Aslam, J., Li,
Q. and Rus, D., "Three power‐aware routing algorithms for sensor
networks", Wireless Communications and Mobile Computing, Vol. 3, No. 2, (2003), 187-208.
13. Baldwin, P.,
Kohli, S., Lee, E.A., Liu, X. and Zhao, Y., "Modeling of sensor nets in
ptolemy ii", in Proceedings of the 3rd international symposium on
Information processing in sensor networks, ACM. Vol., No. Issue, (2004),