新利luck在线·（中国）有限公司官网

名师风采

当前位置: 首页 > 师资队伍 > 名师风采 > 正文

殷苌茗教授

发布时间: 2020-03-16 10:30:10 浏览量:

新利luck在线·（中国）有限公司官网计算机与通信工程学研究生导师基本信息表
1、个人基本信息：
姓名：殷苌茗		性别：男
出生年月：1964年5月		技术职称：教授
毕业院校：上海大学		学历（学位）：博士
所在学科：运筹学与控制论专业		研究方向：算法与计算机软件；机器学习与智能控制
2、教育背景：
	北京师范大学		学士
1998	国防科技大学		硕士
2006	上海大学		博士

3、目前研究领域：
算法与计算机软件；机器学习与智能控制
4、已完成或已在承担的主要课题：
1. 智能体在部分可观测马尔可夫环境下的激励学习研究，国家自然科学基金,2002-2005 2. 多时间尺度风险敏感度MDP研究，理工大学科研基金 3. 湖南省青年骨干教师培养对象，湖南省教育厅 4. 1火力发电厂分布式数据采集与故障诊断系统，湖南省电力局科研项目（1998年），已结题，6万元，主持。 5. 智能体在部分可观测马尔可夫环境下的激励学习研究，国家自然科学基金项目，在研，20万元，主研。 6. 江西省地区电网负荷预测与分析系统，江西省电力总公司，已结题，50万元，主研。 7. 教学管理软件的开发与推广，长沙电力学院教研项目（2000年），已结题，0.5万元，主研。 8. 激励学习算法的收敛性研究，湖南省教委科研项目（2000年），已结题，0.5万元，主研。 9. 激励学习智能体最优控制策略及其在微经济环境下的决策问题，湖南省教育厅科研基金项目（2007），在研，1万元，主持。 10. 7、多时间参数风险敏感度MDP研究，新利luck在线·（中国）有限公司官网科研基金项目（2006），在研，3万元，主持。 11.
5、已出版的主要著作：

6、已发表的学术论文：
1. Optimal Equality for Multi-Time Scale Risk-Sensitive Markov Decision Processes，Proceedings inISCST, 2005,Ningbo, China 2. Automatic Discovery of Subgoals for Sequential Decision Problems Using Potential Fields，Proceedings in ICNC, 2005: 384-391. 3. 求解POMDP的动态合并激励学习算法，计算机工程, No.19,2005 4. 基于动态规划的激励学习遗忘算法，计算机工程与应用, 2004,Vol 40, No.20 5. Reinforcement Learning Forgetting Algorithm Based on Dynamic Programming，Journal of Computer Engineering and Applications, 2004, Vol 40, No.20. 6. Average Asymptotic Temporal Difference Learning Forgetting Algorithm on Eligibility Trace, Journal of Changsha University of Electric Power, 2003 (4). 7. Reinforcement Learning Algorithm for Solving RTDP with Variational Environment. ICGST International Journal on Artificial Intelligence and Machine Learning (AIML), Volume (7), Issue (I), pp17-21. 8. Reinforcement Learning Algorithms Based on mGA and EA with Policy Iterations. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Bio-Inspired Computational Intelligence and Applications - International Conference on Life System Modeling and Simulation, LSMS 2007, Proceedings v 4688 LNCS 2007. 9. Risk-Sensitive Reinforcement Learning Algorithms with Generalized Average Criterion. Applied Mathematics and Mechanics-English Edition, 2007, V28, N3 ( MAR ) , pp405-416. 10. Global Attractor for KGS Lattice System.Applied Mathematics and Mechanics-English Edition, 2007, V28, N5 (MAC), pp619-628. 11. Fused Sarsa（lambda）Learning Algorithm Based-on Multi-agent. Journal of Computer Engineering and Applications,2008, 44 (4), pp182-183. 12. Automatic Discovery of Subgoals for Sequential Decision Problems Using Potential Fields. 2005 International Conference on Natural Computation/2005 International Conference on Fuzzy Systems and knowledge Discovery (ICNC'05-FSKD'05), IEEE. 27-29 August 2005, Changsha , China . (Lecture Notes in Computer Science, v 3612, n PART III, Advances in Natural Computation: First International Conference, ICNC 2005. Proceedings, 2005, pp384-391) 13. Optimal Equality for Multi-Time Scale Risk-Sensitive Markov Decision Processes. Proceedings in the International Symposium on Computer Science and Technology 2005, Ningbo , China . 14. Reinforcement Learning Algorithm Based-on Policy Iteration for Solving RTDP. 2006.8, ISAI’2006, Beijing , China . 15. U-Clustering: A Reinforcement Learning Algorithm Based on Utility Clustering. Journal of Computer Engineering and Applications, 2005, No.20. 16. Reinforcement Learning Forgetting Algorithm Based on Dynamic Programming. Journal of Computer Engineering and Applications, 2004, No.20. 17. The Dynamic Merge Reinforcement Learning Algorithm for Solving POMDP. Journal of Computer Engineering. 2005, 11. 18. Multi-Time Scale Risk-Sensitive Hierarchical Structure Control Problem. DCABES2006, Hangzhou , China , 2006.10. 19. Utility Clustering for Reinforcement Learning with Partial Observability. In Proceedings of Conference of Chinese Intelligence Automatization, HongKong , China , 2003.(IJCAI03). 20. Average Asymptotic Temporal Difference Learning Forgetting Algorithm on Eligibility Trace, Journal of Changsha University of Electric Power, 2003 (4). 21. Nonlinear Control Based on Q-learning Algorithms. Journal of Changsha University of Electric Power, Val.18, No.1, 2003 (1). 22. A Relative Value Iteration Q-Learning Algorithm and Its Convergence Based-on Finite Samples. Journal of Computer Research and Development. Sept.2002, Vol.39, No.9. 23. Optimality Cost Relative Value Iteration Q-Learning Algorithm Based on Finite Samples. Journal of Computer Engineering and Applications, 2002, No.14. 24. Generalize Average Algorithm for Reinforcement Learning Its Convergence. Journal of Computer Engineering and Applications, 2002, No.20. 25. Reinforcement Learning Algorithm Based on average Cost Optimization for Each Stage. Journal of Computer Applications, Val.22, No.4, 2002 (4). 26. Classification for Un-labeled Context Based on Maximum Expectation Learning Algorithm. Proceedings of 14th CDC (Annul Conference of Control and Decision, China ). 27. ATD(lambda) Learning Forgetting Algorithm. Proceedings of 4th Machine and Electric Engineering Association of Hunan , China , Aug. 2002. 28. Distributed Real-time System for Electric Power Enterprise Based on Intranet/Web. Journal of Applications of the Computer Systems, 2002(4). 29. The Uniform of Security Policy in Distributed System. Journal of Information Engineering University , 2001. (Proceedings of Annual Conference of Chinese Networks and Information Security, Zhengzhou , China , 2001). 30. Design of Distributed Real Time Database System Based on JDBC/Web. Journal of Computer Development and Applications. 2001,No.36. 31. The Application Delphi Multi-thread for Distributed Real time Multi-task System. Journal of Changsha University of Electric Power, Val.15, No.1, 2001 (1). 32. Comparing ARP of IPv4 with Neighbor Discovery Protocol of IPv6. Journal of Changsha University of Electric Power, Val.16, No.1, 2001 (1). 33. Study and Application of Distributed Real Time Multimedia Database. Journal of Changsha University of Electric Power, Val.16, No.2, 2001 (2). 34. The Design of Real-time Monitor Database System Based on Distributed Heterogeneous Networks Environment. Journal of Changsha University of Electric Power, Val.16, No.3, 2001 (3). 35. Distributed Real-time Multi-task System Study and Application for Monitoring and Supervising in Electric Power Plant. Proceedings of 1st Machine and Electric Engineering Association of Hunan , China , Aug, 1999. 36. The Principles and Design Methods for Domain Service System of Campus Networks. Journal of Changsha University of Electric Power, Val.13, No.1, 1998(1). 37. Security Study for Windows NT Network Management. Journal of Changsha University of Electric Power, Val.13, No.2, 1998(2). 38. The Weighed Lorentz Norm Inequality of Generalization Maximum Operator. Annual of Hunan Mathematics, Val 17, No.2, 1997. 39. The Weighted boundary of Operator and its interpolation on Mixed Lebesgue Space. Journal of Changsha University of Electric Power, Val.12, No.3, 1997 (3). 40. The Alternativeness of Non-Commutative and Non-Combinative Fractional Ring. Journal of Changsha University of Water Resources and Electric Power, Val.8, No.2, 1993 (2). 41. The Combiner Theory of Non-Commutative and Non-Combinative Fractional Ring. Journal of Changsha University of Water Resources and Electric Power, Val.6, No.2, 1991 (2). 42. The Equivalence Conditions for Reductionable Elements on Complex Commutative Banach Algebra. Journal of Changsha University of Water Resources and Electric Power, Val.5, No.1, 1990 (1). 43. F-Set on Unit square-cube under n-Dimension Euclid Space. Journal of Changsha University of Water Resources and Electric Power, Val.5, No.2, 1990 (2).
7、所获学术荣誉及学术影响：
1. 1998年度获长沙电力学院“优秀教师” 2. 1998年度获系“优秀毕业实习指导教师” 3. 2000年度获长沙电力学院“优秀教师” 4. 2000年度获长沙电力学院“优质课奖” 5. 2001年度获长沙电力学院“优秀教师” 6. 2002年度获长沙电力学院“优秀教师” 7. 2002年度获“华中电力集团奖教基金奖”三等奖 8. 2003年度湖南省高等学校青年骨干教师培养对象

上一篇：蒋加伏教授

下一篇：徐学军教授

上一篇：蔡烁教授

下一篇：潘青