- 必威体育下载

姓名
邮箱
手机号码
标题
留言内容
验证码

摘要

随着深度学习处理问题的日益复杂, 神经网络的层数、神经元个数、和神经元之间的连接逐渐增加, 参数规模急剧膨胀, 优化超参数来提高神经网络的预测性能成为一个重要的任务. 文献中寻找最优参数的方法如灵敏度剪枝、网格搜索等, 算法复杂而且计算量庞大. 本文提出一种超参数优化的“删除垃圾神经元策略”. 权重矩阵中权重均值小的神经元, 在预测中的贡献可以忽略, 称为垃圾神经元. 该策略就是通过删除这些垃圾神经元得到精简的网络结构, 来有效缩短计算时间, 同时提高预测准确率和模型泛化能力. 采用这一策略, 长短期记忆网络模型对几种典型混沌动力系统的预测性能得到显著改善.

关键词:

Abstract

With the complexity of problems in reality increasing, the sizes of deep learning neural networks, including the number of layers, neurons, and connections, are increasing in an explosive way. Optimizing hyperparameters to improve the prediction performance of neural networks has become an important task. In literatures, the methods of finding optimal parameters, such as sensitivity pruning and grid search, are complicated and cost a large amount of computation time. In this paper, a hyperparameter optimization strategy called junk neuron deletion is proposed. A neuron with small mean weight in the weight matrix can be ignored in the prediction, and is defined subsequently as a junk neuron. This strategy is to obtain a simplified network structure by deleting the junk neurons, to effectively shorten the computation time and improve the prediction accuracy and model the generalization capability. The LSTM model is used to train the time series data generated by Logistic, Henon and Rossler dynamical systems, and the relatively optimal parameter combination is obtained by grid search with a certain step length. The partial weight matrix that can influence the model output is extracted under this parameter combination, and the neurons with smaller mean weights are eliminated with different thresholds. It is found that using the weighted mean value of 0.1 as the threshold, the identification and deletion of junk neurons can significantly improve the prediction efficiency. Increasing the threshold accuracy will gradually fall back to the initial level, but with the same prediction effect, more operating costs will be saved. Further reduction will result in prediction ability lower than the initial level due to lack of fitting. Using this strategy, the prediction performance of LSTM model for several typical chaotic dynamical systems is improved significantly.

Keywords:

作者及机构信息

通信作者:杨会杰,hjyang@usst.edu.cn

基金项目:国家自然科学基金(批准号: 11875042, 11505114)资助的课题.

Authors and contacts

Corresponding author:Yang Hui-Jie,hjyang@usst.edu.cn

Funds:Project supported by the National Natural Science Foundation of China (Grant Nos. 11875042, 11505114).

文章全文: translate this paragraph

参考文献

[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]

施引文献

下载: 全尺寸图片幻灯片

模型	train	test	win	L₁	L₂	D₁	D₂	准确率
Logistic (μ= 3.6)	5000	15	22	16	4	4	1	82.90％
Logistic (μ= 3.7)	5000	15	22	16	4	4	1	70.70％
Logistic (μ= 3.8)	5000	15	2	20	4	4	1	68.60％
Logistic (μ= 3.9)	5000	15	2	16	4	2	1	60.00％
Logistic (μ= 3.99)	5000	15	2	16	4	4	1	57.10％
Henon	5000	15	2	22	2	2	1	67.10％
Rossler	5000	15	2	16	4	4	1	77.10％

下载: 导出CSV

结构	起始点	终点
input_gate	0	units
forget_gate	units	2×units
cell	2×units	3×units
output_gate	3×units	4×units

下载: 导出CSV

	w1	w2	w3	w4	w5	w6	w7	w8	w9	w10	w11	w12	w13	w14	w15	w16
x1	0.032129	0.144328	0.007293	0.003557	0.070465	0.011652	0.11142	0.048287	0.105727	0.039649	0.010162	0.119949	0.0771	0.126937	0.001947	0.089396
x2	0.070867	0.013232	0.054503	0.073496	0.005734	0.064071	0.145835	0.083217	0.025553	0.033065	0.113584	0.028327	0.054085	0.177761	0.010123	0.046546
x3	0.083647	0.046366	0.066226	0.044534	0.010054	0.077919	0.071001	0.018548	0.035294	0.071642	0.053888	0.028448	0.102273	0.096744	0.078535	0.114879
x4	0.039123	0.113002	0.161581	0.113866	0.070651	0.085571	0.01836	0.015408	0.032978	0.018375	0.068342	0.059137	0.00701	0.070451	0.032602	0.099853
x5	0.013762	0.055823	0.147481	0.055493	0.093444	0.027264	0.082384	0.058674	0.032903	0.033342	0.045937	0.035937	0.110378	0.004487	0.161653	0.041037
x6	0.038079	0.089649	0.013102	0.021696	0.042833	0.109787	0.001024	0.0673	0.036916	0.134038	0.066291	0.146953	0.009803	0.081372	0.098701	0.042456
x7	0.013865	0.001702	0.057869	0.072264	0.104456	0.029761	0.062669	0.07539	0.05715	0.102282	0.017876	0.012626	0.020022	0.100243	0.153076	0.11875
x8	0.002042	0.006925	0.060008	0.13031	0.136406	0.056203	0.061606	0.028064	0.006926	0.099129	0.055122	0.117276	0.06846	0.014505	0.078184	0.078834
x9	0.044205	0.171724	0.153162	0.13818	0.029189	0.025947	0.049391	0.012338	0.100584	0.028133	0.004946	0.017914	0.008463	0.014741	0.066944	0.134139
x10	0.094619	0.0563	0.040223	0.096283	0.10152	0.145036	0.051991	0.075623	0.075216	0.061209	0.057986	0.066076	0.004787	0.01945	0.042341	0.011339
x11	0.078909	0.005603	0.08149	0.125202	0.069081	0.10143	0.122451	0.027058	0.057647	0.016226	0.03275	0.050667	0.036795	0.11072	0.081767	0.002204
x12	0.046159	0.005688	0.006237	0.004618	0.014815	0.005272	0.04598	0.037005	0.190933	0.065535	0.005131	0.015155	0.065812	0.099804	0.172294	0.21956
x13	0.111226	0.027026	0.027497	0.074868	0.139154	0.084413	0.080342	0.038769	0.088824	0.047083	0.056548	0.002081	0.10549	0.049929	0.020529	0.04622
x14	0.092772	0.03999	0.055938	0.128114	0.036386	0.013061	0.083943	0.051033	0.106374	0.007257	0.063049	0.091929	0.084821	0.020458	0.089496	0.035379
x15	0.131586	0.038161	0.176303	0.041758	0.049173	0.096633	0.0033	0.045529	0.084262	0.050839	0.003322	0.063406	0.029601	0.045323	0.116047	0.024757
x16	0.070289	0.05378	0.092472	0.03372	0.052087	0.110236	0.055639	0.124221	0.029371	0.06142	0.04904	0.043376	0.012261	0.041226	0.109564	0.061299
均值	0.060205	0.054331	0.075087	0.072372	0.064091	0.065266	0.065459	0.050404	0.066666	0.054326	0.043998	0.056204	0.049428	0.067134	0.082113	0.072915

下载: 导出CSV

指标	调整前	以各权重阀值调整后			变化趋势
指标	调整前	0.09	0.1	0.11	变化趋势
L₁	16	15	12	10
准确率	57.10％	59.30％	56.40％	51.40％

下载: 导出CSV

模型	调整前L₁	调整后L₁	调整前准确率	调整后准确率	神经元数调整	准确率变化趋势
Logistic (μ= 3.6)	16	15	82.90％	90.70％	–1
Logistic (μ= 3.7)	16	13	70.70％	71.40％	–3
Logistic (μ= 3.8)	20	16	68.60％	68.60％	–4
Logistic (μ= 3.9)	16	12	60.00％	60.00％	–4
Logistic (μ= 3.99)	16	15	57.10％	59.30％	–1
Henon	22	21	67.10％	70.00％	–1
Rossler	16	14	77.10％	83.60％	–2

下载: 导出CSV

指标	调整前	以各权重阀值调整后			变化趋势
指标	调整前	0.08	0.09	0.095	变化趋势
L₁	16	15	12	10
准确率	82.90％	90.70％	87.90％	78.60％

下载: 导出CSV

指标	调整前	以各权重阀值调整后			变化趋势
指标	调整前	0.075	0.09	0.105	变化趋势
L₁	16	13	11	9
准确率	70.70％	71.40％	65.00％	60.70％

下载: 导出CSV

指标	调整前	以各权重阀值调整后			变化趋势
指标	调整前	0.085	0.095	0.105	变化趋势
L₁	20	18	16	12
准确率	68.60％	68.60％	68.60％	65.00％

下载: 导出CSV

指标	调整前	以各权重阀值调整后			变化趋势
指标	调整前	0.09	0.095	0.1	变化趋势
L₁	16	14	12	10
准确率	60.00％	60.00％	60.00％	55.00％

下载: 导出CSV

指标	调整前	以各权重阀值调整后				变化趋势
指标	调整前	0.1	0.11	0.12	0.14	变化趋势
L₁	22	21	19	16	14
准确率	67.10％	70.00％	66.40％	65.70％	65.00％

下载: 导出CSV

指标	调整前	以各权重阀值调整后				变化趋势
指标	调整前	0.085	0.095	0.105	0.115	变化趋势
L₁	16	14	12	11	8
准确率	77.10％	83.60％	81.40％	80.70％	71.40％

下载: 导出CSV

[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]

[1]	齐乐天, 王世元, 沈明琳, 黄刚毅.基于Nyström柯西核共轭梯度算法的混沌时间序列预测. 必威体育下载 , 2022, 71(10): 108401.doi:10.7498/aps.71.20212274
[2]	王世元, 史春芬, 钱国兵, 王万里.基于分数阶最大相关熵算法的混沌时间序列预测. 必威体育下载 , 2018, 67(1): 018401.doi:10.7498/aps.67.20171803
[3]	李瑞国, 张宏立, 范文慧, 王雅.基于改进教学优化算法的Hermite正交基神经网络混沌时间序列预测. 必威体育下载 , 2015, 64(20): 200506.doi:10.7498/aps.64.200506
[4]	唐舟进, 彭涛, 王文博.一种基于相关分析的局域最小二乘支持向量机小尺度网络流量预测算法. 必威体育下载 , 2014, 63(13): 130504.doi:10.7498/aps.63.130504
[5]	唐舟进, 任峰, 彭涛, 王文博.基于迭代误差补偿的混沌时间序列最小二乘支持向量机预测算法. 必威体育下载 , 2014, 63(5): 050505.doi:10.7498/aps.63.050505
[6]	赵永平, 王康康.具有增加删除机制的正则化极端学习机的混沌时间序列预测. 必威体育下载 , 2013, 62(24): 240509.doi:10.7498/aps.62.240509
[7]	王新迎, 韩敏, 王亚楠.含噪混沌时间序列预测误差分析. 必威体育下载 , 2013, 62(5): 050504.doi:10.7498/aps.62.050504
[8]	高光勇, 蒋国平.采用优化极限学习机的多变量混沌时间序列预测. 必威体育下载 , 2012, 61(4): 040506.doi:10.7498/aps.61.040506
[9]	张文专, 龙文, 焦建军.基于差分进化算法的混沌时间序列预测模型参数组合优化. 必威体育下载 , 2012, 61(22): 220506.doi:10.7498/aps.61.220506
[10]	王新迎, 韩敏.基于极端学习机的多变量混沌时间序列预测. 必威体育下载 , 2012, 61(8): 080507.doi:10.7498/aps.61.080507
[11]	宋彤, 李菡.基于小波回声状态网络的混沌时间序列预测. 必威体育下载 , 2012, 61(8): 080506.doi:10.7498/aps.61.080506
[12]	马军, 苏文涛, 高加振.Hindmarsh-Rose混沌神经元自适应同步和参数识别的优化研究. 必威体育下载 , 2010, 59(3): 1554-1561.doi:10.7498/aps.59.1554
[13]	张春涛, 马千里, 彭宏.基于信息熵优化相空间重构参数的混沌时间序列预测. 必威体育下载 , 2010, 59(11): 7623-7629.doi:10.7498/aps.59.7623
[14]	宋青松, 冯祖仁, 李人厚.用于混沌时间序列预测的多簇回响状态网络. 必威体育下载 , 2009, 58(7): 5057-5064.doi:10.7498/aps.58.5057
[15]	张勇, 关伟.基于最大Lyapunov指数的多变量混沌时间序列预测. 必威体育下载 , 2009, 58(2): 756-763.doi:10.7498/aps.58.756
[16]	王永生, 孙瑾, 王昌金, 范洪达.变参数混沌时间序列的神经网络预测研究. 必威体育下载 , 2008, 57(10): 6120-6131.doi:10.7498/aps.57.6120
[17]	韩敏, 史志伟, 郭伟.储备池状态空间重构与混沌时间序列预测. 必威体育下载 , 2007, 56(1): 43-50.doi:10.7498/aps.56.43
[18]	李军, 刘君华.一种新型广义RBF神经网络在混沌时间序列预测中的研究. 必威体育下载 , 2005, 54(10): 4569-4577.doi:10.7498/aps.54.4569
[19]	胡玉霞, 高金峰.一种预测混沌时间序列的模糊神经网络方法. 必威体育下载 , 2005, 54(11): 5034-5038.doi:10.7498/aps.54.5034
[20]	谭文, 王耀南, 周少武, 刘祖润.混沌时间序列的模糊神经网络预测. 必威体育下载 , 2003, 52(4): 795-801.doi:10.7498/aps.52.795

计量

文章访问数:3363
PDF下载量:63
被引次数:0

搜索

留言板