In:
Concurrency and Computation: Practice and Experience, Wiley, Vol. 35, No. 18 ( 2023-08-15)
Abstract:
Sparse multinomial logistic regression (SMLR) is widely used in image classification and text classification due to its feature selection and probabilistic output. However, the traditional SMLR algorithm cannot satisfy the memory and time needs of big data, which makes it necessary to propose a new distributed solution algorithm. The existing distributed SMLR algorithm has some shortcomings in network strategy and cannot make full use of the computing resources of the current high‐performance cluster. Therefore, we propose communication‐efficient sparse multinomial logistic regression (CESMLR), which adopts the efficient network strategy of each node to solve the SMLR subproblem and achieve a large number of data partitions, taking full advantage of the computing resources of the cluster to achieve an efficient SMLR solution. The big data experimental results show that the performance of our algorithm exceeds those of state‐of‐the‐art algorithms. CESMLR is suitable for processing tasks with high‐dimensional features and consumes less running time while maintaining high classification accuracy.
Type of Medium:
Online Resource
ISSN:
1532-0626
,
1532-0634
Language:
English
Publisher:
Wiley
Publication Date:
2023
detail.hit.zdb_id:
2052606-4
SSG:
11
Permalink