GLORIA — GEOMAR Library Ocean Research Information Access

1

Online Resource

A distributed system for large-scale n-gram language models at Tencent

Long, Qiang ; Wang, Wei ; Deng, Jinfu ; [et al.]

Association for Computing Machinery (ACM) ; 2019

In: Proceedings of the VLDB Endowment Vol. 12, No. 12 ( 2019-08), p. 2206-2217

add to mindlist on the mindlist

Details

In: Proceedings of the VLDB Endowment, Association for Computing Machinery (ACM), Vol. 12, No. 12 ( 2019-08), p. 2206-2217

Abstract: n-gram language models are widely used in language processing applications, e.g., automatic speech recognition, for ranking the candidate word sequences generated from the generator model, e.g., the acoustic model. Large n-gram models typically give good ranking results; however, they require a huge amount of memory storage. While distributing the model across multiple nodes resolves the memory issue, it nonetheless incurs a great network communication overhead and introduces a different bottleneck. In this paper, we present our distributed system developed at Tencent with novel optimization techniques for reducing the network overhead, including distributed indexing, batching and caching. They reduce the network requests and accelerate the operation on each single node. We also propose a cascade fault-tolerance mechanism which adaptively switches to small n-gram models depending on the severity of the failure. Experimental study on 9 automatic speech recognition (ASR) datasets confirms that our distributed system scales to large models efficiently, effectively and robustly. We have successfully deployed it for Tencent's WeChat ASR with the peak network traffic at the scale of 100 millions of messages per minute.

Type of Medium: Online Resource

ISSN: 2150-8097

URL: Article

DOI: 10.14778/3352063.3352136

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2019

detail.hit.zdb_id: 2478691-3

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

2

Online Resource

Real-time Transportation Prediction Correction using Reconstruction Error in Deep Learning

Liu, Shuai ; Song, Guojie ; Huang, Wenhao

Association for Computing Machinery (ACM) ; 2020

In: ACM Transactions on Knowledge Discovery from Data Vol. 14, No. 2 ( 2020-04-30), p. 1-20

add to mindlist on the mindlist

Details

In: ACM Transactions on Knowledge Discovery from Data, Association for Computing Machinery (ACM), Vol. 14, No. 2 ( 2020-04-30), p. 1-20

Abstract: In online complex systems such as transportation system, an important work is real-time traffic prediction. Due to the data shift, data model inconsistency, and sudden change of traffic patterns (like transportation accident), the prediction result derived from an offline-built model would be unreliable. Retraining the model is usually not time affordable for online prediction, especially when the prediction model is very complex and costs a lot of training time (for example, deep neural networks). A real-time prediction correction strategy would be of great value under this situation. Traditionally, the prediction correction usually relies on the prediction error in several previous time intervals. They assume that the error pattern is similar in the current time interval, so that it is time-delayed to some extent. In this article, we propose the prediction correction strategy using the reconstruction error in the deep neural network. The reconstruction error can reflect the model’s ability on feature representation and then determine the fitness of an input data to the model. We first build the relationship between reconstruction error and prediction error. From the perspective of the prediction interval, we demonstrate that the reconstruction error is in positive relation with the prediction interval. Thus the prediction result is more reliable when the reconstruction error is smaller. Then we propose two mechanisms of real-time prediction correction using the reconstruction error. The data driven prediction correction approach selects several training instances with similar reconstruction errors to the current instance and using their average prediction error in correcting the prediction result. The model-driven approach builds several component deep neural networks in training. The component training set for each network is selected according to the reconstruction error of training instances. For a predicting instance, it first computes the reconstruction error of the sample in each component network and then averages the results by the reconstruction error and prediction interval. The model-driven approach is actually a reconstruction error-based deep neural network ensemble approach. Finally, a series of experiments demonstrated that reconstruction error based prediction correction approaches are effective in several prediction problems in transportation including traffic flow prediction on road, traffic flow prediction in entrance and exit station and travel time prediction. Besides the high overall accuracy, our approach can also provide many observations of using the reconstruction error in transportation prediction.

Type of Medium: Online Resource

ISSN: 1556-4681 , 1556-472X

URL: Article

DOI: 10.1145/3369871

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2020

detail.hit.zdb_id: 2257358-6

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

3

Online Resource

Distributed Deep Forest and its Application to Automatic Detection of Cash-Out Fraud

Zhang, Ya-Lin ; Zhou, Jun ; Zheng, Wenhao ; [et al.]

Association for Computing Machinery (ACM) ; 2019

In: ACM Transactions on Intelligent Systems and Technology Vol. 10, No. 5 ( 2019-09-30), p. 1-19

add to mindlist on the mindlist

Details

In: ACM Transactions on Intelligent Systems and Technology, Association for Computing Machinery (ACM), Vol. 10, No. 5 ( 2019-09-30), p. 1-19

Abstract: Internet companies are facing the need for handling large-scale machine learning applications on a daily basis and distributed implementation of machine learning algorithms which can handle extra-large-scale tasks with great performance is widely needed. Deep forest is a recently proposed deep learning framework which uses tree ensembles as its building blocks and it has achieved highly competitive results on various domains of tasks. However, it has not been tested on extremely large-scale tasks. In this work, based on our parameter server system, we developed the distributed version of deep forest. To meet the need for real-world tasks, many improvements are introduced to the original deep forest model, including MART (Multiple Additive Regression Tree) as base learners for efficiency and effectiveness consideration, the cost-based method for handling prevalent class-imbalanced data, MART based feature selection for high dimension data, and different evaluation metrics for automatically determining the cascade level. We tested the deep forest model on an extra-large-scale task, i.e., automatic detection of cash-out fraud, with more than 100 million training samples. Experimental results showed that the deep forest model has the best performance according to the evaluation metrics from different perspectives even with very little effort for parameter tuning. This model can block fraud transactions in a large amount of money each day. Even compared with the best-deployed model, the deep forest model can additionally bring a significant decrease in economic loss each day.

Type of Medium: Online Resource

ISSN: 2157-6904 , 2157-6912

URL: Article

DOI: 10.1145/3342241

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2019

detail.hit.zdb_id: 2584437-4

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

4

Online Resource

Learning to manipulate amorphous materials

Zhang, Yunbo ; Yu, Wenhao ; Liu, C. Karen ; [et al.]

Association for Computing Machinery (ACM) ; 2020

In: ACM Transactions on Graphics Vol. 39, No. 6 ( 2020-12-31), p. 1-11

add to mindlist on the mindlist

Details

In: ACM Transactions on Graphics, Association for Computing Machinery (ACM), Vol. 39, No. 6 ( 2020-12-31), p. 1-11

Abstract: We present a method of training character manipulation of amorphous materials such as those often used in cooking. Common examples of amorphous materials include granular materials (salt, uncooked rice), fluids (honey), and visco-plastic materials (sticky rice, softened butter). A typical task is to spread a given material out across a flat surface using a tool such as a scraper or knife. We use reinforcement learning to train our controllers to manipulate materials in various ways. The training is performed in a physics simulator that uses position-based dynamics of particles to simulate the materials to be manipulated. The neural network control policy is given observations of the material (e.g. a low-resolution density map), and the policy outputs actions such as rotating and translating the knife. We demonstrate policies that have been successfully trained to carry out the following tasks: spreading, gathering, and flipping. We produce a final animation by using inverse kinematics to guide a character's arm and hand to match the motion of the manipulation tool such as a knife or a frying pan.

Type of Medium: Online Resource

ISSN: 0730-0301 , 1557-7368

URL: Article

DOI: 10.1145/3414685.3417868

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2020

detail.hit.zdb_id: 2006336-2

detail.hit.zdb_id: 625686-7

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

5

Online Resource

Learning to dress : synthesizing human dressing motion via deep reinforcement learning

Clegg, Alexander ; Yu, Wenhao ; Tan, Jie ; [et al.]

Association for Computing Machinery (ACM) ; 2018

In: ACM Transactions on Graphics Vol. 37, No. 6 ( 2018-12-31), p. 1-10

add to mindlist on the mindlist

Details

In: ACM Transactions on Graphics, Association for Computing Machinery (ACM), Vol. 37, No. 6 ( 2018-12-31), p. 1-10

Abstract: Creating animation of a character putting on clothing is challenging due to the complex interactions between the character and the simulated garment. We take a model-free deep reinforcement learning (deepRL) approach to automatically discovering robust dressing control policies represented by neural networks. While deepRL has demonstrated several successes in learning complex motor skills, the data-demanding nature of the learning algorithms is at odds with the computationally costly cloth simulation required by the dressing task. This paper is the first to demonstrate that, with an appropriately designed input state space and a reward function, it is possible to incorporate cloth simulation in the deepRL framework to learn a robust dressing control policy. We introduce a salient representation of haptic information to guide the dressing process and utilize it in the reward function to provide learning signals during training. In order to learn a prolonged sequence of motion involving a diverse set of manipulation skills, such as grasping the edge of the shirt or pulling on a sleeve, we find it necessary to separate the dressing task into several subtasks and learn a control policy for each subtask. We introduce a policy sequencing algorithm that matches the distribution of output states from one task to the input distribution for the next task in the sequence. We have used this approach to produce character controllers for several dressing tasks: putting on a t-shirt, putting on a jacket, and robot-assisted dressing of a sleeve.

Type of Medium: Online Resource

ISSN: 0730-0301 , 1557-7368

URL: Article

DOI: 10.1145/3272127.3275048

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2018

detail.hit.zdb_id: 2006336-2

detail.hit.zdb_id: 625686-7

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

6

Online Resource

Exploiting the Value of the Center-dark Channel Prior for Salient Object Detection

Zhu, Chunbiao ; Zhang, Wenhao ; Li, Thomas H. ; [et al.]

Association for Computing Machinery (ACM) ; 2019

In: ACM Transactions on Intelligent Systems and Technology Vol. 10, No. 3 ( 2019-05-31), p. 1-20

add to mindlist on the mindlist

Details

In: ACM Transactions on Intelligent Systems and Technology, Association for Computing Machinery (ACM), Vol. 10, No. 3 ( 2019-05-31), p. 1-20

Abstract: Saliency detection aims to detect the most attractive objects in images and is widely used as a foundation for various applications. In this article, we propose a novel salient object detection algorithm for RGB-D images using center-dark channel priors. First, we generate an initial saliency map based on a color saliency map and a depth saliency map of a given RGB-D image. Then, we generate a center-dark channel map based on center saliency and dark channel priors. Finally, we fuse the initial saliency map with the center dark channel map to generate the final saliency map. Extensive evaluations over four benchmark datasets demonstrate that our proposed method performs favorably against most of the state-of-the-art approaches. Besides, we further discuss the application of the proposed algorithm in small target detection and demonstrate the universal value of center-dark channel priors in the field of object detection.

Type of Medium: Online Resource

ISSN: 2157-6904 , 2157-6912

URL: Article

DOI: 10.1145/3319368

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2019

detail.hit.zdb_id: 2584437-4

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

7

Online Resource

Marvista: Exploring the Design of a Human-AI Collaborative News Reading Tool

Chen, Xiang “Anthony” ; Wu, Chien-Sheng ; Murakhovs’ka, Lidiya ; [et al.]

Association for Computing Machinery (ACM) ; 2023

In: ACM Transactions on Computer-Human Interaction Vol. 30, No. 6 ( 2023-12-31), p. 1-27

add to mindlist on the mindlist

Details

In: ACM Transactions on Computer-Human Interaction, Association for Computing Machinery (ACM), Vol. 30, No. 6 ( 2023-12-31), p. 1-27

Abstract: We explore the design of Marvista—a human-AI collaborative tool that employs a suite of natural language processing models to provide end-to-end support for reading online news articles. Before reading an article, Marvista helps a user plan what to read by filtering text based on how much time one can spend and what questions one is interested to find out from the article. During reading, Marvista helps the user reflect on their understanding of each paragraph with AI-generated questions. After reading, Marvista generates an explainable human-AI summary that combines AI’s processing of the text, the user’s reading behavior, and user-generated data in the reading process. In contrast to prior work that offered (content-independent) interaction techniques or devices for reading, Marvista takes a human-AI collaborative approach that contributes text-specific guidance (content-aware) to support the entire reading process.

Type of Medium: Online Resource

ISSN: 1073-0516 , 1557-7325

URL: Article

DOI: 10.1145/3609331

RVK:

ST 640

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2023

detail.hit.zdb_id: 2006332-5

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

8

Online Resource

A Deep Bayesian Tensor-Based System for Video Recommendation

Lu, Wei ; Chung, Fu-Lai ; Jiang, Wenhao ; [et al.]

Association for Computing Machinery (ACM) ; 2019

In: ACM Transactions on Information Systems Vol. 37, No. 1 ( 2019-01-31), p. 1-22

add to mindlist on the mindlist

Details

In: ACM Transactions on Information Systems, Association for Computing Machinery (ACM), Vol. 37, No. 1 ( 2019-01-31), p. 1-22

Abstract: With the availability of abundant online multi-relational video information, recommender systems that can effectively exploit these sorts of data and suggest creatively interesting items will become increasingly important. Recent research illustrates that tensor models offer effective approaches for complex multi-relational data learning and missing element completion. So far, most tensor-based user clustering models have focused on the accuracy of recommendation. Given the dynamic nature of online media, recommendation in this setting is more challenging as it is difficult to capture the users’ dynamic topic distributions in sparse data settings as well as to identify unseen items as candidates of recommendation. Targeting at constructing a recommender system that can encourage more creativity, a deep Bayesian probabilistic tensor framework for tag and item recommendation is proposed. During the score ranking processes, a metric called Bayesian surprise is incorporated to increase the creativity of the recommended candidates. The new algorithm, called Deep Canonical PARAFAC Factorization (DCPF), is evaluated on both synthetic and large-scale real-world problems. An empirical study for video recommendation demonstrates the superiority of the proposed model and indicates that it can better capture the latent patterns of interactions and generates interesting recommendations based on creative tag combinations.

Type of Medium: Online Resource

ISSN: 1046-8188 , 1558-2868

URL: Article

DOI: 10.1145/3233773

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2019

detail.hit.zdb_id: 602352-6

detail.hit.zdb_id: 2006337-4

SSG: 24,1

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

9

Online Resource

Learning symmetric and low-energy locomotion

Yu, Wenhao ; Turk, Greg ; Liu, C. Karen

Association for Computing Machinery (ACM) ; 2018

In: ACM Transactions on Graphics Vol. 37, No. 4 ( 2018-08-31), p. 1-12

add to mindlist on the mindlist

Details

In: ACM Transactions on Graphics, Association for Computing Machinery (ACM), Vol. 37, No. 4 ( 2018-08-31), p. 1-12

Abstract: Learning locomotion skills is a challenging problem. To generate realistic and smooth locomotion, existing methods use motion capture, finite state machines or morphology-specific knowledge to guide the motion generation algorithms. Deep reinforcement learning (DRL) is a promising approach for the automatic creation of locomotion control. Indeed, a standard benchmark for DRL is to automatically create a running controller for a biped character from a simple reward function [Duan et al. 2016]. Although several different DRL algorithms can successfully create a running controller, the resulting motions usually look nothing like a real runner. This paper takes a minimalist learning approach to the locomotion problem, without the use of motion examples, finite state machines, or morphology-specific knowledge. We introduce two modifications to the DRL approach that, when used together, produce locomotion behaviors that are symmetric, low-energy, and much closer to that of a real person. First, we introduce a new term to the loss function (not the reward function) that encourages symmetric actions. Second, we introduce a new curriculum learning method that provides modulated physical assistance to help the character with left/right balance and forward movement. The algorithm automatically computes appropriate assistance to the character and gradually relaxes this assistance, so that eventually the character learns to move entirely without help. Because our method does not make use of motion capture data, it can be applied to a variety of character morphologies. We demonstrate locomotion controllers for the lower half of a biped, a full humanoid, a quadruped, and a hexapod. Our results show that learned policies are able to produce symmetric, low-energy gaits. In addition, speed-appropriate gait patterns emerge without any guidance from motion examples or contact planning.

Type of Medium: Online Resource

ISSN: 0730-0301 , 1557-7368

URL: Article

DOI: 10.1145/3197517.3201397

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2018

detail.hit.zdb_id: 2006336-2

detail.hit.zdb_id: 625686-7

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

10

Online Resource

FuncPipe: A Pipelined Serverless Framework for Fast and Cost-efficient Training of Deep Learning Models

Liu, Yunzhuo ; Jiang, Bo ; Guo, Tian ; [et al.]

Association for Computing Machinery (ACM) ; 2023

In: ACM SIGMETRICS Performance Evaluation Review Vol. 51, No. 1 ( 2023-06-26), p. 35-36

add to mindlist on the mindlist

Details

In: ACM SIGMETRICS Performance Evaluation Review, Association for Computing Machinery (ACM), Vol. 51, No. 1 ( 2023-06-26), p. 35-36

Abstract: Training deep learning (DL) models in the cloud has become a norm. With the emergence of serverless computing and its benefits of true pay-as-you-go pricing and scalability, systems researchers have recently started to provide support for serverless-based training. However, the ability to train DL models on serverless platforms is hindered by the resource limitations of today's serverless infrastructure and DL models' explosive requirement for memory and bandwidth. This paper describes FUNCPIPE, a novel pipelined training framework specifically designed for serverless platforms that enable fast and low-cost training of DL models. FUNCPIPE is designed with the key insight that model partitioning can be leveraged to bridge both memory and bandwidth gaps between the capacity of serverless functions and the requirement of DL training. Conceptually simple, we have to answer several design questions, including how to partition the model, configure each serverless function, and exploit each function's uplink/downlink bandwidth. We implement FUNCPIPE on two popular cloud serverless platforms and show that it achieves 7%-77% cost savings and 1.3X-2.2X speedup compared to state-of-the-art serverless-based frameworks.

Type of Medium: Online Resource

ISSN: 0163-5999

URL: Article

DOI: 10.1145/3606376.3593543

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2023

detail.hit.zdb_id: 199353-7

detail.hit.zdb_id: 2089001-1

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher