GLORIA — GEOMAR Library Ocean Research Information Access

1

Online Resource

Error Selection Methods for Machine Translation Error Analysis

Akabe, Koichi ; Neubig, Graham ; Sakti, Sakriani ; [et al.]

Association for Natural Language Processing ; 2016

In: Journal of Natural Language Processing Vol. 23, No. 1 ( 2016), p. 87-117

add to mindlist on the mindlist

Details

In: Journal of Natural Language Processing, Association for Natural Language Processing, Vol. 23, No. 1 ( 2016), p. 87-117

Type of Medium: Online Resource

ISSN: 1340-7619 , 2185-8314

Uniform Title: 機械翻訳システムの誤り分析のための誤り箇所選択手法

URL: Article

DOI: 10.5715/jnlp.23.87

Language: English , Japanese

Publisher: Association for Natural Language Processing

Publication Date: 2016

detail.hit.zdb_id: 3050641-4

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

2

Online Resource

Improving Pivot Translation by Remembering the Pivot

Miura, Akiva ; Neubig, Graham ; Sakti, Sakriani ; [et al.]

Association for Natural Language Processing ; 2016

In: Journal of Natural Language Processing Vol. 23, No. 5 ( 2016), p. 499-528

add to mindlist on the mindlist

Details

In: Journal of Natural Language Processing, Association for Natural Language Processing, Vol. 23, No. 5 ( 2016), p. 499-528

Type of Medium: Online Resource

ISSN: 1340-7619 , 2185-8314

Uniform Title: 中間言語情報を記憶するピボット翻訳手法

URL: Article

DOI: 10.5715/jnlp.23.499

Language: English , Japanese

Publisher: Association for Natural Language Processing

Publication Date: 2016

detail.hit.zdb_id: 3050641-4

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

3

Online Resource

An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering

Sugiyama, Kyoshiro ; Mizukami, Masahiro ; Neubig, Graham ; [et al.]

Association for Natural Language Processing ; 2016

In: Journal of Natural Language Processing Vol. 23, No. 5 ( 2016), p. 437-461

add to mindlist on the mindlist

Details

In: Journal of Natural Language Processing, Association for Natural Language Processing, Vol. 23, No. 5 ( 2016), p. 437-461

Type of Medium: Online Resource

ISSN: 1340-7619 , 2185-8314

Uniform Title: 言語横断質問応答に適した機械翻訳評価尺度の調査

URL: Article

DOI: 10.5715/jnlp.23.437

Language: English , Japanese

Publisher: Association for Natural Language Processing

Publication Date: 2016

detail.hit.zdb_id: 3050641-4

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

4

Online Resource

A comparative study of dictionaries and corpora as methods for language resource addition

Mori, Shinsuke ; Neubig, Graham

Springer Science and Business Media LLC ; 2016

In: Language Resources and Evaluation Vol. 50, No. 2 ( 2016-6), p. 245-261

add to mindlist on the mindlist

Details

In: Language Resources and Evaluation, Springer Science and Business Media LLC, Vol. 50, No. 2 ( 2016-6), p. 245-261

Type of Medium: Online Resource

ISSN: 1574-020X , 1574-0218

URL: Article

DOI: 10.1007/s10579-016-9354-7

Language: English

Publisher: Springer Science and Business Media LLC

Publication Date: 2016

detail.hit.zdb_id: 2195235-8

SSG: 24

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

5

Online Resource

A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation

TANAKA, Kou ; TODA, Tomoki ; NEUBIG, Graham ; [et al.]

Institute of Electronics, Information and Communications Engineers (IEICE) ; 2014

In: IEICE Transactions on Information and Systems Vol. E97.D, No. 6 ( 2014), p. 1429-1437

add to mindlist on the mindlist

Details

In: IEICE Transactions on Information and Systems, Institute of Electronics, Information and Communications Engineers (IEICE), Vol. E97.D, No. 6 ( 2014), p. 1429-1437

Type of Medium: Online Resource

ISSN: 0916-8532 , 1745-1361

URL: Article

DOI: 10.1587/transinf.E97.D.1429

Language: English

Publisher: Institute of Electronics, Information and Communications Engineers (IEICE)

Publication Date: 2014

detail.hit.zdb_id: 2214518-7

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

6

Online Resource

Enhancing Event-Related Potentials Based on Maximum a Posteriori Estimation with a Spatial Correlation Prior

MAKI, Hayato ; TODA, Tomoki ; SAKTI, Sakriani ; [et al.]

Institute of Electronics, Information and Communications Engineers (IEICE) ; 2016

In: IEICE Transactions on Information and Systems Vol. E99.D, No. 6 ( 2016), p. 1437-1446

add to mindlist on the mindlist

Details

In: IEICE Transactions on Information and Systems, Institute of Electronics, Information and Communications Engineers (IEICE), Vol. E99.D, No. 6 ( 2016), p. 1437-1446

Type of Medium: Online Resource

ISSN: 0916-8532 , 1745-1361

URL: Article

DOI: 10.1587/transinf.2015CBP0008

Language: English

Publisher: Institute of Electronics, Information and Communications Engineers (IEICE)

Publication Date: 2016

detail.hit.zdb_id: 2214518-7

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

7

Online Resource

A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models

TAKAMICHI, Shinnosuke ; TODA, Tomoki ; NEUBIG, Graham ; [et al.]

Institute of Electronics, Information and Communications Engineers (IEICE) ; 2016

In: IEICE Transactions on Information and Systems Vol. E99.D, No. 10 ( 2016), p. 2490-2498

add to mindlist on the mindlist

Details

In: IEICE Transactions on Information and Systems, Institute of Electronics, Information and Communications Engineers (IEICE), Vol. E99.D, No. 10 ( 2016), p. 2490-2498

Type of Medium: Online Resource

ISSN: 0916-8532 , 1745-1361

URL: Article

DOI: 10.1587/transinf.2016SLP0020

Language: English

Publisher: Institute of Electronics, Information and Communications Engineers (IEICE)

Publication Date: 2016

detail.hit.zdb_id: 2214518-7

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

8

Online Resource

DIRE and its Data: Neural Decompiled Variable Renamings with Respect to Software Class

Dramko, Luke ; Lacomis, Jeremy ; Yin, Pengcheng ; [et al.]

Association for Computing Machinery (ACM) ; 2023

In: ACM Transactions on Software Engineering and Methodology Vol. 32, No. 2 ( 2023-04-30), p. 1-34

add to mindlist on the mindlist

Details

In: ACM Transactions on Software Engineering and Methodology, Association for Computing Machinery (ACM), Vol. 32, No. 2 ( 2023-04-30), p. 1-34

Abstract: The decompiler is one of the most common tools for examining executable binaries without the corresponding source code. It transforms binaries into high-level code, reversing the compilation process. Unfortunately, decompiler output is far from readable because the decompilation process is often incomplete. State-of-the-art techniques use machine learning to predict missing information like variable names. While these approaches are often able to suggest good variable names in context, no existing work examines how the selection of training data influences these machine learning models. We investigate how data provenance and the quality of training data affect performance, and how well, if at all, trained models generalize across software domains. We focus on the variable renaming problem using one such machine learning model, DIRE . We first describe DIRE in detail and the accompanying technique used to generate training data from raw code. We also evaluate DIRE ’s overall performance without respect to data quality. Next, we show how training on more popular, possibly higher quality code (measured using GitHub stars) leads to a more generalizable model because popular code tends to have more diverse variable names. Finally, we evaluate how well DIRE predicts domain-specific identifiers, propose a modification to incorporate domain information, and show that it can predict identifiers in domain-specific scenarios 23% more frequently than the original DIRE model.

Type of Medium: Online Resource

ISSN: 1049-331X , 1557-7392

URL: Article

DOI: 10.1145/3546946

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2023

detail.hit.zdb_id: 2006459-7

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

9

Online Resource

In-IDE Code Generation from Natural Language: Promise and Challenges

Xu, Frank F. ; Vasilescu, Bogdan ; Neubig, Graham

Association for Computing Machinery (ACM) ; 2022

In: ACM Transactions on Software Engineering and Methodology Vol. 31, No. 2 ( 2022-04-30), p. 1-47

add to mindlist on the mindlist

Details

In: ACM Transactions on Software Engineering and Methodology, Association for Computing Machinery (ACM), Vol. 31, No. 2 ( 2022-04-30), p. 1-47

Abstract: A great part of software development involves conceptualizing or communicating the underlying procedures and logic that needs to be expressed in programs. One major difficulty of programming is turning concept into code , especially when dealing with the APIs of unfamiliar libraries. Recently, there has been a proliferation of machine learning methods for code generation and retrieval from natural language queries , but these have primarily been evaluated purely based on retrieval accuracy or overlap of generated code with developer-written code, and the actual effect of these methods on the developer workflow is surprisingly unattested. In this article, we perform the first comprehensive investigation of the promise and challenges of using such technology inside the PyCharm IDE, asking, “At the current state of technology does it improve developer productivity or accuracy, how does it affect the developer experience, and what are the remaining gaps and challenges?” To facilitate the study, we first develop a plugin for the PyCharm IDE that implements a hybrid of code generation and code retrieval functionality, and we orchestrate virtual environments to enable collection of many user events (e.g., web browsing, keystrokes, fine-grained code edits). We ask developers with various backgrounds to complete 7 varieties of 14 Python programming tasks ranging from basic file manipulation to machine learning or data visualization, with or without the help of the plugin. While qualitative surveys of developer experience are largely positive, quantitative results with regards to increased productivity, code quality, or program correctness are inconclusive. Further analysis identifies several pain points that could improve the effectiveness of future machine learning-based code generation/retrieval developer assistants and demonstrates when developers prefer code generation over code retrieval and vice versa. We release all data and software to pave the road for future empirical studies on this topic, as well as development of better code generation models.

Type of Medium: Online Resource

ISSN: 1049-331X , 1557-7392

URL: Article

DOI: 10.1145/3487569

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2022

detail.hit.zdb_id: 2006459-7

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

10

Online Resource

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Liu, Pengfei ; Yuan, Weizhe ; Fu, Jinlan ; [et al.]

Association for Computing Machinery (ACM) ; 2023

In: ACM Computing Surveys Vol. 55, No. 9 ( 2023-09-30), p. 1-35

add to mindlist on the mindlist

Details

In: ACM Computing Surveys, Association for Computing Machinery (ACM), Vol. 55, No. 9 ( 2023-09-30), p. 1-35

Abstract: This article surveys and organizes research works in a new paradigm in natural language processing, which we dub “prompt-based learning.” Unlike traditional supervised learning, which trains a model to take in an input x and predict an output y as P ( y|x ), prompt-based learning is based on language models that model the probability of text directly. To use these models to perform prediction tasks, the original input x is modified using a template into a textual string prompt x′ that has some unfilled slots, and then the language model is used to probabilistically fill the unfilled information to obtain a final string x̂ , from which the final output y can be derived. This framework is powerful and attractive for a number of reasons: It allows the language model to be pre-trained on massive amounts of raw text, and by defining a new prompting function the model is able to perform few-shot or even zero-shot learning, adapting to new scenarios with few or no labeled data. In this article, we introduce the basics of this promising paradigm, describe a unified set of mathematical notations that can cover a wide variety of existing work, and organize existing work along several dimensions, e.g., the choice of pre-trained language models, prompts, and tuning strategies. To make the field more accessible to interested beginners, we not only make a systematic review of existing works and a highly structured typology of prompt-based concepts but also release other resources, e.g., a website NLPedia–Pretrain including constantly updated survey and paperlist.

Type of Medium: Online Resource

ISSN: 0360-0300 , 1557-7341

URL: Article

DOI: 10.1145/3560815

RVK:

SA 2943

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2023

detail.hit.zdb_id: 215909-0

detail.hit.zdb_id: 1495309-2

detail.hit.zdb_id: 626472-4

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher