GLORIA

GEOMAR Library Ocean Research Information Access

Your email was sent successfully. Check your inbox.

An error occurred while sending the email. Please try again.

Proceed reservation?

Export
Filter
  • Oxford University Press (OUP)  (11)
  • Pan, Yi  (11)
Material
Publisher
  • Oxford University Press (OUP)  (11)
Language
Years
  • 1
    Online Resource
    Online Resource
    Oxford University Press (OUP) ; 2016
    In:  Bioinformatics Vol. 32, No. 12 ( 2016-06-15), p. 1788-1796
    In: Bioinformatics, Oxford University Press (OUP), Vol. 32, No. 12 ( 2016-06-15), p. 1788-1796
    Abstract: Motivation: Advances of next generation sequencing technologies and availability of short read data enable the detection of structural variations (SVs). Deletions, an important type of SVs, have been suggested in association with genetic diseases. There are three types of deletions: blunt deletions, deletions with microhomologies and deletions with microsinsertions. The last two types are very common in the human genome, but they pose difficulty for the detection. Furthermore, finding deletions from sequencing data remains challenging. It is highly appealing to develop sensitive and accurate methods to detect deletions from sequencing data, especially deletions with microhomology and deletions with microinsertion. Results: We present a novel method called Sprites (SPlit Read re-alIgnment To dEtect Structural variants) which finds deletions from sequencing data. It aligns a whole soft-clipping read rather than its clipped part to the target sequence, a segment of the reference which is determined by spanning reads, in order to find the longest prefix or suffix of the read that has a match in the target sequence. This alignment aims to solve the problem of deletions with microhomologies and deletions with microinsertions. Using both simulated and real data we show that Sprites performs better on detecting deletions compared with other current methods in terms of F-score. Availability and implementation: Sprites is open source software and freely available at https://github.com/zhangzhen/sprites. Contact:  jxwang@mail.csu.edu.cn Supplementary data:  Supplementary data are available at Bioinformatics online.
    Type of Medium: Online Resource
    ISSN: 1367-4811 , 1367-4803
    Language: English
    Publisher: Oxford University Press (OUP)
    Publication Date: 2016
    detail.hit.zdb_id: 1468345-3
    SSG: 12
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 2
    Online Resource
    Online Resource
    Oxford University Press (OUP) ; 2019
    In:  Bioinformatics Vol. 35, No. 7 ( 2019-04-01), p. 1142-1150
    In: Bioinformatics, Oxford University Press (OUP), Vol. 35, No. 7 ( 2019-04-01), p. 1142-1150
    Abstract: Scaffolding is an essential step during the de novo sequence assembly process to infer the direction and order relationships between the contigs and make the sequence assembly results more continuous and complete. However, scaffolding still faces the challenges of repetitive regions in genome, sequencing errors and uneven sequencing depth. Moreover, the accuracy of scaffolding greatly depends on the quality of contigs. Generally, the existing scaffolding methods construct a scaffold graph, and then optimize the graph by deleting spurious edges. Nevertheless, due to the wrong joints between contigs, some correct edges connecting contigs may be deleted. Results In this study, we present a novel scaffolding method SCOP, which is the first method to classify the contigs and utilize the vertices and edges to optimize the scaffold graph. Specially, SCOP employs alignment features and GC-content of paired reads to evaluate the quality of contigs (vertices), and divide the contigs into three types (True, Uncertain and Misassembled), and then optimizes the scaffold graph based on the classification of contigs together with the alignment of edges. The experiment results on the datasets of GAGE-A and GAGE-B demonstrate that SCOP performs better than 12 other competing scaffolders. Availability and implementation SCOP is publicly available for download at https://github.com/bioinfomaticsCSU/SCOP. Supplementary information Supplementary data are available at Bioinformatics online.
    Type of Medium: Online Resource
    ISSN: 1367-4803 , 1367-4811
    Language: English
    Publisher: Oxford University Press (OUP)
    Publication Date: 2019
    detail.hit.zdb_id: 1468345-3
    SSG: 12
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 3
    Online Resource
    Online Resource
    Oxford University Press (OUP) ; 2015
    In:  Bioinformatics Vol. 31, No. 6 ( 2015-03-15), p. 825-833
    In: Bioinformatics, Oxford University Press (OUP), Vol. 31, No. 6 ( 2015-03-15), p. 825-833
    Abstract: Motivation: In genome assembly, the primary issue is how to determine upstream and downstream sequence regions of sequence seeds for constructing long contigs or scaffolds. When extending one sequence seed, repetitive regions in the genome always cause multiple feasible extension candidates which increase the difficulty of genome assembly. The universally accepted solution is choosing one based on read overlaps and paired-end (mate-pair) reads. However, this solution faces difficulties with regard to some complex repetitive regions. In addition, sequencing errors may produce false repetitive regions and uneven sequencing depth leads some sequence regions to have too few or too many reads. All the aforementioned problems prohibit existing assemblers from getting satisfactory assembly results. Results: In this article, we develop an algorithm, called extract paths for genome assembly (EPGA), which extracts paths from De Bruijn graph for genome assembly. EPGA uses a new score function to evaluate extension candidates based on the distributions of reads and insert size. The distribution of reads can solve problems caused by sequencing errors and short repetitive regions. Through assessing the variation of the distribution of insert size, EPGA can solve problems introduced by some complex repetitive regions. For solving uneven sequencing depth, EPGA uses relative mapping to evaluate extension candidates. On real datasets, we compare the performance of EPGA and other popular assemblers. The experimental results demonstrate that EPGA can effectively obtain longer and more accurate contigs and scaffolds. Availability and implementation: EPGA is publicly available for download at https://github.com/bioinfomaticsCSU/EPGA. Contact:  jxwang@csu.edu.cn Supplementary information:  Supplementary data are available at Bioinformatics online.
    Type of Medium: Online Resource
    ISSN: 1367-4811 , 1367-4803
    Language: English
    Publisher: Oxford University Press (OUP)
    Publication Date: 2015
    detail.hit.zdb_id: 1468345-3
    SSG: 12
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 4
    Online Resource
    Online Resource
    Oxford University Press (OUP) ; 2017
    In:  Bioinformatics Vol. 33, No. 3 ( 2017-02-01), p. 458-460
    In: Bioinformatics, Oxford University Press (OUP), Vol. 33, No. 3 ( 2017-02-01), p. 458-460
    Abstract: Increasing evidences have demonstrated that long noncoding RNAs (lncRNAs) play important roles in many human diseases. Therefore, predicting novel lncRNA-disease associations would contribute to dissect the complex mechanisms of disease pathogenesis. Some computational methods have been developed to infer lncRNA-disease associations. However, most of these methods infer lncRNA-disease associations only based on single data resource. Results In this paper, we propose a new computational method to predict lncRNA-disease associations by integrating multiple biological data resources. Then, we implement this method as a web server for lncRNA-disease association prediction (LDAP). The input of the LDAP server is the lncRNA sequence. The LDAP predicts potential lncRNA-disease associations by using a bagging SVM classifier based on lncRNA similarity and disease similarity. Availability and Implementation The web server is available at http://bioinformatics.csu.edu.cn/ldap Supplementary information Supplementary data are available at Bioinformatics online.
    Type of Medium: Online Resource
    ISSN: 1367-4803 , 1367-4811
    Language: English
    Publisher: Oxford University Press (OUP)
    Publication Date: 2017
    detail.hit.zdb_id: 1468345-3
    SSG: 12
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 5
    Online Resource
    Online Resource
    Oxford University Press (OUP) ; 2015
    In:  Bioinformatics Vol. 31, No. 24 ( 2015-12-15), p. 3988-3990
    In: Bioinformatics, Oxford University Press (OUP), Vol. 31, No. 24 ( 2015-12-15), p. 3988-3990
    Abstract: Motivation: In genome assembly, as coverage of sequencing and genome size growing, most current softwares require a large memory for handling a great deal of sequence data. However, most researchers usually cannot meet the requirements of computing resources which prevent most current softwares from practical applications. Results: In this article, we present an update algorithm called EPGA2, which applies some new modules and can bring about improved assembly results in small memory. For reducing peak memory in genome assembly, EPGA2 adopts memory-efficient DSK to count K-mers and revised BCALM to construct De Bruijn Graph. Moreover, EPGA2 parallels the step of Contigs Merging and adds Errors Correction in its pipeline. Our experiments demonstrate that all these changes in EPGA2 are more useful for genome assembly. Availability and implementation: EPGA2 is publicly available for download at https://github.com/bioinfomaticsCSU/EPGA2. Contact:  jxwang@csu.edu.cn Supplementary information:  Supplementary data are available at Bioinformatics online.
    Type of Medium: Online Resource
    ISSN: 1367-4811 , 1367-4803
    Language: English
    Publisher: Oxford University Press (OUP)
    Publication Date: 2015
    detail.hit.zdb_id: 1468345-3
    SSG: 12
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 6
    Online Resource
    Online Resource
    Oxford University Press (OUP) ; 2019
    In:  Bioinformatics Vol. 35, No. 11 ( 2019-06-01), p. 1893-1900
    In: Bioinformatics, Oxford University Press (OUP), Vol. 35, No. 11 ( 2019-06-01), p. 1893-1900
    Abstract: Reconstructing gene regulatory networks (GRNs) based on gene expression profiles is still an enormous challenge in systems biology. Random forest-based methods have been proved a kind of efficient methods to evaluate the importance of gene regulations. Nevertheless, the accuracy of traditional methods can be further improved. With time-series gene expression data, exploiting inherent time information and high order time lag are promising strategies to improve the power and accuracy of GRNs inference. Results In this study, we propose a scalable, flexible approach called BiXGBoost to reconstruct GRNs. BiXGBoost is a bidirectional-based method by considering both candidate regulatory genes and target genes for a specific gene. Moreover, BiXGBoost utilizes time information efficiently and integrates XGBoost to evaluate the feature importance. Randomization and regularization are also applied in BiXGBoost to address the over-fitting problem. The results on DREAM4 and Escherichia coli datasets show the good performance of BiXGBoost on different scale of networks. Availability and implementation Our Python implementation of BiXGBoost is available at https://github.com/zrq0123/BiXGBoost. Supplementary information Supplementary data are available at Bioinformatics online.
    Type of Medium: Online Resource
    ISSN: 1367-4803 , 1367-4811
    Language: English
    Publisher: Oxford University Press (OUP)
    Publication Date: 2019
    detail.hit.zdb_id: 1468345-3
    SSG: 12
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 7
    Online Resource
    Online Resource
    Oxford University Press (OUP) ; 2016
    In:  Briefings in Bioinformatics
    In: Briefings in Bioinformatics, Oxford University Press (OUP)
    Type of Medium: Online Resource
    ISSN: 1467-5463 , 1477-4054
    Language: English
    Publisher: Oxford University Press (OUP)
    Publication Date: 2016
    detail.hit.zdb_id: 2036055-1
    SSG: 12
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 8
    Online Resource
    Online Resource
    Oxford University Press (OUP) ; 2018
    In:  Bioinformatics Vol. 34, No. 19 ( 2018-10-01), p. 3357-3364
    In: Bioinformatics, Oxford University Press (OUP), Vol. 34, No. 19 ( 2018-10-01), p. 3357-3364
    Type of Medium: Online Resource
    ISSN: 1367-4803 , 1460-2059
    Language: English
    Publisher: Oxford University Press (OUP)
    Publication Date: 2018
    detail.hit.zdb_id: 1468345-3
    SSG: 12
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 9
    Online Resource
    Online Resource
    Oxford University Press (OUP) ; 2016
    In:  Bioinformatics Vol. 32, No. 17 ( 2016-09-01), p. 2664-2671
    In: Bioinformatics, Oxford University Press (OUP), Vol. 32, No. 17 ( 2016-09-01), p. 2664-2671
    Abstract: Motivation: Drug repositioning, which aims to identify new indications for existing drugs, offers a promising alternative to reduce the total time and cost of traditional drug development. Many computational strategies for drug repositioning have been proposed, which are based on similarities among drugs and diseases. Current studies typically use either only drug-related properties (e.g. chemical structures) or only disease-related properties (e.g. phenotypes) to calculate drug or disease similarity, respectively, while not taking into account the influence of known drug–disease association information on the similarity measures. Results: In this article, based on the assumption that similar drugs are normally associated with similar diseases and vice versa, we propose a novel computational method named MBiRW, which utilizes some comprehensive similarity measures and Bi-Random walk (BiRW) algorithm to identify potential novel indications for a given drug. By integrating drug or disease features information with known drug–disease associations, the comprehensive similarity measures are firstly developed to calculate similarity for drugs and diseases. Then drug similarity network and disease similarity network are constructed, and they are incorporated into a heterogeneous network with known drug–disease interactions. Based on the drug–disease heterogeneous network, BiRW algorithm is adopted to predict novel potential drug–disease associations. Computational experiment results from various datasets demonstrate that the proposed approach has reliable prediction performance and outperforms several recent computational drug repositioning approaches. Moreover, case studies of five selected drugs further confirm the superior performance of our method to discover potential indications for drugs practically. Availability and Implementation:  http://github.com//bioinfomaticsCSU/MBiRW . Contact:  jxwang@mail.csu.edu.cn Supplementary information:  Supplementary data are available at Bioinformatics online.
    Type of Medium: Online Resource
    ISSN: 1367-4811 , 1367-4803
    Language: English
    Publisher: Oxford University Press (OUP)
    Publication Date: 2016
    detail.hit.zdb_id: 1468345-3
    SSG: 12
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 10
    Online Resource
    Online Resource
    Oxford University Press (OUP) ; 2019
    In:  Bioinformatics Vol. 35, No. 19 ( 2019-10-01), p. 3642-3650
    In: Bioinformatics, Oxford University Press (OUP), Vol. 35, No. 19 ( 2019-10-01), p. 3642-3650
    Abstract: The development of single-cell RNA-sequencing (scRNA-seq) provides a new perspective to study biological problems at the single-cell level. One of the key issues in scRNA-seq analysis is to resolve the heterogeneity and diversity of cells, which is to cluster the cells into several groups. However, many existing clustering methods are designed to analyze bulk RNA-seq data, it is urgent to develop the new scRNA-seq clustering methods. Moreover, the high noise in scRNA-seq data also brings a lot of challenges to computational methods. Results In this study, we propose a novel scRNA-seq cell type detection method based on similarity learning, called SinNLRR. The method is motivated by the self-expression of the cells with the same group. Specifically, we impose the non-negative and low rank structure on the similarity matrix. We apply alternating direction method of multipliers to solve the optimization problem and propose an adaptive penalty selection method to avoid the sensitivity to the parameters. The learned similarity matrix could be incorporated with spectral clustering, t-distributed stochastic neighbor embedding for visualization and Laplace score for prioritizing gene markers. In contrast to other scRNA-seq clustering methods, our method achieves more robust and accurate results on different datasets. Availability and implementation Our MATLAB implementation of SinNLRR is available at, https://github.com/zrq0123/SinNLRR. Supplementary information Supplementary data are available at Bioinformatics online.
    Type of Medium: Online Resource
    ISSN: 1367-4803 , 1367-4811
    Language: English
    Publisher: Oxford University Press (OUP)
    Publication Date: 2019
    detail.hit.zdb_id: 1468345-3
    SSG: 12
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
Close ⊗
This website uses cookies and the analysis tool Matomo. More information can be found here...