bioconductor-dupchecker

downloads

Meta-analysis has become a popular approach for high-throughput genomic data analysis because it often can significantly increase power to detect biological signals or patterns in datasets. However, when using public-available databases for meta-analysis, duplication of samples is an often encountered problem, especially for gene expression data. Not removing duplicates would make study results questionable. We developed a Bioconductor package DupChecker that efficiently identifies duplicated samples by generating MD5 fingerprints for raw data.
Home http://bioconductor.org/packages/3.7/bioc/html/DupChecker.html
Versions 1.18.0, 1.16.0
License GPL (>= 2)
Recipe https://github.com/bioconda/bioconda-recipes/tree/master/recipes/bioconductor-dupchecker
Links biotools: dupchecker

Installation

With an activated Bioconda channel (see 2. Set up channels), install with:

conda install bioconductor-dupchecker

and update with:

conda update bioconductor-dupchecker

docker

A Docker container is available at https://quay.io/repository/biocontainers/bioconductor-dupchecker.