rhdf5 - R Interface to HDF5
This package provides an interface between HDF5 and R. HDF5's main features are the ability to store and access very large and/or complex datasets and a wide variety of metadata on mass storage (disk) through a completely portable file format. The rhdf5 package is thus suited for the exchange of large and/or complex datasets between R and other software package, and for letting R applications work on datasets that are larger than the available RAM.
Last updated
infrastructuredataimporthdf5rhdf5curlopensslcpp
16.77 score 72 stars 230 dependents 6.3k scripts 43k downloads
biomaRt - Interface to BioMart databases (i.e. Ensembl)
In recent years a wealth of biological data has become available in public data repositories. Easy access to these valuable data resources and firm integration with data analysis is needed for comprehensive bioinformatics data analysis. biomaRt provides an interface to a growing collection of databases implementing the BioMart software suite (<https://www.ensembl.org/info/data/biomart/index.html>). The package enables retrieval of large amounts of data in a uniform way without the need to know the underlying database schemas or write complex SQL queries. The most prominent examples of BioMart databases are maintained by Ensembl, which provides biomaRt users direct access to a diverse set of data and enables a wide range of powerful online queries from gene annotation to database mining.
Last updated
annotationbioconductorbiomartensembl
16.50 score 49 stars 214 dependents 17k scripts 41k downloadsrhdf5 - R Interface to HDF5
This package provides an interface between HDF5 and R. HDF5's main features are the ability to store and access very large and/or complex datasets and a wide variety of metadata on mass storage (disk) through a completely portable file format. The rhdf5 package is thus suited for the exchange of large and/or complex datasets between R and other software package, and for letting R applications work on datasets that are larger than the available RAM.
Last updated
infrastructuredataimporthdf5rhdf5curlopensslcpp
14.29 score 72 stars 229 dependents 6.3k scriptsDESeq2 - Differential gene expression analysis based on the negative binomial distribution
Estimate variance-mean dependence in count data from high-throughput sequencing assays and test for differential expression based on a model using the negative binomial distribution.
Last updated
sequencingrnaseqchipseqgeneexpressiontranscriptionnormalizationdifferentialexpressionbayesianregressionprincipalcomponentclusteringimmunooncologyopenblascpp
14.00 score 459 stars 123 dependents 25k scripts
biomaRt - Interface to BioMart databases (i.e. Ensembl)
In recent years a wealth of biological data has become available in public data repositories. Easy access to these valuable data resources and firm integration with data analysis is needed for comprehensive bioinformatics data analysis. biomaRt provides an interface to a growing collection of databases implementing the BioMart software suite (<https://www.ensembl.org/info/data/biomart/index.html>). The package enables retrieval of large amounts of data in a uniform way without the need to know the underlying database schemas or write complex SQL queries. The most prominent examples of BioMart databases are maintained by Ensembl, which provides biomaRt users direct access to a diverse set of data and enables a wide range of powerful online queries from gene annotation to database mining.
Last updated
annotationbioconductorbiomartensembl
13.89 score 50 stars 213 dependents 18k scripts
vsn - Variance stabilization and calibration for microarray data
The package implements a method for normalising microarray intensities from single- and multiple-color arrays. It can also be used for data from other technologies, as long as they have similar format. The method uses a robust variant of the maximum-likelihood estimator for an additive-multiplicative error model and affine calibration. The model incorporates data calibration step (a.k.a. normalization), a model for the dependence of the variance on the mean intensity and a variance stabilizing data transformation. Differences between transformed intensities are analogous to "normalized log-ratios". However, in contrast to the latter, their variance is independent of the mean, and they are usually more sensitive and specific in detecting differential transcription.
Last updated
microarrayonechanneltwochannelpreprocessing
11.61 score 55 dependents 1.3k scripts 9.1k downloads
Rhdf5lib - hdf5 library as an R package
Provides C and C++ hdf5 libraries.
Last updated
infrastructurebioconductorhdf5hdf5-library
11.48 score 7 stars 343 dependents 29 scripts 44k downloadsEBImage - Image processing and analysis toolbox for R
EBImage provides general purpose functionality for image processing and analysis. In the context of (high-throughput) microscopy-based cellular assays, EBImage offers tools to segment cells and extract quantitative cellular descriptors. This allows the automation of such tasks using the R programming language and facilitates the use of other tools in the R environment for signal processing, statistical modeling, machine learning and visualization with image data.
Last updated
visualizationbioinformaticsimage-analysisimage-processingcpp
10.91 score 77 stars 44 dependents 2.0k scriptsRarr - Read Zarr Files in R
The Zarr specification defines a format for chunked, compressed, N-dimensional arrays. It's design allows efficient access to subsets of the stored array, and supports both local and cloud storage systems. Rarr aims to implement this specification in R with minimal reliance on an external tools or libraries.
Last updated
dataimportbioconductorome-ngffome-zarron-diskout-of-memoryzarrc-blosclibzstd
10.52 score 53 stars 7 dependents 92 scriptsRarr - Read Zarr Files in R
The Zarr specification defines a format for chunked, compressed, N-dimensional arrays. It's design allows efficient access to subsets of the stored array, and supports both local and cloud storage systems. Rarr aims to implement this specification in R with minimal reliance on an external tools or libraries.
Last updated
dataimportbioconductorome-ngffome-zarron-diskout-of-memoryzarrc-blosclibzstd
10.52 score 53 stars 7 dependents 91 scripts 598 downloads
rhdf5filters - HDF5 Compression Filters
Provides a collection of additional compression filters for HDF5 datasets. The package is intended to provide seamless integration with rhdf5, however the compiled filters can also be used with external applications.
Last updated
infrastructuredataimportcompressionfilter-pluginhdf5
10.12 score 5 stars 230 dependents 9 scripts 42k downloads
Rhdf5lib - hdf5 library as an R package
Provides C and C++ hdf5 libraries.
Last updated
infrastructurebioconductorhdf5hdf5-library
9.84 score 7 stars 341 dependents 29 scriptsDEXSeq - Inference of differential exon usage in RNA-Seq
The package is focused on finding differential exon usage using RNA-seq exon counts between samples with different experimental designs. It provides functions that allows the user to make the necessary statistical tests based on a model that uses the negative binomial distribution to estimate the variance between biological replicates and generalized linear models for testing. The package also provides functions for the visualization and exploration of the results.
Last updated
immunooncologysequencingrnaseqdifferentialexpressionalternativesplicingdifferentialsplicinggeneexpressionvisualization
8.43 score 10 stars 5 dependents 462 scripts
rhdf5filters - HDF5 Compression Filters
Provides a collection of additional compression filters for HDF5 datasets. The package is intended to provide seamless integration with rhdf5, however the compiled filters can also be used with external applications.
Last updated
infrastructuredataimportcompressionfilter-pluginhdf5
8.41 score 5 stars 228 dependents 9 scriptsIHW - Independent Hypothesis Weighting
Independent hypothesis weighting (IHW) is a multiple testing procedure that increases power compared to the method of Benjamini and Hochberg by assigning data-driven weights to each hypothesis. The input to IHW is a two-column table of p-values and covariates. The covariate can be any continuous-valued or categorical variable that is thought to be informative on the statistical properties of each hypothesis test, while it is independent of the p-value under the null hypothesis.
Last updated
immunooncologymultiplecomparisonrnaseqihwpvalue-adjustment
8.13 score 16 stars 2 dependents 403 scriptsarrayQualityMetrics - Quality metrics report for microarray data sets
This package generates microarray quality metrics reports for data in Bioconductor microarray data containers (ExpressionSet, NChannelSet, AffyBatch). One and two color array platforms are supported.
Last updated
microarrayqualitycontrolonechanneltwochannelreportwritingbioconductor
8.09 score 1 stars 337 scripts 1.0k downloadsarrayQualityMetrics - Quality metrics report for microarray data sets
This package generates microarray quality metrics reports for data in Bioconductor microarray data containers (ExpressionSet, NChannelSet, AffyBatch). One and two color array platforms are supported.
Last updated
microarrayqualitycontrolonechanneltwochannelreportwritingbioconductor
7.02 score 1 stars 301 scriptslemur - Latent Embedding Multivariate Regression
Fit a latent embedding multivariate regression (LEMUR) model to multi-condition single-cell data. The model provides a parametric description of single-cell data measured with treatment vs. control or more complex experimental designs. The parametric model is used to (1) align conditions, (2) predict log fold changes between conditions for all cells, and (3) identify cell neighborhoods with consistent log fold changes. For those neighborhoods, a pseudobulked differential expression test is conducted to assess which genes are significantly changed.
Last updated
transcriptomicsdifferentialexpressionsinglecelldimensionreductionregressionquartoopenblascpp
6.87 score 101 stars 92 scriptslpsymphony - Symphony integer linear programming solver in R
This package was derived from Rsymphony_0.1-17 from CRAN. These packages provide an R interface to SYMPHONY, an open-source linear programming solver written in C++. The main difference between this package and Rsymphony is that it includes the solver source code (SYMPHONY version 5.6), while Rsymphony expects to find header and library files on the users' system. Thus the intention of lpsymphony is to provide an easy to install interface to SYMPHONY. For Windows, precompiled DLLs are included in this package.
Last updated
infrastructurethirdpartyclientcoinor-symphony
6.54 score 3 dependents 28 scripts 2.0k downloadsHilbertVis - Hilbert curve visualization
Functions to visualize long vectors of integer data by means of Hilbert curves
Last updated
visualization
4.79 score 2 dependents 17 scripts 552 downloadsDepInfeR - Inferring tumor-specific cancer dependencies through integrating ex-vivo drug response assays and drug-protein profiling
DepInfeR integrates two experimentally accessible input data matrices: the drug sensitivity profiles of cancer cell lines or primary tumors ex-vivo (X), and the drug affinities of a set of proteins (Y), to infer a matrix of molecular protein dependencies of the cancers (ß). DepInfeR deconvolutes the protein inhibition effect on the viability phenotype by using regularized multivariate linear regression. It assigns a “dependence coefficient” to each protein and each sample, and therefore could be used to gain a causal and accurate understanding of functional consequences of genomic aberrations in a heterogeneous disease, as well as to guide the choice of pharmacological intervention for a specific cancer type, sub-type, or an individual patient. For more information, please read out preprint on bioRxiv: https://doi.org/10.1101/2022.01.11.475864.
Last updated
softwareregressionpharmacogeneticspharmacogenomicsfunctionalgenomics
4.36 score 1 stars 23 scripts 319 downloadssplots - Visualization of high-throughput assays in microtitre plate or slide format
This package is here to support legacy usages of it, but it should not be used for new code development. It provides a single function, plotScreen, for visualising data in microtitre plate or slide format. As a better alternative for such functionality, please consider the platetools package on CRAN (https://cran.r-project.org/package=platetools and https://github.com/Swarchal/platetools), or ggplot2 (geom_raster, facet_wrap) as exemplified in the vignette of this package.
Last updated
visualizationsequencingmicrotitreplateassay
3.30 score 1 scripts 454 downloads
