kebabs

Kernel-Based Analysis of Biological Sequences


Bioconductor version: Release (3.19)

The package provides functionality for kernel-based analysis of DNA, RNA, and amino acid sequences via SVM-based methods. As core functionality, kebabs implements following sequence kernels: spectrum kernel, mismatch kernel, gappy pair kernel, and motif kernel. Apart from an efficient implementation of standard position-independent functionality, the kernels are extended in a novel way to take the position of patterns into account for the similarity measure. Because of the flexibility of the kernel formulation, other kernels like the weighted degree kernel or the shifted weighted degree kernel with constant weighting of positions are included as special cases. An annotation-specific variant of the kernels uses annotation information placed along the sequence together with the patterns in the sequence. The package allows for the generation of a kernel matrix or an explicit feature representation in dense or sparse format for all available kernels which can be used with methods implemented in other R packages. With focus on SVM-based methods, kebabs provides a framework which simplifies the usage of existing SVM implementations in kernlab, e1071, and LiblineaR. Binary and multi-class classification as well as regression tasks can be used in a unified way without having to deal with the different functions, parameters, and formats of the selected SVM. As support for choosing hyperparameters, the package provides cross validation - including grouped cross validation, grid search and model selection functions. For easier biological interpretation of the results, the package computes feature weights for all SVMs and prediction profiles which show the contribution of individual sequence positions to the prediction result and indicate the relevance of sequence sections for the learning result and the underlying biological functions.

Author: Johannes Palme [aut], Ulrich Bodenhofer [aut,cre]

Maintainer: Ulrich Bodenhofer <ulrich at bodenhofer.com>

Citation (from within R, enter citation("kebabs")):

Installation

To install this package, start R (version "4.4") and enter:


if (!require("BiocManager", quietly = TRUE))
    install.packages("BiocManager")

BiocManager::install("kebabs")

For older versions of R, please refer to the appropriate Bioconductor release.

Documentation

To view documentation for the version of this package installed in your system, start R and enter:

browseVignettes("kebabs")
KeBABS - An R Package for Kernel Based Analysis of Biological Sequences PDF R Script
Reference Manual PDF
NEWS Text

Details

biocViews Classification, Clustering, Regression, Software, SupportVectorMachine
Version 1.38.0
In Bioconductor since BioC 3.0 (R-3.1) (10 years)
License GPL (>= 2.1)
Depends R (>= 3.3.0), Biostrings(>= 2.35.5), kernlab
Imports methods, stats, Rcpp (>= 0.11.2), Matrix (>= 1.5-0), XVector(>= 0.7.3), S4Vectors(>= 0.27.3), e1071, LiblineaR, graphics, grDevices, utils, apcluster
System Requirements
URL https://github.com/UBod/kebabs
See More
Suggests SparseM, Biobase, BiocGenerics, knitr
Linking To IRanges, XVector, Biostrings, Rcpp, S4Vectors
Enhances
Depends On Me procoil
Imports Me odseq
Suggests Me
Links To Me
Build Report Build Report

Package Archives

Follow Installation instructions to use this package in your R session.

Source Package kebabs_1.38.0.tar.gz
Windows Binary kebabs_1.38.0.zip
macOS Binary (x86_64) kebabs_1.38.0.tgz
macOS Binary (arm64) kebabs_1.38.0.tgz
Source Repository git clone https://git.bioconductor.org/packages/kebabs
Source Repository (Developer Access) git clone git@git.bioconductor.org:packages/kebabs
Bioc Package Browser https://code.bioconductor.org/browse/kebabs/
Package Short Url https://bioconductor.org/packages/kebabs/
Package Downloads Report Download Stats