goFish

module
v0.0.0-...-549aca6 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: May 23, 2022 License: BSD-2-Clause

README

Go (lang) genomic scripts

Quick install of goFish packages run the command:

go get -v github.com/edotau/goFish

For a full golang build, GOROOT, GOPATH, GOBIN set up:

git clone https://github.com/edotau/goFish.git; ./goFish/setup.sh

GoDocs are available: goFishDocs

Directories

Path Synopsis
Package algorithms contains basic implementations of popular algorithms in computer science
Package algorithms contains basic implementations of popular algorithms in computer science
Package api is a collection of open source golang projects repurposed or simplified into goFish functions and/or packages
Package api is a collection of open source golang projects repurposed or simplified into goFish functions and/or packages
Package axt contain data structures to operate on pairwise alignment files are produced from Blastz, an alignment tool available from Webb Miller's lab at Penn State University.
Package axt contain data structures to operate on pairwise alignment files are produced from Blastz, an alignment tool available from Webb Miller's lab at Penn State University.
The following code was forked and modified from grailbio.
The following code was forked and modified from grailbio.
bash package implements simple bash commands that are sometimes useful when writing go scripts
bash package implements simple bash commands that are sometimes useful when writing go scripts
Package bed provides an interface along with functions to compute different Browser Extinsible Data (BED) formated files.
Package bed provides an interface along with functions to compute different Browser Extinsible Data (BED) formated files.
Package bigWig is used to process bigwig files which are binary compressed data to provide a visual of aligned sequencing data
Package bigWig is used to process bigwig files which are binary compressed data to provide a visual of aligned sequencing data
Package chain contains data structures and functions to process genome to genome alignments pairwise alignment that allow gaps in both sequences simultaneously.
Package chain contains data structures and functions to process genome to genome alignments pairwise alignment that allow gaps in both sequences simultaneously.
cmd
alleleStats
alleleStats is a tool to investigate heterozygous snp variance and calculate fishers exact to examine the significance of the association (or contingency) between beterozygous alleles
alleleStats is a tool to investigate heterozygous snp variance and calculate fishers exact to examine the significance of the association (or contingency) between beterozygous alleles
chainReader
general tool to example chain alignments between two whole genomes, find inversion, or return a simple print of all chain regions
general tool to example chain alignments between two whole genomes, find inversion, or return a simple print of all chain regions
genePreDict
genePredDict is a software tool used to operate (concat, find, query or and colapse dups) on ucsc geneSeq format
genePredDict is a software tool used to operate (concat, find, query or and colapse dups) on ucsc geneSeq format
geneSeQuery
geneSeQueryL processes data tables from gtf, gff3, and genePred gene feture formats, process custom data frames, and sql querys from ensembl and ucsc databases
geneSeQueryL processes data tables from gtf, gff3, and genePred gene feture formats, process custom data frames, and sql querys from ensembl and ucsc databases
goIndels
goIndels - is a quick script to fild indels in sam/bam alignment
goIndels - is a quick script to fild indels in sam/bam alignment
goToBed
goToBed is a simplified version of ucsc overlap section to analyze non/overlapping genomic regions in a dataset and filtering var variants of interests
goToBed is a simplified version of ucsc overlap section to analyze non/overlapping genomic regions in a dataset and filtering var variants of interests
nucmerParse
nucmerParse is used parse and extra interesting genomic regions from nucmer suffix tree alignments
nucmerParse is used parse and extra interesting genomic regions from nucmer suffix tree alignments
statFastQc
vimFastqc quickly access esfastq quality control metrics
vimFastqc quickly access esfastq quality control metrics
templates
Package templates contain golang template scripts for quick and easy golang programing
Package templates contain golang template scripts for quick and easy golang programing
vimBam
vimBam
vimBam
vimUrl
vimUrl can view http url links and print data stream to stdout
vimUrl can view http url links and print data stream to stdout
Package code describes biological sequence letters coded by bytes and sometimes encoded into hash int64s
Package code describes biological sequence letters coded by bytes and sometimes encoded into hash int64s
Package concurrency contains functions used to build concurrent processes and/or tasks in both pipelines and workflows very much still in development
Package concurrency contains functions used to build concurrent processes and/or tasks in both pipelines and workflows very much still in development
Package csv contains data structures to parse comma separated value takes and perform statical analysis
Package csv contains data structures to parse comma separated value takes and perform statical analysis
dataflow package is an implementation combining workflow/tasks mananging concpets most popular in data science.
dataflow package is an implementation combining workflow/tasks mananging concpets most popular in data science.
Package fasta contains data structures for processes and functions that operate on fasta files containing base sequences
Package fasta contains data structures for processes and functions that operate on fasta files containing base sequences
Package fastq contains data structures and functions to process sequencing data coming off the sequencer
Package fastq contains data structures and functions to process sequencing data coming off the sequencer
GeneSeq contains functions and other data structures for processing gene models and annotations from difference public data bases like ENSEMBL and UCSC Currently supports genePred and GTF formats
GeneSeq contains functions and other data structures for processing gene models and annotations from difference public data bases like ENSEMBL and UCSC Currently supports genePred and GTF formats
GraphQL is a simple low-level client modified from https://github.com/machinebox/graphql
GraphQL is a simple low-level client modified from https://github.com/machinebox/graphql
Package keras implements many functionalities from the popular Keras API which is a deep learning API written in Python, running on top of the machine learning platform TensorFlow.
Package keras implements many functionalities from the popular Keras API which is a deep learning API written in Python, running on top of the machine learning platform TensorFlow.
Modified from https://github.com/cdipaolo/goml for the sole purpose of teaching myself how to implement machine learning models from scratch Original code and documentation can be found at: https://github.com/cdipaolo/goml
Modified from https://github.com/cdipaolo/goml for the sole purpose of teaching myself how to implement machine learning models from scratch Original code and documentation can be found at: https://github.com/cdipaolo/goml
Package reference contains reference information useful for building hash lookups of genomes
Package reference contains reference information useful for building hash lookups of genomes
Package simpleio contains core utils that reading in data optimized for both memory allocation, speed, and performance
Package simpleio contains core utils that reading in data optimized for both memory allocation, speed, and performance
Package stats contains functions to perform statistical tests
Package stats contains functions to perform statistical tests
Package vcf declares vcf struct data fields and contains methods and functions that operate on the Vcf struct
Package vcf declares vcf struct data fields and contains methods and functions that operate on the Vcf struct

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL