Basepair Blog

Preparing your NGS data for reproducibility and auditing

Audits are probably the most dreaded aspect of NGS data analysis, and like specters, they come in many forms to haunt you. You might need to publish results from several years back, only to realize that the open source tools you used have dramatically changed, forcing...

read more

Basepair API features

We’ve heard the war stories from our fellow bioinformaticians and lab directors: waking up bleary-eyed in the middle of the night to check on the results of an NGS data analysis pipeline, converting files to different formats, setting up the next step of their...

read more

Visualizing your NGS results with Basepair

Writers write, painters paint, genomics researchers… wait patiently for algorithm pipelines to finish processing NGS data, only to spend as much time visualizing results? As long-time researchers and bioinformaticians, we’re well aware that in NGS data analysis, every...

read more

How automated workflows simplify NGS analysis

NGS Data Analysis Bottlenecks One of the biggest bottlenecks in next generation sequencing (NGS) today is data analysis, which is surprising considering that we have massive compute available at our fingertips, the available infrastructure to process thousands of...

read more

Meet Basepair at Bio-IT World 2019

Basepair will be at this year’s Bio-IT World Meet Basepair in Boston, MA from April 16-18, and connect with us to learn how you can run NGS analysis at scale. Our platform is trusted by top institutions such as Partners Healthcare, Harvard Medical School, NYU,...

read more

Trimming for RNA-Seq data

Quality trimming decreases the overall number of reads, but increases to the total and proportion of uniquely mapped reads. Thus, you get more useful data for downstream analyses. Too aggressive quality trimming can negatively impact downstream analysis (in our example, estimation of gene expression). Hence, our own findings and that of the research community motivate the incorporation of a light amount of trimming in RNA-seq data (we use a Q threshold of 10).
read more