Genovar

A Detection and Visualization Tool
   for Genomic Variants

Kwang Su Jung1, Kiejung Park1, Sanghoon Moon2, Young Jin Kim2, Bong-jo Kim2


1 Division of Bio-Medical informatics, Cente for Genome Science,Korea National Institute of Health
2 Division of Structural and Functional Genomics, Center for Genome Science, Korea National Institute of Health

 

Availability:
freely distributed to public users and private institutions at http://genovar.sourceforge.net/

 

Contact: Kwang Su Jung (ksjung76.at.gmail.com)

 

Distribution: Download Genovar

 

Quick User Guide: Download Guide

 

Test files for detecting Copy Number Variation

Array CGH file
Probe and gene mapping file

 

Example Sequence Alignment Results (*.bam, *bai) on 1000 genomes :BAM file and its index ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data/NA19403/alignment/NA19403.chrom1.LS454.ssaha2.LWK.exon_targetted.20100311.bam
ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data/NA19403/alignment/NA19403.chrom1.LS454.ssaha2.LWK.exon_targetted.20100311.bam.bai
ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data/NA19397/alignment/NA19397.chrom1.LS454.ssaha2.LWK.exon_targetted.20100311.bam
ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data/NA19397/alignment/NA19397.chrom1.LS454.ssaha2.LWK.exon_targetted.20100311.bam.bai

 

UCSC reference sequence file hg 19(GRCh 37) :FastA format

http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/chromFa.tar.gz
Indices of reference sequence files



Background

Along with single nucleotide polymorphisms (SNPs), copy number variation (CNV) is considered an important source of genetic variation associated with disease susceptibility. Despite the importance of CNV, the tools currently available for its analysis often produce false positive results due to limitations such as low resolution of array platforms, platform specificity, and the type of CNV. To resolve this problem, spurious signals must be separated from true signals by visual inspection. None of the previously reported CNV analysis tools support this function and the simultaneous visualization of comparative genomic hybridization arrays (aCGH) and sequence alignment. The purpose of the present study was to develop a useful program for the efficient detection and visualization of CNV regions that enables the manual exclusion of erroneous signals.

 


Results

A JAVA-based stand-alone program called Genovar, which utilizes the Smith-Waterman Array (SW-ARRAY) algorithm to detect CNV regions, was developed. To ascertain whether a detected CNV region is a novel variant, Genovar compares the detected CNV regions with previously reported CNV regions using the Database of Genomic Variants (DGV, http://projects.tcag.ca/variation) and the Single Nucleotide Polymorphism Database (dbSNP). The current version of Genovar is capable of visualizing genomic data from sources such as the aCGH data file and sequence alignment format files.

 


Conclusions

Genovar is freely accessible and provides a user-friendly graphic user interface (GUI) to facilitate the detection of CNV regions. The program also provides comprehensive information to help in the elimination of spurious signals by visual inspection, making Genovar a valuable tool for reducing false positive CNV results. Genovar is available for download from theGenovar website (http://genovar.sourceforge.net/).

 

 

 

Screen shots

Figure 1. Chromosomal view and statistical summary .