Technology&Platforms
Solexa Data Processing

Data processing & QC

The major research objective of this team is Solexa sequencing data processing and quality control. We do some research on the latest sequencing technology. After data processing, sequence files are transferred from the images. Data processing consists of three steps: image analysis, base calling and sequence analysis. In addition, we will take sequence files up with quality control. QC is an important and effectively measure for determining the sample libraries’ qualities, and it also serves for pointing out whether the sequencing succeeded or failed. For instance, in the pair-end sequencing, the insert size of pair-end is one of the quality index of succeeded library. We map our sequence to the reference. In alignment result, we call the distance between the coordinates of two reads in a pair-end as span. The overwhelming majority of span value of the pair-end mapped reads should be the normal insert size. So if the span does not match the expected insert size, the library was fail in building.   

Projects and brief description:        

 

 

1) Yanhuang Project

2) The International Giant Panda Genome Project

3) The Cucumber Genome Initiative

4) The International 1000 Genomes Project

Solexa Automated Pipeline

We evaluate the Genome Analyzer pipeline, including image analysis, base calling, and quality calibration,analyze all the factors which might affect the data quality, try to find solutions for the problems, and provide technical support to data processing & QC. We are making efforts to improve the performance and overcome the negative factors.

 Projects and brief description:

Developing a quality control system for the whole production process, and giving fast feedback to the experimental department; Applying the Genome Analyzer pipeline to BGI IT environment and making it efficient, automatic and flexible; Combining the data processing with the project management, data storage, and quality control system.

 

For all the projects in  BGI-SHENZHEN, the sequence data are yielded from our team. Meanwhile we implement strict quality control, so that we could build a well channel for estimating the sequencing eligiblility and value for advanced analysis.These large projects including below:

  News
  Related Information
·Solexa Data Processing
·De novo Sequencing
·Evolution & Comparative Genomics
·Transcription and Regulation Analysis
·Metagenomics & Bacteria
·Molecular Breeding
·Software Development/Database/IT and System
   |   BMC   |   Legal   |   Site Map   |   Privacy   |   Contact Copyright © 1999-2010 BGI