Protein coding sequences (CDS) predicted from HTC and genome sequences from SGN BAC clone.
13227 HTCs from Kazusa DNA Research Institute and 987 BACs from SGN (November 2008) were used.
1. BAC clones were searched by BLASTN (1e-50) between HTCs and SGN BAC clones.
2. To obtain putatibe exon and intron sequences,
HTC sequences and genomic sequences from BAC clones were aligned by est2genome.
Putative CDS were obtained from exons in genomic sequences .
4. Translated CDS with 3 ORFs were obtained as putative protein sequences.