Gene Syncc9605_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_2666 
Symbol 
ID3736311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp2476630 
End bp2477697 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content62% 
IMG OID637777250 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_382946 
Protein GI78214167 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.515135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCA CCTCCGATTT GCATGTGGTG GAGACACGTC CCCTGGTGGC CCCTGCTGTG 
TTGCATCAGG AACTGCCCAT GGACGCGGCG GCGCTCGAGA CCGTGGCGTC AGCTCGCCGA
CGCATTCAAG ACTTTCTCAG TGGCCGTGAT CAGCGCTTGC TGGTGGTGGT GGGCCCTTGC
TCTGTTCACG ACGTCAAAGC TGCCCGGGAG TACGCCCAGC GCCTGGCTCC GATCCGCGAG
CGCTTGAAGG ATCAGCTCGA GGTGGTGATG CGGGTGTATT TCGAGAAGCC GCGCACCACG
GTTGGTTGGA AGGGACTTAT CAATGACCCC CACCTCGATA ACTCCTACGA CATCAACACC
GGTCTGCGGC GAGCCCGAGG ACTGCTGTTG GATCTCGCCC GGGAGGGGAT GCCTGCTGCA
ACGGAACTGC TCGATCCCGT GGTTCCCCAG TACATCGCTG ATTTGATCAG CTGGACTGCG
ATTGGCGCCA GGACGACGGA AAGTCAGACC CACCGCGAAA TGGCCTCAGG GCTGTCGATG
CCGATCGGCT ACAAGAACAG CACCAACGGC AGCGCCACCA TCGCGATCAA TGCCATGCAG
GCAGCGGGGA AGCCACATCA CTTCCTCGGA ATCAATCGCG ATGGGCATGC TTCGATCGTT
AGCACCACGG GCAACCCCTA CGGCCACCTC GTGCTGCGGG GCGGCAGCCA GGGCAGCAAT
TACCACCTGG AGGCTGTGCA GGAAGCCGCA GCCGAGTTGA GCCAGGCCGG TCTGCAGGAT
CGACTGATGG TGGATTGCAG CCATGCCAAC AGCAACAAAG ACTTCCGTCG ACAGGCAGAC
GTGCTGGCCA GCGTTGCTGA GCAGTTGCGA GGGGGCTCCA ACCACGTGAT GGGCGTGATG
ATTGAGAGCC ATCTGGTGGA AGGCAACCAG AAGCTCAACG CCGACCTGAC GCAGCTGACC
TATGGCCAGA GCGTCACGGA TGCCTGCATC AGCCTCGAGA CCACCGAAAC CTTGCTGGAG
GATCTGGCCG CGGCCGTGGC CAGTCGCAAA CAGACGGTCA CCGCATGA
 
Protein sequence
MATTSDLHVV ETRPLVAPAV LHQELPMDAA ALETVASARR RIQDFLSGRD QRLLVVVGPC 
SVHDVKAARE YAQRLAPIRE RLKDQLEVVM RVYFEKPRTT VGWKGLINDP HLDNSYDINT
GLRRARGLLL DLAREGMPAA TELLDPVVPQ YIADLISWTA IGARTTESQT HREMASGLSM
PIGYKNSTNG SATIAINAMQ AAGKPHHFLG INRDGHASIV STTGNPYGHL VLRGGSQGSN
YHLEAVQEAA AELSQAGLQD RLMVDCSHAN SNKDFRRQAD VLASVAEQLR GGSNHVMGVM
IESHLVEGNQ KLNADLTQLT YGQSVTDACI SLETTETLLE DLAAAVASRK QTVTA