Gene Cyan8802_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2999 
Symbol 
ID8392327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3032992 
End bp3034677 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content46% 
IMG OID644980946 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003138680 
Protein GI257060792 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.398415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATA ACCTTAGAAG TCGAATTGTT ACCCAAGGAA GTCAACGAAC CCCAAACCGG 
GCTATGCTCA GGGCTGTAGG GTTTGGAGAC AATGACTTTA TTAAACCAAT CGTTGGTGTA
GCTAATGGAT ATAGTACCAT TACGCCCTGT AATATGGGAC TCAATGACCT AGCTTTGCGG
GCCGAAGCGG GATTAAAAAG TGCCGGAGCC ATGCCACAAA TGTTTGGTAC CATTACCATT
AGTGATGGTA TCTCCATGGG GACAGAAGGA ATGAAATATT CCCTTGTCTC ACGGGAAGTT
ATCGCAGACT CCATCGAAAC CGCTTGTAAT GGTCAAAGTA TGGATGGAGT CATTGCCATT
GGAGGGTGTG ATAAGAATAT GCCAGGGGCT ATGATTGCTA TAGCCCGAAT GAATATCCCT
GCTATTTTCG TCTATGGGGG TACGATTAAA CCCGGCCATT ACCAGGGTGA AGATTTAACC
GTTGTCAGTG CCTTTGAAGC CGTAGGAAAG TATAGCGCGG GTAAAATAGA TGATAACGAA
TTATTAGCCA TTGAACGCAA TGCTTGTCCG GGTGCGGGGT CTTGTGGGGG AATGTTTACT
GCTAACACCA TGTCATCCGC GTTTGAAGCG ATGGGGATGA GTTTACCCTA TTCTTCTACC
ATGGCCGCAG AAGATGCTGA AAAAGCCGAT AGTACCGAAC AATCGGCCTT TGTTTTGGTT
GAGGCTATCC GTAAACAGAT TTTACCTAGT CAGATTTTAA CCCGTAAAGC CTTTGAAAAT
GCGATCGCGG TCATTATGGC TGTTGGAGGG TCAACCAATG CAGTATTACA CCTATTAGCC
ATTGCTAATA CCATCGGGGT TGAGTTGAGT ATCGACGACT TTGAAACCAT TCGTAAAAAA
GTTCCAGTTT TGTGTGATCT CAAACCATCG GGACGCTACG TTACCGTTAA TTTACATCAA
GCGGGGGGCA TTCCCCAAGT GATGAAAATG CTGTTAAACC ATGGATTATT ACACGGGGAT
GCGTTAACCA TTTCCGGACA AACTATTGCG GAAGTTTTGC AAGATATTCC CGATGAACCT
CCCGCTAATC AAGATGTCAT TCGTCCTTGG AATAACCCGG TTTATCCAGA AGGACATTTA
GCCATCCTCA AAGGGAATTT AGCCGCAGAA GGTGCGGTAG CTAAAATTAG TGGGGTCAAA
AAACCTAAGA TGACCGGTCC AGCAAGGGTT TTTGAGTCAG AAGAAGCGTG TTTAGACGCA
ATTTTAGCCG GAAAAATTAG CGCGGGAGAT GTCGTTATCG TTCGCTACGA AGGACCCAAA
GGAGGCCCCG GAATGCGAGA AATGTTAGCC CCCACGTCTG CTATTATTGG CGCAGGATTA
GGTGATTCAG TGGGATTAAT TACCGATGGA CGGTTCTCTG GAGGAACCTA CGGGTTAGTA
GTTGGCCATG TCGCTCCTGA AGCCTTTGTT GGCGGTACAA TTGCCTTAGT TAACGAGGGA
GATAGTATCA CCATTGATGC AGAAAAACGG CTATTGCAAT TAAATGTTTC TGACGAAGAA
TTAACTACCC GTCGCGCTCA TTGGACTCCC CCTAAACCGC GCTATCAACG GGGAATTTTA
GGGAAGTATG CTAAGTTAGT TTCTTCGAGT AGTTTAGGCG CAGTGACCGA TGTAGAGCTA
TTCTAG
 
Protein sequence
MSDNLRSRIV TQGSQRTPNR AMLRAVGFGD NDFIKPIVGV ANGYSTITPC NMGLNDLALR 
AEAGLKSAGA MPQMFGTITI SDGISMGTEG MKYSLVSREV IADSIETACN GQSMDGVIAI
GGCDKNMPGA MIAIARMNIP AIFVYGGTIK PGHYQGEDLT VVSAFEAVGK YSAGKIDDNE
LLAIERNACP GAGSCGGMFT ANTMSSAFEA MGMSLPYSST MAAEDAEKAD STEQSAFVLV
EAIRKQILPS QILTRKAFEN AIAVIMAVGG STNAVLHLLA IANTIGVELS IDDFETIRKK
VPVLCDLKPS GRYVTVNLHQ AGGIPQVMKM LLNHGLLHGD ALTISGQTIA EVLQDIPDEP
PANQDVIRPW NNPVYPEGHL AILKGNLAAE GAVAKISGVK KPKMTGPARV FESEEACLDA
ILAGKISAGD VVIVRYEGPK GGPGMREMLA PTSAIIGAGL GDSVGLITDG RFSGGTYGLV
VGHVAPEAFV GGTIALVNEG DSITIDAEKR LLQLNVSDEE LTTRRAHWTP PKPRYQRGIL
GKYAKLVSSS SLGAVTDVEL F