Gene PCC8801_3121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3121 
Symbol 
ID7105096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3263154 
End bp3264839 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content46% 
IMG OID643476147 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002373258 
Protein GI218247887 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGATA ACCTTAGAAG TCGAATTGTT ACCCAAGGAA GTCAACGAAC CCCAAACCGG 
GCTATGCTCA GGGCTGTAGG GTTTGGAGAC AATGACTTTA TTAAACCAAT CGTTGGTGTA
GCTAATGGAT ATAGTACCAT TACGCCCTGT AATATGGGAC TCAATGACCT AGCTTTGCGG
GCCGAAGCGG GATTAAAAAG TGCCGGAGCC ATGCCACAAA TGTTTGGTAC CATTACCATT
AGTGATGGTA TCTCCATGGG GACAGAAGGA ATGAAATATT CCCTTGTCTC ACGGGAAGTT
ATCGCAGACT CCATCGAAAC CGCTTGTAAT GGTCAAAGTA TGGATGGAGT CATTGCCATT
GGAGGGTGTG ATAAGAATAT GCCAGGGGCT ATGATTGCTA TAGCCCGAAT GAATATCCCT
GCTATTTTCG TCTATGGGGG TACGATTAAA CCCGGCCATT ACCAGGGTGA AGATTTAACC
GTTGTCAGTG CCTTTGAAGC CGTAGGAAAG TATAGCGCGG GTAAAATAGA TGATAACGAA
TTATTAGCCA TTGAACGCAA TGCTTGTCCG GGTGCGGGGT CTTGTGGGGG AATGTTTACT
GCTAACACCA TGTCATCCGC GTTTGAAGCG ATGGGGATGA GTTTACCCTA TTCTTCTACC
ATGGCCGCAG AAGATGCTGA AAAAGCCGAT AGTACCGAAC AATCGGCCTT TGTTTTGGTT
GAGGCTATCC GTAAACAGAT TTTACCTAGT CAGATTTTAA CCCGTAAAGC CTTTGAAAAT
GCGATCGCGG TCATTATGGC TGTCGGAGGG TCAACCAATG CAGTATTACA CCTATTAGCC
ATTGCTAATA CCATGGGGGT TGAGTTGACT ATCGACGACT TTGAAACCAT TCGTAAAAAA
GTTCCAGTTT TGTGTGATCT CAAACCATCG GGACGCTACG TTACCGTTAA TTTACATCAA
GCAGGGGGCA TTCCCCAAGT GATGAAAATG CTGTTAAACC ATGGATTATT ACACGGGGAT
GCGTTAACCA TTTCCGGACA AACTATCGCG GAAGTTTTGC AAGATATTCC CGATGAACCT
CCCGCGAATC AAGATGTCAT TCGTCCTTGG AATAACCCGG TTTATCCAGA AGGACATTTA
GCCATCCTCA AAGGGAATTT AGCCGCAGAA GGTGCGGTAG CTAAAATTAG TGGGGTCAAA
AAACCTAAGA TGACCGGTCC AGCAAGGGTT TTTGAGTCAG AAGAAGCGTG TTTAGACGCA
ATTTTAGCCG GAAAAATTAG CGCGGGAGAT GTCGTTATTG TTCGCTACGA AGGACCCAAA
GGAGGCCCCG GAATGCGAGA AATGTTAGCC CCCACGTCTG CTATTATTGG CGCAGGATTA
GGTGATTCAG TGGGATTAAT TACCGATGGA CGGTTCTCTG GAGGAACCTA CGGGTTAGTG
GTTGGCCATG TCGCTCCTGA AGCCTTTGTT GGCGGTACAA TTGCCTTAGT TAACGAGGGA
GATAGTGTCA CCATTGATGC AGAAAAACGG CTATTGCAAT TAAATGTTTC TGACGAAGAA
TTAGCTACCC GTCGCGCTCA TTGGACTCCC CCTAAACCGC GCTATCAACG GGGAATTTTA
GGGAAGTATG CTAAGTTAGT TTCTTCGAGT AGTTTAGGCG CAGTGACCGA TGTAGAGCTA
TTCTAG
 
Protein sequence
MSDNLRSRIV TQGSQRTPNR AMLRAVGFGD NDFIKPIVGV ANGYSTITPC NMGLNDLALR 
AEAGLKSAGA MPQMFGTITI SDGISMGTEG MKYSLVSREV IADSIETACN GQSMDGVIAI
GGCDKNMPGA MIAIARMNIP AIFVYGGTIK PGHYQGEDLT VVSAFEAVGK YSAGKIDDNE
LLAIERNACP GAGSCGGMFT ANTMSSAFEA MGMSLPYSST MAAEDAEKAD STEQSAFVLV
EAIRKQILPS QILTRKAFEN AIAVIMAVGG STNAVLHLLA IANTMGVELT IDDFETIRKK
VPVLCDLKPS GRYVTVNLHQ AGGIPQVMKM LLNHGLLHGD ALTISGQTIA EVLQDIPDEP
PANQDVIRPW NNPVYPEGHL AILKGNLAAE GAVAKISGVK KPKMTGPARV FESEEACLDA
ILAGKISAGD VVIVRYEGPK GGPGMREMLA PTSAIIGAGL GDSVGLITDG RFSGGTYGLV
VGHVAPEAFV GGTIALVNEG DSVTIDAEKR LLQLNVSDEE LATRRAHWTP PKPRYQRGIL
GKYAKLVSSS SLGAVTDVEL F