Gene Cyan8802_2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2521 
Symbol 
ID8391846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2549772 
End bp2550959 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content45% 
IMG OID644980485 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_003138222 
Protein GI257060334 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.21854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.577738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGAC TATCATTAAC CGTTAAACCA GTTTTCAACT CTGGACTCGG AGTGCTTATT 
TTTTTACTGC TAGGAGGATG TGTATTTAAA ACCAATTCCA CTGCCTCCCA ACCGTCTCAA
ATCCAAGAAT CCCAGAGAGA AGTCAATCAA GTTACTGTTG TTGAAGGATT AGAGCATCCC
TGGAGTATGG CCTGGCTTCC CAATGGGGAT ATGTTGATTA CAGAACGCCC TGGACGACTT
CGGTTAGTGA AAAATGGGGT ACTACAACCC ACTCCTATTG CTGGCGTGAT GGAGGTGCTT
CAATTGGGAC AGGGAGGGTT AATGGAGGTG TCATTACACC CTAACTTTAG CGAAAATCGC
CTTGTTTATT TCACCTATGC CCACGGAACT GCTGAAGCTA ATCGTACCCG CATTGCTCGT
GCTACCTTTG ATGGACAAGC GTTACGAGAC GTACAGGTGA TCTTTGAAGT GACTCCCGCT
AAACCAGGGG GACAACATTT TGGCTCTCGT TTAGTTTGGC TCAAAGACCA AACGATGTTG
ATTTCCATTG GAGATGGCGG AAATCCTCCG CTTTCTCTTG ATGGTGAACT GATTCGTCTA
CAAGCACAAA ATCTCGGTAA TCCTTTGGGT AAAATTATTC GCTTGAAGGA CGATGGCAGT
ATTCCCGATG ATAATCCCTT TGTTGGACAA AAGGAGGCAC AAAAAGCCAT TTGGAGCTAT
GGACATCGTA ATATTCAGGG ATTAGCGTTT AATCAAGCTA CTGGGCAAAT TTGGGCGACA
GAACACGGTT CTCGTGGTGG GGATGAACTC AATCAAATTC AAGCGGGTGA GAATTATGGT
TGGCCGCTGG TGACTCACAG TGAAGAATAT TTTGGGGGTG AAATTTCCAG TGAACGCTCC
CGCTCAGGAA TGATTGATCC TTTAATTGTT TGGACTCCTG CGATCGCTCC GTCTGGGTTA
GCTATTTATC AGGGGACTCG CTTTCCCCAG TGGCAAGGTG ATTTATTTGC GGGAGGATTG
GTTGGTAAAG AAGTTCGTCA TATTGACTTG GATAGTTCTG GTCAAGTGAT AGAACAAAAA
TCAATTCCCT TCTCTCAAAG AGTCCGTGAT GTTAAACAGG GTCCTGACGG GTTTCTTTAT
GTTTTAACCG ATGATACGAA TGGCAAGTTA ATCCGTCTTG AACCCTAA
 
Protein sequence
MNRLSLTVKP VFNSGLGVLI FLLLGGCVFK TNSTASQPSQ IQESQREVNQ VTVVEGLEHP 
WSMAWLPNGD MLITERPGRL RLVKNGVLQP TPIAGVMEVL QLGQGGLMEV SLHPNFSENR
LVYFTYAHGT AEANRTRIAR ATFDGQALRD VQVIFEVTPA KPGGQHFGSR LVWLKDQTML
ISIGDGGNPP LSLDGELIRL QAQNLGNPLG KIIRLKDDGS IPDDNPFVGQ KEAQKAIWSY
GHRNIQGLAF NQATGQIWAT EHGSRGGDEL NQIQAGENYG WPLVTHSEEY FGGEISSERS
RSGMIDPLIV WTPAIAPSGL AIYQGTRFPQ WQGDLFAGGL VGKEVRHIDL DSSGQVIEQK
SIPFSQRVRD VKQGPDGFLY VLTDDTNGKL IRLEP