Gene Cyan8802_2070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2070 
Symbol 
ID8391386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2084648 
End bp2086525 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content48% 
IMG OID644980048 
Producthypothetical protein 
Protein accessionYP_003137793 
Protein GI257059905 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.436802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAA TACCGTGGAA TCCGATTAGC CTTTTATCCC TTTCTTTAGT CTCAACCTTC 
GTGATTTATT GGGCAACATC TGGGGAGCTT CCCGCTAATA ATCCCTCAAT TAGCCTCTCT
CCCCAAACCC TACCCGCAGA AGCGGCGGAT GGGTTAGAAC AGGGAGAGGA AATCATACTC
AATGGCAAAA AATTCAAAAT CAGTTGGACT CAATGGACTC AAGGCAATGG CAACCGCATC
GGGATCAGTG ACATCGGGGC AAAGGATCTT CTAGGGTTAG AACTCCTCAG TACCAGTCAG
CCAGACCTAC AACCCGTCCA ATGGTTTGCC ACAGAGTCCC GCCAAACTCT TCCCGTTTTA
GCCCGATTTA TTCCTCCTTA TCGCTATTTA GATGTAACAG AACTGATTCA ATTAGCCGGG
GGACAACTGC AAGTTAGGGG CAATACCCTA GATATTACTT TACCCCCCGC TCGTATTAGT
ACAGTACGCG AAGGAACTCA AGACTGGGGT AAGCGCATTG TCGTAGAAGT TGATCGCCCG
ACGTTTTGGC AAGTCAGTCA GGCGAAAAAT CAAGGAGTCG TGATGATTTC GGGTAATACT
AACGCTCCTA CTAACAATAA TAATAATTCT TCCCCGTTTC CCTTTAATTT AAGCCCTGGA
AATGATGCCG ACGAAGATGA TCTCGGTAGC GGAGGGACTA CGCCCACTAA TTCTAAGCTG
TTTTCTGTAG AAAACGGCGG TGAAATTACT AAAATTCATG TTAACTTACC TACAGCCCAC
GGCTTAAAGG TTTTTAGCCT CTCTAATCCT AACAGAATCG TCATTGATGT TCGCCCCGAT
GCCATGACCC CTAAAGAAAT TGCCTGGACG CGGGGAATTA CTTGGCGACA GCAGTTAGTG
AAAGTTGCAG GGGGAATCTT TCCGGTTCAT TGGCTAGAAA TTGACGGGCG ATCGCCTAAT
ATTAGCCTAA AACCCATTAC CGCTAGTCCG AACCAACAAC AGGGTACAGC CCCCCTCGTG
ACCATGGCAC AAAGCTGGAA AGCCTCAGCA GCCATCAATG CGGGATTTTT TAACCGCAAT
AATCAATTAC CCCTAGGGGC AATGCGATCG CAGTCTCGCT GGTTATCAGG TCCGATTTTA
GGACGGGGGG CGATCGCCTG GAACGATGAA GGACGCATGA AAATTGGCCG CCTGAGTTGG
CAAGAAACCT TAGTAACCAG TAGCGGACAA CGCCTTCCCA TCCGTTTCCT CAACAGTGGC
TATGTGGAAG GGGGAATGGC AAGGTATACC CCCGACTGGG GACCCAATTA CACCCCCTTA
ACCGATAACG AGACGATTAT CTTAGTGCAG AATAATGGGG TGATTAGTCA AAGAAATGGC
GGAAAAGCCG GACAAAATGC CATTTTAATT CCTTCTAATG GCTATTTGTT AACCATTCGT
AAAAACACCG TTGCAGCTTC TGCGTTAGCC GTTGGGACGG GAGTTACCCT CGAAAGTAAT
ACAATTCCGT CTGATTTTAG TCAATACCCT CATATTCTGG GGGCTGGACC TTTGTTAGTT
AATAATAACC GTATCGTGGT CAATGCAGCC TTAGAACAGT TTAGCAAAGG CTTTCAGCAA
CAAATGGCCT CCCGTAGTGC GATCGGGATG ACCAACCAAG GGACAATGAT GTTAGTGGCA
GTCCATAATC GGGTTGGGGG ACGGGGAGCA ACTTTAGGGG AAATGGCACA AATTATGCAG
CAATTGGGGG CAGTGGATGC GTTAAACCTC GATGGAGGCA GTTCAACATC CCTCTCGTTG
GGAGGACAGT TAATTGATCG TTCCCCCGTT ACCGCAGCAA GGGTTCATAA TGCGATTGGA
GTGTTCGTTA ATCGTTAA
 
Protein sequence
MTRIPWNPIS LLSLSLVSTF VIYWATSGEL PANNPSISLS PQTLPAEAAD GLEQGEEIIL 
NGKKFKISWT QWTQGNGNRI GISDIGAKDL LGLELLSTSQ PDLQPVQWFA TESRQTLPVL
ARFIPPYRYL DVTELIQLAG GQLQVRGNTL DITLPPARIS TVREGTQDWG KRIVVEVDRP
TFWQVSQAKN QGVVMISGNT NAPTNNNNNS SPFPFNLSPG NDADEDDLGS GGTTPTNSKL
FSVENGGEIT KIHVNLPTAH GLKVFSLSNP NRIVIDVRPD AMTPKEIAWT RGITWRQQLV
KVAGGIFPVH WLEIDGRSPN ISLKPITASP NQQQGTAPLV TMAQSWKASA AINAGFFNRN
NQLPLGAMRS QSRWLSGPIL GRGAIAWNDE GRMKIGRLSW QETLVTSSGQ RLPIRFLNSG
YVEGGMARYT PDWGPNYTPL TDNETIILVQ NNGVISQRNG GKAGQNAILI PSNGYLLTIR
KNTVAASALA VGTGVTLESN TIPSDFSQYP HILGAGPLLV NNNRIVVNAA LEQFSKGFQQ
QMASRSAIGM TNQGTMMLVA VHNRVGGRGA TLGEMAQIMQ QLGAVDALNL DGGSSTSLSL
GGQLIDRSPV TAARVHNAIG VFVNR