Gene PCC8801_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2044 
Symbol 
ID7105401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2115717 
End bp2117594 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content48% 
IMG OID643475102 
Producthypothetical protein 
Protein accessionYP_002372234 
Protein GI218246863 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGAA TACCGTGGAA TCCGATTAGC CTTTTATCCC TTTCTTTAGT CTCAACCTTC 
GTGATTTATT GGGCAACATC TGGGGAGCTT CCCGCTAATA ATCCCTCAAT TAGCCTCTCT
CCCCAAACCC TACCCGCAGA AGCGGCGGAT GGGTTAGAAC AGGGAGAGGA AATCATACTC
AATGGCAAAA AATTCAAAAT CAGTTGGACT CAATGGACTC AAGGCAATGG CAACCGCATC
GGGATCAGTG ACATCGGGGC CAAGGATCTT CTAGGGTTAG AACTCCTCAG TACCAGTCAG
CCAGACCTAC AACCCGTCCA ATGGTTTGCC ACAGAGTCCC GCCAAACTCT TCCCGTTTTA
GCCCGATTTA TTCCTCCTTA TCGCTATTTA GATGTAACAG AACTGATTCA ATTAGCCGGG
GGACAACTGC AAGTTAGGGG CAATACCCTA GATATTACTT TACCCCCCGC TCGTATTAGT
ACAGTACGCG AAGGAACTCA AGACTGGGGT AAGCGCATTG TCGTAGAAGT TGATCGCCCG
ACGTTTTGGC AAGTCAGTCA GGCGAAAAAT CAAGGAGTCG TGATGATTTC GGGTAATACT
AACGCTCCTA CTAACAATAA TAATAATTCT TCCCCGTTTC CCTTTAATTT AAGCCCTGGA
AATGATGCCG AGGAAGATGA TCTCGGTAGC GGAGGGACTA CGCCCACTAA TTCTAAGCTG
TTTTCTGTAG AAAACGGCGG TGAAATTACT AAAATTCATG TTAACTTACC TACAGCCCAC
GGCTTAAAGG TTTTTAGCCT CTCTAATCCT AACAGAATCG TCATTGATGT TCGCCCCGAT
GCCATGACCC CTAAAGAAAT TGCCTGGACG CGGGGAATTA CTTGGCGACA GCAGTTAGTG
AAAGTTGCAG GGGGAATCTT TCCGGTTCAT TGGCTAGAAA TTGACGGGCG ATCGCCTAAT
ATTAGCCTAA AACCCATTAC CGCTAGTCCG AACCAACAAC AGGGTACAGC CCCCCTCGTG
ACCATGGCAC AAAGCTGGAA AGCCTCAGCA GCCATCAATG CGGGATTTTT TAACCGCAAT
AATCAATTAC CCCTAGGGGC AATGCGATCG CAGTCTCGCT GGTTATCAGG TCCGATTTTA
GGACGGGGGG CGATCGCCTG GAACGATGAA GGACGCATGA AAATTGGCCG CCTGAGTTGG
CAAGAAACCT TAGTGACCAG TAGCGGACAA CGCCTTCCCA TCCGTTTCCT CAACAGTGGC
TATGTGGAAG GGGGAATGGC AAGGTATACC CCCGACTGGG GACCCCATTA CACCCCCTTA
ACTGATAACG AGACGATTAT CTTAGTGCAG AATAATGGGG TGATTACTCA AAGAAATGGG
GGAAAAGCCG GACAAAATGC CATTTTAATT CCTTCTAATG GCTATTTGTT AACCATTCGT
AAAAACGCCG TTGCAGCTTC TGCGTTAGCC GTTGGGACGG GAGTTACCCT CGAAAGTAAT
ACAATTCCGT CTGATTTTAG TCAATACCCT CATATTCTGG GGGCTGGACC TTTGTTAGTT
AATAATAACC GTATCGTGGT CAATGCAGCC TTAGAACAGT TTAGCAAAGG CTTTCAGCAA
CAAATGGCCT CCCGTAGTGC GATCGGGATG ACCAACCAAG GGACAATGAT GTTAGTGGCC
GTCCATAACC GGGTTGGGGG ACGGGGAGCA ACTTTAGGCG AAATGGCACA AATTATGCAG
CAATTGGGGG CAGTGGATGC GTTAAACCTC GATGGAGGCA GTTCAACGTC CCTCGCGTTG
GGAGGACAGT TAATTGATCG TTCCCCCGTT ACCGCAGCAA GGGTTCATAA TGCGATTGGA
GTGTTCGTTA ATCGTTAA
 
Protein sequence
MTRIPWNPIS LLSLSLVSTF VIYWATSGEL PANNPSISLS PQTLPAEAAD GLEQGEEIIL 
NGKKFKISWT QWTQGNGNRI GISDIGAKDL LGLELLSTSQ PDLQPVQWFA TESRQTLPVL
ARFIPPYRYL DVTELIQLAG GQLQVRGNTL DITLPPARIS TVREGTQDWG KRIVVEVDRP
TFWQVSQAKN QGVVMISGNT NAPTNNNNNS SPFPFNLSPG NDAEEDDLGS GGTTPTNSKL
FSVENGGEIT KIHVNLPTAH GLKVFSLSNP NRIVIDVRPD AMTPKEIAWT RGITWRQQLV
KVAGGIFPVH WLEIDGRSPN ISLKPITASP NQQQGTAPLV TMAQSWKASA AINAGFFNRN
NQLPLGAMRS QSRWLSGPIL GRGAIAWNDE GRMKIGRLSW QETLVTSSGQ RLPIRFLNSG
YVEGGMARYT PDWGPHYTPL TDNETIILVQ NNGVITQRNG GKAGQNAILI PSNGYLLTIR
KNAVAASALA VGTGVTLESN TIPSDFSQYP HILGAGPLLV NNNRIVVNAA LEQFSKGFQQ
QMASRSAIGM TNQGTMMLVA VHNRVGGRGA TLGEMAQIMQ QLGAVDALNL DGGSSTSLAL
GGQLIDRSPV TAARVHNAIG VFVNR