Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_2070 |
Symbol | |
ID | 8391386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 2084648 |
End bp | 2086525 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644980048 |
Product | hypothetical protein |
Protein accession | YP_003137793 |
Protein GI | 257059905 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.436802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGAA TACCGTGGAA TCCGATTAGC CTTTTATCCC TTTCTTTAGT CTCAACCTTC GTGATTTATT GGGCAACATC TGGGGAGCTT CCCGCTAATA ATCCCTCAAT TAGCCTCTCT CCCCAAACCC TACCCGCAGA AGCGGCGGAT GGGTTAGAAC AGGGAGAGGA AATCATACTC AATGGCAAAA AATTCAAAAT CAGTTGGACT CAATGGACTC AAGGCAATGG CAACCGCATC GGGATCAGTG ACATCGGGGC AAAGGATCTT CTAGGGTTAG AACTCCTCAG TACCAGTCAG CCAGACCTAC AACCCGTCCA ATGGTTTGCC ACAGAGTCCC GCCAAACTCT TCCCGTTTTA GCCCGATTTA TTCCTCCTTA TCGCTATTTA GATGTAACAG AACTGATTCA ATTAGCCGGG GGACAACTGC AAGTTAGGGG CAATACCCTA GATATTACTT TACCCCCCGC TCGTATTAGT ACAGTACGCG AAGGAACTCA AGACTGGGGT AAGCGCATTG TCGTAGAAGT TGATCGCCCG ACGTTTTGGC AAGTCAGTCA GGCGAAAAAT CAAGGAGTCG TGATGATTTC GGGTAATACT AACGCTCCTA CTAACAATAA TAATAATTCT TCCCCGTTTC CCTTTAATTT AAGCCCTGGA AATGATGCCG ACGAAGATGA TCTCGGTAGC GGAGGGACTA CGCCCACTAA TTCTAAGCTG TTTTCTGTAG AAAACGGCGG TGAAATTACT AAAATTCATG TTAACTTACC TACAGCCCAC GGCTTAAAGG TTTTTAGCCT CTCTAATCCT AACAGAATCG TCATTGATGT TCGCCCCGAT GCCATGACCC CTAAAGAAAT TGCCTGGACG CGGGGAATTA CTTGGCGACA GCAGTTAGTG AAAGTTGCAG GGGGAATCTT TCCGGTTCAT TGGCTAGAAA TTGACGGGCG ATCGCCTAAT ATTAGCCTAA AACCCATTAC CGCTAGTCCG AACCAACAAC AGGGTACAGC CCCCCTCGTG ACCATGGCAC AAAGCTGGAA AGCCTCAGCA GCCATCAATG CGGGATTTTT TAACCGCAAT AATCAATTAC CCCTAGGGGC AATGCGATCG CAGTCTCGCT GGTTATCAGG TCCGATTTTA GGACGGGGGG CGATCGCCTG GAACGATGAA GGACGCATGA AAATTGGCCG CCTGAGTTGG CAAGAAACCT TAGTAACCAG TAGCGGACAA CGCCTTCCCA TCCGTTTCCT CAACAGTGGC TATGTGGAAG GGGGAATGGC AAGGTATACC CCCGACTGGG GACCCAATTA CACCCCCTTA ACCGATAACG AGACGATTAT CTTAGTGCAG AATAATGGGG TGATTAGTCA AAGAAATGGC GGAAAAGCCG GACAAAATGC CATTTTAATT CCTTCTAATG GCTATTTGTT AACCATTCGT AAAAACACCG TTGCAGCTTC TGCGTTAGCC GTTGGGACGG GAGTTACCCT CGAAAGTAAT ACAATTCCGT CTGATTTTAG TCAATACCCT CATATTCTGG GGGCTGGACC TTTGTTAGTT AATAATAACC GTATCGTGGT CAATGCAGCC TTAGAACAGT TTAGCAAAGG CTTTCAGCAA CAAATGGCCT CCCGTAGTGC GATCGGGATG ACCAACCAAG GGACAATGAT GTTAGTGGCA GTCCATAATC GGGTTGGGGG ACGGGGAGCA ACTTTAGGGG AAATGGCACA AATTATGCAG CAATTGGGGG CAGTGGATGC GTTAAACCTC GATGGAGGCA GTTCAACATC CCTCTCGTTG GGAGGACAGT TAATTGATCG TTCCCCCGTT ACCGCAGCAA GGGTTCATAA TGCGATTGGA GTGTTCGTTA ATCGTTAA
|
Protein sequence | MTRIPWNPIS LLSLSLVSTF VIYWATSGEL PANNPSISLS PQTLPAEAAD GLEQGEEIIL NGKKFKISWT QWTQGNGNRI GISDIGAKDL LGLELLSTSQ PDLQPVQWFA TESRQTLPVL ARFIPPYRYL DVTELIQLAG GQLQVRGNTL DITLPPARIS TVREGTQDWG KRIVVEVDRP TFWQVSQAKN QGVVMISGNT NAPTNNNNNS SPFPFNLSPG NDADEDDLGS GGTTPTNSKL FSVENGGEIT KIHVNLPTAH GLKVFSLSNP NRIVIDVRPD AMTPKEIAWT RGITWRQQLV KVAGGIFPVH WLEIDGRSPN ISLKPITASP NQQQGTAPLV TMAQSWKASA AINAGFFNRN NQLPLGAMRS QSRWLSGPIL GRGAIAWNDE GRMKIGRLSW QETLVTSSGQ RLPIRFLNSG YVEGGMARYT PDWGPNYTPL TDNETIILVQ NNGVISQRNG GKAGQNAILI PSNGYLLTIR KNTVAASALA VGTGVTLESN TIPSDFSQYP HILGAGPLLV NNNRIVVNAA LEQFSKGFQQ QMASRSAIGM TNQGTMMLVA VHNRVGGRGA TLGEMAQIMQ QLGAVDALNL DGGSSTSLSL GGQLIDRSPV TAARVHNAIG VFVNR
|
| |