Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2044 |
Symbol | |
ID | 7105401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2115717 |
End bp | 2117594 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643475102 |
Product | hypothetical protein |
Protein accession | YP_002372234 |
Protein GI | 218246863 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGAA TACCGTGGAA TCCGATTAGC CTTTTATCCC TTTCTTTAGT CTCAACCTTC GTGATTTATT GGGCAACATC TGGGGAGCTT CCCGCTAATA ATCCCTCAAT TAGCCTCTCT CCCCAAACCC TACCCGCAGA AGCGGCGGAT GGGTTAGAAC AGGGAGAGGA AATCATACTC AATGGCAAAA AATTCAAAAT CAGTTGGACT CAATGGACTC AAGGCAATGG CAACCGCATC GGGATCAGTG ACATCGGGGC CAAGGATCTT CTAGGGTTAG AACTCCTCAG TACCAGTCAG CCAGACCTAC AACCCGTCCA ATGGTTTGCC ACAGAGTCCC GCCAAACTCT TCCCGTTTTA GCCCGATTTA TTCCTCCTTA TCGCTATTTA GATGTAACAG AACTGATTCA ATTAGCCGGG GGACAACTGC AAGTTAGGGG CAATACCCTA GATATTACTT TACCCCCCGC TCGTATTAGT ACAGTACGCG AAGGAACTCA AGACTGGGGT AAGCGCATTG TCGTAGAAGT TGATCGCCCG ACGTTTTGGC AAGTCAGTCA GGCGAAAAAT CAAGGAGTCG TGATGATTTC GGGTAATACT AACGCTCCTA CTAACAATAA TAATAATTCT TCCCCGTTTC CCTTTAATTT AAGCCCTGGA AATGATGCCG AGGAAGATGA TCTCGGTAGC GGAGGGACTA CGCCCACTAA TTCTAAGCTG TTTTCTGTAG AAAACGGCGG TGAAATTACT AAAATTCATG TTAACTTACC TACAGCCCAC GGCTTAAAGG TTTTTAGCCT CTCTAATCCT AACAGAATCG TCATTGATGT TCGCCCCGAT GCCATGACCC CTAAAGAAAT TGCCTGGACG CGGGGAATTA CTTGGCGACA GCAGTTAGTG AAAGTTGCAG GGGGAATCTT TCCGGTTCAT TGGCTAGAAA TTGACGGGCG ATCGCCTAAT ATTAGCCTAA AACCCATTAC CGCTAGTCCG AACCAACAAC AGGGTACAGC CCCCCTCGTG ACCATGGCAC AAAGCTGGAA AGCCTCAGCA GCCATCAATG CGGGATTTTT TAACCGCAAT AATCAATTAC CCCTAGGGGC AATGCGATCG CAGTCTCGCT GGTTATCAGG TCCGATTTTA GGACGGGGGG CGATCGCCTG GAACGATGAA GGACGCATGA AAATTGGCCG CCTGAGTTGG CAAGAAACCT TAGTGACCAG TAGCGGACAA CGCCTTCCCA TCCGTTTCCT CAACAGTGGC TATGTGGAAG GGGGAATGGC AAGGTATACC CCCGACTGGG GACCCCATTA CACCCCCTTA ACTGATAACG AGACGATTAT CTTAGTGCAG AATAATGGGG TGATTACTCA AAGAAATGGG GGAAAAGCCG GACAAAATGC CATTTTAATT CCTTCTAATG GCTATTTGTT AACCATTCGT AAAAACGCCG TTGCAGCTTC TGCGTTAGCC GTTGGGACGG GAGTTACCCT CGAAAGTAAT ACAATTCCGT CTGATTTTAG TCAATACCCT CATATTCTGG GGGCTGGACC TTTGTTAGTT AATAATAACC GTATCGTGGT CAATGCAGCC TTAGAACAGT TTAGCAAAGG CTTTCAGCAA CAAATGGCCT CCCGTAGTGC GATCGGGATG ACCAACCAAG GGACAATGAT GTTAGTGGCC GTCCATAACC GGGTTGGGGG ACGGGGAGCA ACTTTAGGCG AAATGGCACA AATTATGCAG CAATTGGGGG CAGTGGATGC GTTAAACCTC GATGGAGGCA GTTCAACGTC CCTCGCGTTG GGAGGACAGT TAATTGATCG TTCCCCCGTT ACCGCAGCAA GGGTTCATAA TGCGATTGGA GTGTTCGTTA ATCGTTAA
|
Protein sequence | MTRIPWNPIS LLSLSLVSTF VIYWATSGEL PANNPSISLS PQTLPAEAAD GLEQGEEIIL NGKKFKISWT QWTQGNGNRI GISDIGAKDL LGLELLSTSQ PDLQPVQWFA TESRQTLPVL ARFIPPYRYL DVTELIQLAG GQLQVRGNTL DITLPPARIS TVREGTQDWG KRIVVEVDRP TFWQVSQAKN QGVVMISGNT NAPTNNNNNS SPFPFNLSPG NDAEEDDLGS GGTTPTNSKL FSVENGGEIT KIHVNLPTAH GLKVFSLSNP NRIVIDVRPD AMTPKEIAWT RGITWRQQLV KVAGGIFPVH WLEIDGRSPN ISLKPITASP NQQQGTAPLV TMAQSWKASA AINAGFFNRN NQLPLGAMRS QSRWLSGPIL GRGAIAWNDE GRMKIGRLSW QETLVTSSGQ RLPIRFLNSG YVEGGMARYT PDWGPHYTPL TDNETIILVQ NNGVITQRNG GKAGQNAILI PSNGYLLTIR KNAVAASALA VGTGVTLESN TIPSDFSQYP HILGAGPLLV NNNRIVVNAA LEQFSKGFQQ QMASRSAIGM TNQGTMMLVA VHNRVGGRGA TLGEMAQIMQ QLGAVDALNL DGGSSTSLAL GGQLIDRSPV TAARVHNAIG VFVNR
|
| |