Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3648 |
Symbol | |
ID | 8139022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4224474 |
End bp | 4226348 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644871269 |
Product | glycosyltransferase 36 associated |
Protein accession | YP_003023427 |
Protein GI | 253702238 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 177 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCCACT TTATCACGCC GCGCTTGTGT TACCACCCGG AGTACGGCGA GCGGGTCGCC TTCGCGGCCA TGACCCCCGG CGCCGACGAC TACGGCGGCG ACCGCTCCGC CTTCATCGGG CGCAACCGCT CTCTTGCCGC CCCTGCGGCC ATGGAAGCGA CCCGCCTGTC CCGAAGGACG GGGGCCGGTC TCGATCCCTG CGCGGTCCAG CGGGTCACCC TGGAGCTGGC TCCGGGCGAG CAGCGCGACA TCGTCTGCAT GCTGGGCCAG GAGGCTTCCG TCCCTGAGGC CCGGGAGCTG GTCCTTCGCT ACCGGCAGGA CCGGTCCTTC GAAGACGCCT TCGATAGGAC CCGCGCCTGG TGGGATGAAC TGCTGGGCAC GGTCCAGGTG CAGACCCCGG AGCCTGCCGC CGATCTCCTG ATCAACCGCT GGCTTCAGTA CCAGTCCTTG AGCTGCCGTA TCTGGGGGCG CTCCGGCTTT TACCAGTCGG GGGGCGCCTT CGGTTTTCGC GACCAGTTGC AGGACGTGAT GGCGTTTCTC TATGCGAGGC CGGAGCTTGC CGCCGAGCAG ATCCTATTGG CCGCCAGCCG CCAGTTCAGC GAGGGGGACG TTCAGCACTG GTGGCACGAG CCGGCCGGCG CCGGCATCCG GTCGCGCATC TCCGACGACC TGCTCTGGCT CCCCTATGTG GTCGCGCAGT ACGTCCGGAC CACAGGGGAT CTGGCGATCC TGGAGGCGGA GGTCCCCTTC CTGAACGCCC CGCCACTTTC GGACGATCAG CACGAGGTGT TCTCCGTCCC GCAGGTAAGT CTCGAGCGGG CGACACTCTT CGAGCACTGC AGGCGGGCGG TGGCCCGCGG CCTCACCACA GGCCCCCACG GCTTACCGCT GATGGGGACA GGCGACTGGA ACGACGGCAT GAACCTGGTT GGCGCCGGCG GCAAGGGCGA GAGCGTATGG CTTGCCTGGT TTCTTTGCGA CATCCTGAAG GGAATGGCGG AGATCTCCAC CCTCCTAAAG CTGCCGGAAC TGGCCCGGGG GTATCTGGAG GAGAGGTCCG CCCTGGTGCT TCGTACCGAA AAGGCCGGCT GGGACGGGGA GTGGTATCTA AGGGGGACCT TCGACGACGG CACACCGCTC GGCTCCTCGT CGAACAGCGA GGCGAGGATC GATTCGCTCC CGCAATCGTG GGCTTGGCTT TCCGGCGCAG CCGACCGTGA ACGCGCCGGG AAGGGACTGG AAGCGGCCTG GCAGCATCTG GTGCGCGAGG ATGAGGGGCT CGTGCTTCTT TTCGAACCCC CCTTCGACAC CTCGGAACCG TCTCCAGGGT ACATCAAGGG GTATCCGCCG GGGGTGCGCG AAAACGGCGG GCAGTACACC CACGCGGCAC TCTGGATGGT CATGGCCCTG GCGAAAAAGG GAGAAGGGGA TCGCGCGGCG CAGCTTTTGC GCGTGCTCAA TCCGATCGAG CACGCGCGGG ACGCTCAGGC CGCCTGGCTT TACGGGGTCG AGCCCTACGT GGTAGCCGCC GACGTGTACC GTTTGCCGGG ACGTATCGGA CAAGGAGGCT GGTCCTGGTA CACCGGCTCC GCCGCCTGGA TGTACCGGGC CTGGATCGAA GAGGTGTTGG GACTTAAGGT GAGAGGAGAC GAGCTGCGGA TGAATCCGGT CATCCCCGCT GCGTGGCCCG GCTTCAGCAT GAGCTATCGG CATGGAGAGG CGGTCTACGC GATTCGAGTG GAGAACCCGG ACGGCTGCCA GTGCGGGGTC GCCCAGGTGG AAATGGATGG CCGCCGTGCC GACGGCGGCG TGATCAAACT GGAACGGGGT CTGGTGAAGC ATCAGGTCGT GGTGCGGATG GGTACCCGGA AGTAG
|
Protein sequence | MSHFITPRLC YHPEYGERVA FAAMTPGADD YGGDRSAFIG RNRSLAAPAA MEATRLSRRT GAGLDPCAVQ RVTLELAPGE QRDIVCMLGQ EASVPEAREL VLRYRQDRSF EDAFDRTRAW WDELLGTVQV QTPEPAADLL INRWLQYQSL SCRIWGRSGF YQSGGAFGFR DQLQDVMAFL YARPELAAEQ ILLAASRQFS EGDVQHWWHE PAGAGIRSRI SDDLLWLPYV VAQYVRTTGD LAILEAEVPF LNAPPLSDDQ HEVFSVPQVS LERATLFEHC RRAVARGLTT GPHGLPLMGT GDWNDGMNLV GAGGKGESVW LAWFLCDILK GMAEISTLLK LPELARGYLE ERSALVLRTE KAGWDGEWYL RGTFDDGTPL GSSSNSEARI DSLPQSWAWL SGAADRERAG KGLEAAWQHL VREDEGLVLL FEPPFDTSEP SPGYIKGYPP GVRENGGQYT HAALWMVMAL AKKGEGDRAA QLLRVLNPIE HARDAQAAWL YGVEPYVVAA DVYRLPGRIG QGGWSWYTGS AAWMYRAWIE EVLGLKVRGD ELRMNPVIPA AWPGFSMSYR HGEAVYAIRV ENPDGCQCGV AQVEMDGRRA DGGVIKLERG LVKHQVVVRM GTRK
|
| |