Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0115 |
Symbol | |
ID | 8135418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 141287 |
End bp | 144271 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644867735 |
Product | maltooligosyl trehalose synthase |
Protein accession | YP_003019959 |
Protein GI | 253698770 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 0.136948 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAGA ATCAAAAGCC GGCTGCGAGA ATCCCGACGG CCACCTACCG TCTGCAGTTC AACGCCGGGT TCACCTTCGC CGACGCCACC AGGATAGTAG GATACCTGCA CGACCTGGGT ATCAGCGACG TGTACGCCTC CTCGTACCTG GCTGCCAAGG AAGGGAGCGT CCACGGCTAC GACGTGGTGA ACCAGACCGT GCTGAACAAG GAAGTGGGGG ACGAGCAGAG CCACCTAGCC ATGGTGGAGG AACTGCAGCG GCACGGGATG GGGCACATCC TGGACTTCGT CCCCAACCAC ATGTGCATCG AGAGCGCGGA GAACCTCTGG TGGATGGACG TCCTCGAAAA CGGGATGAGC TCCCCTTACG CGCACTTCTT CGATATCGAT TGGGAGCCGG TGAAAAAGGA GCTGACCGGG AAGGTGCTGC TGCCGCTTCT GGGGGACCAG TACGGCAGGG TGCTGGAAAA CGGCGGCCTG CAGCTTCTTT TCAGGGATGG GGCCTTCTAC GTGCAGGTTT ACGCGCTGCA GATACCGCTG GAGCCTAAGA GCTGCCTGCA GATCCTCCAG CACCGGCTGG ACGCGTTGAA GGAGAAGTTC CCGGCCGAGG CGGCGCCGGT CGAGGAGCTT CTCAGCATCG AGACGGCGCT GCAGCACCTG CCGCTGGCGA CCGAGCAGGA CCCGGAGAAG ATGGGAGAGC GGCACCGCGA AAAAGAGATC ATCAAGAAGA GGCTCTGGCA GCTCTGCCAG GAGTCGCCGG AAGTGGCGGC CTTCATCGCC GACAACGTGA AGAGCTTCAA CGGCAGCAAA GGGGACCCGC GCAGCTTCGA TGCCATGGAC AAGCTCTTGC GGGACCAGGC GTACCGCCTC TCCTACTGGC GGGTGGCGAC GGAGGAGATC AACTACCGGC GCTTCTTCGA CATCAACGGC CTTGCGGCCA TCAGGATGGA AGACCAAGCG GTTTACGACC TGACCCACAC GCTCTTGTTC CGGCTGATCC GGGAGGGGAA GGTCACCGGC GTCCGCATCG ACCACGTGGA CGGGCTCTAC GATCCCGTCT CCTACCTGCA GAACCTACAG AAAAGCAGTT ATTTTCAGTT GCGGCAGGCA GGAGAGCCGC TCTCCTCGAA CAACGGGGAG GAGAAGAAAG AGGCGCTCGA GAAGGAATAC AACGCGCTCC TGGAGAAGGA CCCCTGCTAC AAGCCTTTCT ACGCCGTGGT GGAGAAGATC CTGATGAAGG GGGAACTGCT CCCGGATCAG TGGCCGGTGT TCGGGACCAC TGGGTACGAC TTTTTGAACA GCTTGAACGG CATCTTCGTC GCCACCGAGA AGGCGAAGCA GATGGACCGG CTCTACGACC GGTTCGTGAA GTGGGGGGGA GATTTCCCGG ACCTGGTCTA CGAGAAGAAG AAGCTGGTGA TGCAGGTCTC GCTCTCCGGC GAGGGGAACA TGCTGGCGCA CCAGTTGAAC AACATCGCCG AGCAGGACCG GCTCACCCGC GACTTCACCC TCAACAGCCT GGCCCGCGCC ATCAGCGAGG TGATCGCCTG CTTCCCGGTG TACCGCACCT ATGCCAACTC CGCCTCCGTG CGCGACAAGG ACGTGCAGTA CATCGAGGCG GCCGTATATA AGGCCAAGCG GCGCAATCCT GCCATCAGCG GCTCGGTGTT CGACTTCGTG AGGGACGTGC TCTTGTTGAA ATCCCCGGAG CGCGCGAGCG AGGACGACCG CCGCTCGTGG CTCTATTTCG CCATGCGCTT CCAGCAGATA ACCGGGCCGG TGATGGCCAA AGGGCTCGAA GACACAGCCT TCTACGTCTA CAACCGGCTG GTCTCGTTGA ACGACGTCGG GGGGATGCCC GGGAAGTTCG GCACCACCTT GGAGGCCTTC CATGGCCAGA ACCTCGATCG GAACAAGACC TTCCCCCACG CCATGATCGC CACGGCGACC CACGACTCCA AGCGCGGCGA GGACATACGC ACCAGGATCG ACGCGCTCTC GGAGATCCCG GAACTCTGGC AGAAATCCTT GATCAGGTGG AGCCGATTCA ACAAGGGGAA GACCATCTCC ATCGAGAACC AGCCGGTGCC CGATCGCAAC GAGGAGTACC TCCTGTACCA GATCCTTCTG GGGGTCTGGC CGGCAGGGGA GATGGACGAT GAAGGGTACA AGAGCCTCAA GGGGAGGGTC CGGGACTACA TGGTCAAGGC GCTGCGGGAG GCCAAGGTCA ACACCAGTTG GGTGAGCCCG AACACCGCCT ACGAGGAGGG GGTGACCTCA TTCGTGGACC GGGTCCTGGA ACCGGGAGCG TCGAACCTTT TCCTGGGCGA GTTCCTGCCG CTGCAGAGGC GGCTGGCGCG CTGCGGCATC TTCAGTTCGC TTTCGCAGAC CTTCCTGAAG ATGGTCTCCC CCGGGATCCC CGACTTCTAC CAGGGGACCG AGCTCTTCGA ATTCACCCTG GTCGACCCGG ACAACCGCCG CCAGGTCGAC TACGGCAAGA GGATGGAGGC GCTCTCAGGC CTGAAGGCGC GCGAGGCGGA ATCGGGGCCG GAAGCGCTCT GCCGCGAGCT GATGGGAACG GCGGAAGACG GCAGGATCAA GCTCTATCTC ATCCACCGGG TCTTGAATTA CCGCAGGGAC AACCGCGGCC CCTTCGAGGG GGGCGAGTAC CTGCCGCTGG AGGCAAAGGG GACGCGGGAG CGGCACGTCT GCGCCTTTGC CCGGAAGGGG AAGGAAAAGA CCGTTATAGC CGTCGCCGCG CGGCTGGTGG CGACCCTGAT GCCGGCGGAG GGAAGCTTCC CCTTGGGGGA GGAGGCCTGG CAGGAGACGG TCCTGGTGCT TCCGGAAGGA TGCGGCGGCA GGTTCAGAAA CATCGTCAAC GGCGAGGAGC TGAATGCGCA GGAGCACGGG GGGGAGCAGG TGATCGTGCT CTCGCGGCTA TTCGGGCAGA TTAGCGTGGC GCTGTTGGAG TCTGTTTCCG GCTAA
|
Protein sequence | MAENQKPAAR IPTATYRLQF NAGFTFADAT RIVGYLHDLG ISDVYASSYL AAKEGSVHGY DVVNQTVLNK EVGDEQSHLA MVEELQRHGM GHILDFVPNH MCIESAENLW WMDVLENGMS SPYAHFFDID WEPVKKELTG KVLLPLLGDQ YGRVLENGGL QLLFRDGAFY VQVYALQIPL EPKSCLQILQ HRLDALKEKF PAEAAPVEEL LSIETALQHL PLATEQDPEK MGERHREKEI IKKRLWQLCQ ESPEVAAFIA DNVKSFNGSK GDPRSFDAMD KLLRDQAYRL SYWRVATEEI NYRRFFDING LAAIRMEDQA VYDLTHTLLF RLIREGKVTG VRIDHVDGLY DPVSYLQNLQ KSSYFQLRQA GEPLSSNNGE EKKEALEKEY NALLEKDPCY KPFYAVVEKI LMKGELLPDQ WPVFGTTGYD FLNSLNGIFV ATEKAKQMDR LYDRFVKWGG DFPDLVYEKK KLVMQVSLSG EGNMLAHQLN NIAEQDRLTR DFTLNSLARA ISEVIACFPV YRTYANSASV RDKDVQYIEA AVYKAKRRNP AISGSVFDFV RDVLLLKSPE RASEDDRRSW LYFAMRFQQI TGPVMAKGLE DTAFYVYNRL VSLNDVGGMP GKFGTTLEAF HGQNLDRNKT FPHAMIATAT HDSKRGEDIR TRIDALSEIP ELWQKSLIRW SRFNKGKTIS IENQPVPDRN EEYLLYQILL GVWPAGEMDD EGYKSLKGRV RDYMVKALRE AKVNTSWVSP NTAYEEGVTS FVDRVLEPGA SNLFLGEFLP LQRRLARCGI FSSLSQTFLK MVSPGIPDFY QGTELFEFTL VDPDNRRQVD YGKRMEALSG LKAREAESGP EALCRELMGT AEDGRIKLYL IHRVLNYRRD NRGPFEGGEY LPLEAKGTRE RHVCAFARKG KEKTVIAVAA RLVATLMPAE GSFPLGEEAW QETVLVLPEG CGGRFRNIVN GEELNAQEHG GEQVIVLSRL FGQISVALLE SVSG
|
| |