Gene GM21_0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0115 
Symbol 
ID8135418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp141287 
End bp144271 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content63% 
IMG OID644867735 
Productmaltooligosyl trehalose synthase 
Protein accessionYP_003019959 
Protein GI253698770 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3280] Maltooligosyl trehalose synthase 
TIGRFAM ID[TIGR02401] malto-oligosyltrehalose synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.136948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGA ATCAAAAGCC GGCTGCGAGA ATCCCGACGG CCACCTACCG TCTGCAGTTC 
AACGCCGGGT TCACCTTCGC CGACGCCACC AGGATAGTAG GATACCTGCA CGACCTGGGT
ATCAGCGACG TGTACGCCTC CTCGTACCTG GCTGCCAAGG AAGGGAGCGT CCACGGCTAC
GACGTGGTGA ACCAGACCGT GCTGAACAAG GAAGTGGGGG ACGAGCAGAG CCACCTAGCC
ATGGTGGAGG AACTGCAGCG GCACGGGATG GGGCACATCC TGGACTTCGT CCCCAACCAC
ATGTGCATCG AGAGCGCGGA GAACCTCTGG TGGATGGACG TCCTCGAAAA CGGGATGAGC
TCCCCTTACG CGCACTTCTT CGATATCGAT TGGGAGCCGG TGAAAAAGGA GCTGACCGGG
AAGGTGCTGC TGCCGCTTCT GGGGGACCAG TACGGCAGGG TGCTGGAAAA CGGCGGCCTG
CAGCTTCTTT TCAGGGATGG GGCCTTCTAC GTGCAGGTTT ACGCGCTGCA GATACCGCTG
GAGCCTAAGA GCTGCCTGCA GATCCTCCAG CACCGGCTGG ACGCGTTGAA GGAGAAGTTC
CCGGCCGAGG CGGCGCCGGT CGAGGAGCTT CTCAGCATCG AGACGGCGCT GCAGCACCTG
CCGCTGGCGA CCGAGCAGGA CCCGGAGAAG ATGGGAGAGC GGCACCGCGA AAAAGAGATC
ATCAAGAAGA GGCTCTGGCA GCTCTGCCAG GAGTCGCCGG AAGTGGCGGC CTTCATCGCC
GACAACGTGA AGAGCTTCAA CGGCAGCAAA GGGGACCCGC GCAGCTTCGA TGCCATGGAC
AAGCTCTTGC GGGACCAGGC GTACCGCCTC TCCTACTGGC GGGTGGCGAC GGAGGAGATC
AACTACCGGC GCTTCTTCGA CATCAACGGC CTTGCGGCCA TCAGGATGGA AGACCAAGCG
GTTTACGACC TGACCCACAC GCTCTTGTTC CGGCTGATCC GGGAGGGGAA GGTCACCGGC
GTCCGCATCG ACCACGTGGA CGGGCTCTAC GATCCCGTCT CCTACCTGCA GAACCTACAG
AAAAGCAGTT ATTTTCAGTT GCGGCAGGCA GGAGAGCCGC TCTCCTCGAA CAACGGGGAG
GAGAAGAAAG AGGCGCTCGA GAAGGAATAC AACGCGCTCC TGGAGAAGGA CCCCTGCTAC
AAGCCTTTCT ACGCCGTGGT GGAGAAGATC CTGATGAAGG GGGAACTGCT CCCGGATCAG
TGGCCGGTGT TCGGGACCAC TGGGTACGAC TTTTTGAACA GCTTGAACGG CATCTTCGTC
GCCACCGAGA AGGCGAAGCA GATGGACCGG CTCTACGACC GGTTCGTGAA GTGGGGGGGA
GATTTCCCGG ACCTGGTCTA CGAGAAGAAG AAGCTGGTGA TGCAGGTCTC GCTCTCCGGC
GAGGGGAACA TGCTGGCGCA CCAGTTGAAC AACATCGCCG AGCAGGACCG GCTCACCCGC
GACTTCACCC TCAACAGCCT GGCCCGCGCC ATCAGCGAGG TGATCGCCTG CTTCCCGGTG
TACCGCACCT ATGCCAACTC CGCCTCCGTG CGCGACAAGG ACGTGCAGTA CATCGAGGCG
GCCGTATATA AGGCCAAGCG GCGCAATCCT GCCATCAGCG GCTCGGTGTT CGACTTCGTG
AGGGACGTGC TCTTGTTGAA ATCCCCGGAG CGCGCGAGCG AGGACGACCG CCGCTCGTGG
CTCTATTTCG CCATGCGCTT CCAGCAGATA ACCGGGCCGG TGATGGCCAA AGGGCTCGAA
GACACAGCCT TCTACGTCTA CAACCGGCTG GTCTCGTTGA ACGACGTCGG GGGGATGCCC
GGGAAGTTCG GCACCACCTT GGAGGCCTTC CATGGCCAGA ACCTCGATCG GAACAAGACC
TTCCCCCACG CCATGATCGC CACGGCGACC CACGACTCCA AGCGCGGCGA GGACATACGC
ACCAGGATCG ACGCGCTCTC GGAGATCCCG GAACTCTGGC AGAAATCCTT GATCAGGTGG
AGCCGATTCA ACAAGGGGAA GACCATCTCC ATCGAGAACC AGCCGGTGCC CGATCGCAAC
GAGGAGTACC TCCTGTACCA GATCCTTCTG GGGGTCTGGC CGGCAGGGGA GATGGACGAT
GAAGGGTACA AGAGCCTCAA GGGGAGGGTC CGGGACTACA TGGTCAAGGC GCTGCGGGAG
GCCAAGGTCA ACACCAGTTG GGTGAGCCCG AACACCGCCT ACGAGGAGGG GGTGACCTCA
TTCGTGGACC GGGTCCTGGA ACCGGGAGCG TCGAACCTTT TCCTGGGCGA GTTCCTGCCG
CTGCAGAGGC GGCTGGCGCG CTGCGGCATC TTCAGTTCGC TTTCGCAGAC CTTCCTGAAG
ATGGTCTCCC CCGGGATCCC CGACTTCTAC CAGGGGACCG AGCTCTTCGA ATTCACCCTG
GTCGACCCGG ACAACCGCCG CCAGGTCGAC TACGGCAAGA GGATGGAGGC GCTCTCAGGC
CTGAAGGCGC GCGAGGCGGA ATCGGGGCCG GAAGCGCTCT GCCGCGAGCT GATGGGAACG
GCGGAAGACG GCAGGATCAA GCTCTATCTC ATCCACCGGG TCTTGAATTA CCGCAGGGAC
AACCGCGGCC CCTTCGAGGG GGGCGAGTAC CTGCCGCTGG AGGCAAAGGG GACGCGGGAG
CGGCACGTCT GCGCCTTTGC CCGGAAGGGG AAGGAAAAGA CCGTTATAGC CGTCGCCGCG
CGGCTGGTGG CGACCCTGAT GCCGGCGGAG GGAAGCTTCC CCTTGGGGGA GGAGGCCTGG
CAGGAGACGG TCCTGGTGCT TCCGGAAGGA TGCGGCGGCA GGTTCAGAAA CATCGTCAAC
GGCGAGGAGC TGAATGCGCA GGAGCACGGG GGGGAGCAGG TGATCGTGCT CTCGCGGCTA
TTCGGGCAGA TTAGCGTGGC GCTGTTGGAG TCTGTTTCCG GCTAA
 
Protein sequence
MAENQKPAAR IPTATYRLQF NAGFTFADAT RIVGYLHDLG ISDVYASSYL AAKEGSVHGY 
DVVNQTVLNK EVGDEQSHLA MVEELQRHGM GHILDFVPNH MCIESAENLW WMDVLENGMS
SPYAHFFDID WEPVKKELTG KVLLPLLGDQ YGRVLENGGL QLLFRDGAFY VQVYALQIPL
EPKSCLQILQ HRLDALKEKF PAEAAPVEEL LSIETALQHL PLATEQDPEK MGERHREKEI
IKKRLWQLCQ ESPEVAAFIA DNVKSFNGSK GDPRSFDAMD KLLRDQAYRL SYWRVATEEI
NYRRFFDING LAAIRMEDQA VYDLTHTLLF RLIREGKVTG VRIDHVDGLY DPVSYLQNLQ
KSSYFQLRQA GEPLSSNNGE EKKEALEKEY NALLEKDPCY KPFYAVVEKI LMKGELLPDQ
WPVFGTTGYD FLNSLNGIFV ATEKAKQMDR LYDRFVKWGG DFPDLVYEKK KLVMQVSLSG
EGNMLAHQLN NIAEQDRLTR DFTLNSLARA ISEVIACFPV YRTYANSASV RDKDVQYIEA
AVYKAKRRNP AISGSVFDFV RDVLLLKSPE RASEDDRRSW LYFAMRFQQI TGPVMAKGLE
DTAFYVYNRL VSLNDVGGMP GKFGTTLEAF HGQNLDRNKT FPHAMIATAT HDSKRGEDIR
TRIDALSEIP ELWQKSLIRW SRFNKGKTIS IENQPVPDRN EEYLLYQILL GVWPAGEMDD
EGYKSLKGRV RDYMVKALRE AKVNTSWVSP NTAYEEGVTS FVDRVLEPGA SNLFLGEFLP
LQRRLARCGI FSSLSQTFLK MVSPGIPDFY QGTELFEFTL VDPDNRRQVD YGKRMEALSG
LKAREAESGP EALCRELMGT AEDGRIKLYL IHRVLNYRRD NRGPFEGGEY LPLEAKGTRE
RHVCAFARKG KEKTVIAVAA RLVATLMPAE GSFPLGEEAW QETVLVLPEG CGGRFRNIVN
GEELNAQEHG GEQVIVLSRL FGQISVALLE SVSG