Gene GM21_0117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0117 
Symbol 
ID8135420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp146614 
End bp149964 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content63% 
IMG OID644867737 
Producttrehalose synthase 
Protein accessionYP_003019961 
Protein GI253698772 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.0236355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACAGC GCAGCGACAA AAACCCGCTC TGGTTCAAGG ACGCCATCGT CTACGAGGTG 
CACATCAGGA GTTTCTGCGA CAGCAACGGG GACGGCATCG GAGACTTCCG GGGGCTGTTG
CAAAAACTCC CCTACCTGCG CGACCTGGGG ATCACCGCCG TCTGGGTGCT CCCCTTCTAC
CCCTCGCCGC TTAAGGACGA CGGCTACGAC ATCGCCGACT ACCGCAGCGT CCACCCGGAC
TACGGCACCC TGCGCGACTT CCAGGACTTC CTGAAGGGCG CCCACGCCCT CGGCATGCGG
GTCATCACCG AGCTGGTCTT AAACCACACC TCGGACCAGC ATCCCTGGTT CCAGAAATCC
CGCACCTCTA AGCCCGGCTC CCCCTGGCGC GACTTCTACG TCTGGTCCGA CACCCCCGAC
AAGTACCTGG ACGCCCGCAT CATCTTCAAG GATTTCGAGG TCTCCAACTG GACCCGCGAT
CCCGTCGCCA AGAGCTATTT CTGGCACAGG TTCTATTCGC ACCAGCCCGA CCTCAATTAC
GACAACCCCC GGGTGCACGA GGCGATGTTC CGGGTCATCG ACTTCTGGCT CGGGATGGGG
GTGGACGGCC TGAGGCTCGA CGCGGTTCCC TACCTCTACG AGCGGGAGGG GACCAACTGC
GAGAACCTCC CGGAGACCTA CGAGTTCCTG AAGAAGCTGC GCGCCTACAT CGACGCGAAG
TATCCGGACC GGATGCTCCT GGCCGAGGCC AACCAGTGGC CGGAGGACGC CGCCTCCTAC
TTCGGGGGGG GAGCCGCCTG CCAGATGTGC TTCCATTTCC CGCTCATGCC CCGCATGTTC
ATGGCGCTGC AGATGGAGGA TTCCTTCCCC ATCATCAACA TCCTGCAGCA GACCCCGGTC
ATCCCGCAGC AGTGCCAGTG GGCGCTGTTT CTTAGAAACC ACGACGAGCT CACCCTGGAG
ATGGTGACCG ACGAGGAGCG GGACTACATG TACCGGATCT ACGCCCGGGA CCCCAGGGCG
CGCATCAACC TTGGGATCCG GCGCCGGCTG GCGCCGCTCA TGGGGGACGA CCGGCGCAAG
ATCGAGCTGA TGAACGTCCT TCTTTTCTGC CTACCCGGCA CCCCCATCAT CTACTACGGC
GACGAGATCG GCATGGGGGA CAACTATTAC CTGGGCGACC GCAACGGCGT GCGCACGCCG
ATGCAGTGGA GCCCCGACCG AAACGCCGGC TTCTCCCCGG TGAACCCGCA AAAGCTCTAC
CTCCCCGCCA TCATCGATCC GGAGTACCAC TACGAGGCGC GTAACGTCGA GAACCAGGCG
AAGAACCCCT CTTCGCTTCT TTGGTGGATG AAGCGGATGA TCGATCTGAG GAAGCGGTTC
AAGGCCTTCG GCTGGGGGAG CGTCGAGTTC ATGCCGCTGG AGAACTCAAA GGTGCTGGCC
ATGGTCAGGA AGTACGAGGA CCAGGCCATC CTGGTGCTGA TCAACCTCTC CCGCGCCACG
CAGTTCGTGC AGGTGAACCA GCCGCGTTTC GTCGGCTTTT ATCCCGAGGA GATGTTCAGC
CGCAACCGGT TCCCGGTGAT CAAGGACTCC CCCTACGGCG TCATCCTGGG AGCTTACGAC
TACCACCTTC TGCTGATGAA GGACGGAAGC GAAGAGGTGA AGCCCCGGGA GGAGGGGGTG
CTGGAGGAGG TCCCGGTCAC CGGCAGCTGG GAGAACGTGT TGAGGTCGGC GGGGCTCAGG
CAGCTGGAAC AGTCGGTGCT TCCGGAATAC CTGAAGAGGT CCCGCTGGTT CCAGGGGAAG
AGCCGGATCA TGGTTCGCTT CTCGGTGATG GAGAAGATTC CCGTCCCGGT CAACAGCTCT
TCGGTGATCC TCACGCTTTT GGAGGTGAGC TACAGCGAGG GGGCGCCTGA GACCTACCTG
CTGCCGCTTC ACTTCGTCCC CATGGACGGC GGAGGCGAGG GTATTCTCAC CGAGACGCCG
GCCGCCGTCG TCTGCAAGCT GCGCATCGGC GACAAGGTCG GGATCCTCTA CGACGGCACC
TACAACTCCC ATTTCCGCTG GGCGCTCTTC GAGATGATCT GGCGCCGCAA GGCGATCCGC
ACCGACGGGG GGAGGTTCTC CGGGCGCCCG GCCAGCGGCA TGGGGGCGCT CATGGAAGGG
AAGGAGCCTC CGTTCGTCTC CCAGGTGGGG AAGTCGGAAC AAAGCAACAC CTCGATGCTC
TTCGACAAGC ACTTCTTCCT GAAGCTGTAC CGGCGGCTGG AAGAGGGGGC GCATCCGGAA
GTCGAGATCG GGAGGTTCCT CTCGGACCGG ATCCGTTTCC AGCACGTGGC GCCCCTGGCC
GGGACCATCG AGTACCGGCG ACCCGGCCTG GAACCGGTGG CCATCGGGAT GCTGCAGGCC
TTCGTCCCCA ACCAGGGGGA CGCCTGGGCC TTTACCCTCG GTGAAGCGGG GCAGTTCGTC
GATCGCGTGC TGGCCCATCG CGAGGAGGCA AAGGAATCCG GGACTCCCGC CGTCTCCCAG
CCGGACACCG CGGCCACCGG CTTCTCCACG GTGCTGCATG ACCTGATCCA GGGGTTGTAC
CCGGAGATGG TGACCCTGAT CGGAAAGCGC ACGGCGGAGC TGCATTTGGC TCTTTCCTCC
CGCAGCGACG ACCCCTCCTT CGCGCCCGAA CCCTTTGCGC TTCTTTACCA GCGCTCGGTG
TACCAGTCGA TGCGCAGCCG CACCAGGAAG GCCTTCGACC TCTTGCGCCG GAACCTGGGC
CGGCTTCCCC AGGAACTGGT GCAGGAGGCC GAGGCGCTCC TGGGGATGGA GCCGGAAGTG
CAGGGCGCCC TGCAGAAGTT CACGATCAGC AAGTTTTCGG CGATGAAGAC GCGGGTCCAC
GGCGACTACC ACCTGGGGCA GCTACTCTAC ACCGGCGACG ATTTCCTGAT CATGGACTTC
GAAGGGGAAC CGGTGAAATC CCTGGGAGAG CGAAGGATCA AGCAGTCGCC GCTCAGGGAC
GTGGCGGCGA TGATGCGCTC CTTCGAGTAC GCGGGTCACG CCGTACTGAT GCAGAGGACC
CAGGTCCGGG AAGAGGACGT CGCCTTCCTG CTCCCTTGGA TCCAGGCTTG GTGCCGCTAC
AACTCCTCCC TTTTCCTCGC CTCCTACCAA AAGAAGGTGG AAGGGTGCAA TTTCATGCCG
GACAACCCGC AGGATGTGGA GACCATGCTG CGCTGCTTCA TGTTGGACAA GGCGGTCTAT
GAACTGGGGT ACGAACTGAA CAACCGCCCG GACTGGGTCT CCATACCGCT GCGCGGCATT
ATGAACCTGC TGGCGTCGGA AAAGAGCGTT CCGAAGCTGG AGGGGCAATG A
 
Protein sequence
MPQRSDKNPL WFKDAIVYEV HIRSFCDSNG DGIGDFRGLL QKLPYLRDLG ITAVWVLPFY 
PSPLKDDGYD IADYRSVHPD YGTLRDFQDF LKGAHALGMR VITELVLNHT SDQHPWFQKS
RTSKPGSPWR DFYVWSDTPD KYLDARIIFK DFEVSNWTRD PVAKSYFWHR FYSHQPDLNY
DNPRVHEAMF RVIDFWLGMG VDGLRLDAVP YLYEREGTNC ENLPETYEFL KKLRAYIDAK
YPDRMLLAEA NQWPEDAASY FGGGAACQMC FHFPLMPRMF MALQMEDSFP IINILQQTPV
IPQQCQWALF LRNHDELTLE MVTDEERDYM YRIYARDPRA RINLGIRRRL APLMGDDRRK
IELMNVLLFC LPGTPIIYYG DEIGMGDNYY LGDRNGVRTP MQWSPDRNAG FSPVNPQKLY
LPAIIDPEYH YEARNVENQA KNPSSLLWWM KRMIDLRKRF KAFGWGSVEF MPLENSKVLA
MVRKYEDQAI LVLINLSRAT QFVQVNQPRF VGFYPEEMFS RNRFPVIKDS PYGVILGAYD
YHLLLMKDGS EEVKPREEGV LEEVPVTGSW ENVLRSAGLR QLEQSVLPEY LKRSRWFQGK
SRIMVRFSVM EKIPVPVNSS SVILTLLEVS YSEGAPETYL LPLHFVPMDG GGEGILTETP
AAVVCKLRIG DKVGILYDGT YNSHFRWALF EMIWRRKAIR TDGGRFSGRP ASGMGALMEG
KEPPFVSQVG KSEQSNTSML FDKHFFLKLY RRLEEGAHPE VEIGRFLSDR IRFQHVAPLA
GTIEYRRPGL EPVAIGMLQA FVPNQGDAWA FTLGEAGQFV DRVLAHREEA KESGTPAVSQ
PDTAATGFST VLHDLIQGLY PEMVTLIGKR TAELHLALSS RSDDPSFAPE PFALLYQRSV
YQSMRSRTRK AFDLLRRNLG RLPQELVQEA EALLGMEPEV QGALQKFTIS KFSAMKTRVH
GDYHLGQLLY TGDDFLIMDF EGEPVKSLGE RRIKQSPLRD VAAMMRSFEY AGHAVLMQRT
QVREEDVAFL LPWIQAWCRY NSSLFLASYQ KKVEGCNFMP DNPQDVETML RCFMLDKAVY
ELGYELNNRP DWVSIPLRGI MNLLASEKSV PKLEGQ