Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0117 |
Symbol | |
ID | 8135420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 146614 |
End bp | 149964 |
Gene Length | 3351 bp |
Protein Length | 1116 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644867737 |
Product | trehalose synthase |
Protein accession | YP_003019961 |
Protein GI | 253698772 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.0236355 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACAGC GCAGCGACAA AAACCCGCTC TGGTTCAAGG ACGCCATCGT CTACGAGGTG CACATCAGGA GTTTCTGCGA CAGCAACGGG GACGGCATCG GAGACTTCCG GGGGCTGTTG CAAAAACTCC CCTACCTGCG CGACCTGGGG ATCACCGCCG TCTGGGTGCT CCCCTTCTAC CCCTCGCCGC TTAAGGACGA CGGCTACGAC ATCGCCGACT ACCGCAGCGT CCACCCGGAC TACGGCACCC TGCGCGACTT CCAGGACTTC CTGAAGGGCG CCCACGCCCT CGGCATGCGG GTCATCACCG AGCTGGTCTT AAACCACACC TCGGACCAGC ATCCCTGGTT CCAGAAATCC CGCACCTCTA AGCCCGGCTC CCCCTGGCGC GACTTCTACG TCTGGTCCGA CACCCCCGAC AAGTACCTGG ACGCCCGCAT CATCTTCAAG GATTTCGAGG TCTCCAACTG GACCCGCGAT CCCGTCGCCA AGAGCTATTT CTGGCACAGG TTCTATTCGC ACCAGCCCGA CCTCAATTAC GACAACCCCC GGGTGCACGA GGCGATGTTC CGGGTCATCG ACTTCTGGCT CGGGATGGGG GTGGACGGCC TGAGGCTCGA CGCGGTTCCC TACCTCTACG AGCGGGAGGG GACCAACTGC GAGAACCTCC CGGAGACCTA CGAGTTCCTG AAGAAGCTGC GCGCCTACAT CGACGCGAAG TATCCGGACC GGATGCTCCT GGCCGAGGCC AACCAGTGGC CGGAGGACGC CGCCTCCTAC TTCGGGGGGG GAGCCGCCTG CCAGATGTGC TTCCATTTCC CGCTCATGCC CCGCATGTTC ATGGCGCTGC AGATGGAGGA TTCCTTCCCC ATCATCAACA TCCTGCAGCA GACCCCGGTC ATCCCGCAGC AGTGCCAGTG GGCGCTGTTT CTTAGAAACC ACGACGAGCT CACCCTGGAG ATGGTGACCG ACGAGGAGCG GGACTACATG TACCGGATCT ACGCCCGGGA CCCCAGGGCG CGCATCAACC TTGGGATCCG GCGCCGGCTG GCGCCGCTCA TGGGGGACGA CCGGCGCAAG ATCGAGCTGA TGAACGTCCT TCTTTTCTGC CTACCCGGCA CCCCCATCAT CTACTACGGC GACGAGATCG GCATGGGGGA CAACTATTAC CTGGGCGACC GCAACGGCGT GCGCACGCCG ATGCAGTGGA GCCCCGACCG AAACGCCGGC TTCTCCCCGG TGAACCCGCA AAAGCTCTAC CTCCCCGCCA TCATCGATCC GGAGTACCAC TACGAGGCGC GTAACGTCGA GAACCAGGCG AAGAACCCCT CTTCGCTTCT TTGGTGGATG AAGCGGATGA TCGATCTGAG GAAGCGGTTC AAGGCCTTCG GCTGGGGGAG CGTCGAGTTC ATGCCGCTGG AGAACTCAAA GGTGCTGGCC ATGGTCAGGA AGTACGAGGA CCAGGCCATC CTGGTGCTGA TCAACCTCTC CCGCGCCACG CAGTTCGTGC AGGTGAACCA GCCGCGTTTC GTCGGCTTTT ATCCCGAGGA GATGTTCAGC CGCAACCGGT TCCCGGTGAT CAAGGACTCC CCCTACGGCG TCATCCTGGG AGCTTACGAC TACCACCTTC TGCTGATGAA GGACGGAAGC GAAGAGGTGA AGCCCCGGGA GGAGGGGGTG CTGGAGGAGG TCCCGGTCAC CGGCAGCTGG GAGAACGTGT TGAGGTCGGC GGGGCTCAGG CAGCTGGAAC AGTCGGTGCT TCCGGAATAC CTGAAGAGGT CCCGCTGGTT CCAGGGGAAG AGCCGGATCA TGGTTCGCTT CTCGGTGATG GAGAAGATTC CCGTCCCGGT CAACAGCTCT TCGGTGATCC TCACGCTTTT GGAGGTGAGC TACAGCGAGG GGGCGCCTGA GACCTACCTG CTGCCGCTTC ACTTCGTCCC CATGGACGGC GGAGGCGAGG GTATTCTCAC CGAGACGCCG GCCGCCGTCG TCTGCAAGCT GCGCATCGGC GACAAGGTCG GGATCCTCTA CGACGGCACC TACAACTCCC ATTTCCGCTG GGCGCTCTTC GAGATGATCT GGCGCCGCAA GGCGATCCGC ACCGACGGGG GGAGGTTCTC CGGGCGCCCG GCCAGCGGCA TGGGGGCGCT CATGGAAGGG AAGGAGCCTC CGTTCGTCTC CCAGGTGGGG AAGTCGGAAC AAAGCAACAC CTCGATGCTC TTCGACAAGC ACTTCTTCCT GAAGCTGTAC CGGCGGCTGG AAGAGGGGGC GCATCCGGAA GTCGAGATCG GGAGGTTCCT CTCGGACCGG ATCCGTTTCC AGCACGTGGC GCCCCTGGCC GGGACCATCG AGTACCGGCG ACCCGGCCTG GAACCGGTGG CCATCGGGAT GCTGCAGGCC TTCGTCCCCA ACCAGGGGGA CGCCTGGGCC TTTACCCTCG GTGAAGCGGG GCAGTTCGTC GATCGCGTGC TGGCCCATCG CGAGGAGGCA AAGGAATCCG GGACTCCCGC CGTCTCCCAG CCGGACACCG CGGCCACCGG CTTCTCCACG GTGCTGCATG ACCTGATCCA GGGGTTGTAC CCGGAGATGG TGACCCTGAT CGGAAAGCGC ACGGCGGAGC TGCATTTGGC TCTTTCCTCC CGCAGCGACG ACCCCTCCTT CGCGCCCGAA CCCTTTGCGC TTCTTTACCA GCGCTCGGTG TACCAGTCGA TGCGCAGCCG CACCAGGAAG GCCTTCGACC TCTTGCGCCG GAACCTGGGC CGGCTTCCCC AGGAACTGGT GCAGGAGGCC GAGGCGCTCC TGGGGATGGA GCCGGAAGTG CAGGGCGCCC TGCAGAAGTT CACGATCAGC AAGTTTTCGG CGATGAAGAC GCGGGTCCAC GGCGACTACC ACCTGGGGCA GCTACTCTAC ACCGGCGACG ATTTCCTGAT CATGGACTTC GAAGGGGAAC CGGTGAAATC CCTGGGAGAG CGAAGGATCA AGCAGTCGCC GCTCAGGGAC GTGGCGGCGA TGATGCGCTC CTTCGAGTAC GCGGGTCACG CCGTACTGAT GCAGAGGACC CAGGTCCGGG AAGAGGACGT CGCCTTCCTG CTCCCTTGGA TCCAGGCTTG GTGCCGCTAC AACTCCTCCC TTTTCCTCGC CTCCTACCAA AAGAAGGTGG AAGGGTGCAA TTTCATGCCG GACAACCCGC AGGATGTGGA GACCATGCTG CGCTGCTTCA TGTTGGACAA GGCGGTCTAT GAACTGGGGT ACGAACTGAA CAACCGCCCG GACTGGGTCT CCATACCGCT GCGCGGCATT ATGAACCTGC TGGCGTCGGA AAAGAGCGTT CCGAAGCTGG AGGGGCAATG A
|
Protein sequence | MPQRSDKNPL WFKDAIVYEV HIRSFCDSNG DGIGDFRGLL QKLPYLRDLG ITAVWVLPFY PSPLKDDGYD IADYRSVHPD YGTLRDFQDF LKGAHALGMR VITELVLNHT SDQHPWFQKS RTSKPGSPWR DFYVWSDTPD KYLDARIIFK DFEVSNWTRD PVAKSYFWHR FYSHQPDLNY DNPRVHEAMF RVIDFWLGMG VDGLRLDAVP YLYEREGTNC ENLPETYEFL KKLRAYIDAK YPDRMLLAEA NQWPEDAASY FGGGAACQMC FHFPLMPRMF MALQMEDSFP IINILQQTPV IPQQCQWALF LRNHDELTLE MVTDEERDYM YRIYARDPRA RINLGIRRRL APLMGDDRRK IELMNVLLFC LPGTPIIYYG DEIGMGDNYY LGDRNGVRTP MQWSPDRNAG FSPVNPQKLY LPAIIDPEYH YEARNVENQA KNPSSLLWWM KRMIDLRKRF KAFGWGSVEF MPLENSKVLA MVRKYEDQAI LVLINLSRAT QFVQVNQPRF VGFYPEEMFS RNRFPVIKDS PYGVILGAYD YHLLLMKDGS EEVKPREEGV LEEVPVTGSW ENVLRSAGLR QLEQSVLPEY LKRSRWFQGK SRIMVRFSVM EKIPVPVNSS SVILTLLEVS YSEGAPETYL LPLHFVPMDG GGEGILTETP AAVVCKLRIG DKVGILYDGT YNSHFRWALF EMIWRRKAIR TDGGRFSGRP ASGMGALMEG KEPPFVSQVG KSEQSNTSML FDKHFFLKLY RRLEEGAHPE VEIGRFLSDR IRFQHVAPLA GTIEYRRPGL EPVAIGMLQA FVPNQGDAWA FTLGEAGQFV DRVLAHREEA KESGTPAVSQ PDTAATGFST VLHDLIQGLY PEMVTLIGKR TAELHLALSS RSDDPSFAPE PFALLYQRSV YQSMRSRTRK AFDLLRRNLG RLPQELVQEA EALLGMEPEV QGALQKFTIS KFSAMKTRVH GDYHLGQLLY TGDDFLIMDF EGEPVKSLGE RRIKQSPLRD VAAMMRSFEY AGHAVLMQRT QVREEDVAFL LPWIQAWCRY NSSLFLASYQ KKVEGCNFMP DNPQDVETML RCFMLDKAVY ELGYELNNRP DWVSIPLRGI MNLLASEKSV PKLEGQ
|
| |