Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1811 |
Symbol | |
ID | 3830729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1868385 |
End bp | 1871252 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637829738 |
Product | malto-oligosyltrehalose synthase |
Protein accession | YP_430654 |
Protein GI | 83590645 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAGCC CGCACATTCC CACCGCTACC TACCGGCTGC AGTTTAACCG GCAGTTTGGC TTCATCGAAG CACGCGAGGT GGTACCGTAC CTGCAGGCGC TGGGTATTAC CGATATTTAT GCCTCCCCCC TGCTGAAAGC GAGGAAGGAT AGCCCCCACG GGTACGACGT GACTGACCCT GGGCAGTTAA ACCCGGAACT GGGAAGCAGG GAGGATTTTA CCTCCCTGGC TGACACCCTG AAGCAGCATG GGATGGGGCT GTTGCTGGAC GTCGTGCCCA ACCACATGGC AGCCAGCGTT GATAATCCGT GGTGGCGGGA CGTCCTCCGC CATGGGCGCG CTTCTACTTA CGCAGCTTAC TTTGATATTG ACTGGCAACC AGCCAGGCCG GGGCTGGTAA ACAAAGTCCT CCTGCCAGTC CTGGGTGAAC CCTTCGGGAA AGTATTGGAA AACCAGCAAC TGGCCCTGAA ACTGGCAGAA GATGGGTTCC GAGTCTGCTA CTATGAAAAG GAATTCCCTC TTAGTCCCTT TTCTTCCCGC CGGATTCTAG GCGGCTGGGC GCAAACCCTG GCCGAAGACG GGGGTGCTGC CGAACAGGCG CTATCACAAT TGCGGGACCT GCTTGCTTCC CTCTCGGCGC TCCCTCTCCC CCGGGCTGGC GAGCTGTCGA CTCCCTGGCA ACAGGCCTGG AAAAGCCTCT GGCATCTATA CAATACCAAC CCAGCGGTAA AAGCATTTAT TGACAGGAAC CTGCGCAAAT TAAATGGTAA AAAAGGTGAC CCGCAAAGTT TCAATCAACT GGAGGGTATC CTGGCGGAAC AGGCCTACCG GCTCGCCTAC TGGCGGGTGG CCAATGAAGA AATCAATTAC CGGCGCTTCT TTGACGTTAG CGACCTGGTA GCTATCCGCA TGGAAGACAA AAGGGTTTTT GAGGCGGTGC ACGCCCTGAT TTTTCAATTG GTCGGGGCTG GGCAAGTTAC CGGCCTCAGA ATCGACCATA TTGATGGCCT GTATGACCCC CAGGAGTACC TCAACAGGCT GCAGGAACAC CTTTCTGCAG CAGGAAGTTC TCCCGGTTTT TACGTGGTAG CAGAGAAAAT ACTAAGCGAC GGGGAGGAAC TGCCGGCAAC CTGGCGTACT CAGGGAACAA CGGGCTACGA TTTTTTAAAT TCGTTGAACG GACTCTTCGT TGACGAAGAA GGGCTCGCCG CCCTGGAAGA GTTTTATGCC AGGTACAGCG GCGCCGAAAC CGACTTTACC AGGGTGGTCT ATAACCAGAA GAAGCTGGTA ATGACCAGGT TGTTCGCCAG CGAGGTGCGT AACCTGGTAG GGGAGCTGGG ACGCCTGGCT GAAGAGGATC GCCTGGGCCA CGACCTTACC TTAGCAGAGC TGGAAGAGGC CCTGGTCGCA GTCACTGCCA GCCTGGGTAT ATACCGGACC TACATCCACG ACTTCACGGT AGCGCCGCAG GACCGGCATT ACATCGAAAC TGCCATAGCC GAGGCCGTCC GGAGGTGCCC GGCCGCCGGC CCGGCCTGTC GTTTCCTGCG CCAGGTGCTG TTGCTGGATT TTCCCGTCTC CCTGCCCCCC GAACAACGGC AGGCCTGGCT GCGTTTCGTA ATGCGCTGGC AACAGTTTAC GGGACCGGTT ATGGCCAAGG GTTACGAGGA TACCTCCCTC TACATCTACA ACCCTTTGGT TTCCCTCAAT GAGGTAGGTA GCAGTCCCCG GACCAGGTGC ATATCGGTGG CTGAGTTCCA CCGCCGCAAT AAAACCCGGC AGGAGCGCTG GCCCCATACC CTCAACGCCA CGTCCACCCA TGATACCAAG CGCAGTGAGG ATGTCCGGGC GCGGATTAAT GTTCTAACGG AAATCCCTAA CGCCTGGGTA GAGAGGGTTG AGCGCTGGCG TCGCTGGAAT GGGCCCAAAA AGTTAAATAT AAAAGGCGAG CCTGTACCTG ATGGCAATAT GGAGCTATTT ATCTACCAGA CGCTAATCGG CGCCTGGCCC CTTTTAGAAG AAGAAATACC CGCTTTTAAG GAACGGCTCC GGACTTATAT GGTCAAGGCG GCCCGGGAGG CCAAAACCCG GACCAGCTGG CTTGACCCTG ATACCGATTA CGAAAACGCC CTGATAGAGT TTGTACTCTC TATTTTAACG CCGGAGCCAG GAAACCGGTT TCTGCCGGAC TTCCTCAGCT TTCAAAAAGT CGTTGCTTTT TATGGTGCCT GGAACTCGTT AGCCCAGATA CTGCTGAAGA TAACGAGTCC GGGGGTGCCT GATTTCTACC AGGGTACAGA ACTGTGGAAC CTCAGCCTGG TTGATCCTGA CAACCGGCGT CCGGTTGACT TTAAAACCAG GGCCAGGCTT TTACAAAGGC TCAAAGAAGA AGAGACAAAA GGCCAGCTGG CTCTGGTGAG AAACCTTTTG ACCAGCTGGC AGGATGGCCG GGTAAAGCTC TACCTGACTT ATAAATCCCT TCACTTTCGC TGCGACCACC GGGAATTATT TGCGACGGGT GAATATATCC CCCTGGCAGT CACCGGGTCC AGCTCGGGAC ACGCCTGCGC CTTCGCCAGG CATCTGGGCA GGGAGTGGGC TCTGGTGGTC GTTCCCCGCC TACCGGCCCG GATGCTGACG GGCAAGGTAA TACCAGCAAA TGGTGGGTTG CCGGCCCCCG GGTTTCTCCC AGGGGAAACT CTATGGCAAG GAACAAACCT TGTGCTACCT GAACAGGCCC CGGGCAACTG GCATAATATC TTAACAGGAG AGGTTCTGGC TTCTATCCCT TCGCCTGAAG GTAAAGTTCT TACCCTGGCC GACACCTGGC GCAATTTTCC AGTTGCTTTA CTGACCGAAG ACCAATAA
|
Protein sequence | MASPHIPTAT YRLQFNRQFG FIEAREVVPY LQALGITDIY ASPLLKARKD SPHGYDVTDP GQLNPELGSR EDFTSLADTL KQHGMGLLLD VVPNHMAASV DNPWWRDVLR HGRASTYAAY FDIDWQPARP GLVNKVLLPV LGEPFGKVLE NQQLALKLAE DGFRVCYYEK EFPLSPFSSR RILGGWAQTL AEDGGAAEQA LSQLRDLLAS LSALPLPRAG ELSTPWQQAW KSLWHLYNTN PAVKAFIDRN LRKLNGKKGD PQSFNQLEGI LAEQAYRLAY WRVANEEINY RRFFDVSDLV AIRMEDKRVF EAVHALIFQL VGAGQVTGLR IDHIDGLYDP QEYLNRLQEH LSAAGSSPGF YVVAEKILSD GEELPATWRT QGTTGYDFLN SLNGLFVDEE GLAALEEFYA RYSGAETDFT RVVYNQKKLV MTRLFASEVR NLVGELGRLA EEDRLGHDLT LAELEEALVA VTASLGIYRT YIHDFTVAPQ DRHYIETAIA EAVRRCPAAG PACRFLRQVL LLDFPVSLPP EQRQAWLRFV MRWQQFTGPV MAKGYEDTSL YIYNPLVSLN EVGSSPRTRC ISVAEFHRRN KTRQERWPHT LNATSTHDTK RSEDVRARIN VLTEIPNAWV ERVERWRRWN GPKKLNIKGE PVPDGNMELF IYQTLIGAWP LLEEEIPAFK ERLRTYMVKA AREAKTRTSW LDPDTDYENA LIEFVLSILT PEPGNRFLPD FLSFQKVVAF YGAWNSLAQI LLKITSPGVP DFYQGTELWN LSLVDPDNRR PVDFKTRARL LQRLKEEETK GQLALVRNLL TSWQDGRVKL YLTYKSLHFR CDHRELFATG EYIPLAVTGS SSGHACAFAR HLGREWALVV VPRLPARMLT GKVIPANGGL PAPGFLPGET LWQGTNLVLP EQAPGNWHNI LTGEVLASIP SPEGKVLTLA DTWRNFPVAL LTEDQ
|
| |