Gene Moth_1811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1811 
Symbol 
ID3830729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1868385 
End bp1871252 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content56% 
IMG OID637829738 
Productmalto-oligosyltrehalose synthase 
Protein accessionYP_430654 
Protein GI83590645 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3280] Maltooligosyl trehalose synthase 
TIGRFAM ID[TIGR02401] malto-oligosyltrehalose synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGCC CGCACATTCC CACCGCTACC TACCGGCTGC AGTTTAACCG GCAGTTTGGC 
TTCATCGAAG CACGCGAGGT GGTACCGTAC CTGCAGGCGC TGGGTATTAC CGATATTTAT
GCCTCCCCCC TGCTGAAAGC GAGGAAGGAT AGCCCCCACG GGTACGACGT GACTGACCCT
GGGCAGTTAA ACCCGGAACT GGGAAGCAGG GAGGATTTTA CCTCCCTGGC TGACACCCTG
AAGCAGCATG GGATGGGGCT GTTGCTGGAC GTCGTGCCCA ACCACATGGC AGCCAGCGTT
GATAATCCGT GGTGGCGGGA CGTCCTCCGC CATGGGCGCG CTTCTACTTA CGCAGCTTAC
TTTGATATTG ACTGGCAACC AGCCAGGCCG GGGCTGGTAA ACAAAGTCCT CCTGCCAGTC
CTGGGTGAAC CCTTCGGGAA AGTATTGGAA AACCAGCAAC TGGCCCTGAA ACTGGCAGAA
GATGGGTTCC GAGTCTGCTA CTATGAAAAG GAATTCCCTC TTAGTCCCTT TTCTTCCCGC
CGGATTCTAG GCGGCTGGGC GCAAACCCTG GCCGAAGACG GGGGTGCTGC CGAACAGGCG
CTATCACAAT TGCGGGACCT GCTTGCTTCC CTCTCGGCGC TCCCTCTCCC CCGGGCTGGC
GAGCTGTCGA CTCCCTGGCA ACAGGCCTGG AAAAGCCTCT GGCATCTATA CAATACCAAC
CCAGCGGTAA AAGCATTTAT TGACAGGAAC CTGCGCAAAT TAAATGGTAA AAAAGGTGAC
CCGCAAAGTT TCAATCAACT GGAGGGTATC CTGGCGGAAC AGGCCTACCG GCTCGCCTAC
TGGCGGGTGG CCAATGAAGA AATCAATTAC CGGCGCTTCT TTGACGTTAG CGACCTGGTA
GCTATCCGCA TGGAAGACAA AAGGGTTTTT GAGGCGGTGC ACGCCCTGAT TTTTCAATTG
GTCGGGGCTG GGCAAGTTAC CGGCCTCAGA ATCGACCATA TTGATGGCCT GTATGACCCC
CAGGAGTACC TCAACAGGCT GCAGGAACAC CTTTCTGCAG CAGGAAGTTC TCCCGGTTTT
TACGTGGTAG CAGAGAAAAT ACTAAGCGAC GGGGAGGAAC TGCCGGCAAC CTGGCGTACT
CAGGGAACAA CGGGCTACGA TTTTTTAAAT TCGTTGAACG GACTCTTCGT TGACGAAGAA
GGGCTCGCCG CCCTGGAAGA GTTTTATGCC AGGTACAGCG GCGCCGAAAC CGACTTTACC
AGGGTGGTCT ATAACCAGAA GAAGCTGGTA ATGACCAGGT TGTTCGCCAG CGAGGTGCGT
AACCTGGTAG GGGAGCTGGG ACGCCTGGCT GAAGAGGATC GCCTGGGCCA CGACCTTACC
TTAGCAGAGC TGGAAGAGGC CCTGGTCGCA GTCACTGCCA GCCTGGGTAT ATACCGGACC
TACATCCACG ACTTCACGGT AGCGCCGCAG GACCGGCATT ACATCGAAAC TGCCATAGCC
GAGGCCGTCC GGAGGTGCCC GGCCGCCGGC CCGGCCTGTC GTTTCCTGCG CCAGGTGCTG
TTGCTGGATT TTCCCGTCTC CCTGCCCCCC GAACAACGGC AGGCCTGGCT GCGTTTCGTA
ATGCGCTGGC AACAGTTTAC GGGACCGGTT ATGGCCAAGG GTTACGAGGA TACCTCCCTC
TACATCTACA ACCCTTTGGT TTCCCTCAAT GAGGTAGGTA GCAGTCCCCG GACCAGGTGC
ATATCGGTGG CTGAGTTCCA CCGCCGCAAT AAAACCCGGC AGGAGCGCTG GCCCCATACC
CTCAACGCCA CGTCCACCCA TGATACCAAG CGCAGTGAGG ATGTCCGGGC GCGGATTAAT
GTTCTAACGG AAATCCCTAA CGCCTGGGTA GAGAGGGTTG AGCGCTGGCG TCGCTGGAAT
GGGCCCAAAA AGTTAAATAT AAAAGGCGAG CCTGTACCTG ATGGCAATAT GGAGCTATTT
ATCTACCAGA CGCTAATCGG CGCCTGGCCC CTTTTAGAAG AAGAAATACC CGCTTTTAAG
GAACGGCTCC GGACTTATAT GGTCAAGGCG GCCCGGGAGG CCAAAACCCG GACCAGCTGG
CTTGACCCTG ATACCGATTA CGAAAACGCC CTGATAGAGT TTGTACTCTC TATTTTAACG
CCGGAGCCAG GAAACCGGTT TCTGCCGGAC TTCCTCAGCT TTCAAAAAGT CGTTGCTTTT
TATGGTGCCT GGAACTCGTT AGCCCAGATA CTGCTGAAGA TAACGAGTCC GGGGGTGCCT
GATTTCTACC AGGGTACAGA ACTGTGGAAC CTCAGCCTGG TTGATCCTGA CAACCGGCGT
CCGGTTGACT TTAAAACCAG GGCCAGGCTT TTACAAAGGC TCAAAGAAGA AGAGACAAAA
GGCCAGCTGG CTCTGGTGAG AAACCTTTTG ACCAGCTGGC AGGATGGCCG GGTAAAGCTC
TACCTGACTT ATAAATCCCT TCACTTTCGC TGCGACCACC GGGAATTATT TGCGACGGGT
GAATATATCC CCCTGGCAGT CACCGGGTCC AGCTCGGGAC ACGCCTGCGC CTTCGCCAGG
CATCTGGGCA GGGAGTGGGC TCTGGTGGTC GTTCCCCGCC TACCGGCCCG GATGCTGACG
GGCAAGGTAA TACCAGCAAA TGGTGGGTTG CCGGCCCCCG GGTTTCTCCC AGGGGAAACT
CTATGGCAAG GAACAAACCT TGTGCTACCT GAACAGGCCC CGGGCAACTG GCATAATATC
TTAACAGGAG AGGTTCTGGC TTCTATCCCT TCGCCTGAAG GTAAAGTTCT TACCCTGGCC
GACACCTGGC GCAATTTTCC AGTTGCTTTA CTGACCGAAG ACCAATAA
 
Protein sequence
MASPHIPTAT YRLQFNRQFG FIEAREVVPY LQALGITDIY ASPLLKARKD SPHGYDVTDP 
GQLNPELGSR EDFTSLADTL KQHGMGLLLD VVPNHMAASV DNPWWRDVLR HGRASTYAAY
FDIDWQPARP GLVNKVLLPV LGEPFGKVLE NQQLALKLAE DGFRVCYYEK EFPLSPFSSR
RILGGWAQTL AEDGGAAEQA LSQLRDLLAS LSALPLPRAG ELSTPWQQAW KSLWHLYNTN
PAVKAFIDRN LRKLNGKKGD PQSFNQLEGI LAEQAYRLAY WRVANEEINY RRFFDVSDLV
AIRMEDKRVF EAVHALIFQL VGAGQVTGLR IDHIDGLYDP QEYLNRLQEH LSAAGSSPGF
YVVAEKILSD GEELPATWRT QGTTGYDFLN SLNGLFVDEE GLAALEEFYA RYSGAETDFT
RVVYNQKKLV MTRLFASEVR NLVGELGRLA EEDRLGHDLT LAELEEALVA VTASLGIYRT
YIHDFTVAPQ DRHYIETAIA EAVRRCPAAG PACRFLRQVL LLDFPVSLPP EQRQAWLRFV
MRWQQFTGPV MAKGYEDTSL YIYNPLVSLN EVGSSPRTRC ISVAEFHRRN KTRQERWPHT
LNATSTHDTK RSEDVRARIN VLTEIPNAWV ERVERWRRWN GPKKLNIKGE PVPDGNMELF
IYQTLIGAWP LLEEEIPAFK ERLRTYMVKA AREAKTRTSW LDPDTDYENA LIEFVLSILT
PEPGNRFLPD FLSFQKVVAF YGAWNSLAQI LLKITSPGVP DFYQGTELWN LSLVDPDNRR
PVDFKTRARL LQRLKEEETK GQLALVRNLL TSWQDGRVKL YLTYKSLHFR CDHRELFATG
EYIPLAVTGS SSGHACAFAR HLGREWALVV VPRLPARMLT GKVIPANGGL PAPGFLPGET
LWQGTNLVLP EQAPGNWHNI LTGEVLASIP SPEGKVLTLA DTWRNFPVAL LTEDQ