Gene Moth_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1810 
Symbol 
ID3830728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1865957 
End bp1868392 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content55% 
IMG OID637829737 
Productglycoside hydrolase family protein 
Protein accessionYP_430653 
Protein GI83590644 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGGT ACATTTGCAT CCACGGCCAT TTTTACCAGC CGCCACGGGA AAACCCCTGG 
CTGGAGGACA TCGAGCTCCA GGACTCGGCT TACCCCTACC ACGACTGGGA TGAACGGATT
ACCGCTGAAT GCTACGAACC TAATACCGCC TCACGCATTC TTGATGGTGA CGGGTGGATC
AGGAAAATCG TCAACAACTA TAGCAAAATC AGTTTCAACT TTGGTCCCAC CCTCCTTTCC
TGGATGGAAA CCAATGCCCC GGAGGTTTAC CGGGCGATCA TTGAAGCCGA CCGGGAAAGC
CAGCGGCGCT TCTCGGGACA CGGTTCCGCC CTGGCCCAGG CCTATAACCA CATGATCATG
CCCCTGGCCA ACACGCGTGA TAAATACACC CAGGTAATCT GGGGGATTAA AGATTTTGAG
CATCGCTTTG GCCGGCGGCC GGAAGGAATG TGGTTACCTG AGACCGCCGT TGATCTGGAA
ACCCTCGCCA TCCTGGCCCA GCAGGGCATC CGTTTTACCA TCCTGGCCCA CTGGCAGGCC
CGCCGCATCC GTCCCCTGGG TAGCGATAAC TGGCAGGAGG TCCACGGCGG CCTAGATACC
ACCATGCCTT ATCAGGTCCG CCTGCCCAAT ACGGACCGGA CCATCAATGT CTTCTTTTAT
AATGGCGAAG TCGCCCGCGC CGTAGCCTTC GAGAAACTGC TCGATAACGG CGAACGCTTT
GCCAAACGTC TACTGAGTGT CTTTAACGAG GGACGCAACG CGCCCCAGCT TGTCAATATT
GCTACTGACG GGGAGACCTA CGGCCACCAC CACCGCCACG GGGATATGGC CCTGGCCTAC
GCCCTGGATT ATATTGAAGC CAATAAACTG GCACGTCTAA CCAACTATGG AGAATACCTG
GAAAAACACC CGCCCACCCA TGAGGTGGAG ATAAATAATA ATAGCTCCTG GAGCTGTGCC
CATGGAGTGG AAAGGTGGCG AACTAATTGT GGCTGTAACA CGGGTATGCA TCCAGGGTGG
AGCCAGGCCT GGCGGGCACC CCTGCGCGAT AGCCTCAACT GGCTGCGGAA CACCCTGGCA
CCCAAGTTTG AGGGAAGGGC GCGCCAGTTT CTAAAAGATC CCTGGGCCGC GCGTAACGAC
TACATCGCGG TCATCCTCGA CCGTTCACCG GAAAACTTCG ACCGCTTCCT CGGACAACAT
GCCACCCGTA TACTAAACCA GGAAGAAAAA ATTACGGTGT TGAAGCTGTT GGAGCTCCAG
CGACACGCCA TGTTAATGTT CACCAGCTGC GGCTGGTTCT TTGATGAGAT CTCAGGCATT
GAGACAGTGC AGGTTATTAA ATACGCCGGC CGTGTGATTC AATTAGCTCA GGAACTATTT
AACGAGTCGC CAGAGCCCCG TTTTATGGAG ATGCTGGCCC AGGCCAAGAG CAACATCCCT
GAACACCGTG ACGGAGCCCA TATCTATGAA AAATTCGTCA AGCCGGCAAT GGTCGACTTG
CTTAAGGTAG GTGCCCATTA TGCCCTGTGT TCCCTGTACG AAACCTATGA CCAGCATTCC
CGCATCTTTT GCTACGATGT CTACCGTGAA GACTACCAGA ATCGTTTAGC AGGTACAGCC
AGGTTAGCTG TAGGCCGGGC GCAGGTCACC TCCCAGATTA CCCAGGAGTC GATCAAGATT
AGTTATGGCG CCGTTACTTT AGGTAACCAT AATGTTAGTG GCGGTGTCCA GGTTTACACC
AATGACGATT CCTACCAGCG AATGGTACAG GAATTAACCG GTGCCTTCGA CCGGGCCGAT
TTCAATGAGG TCATCAGGCT CCTGGATCAA CACTTTGCAG GGGCCACCTA TTCCTTAAGG
CAGCTTTTCC GGGACAAACA GCGGATGGTA CTGGATATCA TCCTGGAGTC CACCCTGGCG
GAAGCGGCGG AAGATTACCG GCGCATTTAC GATCGTCATG CTCCTTTAAT GCGGTTCTTA
AAGGATTTAA ATATCCCCCA GCCCCGGGCC CTGCAAGCCG CTGCCGAGTT CGTCTTGAAC
ACCAGCCTCC GCCAGGCTTT CGCAGGTGAC AATCTGGACC TGGAACACAT AAAAGCGCTC
TTGGGGGAAG CCGAGATGGC CGGTGTTCCC CTCGACGGCG AAGGTCTGGG GTATGTCCTG
GAACAGACCC TGAAACAGAT GGCTGAGAAA CTGCTAGGCC AACCTGACGA CCAGGTCTTC
ATCGGGCGTT TGGACGCGGT GATCAGCCTG GTGCGCTCGT TACCCTTTGA AGTAAACCTC
TGGAAAGTCC AAAACGCTTA TTACCGTCTG TTGCAGACAG TTTACCCCGG ATACCGGGAG
AAAGCCCGGC AGGGGGATGG GGAAGCCCGG GCGTGGCTTG ACCTGTTCAA CTCCCTGGGT
GACAAACTGC AGGTACGGAG AGGCAAAAAT GGCTAG
 
Protein sequence
MERYICIHGH FYQPPRENPW LEDIELQDSA YPYHDWDERI TAECYEPNTA SRILDGDGWI 
RKIVNNYSKI SFNFGPTLLS WMETNAPEVY RAIIEADRES QRRFSGHGSA LAQAYNHMIM
PLANTRDKYT QVIWGIKDFE HRFGRRPEGM WLPETAVDLE TLAILAQQGI RFTILAHWQA
RRIRPLGSDN WQEVHGGLDT TMPYQVRLPN TDRTINVFFY NGEVARAVAF EKLLDNGERF
AKRLLSVFNE GRNAPQLVNI ATDGETYGHH HRHGDMALAY ALDYIEANKL ARLTNYGEYL
EKHPPTHEVE INNNSSWSCA HGVERWRTNC GCNTGMHPGW SQAWRAPLRD SLNWLRNTLA
PKFEGRARQF LKDPWAARND YIAVILDRSP ENFDRFLGQH ATRILNQEEK ITVLKLLELQ
RHAMLMFTSC GWFFDEISGI ETVQVIKYAG RVIQLAQELF NESPEPRFME MLAQAKSNIP
EHRDGAHIYE KFVKPAMVDL LKVGAHYALC SLYETYDQHS RIFCYDVYRE DYQNRLAGTA
RLAVGRAQVT SQITQESIKI SYGAVTLGNH NVSGGVQVYT NDDSYQRMVQ ELTGAFDRAD
FNEVIRLLDQ HFAGATYSLR QLFRDKQRMV LDIILESTLA EAAEDYRRIY DRHAPLMRFL
KDLNIPQPRA LQAAAEFVLN TSLRQAFAGD NLDLEHIKAL LGEAEMAGVP LDGEGLGYVL
EQTLKQMAEK LLGQPDDQVF IGRLDAVISL VRSLPFEVNL WKVQNAYYRL LQTVYPGYRE
KARQGDGEAR AWLDLFNSLG DKLQVRRGKN G