Gene Moth_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1809 
Symbol 
ID3830727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1864101 
End bp1865957 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content59% 
IMG OID637829736 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_430652 
Protein GI83590643 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTATA GTGTTAGCCT GGGCGCCAAT TACCTGGGGG ATGGCCGCTG CCGCTTCCAC 
CTCTGGGCAC CCCTGGCCGA AAGCGTCGCG GTACAGATCG TGGCACCTGA AGAAAGACTC
GAACCTTTAA CCCGAAAAGA ACGTGGTTAC TTTCAGGGCG AGGTAGCGGG GGTCGTACCA
GGTAGTCTCT ATTACTACCT CCTGGACGGG CAGCAGTTAC CCGACCCCGC TTCGCGCTAT
CAGCCACGGG GGGTCCACGG CCCTTCCCAG GTAGTAGACG CCAAAGCCTT TTCCTGGTCG
GATCGCTGCT GGCCGGGGCC CAAGGGTGAG GATTTAATCT TTTATGAGCT GCATGTAGGG
ACCTTCACCC CGGAGGGCAC TTTTGAAGCC ATCATCGCCC ACCTAGACGA TTTGCGCACC
CTGGGGGTTA CCGCCATTGA ATTAATGCCC GTGGCCCAGT TCCCGGGATG CCGCAACTGG
GGTTACGACG GCGTTTTCCC CTTCGCCGTC CAGAACTCTT ACGGTGGACC GGAGGGGCTC
AGGCGGCTGG TGGATACCTG CCACAGGTAC GGCCTGGCCG TATTCCTCGA TGTGGTCTAC
AACCACCTGG GCCCGGAAGG AAACTACCTG GCCAAATTCG GGCCTTATTT TACCGATCGT
TACCGTACCC CCTGGGGACA GGCCCTCAAC TTTGATGGAC CCGGCAGCGA CGAAGTCCGG
CGCTTCTTTA TCGAAAACGC CCTTTACTGG TTAACCGAGT TTCACCTGGA TGGCTTACGG
TTGGACGCCA TTCATGCCAT TATGGATAAG TCTGCGTTAC CTTTCCTGGA GGAACTGGCC
GCTGCCGTGA AGCTTCAGGC CGAACGGCTG GATCGCCGGG TTTATATTGT TGCCGAGAGT
GACTTGAACG ACCCCCGGGT AATCCGGCCG CAAGAATTGG GGGGATACGG CCTTGATGCC
CAGTGGTGCG ACGACTTTCA CCACGCCCTG CATGCGCTGC TGACGGGGGA AAGAAACGGC
TACTATCGTG ACTTCGGCAC CCTCGGCAAC CTGGCCCGGG CCTTCCGGGA AGGTTATGTT
TACACCGGTC AGTACTCTGC CTACCGGCAG CGGCGGCACG GCCGGCGGCC GCATCCGTGC
AAGGGTAACC AGTTCGTGGT TTTTACCCAG AACCATGATC AAGTAGGTAA TCGGGCCCGG
GGTGAAAGGT TAAGCACCCT GGTTCCCTTC GTCAAACTCA AACTGGCCGC CGCCGTAGTG
CTCCTCTCCC CCTTCGTACC ACTGCTCTTC ATGGGCGAGG AATACGGTGA AACAGCACCT
TTTCAATACT TTACCAGCCA TTCCGATCCC CGTCTGATCG CAGCGGTACG CCGGGGGCGG
CGGGAAGAAT TCGCCGGCCA CAACTGGACA GGCGAGGTGC CCGATCCCCA GGATGAGGCC
ACCTTCCGAC GTTCCCGTCT TAACCACGGC CTCAGCCTTC AGGGACAGCA CCGGGTACTC
TGGGAGTTCT ACCGGCAGCT AATCCAGTTA CGCCGGGAAC TACCCTCCCT TACGGAGCTA
AACCTGGAAA ATATGGAGGT TATCACCTGC GAGGAGGACC TGGTCCTGTT CGTACGCCGC
TGGAGTAGGG ATAGTGAGGT GGGCATCATA TTTTCCTTTA GCAACACCGC AACGGCCCCT
ACCCTGCCCC TGCCGGCAGG TCGCTGGCGC AAACGCCTGG ATGCCGCTGA AGAGCGCTGG
CTGGGCGATG GTAGTACCAT ACCTGCCCTG CTCGTATCCC AGGGAAAAAT GCAGGTGCCT
TTAACCCCGG GAGCATGCCT GTTGTTTGAA CGAATAAAGG AGGCTCAGGC TGACTAA
 
Protein sequence
MTYSVSLGAN YLGDGRCRFH LWAPLAESVA VQIVAPEERL EPLTRKERGY FQGEVAGVVP 
GSLYYYLLDG QQLPDPASRY QPRGVHGPSQ VVDAKAFSWS DRCWPGPKGE DLIFYELHVG
TFTPEGTFEA IIAHLDDLRT LGVTAIELMP VAQFPGCRNW GYDGVFPFAV QNSYGGPEGL
RRLVDTCHRY GLAVFLDVVY NHLGPEGNYL AKFGPYFTDR YRTPWGQALN FDGPGSDEVR
RFFIENALYW LTEFHLDGLR LDAIHAIMDK SALPFLEELA AAVKLQAERL DRRVYIVAES
DLNDPRVIRP QELGGYGLDA QWCDDFHHAL HALLTGERNG YYRDFGTLGN LARAFREGYV
YTGQYSAYRQ RRHGRRPHPC KGNQFVVFTQ NHDQVGNRAR GERLSTLVPF VKLKLAAAVV
LLSPFVPLLF MGEEYGETAP FQYFTSHSDP RLIAAVRRGR REEFAGHNWT GEVPDPQDEA
TFRRSRLNHG LSLQGQHRVL WEFYRQLIQL RRELPSLTEL NLENMEVITC EEDLVLFVRR
WSRDSEVGII FSFSNTATAP TLPLPAGRWR KRLDAAEERW LGDGSTIPAL LVSQGKMQVP
LTPGACLLFE RIKEAQAD