Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1809 |
Symbol | |
ID | 3830727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1864101 |
End bp | 1865957 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829736 |
Product | malto-oligosyltrehalose trehalohydrolase |
Protein accession | YP_430652 |
Protein GI | 83590643 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR02402] malto-oligosyltrehalose trehalohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTATA GTGTTAGCCT GGGCGCCAAT TACCTGGGGG ATGGCCGCTG CCGCTTCCAC CTCTGGGCAC CCCTGGCCGA AAGCGTCGCG GTACAGATCG TGGCACCTGA AGAAAGACTC GAACCTTTAA CCCGAAAAGA ACGTGGTTAC TTTCAGGGCG AGGTAGCGGG GGTCGTACCA GGTAGTCTCT ATTACTACCT CCTGGACGGG CAGCAGTTAC CCGACCCCGC TTCGCGCTAT CAGCCACGGG GGGTCCACGG CCCTTCCCAG GTAGTAGACG CCAAAGCCTT TTCCTGGTCG GATCGCTGCT GGCCGGGGCC CAAGGGTGAG GATTTAATCT TTTATGAGCT GCATGTAGGG ACCTTCACCC CGGAGGGCAC TTTTGAAGCC ATCATCGCCC ACCTAGACGA TTTGCGCACC CTGGGGGTTA CCGCCATTGA ATTAATGCCC GTGGCCCAGT TCCCGGGATG CCGCAACTGG GGTTACGACG GCGTTTTCCC CTTCGCCGTC CAGAACTCTT ACGGTGGACC GGAGGGGCTC AGGCGGCTGG TGGATACCTG CCACAGGTAC GGCCTGGCCG TATTCCTCGA TGTGGTCTAC AACCACCTGG GCCCGGAAGG AAACTACCTG GCCAAATTCG GGCCTTATTT TACCGATCGT TACCGTACCC CCTGGGGACA GGCCCTCAAC TTTGATGGAC CCGGCAGCGA CGAAGTCCGG CGCTTCTTTA TCGAAAACGC CCTTTACTGG TTAACCGAGT TTCACCTGGA TGGCTTACGG TTGGACGCCA TTCATGCCAT TATGGATAAG TCTGCGTTAC CTTTCCTGGA GGAACTGGCC GCTGCCGTGA AGCTTCAGGC CGAACGGCTG GATCGCCGGG TTTATATTGT TGCCGAGAGT GACTTGAACG ACCCCCGGGT AATCCGGCCG CAAGAATTGG GGGGATACGG CCTTGATGCC CAGTGGTGCG ACGACTTTCA CCACGCCCTG CATGCGCTGC TGACGGGGGA AAGAAACGGC TACTATCGTG ACTTCGGCAC CCTCGGCAAC CTGGCCCGGG CCTTCCGGGA AGGTTATGTT TACACCGGTC AGTACTCTGC CTACCGGCAG CGGCGGCACG GCCGGCGGCC GCATCCGTGC AAGGGTAACC AGTTCGTGGT TTTTACCCAG AACCATGATC AAGTAGGTAA TCGGGCCCGG GGTGAAAGGT TAAGCACCCT GGTTCCCTTC GTCAAACTCA AACTGGCCGC CGCCGTAGTG CTCCTCTCCC CCTTCGTACC ACTGCTCTTC ATGGGCGAGG AATACGGTGA AACAGCACCT TTTCAATACT TTACCAGCCA TTCCGATCCC CGTCTGATCG CAGCGGTACG CCGGGGGCGG CGGGAAGAAT TCGCCGGCCA CAACTGGACA GGCGAGGTGC CCGATCCCCA GGATGAGGCC ACCTTCCGAC GTTCCCGTCT TAACCACGGC CTCAGCCTTC AGGGACAGCA CCGGGTACTC TGGGAGTTCT ACCGGCAGCT AATCCAGTTA CGCCGGGAAC TACCCTCCCT TACGGAGCTA AACCTGGAAA ATATGGAGGT TATCACCTGC GAGGAGGACC TGGTCCTGTT CGTACGCCGC TGGAGTAGGG ATAGTGAGGT GGGCATCATA TTTTCCTTTA GCAACACCGC AACGGCCCCT ACCCTGCCCC TGCCGGCAGG TCGCTGGCGC AAACGCCTGG ATGCCGCTGA AGAGCGCTGG CTGGGCGATG GTAGTACCAT ACCTGCCCTG CTCGTATCCC AGGGAAAAAT GCAGGTGCCT TTAACCCCGG GAGCATGCCT GTTGTTTGAA CGAATAAAGG AGGCTCAGGC TGACTAA
|
Protein sequence | MTYSVSLGAN YLGDGRCRFH LWAPLAESVA VQIVAPEERL EPLTRKERGY FQGEVAGVVP GSLYYYLLDG QQLPDPASRY QPRGVHGPSQ VVDAKAFSWS DRCWPGPKGE DLIFYELHVG TFTPEGTFEA IIAHLDDLRT LGVTAIELMP VAQFPGCRNW GYDGVFPFAV QNSYGGPEGL RRLVDTCHRY GLAVFLDVVY NHLGPEGNYL AKFGPYFTDR YRTPWGQALN FDGPGSDEVR RFFIENALYW LTEFHLDGLR LDAIHAIMDK SALPFLEELA AAVKLQAERL DRRVYIVAES DLNDPRVIRP QELGGYGLDA QWCDDFHHAL HALLTGERNG YYRDFGTLGN LARAFREGYV YTGQYSAYRQ RRHGRRPHPC KGNQFVVFTQ NHDQVGNRAR GERLSTLVPF VKLKLAAAVV LLSPFVPLLF MGEEYGETAP FQYFTSHSDP RLIAAVRRGR REEFAGHNWT GEVPDPQDEA TFRRSRLNHG LSLQGQHRVL WEFYRQLIQL RRELPSLTEL NLENMEVITC EEDLVLFVRR WSRDSEVGII FSFSNTATAP TLPLPAGRWR KRLDAAEERW LGDGSTIPAL LVSQGKMQVP LTPGACLLFE RIKEAQAD
|
| |