Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1810 |
Symbol | |
ID | 3830728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1865957 |
End bp | 1868392 |
Gene Length | 2436 bp |
Protein Length | 811 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829737 |
Product | glycoside hydrolase family protein |
Protein accession | YP_430653 |
Protein GI | 83590644 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1449] Alpha-amylase/alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACGGT ACATTTGCAT CCACGGCCAT TTTTACCAGC CGCCACGGGA AAACCCCTGG CTGGAGGACA TCGAGCTCCA GGACTCGGCT TACCCCTACC ACGACTGGGA TGAACGGATT ACCGCTGAAT GCTACGAACC TAATACCGCC TCACGCATTC TTGATGGTGA CGGGTGGATC AGGAAAATCG TCAACAACTA TAGCAAAATC AGTTTCAACT TTGGTCCCAC CCTCCTTTCC TGGATGGAAA CCAATGCCCC GGAGGTTTAC CGGGCGATCA TTGAAGCCGA CCGGGAAAGC CAGCGGCGCT TCTCGGGACA CGGTTCCGCC CTGGCCCAGG CCTATAACCA CATGATCATG CCCCTGGCCA ACACGCGTGA TAAATACACC CAGGTAATCT GGGGGATTAA AGATTTTGAG CATCGCTTTG GCCGGCGGCC GGAAGGAATG TGGTTACCTG AGACCGCCGT TGATCTGGAA ACCCTCGCCA TCCTGGCCCA GCAGGGCATC CGTTTTACCA TCCTGGCCCA CTGGCAGGCC CGCCGCATCC GTCCCCTGGG TAGCGATAAC TGGCAGGAGG TCCACGGCGG CCTAGATACC ACCATGCCTT ATCAGGTCCG CCTGCCCAAT ACGGACCGGA CCATCAATGT CTTCTTTTAT AATGGCGAAG TCGCCCGCGC CGTAGCCTTC GAGAAACTGC TCGATAACGG CGAACGCTTT GCCAAACGTC TACTGAGTGT CTTTAACGAG GGACGCAACG CGCCCCAGCT TGTCAATATT GCTACTGACG GGGAGACCTA CGGCCACCAC CACCGCCACG GGGATATGGC CCTGGCCTAC GCCCTGGATT ATATTGAAGC CAATAAACTG GCACGTCTAA CCAACTATGG AGAATACCTG GAAAAACACC CGCCCACCCA TGAGGTGGAG ATAAATAATA ATAGCTCCTG GAGCTGTGCC CATGGAGTGG AAAGGTGGCG AACTAATTGT GGCTGTAACA CGGGTATGCA TCCAGGGTGG AGCCAGGCCT GGCGGGCACC CCTGCGCGAT AGCCTCAACT GGCTGCGGAA CACCCTGGCA CCCAAGTTTG AGGGAAGGGC GCGCCAGTTT CTAAAAGATC CCTGGGCCGC GCGTAACGAC TACATCGCGG TCATCCTCGA CCGTTCACCG GAAAACTTCG ACCGCTTCCT CGGACAACAT GCCACCCGTA TACTAAACCA GGAAGAAAAA ATTACGGTGT TGAAGCTGTT GGAGCTCCAG CGACACGCCA TGTTAATGTT CACCAGCTGC GGCTGGTTCT TTGATGAGAT CTCAGGCATT GAGACAGTGC AGGTTATTAA ATACGCCGGC CGTGTGATTC AATTAGCTCA GGAACTATTT AACGAGTCGC CAGAGCCCCG TTTTATGGAG ATGCTGGCCC AGGCCAAGAG CAACATCCCT GAACACCGTG ACGGAGCCCA TATCTATGAA AAATTCGTCA AGCCGGCAAT GGTCGACTTG CTTAAGGTAG GTGCCCATTA TGCCCTGTGT TCCCTGTACG AAACCTATGA CCAGCATTCC CGCATCTTTT GCTACGATGT CTACCGTGAA GACTACCAGA ATCGTTTAGC AGGTACAGCC AGGTTAGCTG TAGGCCGGGC GCAGGTCACC TCCCAGATTA CCCAGGAGTC GATCAAGATT AGTTATGGCG CCGTTACTTT AGGTAACCAT AATGTTAGTG GCGGTGTCCA GGTTTACACC AATGACGATT CCTACCAGCG AATGGTACAG GAATTAACCG GTGCCTTCGA CCGGGCCGAT TTCAATGAGG TCATCAGGCT CCTGGATCAA CACTTTGCAG GGGCCACCTA TTCCTTAAGG CAGCTTTTCC GGGACAAACA GCGGATGGTA CTGGATATCA TCCTGGAGTC CACCCTGGCG GAAGCGGCGG AAGATTACCG GCGCATTTAC GATCGTCATG CTCCTTTAAT GCGGTTCTTA AAGGATTTAA ATATCCCCCA GCCCCGGGCC CTGCAAGCCG CTGCCGAGTT CGTCTTGAAC ACCAGCCTCC GCCAGGCTTT CGCAGGTGAC AATCTGGACC TGGAACACAT AAAAGCGCTC TTGGGGGAAG CCGAGATGGC CGGTGTTCCC CTCGACGGCG AAGGTCTGGG GTATGTCCTG GAACAGACCC TGAAACAGAT GGCTGAGAAA CTGCTAGGCC AACCTGACGA CCAGGTCTTC ATCGGGCGTT TGGACGCGGT GATCAGCCTG GTGCGCTCGT TACCCTTTGA AGTAAACCTC TGGAAAGTCC AAAACGCTTA TTACCGTCTG TTGCAGACAG TTTACCCCGG ATACCGGGAG AAAGCCCGGC AGGGGGATGG GGAAGCCCGG GCGTGGCTTG ACCTGTTCAA CTCCCTGGGT GACAAACTGC AGGTACGGAG AGGCAAAAAT GGCTAG
|
Protein sequence | MERYICIHGH FYQPPRENPW LEDIELQDSA YPYHDWDERI TAECYEPNTA SRILDGDGWI RKIVNNYSKI SFNFGPTLLS WMETNAPEVY RAIIEADRES QRRFSGHGSA LAQAYNHMIM PLANTRDKYT QVIWGIKDFE HRFGRRPEGM WLPETAVDLE TLAILAQQGI RFTILAHWQA RRIRPLGSDN WQEVHGGLDT TMPYQVRLPN TDRTINVFFY NGEVARAVAF EKLLDNGERF AKRLLSVFNE GRNAPQLVNI ATDGETYGHH HRHGDMALAY ALDYIEANKL ARLTNYGEYL EKHPPTHEVE INNNSSWSCA HGVERWRTNC GCNTGMHPGW SQAWRAPLRD SLNWLRNTLA PKFEGRARQF LKDPWAARND YIAVILDRSP ENFDRFLGQH ATRILNQEEK ITVLKLLELQ RHAMLMFTSC GWFFDEISGI ETVQVIKYAG RVIQLAQELF NESPEPRFME MLAQAKSNIP EHRDGAHIYE KFVKPAMVDL LKVGAHYALC SLYETYDQHS RIFCYDVYRE DYQNRLAGTA RLAVGRAQVT SQITQESIKI SYGAVTLGNH NVSGGVQVYT NDDSYQRMVQ ELTGAFDRAD FNEVIRLLDQ HFAGATYSLR QLFRDKQRMV LDIILESTLA EAAEDYRRIY DRHAPLMRFL KDLNIPQPRA LQAAAEFVLN TSLRQAFAGD NLDLEHIKAL LGEAEMAGVP LDGEGLGYVL EQTLKQMAEK LLGQPDDQVF IGRLDAVISL VRSLPFEVNL WKVQNAYYRL LQTVYPGYRE KARQGDGEAR AWLDLFNSLG DKLQVRRGKN G
|
| |