Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2026 |
Symbol | |
ID | 3831401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2114823 |
End bp | 2115965 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637829955 |
Product | xylose isomerase-like TIM barrel |
Protein accession | YP_430865 |
Protein GI | 83590856 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4952] Predicted sugar isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGATT TAAGCTACCA GGCCACCCGC CGCTCGCCGG AAGAGTTGAT CAGGCACTTG CAGAACTTTG AGCTCAACCT CCGCTTCTCA GCCGGAATTT GGTTTTTTTC CGGCAGCAAC AGCCGCTTCC ATATCCGCTA CGGCCAAGAA ATGAGCATTG AGGAACGCTT GGAGAAATTC GCCAGCCTGA AGGAGTACGG GCTGGAGGGT ATCGAGGCCC ATTACCCCAA TGAGATTAAC GAACATAACC TGCCGCTTTA TAAGGATTTT TGCCGGGACA CCGGCATGAA GGTGGTCACC GTTGTACCCA ACCTTTTCTA TGAAGAACAG TACTGCTACG GTTCTTTATC GTCGCCCCTT CCGGCTGCCA GGCAGTCCGC CATCCAGCGC GTTAAAGAAA CGCTAGAAAT AAATAAGGAA CTGGGGACAG AGTTCATGGT TGTCTGGCCG GGTATCGACG GTTACGAAAA TCCTTTCGGT ATCGATTTCA GCGAGATGCG CCGTCGGTTC GCCGCCGGCC TGGCAGAAGC TATGGACGCC GTACCGGGAA TTAGGGTGGC GATGGAGCCC AAGCCCTACG AACCCCGGGG CAGAATAATC TATGGCACCA CACCGGAAGG AATTCTCTTG GCGGAAAAAG TAGAAGGATT GTTACAAAAT CCTGTTAACA AAGAACTGCT GCAGCAGGGT TATACCCTGC TGGGACTCAA CCCGGAAATC GGCCACGTCA TCATGGGTTA TGAAGACCTG CCCTATGCCT TGAGCCTGCC CCTGGAATAC GGTCGCCTGG TGCATACCCA TTGGAACAGC CAGCCCCTGG GTAATTACGA TCAGGACCTA AACGTCGGTG TAATCGCGCC CGAACAGGCC GAGGCCGCCC TTTACGTCCT GAAGATGCAC GGTTATCAGG GGTGGTTCGG TCTGGATATC AATCCGGAGC GGATGCCGGT GGAAAGGGCG GTTATTAATA GCATGGATGC CATCCGGGCC ATGAACGACC GCATCAACAA TCTCGATCAC GAGCTGGTAG TTGCCTGCGT CCAGGACCCG GAACGTTATG CCGGTTACTT AGAAGCTCTC CTTATCCGGG CCCGGGCTAG AAACCCTGAG ATCCTCAGCC CCTTAAGCCT CCAAGGAGTA TAA
|
Protein sequence | MQDLSYQATR RSPEELIRHL QNFELNLRFS AGIWFFSGSN SRFHIRYGQE MSIEERLEKF ASLKEYGLEG IEAHYPNEIN EHNLPLYKDF CRDTGMKVVT VVPNLFYEEQ YCYGSLSSPL PAARQSAIQR VKETLEINKE LGTEFMVVWP GIDGYENPFG IDFSEMRRRF AAGLAEAMDA VPGIRVAMEP KPYEPRGRII YGTTPEGILL AEKVEGLLQN PVNKELLQQG YTLLGLNPEI GHVIMGYEDL PYALSLPLEY GRLVHTHWNS QPLGNYDQDL NVGVIAPEQA EAALYVLKMH GYQGWFGLDI NPERMPVERA VINSMDAIRA MNDRINNLDH ELVVACVQDP ERYAGYLEAL LIRARARNPE ILSPLSLQGV
|
| |