Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0103 |
Symbol | |
ID | 3831993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 99766 |
End bp | 101808 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828037 |
Product | serine phosphatase |
Protein accession | YP_428985 |
Protein GI | 83588976 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | [TIGR02865] stage II sporulation protein E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGTC AAACCCGTGC CCACTCTATG CGACAGCAAA CCGGGGTTTT GGCAGGGAGC GAGTTTTTTC TCGATGTTTT GCTTCTGGGG GCAGTATTTT TCCTTGTTCG GGCGGCCCTT CTGGAGGAAC TCTTTCCCTT CGGTCCGGCG GTAATAATTG CTGTCGGTGC CAGGCGGCGC CTCCTCTGGC CGGCGGTAGT GGTAGCAGCC GTCAGCGCCT GGCTGGCCGG CCTGCCCCAG GTATACAGCC GCCTGCTGAT TTTCCTGTTC CTGGGCCTGG CCTGCACCCT TTATCCTTCC CTCAACCAGC GCGGTCCCCT TACCCGGGCT ACCCTGGCAC CGGTAGCCAT TACCCTCGTA AGGGGCCTGG GCCTGACCCT CTGGCAACCC TCCTTTTATG GCTGGGTGCA GGTGATCTTT GAAGCTCTTC TGGCCTGGGG ACTGAGCCTG GGGTTGCTGG AAACCGCCAC GGCCAGGCGC CAGGAGGAGC GTCTGCTGGG CGGGGGGCTC TTCCTCCTGG GGCTCCTCCT GGGTTTACAG GGGTGGCAGG TATCCGGCCT TTCCATCCAG GGGATCATCA GCCGTTATAT CCTGCTCCTG GCAGCCCTGG CCGGGGGGGC AGGAACAGGA GCTGCCGCCG GGGCGGCTGT AGGTTTCCTA CCCAGCCTTT CAGCCCTGAT TACACCCTCC CTGGCGGGGT TAATGGCCTT CACCGGCCTG GTGGCCGGTT CTTTAAAGAA CCTGGGCAAA CCAGGAGTGA TTAGCGGCTT TTTGCTGGCC CACCTGGTAC TGGCCAACTA CTTCCTAGGC AGTGACGGTG TCCAGGCCGC CCTGAGGGAA AGCGGCCTGG CTGTCCTGTT CCTGGTGGCT ACACCGCCCC TCCTGGTTTA TTACTTTCGG GAATTCCTGG CGGTACCGGT GGCCGCCCAG CAGGCGCCGG CAGATACCAG CCCCCGGGAC AACCTCAAAG TTGCCCTGAA AAGCCTGGCT CAAAACCTCA AATTCCATGG TTTCAACGAA AGTCCCCTGG AGACCGTTCG TCAGGTGGCC AGGGCTGCCT GCCGCGGGTG CCCGGCAGGT AAGGTTTGCT GGGAGCTTGA AGGGGAACAG ATGCTCAATA CTTTACAGGA GCTATTGCAC CGGGGCAGCC AGGGCCCCCT GACCTTGGCC GCCCTCCCCG AATGGCTGGC TTCGCGTTGC AATCGCGGCC GTGAACTCCT GGCCGCCCTG ACCACCAGGG CCGGCAAGGG GCAACCCCAG CCCTTGGAGG ATGGCCTGAC CAACTGGCTG GCCAGTATTT TTGATAACCT GGCGGTCATG GTCGAAAACA GGGGGATCAA AAAAGAAAAT TCAGCGGGCA GCGGCACCGG TCAACCGGCC TTGAAGATCA GCGTTGGTAT GGCTGCTACC CCGCGCCACC GGGCGGAGGT CACTGGCGAC GCCTTCGTCG CAGCTACCCT GGAACCAGGC AGGCAACTGT TAATTCTGGG CGACGGTATG GGCGCCGGGC GGGAAGCGGC AGACGCCAGC GGCACAGCCC TGGAGTTGCT CCAGGATTTA CTGGCCGCCG GATTTAGCCC CGAGCTCGCC CTGCGGACAG TCAACATGGT GCTACTGCTA AGAACGACCC GCGAGAACTT TACCACCATT GACCTGGCCA TGGTCAATTG CCACAATGGC CAGACTGAAT TCTATAAGCT GGGGGCCTGT CCCAGCTTTA TCCAGGGGAA GGATGGCGTC AAGATCTTGC GGAGCCACTC CCTGCCGGTA GGTATCCTTG AAGACCTCCA GGTAGAAGCC CTCAAGGAAG AACTGCAGGA AGGAGACCTG CTGGTAATGG TCAGCGACGG CGTCCTGGAG GCCCACCGCG ATTTAAATGA AAAGGAAAAG TGGGTGAGTA AAGCCCTGCA GCGAGCCGGG GATGCCCGGC CCCAGGAGAT TGCCGATCGT CTGTTGAAAC AGGCCCGGGC CCTGGCCGGC GGCAATCCTC CCGATGACAT GACGGTGGTA GTTGCCAGGG TGGAGAAGGC CGCCAGCGCA TAA
|
Protein sequence | MSGQTRAHSM RQQTGVLAGS EFFLDVLLLG AVFFLVRAAL LEELFPFGPA VIIAVGARRR LLWPAVVVAA VSAWLAGLPQ VYSRLLIFLF LGLACTLYPS LNQRGPLTRA TLAPVAITLV RGLGLTLWQP SFYGWVQVIF EALLAWGLSL GLLETATARR QEERLLGGGL FLLGLLLGLQ GWQVSGLSIQ GIISRYILLL AALAGGAGTG AAAGAAVGFL PSLSALITPS LAGLMAFTGL VAGSLKNLGK PGVISGFLLA HLVLANYFLG SDGVQAALRE SGLAVLFLVA TPPLLVYYFR EFLAVPVAAQ QAPADTSPRD NLKVALKSLA QNLKFHGFNE SPLETVRQVA RAACRGCPAG KVCWELEGEQ MLNTLQELLH RGSQGPLTLA ALPEWLASRC NRGRELLAAL TTRAGKGQPQ PLEDGLTNWL ASIFDNLAVM VENRGIKKEN SAGSGTGQPA LKISVGMAAT PRHRAEVTGD AFVAATLEPG RQLLILGDGM GAGREAADAS GTALELLQDL LAAGFSPELA LRTVNMVLLL RTTRENFTTI DLAMVNCHNG QTEFYKLGAC PSFIQGKDGV KILRSHSLPV GILEDLQVEA LKEELQEGDL LVMVSDGVLE AHRDLNEKEK WVSKALQRAG DARPQEIADR LLKQARALAG GNPPDDMTVV VARVEKAASA
|
| |