Gene Moth_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0103 
Symbol 
ID3831993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp99766 
End bp101808 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content62% 
IMG OID637828037 
Productserine phosphatase 
Protein accessionYP_428985 
Protein GI83588976 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR02865] stage II sporulation protein E 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGTC AAACCCGTGC CCACTCTATG CGACAGCAAA CCGGGGTTTT GGCAGGGAGC 
GAGTTTTTTC TCGATGTTTT GCTTCTGGGG GCAGTATTTT TCCTTGTTCG GGCGGCCCTT
CTGGAGGAAC TCTTTCCCTT CGGTCCGGCG GTAATAATTG CTGTCGGTGC CAGGCGGCGC
CTCCTCTGGC CGGCGGTAGT GGTAGCAGCC GTCAGCGCCT GGCTGGCCGG CCTGCCCCAG
GTATACAGCC GCCTGCTGAT TTTCCTGTTC CTGGGCCTGG CCTGCACCCT TTATCCTTCC
CTCAACCAGC GCGGTCCCCT TACCCGGGCT ACCCTGGCAC CGGTAGCCAT TACCCTCGTA
AGGGGCCTGG GCCTGACCCT CTGGCAACCC TCCTTTTATG GCTGGGTGCA GGTGATCTTT
GAAGCTCTTC TGGCCTGGGG ACTGAGCCTG GGGTTGCTGG AAACCGCCAC GGCCAGGCGC
CAGGAGGAGC GTCTGCTGGG CGGGGGGCTC TTCCTCCTGG GGCTCCTCCT GGGTTTACAG
GGGTGGCAGG TATCCGGCCT TTCCATCCAG GGGATCATCA GCCGTTATAT CCTGCTCCTG
GCAGCCCTGG CCGGGGGGGC AGGAACAGGA GCTGCCGCCG GGGCGGCTGT AGGTTTCCTA
CCCAGCCTTT CAGCCCTGAT TACACCCTCC CTGGCGGGGT TAATGGCCTT CACCGGCCTG
GTGGCCGGTT CTTTAAAGAA CCTGGGCAAA CCAGGAGTGA TTAGCGGCTT TTTGCTGGCC
CACCTGGTAC TGGCCAACTA CTTCCTAGGC AGTGACGGTG TCCAGGCCGC CCTGAGGGAA
AGCGGCCTGG CTGTCCTGTT CCTGGTGGCT ACACCGCCCC TCCTGGTTTA TTACTTTCGG
GAATTCCTGG CGGTACCGGT GGCCGCCCAG CAGGCGCCGG CAGATACCAG CCCCCGGGAC
AACCTCAAAG TTGCCCTGAA AAGCCTGGCT CAAAACCTCA AATTCCATGG TTTCAACGAA
AGTCCCCTGG AGACCGTTCG TCAGGTGGCC AGGGCTGCCT GCCGCGGGTG CCCGGCAGGT
AAGGTTTGCT GGGAGCTTGA AGGGGAACAG ATGCTCAATA CTTTACAGGA GCTATTGCAC
CGGGGCAGCC AGGGCCCCCT GACCTTGGCC GCCCTCCCCG AATGGCTGGC TTCGCGTTGC
AATCGCGGCC GTGAACTCCT GGCCGCCCTG ACCACCAGGG CCGGCAAGGG GCAACCCCAG
CCCTTGGAGG ATGGCCTGAC CAACTGGCTG GCCAGTATTT TTGATAACCT GGCGGTCATG
GTCGAAAACA GGGGGATCAA AAAAGAAAAT TCAGCGGGCA GCGGCACCGG TCAACCGGCC
TTGAAGATCA GCGTTGGTAT GGCTGCTACC CCGCGCCACC GGGCGGAGGT CACTGGCGAC
GCCTTCGTCG CAGCTACCCT GGAACCAGGC AGGCAACTGT TAATTCTGGG CGACGGTATG
GGCGCCGGGC GGGAAGCGGC AGACGCCAGC GGCACAGCCC TGGAGTTGCT CCAGGATTTA
CTGGCCGCCG GATTTAGCCC CGAGCTCGCC CTGCGGACAG TCAACATGGT GCTACTGCTA
AGAACGACCC GCGAGAACTT TACCACCATT GACCTGGCCA TGGTCAATTG CCACAATGGC
CAGACTGAAT TCTATAAGCT GGGGGCCTGT CCCAGCTTTA TCCAGGGGAA GGATGGCGTC
AAGATCTTGC GGAGCCACTC CCTGCCGGTA GGTATCCTTG AAGACCTCCA GGTAGAAGCC
CTCAAGGAAG AACTGCAGGA AGGAGACCTG CTGGTAATGG TCAGCGACGG CGTCCTGGAG
GCCCACCGCG ATTTAAATGA AAAGGAAAAG TGGGTGAGTA AAGCCCTGCA GCGAGCCGGG
GATGCCCGGC CCCAGGAGAT TGCCGATCGT CTGTTGAAAC AGGCCCGGGC CCTGGCCGGC
GGCAATCCTC CCGATGACAT GACGGTGGTA GTTGCCAGGG TGGAGAAGGC CGCCAGCGCA
TAA
 
Protein sequence
MSGQTRAHSM RQQTGVLAGS EFFLDVLLLG AVFFLVRAAL LEELFPFGPA VIIAVGARRR 
LLWPAVVVAA VSAWLAGLPQ VYSRLLIFLF LGLACTLYPS LNQRGPLTRA TLAPVAITLV
RGLGLTLWQP SFYGWVQVIF EALLAWGLSL GLLETATARR QEERLLGGGL FLLGLLLGLQ
GWQVSGLSIQ GIISRYILLL AALAGGAGTG AAAGAAVGFL PSLSALITPS LAGLMAFTGL
VAGSLKNLGK PGVISGFLLA HLVLANYFLG SDGVQAALRE SGLAVLFLVA TPPLLVYYFR
EFLAVPVAAQ QAPADTSPRD NLKVALKSLA QNLKFHGFNE SPLETVRQVA RAACRGCPAG
KVCWELEGEQ MLNTLQELLH RGSQGPLTLA ALPEWLASRC NRGRELLAAL TTRAGKGQPQ
PLEDGLTNWL ASIFDNLAVM VENRGIKKEN SAGSGTGQPA LKISVGMAAT PRHRAEVTGD
AFVAATLEPG RQLLILGDGM GAGREAADAS GTALELLQDL LAAGFSPELA LRTVNMVLLL
RTTRENFTTI DLAMVNCHNG QTEFYKLGAC PSFIQGKDGV KILRSHSLPV GILEDLQVEA
LKEELQEGDL LVMVSDGVLE AHRDLNEKEK WVSKALQRAG DARPQEIADR LLKQARALAG
GNPPDDMTVV VARVEKAASA