Gene Moth_0900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0900 
Symbol 
ID3831442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp935847 
End bp937220 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content63% 
IMG OID637828831 
Productsun protein 
Protein accessionYP_429760 
Protein GI83589751 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases
[COG0781] Transcription termination factor 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase
[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGTAAAA TCAAGCCCGC CGTTTCGGCC CGGGAGGCGG CATTGCAGGT AATTTACCGG 
GTAACTGAGG AAGGGGCCTA TGCCGGCCTG GCTCTGGATG AGGTGTTGAA GTCCGCCGGC
CTGGATGGCC GCGAGAGGGC CCTGGCTACT GAACTGGCTT ATAGCGCCAT TAAGGCCTGG
GGAACCCTGG ATTGGGCTCT GGGCCTTTTT TTACGGCAAC CCTTGGAGAA ACTGCCGCCC
TGGATTCGCT GTGTTCTGCG CCTGGGAGCC ACCCAGTTGC TGTATGTGCC CCGGATACCG
CCCCGGGCGG CCATTTATGA AACAGTGGAG CTGGCCAAAA AGTACGGTCA CCGGGGTACG
ACGGGCCTGG TCAACGGTGT CCTGCGCCAC CTGGACCGGC AAAAGGACGC CCTGCCCTAT
CCCGATTGGA AAACCGACCC GGCCGGCTAC CTGGCCCTGC GCTATTATCA CCCTCGCTGG
CTGGTAGAAC GCTGGCTGGA AGAGTTCGGG TACCAGGAGA CCGAATATCT CTGCCGGGCG
GATAATGAAC CCCCTCCCAC AATAGCCCGG GTCAACACCC TGAAGACGAG AAAAGATGTA
CTGGCCGCGC GCCTCCAGGC GGAGGGGGCG ACCGTCAGGC CGGCCCGTTA CGCCCCGGAA
GGGCTGGTGG TCGAGGGGCT GGGAGCGCTA GAGGCCAGTC CCTCCTTCCA GGAGGGGTTG
TTTTATGTCC AGGACGAGGG TTCCCAGCTG GTCAGCCATG CCCTGCACCC GGACTCTGGT
GCCTGGGTAA TCGATGCCAG CGCCGCACCG GGCGGTAAGA CAACCCATCT GGCCCAGCTG
ATGGCCGATC GGGGGACGAT TCTGGCCTGC GATGTTCACC GGGGGAGGTT GGATTTGATC
GCCGCCAACT GCCGTCGCCT GGGGGTTACC TGCGTTCGCA CCGTCCTGGT AGATGCCCGG
GAACTGGGGG AACGCTACCC GGCGGCTGCA GATTACCTCC TAATTGATGC CCCCTGCTCC
GGGCTGGGGG TATTGCGGCG GCGGCCCGAC GCCCGCTGGC GGAAAGAAGC CCCCCGCACC
CGGGAGCTGG CCCGGCTACA ACTGGCCATT CTGATGGGAG CCAGGCAGGC CCTGAAACCG
GGAGGTGTCC TGGTTTACAG TACCTGCACC CTGCTGCCGG AAGAAAACCA GGAGGTGGTA
CGGGAGTTTC TGGAACGGGC GGGGGAATTC AGACCGGACT CCCTGGAGCC TTGGTTGCCG
GTCCTGCCAC CGGACCTGAT GGTCACCGCC CGCCAGGGCT GGGTCCAGTT TTTGCCCCAG
CGTCACGGGA CGGACGGCTT TTTTATCGCC AGGATAAAAA AGCTAGAAAA ATAA
 
Protein sequence
MGKIKPAVSA REAALQVIYR VTEEGAYAGL ALDEVLKSAG LDGRERALAT ELAYSAIKAW 
GTLDWALGLF LRQPLEKLPP WIRCVLRLGA TQLLYVPRIP PRAAIYETVE LAKKYGHRGT
TGLVNGVLRH LDRQKDALPY PDWKTDPAGY LALRYYHPRW LVERWLEEFG YQETEYLCRA
DNEPPPTIAR VNTLKTRKDV LAARLQAEGA TVRPARYAPE GLVVEGLGAL EASPSFQEGL
FYVQDEGSQL VSHALHPDSG AWVIDASAAP GGKTTHLAQL MADRGTILAC DVHRGRLDLI
AANCRRLGVT CVRTVLVDAR ELGERYPAAA DYLLIDAPCS GLGVLRRRPD ARWRKEAPRT
RELARLQLAI LMGARQALKP GGVLVYSTCT LLPEENQEVV REFLERAGEF RPDSLEPWLP
VLPPDLMVTA RQGWVQFLPQ RHGTDGFFIA RIKKLEK