Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0900 |
Symbol | |
ID | 3831442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 935847 |
End bp | 937220 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637828831 |
Product | sun protein |
Protein accession | YP_429760 |
Protein GI | 83589751 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases [COG0781] Transcription termination factor |
TIGRFAM ID | [TIGR00446] NOL1/NOP2/sun family putative RNA methylase [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGTAAAA TCAAGCCCGC CGTTTCGGCC CGGGAGGCGG CATTGCAGGT AATTTACCGG GTAACTGAGG AAGGGGCCTA TGCCGGCCTG GCTCTGGATG AGGTGTTGAA GTCCGCCGGC CTGGATGGCC GCGAGAGGGC CCTGGCTACT GAACTGGCTT ATAGCGCCAT TAAGGCCTGG GGAACCCTGG ATTGGGCTCT GGGCCTTTTT TTACGGCAAC CCTTGGAGAA ACTGCCGCCC TGGATTCGCT GTGTTCTGCG CCTGGGAGCC ACCCAGTTGC TGTATGTGCC CCGGATACCG CCCCGGGCGG CCATTTATGA AACAGTGGAG CTGGCCAAAA AGTACGGTCA CCGGGGTACG ACGGGCCTGG TCAACGGTGT CCTGCGCCAC CTGGACCGGC AAAAGGACGC CCTGCCCTAT CCCGATTGGA AAACCGACCC GGCCGGCTAC CTGGCCCTGC GCTATTATCA CCCTCGCTGG CTGGTAGAAC GCTGGCTGGA AGAGTTCGGG TACCAGGAGA CCGAATATCT CTGCCGGGCG GATAATGAAC CCCCTCCCAC AATAGCCCGG GTCAACACCC TGAAGACGAG AAAAGATGTA CTGGCCGCGC GCCTCCAGGC GGAGGGGGCG ACCGTCAGGC CGGCCCGTTA CGCCCCGGAA GGGCTGGTGG TCGAGGGGCT GGGAGCGCTA GAGGCCAGTC CCTCCTTCCA GGAGGGGTTG TTTTATGTCC AGGACGAGGG TTCCCAGCTG GTCAGCCATG CCCTGCACCC GGACTCTGGT GCCTGGGTAA TCGATGCCAG CGCCGCACCG GGCGGTAAGA CAACCCATCT GGCCCAGCTG ATGGCCGATC GGGGGACGAT TCTGGCCTGC GATGTTCACC GGGGGAGGTT GGATTTGATC GCCGCCAACT GCCGTCGCCT GGGGGTTACC TGCGTTCGCA CCGTCCTGGT AGATGCCCGG GAACTGGGGG AACGCTACCC GGCGGCTGCA GATTACCTCC TAATTGATGC CCCCTGCTCC GGGCTGGGGG TATTGCGGCG GCGGCCCGAC GCCCGCTGGC GGAAAGAAGC CCCCCGCACC CGGGAGCTGG CCCGGCTACA ACTGGCCATT CTGATGGGAG CCAGGCAGGC CCTGAAACCG GGAGGTGTCC TGGTTTACAG TACCTGCACC CTGCTGCCGG AAGAAAACCA GGAGGTGGTA CGGGAGTTTC TGGAACGGGC GGGGGAATTC AGACCGGACT CCCTGGAGCC TTGGTTGCCG GTCCTGCCAC CGGACCTGAT GGTCACCGCC CGCCAGGGCT GGGTCCAGTT TTTGCCCCAG CGTCACGGGA CGGACGGCTT TTTTATCGCC AGGATAAAAA AGCTAGAAAA ATAA
|
Protein sequence | MGKIKPAVSA REAALQVIYR VTEEGAYAGL ALDEVLKSAG LDGRERALAT ELAYSAIKAW GTLDWALGLF LRQPLEKLPP WIRCVLRLGA TQLLYVPRIP PRAAIYETVE LAKKYGHRGT TGLVNGVLRH LDRQKDALPY PDWKTDPAGY LALRYYHPRW LVERWLEEFG YQETEYLCRA DNEPPPTIAR VNTLKTRKDV LAARLQAEGA TVRPARYAPE GLVVEGLGAL EASPSFQEGL FYVQDEGSQL VSHALHPDSG AWVIDASAAP GGKTTHLAQL MADRGTILAC DVHRGRLDLI AANCRRLGVT CVRTVLVDAR ELGERYPAAA DYLLIDAPCS GLGVLRRRPD ARWRKEAPRT RELARLQLAI LMGARQALKP GGVLVYSTCT LLPEENQEVV REFLERAGEF RPDSLEPWLP VLPPDLMVTA RQGWVQFLPQ RHGTDGFFIA RIKKLEK
|
| |