Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3160 |
Symbol | |
ID | 4809610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3733544 |
End bp | 3734914 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108593 |
Product | NOL1/NOP2/sun family RNA methylase |
Protein accession | YP_001039548 |
Protein GI | 125975638 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00446] NOL1/NOP2/sun family putative RNA methylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTTC CTGAAGAGTT TTTAAGAAAG ATGGAAGGAC TTTTTGATGC CGGAGAATTT GAGGAATTTT TAAAATCCTA CGATATGCCA AGATTCTACG GACTTCGGGT AAACACACTT AAAATCGGAG TGGAGGAGTT TAAAAAGCTT TCACCCTTTG AGCTTGAACC AATACCGTGG ACAAAGGACG GTTTTTATTA TAATGAAGGG GAAAATCCGG GAAAGCATCC GTATTATCAT GCCGGACTTT ATTATATTCA GGAACCCAGT GCCATGCTTC CGGGAGCTGT TATAAATGCC GAAGAAGGGG ATTATGTACT GGACCTTTGT GCCGCCCCCG GAGGAAAAAC GGTGCAAATG GCGGCCGGCA TGAAGGGGAA AGGCCTTTTG ATTGCCAATG ACATAAGCTC TGACAGGGTA AAAGCTCTGG TGAAGAACAT TGAGCTTTGC GGTATAACCA ACGCCATAGT TACCAATGAA AGTCCTGACA GGCTTGCCAA AAAACTTTGC GCATTTTTCG ACAGGATACT TGTGGATGCT CCCTGTTCCG GCGAAGGAAT GTTCAGAAAA GACGAGGATG CCGCAAAGAG CTGGGGCAAG TTCAAATGTG ACAAATGCTG TGCCATGCAG CGGGAGATTC TCGAAAGTGC CGATGTGATG CTAAAGCCGG GGGGATATTT GGTCTACTCC ACATGTACTT TTTCTCCTGA GGAAAACGAG GGAATGATTT CCGAATTTTT AAGCAGGCAT AAAAACTATG ATATATTGGA AATACCTAAA GCATACGGTA TTGATAACGG ACGGCCCGAA TGGTGGGACA ACAACAAGGA ACTTTTGAAA ACCGCAAGAA TTTGGCCTCA CAAGGTAAGA GGAGAGGGAC ATTTTGTTGC CCTTCTTAAG AAAAAGGGCG ACAGAACTGT CAATGAAAAA AGGAGAAAAA ACGCGGACTC CAATGTAATT AAGCTTATGG AGCCGTTTTA TAAATTTGCC GGGGAAAACT TGAATATAAA TATAGACGGT TTTTTCACAG TCAAGGGAAA TAATTTATAC TGCCTTCCCG AAGAACCACC GGACCTTTCG GGCATAAAAG TGGCAAAATT TGGGTGGTAT CTGGGGGAAA TAGCAAAGGG CAGGTTTGAA CCGTCCCATT CTTTTGCTCT TTCCTTAAAA AAGGAAGATA TCAGGAAAAC GTTAAACTTC AGCGCGGATT CGGTTGAGGT GTTAAAATAC TTAAAAGGTG AAACCCTTAT GATAGAAGGA GAACCGGGAT ATACCGGCAT TTTGGTTGAC GGATATACGT TAGGCTGGGC AAAGCAGACC GGTGATATGC TAAAGAACTT GTATCCAAAG GGCTGGAGGA AAATGCAGTA G
|
Protein sequence | MKLPEEFLRK MEGLFDAGEF EEFLKSYDMP RFYGLRVNTL KIGVEEFKKL SPFELEPIPW TKDGFYYNEG ENPGKHPYYH AGLYYIQEPS AMLPGAVINA EEGDYVLDLC AAPGGKTVQM AAGMKGKGLL IANDISSDRV KALVKNIELC GITNAIVTNE SPDRLAKKLC AFFDRILVDA PCSGEGMFRK DEDAAKSWGK FKCDKCCAMQ REILESADVM LKPGGYLVYS TCTFSPEENE GMISEFLSRH KNYDILEIPK AYGIDNGRPE WWDNNKELLK TARIWPHKVR GEGHFVALLK KKGDRTVNEK RRKNADSNVI KLMEPFYKFA GENLNINIDG FFTVKGNNLY CLPEEPPDLS GIKVAKFGWY LGEIAKGRFE PSHSFALSLK KEDIRKTLNF SADSVEVLKY LKGETLMIEG EPGYTGILVD GYTLGWAKQT GDMLKNLYPK GWRKMQ
|
| |