Gene Cthe_3160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3160 
Symbol 
ID4809610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3733544 
End bp3734914 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content44% 
IMG OID640108593 
ProductNOL1/NOP2/sun family RNA methylase 
Protein accessionYP_001039548 
Protein GI125975638 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTTC CTGAAGAGTT TTTAAGAAAG ATGGAAGGAC TTTTTGATGC CGGAGAATTT 
GAGGAATTTT TAAAATCCTA CGATATGCCA AGATTCTACG GACTTCGGGT AAACACACTT
AAAATCGGAG TGGAGGAGTT TAAAAAGCTT TCACCCTTTG AGCTTGAACC AATACCGTGG
ACAAAGGACG GTTTTTATTA TAATGAAGGG GAAAATCCGG GAAAGCATCC GTATTATCAT
GCCGGACTTT ATTATATTCA GGAACCCAGT GCCATGCTTC CGGGAGCTGT TATAAATGCC
GAAGAAGGGG ATTATGTACT GGACCTTTGT GCCGCCCCCG GAGGAAAAAC GGTGCAAATG
GCGGCCGGCA TGAAGGGGAA AGGCCTTTTG ATTGCCAATG ACATAAGCTC TGACAGGGTA
AAAGCTCTGG TGAAGAACAT TGAGCTTTGC GGTATAACCA ACGCCATAGT TACCAATGAA
AGTCCTGACA GGCTTGCCAA AAAACTTTGC GCATTTTTCG ACAGGATACT TGTGGATGCT
CCCTGTTCCG GCGAAGGAAT GTTCAGAAAA GACGAGGATG CCGCAAAGAG CTGGGGCAAG
TTCAAATGTG ACAAATGCTG TGCCATGCAG CGGGAGATTC TCGAAAGTGC CGATGTGATG
CTAAAGCCGG GGGGATATTT GGTCTACTCC ACATGTACTT TTTCTCCTGA GGAAAACGAG
GGAATGATTT CCGAATTTTT AAGCAGGCAT AAAAACTATG ATATATTGGA AATACCTAAA
GCATACGGTA TTGATAACGG ACGGCCCGAA TGGTGGGACA ACAACAAGGA ACTTTTGAAA
ACCGCAAGAA TTTGGCCTCA CAAGGTAAGA GGAGAGGGAC ATTTTGTTGC CCTTCTTAAG
AAAAAGGGCG ACAGAACTGT CAATGAAAAA AGGAGAAAAA ACGCGGACTC CAATGTAATT
AAGCTTATGG AGCCGTTTTA TAAATTTGCC GGGGAAAACT TGAATATAAA TATAGACGGT
TTTTTCACAG TCAAGGGAAA TAATTTATAC TGCCTTCCCG AAGAACCACC GGACCTTTCG
GGCATAAAAG TGGCAAAATT TGGGTGGTAT CTGGGGGAAA TAGCAAAGGG CAGGTTTGAA
CCGTCCCATT CTTTTGCTCT TTCCTTAAAA AAGGAAGATA TCAGGAAAAC GTTAAACTTC
AGCGCGGATT CGGTTGAGGT GTTAAAATAC TTAAAAGGTG AAACCCTTAT GATAGAAGGA
GAACCGGGAT ATACCGGCAT TTTGGTTGAC GGATATACGT TAGGCTGGGC AAAGCAGACC
GGTGATATGC TAAAGAACTT GTATCCAAAG GGCTGGAGGA AAATGCAGTA G
 
Protein sequence
MKLPEEFLRK MEGLFDAGEF EEFLKSYDMP RFYGLRVNTL KIGVEEFKKL SPFELEPIPW 
TKDGFYYNEG ENPGKHPYYH AGLYYIQEPS AMLPGAVINA EEGDYVLDLC AAPGGKTVQM
AAGMKGKGLL IANDISSDRV KALVKNIELC GITNAIVTNE SPDRLAKKLC AFFDRILVDA
PCSGEGMFRK DEDAAKSWGK FKCDKCCAMQ REILESADVM LKPGGYLVYS TCTFSPEENE
GMISEFLSRH KNYDILEIPK AYGIDNGRPE WWDNNKELLK TARIWPHKVR GEGHFVALLK
KKGDRTVNEK RRKNADSNVI KLMEPFYKFA GENLNINIDG FFTVKGNNLY CLPEEPPDLS
GIKVAKFGWY LGEIAKGRFE PSHSFALSLK KEDIRKTLNF SADSVEVLKY LKGETLMIEG
EPGYTGILVD GYTLGWAKQT GDMLKNLYPK GWRKMQ