Gene Cthe_1751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1751 
Symbol 
ID4810181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2070864 
End bp2072225 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content45% 
IMG OID640107164 
ProductRNA methyltransferase 
Protein accessionYP_001038165 
Protein GI125974255 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID[TIGR00479] 23S rRNA (uracil-5-)-methyltransferase RumA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00216476 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATTTGG TAAAGAAAAA TGAGGTTTAT ACTATTGACA TTACCGGAAT GACCCATGAA 
GGGCAGGGAG TCGGAAGGAT AGACAATTTC ACGGTTTTTG TGGACGGCCC GGTTGAAGGG
GAAAAGGTGG AGATAAAGAT TATAAAGGTG AAAAAGAGTT ATGCCATTGG CAAACTCCTT
CAAGTTTTGG AGCCTTCCTC CGACAGGACG GAGCCTTTTT GCCCTTCGTA CAAAAGGTGT
GGAGGATGCA GTCTTCAGCA CATGAATTAT GAAGCAAGTC TTAGATTCAA GACCAACGTT
GTAAGGGAAA GCATCAGAAG AATCGGCGGG CTGGAGAATG TCGTGGTACA TGATACCGTT
GGGATGGAGC AGCCGGTAAA CTATAGGAAC AAAGCCCAAT ATCCGATAGG AAGGTATAAG
GATGGAATCA GGGCGGGATT TTACGCAAAA AGATCGCATG AGATTATTGA CTGTGCCACC
TGTACCATTC AGCATTCTGT AAGTGACAGG GTAAGGGCTG TTGTCAAGGA GTATATAGAG
AAAAACAATA TAAGTACATA TGATGAAATT ACCGGAGAAG GCTTGGTCCG GCACATTATG
ACCCGTGTGG GGTTTAAGAC CGGCGAGGTA ATGGTGGTTC TGGTCATTAA CGGAAGAAAC
ATCCCCAGTC AGAAAAAGCT GGTGGACAGG CTGGTAAAAG AGATTCCGGA GATAAAGAGT
GTTGTTTTGA ATGTTAATAC AAAGAGGACC AATGTTATTT TGGGTGATGA GAATATAGTC
GTCTATGGAA GGGATACCAT TACTGATTAT ATAGGAGAGT TTAAGTTTAA TATATCGCCG
CTGTCTTTTT TTCAGGTGAA TCCTGTTCAA ACGGAAGTGC TGTATCGAAA GGCGTTGGAT
TATGCGCAGC TTACAGGCAA GGAGACTGTT TTTGATTTGT ATTGCGGTAT AGGCACTATT
TCGCTGTTTT TGTCAAAGAA GGCGAAAAGG ATTTATGGTG TTGAAGTTGT TGAGGCGGCA
GTGAAGGATG CGTGGGAAAA TGCCAAGGTT AACGGAGTGG AAAATGCGGA GTTTATAGTT
GGAGAAGCGG AAAAGGTAAT ACCCGAGATG TATGAAAAGG GCGTGCGCGC GGATGTTGTG
GTGGTTGACC CTCCAAGGAA GGGATGCGAT GAGGCAGTGC TGAAGACTTT GGTGGATATG
AAGCCGGAGA GGATTGTTTA TGTGTCCTGC AATCCGGCGA CACTGGCAAG GGATTTGAAG
TACTTGGCGG AAGGCGGGTT TGAGGTTCGG GAGGTTCAAC CTGTGGACAT GTTCCCGTGG
ACGTATCACG TGGAGTGCGT TGCGTTGATA GAGAAGAAAT AG
 
Protein sequence
MDLVKKNEVY TIDITGMTHE GQGVGRIDNF TVFVDGPVEG EKVEIKIIKV KKSYAIGKLL 
QVLEPSSDRT EPFCPSYKRC GGCSLQHMNY EASLRFKTNV VRESIRRIGG LENVVVHDTV
GMEQPVNYRN KAQYPIGRYK DGIRAGFYAK RSHEIIDCAT CTIQHSVSDR VRAVVKEYIE
KNNISTYDEI TGEGLVRHIM TRVGFKTGEV MVVLVINGRN IPSQKKLVDR LVKEIPEIKS
VVLNVNTKRT NVILGDENIV VYGRDTITDY IGEFKFNISP LSFFQVNPVQ TEVLYRKALD
YAQLTGKETV FDLYCGIGTI SLFLSKKAKR IYGVEVVEAA VKDAWENAKV NGVENAEFIV
GEAEKVIPEM YEKGVRADVV VVDPPRKGCD EAVLKTLVDM KPERIVYVSC NPATLARDLK
YLAEGGFEVR EVQPVDMFPW TYHVECVALI EKK