Gene Cthe_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0783 
Symbol 
ID4810401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp946308 
End bp947750 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content41% 
IMG OID640106200 
ProductRNA modification protein 
Protein accessionYP_001037211 
Protein GI125973301 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0621] 2-methylthioadenine synthetase 
TIGRFAM ID[TIGR00089] RNA modification enzyme, MiaB family
[TIGR01574] tRNA-N(6)-(isopentenyl)adenosine-37 thiotransferase enzyme MiaB 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTAAGG GTACAAGGGA CAGGAAAAAT ATTTATGTTT CTCCGGAGGA AATGGCAAGA 
CAGCAGCGGT TTATTGACGA GATTAAAGAA TTAAACTATC GGAAGGAAGT AAAGACCGGA
AAGAAGAAGC TTTATTGTCT TAATACTTTC GGTTGCCAGA TGAATGAACA TGATTCGGAG
AAACTTGCGG GAATGTTGGC TGAAATGGGA TATGCTGAAA CGGATAACGT AAACGAAAGC
GATTTGGTTA TTTACAATAC ATGCTGCGTA AGGGAAAATG CCGAGCTTAA GGTATACGGG
CATCTTGGAA TGTTAAAGCC CCTTAAAAAT CAAAAACCGG ATCTTGTGAT CGCTGTATGC
GGTTGTATGA TGCAGCAGCC GGAAGTTGTG GAGCATATAA AGAAGACATA CAGTCATGTT
GACCTGATAT TTGGAACGCA CAACCTGTAT AAGTTTCCTG AGCTTTTGTA CAGTGCGATG
GATTCTCAGA CAACTGTTGT TGATGTCTGG GATTGCGACG GCCAAATAGC TGAAAATGTG
GCAATTGAGA GAAAAGACGG GGTGAAGGCC TGGGTTACGG TAATGTACGG CTGCAATAAT
TTTTGCACCT ATTGTATTGT TCCTTACGTA CGAGGCAGGG AAAGAAGCAG ATCAATGGAT
GACATTCTTG AAGAAGTAAG GATGTTAGGA CGTCAAGGGT TTAAGGAGAT AACACTTCTG
GGGCAGAATG TAAACTCTTA CGGAAAAGAC ATTGGAGACG GTACAAGTTT TGCCGAGTTG
ATACGTGAGG TTAACAAGAT ACCCGGGATT GAAAGAATCA GGTTTACCAC ATCCCATCCG
AAAGATTTGT CCGATGATTT GATTTATGCC ATGAGAGACT GTGAAAAGGT ATGTGAACAT
TTGCATCTTC CGTTTCAGGC GGGAAGCACC AGAATACTGA AATTGATGAA CAGAAAGTAT
ACCAAGGAGG ATTATATTAA TCTTGTAGCA AAGATTAAGG AAAATATACC GGATATTGCA
CTTACCACTG ATATTATCGT GGGATTTCCC GGTGAGACGG AGGAAGATTT CTCAGACACA
CTGGATATTC TTGAAAAAGT CAGATTTGAC AACGCATATA CTTTCCTGTA TTCAAAGAGA
ACCGGTACGC CTGCGGCCAA AATGGAGGAT CAGGTTCCGG AAGAAGTGAA GAAGGAAAGA
TTCCAGAGAC TTCTTGAAAC GCAGAACAGG ATAAGCAAGG AAATAAATGA CACTTTTTTG
GGCAAAGTGG TTGAAGTTCT TGTTGAGGGT GTCAGCAAGA CAAATGATAA GATTTTTACA
GGAAGGACAA GGGGAAACAA AGTTGTTAAT TTTGAGGCTG ATGCAAGTTT GATAGGTAAG
TTGGTGAATG TAAGAATAAA TACTGTAAAA ACTTGGTCGC TGGAGGGCAG CATAGTAAGG
TGA
 
Protein sequence
MSKGTRDRKN IYVSPEEMAR QQRFIDEIKE LNYRKEVKTG KKKLYCLNTF GCQMNEHDSE 
KLAGMLAEMG YAETDNVNES DLVIYNTCCV RENAELKVYG HLGMLKPLKN QKPDLVIAVC
GCMMQQPEVV EHIKKTYSHV DLIFGTHNLY KFPELLYSAM DSQTTVVDVW DCDGQIAENV
AIERKDGVKA WVTVMYGCNN FCTYCIVPYV RGRERSRSMD DILEEVRMLG RQGFKEITLL
GQNVNSYGKD IGDGTSFAEL IREVNKIPGI ERIRFTTSHP KDLSDDLIYA MRDCEKVCEH
LHLPFQAGST RILKLMNRKY TKEDYINLVA KIKENIPDIA LTTDIIVGFP GETEEDFSDT
LDILEKVRFD NAYTFLYSKR TGTPAAKMED QVPEEVKKER FQRLLETQNR ISKEINDTFL
GKVVEVLVEG VSKTNDKIFT GRTRGNKVVN FEADASLIGK LVNVRINTVK TWSLEGSIVR