Gene Cthe_2786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2786 
Symbol 
ID4810103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3286144 
End bp3287211 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content43% 
IMG OID640108206 
Productmethylcobalamin:coenzyme M methyltransferase 
Protein accessionYP_001039178 
Protein GI125975268 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01463] methyltransferase, MtaA/CmuA family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGCA TAAGTCCCAA GGAAAGGATA TTGCGTGTGC TAAAAAAACA AAAAGTTGAC 
CGGCCGCCTG TAATTTGCCC CGGAGGCATG ATGAATGCTG CGATTGTTGA TATTATGAAG
ACTACCGGAC ATACCCTTCC GGAAGCACAT CACGATGACA GGCTTATGGC GGAGCTTTCC
CGGGATGTGC ATAAATACAC GGGCTTTGAG AACTTTGGCA TTCCCTTTTG CATGACCGTT
GAAGCAGAGG TGCTGGGCAG TAGCATAAAT TTTGGAACAC TGGCCTGCGA ACCAAAAATC
GAAAAAGAAG CCTTTGATTC GGTATCGAAT GTAGTGTATC AGGATATCGG CAAAATGCTT
AAAAAAGGCA GGATAGAATC TGTCATTCAG GCAGCCTGGC ACCTGTCCAA AAAAAATAAG
GATATACCTG TTGTTGGGAA TTTGACAGGG CCTTTGAGCA CTTCGGCGTC TATAGTAGAT
CCTGTGACCT TTCTTAAGGA GCTTAGAAAA GACAATGAAA ATGCCCATAG GGTAATAAAT
TATGTCACGG ACTTTTTAAT CGAATATGCA AAGCTTATGA TTGAGAACGG TGTGGACTTG
ATATCCATAG GAGATCCTAC TGCAACGGGC GAAATTTTAG GGCCGAAAAT GTTTGAAGAA
TATGCCGTAA GATACCTGAA CAAGCTGGTT GACGGGATAC ATTCCTTAAA TGCCCCGGTA
ATTGTACATA TTTGCGGAAA TATAAACACG GTAAAGCGCT TTATTCCCCA AATCAGGTCT
GACGCAATCA GCACCGATGC GATGATAAAT CTTCGGGCGC TGAAAGATGA GTTTCCTTAT
TTAACGACAA TGGGCAATTT AAGCACTTTT CTTCTTCAAT TCGGAACTCC TGAAAAAGTG
GCAGACCAGA CGCAGCGCCT GCTGAGGGAC GGAATAGATA TAATATCTCC GGCGTGCGGC
CTAAGTACAA CGTCTTCAAT TCAAAATATA AAGGCTCTGA CCAAAACTGT AAAGGAGCAT
GGAGAGTATG CCAGAAGTAG TGTTTTACCC GCAAAACAAG TCCATTAA
 
Protein sequence
MNSISPKERI LRVLKKQKVD RPPVICPGGM MNAAIVDIMK TTGHTLPEAH HDDRLMAELS 
RDVHKYTGFE NFGIPFCMTV EAEVLGSSIN FGTLACEPKI EKEAFDSVSN VVYQDIGKML
KKGRIESVIQ AAWHLSKKNK DIPVVGNLTG PLSTSASIVD PVTFLKELRK DNENAHRVIN
YVTDFLIEYA KLMIENGVDL ISIGDPTATG EILGPKMFEE YAVRYLNKLV DGIHSLNAPV
IVHICGNINT VKRFIPQIRS DAISTDAMIN LRALKDEFPY LTTMGNLSTF LLQFGTPEKV
ADQTQRLLRD GIDIISPACG LSTTSSIQNI KALTKTVKEH GEYARSSVLP AKQVH