Gene Cthe_2785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2785 
Symbol 
ID4810102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3284989 
End bp3286107 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content44% 
IMG OID640108205 
ProductMtaA/CmuA family methyltransferase 
Protein accessionYP_001039177 
Protein GI125975267 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01463] methyltransferase, MtaA/CmuA family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.271038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTCGTG ACCAGATGAC TCCCAAAGAA AGGATGGAAG CATACAATAA AGGACAGCCG 
ATAGATAGGC TTCCTTGTGT CCCGATTGTG GGAAACACTG CGGCAAGGGT AATAAATGTA
AAAGTATCGG AATTTAAAGG CAACGGCAAA TTGCTTGCCA AGGCCCATGT TGCTGCTTAC
AGGATGTTTG GGTACGACAT CATAAGGGTT TTTACGGACT TGTATGTTCA GGCTGAGGCA
ATGGGGGCCA AAGTGCATTA TCCCTATGAT CAAACGGCAT ACCTTGAAGC TCCTGCTATA
AATGATGTTT CGGAGATTGA TTCATTGGAG CCTGCAGACC CGTACAAAGA CGGTAATCTG
CCCCATCATC TTGAGGCAAT GAAAATAGTT GCGGAAGAAG TGGGAGATGA AGTAACAGTG
GCGGGAGCTG TCACGTGTCC TTTTACCAAT GCATCTTTCC TGATAGGTGC TGAGAACCTT
GTAAGGTTGA CCCTTAAAGA CCCTGAAAAA GTACATAGAT TGTGTGAGAT ATCTTTGGAA
ACCAGTTTGA GATATGCAAA GGCAATTATT GACGCGGGAT GCACGCCCAG CCTTACGGAC
CCCATGTCTT CAAATACTGT GATAAGTCCA AAACAATTTA AGGAGTTTTC GTTCCCGTAT
TTAAAAAGAC TGATTGACTA TATTCATTCA AGGGGCAAAA GCGTAACTTT GCACATCTGC
GGAAAGACAA ATAAAATATG GGAACTGATG GCCGAAGCCG GTGCCGACTG CATAAGTATC
GACAATGATG CCAGCCTTCT TGAAGCAAAG CAGAAAATAG GCCACAAGGT AAGATTGATG
GGAAATGTGA AACCTTCCGA GACCATGCTT CAGGGAACGG TTTCCGACGT TAAAAAAGCC
GTTTTTGAGT GCGTACGTCA GGCTTATGAC AACCCGAAAG GCTATATTGT GGCTTCAGGA
TGCAGCCTTC CCACAGACAC ACCTTTTAGT AATATTCATG CAATGATGGA TGCGGTAAGA
GAAATCGGAT ATCCTCCCAA TGAAGATTTG TTTAATTACA TGATTTACAA AGGGCATTTT
CCGTCGCAGT ACGACTCTTA TTGCTCCGAT TATATTTAG
 
Protein sequence
MLRDQMTPKE RMEAYNKGQP IDRLPCVPIV GNTAARVINV KVSEFKGNGK LLAKAHVAAY 
RMFGYDIIRV FTDLYVQAEA MGAKVHYPYD QTAYLEAPAI NDVSEIDSLE PADPYKDGNL
PHHLEAMKIV AEEVGDEVTV AGAVTCPFTN ASFLIGAENL VRLTLKDPEK VHRLCEISLE
TSLRYAKAII DAGCTPSLTD PMSSNTVISP KQFKEFSFPY LKRLIDYIHS RGKSVTLHIC
GKTNKIWELM AEAGADCISI DNDASLLEAK QKIGHKVRLM GNVKPSETML QGTVSDVKKA
VFECVRQAYD NPKGYIVASG CSLPTDTPFS NIHAMMDAVR EIGYPPNEDL FNYMIYKGHF
PSQYDSYCSD YI