Gene Cthe_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1749 
Symbol 
ID4810179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2069230 
End bp2070282 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content36% 
IMG OID640107162 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001038163 
Protein GI125974253 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000535565 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTG AATTAGGAGA ACTGTTTTGT GGTCCTGGAG GGCTGGCTTA TGGAGCAAAA 
ACAGCAGAAA TAGAAAATAA GGAATATAGA ATAATTCATA AATGGGCAAA TGACTATGAC
AGAGATACTT GTGATACTTA CATACACAAT ATATGTCCGG ATAATCCTGA ATCGGTTATA
TGTCAGGATG TAAGGAAGTT GAACATAGAT TCTTTGCAAC CCATAGATGC CCTTGCATTT
GGTTTTCCAT GCAATGACTT TTCGGTTGTA GGCGAACAAA AAGGATTTAA TGGAGAATAT
GGACCATTGT ACACTTATGG GGTAAAAGTA TTGAAAAAGT TCAAACCAAT GTGGTTTCTT
GCTGAAAATG TAGGCGGTTT AAAGTCGGCA AATGATGGTG GAGCGTTTAA AAAGATACTT
AATGATTTAG GAGAGGCTGG ATATAGATTA TATCCTCATT TATATAAGTT TGAAGAATAT
GGTATACCGC AGGCACGTCA TAGAATAATT ATTATAGGTA TTAGAAAAGA TTTACCGTAT
GAATTTAAAG TTCCTTCTCC GGTTCCGTAT AAAGACTTAG ATAATACCTG TAGAACAGCA
TTAGAAGTAC CACCAATTCC GAAAGATGCA CCGAATAATG AATTGACAAG ACAATCAGCA
ATAGTAACAG AGAGATTAAA ATATATAAGA CCTGGAGAAA ATGCTTTCAC GGCAGATTTA
CCTGAGCATC TTCGATTGAA TGTTAAAGGG GCAAAGATAA GTCAAATATA TAAAAGATTA
GATCCTAATA GACCAGCCTA TACAGTAACA GGTTCAGGCG GCGGTGGAAC TCATATATAT
CATTATGCTG AGCCAAGAGC ATTAACTAAT AGGGAAAGAG CAAGACTGCA GACATTTCCA
GATGATTATT TATTTAAGGG TTCAAAGGAA AGTGTAAGAA GGCAGATAGG GATGGCAGTC
CCAGCTAAAG GAGCAAAGAT AATATTTGAA GCTGTATTAA GAACATTTGC AGGTATTGAA
TATGAAAATG TTCCATGTAA TGTTAATGAA TAG
 
Protein sequence
MKFELGELFC GPGGLAYGAK TAEIENKEYR IIHKWANDYD RDTCDTYIHN ICPDNPESVI 
CQDVRKLNID SLQPIDALAF GFPCNDFSVV GEQKGFNGEY GPLYTYGVKV LKKFKPMWFL
AENVGGLKSA NDGGAFKKIL NDLGEAGYRL YPHLYKFEEY GIPQARHRII IIGIRKDLPY
EFKVPSPVPY KDLDNTCRTA LEVPPIPKDA PNNELTRQSA IVTERLKYIR PGENAFTADL
PEHLRLNVKG AKISQIYKRL DPNRPAYTVT GSGGGGTHIY HYAEPRALTN RERARLQTFP
DDYLFKGSKE SVRRQIGMAV PAKGAKIIFE AVLRTFAGIE YENVPCNVNE