Gene Cthe_2320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2320 
Symbol 
ID4809248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2766647 
End bp2767651 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content37% 
IMG OID640107727 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001038715 
Protein GI125974805 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA TTGACTTAAA TAAAATAAGA AACATCAGAA AATACACTGT TGCGTCACTA 
TTTGCTGGTG CTGGCGGTCT AGACATGGGA TTAGAATTAG CAGGTTTCAA AACCGTTTGG
GCTAATGATA TTGATAAAGA TGCTTGTGCT ACATACCGTC TATGGAGTCA GGCAGATGTT
GTGCAAGGAG ATATTGCAAA AATTGATTAC TCTGATGTTC CAGATACAGA CGTGATAACG
GGTGGTTTTC CATGTCAAGG CTTTTCGTTG GCAGGACCAC GAAAAATTAA TGATGAAAGA
AATAAATTGT ATCGTTACTT TGTTAAATTA GTTGAATTAA AGCAACCTTA TGCTTTTATT
GCAGAAAATG TAAAAGGCAT CCTTACTCTT GGCGATGGCG AAATTATTGA AGCCATAATA
GAGGATTTTG CGAGCAAAGG ATATGATGTA TATCCTAATC TTGTTAATGC CGCTGACTAT
GGAGTTCCGC AAGACAGATG GCGTGTTATT CTATACGGAT TTAGAAAAGA TTTAGAAGTA
AAAGATTTTA AATTTCCTGA GCCGTTCCCA TATAAAGTTA CCTTGAGAGA GGCTATCGGG
GATATGCCTG AACCAAAACA GAGTGATATA TGCCATGCTT CTTATTCAAG TAGGTATATG
TCGAGAAACC GAAAAAGAGG TTGGGACGAA GTAAGTTATA CAATTCCAGC AATGGCGAAG
CAAGTTCCGT TGCACCCATC TTCTCCAGAC ATGATTAAGA TTGCTGAAGA TAAGTGGATA
TTTGGTGAAG GGAAAACAAG AAGATTTAGT TGGCAAGAAG CAGCAGTTAT ACAAACTTTC
CCTCGTGATA TGGAATTCGT AGGAAATCTA ACTTCTAAGT ATCGACAGAT AGGAAATGCT
GTCCCGGTAA AACTTGCAGA AGTAATAGGT AAGAAATTAT ATGAAATACT TGAAGAGAAG
TTAAATGCAA ATGTTAAAGA ATTAGAACAA AAAGTTGGTG AATGA
 
Protein sequence
MKIIDLNKIR NIRKYTVASL FAGAGGLDMG LELAGFKTVW ANDIDKDACA TYRLWSQADV 
VQGDIAKIDY SDVPDTDVIT GGFPCQGFSL AGPRKINDER NKLYRYFVKL VELKQPYAFI
AENVKGILTL GDGEIIEAII EDFASKGYDV YPNLVNAADY GVPQDRWRVI LYGFRKDLEV
KDFKFPEPFP YKVTLREAIG DMPEPKQSDI CHASYSSRYM SRNRKRGWDE VSYTIPAMAK
QVPLHPSSPD MIKIAEDKWI FGEGKTRRFS WQEAAVIQTF PRDMEFVGNL TSKYRQIGNA
VPVKLAEVIG KKLYEILEEK LNANVKELEQ KVGE