Gene Cthe_1728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1728 
Symbol 
ID4810158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2049221 
End bp2050672 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content39% 
IMG OID640107141 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001038142 
Protein GI125974232 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.770689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTCC TTGATTTCTT TGCAGGAATA GGCGGCTTTC GATTGGGGTT AGAACTTGCA 
GGACATAAAT GTATAGGATT TTGTGAAAAA GATAAATTTG CAGTTAAAAG CTATAGAGCA
ATGTTTGATA CGGAAGGAGA GTGGTATGCA GATGACGTTA CAAAACTTAA AAGCGAAGAT
ATCCCATATG CAGACATCTG GTGTTTTGGA TTCCCATGTC AAGACATCTC AGTTGCAGGA
AAACAGCGAG GAATCAGAGG AAAAAGAAGC GGCTTATATT TTAGAATTAT TGACCTTATC
AAAGGCAAAG AAGAAAAAGA TAAACCCTCA TACCTTCTTA TTGAGAACGT TAAAAACCTG
CTGTCAATTA ATAACGGATG GGACTTTGCA GCCGTTCTCT CTGAGCTGGA TGAAGCAGGG
TATGATGCTT TCTGGCAGGT GCTTAACTCT AAAGATTTCG GAGTACCCCA AAACAGAGAG
CGTGTGTTCA TTACTGCAAA TCTTAGAAGC AGAGGCAGAC GAGAAATACT ACCTGTCAGA
GGAGAAAACA CAGCAGCTCT TAAGCAGATT ATAGGCGGCT GTCAAGGTGA GCGAGTTTAT
GATGCAGAAG GAGTAGCTTG CACACTTACA GGTTGTGGTG GAGGTGGAGG AGCAAAGACT
GGTCTTTACT GCGTAGGAAA TATAAATCCA AGCGGCAAAG GTATGAATGG AAATGTTTAT
TCATCAGAAG GAATTGCTCC AGCAGTTACA ACTAATAAGG GCGAAGGAAG CAAAATTTTT
ATAGACCAGT CTTACACAAA GCCAAAGCTC ACAGATACAT CAAGATGTAT TACATCACGT
TATACAGCAG GAGTGGTTAA TAGAACTGCA ATGAATAGCG GAGTTTTAGA GAGTGTTCCA
ATTGAGTTTA ATAGAAATGA TGGTATTCTT GATGAAATAA AAATAGCACA TACAATCAGT
GCCAGTGATT GGCGAGGACT AAACAGAAAT CAAACACAAA ATGCTGTTCT TGAAGCAAGG
GCTGTTATAA CTCCTGAAAG AGAAAATAAA CGTCAGAATG GAAGGAGATT TAAAAATGCA
GATGAACCAA TGTTTACTTT AACAAGTCAG GACAGACATG GAGTTGCAGT TAAAGAAGCC
ACAAAAAAAG GCTATGCTGA AGCTGAGATT GGAGACAGCA TAAATATTTC AGTACCTAAT
TCAAAAACAA GAAGAGGCAG GGTTGGAAAA GGTATATCAA ATACTTTAGA TACAGGATGC
CAAATGGCTA CATTGGACAA AAATTACCGT ATTAGAAGGC TTACACCAAA GGAATGCTTC
AGACTTCAAG GCTTCCCAGA TGAATTATTT GAAAAAGCAA GGGCTGTAAA TTCAGATGCT
CAGCTATATA AGCAGGCAGG TAATGCAGTA ACTGTAAATG TAGCTTTTGC AGTAGCACAA
TCTATTAAAT AA
 
Protein sequence
MTFLDFFAGI GGFRLGLELA GHKCIGFCEK DKFAVKSYRA MFDTEGEWYA DDVTKLKSED 
IPYADIWCFG FPCQDISVAG KQRGIRGKRS GLYFRIIDLI KGKEEKDKPS YLLIENVKNL
LSINNGWDFA AVLSELDEAG YDAFWQVLNS KDFGVPQNRE RVFITANLRS RGRREILPVR
GENTAALKQI IGGCQGERVY DAEGVACTLT GCGGGGGAKT GLYCVGNINP SGKGMNGNVY
SSEGIAPAVT TNKGEGSKIF IDQSYTKPKL TDTSRCITSR YTAGVVNRTA MNSGVLESVP
IEFNRNDGIL DEIKIAHTIS ASDWRGLNRN QTQNAVLEAR AVITPERENK RQNGRRFKNA
DEPMFTLTSQ DRHGVAVKEA TKKGYAEAEI GDSINISVPN SKTRRGRVGK GISNTLDTGC
QMATLDKNYR IRRLTPKECF RLQGFPDELF EKARAVNSDA QLYKQAGNAV TVNVAFAVAQ
SIK