Gene Cthe_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1145 
Symbol 
ID4810813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1362344 
End bp1364230 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content38% 
IMG OID640106567 
ProductN-6 DNA methylase 
Protein accessionYP_001037570 
Protein GI125973660 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAAC TAAAGGATAA AATCAAAAAA CTTGGTTATG AGGAAATAAA GGATATTGAT 
GACGCTACCT TTATAGCCAG CCATAAAAAT GTATATGTAT ATGTAAAAAA AGTAGATGAA
GAACAGTTGA AACCAGAATT GGTTACTGCT ATAACATATG AAGCGATGGC AACAGACCCT
ATTTCTACAT ATGCTTGGAT TACAAACGGT ACAAGTAATG CCTATGTCCT TGTTGAAGAG
GAAAAGGCTG TTTCTGAAAT TCCATCAGTC TTTGAAGATG AAAACAAATT GTACTCAGGC
AAAAGGCAAC TTACTGATAG GGACAAATGG TCAATAAGAA AATATCAAGA GTTACAAGAG
AAATTTGATG GACTCCATGA AATGATTTAT GGAATGAAGG ACCATGTAAA TAACTCCAAT
GATGTAATCG ATGAATTCAG TAAACTTATT TTTTTGGAGA CCTTCAGGCT TTATCACCCT
GAATATAGAT TAACTAAGGG TAATGTAACA GGGAAACTAT TTAACGAAAT ATATAGATAC
GAATATGTAG AAAAACATAA GGATAAGGCA GTCCAGGAGA TAAGGGAAGC CTTTAAAGAA
ATAAAAGACC ATGCAGATTA TGTTGCTATT TTGGATAATG GGGAAAAGGC AAACATATTT
AGTGCAGATG AATATATAAA ACTGGAAAAT CCCAACATCT ACATTGCTGT TTTAAAGGCT
CTCCAGGATT TAGGGACAAT AATAATTGAC GGTGTAGAGA GACCTGCCAC TTTAAGGGAT
TTGACAGGGG ATGTATTGGG CAGGGTTTTT GATGTACTGC TTCGTGGAAA GTTTGAGAAT
AAAGGCGGTA TGGGTATCTA TCTTACCCCG AGACAGGTAA CGGAAGCAGC AGCCGAGATG
GTTTTACACG ACCTTACCAA AGACGGGGCA GCAAAACTAA TTGCTAAAGA CCCCAAAACA
GGAATACCTA CCCTCCGCAT AGGTGATTTG TGCTGTGGTT CGGGAGGATT TTTAATAAAG
ATGCTTCAGA AGATAGAGCA CTACCTTTTG AACAAATTGA CAGGAGACAA GAAGCAGTAT
GAAGAACTAT TTGAACAAAT GAAGGAACAC TGTTTTATAG GTGCGGATAA TGCTCCGGGA
ATGGTTCTCA AGGCGAGAAT CAATATGGCA CTGCACGGAG CACCTAAGTG CCCTATTTTC
CAAACGAGAA ATTCCCTTAT GAATACACGC CTTAAACCAG GGACATTCGA TGCAATCCTT
ACAAATCCCC CTTTTTCAAA AACTGGTATT TCAAAAACGA TTAAGAAGGG TAAAACAACA
GTAGAAAACC CAGAAGGTGC TGAGATTATC AAATATTATT CTTCTGACAT AGATGAGGAC
GGACAAAACA GGATGAGTCC TTATGGCTTA TCCCTCGGGT CAAAGCCAGA CAGCAGAGGT
AAGTGGAAAG AAGTAAATTC GGTAGACCCA GCAGTGTTAT TTATTGATAG AAATCTGCAA
CTGTTAAAAC CAGGCGGGCT ACTCATGATA GTTGTACCAG ATGGAATTCT TTCAAACTCA
GGAGATAAAT ATGTACGTGA ATACATCATG GGTAAAAAGA ACCCTGTTAC AGGTGAATTT
GAAGGTGGAA AAGCAATATT AAAAGCAGTT ATAAGTCTTC CGCAGGTAAC CTTTGCCCTT
TCAGGTGCAG GTGCAAAAAC GTCGCTGCTA TATTTAAAGA AGAAAGAACA TCCAGGAGAA
AAACAGGGTC CTGTATTTAT GGCAGTAGCA GATGAAGTGG GATTTACTGT AAAACAGAAT
GTAGAGGTAC AGTTAGGTGA TGACCATAAC GACTTATTAA AGATTGTGGA GGCTTATAAG
AAGGGTATGC CAGAGGATGT AGAATAG
 
Protein sequence
MQELKDKIKK LGYEEIKDID DATFIASHKN VYVYVKKVDE EQLKPELVTA ITYEAMATDP 
ISTYAWITNG TSNAYVLVEE EKAVSEIPSV FEDENKLYSG KRQLTDRDKW SIRKYQELQE
KFDGLHEMIY GMKDHVNNSN DVIDEFSKLI FLETFRLYHP EYRLTKGNVT GKLFNEIYRY
EYVEKHKDKA VQEIREAFKE IKDHADYVAI LDNGEKANIF SADEYIKLEN PNIYIAVLKA
LQDLGTIIID GVERPATLRD LTGDVLGRVF DVLLRGKFEN KGGMGIYLTP RQVTEAAAEM
VLHDLTKDGA AKLIAKDPKT GIPTLRIGDL CCGSGGFLIK MLQKIEHYLL NKLTGDKKQY
EELFEQMKEH CFIGADNAPG MVLKARINMA LHGAPKCPIF QTRNSLMNTR LKPGTFDAIL
TNPPFSKTGI SKTIKKGKTT VENPEGAEII KYYSSDIDED GQNRMSPYGL SLGSKPDSRG
KWKEVNSVDP AVLFIDRNLQ LLKPGGLLMI VVPDGILSNS GDKYVREYIM GKKNPVTGEF
EGGKAILKAV ISLPQVTFAL SGAGAKTSLL YLKKKEHPGE KQGPVFMAVA DEVGFTVKQN
VEVQLGDDHN DLLKIVEAYK KGMPEDVE