Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1145 |
Symbol | |
ID | 4810813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1362344 |
End bp | 1364230 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106567 |
Product | N-6 DNA methylase |
Protein accession | YP_001037570 |
Protein GI | 125973660 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGAAC TAAAGGATAA AATCAAAAAA CTTGGTTATG AGGAAATAAA GGATATTGAT GACGCTACCT TTATAGCCAG CCATAAAAAT GTATATGTAT ATGTAAAAAA AGTAGATGAA GAACAGTTGA AACCAGAATT GGTTACTGCT ATAACATATG AAGCGATGGC AACAGACCCT ATTTCTACAT ATGCTTGGAT TACAAACGGT ACAAGTAATG CCTATGTCCT TGTTGAAGAG GAAAAGGCTG TTTCTGAAAT TCCATCAGTC TTTGAAGATG AAAACAAATT GTACTCAGGC AAAAGGCAAC TTACTGATAG GGACAAATGG TCAATAAGAA AATATCAAGA GTTACAAGAG AAATTTGATG GACTCCATGA AATGATTTAT GGAATGAAGG ACCATGTAAA TAACTCCAAT GATGTAATCG ATGAATTCAG TAAACTTATT TTTTTGGAGA CCTTCAGGCT TTATCACCCT GAATATAGAT TAACTAAGGG TAATGTAACA GGGAAACTAT TTAACGAAAT ATATAGATAC GAATATGTAG AAAAACATAA GGATAAGGCA GTCCAGGAGA TAAGGGAAGC CTTTAAAGAA ATAAAAGACC ATGCAGATTA TGTTGCTATT TTGGATAATG GGGAAAAGGC AAACATATTT AGTGCAGATG AATATATAAA ACTGGAAAAT CCCAACATCT ACATTGCTGT TTTAAAGGCT CTCCAGGATT TAGGGACAAT AATAATTGAC GGTGTAGAGA GACCTGCCAC TTTAAGGGAT TTGACAGGGG ATGTATTGGG CAGGGTTTTT GATGTACTGC TTCGTGGAAA GTTTGAGAAT AAAGGCGGTA TGGGTATCTA TCTTACCCCG AGACAGGTAA CGGAAGCAGC AGCCGAGATG GTTTTACACG ACCTTACCAA AGACGGGGCA GCAAAACTAA TTGCTAAAGA CCCCAAAACA GGAATACCTA CCCTCCGCAT AGGTGATTTG TGCTGTGGTT CGGGAGGATT TTTAATAAAG ATGCTTCAGA AGATAGAGCA CTACCTTTTG AACAAATTGA CAGGAGACAA GAAGCAGTAT GAAGAACTAT TTGAACAAAT GAAGGAACAC TGTTTTATAG GTGCGGATAA TGCTCCGGGA ATGGTTCTCA AGGCGAGAAT CAATATGGCA CTGCACGGAG CACCTAAGTG CCCTATTTTC CAAACGAGAA ATTCCCTTAT GAATACACGC CTTAAACCAG GGACATTCGA TGCAATCCTT ACAAATCCCC CTTTTTCAAA AACTGGTATT TCAAAAACGA TTAAGAAGGG TAAAACAACA GTAGAAAACC CAGAAGGTGC TGAGATTATC AAATATTATT CTTCTGACAT AGATGAGGAC GGACAAAACA GGATGAGTCC TTATGGCTTA TCCCTCGGGT CAAAGCCAGA CAGCAGAGGT AAGTGGAAAG AAGTAAATTC GGTAGACCCA GCAGTGTTAT TTATTGATAG AAATCTGCAA CTGTTAAAAC CAGGCGGGCT ACTCATGATA GTTGTACCAG ATGGAATTCT TTCAAACTCA GGAGATAAAT ATGTACGTGA ATACATCATG GGTAAAAAGA ACCCTGTTAC AGGTGAATTT GAAGGTGGAA AAGCAATATT AAAAGCAGTT ATAAGTCTTC CGCAGGTAAC CTTTGCCCTT TCAGGTGCAG GTGCAAAAAC GTCGCTGCTA TATTTAAAGA AGAAAGAACA TCCAGGAGAA AAACAGGGTC CTGTATTTAT GGCAGTAGCA GATGAAGTGG GATTTACTGT AAAACAGAAT GTAGAGGTAC AGTTAGGTGA TGACCATAAC GACTTATTAA AGATTGTGGA GGCTTATAAG AAGGGTATGC CAGAGGATGT AGAATAG
|
Protein sequence | MQELKDKIKK LGYEEIKDID DATFIASHKN VYVYVKKVDE EQLKPELVTA ITYEAMATDP ISTYAWITNG TSNAYVLVEE EKAVSEIPSV FEDENKLYSG KRQLTDRDKW SIRKYQELQE KFDGLHEMIY GMKDHVNNSN DVIDEFSKLI FLETFRLYHP EYRLTKGNVT GKLFNEIYRY EYVEKHKDKA VQEIREAFKE IKDHADYVAI LDNGEKANIF SADEYIKLEN PNIYIAVLKA LQDLGTIIID GVERPATLRD LTGDVLGRVF DVLLRGKFEN KGGMGIYLTP RQVTEAAAEM VLHDLTKDGA AKLIAKDPKT GIPTLRIGDL CCGSGGFLIK MLQKIEHYLL NKLTGDKKQY EELFEQMKEH CFIGADNAPG MVLKARINMA LHGAPKCPIF QTRNSLMNTR LKPGTFDAIL TNPPFSKTGI SKTIKKGKTT VENPEGAEII KYYSSDIDED GQNRMSPYGL SLGSKPDSRG KWKEVNSVDP AVLFIDRNLQ LLKPGGLLMI VVPDGILSNS GDKYVREYIM GKKNPVTGEF EGGKAILKAV ISLPQVTFAL SGAGAKTSLL YLKKKEHPGE KQGPVFMAVA DEVGFTVKQN VEVQLGDDHN DLLKIVEAYK KGMPEDVE
|
| |