Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1729 |
Symbol | |
ID | 4810159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2050674 |
End bp | 2051909 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640107142 |
Product | DNA methylase N-4/N-6 |
Protein accession | YP_001038143 |
Protein GI | 125974233 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase [COG1475] Predicted transcriptional regulators |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTTTA GGAAGTTAAA AATAGACAGC CTTATACCTG CTAAATATAA TCCGAGAAAA GATTTAAAGC CGGGTGATAA GGAATATGAA AAGATAAAAA ACAGTTTAAC TGAATTTGGA TATGTAGATC CCATTATTGT AAATTCAGAC CTTACAATTA TTGGCGGTCA TCAAAGATGG AAGGTTTTAA AAAGCTTAGG CTATACAGAA GTTGATTGTG TTGTTATTGA TATAGATAAA ACAAAAGAAA AGGCTTTGAA TGTGGCACTT AATAAAATAA GCGGAGAGTG GAATGAAGCA CTTCTTGCTG AGCTTATTAA GGATTTGCAG AGTATAGATT ATGATGTTTC CTTTACAGGT TTTGAACCGC CGGAGATAGA AGAACTGTTT AGCAATGTTC ATGACAAGGA AATAAAAGAA GATGATTTTG ATGTTGAAGA TGCTTTAAAA GAACCTGTAA TTTCAAAGCA GGGAGATTTG TGGCTGCTTG GAAGGCACAG GCTTATTTGC GGGGATAGTA CTAAAGCTGA AACATATGAG GCTTTAATGG ATGGTAAAAA AGCTAATTTA GTGGTTACAG ACCCTCCCTA CAATGTTGCA TATGAAGCAA AAGCCGGAAA GATTCAAAAT GATAACCTTA AAGATGAGGA GTTTTATAAT TTCCTTTATA AGGCGTTCAC TAATATGTAT GATGCTATGG AGAAAGATGC TTCAATTTAT GTATTCCATG CAGATACAGA AGGATTAAAC TTTAGAAAGG CTTTTAAAGC TGTTGGATTT TATTTATCCG GAGTTTGTAT CTGGGCAAAG CAAAGCTTGG TACTGGGCAG AAGTCCTTAT CAGTGGAAAC ATGAACCTGT ACTCTTTGGT TGGAAGAAGG AAGGCAGGCA TAATTGGTAC TCTGATAGAA AACAAAGTAC TATATGGAGC TTTGACAGAC CATCTAAGAA TGCTCTCCAT CCAACAATGA AGCCAGTAGC TCTTTGTGCT TATCCAATTC AAAACAGCAG CATGAGCAAT TGTATTGTTC TTGACCCTTT TGGCGGCAGT GGTTCTACTT TGATTGCCTG TGAGCAGACT AATAGAATCT GCTATACCAT AGAGCTTGAT GAAAAGTATG CGGATGTTAT TGTAAAAAGA TATATAGAGC AGGTTGGTAC AGATGAAGAA GTATTTTTAG TTAGAGATGG AGTTAAAATT AAATATGCTG ATATAAAAAA GGAAGGTTGT GATTAA
|
Protein sequence | MQFRKLKIDS LIPAKYNPRK DLKPGDKEYE KIKNSLTEFG YVDPIIVNSD LTIIGGHQRW KVLKSLGYTE VDCVVIDIDK TKEKALNVAL NKISGEWNEA LLAELIKDLQ SIDYDVSFTG FEPPEIEELF SNVHDKEIKE DDFDVEDALK EPVISKQGDL WLLGRHRLIC GDSTKAETYE ALMDGKKANL VVTDPPYNVA YEAKAGKIQN DNLKDEEFYN FLYKAFTNMY DAMEKDASIY VFHADTEGLN FRKAFKAVGF YLSGVCIWAK QSLVLGRSPY QWKHEPVLFG WKKEGRHNWY SDRKQSTIWS FDRPSKNALH PTMKPVALCA YPIQNSSMSN CIVLDPFGGS GSTLIACEQT NRICYTIELD EKYADVIVKR YIEQVGTDEE VFLVRDGVKI KYADIKKEGC D
|
| |