Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1511 |
Symbol | |
ID | 4810549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1835392 |
End bp | 1836381 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640106931 |
Product | DNA methylase N-4/N-6 |
Protein accession | YP_001037932 |
Protein GI | 125974022 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1041] Predicted DNA modification methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAGAA AAGTCAAGTG CTTGTATTGC GGAGAAGAAT ATACCAGAAC GCAGTATCCT GAACATCTTG AAAAGAACCA TAAAGATGAG TATTTCAGGC TGATCGAAAA AATAAAAAGT GATATTGAGA GTGATTTCAG TATTAAAGAT ACAGCAGCTA ATAATGATGT AACTTGTGCA TTTGTCAAAA AAGTTGCAAA AGAATCAACA ATATGCCTCG AAGAGAAATC AAAAAGTTAT TTTGCGGACA AGCTTAATAT TAAATCGTGG GAGCCTGAAA ATTTCAATCT TGAAACAACT ACTGTATGGA GCTTTCCTGA CAGGGGTGAC TGGGCTACAC ACAGCGGAAA ATACAGGGGT AATTGGTCAC CTTTTATTCC CAGGAATGTT ATTTTGCGAT ATTCAAAAGA AGGAGAAACT GTGCTCGACC AATTTGTAGG TAGTGGAACG ACGCTTGTTG AAGCAAAACT TTTAAAGAGA AAAGGTATAG GTGTTGATAT AAACCCTGAA GCAGTAAATT TAACTTGTCG TAATATAAAC TTTGAAAAAG AAGATTGCGG AGAAACAGAA GTACATGTTG GAGATGCCAG GCATCTTGGA TTTATTAAAG ATGAGAGTGT AGATCTTATA TGCACCCATC CGCCTTACAG TAATATCATT AAGTACAGTG AAGACATAGA GGGAGATCTT TCCCATTGTG ATATCAATGA GTTTCTTGTA GAAATGGAAA AAGTCGCAAA AGAGAGCTAC AGGGTGCTCA AAAAAGGTAG ATTCTGTGCA ATTCTTATAG GGGATACACG GAGAAAAGGT CACATGATTC CTATCGGCTT CAATGTAATG CAGACGTTTT TACGGGCAGG CTTTAAGTTA AAAGAAATTG TAATAAAGGA ACAGCATAAT TGCAGTTCAA CCGGTTACTG GAGAAATCAG AGTATAAAGT ATAATTTTTT ATTAATAGCA CATGAGTATT TATTTATATT CAGGAAATGA
|
Protein sequence | MGRKVKCLYC GEEYTRTQYP EHLEKNHKDE YFRLIEKIKS DIESDFSIKD TAANNDVTCA FVKKVAKEST ICLEEKSKSY FADKLNIKSW EPENFNLETT TVWSFPDRGD WATHSGKYRG NWSPFIPRNV ILRYSKEGET VLDQFVGSGT TLVEAKLLKR KGIGVDINPE AVNLTCRNIN FEKEDCGETE VHVGDARHLG FIKDESVDLI CTHPPYSNII KYSEDIEGDL SHCDINEFLV EMEKVAKESY RVLKKGRFCA ILIGDTRRKG HMIPIGFNVM QTFLRAGFKL KEIVIKEQHN CSSTGYWRNQ SIKYNFLLIA HEYLFIFRK
|
| |