Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2320 |
Symbol | |
ID | 4809248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2766647 |
End bp | 2767651 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107727 |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_001038715 |
Protein GI | 125974805 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA TTGACTTAAA TAAAATAAGA AACATCAGAA AATACACTGT TGCGTCACTA TTTGCTGGTG CTGGCGGTCT AGACATGGGA TTAGAATTAG CAGGTTTCAA AACCGTTTGG GCTAATGATA TTGATAAAGA TGCTTGTGCT ACATACCGTC TATGGAGTCA GGCAGATGTT GTGCAAGGAG ATATTGCAAA AATTGATTAC TCTGATGTTC CAGATACAGA CGTGATAACG GGTGGTTTTC CATGTCAAGG CTTTTCGTTG GCAGGACCAC GAAAAATTAA TGATGAAAGA AATAAATTGT ATCGTTACTT TGTTAAATTA GTTGAATTAA AGCAACCTTA TGCTTTTATT GCAGAAAATG TAAAAGGCAT CCTTACTCTT GGCGATGGCG AAATTATTGA AGCCATAATA GAGGATTTTG CGAGCAAAGG ATATGATGTA TATCCTAATC TTGTTAATGC CGCTGACTAT GGAGTTCCGC AAGACAGATG GCGTGTTATT CTATACGGAT TTAGAAAAGA TTTAGAAGTA AAAGATTTTA AATTTCCTGA GCCGTTCCCA TATAAAGTTA CCTTGAGAGA GGCTATCGGG GATATGCCTG AACCAAAACA GAGTGATATA TGCCATGCTT CTTATTCAAG TAGGTATATG TCGAGAAACC GAAAAAGAGG TTGGGACGAA GTAAGTTATA CAATTCCAGC AATGGCGAAG CAAGTTCCGT TGCACCCATC TTCTCCAGAC ATGATTAAGA TTGCTGAAGA TAAGTGGATA TTTGGTGAAG GGAAAACAAG AAGATTTAGT TGGCAAGAAG CAGCAGTTAT ACAAACTTTC CCTCGTGATA TGGAATTCGT AGGAAATCTA ACTTCTAAGT ATCGACAGAT AGGAAATGCT GTCCCGGTAA AACTTGCAGA AGTAATAGGT AAGAAATTAT ATGAAATACT TGAAGAGAAG TTAAATGCAA ATGTTAAAGA ATTAGAACAA AAAGTTGGTG AATGA
|
Protein sequence | MKIIDLNKIR NIRKYTVASL FAGAGGLDMG LELAGFKTVW ANDIDKDACA TYRLWSQADV VQGDIAKIDY SDVPDTDVIT GGFPCQGFSL AGPRKINDER NKLYRYFVKL VELKQPYAFI AENVKGILTL GDGEIIEAII EDFASKGYDV YPNLVNAADY GVPQDRWRVI LYGFRKDLEV KDFKFPEPFP YKVTLREAIG DMPEPKQSDI CHASYSSRYM SRNRKRGWDE VSYTIPAMAK QVPLHPSSPD MIKIAEDKWI FGEGKTRRFS WQEAAVIQTF PRDMEFVGNL TSKYRQIGNA VPVKLAEVIG KKLYEILEEK LNANVKELEQ KVGE
|
| |